Comment by tootie 3 years ago I'm not an expert but isn't size the distinguishing feature of an LLM? It's the first L. 1 comment tootie Reply hiddencost 3 years ago They needed an architecture that could take advantage of the scale, first. That's what BERT did.
hiddencost 3 years ago They needed an architecture that could take advantage of the scale, first. That's what BERT did.
They needed an architecture that could take advantage of the scale, first. That's what BERT did.