← Back to context

Comment by himata4113

10 hours ago

Engineers at google have publically stated that the models are too big and are far from their potencial. Glad they're being proven right with every release.

They continue to focus on smaller models while openai and anthropic are increasing compute requirements for their SOTA models.

> Engineers at google have publically stated that the models are too big and are far from their potencial

Can you link to a source?

  • I wish I could, it was one of those youtube podcast type interviews with one of the engineers, there was a lot more shared, but that line stuck with me the most.

Don’t let that fool yourself. Google will have SOTA models as big as or even bigger than their competitors.

They are just refining their current models while they finish training the next generation.

They will all come out at about the same time. Anthropic, OpenAi, Google, xAI

Google’s pro models are almost certainly bigger than Openai’s lol

  • Why would that be? I am curious why do you think that.

    • E.g. because they are behind on research and so must compensate with size to achieve similar level of intelligence. At least this is what I heard.

      For intelligence/size only OpenAI and Anthropic are the frontier. Google has more compute so it can compensate for that with size of the models...

      1 reply →

    • Because TPUs are more efficient, and its cheaper for them to field them in higher quantity since they own the chip.

I mean, yes and no.

Nobody really knows the answer to which one is more optimal

* Large model trained on a large amount of data across multiple domains, that doesn't need any extra content to answer questions.

* Smaller model that is smart enough to go fetch extra relevant content, and then operate on essentially "reformatting" the context into an answer.