← Back to context

Comment by pixl97

2 years ago

I just want to point out that GPT isn't a great model for math, and for at least a year we've had better models

>Although LLMs can sometimes answer these types of question correctly, they more often get them wrong. In one early test of its reasoning abilities, ChatGPT scored just 26% when faced with a sample of questions from the ‘MATH’ data set of secondary-school-level mathematical problems.

>But back in June 2022, an LLM called Minerva, created by Google, had already defied these expectations — to some extent. Minerva scored 50% on questions in the MATH data set, a result that shocked some researchers in artificial intelligence (AI; see ‘Minerva’s mathematics test’).