Comment by jumploops

5 months ago

From the release you say: "[..] in developing our reasoning models, we’ve optimized somewhat less for math and computer science competition problems, and instead shifted focus towards real-world tasks that better reflect how businesses actually use LLMs."

Can you tell us more about the trade-offs here?

Also, are you using synthetic data for improving the responses here, or are you purely leveraging data from usage/partner's usage?