Comment by chris_money202
17 days ago
"training good coding models" many would say that is a highly debatable statement, and some would say that is just flat out not true. Cursor has not trained a frontier model from scratch, what they did was take an already made (non-frontier) model and further trained it on their user data about coding outcomes from its coding agent. So, a form of distillation and RL.
No comments yet
Contribute on Hacker News ↗