Comment by S0y

1 day ago

These are simply benchmaxxed versions of either Qwen or Gemma 4.

4 comments

S0y

If so, it's impressive they managed to benchmaxx Qwen even further than it's already benchmaxxed.

v3ss0n 21 hours ago

Nah , they just put graphs with different color prioritizing themselves.

Citation needed

S0y 21 hours ago

Sure. https://deep-reinforce.com/ornith_1_0.html
>Built on top of pretrained Gemma 4 and Qwen 3.5, it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.
>Ornith-1.0 is a self-improving training framework. Instead of relying on human-designed harnesses to drive solution generation in RL, Ornith-1.0 learns to generate both solution rollouts and the task-specific harnesses that guide those rollouts.