← Back to context

Comment by S0y

1 day ago

These are simply benchmaxxed versions of either Qwen or Gemma 4.

Citation needed

  • Sure. https://deep-reinforce.com/ornith_1_0.html

    >Built on top of pretrained Gemma 4 and Qwen 3.5, it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.

    >Ornith-1.0 is a self-improving training framework. Instead of relying on human-designed harnesses to drive solution generation in RL, Ornith-1.0 learns to generate both solution rollouts and the task-specific harnesses that guide those rollouts.