skip to content
gigarouter gigarouter
models / text generation · coming soon

Ornith 1.0 35B

deepreinforce-ai/Ornith-1.0-35B-FP8

published Jun 2026 · updated Jun 2026

Ornith 1.0 35B is a text-generation model for agentic coding, fine-tuned from Qwen 3.5 using reinforcement learning to jointly optimize solution rollouts and scaffolding.

status
coming soon
API providers
0
downloads / mo
61.6K
license
mit

specs

TaskText Generation (Agentic Coding)
ArchitectureMixture of Experts (MoE)
Parameters35B
LicenseMIT

about this model

Ornith-1.0-35B architecture diagram Ornith-1.0-35B is a text-generation model that serves as the lightweight single-GPU member of the Ornith family, a self-improving family of open-source models for agentic coding post-trained on Gemma 4 and Qwen 3.5 architectures.

Training Framework

The model employs a self-improving reinforcement learning framework that jointly optimizes both the scaffolding (search trajectories) and the resulting solution rollouts. By learning to generate better scaffolds alongside solutions, Ornith-1.0-35B discovers higher-quality search trajectories and produces improved coding solutions. The model is released under the MIT license.

Key Capabilities

Ornith-1.0-35B is a reasoning model: by default, the assistant response opens with a <think> … </think> block before delivering the final answer. It achieves strong results across multiple agentic coding benchmarks.

Benchmark Results

Ornith-1.0-35B Qwen3.5-35B Qwen3.6-35B Gemma4-31B Qwen3.5-397B
Agentic Coding
Terminal-Bench 2.1 (Terminus-2)64.241.452.542.153.5
Terminal-Bench 2.1 (Claude Code)62.838.949.2-48.6
SWE-bench Verified75.67073.45276.4
SWE-bench Pro50.444.649.535.751.6
SWE-bench Multilingual69.360.367.251.769.3
NL2Repo34.620.529.415.536.8
Claw-eval Avg69.865.468.748.570.7
SWE Atlas - QnA37.113.215.5-20.4
SWE Atlas - RF29.710.211.4-18.4
SWE Atlas - TW27.89.813.3-18.5

Bold numbers indicate the highest score in each row among the compared models. Evaluation details: Terminal-Bench 2.1 uses Harbor/Terminus-2 and Claude Code frameworks with temperature 1.0; SWE-Bench uses OpenHands harness; SWE Atlas uses mini SWE agent harness; NL2Repo uses 400K context; ClawEval uses temperature 0.6 with 256K context. All results averaged over 5 runs where noted.

Ornith-1.0-35B benchmark graph

best for

FAQ

What is Ornith 1.0 35B best used for?

It is designed for agentic coding tasks such as automated software engineering, repository-level code generation, and terminal-based coding benchmarks.

What architecture does Ornith 1.0 35B use?

It is a 35B parameter Mixture of Experts (MoE) model, post-trained on top of Qwen 3.5.

What is the license for Ornith 1.0 35B?

It is MIT licensed, globally accessible with no regional restrictions.

How do I call Ornith 1.0 35B via the API?

Use the gigarouter OpenAI-compatible endpoint with your API key. The model is a reasoning model that outputs a <think> block before the final answer.

How does Ornith 1.0 35B compare to Qwen 3.5 35B?

Ornith 1.0 35B outperforms Qwen 3.5 35B on all reported agentic coding benchmarks, including SWE-Bench Verified (75.6 vs 70) and Terminal-Bench 2.1 (64.2 vs 41.4).

not yet live

We're benchmarking and onboarding Ornith 1.0 35B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related text generation models

compare all →