rate card
Models & pricing
The specialist models we've benchmarked, hosted and priced — with the long tail we're onboarding next below. Prices are in each model's native unit; realtime is the on-demand rate, batch is a discounted flexible tier (send X-Tier: batch).
allembeddingsspeech-to-textvision-languagezero-shot imagererankerimage-to-texttext-to-speechobject detectiondepth estimationtext generation
34 matches in text generation · clear
no live models match — see the roadmap below or clear the filter.
| model | task | tier | realtime | batch |
|---|
On the roadmap
34 modelsHigh-demand specialist models with no hosted API. We benchmark and onboard them by task - each has a page; sign in and tell us which you need to jump the queue.
text generation · 34
opt-125mgpt2tiny-Qwen2ForCausalLM-2.5deepseek-v4-ggufQwen3.6-35B-A3B-NVFP4gemma-3-270mdolphin-2.9.1-yi-1.5-34bgemma-4-12B-coder-fable5-composer2.5-v1-GGUFVLM2Vec-Fullgemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUFOrnith-1.0-35B-GGUFOrnith-1.0-9B-GGUFGLM-5.2-GGUFQwen-AgentWorld-35B-A3B-GGUFDeepSeek-V4-Flash-GGUFOrnith-1.0-35BGLM-5.2-NVFP4Qwen3.6-27B-NVFP4Ornith-1.0-397B-FP8Ornith-1.0-9BOrnith-1.0-35B-FP8Huihui-Qwythos-9B-Claude-Mythos-5-1M-abliterated-GGUFQwen-AgentWorld-35B-A3BDeepSeek-V4-Flash-DSparkLFM2.5-230MOrnith-1.0-35B-MTP-APEX-GGUFOrnith-1.0-9B-MTP-GGUFNemotron-Labs-TwoTower-30B-A3B-Base-BF16DeepSeek-V4-Pro-DSparkOrnith-1.0-35B-AEON-Ultimate-Uncensored-NVFP4Ornith-1.0-397BHuihui-GLM-5.2-abliterated-GGUFAgents-A1VLM2Vec-LoRA