rate card

Models & pricing

The specialist models we've benchmarked, hosted and priced — with the long tail we're onboarding next below. Prices are in each model's native unit; realtime is the on-demand rate, batch is a discounted flexible tier (send X-Tier: batch).

all embeddings speech-to-text vision-language zero-shot image reranker image-to-text text-to-speech object detection depth estimation text generation

38 matches in reranker · clear

model	task	tier	realtime	batch
cross-encoder/ms-marco-MiniLM-L6-v2	reranker	A	$0.008/1k docs	$0.0025/1k docs
jinaai/jina-reranker-v2-base-multilingual	reranker	A	$0.008/1k docs	$0.0025/1k docs
Qwen/Qwen3-Reranker-0.6B	reranker	A	$0.008/1k docs	$0.0025/1k docs
BAAI/bge-reranker-base	reranker	A	$0.008/1k docs	$0.0025/1k docs

On the roadmap

34 models

High-demand specialist models with no hosted API. We benchmark and onboard them by task - each has a page; sign in and tell us which you need to jump the queue.