skip to content
gigarouter gigarouter
tasks / reranker

Hosted reranker models

38 models · 4 live as APIs · benchmarked & compared

Reranker models are specialized neural networks that take a query and a set of candidate documents and output relevance scores, reordering the list with the most relevant results first. They solve the problem of improving accuracy after an initial, cheap retrieval step—common in search engines, retrieval-augmented generation (RAG) pipelines, and enterprise question-answering. For example, a legal research platform might first retrieve 100 possibly relevant cases by keyword, then use a reranker to surface the top 5 that best match the user’s intent.

In production, rerankers are typically placed after a fast bi-encoder or keyword-based index. The retriever passes a limited number of candidates (e.g., 50–200) to the reranker, which re-scores them. This two-stage design balances recall and latency. When choosing between models, the primary trade-off is accuracy versus speed and memory. Smaller models like cross-encoder/ms-marco-MiniLM-L2-v2 offer higher throughput and lower latency, while larger ones like Qwen/Qwen3-Reranker-4B or cross-encoder/ms-marco-MiniLM-L12-v2 (and the multilingual jinaai/jina-reranker-v2-base-multilingual) provide stronger relevance signals at the cost of slower inference. On GigaRouter, 4 reranker models are live now, with 38 total being onboarded.

For most production call volumes, calling a hosted API eliminates the operational burden of managing GPU infrastructure, scaling, and model updates, making it simpler to integrate reranking without dedicating engineering resources to self-hosting.

compare

modelparamsdownloads/mopricestatus
cross-encoder/ms-marco-MiniLM-L6-v2-81.5M$0.008 / 1k docslive
jinaai/jina-reranker-v2-base-multilingual-1.8M$0.008 / 1k docslive
Qwen/Qwen3-Reranker-0.6B--$0.008 / 1k docslive
BAAI/bge-reranker-base--$0.008 / 1k docslive
cross-encoder/ms-marco-MiniLM-L4-v219.2M4.8M~$0.008 / 1k docscoming soon
Alibaba-NLP/gte-reranker-modernbert-base149.6M2.7M~$0.008 / 1k docscoming soon
cross-encoder/ms-marco-MiniLM-L12-v233.4M2.3M~$0.008 / 1k docscoming soon
Qwen/Qwen3-Reranker-4B4021.8M1.8M~$0.008 / 1k docscoming soon
cross-encoder/mmarco-mMiniLMv2-L12-H384-v1117.6M1.6M~$0.008 / 1k docscoming soon
cross-encoder/ms-marco-MiniLM-L2-v215.6M1.2M~$0.008 / 1k docscoming soon
Qwen/Qwen3-Reranker-8B8188.5M1M~$0.008 / 1k docscoming soon
jinaai/jina-reranker-v3596.8M949.9K~$0.008 / 1k docscoming soon
mixedbread-ai/mxbai-rerank-xsmall-v170.8M551K~$0.008 / 1k docscoming soon
Qwen/Qwen3-VL-Reranker-8B8767.1M431K~$0.008 / 1k docscoming soon
hotchpotch/japanese-reranker-cross-encoder-small-v1117.6M334.2K~$0.008 / 1k docscoming soon
Qwen/Qwen3-VL-Reranker-2B2127.5M300.3K~$0.008 / 1k docscoming soon
cross-encoder/stsb-roberta-large355.4M286.8K~$0.008 / 1k docscoming soon
cross-encoder/ms-marco-TinyBERT-L2-v24.4M283.3K~$0.008 / 1k docscoming soon
cl-nagoya/ruri-v3-reranker-310m315.2M274K~$0.008 / 1k docscoming soon
mixedbread-ai/mxbai-rerank-base-v1184.4M273.7K~$0.008 / 1k docscoming soon
tomaarsen/Qwen3-Reranker-0.6B-seq-cls595.8M262.5K~$0.008 / 1k docscoming soon
nvidia/llama-nemotron-rerank-1b-v21235.8M231K~$0.008 / 1k docscoming soon
Alibaba-NLP/gte-multilingual-reranker-base306M221.9K~$0.008 / 1k docscoming soon
antoinelouis/crossencoder-camembert-base-mmarcoFR110.6M185K~$0.008 / 1k docscoming soon
cross-encoder/stsb-roberta-base124.6M182.5K~$0.008 / 1k docscoming soon
nvidia/llama-nemotron-rerank-vl-1b-v21678.3M99.7K~$0.008 / 1k docscoming soon
mixedbread-ai/mxbai-rerank-base-v2494M97.7K~$0.008 / 1k docscoming soon
cross-encoder/stsb-distilroberta-base82.1M95.4K~$0.008 / 1k docscoming soon
Xenova/bge-reranker-base-84.3Kat launchcoming soon
mixedbread-ai/mxbai-rerank-large-v1-66.1Kat launchcoming soon
hotchpotch/japanese-reranker-cross-encoder-xsmall-v1107M55.4K~$0.008 / 1k docscoming soon
mixedbread-ai/mxbai-rerank-large-v21543.7M54.3K~$0.008 / 1k docscoming soon
ContextualAI/ctxl-rerank-v2-instruct-multilingual-1b1327M54.1K~$0.008 / 1k docscoming soon
hotchpotch/japanese-reranker-xsmall-v236.8M50.8K~$0.008 / 1k docscoming soon
jinaai/jina-reranker-v1-turbo-en37.8M49.2K~$0.008 / 1k docscoming soon
zeroentropy/zerank-2-reranker4022.5M34.6K~$0.008 / 1k docscoming soon
ggml-org/Qwen3-Reranker-0.6B-Q8_0-GGUF-32.7Kat launchcoming soon
cross-encoder/qnli-electra-base109.5M29.6K~$0.008 / 1k docscoming soon