models / reranker · coming soon

MXBAI Rerank Large V2

mixedbread-ai/mxbai-rerank-large-v2

published Mar 2025 · updated Apr 2026

MXBAI Rerank Large V2 is a rerank model that scores document-query pairs for improved retrieval ranking.

est. price

~$0.008

/ 1k docs · estimated, set at launch

API providers

downloads / mo

54.3K

license

apache-2.0

specs

Task	Reranking
Architecture	Cross-encoder transformer

about this model

mxbai-rerank-large-v2 is a reranking model that improves the relevance ordering of documents in response to a query, designed for use in retrieval-augmented generation and information retrieval pipelines. It is a specialized small language model (SLM) that achieves state-of-the-art reranking performance while maintaining computational efficiency.

Key Strengths

The model addresses fundamental limitations of SLMs in reranking: narrow representation space and difficulty understanding task prompts without fine-tuning. It uses a two-stage training approach called ProRank, which applies reinforcement learning to improve prompt understanding and fine-grained score learning to enhance representation expressiveness. This allows the model to outperform both advanced open-source and proprietary reranking models, including large LLMs with over 7B parameters.

Benchmark Results

On the BEIR benchmark, the 0.5B parameter version of this model surpasses powerful LLM reranking models, demonstrating that properly trained SLMs can achieve superior document reranking performance while maintaining computational efficiency. The model consistently outperforms the most advanced open-source and proprietary reranking models across extensive experiments.

Architecture and Training

The model uses a two-stage training approach called ProRank. The first stage applies reinforcement learning to improve understanding of task prompts. The second stage introduces fine-grained score learning to enhance representation expressiveness, addressing the narrow representation space and prompt understanding limitations typical of SLMs.

best for

·Reranking search results in retrieval-augmented generation (RAG) pipelines
·Improving relevance ranking for enterprise search and Q&A systems

FAQ

What is MXBAI Rerank Large V2 best used for?

It is best for reranking document-query pairs to improve the relevance of top results in search and RAG systems.

How do I call this model via the gigarouter API?

Use the OpenAI-compatible endpoint with your gigarouter API key, sending a list of query-document pairs for scoring.

What is the input and output format?

Input is a query and a list of documents; output is a relevance score for each document.

not yet live

We're benchmarking and onboarding MXBAI Rerank Large V2 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related reranker models

compare all →

ms-marco-MiniLM-L6-v2

81.5M dl/mo · live

ms-marco-MiniLM-L4-v2

4.8M dl/mo

gte-reranker-modernbert-base

2.7M dl/mo

ms-marco-MiniLM-L12-v2

2.3M dl/mo

jina-reranker-v2-base-multilingual

1.8M dl/mo · live

Qwen3-Reranker-4B

1.8M dl/mo