MXBAI Rerank Large V2
mixedbread-ai/mxbai-rerank-large-v2
published Mar 2025 · updated Apr 2026
MXBAI Rerank Large V2 is a rerank model that scores document-query pairs for improved retrieval ranking.
specs
| Task | Reranking |
| Architecture | Cross-encoder transformer |
about this model
mxbai-rerank-large-v2 is a reranking model that improves the relevance ordering of documents in response to a query, designed for use in retrieval-augmented generation and information retrieval pipelines. It is a specialized small language model (SLM) that achieves state-of-the-art reranking performance while maintaining computational efficiency.
Key Strengths
The model addresses fundamental limitations of SLMs in reranking: narrow representation space and difficulty understanding task prompts without fine-tuning. It uses a two-stage training approach called ProRank, which applies reinforcement learning to improve prompt understanding and fine-grained score learning to enhance representation expressiveness. This allows the model to outperform both advanced open-source and proprietary reranking models, including large LLMs with over 7B parameters.
Benchmark Results
On the BEIR benchmark, the 0.5B parameter version of this model surpasses powerful LLM reranking models, demonstrating that properly trained SLMs can achieve superior document reranking performance while maintaining computational efficiency. The model consistently outperforms the most advanced open-source and proprietary reranking models across extensive experiments.
Architecture and Training
The model uses a two-stage training approach called ProRank. The first stage applies reinforcement learning to improve understanding of task prompts. The second stage introduces fine-grained score learning to enhance representation expressiveness, addressing the narrow representation space and prompt understanding limitations typical of SLMs.
best for
- ·Reranking search results in retrieval-augmented generation (RAG) pipelines
- ·Improving relevance ranking for enterprise search and Q&A systems
FAQ
It is best for reranking document-query pairs to improve the relevance of top results in search and RAG systems.
Use the OpenAI-compatible endpoint with your gigarouter API key, sending a list of query-document pairs for scoring.
Input is a query and a list of documents; output is a relevance score for each document.
We're benchmarking and onboarding MXBAI Rerank Large V2 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.