Question 1

What is Qwen3 VL Reranker 2B best for?

Accepted Answer

It is best for re-ranking initial retrieval results by scoring relevance between a query (text, image, or video) and candidate documents (text, image, or video), significantly improving retrieval accuracy in multimodal pipelines.

Question 2

How does it compare to the Qwen3 VL Embedding model?

Accepted Answer

The Embedding model generates vector embeddings for efficient first-stage recall, while the Reranker performs fine-grained cross-encoder scoring on the retrieved candidates. Used together, they form a high-accuracy two-stage retrieval pipeline.

Question 3

What input formats does the model support?

Accepted Answer

The model accepts pairs of queries and documents, where each can be plain text, an image URL, a video, or a mix of text and image. It supports over 30 languages and a context length of 32K tokens.

Question 4

How do I call this model via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key, sending a request with the query and documents as inputs. The response will include relevance scores for each document.

Question 5

What license is the model released under?

Accepted Answer

It is released under the Apache 2.0 license, allowing commercial use and modification.

Task	Multimodal Reranking
Architecture	Cross-encoder with cross-attention
Parameters	2B
License	Apache 2.0
Context Length	32K tokens

Benchmark	Metric	Score
MMEB-v2 (Retrieval) – Avg	Average	75.1
MMEB-v2 (Retrieval) – Image	Image retrieval	73.8
MMEB-v2 (Retrieval) – Video	Video retrieval	52.1
MMEB-v2 (Retrieval) – VisDoc	Visual document retrieval	83.4
MMTEB (Retrieval)	Text retrieval	70.0
JinaVDR	Visual document retrieval	80.9
ViDoRe v3	Visual document retrieval	60.8

Qwen3 VL Reranker 2B

specs

about this model

best for

FAQ

related reranker models