Question 1

What is this model best used for?

Accepted Answer

It is designed to rerank a set of candidate documents for a given query, improving retrieval accuracy. It supports 100+ languages and can leverage custom instructions for up to 5% better performance.

Question 2

How does the 0.6B size compare to the 4B or 8B versions?

Accepted Answer

The 0.6B variant is faster and more lightweight, suitable for latency-sensitive applications, while larger versions offer higher accuracy at the cost of speed.

Question 3

What is the license for this model?

Accepted Answer

The model is released under the Apache-2.0 license, allowing commercial and personal use with attribution.

Question 4

What input format does the model expect?

Accepted Answer

It accepts query-document pairs, typically as strings. Using SentenceTransformers CrossEncoder, you pass (query, document) and receive a relevance score. It can also process batched inputs.

Question 5

How can I call this model via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key. Refer to the gigarouter documentation for the exact endpoint and payload format.

Task	Text ranking (reranking)
Architecture	Transformer with 28 layers
Parameters	0.6B
License	Apache-2.0
Context Length	32K tokens

Qwen3 Reranker 0.6B

specs

about this model

Key Capabilities

Architecture

Benchmark Context

License

best for

FAQ

related reranker models