Question 1

What is the embedding dimension of INF Retriever V1 1.5B?

Accepted Answer

The embedding dimension is 1536.

Question 2

What is the maximum input token length?

Accepted Answer

The model supports a maximum of 32768 tokens.

Question 3

What license is this model released under?

Accepted Answer

It is released under Apache-2.0.

Question 4

How does this 1.5B model compare to the larger INF-Retriever-v1 (7B)?

Accepted Answer

It is a lighter version that still ranks No.1 on the AIR-Bench bilingual sub-leaderboard among models with fewer than 7B parameters, making it a top choice for efficient bilingual retrieval.

Question 5

How can I use this model via the gigarouter API?

Accepted Answer

Send requests to the OpenAI-compatible endpoint with your API key; use the model name as provided in the deployment.

Task	Embedding / Dense Retrieval
Architecture	Transformer (based on GTE-Qwen2-1.5B-instruct)
Parameters	1.5B
License	Apache-2.0
Embedding Dimension	1536
Max Input Tokens	32768

INF Retriever V1 1.5B

specs

best for

FAQ

related embeddings models