Question 1

What is this model best for?

Accepted Answer

It is best for multilingual embedding tasks such as retrieval, classification, and clustering, with optional instruction prompts for classification and clustering.

Question 2

How does it compare to other multilingual embedding models?

Accepted Answer

It outperforms similar-sized models like multilingual-e5-large and bge-m3 on the MTEB benchmark, achieving an average score of 64.16.

Question 3

What is the input and output format?

Accepted Answer

Input is text strings; output is normalized embedding vectors (dense). The model supports a prompt prefix for instruction tasks and a max sequence length of 512 tokens.

Question 4

How do I call this model via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key. Send a request with the model name and input text to get the embedding vector.

Question 5

What license does this model use?

Accepted Answer

The KaLM-Embedding repository is released under the MIT license, allowing free use, modification, and distribution.

Task	Embedding
Architecture	Adapted auto-regressive LLM (Qwen2-0.5B) with mean pooling
Parameters	494M
License	MIT
Max Sequence Length	512 tokens
Instruction Support	Yes (for classification and clustering)

KaLM Embedding Multilingual Mini Instruct V1

specs

best for

FAQ

related embeddings models