Question 1

What is the embedding dimension and max input length?

Accepted Answer

The embedding dimension is 2304 and the maximum input token length is 512.

Question 2

What input format does MiniCPM-Embedding expect?

Accepted Answer

It supports an optional query-side instruction in the format "Instruction: {{ instruction }} Query: {{ query }}", or instruction-free mode as "Query: {{ query }}". Documents are input directly.

Question 3

What is the model size and how does it compare to other embedding models?

Accepted Answer

MiniCPM-Embedding has 2.4B parameters. It achieves 76.76 NDCG@10 on C-MTEB/Retrieval and 58.56 on BEIR, outperforming many larger models in cross-lingual tasks.

Question 4

What are the license terms for using MiniCPM-Embedding commercially?

Accepted Answer

The code is Apache-2.0. The model weights require following the MiniCPM Model License; they are free for academic research and free for commercial use after filling out a registration questionnaire.

Question 5

How can I call MiniCPM-Embedding via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key, sending your text as input to the embeddings endpoint.

Task	Text Embedding
Architecture	Bidirectional attention with Weighted Mean Pooling, based on MiniCPM-2B
Parameters	2.4B
Embedding Dimension	2304
Max Input Tokens	512
License	Apache-2.0 (code); MiniCPM Model License (weights, free for academic and commercial use after registration)

MiniCPM-Embedding

specs

best for

FAQ

related embeddings models