Question 1

What is the primary use case for this model?

Accepted Answer

Semantic search and information retrieval where speed and low resource usage are critical. It is designed for efficient nearest neighbor search.

Question 2

How does this model compare in speed to all-mpnet-base-v2?

Accepted Answer

It is 100-400x faster on CPU and 10-25x faster on GPU while achieving 87.4% of its retrieval performance.

Question 3

What license is this model released under?

Accepted Answer

Apache 2.0.

Question 4

Can I use a smaller embedding dimensionality?

Accepted Answer

Yes, the model was trained with Matryoshka loss, allowing you to truncate the embedding dimension (e.g., to 256) with minimal performance loss. Use the truncate_dim parameter.

Question 5

How do I call this model via the gigarouter API?

Accepted Answer

Use the OpenAI-compatible endpoint with your API key. Refer to gigarouter documentation for endpoint details.

Task	Embedding / Semantic Search
Architecture	StaticEmbedding (EmbeddingBag with BERT uncased tokenizer)
Parameters	0 active parameters (pre-computed token embeddings)
License	Apache 2.0
Output Dimensionality	1024 (truncatable via Matryoshka)
Similarity Function	Cosine Similarity

Static Retrieval MRL EN V1

specs

best for

FAQ

related embeddings models