Question 1

What is BGE Small EN v1.5 best for?

Accepted Answer

It is best for English text embedding tasks like semantic search, retrieval-augmented generation, and sentence similarity where low latency and small model size are important.

Question 2

How does it compare in size and speed to larger BGE models?

Accepted Answer

BGE Small EN v1.5 has fewer parameters (about 33M) than base and large versions, making it faster and more memory-efficient, though with slightly lower accuracy on benchmarks.

Question 3

What are the input and output formats for the API?

Accepted Answer

The API accepts text strings as input and returns a vector (list of floats) as output. Use the gigarouter OpenAI-compatible endpoint with an API key.

Question 4

Does it require an instruction prefix for queries?

Accepted Answer

Yes, for retrieval tasks you should prepend the query with "Represent this sentence for searching relevant passages: ". No instruction is needed for documents.

Question 5

What is the license for using this model?

Accepted Answer

The model is released under the MIT license, allowing free use, modification, and distribution.

Task	Embedding
Architecture	Transformer encoder
Parameters	Small (approx. 33M)
License	MIT

BGE Small EN v1.5

specs

about this model

best for

FAQ

try it live

related embeddings models