skip to content
gigarouter gigarouter
models / embeddings · coming soon

BGE Base EN v1.5

Xenova/bge-base-en-v1.5

published Sep 2023 · updated Jul 2025

BGE Base EN v1.5 is an embed model that converts English text into 768-dimensional dense vectors for semantic search, clustering, and classification.

status
coming soon
API providers
0
downloads / mo
1.8M
license
mit

specs

TaskFeature Extraction / Text Embedding
ArchitectureBERT-base (BGE)
Output Dimensions768
LicenseMIT

about this model

Xenova/bge-base-en-v1.5 is an English text embedding model optimized for retrieval and semantic similarity tasks, converted to ONNX weights for efficient web deployment and hosted as a managed API by gigarouter.

Part of BAAI’s General Embedding (BGE) series, the v1.5 update alleviates similarity distribution issues and enhances retrieval capability without requiring an instruction prefix (a recommended query prefix can still be used for retrieval tasks). The model outputs 768-dimensional embeddings and is released under the MIT license.

Benchmark Performance

Evaluated on the MTEB benchmark, the model achieves competitive scores across classification, retrieval, and semantic textual similarity tasks:

TaskMetricScore
ArguAna RetrievalNDCG@1063.61
AmazonPolarityClassificationAccuracy93.39
BIOSSES STSCosine Spearman86.94
Banking77ClassificationAccuracy86.95

These scores reflect the model’s strong general-purpose embedding quality for English text, particularly for dense retrieval and classification workloads.

best for

FAQ

What is the input format for this model?

It accepts English text strings and outputs 768-dimensional dense vectors when using mean pooling and normalization.

How is this model optimized for retrieval?

It uses a recommended query prefix: "Represent this sentence for searching relevant passages: " for retrieval tasks.

What is the license?

MIT.

How do I call this model via the gigarouter API?

Use the gigarouter OpenAI-compatible endpoint with your API key; set the model parameter to "Xenova/bge-base-en-v1.5" and send your text in the input.

What is the output dimension?

768.

not yet live

We're benchmarking and onboarding BGE Base EN v1.5 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related embeddings models

compare all →