skip to content
gigarouter gigarouter
models / embeddings · coming soon

Jina Embeddings V2 Base EN

jinaai/jina-embeddings-v2-base-en

published Sep 2023 · updated Jan 2025

Jina Embeddings V2 Base EN is a text embedding model that supports up to 8192 tokens and is designed for long document retrieval, semantic search, and RAG applications.

est. price
~$0.008
/ 1M tokens · estimated, set at launch
API providers
0
downloads / mo
172K
license
apache-2.0

specs

TaskText Embedding
ArchitectureBERT with ALiBi (JinaBERT)
Parameters137 million
LicenseApache 2.0
Max Sequence Length8192 tokens
TaskText Embedding
ArchitectureBERT with ALiBi (JinaBERT)
Parameters137 million
LicenseApache 2.0
Max Sequence Length8192 tokens

about this model

jina-embeddings-v2-base-en is an English monolingual text embedding model that supports an 8192-token sequence length. It is based on a BERT architecture enhanced with the symmetric bidirectional variant of ALiBi (Attention with Linear Biases), which enables input length extrapolation beyond the 512-token training limit. The model was pretrained on the C4 dataset and further fine-tuned on over 400 million sentence pairs and hard negatives from diverse domains. With 137 million parameters, it delivers efficient inference suitable for single-GPU deployment. Gigarouter hosts this model as a managed, OpenAI-compatible API, allowing developers to use it via a single API call without managing infrastructure.

Jina AI embedding model banner

Key Strengths and Performance

In the MTEB benchmark, Jina Embeddings 2 achieves state-of-the-art performance and matches the quality of OpenAI’s text-embedding-ada-002, as reported in the accompanying technical report. The model is released under the Apache-2.0 license.

In retrieval-augmented generation (RAG) evaluations conducted by LlamaIndex, jina-embeddings-v2-base-en combined with a reranker yields strong results:

RerankerHit RateMRR
bge-reranker-large0.9382020.868539
CohereRerank0.9325840.873689
Performance comparison from LlamaIndex blog

best for

FAQ

What is the maximum input length for this model?

It supports up to 8192 tokens, thanks to ALiBi positional encoding.

How can I call this model via the gigarouter API?

Use the OpenAI-compatible endpoint with your gigarouter API key, specifying the model name jinaai/jina-embeddings-v2-base-en.

What is the model size and inference speed?

It has 137 million parameters and is recommended for single GPU inference for fast performance.

What license is this model released under?

Apache 2.0 license.

Do I need to apply mean pooling when using the model?

Yes, mean pooling is recommended to produce high-quality sentence embeddings; the encode function handles this automatically when using the transformers package.

What is the maximum input length for this model?

It supports up to 8192 tokens, thanks to ALiBi positional encoding.

How can I call this model via the gigarouter API?

Use the OpenAI-compatible endpoint with your gigarouter API key, specifying the model name jinaai/jina-embeddings-v2-base-en.

What is the model size and inference speed?

It has 137 million parameters and is recommended for single GPU inference for fast performance.

What license is this model released under?

Apache 2.0 license.

Do I need to apply mean pooling when using the model?

Yes, mean pooling is recommended to produce high-quality sentence embeddings; the encode function handles this automatically when using the transformers package.

not yet live

We're benchmarking and onboarding Jina Embeddings V2 Base EN as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related embeddings models

compare all →