Jina Embeddings V3

jinaai/jina-embeddings-v3

published Sep 2024 · updated Apr 2026

Jina Embeddings V3 is a multilingual text embedding model with task-specific LoRA adapters, supporting up to 8192 tokens and flexible output dimensions.

est. price

~$0.008

/ 1M tokens · estimated, set at launch

API providers

downloads / mo

2.7M

license

cc-by-nc-4.0

specs

Task	Text Embeddings
Architecture	Jina-XLM-RoBERTa with Rotary Position Embeddings
Parameters	570 million
License	CC BY-NC 4.0

about this model

jina-embeddings-v3 is a multilingual multi-task text embedding model that generates high-quality vector representations for retrieval, clustering, classification, and text matching tasks. Based on the Jina-XLM-RoBERTa architecture with 570 million parameters, it supports context lengths up to 8192 tokens via Rotary Position Embeddings (RoPE) and uses five task-specific LoRA adapters to produce optimized embeddings.

Diagram of jina-embeddings-v3 architecture showing LoRA adapters and task outputs

Key capabilities

Task-specific LoRA adapters: retrieval.query, retrieval.passage, separation, classification, and text-matching.
Matryoshka embeddings: Flexible output dimensions from 1024 down to 32 without degrading performance.
Multilingual support: Optimized for 30 languages (Arabic, Bengali, Chinese, Danish, Dutch, English, Finnish, French, Georgian, German, Greek, Hindi, Indonesian, Italian, Japanese, Korean, Latvian, Norwegian, Polish, Portuguese, Romanian, Russian, Slovak, Spanish, Swedish, Thai, Turkish, Ukrainian, Urdu, Vietnamese); foundation model covers 100 languages.

Benchmark results

On the MTEB benchmark, jina-embeddings-v3 outperforms OpenAI text-embedding-3-large and Cohere embed-multilingual-v3.0 on English tasks, and surpasses multilingual-e5-large-instruct on all multilingual tasks. It is the best-performing multilingual model among those with fewer than 1 billion parameters (ranked 2nd on the English MTEB leaderboard as of September 2024). On six long-document retrieval tasks (LongEmbed), it shows significant improvements over models using fixed or ALiBi positional embeddings.

Efficiency

Compared to e5-mistral-7b-instruct (7.1B parameters, 4096 output dimension), jina-embeddings-v3 is 12× smaller and uses 4× smaller output dimensions while achieving comparable or better English MTEB scores, making it suitable for production and edge deployment.

best for

·Multilingual semantic search and retrieval
·Text clustering and classification
·Sentence similarity and matching
·Long-document retrieval (up to 8192 tokens)

FAQ

What tasks does jina-embeddings-v3 support via LoRA adapters?

It supports retrieval.query, retrieval.passage, separation, classification, and text-matching.

What is the maximum input length and output dimension?

Maximum input length is 8192 tokens. Output dimension defaults to 1024 but can be truncated down to 32 using Matryoshka embeddings.

How many parameters does the model have?

It has 570 million parameters.

What is the license for this model?

The model is licensed under CC BY-NC 4.0. Commercial use requires contacting Jina AI.

How can I use this model via the gigarouter API?

Use the gigarouter OpenAI-compatible endpoint with your API key to generate embeddings.

not yet live

We're benchmarking and onboarding Jina Embeddings V3 as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related embeddings models

compare all →

nomic-embed-text-v1.5

granite-embedding-small-english-r2

2.2M dl/mo