skip to content
gigarouter gigarouter
tasks / embeddings

Hosted embeddings models

91 models · 4 live as APIs · benchmarked & compared

Embeddings models convert text into dense vector representations that capture semantic meaning, enabling machines to compare and retrieve relevant information. Common use cases include semantic search, where queries are matched to documents by vector similarity; retrieval-augmented generation (RAG), where relevant context is fetched before prompting a language model; and clustering or classification of text based on thematic proximity.

In production, embeddings are typically precomputed for a corpus and stored in a vector database. At query time, the input is embedded and a nearest-neighbor search returns the most relevant items. The choice of model involves a trade-off between size, quality, and speed. Larger models like Qwen/Qwen3-Embedding-4B or jinaai/jina-embeddings-v3 often deliver higher accuracy on nuanced tasks, but require more compute and memory. Smaller models such as Xenova/all-MiniLM-L6-v2 or ibm-granite/granite-embedding-small-english-r2 are faster and cheaper to run, making them suitable for latency-sensitive or high-volume applications.

For most call volumes, calling a hosted API beats self-hosting by eliminating the infrastructure, scaling, and maintenance overhead associated with running multiple model variants.

compare

modelparamsdownloads/mopricestatus
Qwen/Qwen3-Embedding-4B4021.8M2.6M$0.008 / 1M tokenslive
NovaSearch/stella_en_1.5B_v51543.3M30.1K$0.008 / 1M tokenslive
Qwen/Qwen3-Embedding-0.6B--$0.008 / 1M tokenslive
BAAI/bge-small-en-v1.5--$0.008 / 1M tokenslive
nomic-ai/nomic-embed-text-v1.5136.7M16.9M~$0.008 / 1M tokenscoming soon
nomic-ai/nomic-embed-text-v1136.7M4.2M~$0.008 / 1M tokenscoming soon
facebook/w2v-bert-2.0580.5M3.7M~$0.008 / 1M tokenscoming soon
Xenova/all-MiniLM-L6-v2-2.8Mat launchcoming soon
jinaai/jina-embeddings-v3572.3M2.7M~$0.008 / 1M tokenscoming soon
ibm-granite/granite-embedding-small-english-r247.7M2.2M~$0.008 / 1M tokenscoming soon
Xenova/bge-base-en-v1.5-1.8Mat launchcoming soon
microsoft/wavlm-large-1.4Mat launchcoming soon
Qdrant/all-MiniLM-L6-v2-onnx-1.3Mat launchcoming soon
jinaai/jina-embeddings-v2-small-en32.7M1.3M~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gte-multilingual-base305.4M1.2M~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gte-large-en-v1.5434.1M1.1M~$0.008 / 1M tokenscoming soon
Qwen/Qwen3-VL-Embedding-8B8144.8M1.1M~$0.008 / 1M tokenscoming soon
jinaai/jina-embeddings-v5-text-nano211.8M1.1M~$0.008 / 1M tokenscoming soon
Salesforce/SFR-Embedding-2_R7110.7M1M~$0.008 / 1M tokenscoming soon
nomic-ai/nomic-embed-text-v2-moe475.3M854.7K~$0.008 / 1M tokenscoming soon
indobenchmark/indobert-base-p1-826.3Kat launchcoming soon
Alibaba-NLP/gte-Qwen2-1.5B-instruct1776.2M772.7K~$0.008 / 1M tokenscoming soon
microsoft/wavlm-base-plus-771.5Kat launchcoming soon
Qdrant/bm25-769.3Kat launchcoming soon
nvidia/llama-nemotron-embed-1b-v21235.8M658.5K~$0.008 / 1M tokenscoming soon
boboliu/Qwen3-Embedding-4B-W4A16-G1284050.2M549.1K~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gte-base-en-v1.5136.8M459.5K~$0.008 / 1M tokenscoming soon
intfloat/e5-mistral-7b-instruct7110.7M414.7K~$0.008 / 1M tokenscoming soon
jinaai/jina-embeddings-v2-base-en137.4M172K~$0.008 / 1M tokenscoming soon
Snowflake/snowflake-arctic-embed-m-v2.0305.4M162.9K~$0.008 / 1M tokenscoming soon
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised-112.3Kat launchcoming soon
Alibaba-NLP/gte-Qwen2-7B-instruct7612.6M79.3K~$0.008 / 1M tokenscoming soon
jinaai/jina-embeddings-v5-omni-small1626.3M76.6K~$0.008 / 1M tokenscoming soon
NovaSearch/stella_en_400M_v5435.2M69.4K$0.008 / 1M tokenscoming soon
jinaai/jina-clip-v1222.7M61K~$0.008 / 1M tokenscoming soon
royokong/e5-v8355.3M58.2K~$0.008 / 1M tokenscoming soon
Snowflake/snowflake-arctic-embed-m-long136.7M53.5K~$0.008 / 1M tokenscoming soon
ibm-granite/granite-embedding-english-r2149M45K~$0.008 / 1M tokenscoming soon
codefuse-ai/F2LLM-v2-4B4022.5M41.1K~$0.008 / 1M tokenscoming soon
jxm/cde-small-v2305.7M25.9K~$0.008 / 1M tokenscoming soon
jinaai/jina-embeddings-v5-omni-nano986M25.2K~$0.008 / 1M tokenscoming soon
McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-unsup-simcse-24.7Kat launchcoming soon
nvidia/NV-Embed-v27851M24.5K~$0.008 / 1M tokenscoming soon
Salesforce/SFR-Embedding-Mistral7110.7M18.2K~$0.008 / 1M tokenscoming soon
Linq-AI-Research/Linq-Embed-Mistral7110.7M14.7K~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct2209M9.1K~$0.008 / 1M tokenscoming soon
sdadas/mmlw-e5-large559.9M5.2K~$0.008 / 1M tokenscoming soon
infly/inf-retriever-v1-1.5b1543.3M4.3K~$0.008 / 1M tokenscoming soon
codefuse-ai/F2LLM-v2-1.7B1720.6M3.8K~$0.008 / 1M tokenscoming soon
jinaai/jina-embedding-b-en-v1-3.5Kat launchcoming soon
openbmb/MiniCPM-Embedding2724.9M2.5K~$0.008 / 1M tokenscoming soon
opensearch-project/opensearch-neural-sparse-encoding-doc-v3-gte137.4M2.1K~$0.008 / 1M tokenscoming soon
LCO-Embedding/LCO-Embedding-Omni-3B4703.5M2.1K~$0.008 / 1M tokenscoming soon
LCO-Embedding/LCO-Embedding-Omni-7B8931.8M1.2K~$0.008 / 1M tokenscoming soon
Lajavaness/bilingual-embedding-base278M1.1K~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gme-Qwen2-VL-7B-Instruct8291.4M1K~$0.008 / 1M tokenscoming soon
BAAI/bge-en-icl7110.7M1K~$0.008 / 1M tokenscoming soon
codefuse-ai/F2LLM-v2-8B7568.4M923~$0.008 / 1M tokenscoming soon
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v2494M919~$0.008 / 1M tokenscoming soon
HIT-TMG/KaLM-embedding-multilingual-mini-instruct-v1494M732~$0.008 / 1M tokenscoming soon
jinaai/jina-embedding-s-en-v1-691at launchcoming soon
BidirLM/BidirLM-Omni-2.5B-Embedding2445M619~$0.008 / 1M tokenscoming soon
nomic-ai/nomic-embed-text-v1-unsupervised-609at launchcoming soon
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-unsup-simcse-446at launchcoming soon
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp-supervised-440at launchcoming soon
sdadas/mmlw-e5-base278M369~$0.008 / 1M tokenscoming soon
infly/inf-retriever-v17069.1M359~$0.008 / 1M tokenscoming soon
izhx/udever-bloom-560m-334at launchcoming soon
izhx/udever-bloom-1b1-321at launchcoming soon
McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervised-291at launchcoming soon
nomic-ai/nomic-embed-text-v1-ablated-248at launchcoming soon
Hum-Works/lodestone-base-4096-v1-235at launchcoming soon
Mihaiii/Ivysaur22.7M232~$0.008 / 1M tokenscoming soon
deepfile/embedder-100p278M223~$0.008 / 1M tokenscoming soon
brahmairesearch/slx-v0.122.7M215~$0.008 / 1M tokenscoming soon
w601sxs/b1ade-embed-209at launchcoming soon
izhx/udever-bloom-3b-206at launchcoming soon
Mihaiii/Bulbasaur17.4M200~$0.008 / 1M tokenscoming soon
jxm/cde-small-v1281.1M190~$0.008 / 1M tokenscoming soon
Mihaiii/gte-micro-v419.2M162~$0.008 / 1M tokenscoming soon
zeta-alpha-ai/Zeta-Alpha-E5-Mistral7110.7M153~$0.008 / 1M tokenscoming soon
lightonai/DenseOn-unsupervised149M140~$0.008 / 1M tokenscoming soon
yibinlei/LENS-d80007110.7M108~$0.008 / 1M tokenscoming soon
yibinlei/LENS-d40007110.7M101~$0.008 / 1M tokenscoming soon
lightonai/ColBERT-Zero-supervised149M84~$0.008 / 1M tokenscoming soon
Alibaba-NLP/gte-Qwen1.5-7B-instruct7721.3M81~$0.008 / 1M tokenscoming soon
lightonai/ColBERT-Zero-unsupervised149M81~$0.008 / 1M tokenscoming soon
McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-unsup-simcse-13at launchcoming soon
manveertamber/cadet-embed-base-v1109.5M13~$0.008 / 1M tokenscoming soon
McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-supervised-12at launchcoming soon
sentence-transformers/static-retrieval-mrl-en-v1--at launchcoming soon