skip to content
gigarouter gigarouter
rate card

Models & pricing

The specialist models we've benchmarked, hosted and priced — with the long tail we're onboarding next below. Prices are in each model's native unit; realtime is the on-demand rate, batch is a discounted flexible tier (send X-Tier: batch).

91 matches in embeddings · clear

modeltasktierrealtimebatch
Qwen/Qwen3-Embedding-0.6BembeddingsA$0.008/1M tok$0.0025/1M tok
BAAI/bge-small-en-v1.5embeddingsA$0.008/1M tok$0.0025/1M tok
Qwen/Qwen3-Embedding-4BembeddingsA$0.008/1M tok$0.0025/1M tok
NovaSearch/stella_en_1.5B_v5embeddingsA$0.008/1M tok$0.0025/1M tok

On the roadmap

87 models

High-demand specialist models with no hosted API. We benchmark and onboard them by task - each has a page; sign in and tell us which you need to jump the queue.

embeddings · 87
nomic-embed-text-v1.5nomic-embed-text-v1w2v-bert-2.0all-MiniLM-L6-v2jina-embeddings-v3granite-embedding-small-english-r2bge-base-en-v1.5wavlm-largeall-MiniLM-L6-v2-onnxjina-embeddings-v2-small-engte-multilingual-basegte-large-en-v1.5Qwen3-VL-Embedding-8Bjina-embeddings-v5-text-nanoSFR-Embedding-2_Rnomic-embed-text-v2-moeindobert-base-p1gte-Qwen2-1.5B-instructwavlm-base-plusbm25llama-nemotron-embed-1b-v2Qwen3-Embedding-4B-W4A16-G128gte-base-en-v1.5e5-mistral-7b-instructjina-embeddings-v2-base-ensnowflake-arctic-embed-m-v2.0LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervisedgte-Qwen2-7B-instructjina-embeddings-v5-omni-smallstella_en_400M_v5jina-clip-v1e5-vsnowflake-arctic-embed-m-longgranite-embedding-english-r2F2LLM-v2-4Bcde-small-v2jina-embeddings-v5-omni-nanoLLM2Vec-Mistral-7B-Instruct-v2-mntp-unsup-simcseNV-Embed-v2SFR-Embedding-MistralLinq-Embed-Mistralgme-Qwen2-VL-2B-Instructmmlw-e5-largeinf-retriever-v1-1.5bF2LLM-v2-1.7Bjina-embedding-b-en-v1MiniCPM-Embeddingopensearch-neural-sparse-encoding-doc-v3-gteLCO-Embedding-Omni-3BLCO-Embedding-Omni-7Bbilingual-embedding-basegme-Qwen2-VL-7B-Instructbge-en-iclF2LLM-v2-8BKaLM-embedding-multilingual-mini-instruct-v2KaLM-embedding-multilingual-mini-instruct-v1jina-embedding-s-en-v1BidirLM-Omni-2.5B-Embeddingnomic-embed-text-v1-unsupervisedLLM2Vec-Meta-Llama-3-8B-Instruct-mntp-unsup-simcseLLM2Vec-Sheared-LLaMA-mntp-supervisedmmlw-e5-baseinf-retriever-v1udever-bloom-560mudever-bloom-1b1LLM2Vec-Mistral-7B-Instruct-v2-mntp-supervisednomic-embed-text-v1-ablatedlodestone-base-4096-v1Ivysaurembedder-100pslx-v0.1b1ade-embedudever-bloom-3bBulbasaurcde-small-v1gte-micro-v4Zeta-Alpha-E5-MistralDenseOn-unsupervisedLENS-d8000LENS-d4000ColBERT-Zero-supervisedgte-Qwen1.5-7B-instructColBERT-Zero-unsupervisedLLM2Vec-Llama-2-7b-chat-hf-mntp-unsup-simcsecadet-embed-base-v1LLM2Vec-Llama-2-7b-chat-hf-mntp-supervisedstatic-retrieval-mrl-en-v1