F2LLM V2 8B

codefuse-ai/F2LLM-v2-8B

published Mar 2026 · updated May 2026

F2LLM V2 8B is a general-purpose, multilingual embedding model that supports over 200 languages and is trained on 60 million high-quality data samples.

est. price

~$0.008

/ 1M tokens · estimated, set at launch

API providers

downloads / mo

923

license

apache-2.0

specs

Task	Text Embedding
Architecture	Transformer-based LLM with two-stage training pipeline
Parameters	8B
License	CC BY-NC-ND 4.0

about this model

F2LLM-v2-8B is a multilingual embedding model from the F2LLM-v2 family, designed for general-purpose text embedding tasks such as information retrieval, semantic search, text classification, clustering, and bitext mining. It supports over 200 languages (282 natural languages and 40 programming languages in the training corpus).

Architecture and Training

The model is trained on a curated composite of 60 million publicly available high-quality data samples. It employs a two-stage LLM-based embedding pipeline integrating matryoshka learning, model pruning, and knowledge distillation to deliver competitive performance with greater efficiency than previous LLM-based embedding models.

Benchmark Performance

Within the F2LLM-v2 family, the 14B model ranks first on 11 MTEB benchmarks. The 8B variant achieves state-of-the-art results for its size class, making it suitable for resource-constrained applications while retaining strong retrieval and similarity capabilities.

Model Variants

Model	Base	Instruct
80M		F2LLM-v2-80M
160M		F2LLM-v2-160M
330M		F2LLM-v2-330M
0.6B	F2LLM-v2-0.6B-Preview	F2LLM-v2-0.6B
1.7B	F2LLM-v2-1.7B-Preview	F2LLM-v2-1.7B
4B	F2LLM-v2-4B-Preview	F2LLM-v2-4B
8B	F2LLM-v2-8B-Preview	F2LLM-v2-8B
14B	F2LLM-v2-14B-Preview	F2LLM-v2-14B

Prompt Format

For asymmetric retrieval and reranking, prepend the instruction to the query using the format Instruct: your_instruction\nQuery: and leave documents unprompted. For symmetric tasks (STS, clustering, bitext mining), the model supports both prompted and unprompted encoding.

Pipeline: feature-extraction. License: CC BY-NC-ND 4.0.

best for

·Multilingual semantic search
·Information retrieval and reranking
·Text classification and clustering

FAQ

What is the best use case for F2LLM V2 8B?

It is best for multilingual information retrieval, semantic search, text classification, and clustering across 200+ languages.

What is the input and output format for this model?

Input: text strings. Queries should use the prompt format: "Instruct: your_instruction\nQuery: ". Output: a 4096-dimensional embedding vector.

How can I call F2LLM V2 8B via the gigarouter API?

Use the OpenAI-compatible endpoint with your gigarouter API key. Send a request with the model name "F2LLM V2 8B" and the input text.

What is the license for F2LLM V2 8B?

It is released under CC BY-NC-ND 4.0, which allows non-commercial use and sharing but prohibits commercial use and derivative works.

not yet live

We're benchmarking and onboarding F2LLM V2 8B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related embeddings models

compare all →

nomic-embed-text-v1.5