F2LLM V2 8B
codefuse-ai/F2LLM-v2-8B
published Mar 2026 · updated May 2026
F2LLM V2 8B is a general-purpose, multilingual embedding model that supports over 200 languages and is trained on 60 million high-quality data samples.
specs
| Task | Text Embedding |
| Architecture | Transformer-based LLM with two-stage training pipeline |
| Parameters | 8B |
| License | CC BY-NC-ND 4.0 |
about this model
F2LLM-v2-8B is a multilingual embedding model from the F2LLM-v2 family, designed for general-purpose text embedding tasks such as information retrieval, semantic search, text classification, clustering, and bitext mining. It supports over 200 languages (282 natural languages and 40 programming languages in the training corpus).
Architecture and Training
The model is trained on a curated composite of 60 million publicly available high-quality data samples. It employs a two-stage LLM-based embedding pipeline integrating matryoshka learning, model pruning, and knowledge distillation to deliver competitive performance with greater efficiency than previous LLM-based embedding models.
Benchmark Performance
Within the F2LLM-v2 family, the 14B model ranks first on 11 MTEB benchmarks. The 8B variant achieves state-of-the-art results for its size class, making it suitable for resource-constrained applications while retaining strong retrieval and similarity capabilities.
Model Variants
| Model | Base | Instruct |
|---|---|---|
| 80M | F2LLM-v2-80M | |
| 160M | F2LLM-v2-160M | |
| 330M | F2LLM-v2-330M | |
| 0.6B | F2LLM-v2-0.6B-Preview | F2LLM-v2-0.6B |
| 1.7B | F2LLM-v2-1.7B-Preview | F2LLM-v2-1.7B |
| 4B | F2LLM-v2-4B-Preview | F2LLM-v2-4B |
| 8B | F2LLM-v2-8B-Preview | F2LLM-v2-8B |
| 14B | F2LLM-v2-14B-Preview | F2LLM-v2-14B |
Prompt Format
For asymmetric retrieval and reranking, prepend the instruction to the query using the format Instruct: your_instruction\nQuery: and leave documents unprompted. For symmetric tasks (STS, clustering, bitext mining), the model supports both prompted and unprompted encoding.
Pipeline: feature-extraction. License: CC BY-NC-ND 4.0.
best for
- ·Multilingual semantic search
- ·Information retrieval and reranking
- ·Text classification and clustering
FAQ
It is best for multilingual information retrieval, semantic search, text classification, and clustering across 200+ languages.
Input: text strings. Queries should use the prompt format: "Instruct: your_instruction\nQuery: ". Output: a 4096-dimensional embedding vector.
Use the OpenAI-compatible endpoint with your gigarouter API key. Send a request with the model name "F2LLM V2 8B" and the input text.
It is released under CC BY-NC-ND 4.0, which allows non-commercial use and sharing but prohibits commercial use and derivative works.
We're benchmarking and onboarding F2LLM V2 8B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.