F2LLM V2 1.7B
codefuse-ai/F2LLM-v2-1.7B
published Mar 2026 · updated May 2026
F2LLM V2 1.7B is a multilingual embedding model that supports over 200 languages, optimized for retrieval, semantic search, and classification tasks.
specs
| Task | Embedding |
| Architecture | Qwen3 |
| Parameters | 1.7B |
| License | Apache 2.0 |
about this model
F2LLM-v2-1.7B is a general-purpose, multilingual embedding model that converts text into dense vector representations for information retrieval, semantic search, and text classification. It is the 1.7-billion-parameter instruct variant of the F2LLM-v2 family, fine-tuned from the codefuse-ai/F2LLM-v2-1.7B-Preview base model and released under the Apache 2.0 license.
Capabilities and Training
Trained on a curated composite of 60 million high-quality samples from the codefuse-ai/F2LLM-v2 dataset, the model supports over 200 languages with a particular focus on mid- and low-resource languages. According to the research paper (arXiv:2603.19223), the training corpus covers 282 natural languages and more than 40 programming languages. The architecture is based on Qwen3 and produces output embeddings with a fixed dimension of 2048.
Optimization Techniques
The model integrates a two-stage LLM-based embedding training pipeline with Matryoshka Representation Learning (MRL), model pruning, and knowledge distillation, enabling efficient inference while maintaining competitive retrieval performance.
Benchmark Performance
While specific MTEB scores for the 1.7B variant are not published, the family’s largest model, F2LLM-v2-14B, ranks first on 11 MTEB benchmarks. The 1.7B model inherits the training methodology and is designed to deliver strong embedding quality at a compact size suitable for resource-constrained applications.
Model Family Overview
| Size | Base Model | Instruct Model |
|---|---|---|
| 80M | — | F2LLM-v2-80M |
| 160M | — | F2LLM-v2-160M |
| 330M | — | F2LLM-v2-330M |
| 0.6B | Preview | F2LLM-v2-0.6B |
| 1.7B | Preview | F2LLM-v2-1.7B (this model) |
| 4B | Preview | F2LLM-v2-4B |
| 8B | Preview | F2LLM-v2-8B |
| 14B | Preview | F2LLM-v2-14B |
All models, training data, code, and intermediate checkpoints are publicly available. Intermediate checkpoints for this model are provided in the intermediate_checkpoints branch on Hugging Face.
best for
- ·Multilingual semantic search and information retrieval
- ·Text classification across 200+ languages
- ·Clustering and bitext mining tasks
FAQ
It excels at multilingual retrieval, semantic search, text classification, and clustering, especially for mid- and low-resource languages.
The embedding dimension is 2048.
It is released under the Apache 2.0 license.
Use the gigarouter OpenAI-compatible endpoint with your API key; refer to the gigarouter documentation for endpoint details.
Yes, for asymmetric retrieval use the format 'Instruct: your_instruction\nQuery: ' for queries; documents do not need a prompt.
We're benchmarking and onboarding F2LLM V2 1.7B as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.