Gemma 4 12B V2 Coding Agentic

yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF

published Jun 2026 · updated Jun 2026

Gemma 4 12B V2 Coding Agentic is a text-generation model that serves as a specialized local coding and tool-using agent for multi-step technical tasks.

status

coming soon

API providers

downloads / mo

329.4K

license

apache-2.0

specs

Task	Text Generation
Architecture	Gemma 4 12B
Parameters	12B
License	Apache 2.0

about this model

Gemma4-12B v2 (yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF) is a text-generation model specialized for coding and agentic tool-use tasks, fine-tuned from Google's Gemma 4 12B instruction-tuned base model. It is designed to read, reason, use tools, and work through multi-step technical tasks before acting, making it suitable for local, private agentic workflows.

Key Strengths and Benchmarks

tau2-bench telecom (20 tasks, Q8_0, local harness)	Score
Official `gemma-4-12B-it` (base)	~15%
Gemma4-12B v2	~55%

The model achieves roughly 3.5× higher performance than the base model on the tau2-bench telecom benchmark, which mirrors real terminal/debugging work by requiring diagnose → fix → verify loops. In a separate fabrication probe, the model grounds actions before acting (0% fabrication, on par with the base model). The base model tends to abandon tasks (transfer to human), while v2 persists through the agentic loop.

Trade-offs and Specialization

This fine-tune is specialized for coding, terminal, and technical-agentic work. It trades a small amount of general-knowledge breadth (MMLU-Pro scores slightly below the base model) for its agentic capabilities. It is not designed for customer-service tasks (e.g., tau2-bench retail).

Training and Architecture

v2 builds on v1 with a significant agentic push: multi-step tool-use trajectories (read → reason → act → verify) in Gemma 4's native tool protocol, verified chain-of-thought for Python coding tasks, and curated reasoning/instruction data to maintain broad competence. All reasoning is distilled CoT from Opus 4.8. The model uses Gemma's native thought channel before answering, and is distributed under Apache 2.0.

best for

·Local coding and debugging with multi-step tool use
·Terminal and command-line agentic tasks
·Private, offline technical problem-solving

FAQ

What is this model best for?

It is specialized for coding, terminal, and technical-agentic tasks like multi-step debugging and tool use.

What is the license for this model?

It is released under Apache 2.0, free to use, modify, and redistribute.

How does this model compare to the base Gemma 4 12B IT model?

It scores roughly 3.5x higher on the tau2-bench telecom agentic benchmark but slightly lower on general knowledge benchmarks like MMLU-Pro.

How do I call this model via the gigarouter API?

Use the gigarouter OpenAI-compatible endpoint with your API key, passing the model name and your prompt.

What is the recommended quantization for this model?

Q4_K_M is the recommended sweet spot, requiring about 6.87 GB of storage.

not yet live

We're benchmarking and onboarding Gemma 4 12B V2 Coding Agentic as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related text generation models

tiny-Qwen2ForCausalLM-2.5

9.2M dl/mo

deepseek-v4-gguf

6.4M dl/mo

Qwen3.6-35B-A3B-NVFP4

6.2M dl/mo

gemma-3-270m

5.1M dl/mo