Question 1

What is this model best for?

Accepted Answer

It is designed for end-to-end document parsing: layout detection, text recognition, table and formula recognition, and chart analysis from page images.

Question 2

How does it compare in size to other models?

Accepted Answer

It has 1.2B parameters, making it a lightweight nano tier. A larger ultra version is available through KoreaDeep.

Question 3

What are the license terms?

Accepted Answer

It is an open-weight model, but no specific license is mentioned in the model card. Contact KoreaDeep for usage terms.

Question 4

How do I call it via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key. Send an image (URL or base64) and the appropriate task prompt (e.g., "
Table Recognition:"). Set temperature=0 and skip_special_tokens=False.

Question 5

What input formats does it accept?

Accepted Answer

It accepts page images (one per request). Supported tasks include layout detection, text, table, formula, and figure analysis, each with a fixed prompt.

Task	Document Parsing
Architecture	Vision-Language Model (VLM)
Parameters	1.2B
Context Length	8192 tokens
Max Images Per Request	1

KDL Frontier Parser Nano

specs

best for

FAQ

related vision-language models