Question 1

How does Nougat Base differ from the small version?

Accepted Answer

The small version is the default tag (0.1.0-small); base is the larger variant (0.1.0-base), offering higher accuracy at the cost of more compute.

Question 2

What output format does Nougat produce?

Accepted Answer

It outputs Mathpix Markdown (.mmd), a plain-text format that preserves LaTeX math and table structures.

Question 3

How can I call Nougat Base via the gigarouter API?

Accepted Answer

Use the gigarouter OpenAI-compatible endpoint with your API key, sending a PDF image as input to receive Markdown text.

Question 4

What is the license of the Nougat Base model?

Accepted Answer

The model card does not specify a license; the associated code repository uses MIT, and a separate MODEL license file exists but its terms are not disclosed.

Question 5

Can Nougat handle multi-page PDFs?

Accepted Answer

Yes, it processes each page as an image; the CLI supports batch processing and page-range selection (e.g., --pages 1-4,7).

Task	image-to-text
Architecture	Swin Transformer encoder + mBART decoder
Input	PDF page images (pixels)
Output	Mathpix Markdown (.mmd)

Nougat Base

specs

about this model

best for

FAQ

related image-to-text models