UVDoc
PaddlePaddle/UVDoc
published Jun 2025 · updated Jul 2025
UVDoc is an image-to-text model that corrects geometric distortions in document images to improve OCR accuracy.
specs
| Task | Document Image Unwarping |
| Architecture | UVDoc |
| Framework | PaddlePaddle |
| Benchmark CER (DocUNet) | 0.179 |
about this model
For developers evaluating this model through gigarouter's API, UVDoc provides a single-call solution for document rectification, with no need to manage dependencies or hardware. It is suitable for any application where document images are captured under uncontrolled conditions and require normalization before text extraction.best for
- ·Preprocessing distorted document images for OCR pipelines
- ·Correcting perspective and curling in scanned or photographed documents
- ·Improving text recognition accuracy in document AI systems
FAQ
UVDoc is an image-to-text model that geometrically corrects distorted document images to reduce character error rate in OCR.
It applies geometric transformations to fix inclination, perspective, and curling, making subsequent text recognition more accurate.
Input: a distorted document image (JPEG/PNG). Output: an unwarped document image, which can be used directly or passed to an OCR engine.
Use the gigarouter OpenAI-compatible endpoint with your API key, specifying the model as UVDoc and providing the image URL or base64 data.
The CER is 0.179 on the DocUNet benchmark dataset.
We're benchmarking and onboarding UVDoc as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.