DETR Doc Table Detection
TahaDouaji/detr-doc-table-detection
published Mar 2022 · updated Nov 2025
DETR Doc Table Detection is an object detection model that locates both bordered and borderless tables in document images.
specs
| Task | Object Detection |
| Architecture | DETR (Detection Transformer) with ResNet-50 backbone |
| Training Data | ICDAR2019 Table Dataset |
| License | Apache-2.0 |
about this model
detr-doc-table-detection is an object detection model that detects both bordered and borderless tables in document images. It is a fine-tuned version of facebook/detr-resnet-50, the DETR (Detection Transformer) architecture introduced in End-to-End Object Detection with Transformers. The model is hosted on gigarouter as a managed, OpenAI-compatible API.
Architecture and Training
DETR treats object detection as a direct set prediction problem, eliminating hand-designed components such as non-maximum suppression and anchor generation. The model uses a transformer encoder-decoder with a fixed set of learned object queries and a bipartite-matching loss. This fine-tuned model was trained on the ICDAR2019 Table Dataset, enabling it to recognize both bordered and borderless table structures in documents.
Performance Context
The underlying DETR architecture (ResNet-50 backbone) achieves accuracy and run-time performance on par with the highly-optimized Faster R-CNN baseline on the COCO object detection dataset, as reported in the original paper. While specific table-detection benchmark numbers for this fine-tuned variant are not published, the architectural strengths—global context reasoning and parallel prediction—are well suited to the diverse layouts found in documents.
About the Hosted API
No installation or model loading is required. Call the gigarouter endpoint with a document image and receive bounding-box detections for tables. The model handles both bordered and borderless tables, making it suitable for downstream tasks such as table extraction and document parsing.
best for
- ·Locating tables in scanned PDF documents
- ·Automating table detection in document processing pipelines
- ·Preprocessing for table structure recognition or data extraction
FAQ
It detects both bordered and borderless tables in document images, outputting bounding boxes and confidence scores.
It is based on DETR (Detection Transformer) with a ResNet-50 backbone, originally introduced in the paper "End-to-End Object Detection with Transformers".
Send a POST request to the gigarouter OpenAI-compatible endpoint with your API key, providing an image file (base64 or URL) as input.
The parent model facebook/detr-resnet-50 is licensed under Apache-2.0, which likely extends to this fine-tuned version.
We're benchmarking and onboarding DETR Doc Table Detection as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.