DETR Doc Table Detection

TahaDouaji/detr-doc-table-detection

published Mar 2022 · updated Nov 2025

DETR Doc Table Detection is an object detection model that locates both bordered and borderless tables in document images.

est. price

~$0.047

/ 1k images · estimated, set at launch

API providers

downloads / mo

208.1K

license

apache-2.0

specs

Task	Object Detection
Architecture	DETR (Detection Transformer) with ResNet-50 backbone
Training Data	ICDAR2019 Table Dataset
License	Apache-2.0

about this model

detr-doc-table-detection is an object detection model that detects both bordered and borderless tables in document images. It is a fine-tuned version of facebook/detr-resnet-50, the DETR (Detection Transformer) architecture introduced in End-to-End Object Detection with Transformers. The model is hosted on gigarouter as a managed, OpenAI-compatible API.

Architecture and Training

DETR treats object detection as a direct set prediction problem, eliminating hand-designed components such as non-maximum suppression and anchor generation. The model uses a transformer encoder-decoder with a fixed set of learned object queries and a bipartite-matching loss. This fine-tuned model was trained on the ICDAR2019 Table Dataset, enabling it to recognize both bordered and borderless table structures in documents.

Performance Context

The underlying DETR architecture (ResNet-50 backbone) achieves accuracy and run-time performance on par with the highly-optimized Faster R-CNN baseline on the COCO object detection dataset, as reported in the original paper. While specific table-detection benchmark numbers for this fine-tuned variant are not published, the architectural strengths—global context reasoning and parallel prediction—are well suited to the diverse layouts found in documents.

About the Hosted API

No installation or model loading is required. Call the gigarouter endpoint with a document image and receive bounding-box detections for tables. The model handles both bordered and borderless tables, making it suitable for downstream tasks such as table extraction and document parsing.

best for

·Locating tables in scanned PDF documents
·Automating table detection in document processing pipelines
·Preprocessing for table structure recognition or data extraction

FAQ

What does this model detect?

It detects both bordered and borderless tables in document images, outputting bounding boxes and confidence scores.

What architecture is it based on?

It is based on DETR (Detection Transformer) with a ResNet-50 backbone, originally introduced in the paper "End-to-End Object Detection with Transformers".

How can I call this model via the gigarouter API?

Send a POST request to the gigarouter OpenAI-compatible endpoint with your API key, providing an image file (base64 or URL) as input.

What license applies to this model?

The parent model facebook/detr-resnet-50 is licensed under Apache-2.0, which likely extends to this fine-tuned version.

not yet live

We're benchmarking and onboarding DETR Doc Table Detection as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related object detection models

compare all →

table-transformer-structure-recognition

1.8M dl/mo

table-transformer-detection

1.5M dl/mo

yolos-small

713.6K dl/mo

PP-DocLayoutV3_safetensors

341.1K dl/mo

rtdetr_v2_r50vd

309.8K dl/mo

rtdetr_r50vd_coco_o365

254.5K dl/mo