Question 1

What is the input and output format for the Depth Anything Small API?

Accepted Answer

The API accepts an image file (e.g., PNG, JPEG) and returns a depth map as a grayscale image or raw tensor, depending on the endpoint configuration.

Question 2

How does Depth Anything Small compare to larger depth models in speed and size?

Accepted Answer

At 24.8M parameters, it is significantly smaller and faster than models like MiDaS v3.1 BEiT L-512 (345M), while often matching or exceeding its zero-shot accuracy on benchmarks like KITTI and NYUv2.

Question 3

What is the license for using Depth Anything Small?

Accepted Answer

The model is released under the Apache 2.0 license, allowing for commercial and non-commercial use with attribution.

Question 4

How can I call the Depth Anything Small model via the gigarouter API?

Accepted Answer

Use the OpenAI-compatible endpoint with your API key, sending a POST request with the image data to the designated depth estimation route.

Question 5

What training data was used for Depth Anything Small?

Accepted Answer

The model was trained on a combination of 1.5M labeled images and over 62 million unlabeled images, using a data engine to scale up data coverage.

Task	Monocular Depth Estimation
Architecture	Vision Transformer (ViT-S) with DPT head
Parameters	24.8M
License	Apache 2.0

Dataset	AbsRel	δ₁
KITTI	0.080	0.936
NYUv2	0.053	0.972
Sintel	0.464	0.739
DDAD	0.247	0.768
ETH3D	0.127	0.885
DIODE	0.076	0.939

Depth Anything Small

specs

about this model

Key strengths

Zero-shot benchmark results (Ours-S / vits14)

best for

FAQ

related depth estimation models