Distill Any Depth Large

xingyang1/Distill-Any-Depth-Large-hf

published Mar 2025 · updated Mar 2025

Distill Any Depth Large is a monocular depth estimation model that uses novel knowledge distillation methods to achieve state-of-the-art zero-shot depth prediction.

est. price

~$0.094

/ 1k images · estimated, set at launch

API providers

downloads / mo

189.6K

license

mit

specs

Task	Depth Estimation
Architecture	DepthAnythingForDepthEstimation
Parameters	335M
License	MIT

about this model

Distill-Any-Depth-Large-hf is a monocular depth estimation model that performs zero-shot depth estimation using knowledge distillation. Built on the DepthAnythingForDepthEstimation architecture with 335 million parameters and released under the MIT license, it was introduced in the paper Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator (arXiv:2502.19204).

Technical approach

The model employs Cross-Context Distillation, which integrates global and local depth cues to improve pseudo-label quality, and an assistant-guided distillation strategy that incorporates complementary priors from a diffusion-based teacher model. This multi-teacher framework leverages the strengths of different depth estimation models, including generative diffusion-based depth models, to enhance supervision diversity and robustness.

Performance

Quantitative and qualitative evaluations on benchmark datasets show that Distill-Any-Depth-Large-hf significantly outperforms prior state-of-the-art methods such as MiDaS v3.1, DepthAnythingv2, Marigold, and Genpercept. The model produces finer granularity and more detailed depth estimates, particularly in challenging regions.

Hosted on gigarouter as a managed, OpenAI-compatible API, this model requires no local installation or hardware management, enabling seamless integration into depth estimation pipelines.

best for

·Zero-shot depth estimation in diverse real-world scenes
·Robotics and autonomous navigation
·3D scene reconstruction from a single image
·Augmented and virtual reality applications

FAQ

What is the primary advantage of Distill Any Depth Large over other depth models?

It uses cross-context distillation and assistant-guided distillation from a diffusion-based teacher to produce finer, more detailed depth maps with better generalization.

How do I call this model via the gigarouter API?

Use the gigarouter OpenAI-compatible endpoint with your API key, sending an image URL or base64-encoded image to the depth estimation endpoint.

What input and output formats are supported?

Input: an image (URL or base64). Output: a depth map as a 2D array or image (depth values normalized between 0 and 255).

What is the license of this model?

It is released under the MIT license.

How does this model compare in size to other depth models?

It has 335 million parameters and a file size of approximately 1.34 GB (float32), making it a large but efficient model for high-quality depth estimation.

not yet live

We're benchmarking and onboarding Distill Any Depth Large as a hosted, OpenAI-compatible API. Sign in for free credit and be ready when it lands, or tell us you want it and we'll prioritize it.

related depth estimation models

compare all →

Depth-Anything-V2-Small-hf

1.7M dl/mo

DA3METRIC-LARGE

825K dl/mo

depth-anything-large-hf

388.9K dl/mo

dpt-hybrid-midas

225.1K dl/mo

DA3NESTED-GIANT-LARGE-1.1

199.9K dl/mo

Depth-Anything-V2-Large-hf

199.1K dl/mo