---
license: apache-2.0
language:
- en
pipeline_tag: depth-estimation
tags:
- depth estimation
- high resolution
- image analysis
- computer vision
- in-the-wild
- zero-shot
---
High-Resolution Marigold Depth v1-0 Model Card
This is a model card for the `marigold-depth-hr-v1-0` model for monocular depth estimation from a single image.
The model is fine-tuned from the `marigold-depth-v1-0` [model](https://huggingface.co/prs-eth/marigold-depth-v1-0) as
described in
a follow-up of our [CVPR'2024 paper](https://arxiv.org/abs/2312.02145) titled "Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation".
## Model Details
- **Developed by:** [Bingxin Ke](http://www.kebingxin.com/), [Kevin Qu](https://ch.linkedin.com/in/kevin-qu-b3417621b), [Tianfu Wang](https://tianfwang.github.io/), [Nando Metzger](https://nandometzger.github.io/), [Shengyu Huang](https://shengyuh.github.io/), [Bo Li](https://www.linkedin.com/in/bobboli0202), [Anton Obukhov](https://www.obukhov.ai/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ).
- **Model type:** Generative latent diffusion-based affine-invariant monocular depth estimation from a single image.
- **Language:** English.
- **License:** [Apache License License Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
- **Model Description:** This model can be used to generate an estimated depth map of an input image.
- **Resolution**: The model is designed to support large resolutions up to 4MP.
- **Steps and scheduler**: This model was designed for usage with the **DDIM** scheduler and between **10 and 50** denoising steps.
- **Outputs**:
- **Affine-invariant depth map**: The predicted values are between 0 and 1, interpolating between the near and far planes of the model's choice.
- **Resources for more information:** [Project Website](https://marigoldcomputervision.github.io/), [Paper](https://arxiv.org/abs/2312.02145), [Code](https://github.com/prs-eth/marigold).
- **Cite as:**
Placeholder for the citation block of the follow-up paper
```bibtex
@InProceedings{ke2023repurposing,
title={Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation},
author={Bingxin Ke and Anton Obukhov and Shengyu Huang and Nando Metzger and Rodrigo Caye Daudt and Konrad Schindler},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2024}
}
```