F-ViTA: Foundation Model Guided Visible to Thermal Translation
This repository contains the model described in the paper F-ViTA: Foundation Model Guided Visible to Thermal Translation.
F-ViTA leverages foundation models (SAM and Grounded DINO) to guide the visible-to-thermal image translation process using an InstructPix2Pix diffusion model. This approach improves translation accuracy and generalizes well to out-of-distribution scenarios.
Code: https://github.com/jay-jnp/F-ViTA
Pre-trained checkpoints are available for several datasets:
- Downloads last month
- 42
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support