Learned Lightweight Smartphone ISP with Unpaired Data
Abstract
A novel unpaired training method for a learnable ISP uses adversarial training with multiple discriminators to achieve high-quality image transformation without paired datasets.
The Image Signal Processor (ISP) is a fundamental component in modern smartphone cameras responsible for conversion of RAW sensor image data to RGB images with a strong focus on perceptual quality. Recent work highlights the potential of deep learning approaches and their ability to capture details with a quality increasingly close to that of professional cameras. A difficult and costly step when developing a learned ISP is the acquisition of pixel-wise aligned paired data that maps the raw captured by a smartphone camera sensor to high-quality reference images. In this work, we address this challenge by proposing a novel training method for a learnable ISP that eliminates the need for direct correspondences between raw images and ground-truth data with matching content. Our unpaired approach employs a multi-term loss function guided by adversarial training with multiple discriminators processing feature maps from pre-trained networks to maintain content structure while learning color and texture characteristics from the target RGB dataset. Using lightweight neural network architectures suitable for mobile devices as backbones, we evaluated our method on the Zurich RAW to RGB and Fujifilm UltraISP datasets. Compared to paired training methods, our unpaired learning strategy shows strong potential and achieves high fidelity across multiple evaluation metrics. The code and pre-trained models are available at https://github.com/AndreiiArhire/Learned-Lightweight-Smartphone-ISP-with-Unpaired-Data .
Community
This paper proposes a novel unpaired training method for learning image signal processors (ISPs) without the need for paired RAW-to-RGB data. By combining a multi-term loss with adversarial training and multiple discriminators, the method learns high-quality color and texture mappings using lightweight models. It performs strongly on the Zurich RAW to RGB and Fujifilm UltraISP datasets, making it well-suited for deployment on smartphones.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Exploring Semantic Feature Discrimination for Perceptual Image Super-Resolution and Opinion-Unaware No-Reference Image Quality Assessment (2025)
- DSDNet: Raw Domain Demoir'eing via Dual Color-Space Synergy (2025)
- ISPDiffuser: Learning RAW-to-sRGB Mappings with Texture-Aware Diffusion Models and Histogram-Guided Color Consistency (2025)
- Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining (2025)
- Scene Perceived Image Perceptual Score (SPIPS): combining global and local perception for image quality assessment (2025)
- Towards Realistic Low-Light Image Enhancement via ISP Driven Data Modeling (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper