M2Retinexformer

This repository contains the official weights for M2Retinexformer (Multi-Modal Retinexformer), introduced in the paper M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement.

Introduction

Low-light image enhancement is challenging due to complex degradations, including amplified noise, artifacts, and color distortion. M2Retinexformer is a novel framework that extends Retinexformer by incorporating depth cues, luminance priors, and semantic features within a progressive refinement pipeline.

Depth provides geometric context invariant to lighting variations, while luminance and semantic features offer explicit guidance on brightness distribution and scene understanding. These modalities are fused through cross-attention with adaptive gating to dynamically balance illumination-guided self-attention and cross-attention based on the reliability of auxiliary cues.

Citation

If you find this work useful, please cite:

@misc{aboelwafa2026m2retinexformermultimodalretinexformerlowlight,
      title={M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement}, 
      author={Youssef Aboelwafa and Hicham G. Elmongui and Marwan Torki},
      year={2026},
      eprint={2605.12556},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.12556}, 
}

Acknowledgements

This project is built on the baseline architecture of Retinexformer.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for YoussefAboelwafa/M2Retinexformer