M2Retinexformer / README.md
YoussefAboelwafa's picture
Add model card and metadata (#2)
83c1220
metadata
pipeline_tag: image-to-image

M2Retinexformer

This repository contains the official weights for M2Retinexformer (Multi-Modal Retinexformer), introduced in the paper M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement.

Introduction

Low-light image enhancement is challenging due to complex degradations, including amplified noise, artifacts, and color distortion. M2Retinexformer is a novel framework that extends Retinexformer by incorporating depth cues, luminance priors, and semantic features within a progressive refinement pipeline.

Depth provides geometric context invariant to lighting variations, while luminance and semantic features offer explicit guidance on brightness distribution and scene understanding. These modalities are fused through cross-attention with adaptive gating to dynamically balance illumination-guided self-attention and cross-attention based on the reliability of auxiliary cues.

Citation

If you find this work useful, please cite:

@misc{aboelwafa2026m2retinexformermultimodalretinexformerlowlight,
      title={M2Retinexformer: Multi-Modal Retinexformer for Low-Light Image Enhancement}, 
      author={Youssef Aboelwafa and Hicham G. Elmongui and Marwan Torki},
      year={2026},
      eprint={2605.12556},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.12556}, 
}

Acknowledgements

This project is built on the baseline architecture of Retinexformer.