AnyMod-LLVE: Low-Light Video Enhancement
with Modality-Agnostic Inference

Hangfeng Liang^{* 1 2} Yutao Hu^{* † 1 2} Yanhan Hu¹ Xiaohan Wu¹ Wenqi Shao³ Ying Fu⁴

¹ School of Computer Science and Engineering, Southeast University ² Key Laboratory of New Generation AI Technology and Its Interdisciplinary Applications, Southeast University ³ Shanghai Innovation Institute ⁴ School of Computer Science and Technology, Beijing Institute of Technology

* Equal contribution † Corresponding author

Abstract

Low-light video enhancement (LLVE) remains a challenging task due to severe information degradation under low-illumination conditions. Recent multimodal approaches have significantly improved enhancement performance by incorporating auxiliary modalities, such as event streams and infrared images. However, these methods typically assume the availability of these modalities at inference, which is often not feasible in real-world scenarios. To solve this problem, in this work, we propose AMNet, a unified multimodal framework for LLVE, to support flexible modality-agnostic inference, where auxiliary modalities may be unavailable. To address the issue of modality absence, we introduce a Spatial-Spectral Dual-Gated Translator that learns the correspondence between auxiliary modalities and RGB inputs, producing implicit auxiliary representations to support the robust enhancement. Additionally, to fully facilitate the learning of cross-modal correspondence, we conduct large-scale multimodal pretraining based on the RGB-only dataset with synthetic auxiliary modalities. Extensive experiments demonstrate that AMNet could handle arbitrary inference-time modality combinations and exhibits superior performance for LLVE under modality absence conditions.

Paper Code

Poster

Visual Results

Drag the slider to compare input and enhanced results

INPUT ENHANCED

Citation

Cite This Work

If you find this work useful, please cite our paper:


      @misc{liang2026amnet,
        title={AnyMod-LLVE: Low-Light Video Enhancement with Modality-Agnostic Inference},
        author={Hangfeng Liang and Yutao Hu and Yanhan Hu and Xiaohan Wu and Wenqi Shao and Ying Fu},
        year={2026},
        eprint={2606.11186},
        archivePrefix={arXiv},
        primaryClass={cs.CV},
        url={https://arxiv.org/abs/2606.11186}
      }

AnyMod-LLVE: Low-Light Video Enhancementwith Modality-Agnostic Inference

Demo Video

Visual Results

Citation

Cite This Work

AnyMod-LLVE: Low-Light Video Enhancement
with Modality-Agnostic Inference