Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Dec 15, 2023

Xiaofeng Zhang, Zishan Xu, Hao Tang, Chaochen Gu, Wei Chen, Shanying Zhu, Xinping Guan

Figure 1 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Figure 2 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Figure 3 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Figure 4 for Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Share this with someone who'll enjoy it:

Abstract:Low-light image enhancement is a crucial visual task, and many unsupervised methods tend to overlook the degradation of visible information in low-light scenes, which adversely affects the fusion of complementary information and hinders the generation of satisfactory results. To address this, our study introduces ``Enlighten-Your-Voice'', a multimodal enhancement framework that innovatively enriches user interaction through voice and textual commands. This approach does not merely signify a technical leap but also represents a paradigm shift in user engagement. Our model is equipped with a Dual Collaborative Attention Module (DCAM) that meticulously caters to distinct content and color discrepancies, thereby facilitating nuanced enhancements. Complementarily, we introduce a Semantic Feature Fusion (SFM) plug-and-play module that synergizes semantic context with low-light enhancement operations, sharpening the algorithm's efficacy. Crucially, ``Enlighten-Your-Voice'' showcases remarkable generalization in unsupervised zero-shot scenarios. The source code can be accessed from https://github.com/zhangbaijin/Enlighten-Your-Voice

* arXiv admin note: text overlap with arXiv:2306.10286

View paper on

Share this with someone who'll enjoy it:

Title:Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement

Paper and Code