Picture for Honggang Chen

Honggang Chen

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Add code
Nov 26, 2024
Viaarxiv icon

M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension

Add code
Jul 01, 2024
Figure 1 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 2 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 3 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Figure 4 for M$^2$IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
Viaarxiv icon

Perception- and Fidelity-aware Reduced-Reference Super-Resolution Image Quality Assessment

Add code
May 15, 2024
Viaarxiv icon

DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding

Add code
May 10, 2024
Viaarxiv icon

Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application

Add code
May 07, 2024
Figure 1 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 2 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 3 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Figure 4 for Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
Viaarxiv icon

Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images

Add code
Apr 29, 2024
Viaarxiv icon

VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Add code
Sep 03, 2023
Viaarxiv icon

Real-World Single Image Super-Resolution: A Brief Review

Add code
Mar 03, 2021
Figure 1 for Real-World Single Image Super-Resolution: A Brief Review
Figure 2 for Real-World Single Image Super-Resolution: A Brief Review
Figure 3 for Real-World Single Image Super-Resolution: A Brief Review
Figure 4 for Real-World Single Image Super-Resolution: A Brief Review
Viaarxiv icon

Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder

Add code
Jan 04, 2020
Figure 1 for Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder
Figure 2 for Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder
Figure 3 for Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder
Figure 4 for Pixel-Semantic Revise of Position Learning A One-Stage Object Detector with A Shared Encoder-Decoder
Viaarxiv icon

Accurate and Fast reconstruction of Porous Media from Extremely Limited Information Using Conditional Generative Adversarial Network

Add code
Apr 04, 2019
Figure 1 for Accurate and Fast reconstruction of Porous Media from Extremely Limited Information Using Conditional Generative Adversarial Network
Figure 2 for Accurate and Fast reconstruction of Porous Media from Extremely Limited Information Using Conditional Generative Adversarial Network
Figure 3 for Accurate and Fast reconstruction of Porous Media from Extremely Limited Information Using Conditional Generative Adversarial Network
Figure 4 for Accurate and Fast reconstruction of Porous Media from Extremely Limited Information Using Conditional Generative Adversarial Network
Viaarxiv icon