Picture for Hongliang Li

Hongliang Li

Towards Cost-Effective Reward Guided Text Generation

Add code
Feb 06, 2025
Figure 1 for Towards Cost-Effective Reward Guided Text Generation
Figure 2 for Towards Cost-Effective Reward Guided Text Generation
Figure 3 for Towards Cost-Effective Reward Guided Text Generation
Figure 4 for Towards Cost-Effective Reward Guided Text Generation
Viaarxiv icon

EgoMe: Follow Me via Egocentric View in Real World

Add code
Jan 31, 2025
Viaarxiv icon

Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs

Add code
Jan 31, 2025
Figure 1 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 2 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 3 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Figure 4 for Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs
Viaarxiv icon

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection

Add code
Jan 28, 2025
Viaarxiv icon

HiHa: Introducing Hierarchical Harmonic Decomposition to Implicit Neural Compression for Atmospheric Data

Add code
Nov 09, 2024
Viaarxiv icon

ARIC: An Activity Recognition Dataset in Classroom Surveillance Images

Add code
Oct 16, 2024
Figure 1 for ARIC: An Activity Recognition Dataset in Classroom Surveillance Images
Figure 2 for ARIC: An Activity Recognition Dataset in Classroom Surveillance Images
Figure 3 for ARIC: An Activity Recognition Dataset in Classroom Surveillance Images
Figure 4 for ARIC: An Activity Recognition Dataset in Classroom Surveillance Images
Viaarxiv icon

Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation

Add code
Oct 02, 2024
Figure 1 for Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation
Figure 2 for Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation
Figure 3 for Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation
Figure 4 for Cognition Transferring and Decoupling for Text-supervised Egocentric Semantic Segmentation
Viaarxiv icon

Few-Shot Continual Learning for Activity Recognition in Classroom Surveillance Images

Add code
Sep 05, 2024
Viaarxiv icon

DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding

Add code
Aug 27, 2024
Figure 1 for DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding
Figure 2 for DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding
Figure 3 for DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding
Figure 4 for DocLayLLM: An Efficient and Effective Multi-modal Extension of Large Language Models for Text-rich Document Understanding
Viaarxiv icon

Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion

Add code
Aug 04, 2024
Figure 1 for Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion
Figure 2 for Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion
Figure 3 for Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion
Figure 4 for Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion
Viaarxiv icon