Picture for Geng Li

Geng Li

YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models

Add code
May 29, 2026
Viaarxiv icon

OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning

Add code
May 28, 2026
Viaarxiv icon

Gaze2Act: Gaze-Conditioned Vision-Language-Action Policies for Interactive Robot Manipulation

Add code
May 28, 2026
Viaarxiv icon

FIKA-Bench: From Fine-grained Recognition to Fine-Grained Knowledge Acquisition

Add code
May 13, 2026
Viaarxiv icon

Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts

Add code
Mar 24, 2026
Viaarxiv icon

DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding

Add code
Apr 21, 2025
Figure 1 for DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Figure 2 for DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Figure 3 for DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Figure 4 for DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual Understanding
Viaarxiv icon

FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos

Add code
Apr 14, 2025
Figure 1 for FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
Figure 2 for FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
Figure 3 for FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
Figure 4 for FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos
Viaarxiv icon

Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language Models

Add code
Jan 25, 2025
Viaarxiv icon

UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration

Add code
Oct 30, 2024
Viaarxiv icon

GERA: Geometric Embedding for Efficient Point Registration Analysis

Add code
Oct 01, 2024
Viaarxiv icon