Picture for Conghui He

Conghui He

Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Add code
Jun 12, 2025
Viaarxiv icon

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

Add code
Jun 09, 2025
Viaarxiv icon

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification

Add code
Jun 08, 2025
Viaarxiv icon

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Add code
Jun 08, 2025
Viaarxiv icon

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Add code
May 25, 2025
Viaarxiv icon

A Survey of LLM $\times$ DATA

Add code
May 24, 2025
Viaarxiv icon

Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering

Add code
May 22, 2025
Viaarxiv icon

IDEAL: Data Equilibrium Adaptation for Multi-Capability Language Model Alignment

Add code
May 19, 2025
Viaarxiv icon

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Add code
May 18, 2025
Viaarxiv icon