Picture for Bin Li

Bin Li

Member, IEEE

UniMIC: Token-Based Multimodal Interactive Coding for Human-AI Collaboration

Add code
Sep 26, 2025
Viaarxiv icon

Training-Free Pyramid Token Pruning for Efficient Large Vision-Language Models via Region, Token, and Instruction-Guided Importance

Add code
Sep 19, 2025
Viaarxiv icon

MedFact-R1: Towards Factual Medical Reasoning via Pseudo-Label Augmentation

Add code
Sep 18, 2025
Viaarxiv icon

HERO: Rethinking Visual Token Early Dropping in High-Resolution Large Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

Brought a Gun to a Knife Fight: Modern VFM Baselines Outgun Specialized Detectors on In-the-Wild AI Image Detection

Add code
Sep 16, 2025
Viaarxiv icon

On the Regularity and Fairness of Combinatorial Multi-Armed Bandit

Add code
Sep 15, 2025
Viaarxiv icon

Webly-Supervised Image Manipulation Localization via Category-Aware Auto-Annotation

Add code
Aug 28, 2025
Viaarxiv icon

From Intent to Execution: Multimodal Chain-of-Thought Reinforcement Learning for Precise CAD Code Generation

Add code
Aug 13, 2025
Viaarxiv icon

Interpretable Oracle Bone Script Decipherment through Radical and Pictographic Analysis with LVLMs

Add code
Aug 13, 2025
Viaarxiv icon

CLUE: Leveraging Low-Rank Adaptation to Capture Latent Uncovered Evidence for Image Forgery Localization

Add code
Aug 10, 2025
Viaarxiv icon