Picture for Ming Hu

Ming Hu

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

Add code
Feb 28, 2026
Viaarxiv icon

Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding

Add code
Feb 13, 2026
Viaarxiv icon

A Vision-Language Foundation Model for Zero-shot Clinical Collaboration and Automated Concept Discovery in Dermatology

Add code
Feb 11, 2026
Viaarxiv icon

MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling

Add code
Feb 11, 2026
Viaarxiv icon

OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Add code
Feb 09, 2026
Viaarxiv icon

AdaptOVCD: Training-Free Open-Vocabulary Remote Sensing Change Detection via Adaptive Information Fusion

Add code
Feb 06, 2026
Viaarxiv icon

LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization

Add code
Feb 03, 2026
Viaarxiv icon

A General Model for Retinal Segmentation and Quantification

Add code
Jan 31, 2026
Viaarxiv icon

A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

Add code
Jan 29, 2026
Viaarxiv icon

LLM Collusion

Add code
Jan 03, 2026
Viaarxiv icon