Picture for Yuheng Li

Yuheng Li

MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting

Add code
Sep 04, 2025
Viaarxiv icon

Enhancing Large Multimodal Models with Adaptive Sparsity and KV Cache Compression

Add code
Jul 28, 2025
Viaarxiv icon

X-Fusion: Introducing New Modality to Frozen Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon

YoChameleon: Personalized Vision and Language Generation

Add code
Apr 29, 2025
Viaarxiv icon

RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT

Add code
Mar 18, 2025
Viaarxiv icon

Towards Universal Text-driven CT Image Segmentation

Add code
Mar 08, 2025
Viaarxiv icon

Removing Distributional Discrepancies in Captions Improves Image-Text Alignment

Add code
Oct 01, 2024
Figure 1 for Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Figure 2 for Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Figure 3 for Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Figure 4 for Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
Viaarxiv icon

AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking

Add code
Jul 09, 2024
Viaarxiv icon

Yo'LLaVA: Your Personalized Language and Vision Assistant

Add code
Jun 13, 2024
Figure 1 for Yo'LLaVA: Your Personalized Language and Vision Assistant
Figure 2 for Yo'LLaVA: Your Personalized Language and Vision Assistant
Figure 3 for Yo'LLaVA: Your Personalized Language and Vision Assistant
Figure 4 for Yo'LLaVA: Your Personalized Language and Vision Assistant
Viaarxiv icon

Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography

Add code
Apr 24, 2024
Viaarxiv icon