Picture for Dong Yang

Dong Yang

LUMEN: Longitudinal Multi-Modal Radiology Model for Prognosis and Diagnosis

Add code
Feb 24, 2026
Viaarxiv icon

Improved Evidence Extraction for Document Inconsistency Detection with LLMs

Add code
Jan 06, 2026
Viaarxiv icon

Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging

Add code
Oct 23, 2025
Viaarxiv icon

Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders

Add code
Jul 08, 2025
Figure 1 for Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders
Figure 2 for Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders
Figure 3 for Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders
Figure 4 for Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders
Viaarxiv icon

Flexible and Efficient Drift Detection without Labels

Add code
Jun 10, 2025
Figure 1 for Flexible and Efficient Drift Detection without Labels
Figure 2 for Flexible and Efficient Drift Detection without Labels
Figure 3 for Flexible and Efficient Drift Detection without Labels
Figure 4 for Flexible and Efficient Drift Detection without Labels
Viaarxiv icon

Improved LLM Agents for Financial Document Question Answering

Add code
Jun 10, 2025
Figure 1 for Improved LLM Agents for Financial Document Question Answering
Figure 2 for Improved LLM Agents for Financial Document Question Answering
Figure 3 for Improved LLM Agents for Financial Document Question Answering
Figure 4 for Improved LLM Agents for Financial Document Question Answering
Viaarxiv icon

Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis

Add code
May 18, 2025
Viaarxiv icon

Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model

Add code
May 07, 2025
Figure 1 for Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Figure 2 for Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Figure 3 for Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Figure 4 for Text2CT: Towards 3D CT Volume Generation from Free-text Descriptions Using Diffusion Model
Viaarxiv icon

Span-level Emotion-Cause-Category Triplet Extraction with Instruction Tuning LLMs and Data Augmentation

Add code
Apr 13, 2025
Viaarxiv icon

Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence

Add code
Apr 12, 2025
Figure 1 for Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
Figure 2 for Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
Figure 3 for Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
Figure 4 for Graph Learning-Driven Multi-Vessel Association: Fusing Multimodal Data for Maritime Intelligence
Viaarxiv icon