Picture for Pheng-Ann Heng

Pheng-Ann Heng

Omni-Captioner: Data Pipeline, Models, and Benchmark for Omni Detailed Perception

Add code
Oct 14, 2025
Viaarxiv icon

REAR: Rethinking Visual Autoregressive Models via Generator-Tokenizer Consistency Regularization

Add code
Oct 06, 2025
Viaarxiv icon

DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis

Add code
Oct 02, 2025
Viaarxiv icon

From Supervision to Exploration: What Does Protein Language Model Learn During Reinforcement Learning?

Add code
Oct 02, 2025
Viaarxiv icon

MEJO: MLLM-Engaged Surgical Triplet Recognition via Inter- and Intra-Task Joint Optimization

Add code
Sep 16, 2025
Viaarxiv icon

From Learning to Unlearning: Biomedical Security Protection in Multimodal Large Language Models

Add code
Aug 06, 2025
Viaarxiv icon

ClipGS: Clippable Gaussian Splatting for Interactive Cinematic Visualization of Volumetric Medical Data

Add code
Jul 09, 2025
Viaarxiv icon

Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making

Add code
May 27, 2025
Viaarxiv icon

Medical Large Vision Language Models with Multi-Image Visual Ability

Add code
May 25, 2025
Viaarxiv icon

Benchmarking Laparoscopic Surgical Image Restoration and Beyond

Add code
May 25, 2025
Viaarxiv icon