Picture for Sicheng Yang

Sicheng Yang

Recover Semantics First, Generate Better: Improved Latent Modeling for 3D MRI Reconstruction and Cross-Contrast Synthesis

Add code
Jun 16, 2026
Viaarxiv icon

SegDINO: Introducing Multi-Scale Structure into DINO for Efficient Medical Image Segmentation

Add code
Jun 16, 2026
Viaarxiv icon

ClinHallu: A Benchmark for Diagnosing Stage-Wise Hallucinations in Medical MLLM Reasoning

Add code
Jun 12, 2026
Viaarxiv icon

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects

Add code
May 31, 2026
Viaarxiv icon

Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI

Add code
Mar 01, 2026
Viaarxiv icon

Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge

Add code
Jan 15, 2026
Viaarxiv icon

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Add code
Jan 15, 2026
Viaarxiv icon

Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation

Add code
Nov 12, 2025
Figure 1 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 2 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 3 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 4 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Viaarxiv icon

VAEVQ: Enhancing Discrete Visual Tokenization through Variational Modeling

Add code
Nov 10, 2025
Viaarxiv icon

K-Stain: Keypoint-Driven Correspondence for H&E-to-IHC Virtual Staining

Add code
Nov 10, 2025
Viaarxiv icon