Picture for Yixuan Li

Yixuan Li

video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models

Add code
Jun 18, 2025
Viaarxiv icon

CLONE: Closed-Loop Whole-Body Humanoid Teleoperation for Long-Horizon Tasks

Add code
Jun 10, 2025
Viaarxiv icon

Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation

Add code
Jun 04, 2025
Viaarxiv icon

Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders

Add code
May 27, 2025
Viaarxiv icon

MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems

Add code
May 25, 2025
Viaarxiv icon

Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

Add code
May 22, 2025
Viaarxiv icon

HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Add code
May 20, 2025
Viaarxiv icon

Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon