Picture for Yixuan Zhou

Yixuan Zhou

Early warning prediction: Onsager-Machlup vs Schrödinger

Add code
Jan 29, 2026
Viaarxiv icon

PILOT: A Perceptive Integrated Low-level Controller for Loco-manipulation over Unstructured Scenes

Add code
Jan 24, 2026
Viaarxiv icon

UniSRCodec: Unified and Low-Bitrate Single Codebook Codec with Sub-Band Reconstruction

Add code
Jan 06, 2026
Viaarxiv icon

UltraEval-Audio: A Unified Framework for Comprehensive Evaluation of Audio Foundation Models

Add code
Jan 04, 2026
Viaarxiv icon

EgoLoc: A Generalizable Solution for Temporal Interaction Localization in Egocentric Videos

Add code
Aug 17, 2025
Viaarxiv icon

A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understanding

Add code
Aug 07, 2025
Viaarxiv icon

"In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion

Add code
Jun 08, 2025
Figure 1 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 2 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 3 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 4 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Viaarxiv icon

Probe by Gaming: A Game-based Benchmark for Assessing Conceptual Knowledge in LLMs

Add code
May 23, 2025
Viaarxiv icon

UTTG_ A Universal Teleoperation Approach via Online Trajectory Generation

Add code
Apr 28, 2025
Viaarxiv icon

DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models

Add code
Feb 27, 2025
Figure 1 for DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models
Figure 2 for DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models
Figure 3 for DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models
Figure 4 for DiffCSS: Diverse and Expressive Conversational Speech Synthesis with Diffusion Models
Viaarxiv icon