Picture for Yiwen Guo

Yiwen Guo

MSR-Codec: A Low-Bitrate Multi-Stream Residual Codec for High-Fidelity Speech Generation with Information Disentanglement

Add code
Sep 16, 2025
Viaarxiv icon

Emotion Omni: Enabling Empathetic Speech Response Generation through Large Language Models

Add code
Aug 26, 2025
Viaarxiv icon

Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge

Add code
Jul 23, 2025
Viaarxiv icon

Identifying and Understanding Cross-Class Features in Adversarial Training

Add code
Jun 05, 2025
Viaarxiv icon

Cultivating Game Sense for Yourself: Making VLMs Gaming Experts

Add code
Mar 27, 2025
Viaarxiv icon

Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm

Add code
Mar 04, 2025
Viaarxiv icon

Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models

Add code
Feb 13, 2025
Viaarxiv icon

Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model

Add code
Feb 08, 2025
Figure 1 for Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model
Figure 2 for Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model
Figure 3 for Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model
Figure 4 for Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model
Viaarxiv icon

CueTip: An Interactive and Explainable Physics-aware Pool Assistant

Add code
Jan 30, 2025
Viaarxiv icon

Multi-Task Model Merging via Adaptive Weight Disentanglement

Add code
Nov 27, 2024
Figure 1 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 2 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 3 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Figure 4 for Multi-Task Model Merging via Adaptive Weight Disentanglement
Viaarxiv icon