Picture for Chen Xu

Chen Xu

Power Homotopy for Zeroth-Order Non-Convex Optimizations

Add code
Nov 17, 2025
Viaarxiv icon

Is It Truly Necessary to Process and Fit Minutes-Long Reference Videos for Personalized Talking Face Generation?

Add code
Nov 11, 2025
Viaarxiv icon

LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback

Add code
Aug 12, 2025
Figure 1 for LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback
Figure 2 for LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback
Figure 3 for LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback
Figure 4 for LLM-as-a-Supervisor: Mistaken Therapeutic Behaviors Trigger Targeted Supervisory Feedback
Viaarxiv icon

Step-Audio 2 Technical Report

Add code
Jul 24, 2025
Figure 1 for Step-Audio 2 Technical Report
Figure 2 for Step-Audio 2 Technical Report
Figure 3 for Step-Audio 2 Technical Report
Figure 4 for Step-Audio 2 Technical Report
Viaarxiv icon

Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation

Add code
Jul 16, 2025
Figure 1 for Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation
Figure 2 for Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation
Figure 3 for Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation
Figure 4 for Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation
Viaarxiv icon

Precise Zero-Shot Pointwise Ranking with LLMs through Post-Aggregated Global Context Information

Add code
Jun 12, 2025
Viaarxiv icon

Collapsing Sequence-Level Data-Policy Coverage via Poisoning Attack in Offline Reinforcement Learning

Add code
Jun 12, 2025
Viaarxiv icon

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Figure 1 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 2 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 3 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Figure 4 for Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Viaarxiv icon

A Survey of Automatic Evaluation Methods on Text, Visual and Speech Generations

Add code
Jun 06, 2025
Viaarxiv icon

T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation

Add code
May 23, 2025
Viaarxiv icon