Picture for Cheng Gong

Cheng Gong

Rare Word Recognition and Translation Without Fine-Tuning via Task Vector in Speech Models

Add code
Dec 26, 2025
Viaarxiv icon

Efficient Reasoning via Reward Model

Add code
Nov 12, 2025
Viaarxiv icon

Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities

Add code
Aug 27, 2025
Figure 1 for Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
Figure 2 for Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
Figure 3 for Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
Figure 4 for Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
Viaarxiv icon

ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech

Add code
Feb 13, 2025
Figure 1 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 2 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 3 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 4 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Viaarxiv icon

Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models

Add code
Jan 24, 2025
Figure 1 for Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models
Figure 2 for Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models
Figure 3 for Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models
Figure 4 for Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models
Viaarxiv icon

MOS-Attack: A Scalable Multi-objective Adversarial Attack Framework

Add code
Jan 13, 2025
Viaarxiv icon

EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis

Add code
Sep 27, 2024
Figure 1 for EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis
Figure 2 for EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis
Figure 3 for EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis
Figure 4 for EmoPro: A Prompt Selection Strategy for Emotional Expression in LM-based Speech Synthesis
Viaarxiv icon

VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing

Add code
Aug 11, 2024
Viaarxiv icon

Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks

Add code
Jul 19, 2024
Figure 1 for Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks
Figure 2 for Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks
Figure 3 for Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks
Figure 4 for Deep Feature Surgery: Towards Accurate and Efficient Multi-Exit Networks
Viaarxiv icon

Learning Pareto Set for Multi-Objective Continuous Robot Control

Add code
Jun 27, 2024
Figure 1 for Learning Pareto Set for Multi-Objective Continuous Robot Control
Figure 2 for Learning Pareto Set for Multi-Objective Continuous Robot Control
Figure 3 for Learning Pareto Set for Multi-Objective Continuous Robot Control
Figure 4 for Learning Pareto Set for Multi-Objective Continuous Robot Control
Viaarxiv icon