Picture for Hangting Chen

Hangting Chen

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

AUV: Teaching Audio Universal Vector Quantization with Single Nested Codebook

Add code
Sep 26, 2025
Viaarxiv icon

DualSpeechLM: Towards Unified Speech Understanding and Generation via Dual Speech Token Modeling with Large Language Models

Add code
Aug 12, 2025
Viaarxiv icon

Towards Hallucination-Free Music: A Reinforcement Learning Preference Optimization Framework for Reliable Song Generation

Add code
Aug 07, 2025
Viaarxiv icon

SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Add code
Jun 09, 2025
Viaarxiv icon

LeVo: High-Quality Song Generation with Multi-Preference Alignment

Add code
Jun 09, 2025
Figure 1 for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Figure 2 for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Figure 3 for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Figure 4 for LeVo: High-Quality Song Generation with Multi-Preference Alignment
Viaarxiv icon

WAKE: Watermarking Audio with Key Enrichment

Add code
Jun 06, 2025
Figure 1 for WAKE: Watermarking Audio with Key Enrichment
Figure 2 for WAKE: Watermarking Audio with Key Enrichment
Figure 3 for WAKE: Watermarking Audio with Key Enrichment
Figure 4 for WAKE: Watermarking Audio with Key Enrichment
Viaarxiv icon

Layer-wise Investigation of Large-Scale Self-Supervised Music Representation Models

Add code
May 22, 2025
Viaarxiv icon

UniSep: Universal Target Audio Separation with Language Models at Scale

Add code
Mar 31, 2025
Figure 1 for UniSep: Universal Target Audio Separation with Language Models at Scale
Figure 2 for UniSep: Universal Target Audio Separation with Language Models at Scale
Figure 3 for UniSep: Universal Target Audio Separation with Language Models at Scale
Figure 4 for UniSep: Universal Target Audio Separation with Language Models at Scale
Viaarxiv icon

MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization

Add code
Jan 03, 2025
Figure 1 for MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization
Figure 2 for MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization
Figure 3 for MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization
Figure 4 for MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization
Viaarxiv icon