Picture for Zhen Ye

Zhen Ye

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models

Add code
Jun 17, 2024
Viaarxiv icon

FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation

Add code
May 13, 2024
Viaarxiv icon

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Add code
Apr 25, 2024
Viaarxiv icon

CoMoSVC: Consistency Model-based Singing Voice Conversion

Add code
Jan 03, 2024
Viaarxiv icon

NAS-FM: Neural Architecture Search for Tunable and Interpretable Sound Synthesis based on Frequency Modulation

Add code
May 22, 2023
Viaarxiv icon

CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model

Add code
May 11, 2023
Viaarxiv icon

Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features

Add code
May 05, 2021
Figure 1 for Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features
Figure 2 for Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features
Figure 3 for Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features
Figure 4 for Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features
Viaarxiv icon

BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving

Add code
Mar 15, 2019
Figure 1 for BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving
Figure 2 for BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving
Figure 3 for BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving
Figure 4 for BLVD: Building A Large-scale 5D Semantics Benchmark for Autonomous Driving
Viaarxiv icon