Picture for Haohe Liu

Haohe Liu

HandSplat: Embedding-Driven Gaussian Splatting for High-Fidelity Hand Rendering

Add code
Mar 18, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

Audio-FLAN: A Preliminary Release

Add code
Feb 23, 2025
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models

Add code
Nov 28, 2024
Viaarxiv icon

FlowSep: Language-Queried Sound Separation with Rectified Flow Matching

Add code
Sep 11, 2024
Viaarxiv icon

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Text-Queried Target Sound Event Localization

Add code
Jun 23, 2024
Figure 1 for Text-Queried Target Sound Event Localization
Figure 2 for Text-Queried Target Sound Event Localization
Figure 3 for Text-Queried Target Sound Event Localization
Figure 4 for Text-Queried Target Sound Event Localization
Viaarxiv icon

Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review

Add code
Jun 20, 2024
Viaarxiv icon

Zero-Shot Audio Captioning Using Soft and Hard Prompts

Add code
Jun 10, 2024
Figure 1 for Zero-Shot Audio Captioning Using Soft and Hard Prompts
Figure 2 for Zero-Shot Audio Captioning Using Soft and Hard Prompts
Figure 3 for Zero-Shot Audio Captioning Using Soft and Hard Prompts
Figure 4 for Zero-Shot Audio Captioning Using Soft and Hard Prompts
Viaarxiv icon