Picture for Bingliang Li

Bingliang Li

Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control

Add code
Dec 29, 2024
Viaarxiv icon

Open-World Human-Object Interaction Detection via Multi-modal Prompts

Add code
Jun 11, 2024
Viaarxiv icon

FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild

Add code
Sep 12, 2023
Viaarxiv icon

Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models

Add code
Sep 04, 2023
Viaarxiv icon

Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model

Add code
May 20, 2023
Viaarxiv icon