Picture for Qifeng Liu

Qifeng Liu

Autonomous Driving in Unstructured Environments: How Far Have We Come?

Add code
Oct 10, 2024
Viaarxiv icon

PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion

Add code
Sep 16, 2024
Viaarxiv icon

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Add code
Aug 30, 2024
Figure 1 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 2 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 3 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Figure 4 for Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Viaarxiv icon

Importance Weighting Can Help Large Language Models Self-Improve

Add code
Aug 19, 2024
Viaarxiv icon

STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Add code
Aug 03, 2024
Viaarxiv icon

MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

Add code
Jul 30, 2024
Viaarxiv icon

M-LRM: Multi-view Large Reconstruction Model

Add code
Jun 11, 2024
Viaarxiv icon

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

Add code
Jun 06, 2024
Viaarxiv icon

LLMs Meet Multimodal Generation and Editing: A Survey

Add code
May 29, 2024
Viaarxiv icon

CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild

Add code
May 27, 2024
Viaarxiv icon