Picture for Lu Xu

Lu Xu

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Add code
Jul 31, 2024
Viaarxiv icon

NTIRE 2024 Challenge on Night Photography Rendering

Add code
Jun 18, 2024
Viaarxiv icon

Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

Add code
Jun 15, 2024
Viaarxiv icon

Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis

Add code
May 28, 2024
Viaarxiv icon

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Add code
May 09, 2024
Viaarxiv icon

End-to-end training of Multimodal Model and ranking Model

Add code
Apr 09, 2024
Viaarxiv icon

HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting

Add code
Mar 19, 2024
Viaarxiv icon

Parameter-Efficient Conversational Recommender System as a Language Processing Task

Add code
Feb 03, 2024
Viaarxiv icon

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

Add code
Jan 26, 2024
Viaarxiv icon

The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Add code
Oct 30, 2023
Viaarxiv icon