Picture for Lu Xu

Lu Xu

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Add code
Jul 31, 2024
Viaarxiv icon

NTIRE 2024 Challenge on Night Photography Rendering

Add code
Jun 18, 2024
Figure 1 for NTIRE 2024 Challenge on Night Photography Rendering
Figure 2 for NTIRE 2024 Challenge on Night Photography Rendering
Figure 3 for NTIRE 2024 Challenge on Night Photography Rendering
Figure 4 for NTIRE 2024 Challenge on Night Photography Rendering
Viaarxiv icon

Beyond Raw Videos: Understanding Edited Videos with Large Multimodal Model

Add code
Jun 15, 2024
Viaarxiv icon

Sparsity- and Hybridity-Inspired Visual Parameter-Efficient Fine-Tuning for Medical Diagnosis

Add code
May 28, 2024
Viaarxiv icon

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Add code
May 09, 2024
Figure 1 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 2 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 3 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 4 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Viaarxiv icon

End-to-end training of Multimodal Model and ranking Model

Add code
Apr 09, 2024
Viaarxiv icon

HUGS: Holistic Urban 3D Scene Understanding via Gaussian Splatting

Add code
Mar 19, 2024
Viaarxiv icon

Parameter-Efficient Conversational Recommender System as a Language Processing Task

Add code
Feb 03, 2024
Viaarxiv icon

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

Add code
Jan 26, 2024
Viaarxiv icon

The Memory Perturbation Equation: Understanding Model's Sensitivity to Data

Add code
Oct 30, 2023
Viaarxiv icon