Picture for Jiuxiang Gu

Jiuxiang Gu

MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Add code
Dec 18, 2024
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Viaarxiv icon

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon

XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation

Add code
Dec 02, 2024
Viaarxiv icon

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Viaarxiv icon

Personalization of Large Language Models: A Survey

Add code
Oct 29, 2024
Viaarxiv icon

A Survey of Small Language Models

Add code
Oct 25, 2024
Figure 1 for A Survey of Small Language Models
Figure 2 for A Survey of Small Language Models
Figure 3 for A Survey of Small Language Models
Viaarxiv icon

VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use

Add code
Oct 21, 2024
Viaarxiv icon