Picture for Ruiyi Zhang

Ruiyi Zhang

A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation

Add code
Dec 20, 2024
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Viaarxiv icon

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Viaarxiv icon

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

Personalized Multimodal Large Language Models: A Survey

Add code
Dec 03, 2024
Viaarxiv icon

Enhancing Diffusion Posterior Sampling for Inverse Problems by Integrating Crafted Measurements

Add code
Nov 15, 2024
Viaarxiv icon

Optimizing Data Delivery: Insights from User Preferences on Visuals, Tables, and Text

Add code
Nov 12, 2024
Viaarxiv icon

DynaSaur: Large Language Agents Beyond Predefined Actions

Add code
Nov 04, 2024
Figure 1 for DynaSaur: Large Language Agents Beyond Predefined Actions
Figure 2 for DynaSaur: Large Language Agents Beyond Predefined Actions
Figure 3 for DynaSaur: Large Language Agents Beyond Predefined Actions
Figure 4 for DynaSaur: Large Language Agents Beyond Predefined Actions
Viaarxiv icon

LoRA-Contextualizing Adaptation of Large Multimodal Models for Long Document Understanding

Add code
Nov 02, 2024
Viaarxiv icon

Personalization of Large Language Models: A Survey

Add code
Oct 29, 2024
Viaarxiv icon