Picture for Yiyi Zhou

Yiyi Zhou

$γ-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models

Add code
Oct 17, 2024
Viaarxiv icon

Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models

Add code
Sep 16, 2024
Viaarxiv icon

Image Captioning via Dynamic Path Customization

Add code
Jun 01, 2024
Viaarxiv icon

Deep Instruction Tuning for Segment Anything Model

Add code
Mar 31, 2024
Viaarxiv icon

Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models

Add code
Mar 22, 2024
Viaarxiv icon

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Add code
Mar 11, 2024
Viaarxiv icon

Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

Add code
Mar 05, 2024
Viaarxiv icon

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Add code
Jan 23, 2024
Viaarxiv icon

Towards Omni-supervised Referring Expression Segmentation

Add code
Nov 01, 2023
Viaarxiv icon

NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning

Add code
Oct 23, 2023
Viaarxiv icon