Picture for Peng Gao

Peng Gao

University of Massachusetts Amherst

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Add code
Apr 10, 2025
Viaarxiv icon

OmniCaptioner: One Captioner to Rule Them All

Add code
Apr 09, 2025
Viaarxiv icon

Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision

Add code
Apr 08, 2025
Viaarxiv icon

Localization and Tracking for Cooperative Users in Multi-RIS-assisted Systems: Theoretical Analysis and Principles of Interpretations

Add code
Apr 07, 2025
Viaarxiv icon

NCL-CIR: Noise-aware Contrastive Learning for Composed Image Retrieval

Add code
Apr 06, 2025
Viaarxiv icon

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Add code
Mar 27, 2025
Viaarxiv icon

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Add code
Mar 27, 2025
Viaarxiv icon

3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o

Add code
Mar 17, 2025
Viaarxiv icon

TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation

Add code
Mar 10, 2025
Viaarxiv icon

CAML: Collaborative Auxiliary Modality Learning for Multi-Agent Systems

Add code
Feb 25, 2025
Viaarxiv icon