Picture for Fan Ma

Fan Ma

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Add code
Feb 05, 2025
Viaarxiv icon

TV-Dialogue: Crafting Theme-Aware Video Dialogues with Immersive Interaction

Add code
Jan 31, 2025
Viaarxiv icon

BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities

Add code
Jan 24, 2025
Viaarxiv icon

InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Add code
Nov 27, 2024
Viaarxiv icon

AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks

Add code
Nov 24, 2024
Viaarxiv icon

Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy

Add code
Nov 24, 2024
Figure 1 for Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Figure 2 for Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Figure 3 for Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Figure 4 for Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
Viaarxiv icon

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models

Add code
Nov 14, 2024
Figure 1 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 2 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 3 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Figure 4 for Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
Viaarxiv icon

Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion

Add code
Aug 01, 2024
Viaarxiv icon

VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation

Add code
Jul 13, 2024
Figure 1 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 2 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 3 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Figure 4 for VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation
Viaarxiv icon

MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis

Add code
Jul 02, 2024
Figure 1 for MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Figure 2 for MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Figure 3 for MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Figure 4 for MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
Viaarxiv icon