Picture for Minheng Ni

Minheng Ni

Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning

Add code
Oct 04, 2024
Viaarxiv icon

AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition

Add code
Aug 21, 2024
Viaarxiv icon

Responsible Visual Editing

Add code
Apr 08, 2024
Viaarxiv icon

Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models

Add code
Sep 01, 2023
Viaarxiv icon

ORES: Open-vocabulary Responsible Visual Synthesis

Add code
Aug 26, 2023
Viaarxiv icon

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

Add code
Mar 22, 2023
Viaarxiv icon

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

Add code
Feb 21, 2023
Viaarxiv icon

ImaginaryNet: Learning Object Detectors without Real Images and Annotations

Add code
Oct 13, 2022
Figure 1 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 2 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 3 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 4 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Viaarxiv icon

NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN

Add code
Feb 10, 2022
Viaarxiv icon

Co-GAT: A Co-Interactive Graph Attention Network for Joint Dialog Act Recognition and Sentiment Classification

Add code
Dec 24, 2020
Figure 1 for Co-GAT: A Co-Interactive Graph Attention Network for Joint Dialog Act Recognition and Sentiment Classification
Figure 2 for Co-GAT: A Co-Interactive Graph Attention Network for Joint Dialog Act Recognition and Sentiment Classification
Figure 3 for Co-GAT: A Co-Interactive Graph Attention Network for Joint Dialog Act Recognition and Sentiment Classification
Figure 4 for Co-GAT: A Co-Interactive Graph Attention Network for Joint Dialog Act Recognition and Sentiment Classification
Viaarxiv icon