Picture for Minheng Ni

Minheng Ni

Don't Let Your Robot be Harmful: Responsible Robotic Manipulation

Add code
Nov 27, 2024
Viaarxiv icon

Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning

Add code
Oct 04, 2024
Figure 1 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 2 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 3 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Figure 4 for Visual-O1: Understanding Ambiguous Instructions via Multi-modal Multi-turn Chain-of-thoughts Reasoning
Viaarxiv icon

AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition

Add code
Aug 21, 2024
Figure 1 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 2 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 3 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 4 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Viaarxiv icon

Responsible Visual Editing

Add code
Apr 08, 2024
Viaarxiv icon

Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models

Add code
Sep 01, 2023
Viaarxiv icon

ORES: Open-vocabulary Responsible Visual Synthesis

Add code
Aug 26, 2023
Viaarxiv icon

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

Add code
Mar 22, 2023
Viaarxiv icon

Learning 3D Photography Videos via Self-supervised Diffusion on Single Images

Add code
Feb 21, 2023
Figure 1 for Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Figure 2 for Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Figure 3 for Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Figure 4 for Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
Viaarxiv icon

ImaginaryNet: Learning Object Detectors without Real Images and Annotations

Add code
Oct 13, 2022
Figure 1 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 2 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 3 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Figure 4 for ImaginaryNet: Learning Object Detectors without Real Images and Annotations
Viaarxiv icon

NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN

Add code
Feb 10, 2022
Viaarxiv icon