Picture for Shuai Yang

Shuai Yang

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Add code
Dec 04, 2024
Viaarxiv icon

Trajectory Attention for Fine-grained Video Motion Control

Add code
Nov 28, 2024
Viaarxiv icon

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Add code
Nov 12, 2024
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Figure 1 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 2 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 3 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Figure 4 for Unified Generative and Discriminative Training for Multi-modal Large Language Models
Viaarxiv icon

GroupDiff: Diffusion-based Group Portrait Editing

Add code
Sep 22, 2024
Viaarxiv icon

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Add code
Aug 23, 2024
Figure 1 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 2 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 3 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 4 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Viaarxiv icon

Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

Add code
Aug 06, 2024
Viaarxiv icon

Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation

Add code
Jul 20, 2024
Viaarxiv icon

SEED-Story: Multimodal Long Story Generation with Large Language Model

Add code
Jul 11, 2024
Viaarxiv icon

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Add code
Jun 13, 2024
Viaarxiv icon