Picture for Shuai Yang

Shuai Yang

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Add code
Nov 12, 2024
Viaarxiv icon

Unified Generative and Discriminative Training for Multi-modal Large Language Models

Add code
Nov 01, 2024
Viaarxiv icon

GroupDiff: Diffusion-based Group Portrait Editing

Add code
Sep 22, 2024
Viaarxiv icon

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Add code
Aug 23, 2024
Figure 1 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 2 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 3 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Figure 4 for LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation
Viaarxiv icon

Deep Uncertainty-Based Explore for Index Construction and Retrieval in Recommendation System

Add code
Aug 06, 2024
Viaarxiv icon

Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation

Add code
Jul 20, 2024
Viaarxiv icon

SEED-Story: Multimodal Long Story Generation with Large Language Model

Add code
Jul 11, 2024
Viaarxiv icon

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Add code
Jun 13, 2024
Viaarxiv icon

Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond

Add code
Jun 05, 2024
Viaarxiv icon

Video Diffusion Models are Training-free Motion Interpreter and Controller

Add code
May 23, 2024
Figure 1 for Video Diffusion Models are Training-free Motion Interpreter and Controller
Figure 2 for Video Diffusion Models are Training-free Motion Interpreter and Controller
Figure 3 for Video Diffusion Models are Training-free Motion Interpreter and Controller
Figure 4 for Video Diffusion Models are Training-free Motion Interpreter and Controller
Viaarxiv icon