Picture for Shuai Yang

Shuai Yang

OmniCam: Unified Multimodal Video Generation via Camera Control

Add code
Apr 03, 2025
Viaarxiv icon

A Survey on Remote Sensing Foundation Models: From Vision to Multimodality

Add code
Mar 28, 2025
Viaarxiv icon

Language-based Image Colorization: A Benchmark and Beyond

Add code
Mar 19, 2025
Viaarxiv icon

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Add code
Mar 12, 2025
Viaarxiv icon

MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

Add code
Mar 11, 2025
Viaarxiv icon

Balanced Image Stylization with Style Matching Score

Add code
Mar 10, 2025
Viaarxiv icon

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Add code
Mar 08, 2025
Viaarxiv icon

PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model

Add code
Mar 08, 2025
Viaarxiv icon

Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support

Add code
Feb 26, 2025
Viaarxiv icon

Imagine360: Immersive 360 Video Generation from Perspective Anchor

Add code
Dec 04, 2024
Figure 1 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 2 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 3 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Figure 4 for Imagine360: Immersive 360 Video Generation from Perspective Anchor
Viaarxiv icon