Picture for Chen Yang

Chen Yang

University of Science and Technology of China

LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors

Add code
Dec 12, 2024
Viaarxiv icon

Realistic Surgical Simulation from Monocular Videos

Add code
Dec 03, 2024
Viaarxiv icon

Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach

Add code
Nov 22, 2024
Figure 1 for Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach
Figure 2 for Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach
Figure 3 for Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach
Figure 4 for Facial Features Matter: a Dynamic Watermark based Proactive Deepfake Detection Approach
Viaarxiv icon

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing

Add code
Oct 22, 2024
Figure 1 for CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Figure 2 for CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Figure 3 for CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Figure 4 for CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
Viaarxiv icon

SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset

Add code
Oct 16, 2024
Figure 1 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 2 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 3 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Figure 4 for SF-Speech: Straightened Flow for Zero-Shot Voice Clone on Small-Scale Dataset
Viaarxiv icon

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Add code
Sep 26, 2024
Figure 1 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 2 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 3 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 4 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Viaarxiv icon

CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning

Add code
Aug 20, 2024
Viaarxiv icon

SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting

Add code
Aug 19, 2024
Figure 1 for SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting
Figure 2 for SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting
Figure 3 for SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting
Figure 4 for SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting
Viaarxiv icon

Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method

Add code
Aug 19, 2024
Figure 1 for Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method
Figure 2 for Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method
Figure 3 for Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method
Figure 4 for Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method
Viaarxiv icon

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Add code
Jul 07, 2024
Figure 1 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 2 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 3 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 4 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Viaarxiv icon