Picture for Yao Yao

Yao Yao

China University of Geosciences

4D SlingBAG: spatial-temporal coupled Gaussian ball for large-scale dynamic 3D photoacoustic iterative reconstruction

Add code
Dec 05, 2024
Viaarxiv icon

FATE: Full-head Gaussian Avatar with Textural Editing from Monocular Video

Add code
Nov 23, 2024
Viaarxiv icon

Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models

Add code
Oct 13, 2024
Figure 1 for Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models
Figure 2 for Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models
Figure 3 for Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models
Figure 4 for Model Developmental Safety: A Safety-Centric Method and Applications in Vision-Language Models
Viaarxiv icon

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

Add code
Oct 10, 2024
Figure 1 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 2 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 3 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 4 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Viaarxiv icon

Towards Native Generative Model for 3D Head Avatar

Add code
Oct 02, 2024
Figure 1 for Towards Native Generative Model for 3D Head Avatar
Figure 2 for Towards Native Generative Model for 3D Head Avatar
Figure 3 for Towards Native Generative Model for 3D Head Avatar
Figure 4 for Towards Native Generative Model for 3D Head Avatar
Viaarxiv icon

Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models

Add code
Sep 30, 2024
Figure 1 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 2 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 3 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Figure 4 for Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
Viaarxiv icon

4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment

Add code
Aug 22, 2024
Figure 1 for 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment
Figure 2 for 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment
Figure 3 for 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment
Figure 4 for 4D Diffusion for Dynamic Protein Structure Prediction with Reference Guided Motion Alignment
Viaarxiv icon

Deep Joint Denoising and Detection for Enhanced Intracellular Particle Analysis

Add code
Aug 15, 2024
Viaarxiv icon

Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions

Add code
Aug 05, 2024
Figure 1 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 2 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 3 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Figure 4 for Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
Viaarxiv icon

EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head

Add code
Aug 01, 2024
Viaarxiv icon