Picture for Wenxiong Kang

Wenxiong Kang

Pseudo-label Refinement for Improving Self-Supervised Learning Systems

Add code
Oct 18, 2024
Figure 1 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 2 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 3 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 4 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Viaarxiv icon

4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting

Add code
Oct 14, 2024
Viaarxiv icon

ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model

Add code
Oct 12, 2024
Figure 1 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 2 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 3 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 4 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Viaarxiv icon

Improving 3D Finger Traits Recognition via Generalizable Neural Rendering

Add code
Oct 12, 2024
Viaarxiv icon

EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention

Add code
Aug 21, 2024
Figure 1 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 2 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 3 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 4 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Viaarxiv icon

GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer

Add code
Aug 03, 2024
Figure 1 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 2 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 3 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 4 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Viaarxiv icon

PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement

Add code
Aug 02, 2024
Viaarxiv icon

STMR: Spiral Transformer for Hand Mesh Reconstruction

Add code
Jul 08, 2024
Figure 1 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 2 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 3 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 4 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Viaarxiv icon

RobustMVS: Single Domain Generalized Deep Multi-view Stereo

Add code
May 15, 2024
Figure 1 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 2 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 3 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 4 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Viaarxiv icon

SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention

Add code
Mar 13, 2024
Figure 1 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 2 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 3 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 4 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Viaarxiv icon