Picture for Wenxiong Kang

Wenxiong Kang

Pseudo-label Refinement for Improving Self-Supervised Learning Systems

Add code
Oct 18, 2024
Figure 1 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 2 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 3 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Figure 4 for Pseudo-label Refinement for Improving Self-Supervised Learning Systems
Viaarxiv icon

4DStyleGaussian: Zero-shot 4D Style Transfer with Gaussian Splatting

Add code
Oct 14, 2024
Viaarxiv icon

Improving 3D Finger Traits Recognition via Generalizable Neural Rendering

Add code
Oct 12, 2024
Viaarxiv icon

ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model

Add code
Oct 12, 2024
Figure 1 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 2 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 3 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Figure 4 for ControLRM: Fast and Controllable 3D Generation via Large Reconstruction Model
Viaarxiv icon

EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention

Add code
Aug 21, 2024
Figure 1 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 2 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 3 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Figure 4 for EmoFace: Emotion-Content Disentangled Speech-Driven 3D Talking Face with Mesh Attention
Viaarxiv icon

GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer

Add code
Aug 03, 2024
Figure 1 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 2 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 3 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Figure 4 for GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer
Viaarxiv icon

PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement

Add code
Aug 02, 2024
Viaarxiv icon

STMR: Spiral Transformer for Hand Mesh Reconstruction

Add code
Jul 08, 2024
Figure 1 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 2 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 3 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Figure 4 for STMR: Spiral Transformer for Hand Mesh Reconstruction
Viaarxiv icon

RobustMVS: Single Domain Generalized Deep Multi-view Stereo

Add code
May 15, 2024
Figure 1 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 2 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 3 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Figure 4 for RobustMVS: Single Domain Generalized Deep Multi-view Stereo
Viaarxiv icon

SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention

Add code
Mar 13, 2024
Figure 1 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 2 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 3 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Figure 4 for SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph Attention
Viaarxiv icon