Picture for Jingxiang Sun

Jingxiang Sun

VTok: A Unified Video Tokenizer with Decoupled Spatial-Temporal Latents

Add code
Feb 04, 2026
Viaarxiv icon

FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation

Add code
Dec 19, 2025
Figure 1 for FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Figure 2 for FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Figure 3 for FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Figure 4 for FlexAvatar: Flexible Large Reconstruction Model for Animatable Gaussian Head Avatars with Detailed Deformation
Viaarxiv icon

GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation

Add code
Aug 19, 2025
Figure 1 for GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Figure 2 for GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Figure 3 for GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Figure 4 for GeoSAM2: Unleashing the Power of SAM2 for 3D Part Segmentation
Viaarxiv icon

SemanticSplat: Feed-Forward 3D Scene Understanding with Language-Aware Gaussian Fields

Add code
Jun 11, 2025
Viaarxiv icon

Parametric Gaussian Human Model: Generalizable Prior for Efficient and Realistic Human Avatar Modeling

Add code
Jun 07, 2025
Viaarxiv icon

DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow

Add code
Nov 25, 2024
Viaarxiv icon

DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model

Add code
Oct 16, 2024
Figure 1 for DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Figure 2 for DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Figure 3 for DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Figure 4 for DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model
Viaarxiv icon

Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer

Add code
May 27, 2024
Figure 1 for Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Figure 2 for Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Figure 3 for Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Figure 4 for Human4DiT: Free-view Human Video Generation with 4D Diffusion Transformer
Viaarxiv icon

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Add code
Mar 11, 2024
Figure 1 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 2 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 3 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Figure 4 for DeepSeek-VL: Towards Real-World Vision-Language Understanding
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Figure 1 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 2 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 3 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Figure 4 for DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Viaarxiv icon