Picture for Xuemiao Xu

Xuemiao Xu

TAR-TVG: Enhancing VLMs with Timestamp Anchor-Constrained Reasoning for Temporal Video Grounding

Add code
Aug 11, 2025
Viaarxiv icon

Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding

Add code
Aug 10, 2025
Viaarxiv icon

SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation

Add code
Mar 25, 2025
Viaarxiv icon

SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation

Add code
Mar 18, 2025
Viaarxiv icon

RecDreamer: Consistent Text-to-3D Generation via Uniform Score Distillation

Add code
Feb 18, 2025
Viaarxiv icon

Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning

Add code
Feb 04, 2025
Figure 1 for Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning
Figure 2 for Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning
Figure 3 for Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning
Figure 4 for Rotation-Adaptive Point Cloud Domain Generalization via Intricate Orientation Learning
Viaarxiv icon

DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification

Add code
Sep 09, 2024
Figure 1 for DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
Figure 2 for DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
Figure 3 for DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
Figure 4 for DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification
Viaarxiv icon

VrdONE: One-stage Video Visual Relation Detection

Add code
Aug 18, 2024
Figure 1 for VrdONE: One-stage Video Visual Relation Detection
Figure 2 for VrdONE: One-stage Video Visual Relation Detection
Figure 3 for VrdONE: One-stage Video Visual Relation Detection
Figure 4 for VrdONE: One-stage Video Visual Relation Detection
Viaarxiv icon

G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors

Add code
Aug 18, 2024
Figure 1 for G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
Figure 2 for G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
Figure 3 for G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
Figure 4 for G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors
Viaarxiv icon

Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation

Add code
Jul 10, 2024
Viaarxiv icon