Picture for Xin Wang

Xin Wang

National Institute of Informatics, Japan

The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?

Add code
Apr 06, 2025
Viaarxiv icon

Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation

Add code
Apr 02, 2025
Viaarxiv icon

MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation

Add code
Mar 27, 2025
Viaarxiv icon

GS-I$^{3}$: Gaussian Splatting for Surface Reconstruction from Illumination-Inconsistent Images

Add code
Mar 18, 2025
Viaarxiv icon

Gaussian On-the-Fly Splatting: A Progressive Framework for Robust Near Real-Time 3DGS Optimization

Add code
Mar 17, 2025
Viaarxiv icon

SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression

Add code
Mar 16, 2025
Viaarxiv icon

GS-3I: Gaussian Splatting for Surface Reconstruction from Illumination-Inconsistent Images

Add code
Mar 16, 2025
Viaarxiv icon

Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards

Add code
Mar 14, 2025
Viaarxiv icon

VizTrust: A Visual Analytics Tool for Capturing User Trust Dynamics in Human-AI Communication

Add code
Mar 10, 2025
Viaarxiv icon

GrInAdapt: Scaling Retinal Vessel Structural Map Segmentation Through Grounding, Integrating and Adapting Multi-device, Multi-site, and Multi-modal Fundus Domains

Add code
Mar 08, 2025
Viaarxiv icon