Picture for Xinyi Liu

Xinyi Liu

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Add code
Dec 16, 2024
Viaarxiv icon

V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction

Add code
Dec 02, 2024
Figure 1 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 2 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 3 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 4 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Viaarxiv icon

Data-Centric and Heterogeneity-Adaptive Sequence Parallelism for Efficient LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

AutoGLM: Autonomous Foundation Agents for GUIs

Add code
Oct 28, 2024
Viaarxiv icon

Perturbation-based Graph Active Learning for Weakly-Supervised Belief Representation Learning

Add code
Oct 24, 2024
Viaarxiv icon

RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration

Add code
Oct 21, 2024
Figure 1 for RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration
Figure 2 for RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration
Figure 3 for RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration
Figure 4 for RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration
Viaarxiv icon

SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting

Add code
Oct 08, 2024
Figure 1 for SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting
Figure 2 for SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting
Figure 3 for SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting
Figure 4 for SplaTraj: Camera Trajectory Generation with Semantic Gaussian Splatting
Viaarxiv icon

MHAD: Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals

Add code
Sep 14, 2024
Viaarxiv icon

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Add code
Aug 12, 2024
Figure 1 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 2 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 3 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Figure 4 for VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
Viaarxiv icon

Gaussian Lane Keeping: A Robust Prediction Baseline

Add code
Jul 26, 2024
Viaarxiv icon