Picture for Xinlei Chen

Xinlei Chen

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Add code
Jan 16, 2025
Viaarxiv icon

Gaussian Masked Autoencoders

Add code
Jan 06, 2025
Viaarxiv icon

MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs

Add code
Dec 24, 2024
Viaarxiv icon

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Add code
Dec 18, 2024
Viaarxiv icon

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

Add code
Nov 14, 2024
Viaarxiv icon

SniffySquad: Patchiness-Aware Gas Source Localization with Multi-Robot Collaboration

Add code
Nov 09, 2024
Viaarxiv icon

Learning Video Representations without Natural Videos

Add code
Oct 31, 2024
Viaarxiv icon

EmbodiedCity: A Benchmark Platform for Embodied Agent in Real-world City Environment

Add code
Oct 12, 2024
Viaarxiv icon

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

Add code
Sep 30, 2024
Viaarxiv icon

Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping

Add code
Sep 15, 2024
Figure 1 for Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping
Figure 2 for Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping
Figure 3 for Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping
Figure 4 for Range-SLAM: Ultra-Wideband-Based Smoke-Resistant Real-Time Localization and Mapping
Viaarxiv icon