Picture for Ian Reid

Ian Reid

Motion Anything: Any to Motion Generation

Add code
Mar 10, 2025
Viaarxiv icon

Action Tokenizer Matters in In-Context Imitation Learning

Add code
Mar 05, 2025
Figure 1 for Action Tokenizer Matters in In-Context Imitation Learning
Figure 2 for Action Tokenizer Matters in In-Context Imitation Learning
Figure 3 for Action Tokenizer Matters in In-Context Imitation Learning
Figure 4 for Action Tokenizer Matters in In-Context Imitation Learning
Viaarxiv icon

Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting

Add code
Feb 20, 2025
Viaarxiv icon

Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models

Add code
Jan 07, 2025
Figure 1 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 2 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 3 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 4 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Viaarxiv icon

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

Add code
Jan 02, 2025
Viaarxiv icon

Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion

Add code
Dec 16, 2024
Figure 1 for Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion
Figure 2 for Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion
Figure 3 for Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion
Figure 4 for Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion
Viaarxiv icon

Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments

Add code
Dec 13, 2024
Viaarxiv icon

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Add code
Dec 02, 2024
Figure 1 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 2 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 3 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 4 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Viaarxiv icon

Rethinking Weight-Averaged Model-merging

Add code
Nov 14, 2024
Viaarxiv icon

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Add code
Nov 04, 2024
Figure 1 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 2 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 3 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 4 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Viaarxiv icon