Picture for Ian Reid

Ian Reid

Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models

Add code
Jan 07, 2025
Figure 1 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 2 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 3 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Figure 4 for Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models
Viaarxiv icon

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

Add code
Jan 02, 2025
Viaarxiv icon

Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion

Add code
Dec 16, 2024
Viaarxiv icon

Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments

Add code
Dec 13, 2024
Viaarxiv icon

PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos

Add code
Dec 02, 2024
Figure 1 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 2 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 3 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Figure 4 for PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos
Viaarxiv icon

Rethinking Weight-Averaged Model-merging

Add code
Nov 14, 2024
Viaarxiv icon

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Add code
Nov 04, 2024
Figure 1 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 2 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 3 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 4 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Viaarxiv icon

BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment

Add code
Oct 28, 2024
Figure 1 for BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
Figure 2 for BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
Figure 3 for BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
Figure 4 for BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
Viaarxiv icon

Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning

Add code
Oct 24, 2024
Figure 1 for Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning
Figure 2 for Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning
Figure 3 for Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning
Figure 4 for Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning
Viaarxiv icon

Affordance-Centric Policy Learning: Sample Efficient and Generalisable Robot Policy Learning using Affordance-Centric Task Frames

Add code
Oct 15, 2024
Figure 1 for Affordance-Centric Policy Learning: Sample Efficient and Generalisable Robot Policy Learning using Affordance-Centric Task Frames
Figure 2 for Affordance-Centric Policy Learning: Sample Efficient and Generalisable Robot Policy Learning using Affordance-Centric Task Frames
Figure 3 for Affordance-Centric Policy Learning: Sample Efficient and Generalisable Robot Policy Learning using Affordance-Centric Task Frames
Figure 4 for Affordance-Centric Policy Learning: Sample Efficient and Generalisable Robot Policy Learning using Affordance-Centric Task Frames
Viaarxiv icon