Picture for Motonari Kambara

Motonari Kambara

Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models

Add code
Sep 17, 2025
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon

Task Success Prediction and Open-Vocabulary Object Manipulation

Add code
Dec 26, 2024
Figure 1 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 2 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 3 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 4 for Task Success Prediction and Open-Vocabulary Object Manipulation
Viaarxiv icon

Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations

Add code
Oct 01, 2024
Figure 1 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 2 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 3 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 4 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Viaarxiv icon

Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks

Add code
Jul 18, 2024
Figure 1 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 2 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 3 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 4 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Viaarxiv icon

Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models

Add code
Jul 01, 2024
Figure 1 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 2 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 3 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 4 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Viaarxiv icon

Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine

Add code
Dec 26, 2023
Viaarxiv icon

DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

Add code
Nov 12, 2023
Viaarxiv icon

Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space

Add code
Nov 07, 2023
Viaarxiv icon