Picture for Motonari Kambara

Motonari Kambara

Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations

Add code
Oct 01, 2024
Viaarxiv icon

Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks

Add code
Jul 18, 2024
Figure 1 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 2 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 3 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 4 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Viaarxiv icon

Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models

Add code
Jul 01, 2024
Figure 1 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 2 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 3 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Figure 4 for Object Segmentation from Open-Vocabulary Manipulation Instructions Based on Optimal Transport Polygon Matching with Multimodal Foundation Models
Viaarxiv icon

Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine

Add code
Dec 26, 2023
Viaarxiv icon

DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

Add code
Nov 12, 2023
Viaarxiv icon

Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space

Add code
Nov 07, 2023
Viaarxiv icon

Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks

Add code
Jul 14, 2023
Figure 1 for Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks
Figure 2 for Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks
Figure 3 for Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks
Figure 4 for Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks
Viaarxiv icon

Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks

Add code
Jul 19, 2022
Figure 1 for Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
Figure 2 for Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
Figure 3 for Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
Figure 4 for Relational Future Captioning Model for Explaining Likely Collisions in Daily Tasks
Viaarxiv icon

Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions

Add code
Jul 02, 2021
Figure 1 for Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Figure 2 for Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Figure 3 for Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Figure 4 for Case Relation Transformer: A Crossmodal Language Generation Model for Fetching Instructions
Viaarxiv icon