Picture for Selamawit Asfaw

Selamawit Asfaw

VDT-Auto: End-to-end Autonomous Driving with VLM-Guided Diffusion Transformers

Add code
Feb 27, 2025
Viaarxiv icon

Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing

Add code
Jan 12, 2025
Figure 1 for Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Figure 2 for Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Figure 3 for Shake-VLA: Vision-Language-Action Model-Based System for Bimanual Robotic Manipulations and Liquid Mixing
Viaarxiv icon

FlightAR: AR Flight Assistance Interface with Multiple Video Streams and Object Detection Aimed at Immersive Drone Control

Add code
Oct 22, 2024
Viaarxiv icon

FADet: A Multi-sensor 3D Object Detection Network based on Local Featured Attention

Add code
May 19, 2024
Viaarxiv icon