Picture for Eslam Mohamed Bakr

Eslam Mohamed Bakr

iMotion-LLM: Motion Prediction Instruction Tuning

Add code
Jun 11, 2024
Viaarxiv icon

Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding

Add code
May 29, 2024
Viaarxiv icon

ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model

Add code
Nov 24, 2023
Viaarxiv icon

CoT3DRef: Chain-of-Thoughts Data-Efficient 3D Visual Grounding

Add code
Oct 10, 2023
Viaarxiv icon

HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models

Add code
Apr 11, 2023
Viaarxiv icon

ImageCaptioner$^2$: Image Captioner for Image Captioning Bias Amplification Assessment

Add code
Apr 10, 2023
Viaarxiv icon

Look Around and Refer: 2D Synthetic Semantics Knowledge Distillation for 3D Visual Grounding

Add code
Nov 25, 2022
Viaarxiv icon