Picture for Abdelrahman Mohamed

Abdelrahman Mohamed

Casablanca: Data and Models for Multidialectal Arabic Speech Recognition

Add code
Oct 06, 2024
Viaarxiv icon

fCOP: Focal Length Estimation from Category-level Object Priors

Add code
Sep 29, 2024
Figure 1 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 2 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 3 for fCOP: Focal Length Estimation from Category-level Object Priors
Figure 4 for fCOP: Focal Length Estimation from Category-level Object Priors
Viaarxiv icon

A Large-Scale Evaluation of Speech Foundation Models

Add code
Apr 15, 2024
Viaarxiv icon

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Add code
Mar 25, 2024
Viaarxiv icon

Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks

Add code
Mar 01, 2024
Viaarxiv icon

SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

Add code
Jan 24, 2024
Viaarxiv icon

Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder

Add code
Nov 15, 2023
Viaarxiv icon

SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Add code
Oct 16, 2023
Viaarxiv icon

Self-Supervised Models of Speech Infer Universal Articulatory Kinematics

Add code
Oct 16, 2023
Viaarxiv icon

Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond

Add code
Oct 09, 2023
Figure 1 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 2 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 3 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Figure 4 for Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond
Viaarxiv icon