Picture for Mohamed Amine Ferrag

Mohamed Amine Ferrag

6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks

Add code
Feb 09, 2026
Viaarxiv icon

$α^3$-SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks

Add code
Jan 26, 2026
Viaarxiv icon

AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems

Add code
Jan 23, 2026
Viaarxiv icon

$α^3$-Bench: A Unified Benchmark of Safety, Robustness, and Efficiency for LLM-Based UAV Agents over 6G Networks

Add code
Jan 01, 2026
Viaarxiv icon

UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios

Add code
Nov 14, 2025
Figure 1 for UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios
Figure 2 for UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios
Figure 3 for UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios
Figure 4 for UAVBench: An Open Benchmark Dataset for Autonomous and Agentic AI UAV Systems via LLM-Generated Flight Scenarios
Viaarxiv icon

From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review

Add code
Apr 28, 2025
Figure 1 for From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Figure 2 for From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Figure 3 for From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Figure 4 for From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Viaarxiv icon

Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview

Add code
Mar 13, 2025
Figure 1 for Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview
Figure 2 for Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview
Figure 3 for Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview
Viaarxiv icon

CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection

Add code
Mar 12, 2025
Figure 1 for CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection
Figure 2 for CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection
Figure 3 for CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection
Figure 4 for CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection
Viaarxiv icon

Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Add code
Oct 20, 2024
Figure 1 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 2 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 3 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 4 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Viaarxiv icon

Generative AI and Large Language Models for Cyber Security: All Insights You Need

Add code
May 21, 2024
Figure 1 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 2 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 3 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Figure 4 for Generative AI and Large Language Models for Cyber Security: All Insights You Need
Viaarxiv icon