Picture for Lucas C. Cordeiro

Lucas C. Cordeiro

Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Add code
Oct 20, 2024
Figure 1 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 2 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 3 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 4 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Viaarxiv icon

Synthetic Data Aided Federated Learning Using Foundation Models

Add code
Jul 06, 2024
Figure 1 for Synthetic Data Aided Federated Learning Using Foundation Models
Figure 2 for Synthetic Data Aided Federated Learning Using Foundation Models
Figure 3 for Synthetic Data Aided Federated Learning Using Foundation Models
Figure 4 for Synthetic Data Aided Federated Learning Using Foundation Models
Viaarxiv icon

Automated Repair of AI Code with Large Language Models and Formal Verification

Add code
May 14, 2024
Figure 1 for Automated Repair of AI Code with Large Language Models and Formal Verification
Figure 2 for Automated Repair of AI Code with Large Language Models and Formal Verification
Figure 3 for Automated Repair of AI Code with Large Language Models and Formal Verification
Figure 4 for Automated Repair of AI Code with Large Language Models and Formal Verification
Viaarxiv icon

Do Neutral Prompts Produce Insecure Code? FormAI-v2 Dataset: Labelling Vulnerabilities in Code Generated by Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

Tasks People Prompt: A Taxonomy of LLM Downstream Tasks in Software Verification and Falsification Approaches

Add code
Apr 14, 2024
Viaarxiv icon

NeuroCodeBench: a plain C neural network benchmark for software verification

Add code
Sep 07, 2023
Figure 1 for NeuroCodeBench: a plain C neural network benchmark for software verification
Figure 2 for NeuroCodeBench: a plain C neural network benchmark for software verification
Figure 3 for NeuroCodeBench: a plain C neural network benchmark for software verification
Viaarxiv icon

SecureFalcon: The Next Cyber Reasoning System for Cyber Security

Add code
Jul 13, 2023
Viaarxiv icon

The FormAI Dataset: Generative AI in Software Security Through the Lens of Formal Verification

Add code
Jul 05, 2023
Viaarxiv icon

QNNRepair: Quantized Neural Network Repair

Add code
Jun 27, 2023
Viaarxiv icon

Revolutionizing Cyber Threat Detection with Large Language Models

Add code
Jun 25, 2023
Viaarxiv icon