Picture for Lajos Muzsai

Lajos Muzsai

HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing

Add code
Dec 02, 2024
Viaarxiv icon

Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence

Add code
Oct 20, 2024
Figure 1 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 2 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 3 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Figure 4 for Dynamic Intelligence Assessment: Benchmarking LLMs on the Road to AGI with a Focus on Model Confidence
Viaarxiv icon

LlamBERT: Large-scale low-cost data annotation in NLP

Add code
Mar 23, 2024
Viaarxiv icon