Picture for Junda He

Junda He

LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks

Add code
Feb 10, 2025
Viaarxiv icon

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Figure 1 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 2 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 3 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Figure 4 for BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
Viaarxiv icon

Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets

Add code
Oct 07, 2022
Figure 1 for Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets
Figure 2 for Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets
Figure 3 for Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets
Figure 4 for Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets
Viaarxiv icon