Picture for Ben He

Ben He

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Add code
Jan 07, 2025
Viaarxiv icon

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Add code
Jan 03, 2025
Figure 1 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 2 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 3 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 4 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Viaarxiv icon

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Add code
Nov 18, 2024
Viaarxiv icon

DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models

Add code
Nov 05, 2024
Viaarxiv icon

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

Add code
Oct 08, 2024
Viaarxiv icon

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution

Add code
Aug 23, 2024
Figure 1 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 2 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 3 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 4 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Viaarxiv icon

On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation

Add code
Jun 18, 2024
Viaarxiv icon

Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors

Add code
Jun 13, 2024
Viaarxiv icon

Towards Scalable Automated Alignment of LLMs: A Survey

Add code
Jun 03, 2024
Viaarxiv icon

Spiral of Silences: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering

Add code
Apr 18, 2024
Viaarxiv icon