Picture for Ben He

Ben He

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Add code
Nov 18, 2024
Viaarxiv icon

DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models

Add code
Nov 05, 2024
Viaarxiv icon

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

Add code
Oct 08, 2024
Viaarxiv icon

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution

Add code
Aug 23, 2024
Figure 1 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 2 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 3 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Figure 4 for CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Viaarxiv icon

On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation

Add code
Jun 18, 2024
Viaarxiv icon

Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors

Add code
Jun 13, 2024
Viaarxiv icon

Towards Scalable Automated Alignment of LLMs: A Survey

Add code
Jun 03, 2024
Viaarxiv icon

Spiral of Silences: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering

Add code
Apr 18, 2024
Viaarxiv icon

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack

Add code
Apr 02, 2024
Viaarxiv icon

Self-Retrieval: Building an Information Retrieval System with One Large Language Model

Add code
Feb 23, 2024
Viaarxiv icon