Picture for Tianyi Men

Tianyi Men

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Add code
Dec 18, 2024
Figure 1 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 2 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 3 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 4 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models

Add code
Jun 23, 2024
Viaarxiv icon