Picture for Yubo Chen

Yubo Chen

RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment

Add code
Dec 18, 2024
Figure 1 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 2 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 3 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Figure 4 for RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
Viaarxiv icon

OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations

Add code
Dec 03, 2024
Figure 1 for OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Figure 2 for OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Figure 3 for OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Figure 4 for OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
Viaarxiv icon

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Add code
Nov 26, 2024
Figure 1 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 2 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 3 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Figure 4 for One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models
Viaarxiv icon

DTELS: Towards Dynamic Granularity of Timeline Summarization

Add code
Nov 14, 2024
Figure 1 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 2 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 3 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 4 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning

Add code
Oct 12, 2024
Viaarxiv icon

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models

Add code
Oct 12, 2024
Figure 1 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 2 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 3 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 4 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Viaarxiv icon

Real-world Adversarial Defense against Patch Attacks based on Diffusion Model

Add code
Sep 14, 2024
Viaarxiv icon

Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Add code
Aug 14, 2024
Viaarxiv icon