Picture for Pengfei Cao

Pengfei Cao

DTELS: Towards Dynamic Granularity of Timeline Summarization

Add code
Nov 14, 2024
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning

Add code
Oct 12, 2024
Viaarxiv icon

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models

Add code
Oct 12, 2024
Viaarxiv icon

Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models

Add code
Aug 14, 2024
Viaarxiv icon

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models

Add code
Jun 23, 2024
Viaarxiv icon

Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation

Add code
Jun 17, 2024
Viaarxiv icon

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Add code
Jun 16, 2024
Viaarxiv icon