Picture for Zhuoran Jin

Zhuoran Jin

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Add code
Nov 26, 2024
Viaarxiv icon

DTELS: Towards Dynamic Granularity of Timeline Summarization

Add code
Nov 14, 2024
Figure 1 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 2 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 3 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Figure 4 for DTELS: Towards Dynamic Granularity of Timeline Summarization
Viaarxiv icon

A Troublemaker with Contagious Jailbreak Makes Chaos in Honest Towns

Add code
Oct 21, 2024
Viaarxiv icon

MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models

Add code
Oct 12, 2024
Figure 1 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 2 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 3 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Figure 4 for MIRAGE: Evaluating and Explaining Inductive Reasoning Process in Language Models
Viaarxiv icon

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning

Add code
Oct 12, 2024
Viaarxiv icon

Towards Robust Knowledge Unlearning: An Adversarial Framework for Assessing and Improving Unlearning Robustness in Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models

Add code
Jun 23, 2024
Viaarxiv icon

Beyond Under-Alignment: Atomic Preference Enhanced Factuality Tuning for Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models

Add code
Jun 16, 2024
Viaarxiv icon

SimuCourt: Building Judicial Decision-Making Agents with Real-world Judgement Documents

Add code
Mar 05, 2024
Viaarxiv icon