Picture for Zixuan Zhang

Zixuan Zhang

LLMs Can Generate a Better Answer by Aggregating Their Own Responses

Add code
Mar 06, 2025
Viaarxiv icon

A Minimalist Example of Edge-of-Stability and Progressive Sharpening

Add code
Mar 04, 2025
Viaarxiv icon

COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs

Add code
Feb 26, 2025
Viaarxiv icon

IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Add code
Feb 12, 2025
Viaarxiv icon

Optimistic ε-Greedy Exploration for Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 05, 2025
Viaarxiv icon

Double Distillation Network for Multi-Agent Reinforcement Learning

Add code
Feb 05, 2025
Viaarxiv icon

Robust Reinforcement Learning from Corrupted Human Feedback

Add code
Jun 21, 2024
Viaarxiv icon

Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization

Add code
Apr 02, 2024
Figure 1 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 2 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 3 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 4 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Viaarxiv icon

EVEDIT: Event-based Knowledge Editing with Deductive Editing Boundaries

Add code
Feb 17, 2024
Viaarxiv icon

Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

Add code
Jan 25, 2024
Viaarxiv icon