Picture for Juanzi Li

Juanzi Li

MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark

Add code
Mar 10, 2025
Viaarxiv icon

Shifting Long-Context LLMs Research from Input to Output

Add code
Mar 07, 2025
Viaarxiv icon

Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs

Add code
Mar 03, 2025
Viaarxiv icon

Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Add code
Feb 26, 2025
Viaarxiv icon

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Add code
Feb 20, 2025
Viaarxiv icon

Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling

Add code
Jan 20, 2025
Figure 1 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 2 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 3 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Figure 4 for Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling
Viaarxiv icon

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Add code
Dec 19, 2024
Figure 1 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 2 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 3 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 4 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Viaarxiv icon

EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents

Add code
Dec 16, 2024
Figure 1 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 2 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 3 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Figure 4 for EventSum: A Large-Scale Event-Centric Summarization Dataset for Chinese Multi-News Documents
Viaarxiv icon

AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning

Add code
Nov 25, 2024
Figure 1 for AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Figure 2 for AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Figure 3 for AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Figure 4 for AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning
Viaarxiv icon