Picture for Juanzi Li

Juanzi Li

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Add code
Jun 23, 2025
Viaarxiv icon

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Add code
Jun 11, 2025
Viaarxiv icon

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Add code
Jun 04, 2025
Viaarxiv icon

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Add code
Jun 04, 2025
Viaarxiv icon

How does Transformer Learn Implicit Reasoning?

Add code
May 29, 2025
Viaarxiv icon

Are Reasoning Models More Prone to Hallucination?

Add code
May 29, 2025
Viaarxiv icon

Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal Models

Add code
May 26, 2025
Viaarxiv icon

AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios

Add code
May 22, 2025
Viaarxiv icon

AdaptThink: Reasoning Models Can Learn When to Think

Add code
May 19, 2025
Viaarxiv icon