Picture for Zilong Zheng

Zilong Zheng

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Add code
Aug 27, 2025
Viaarxiv icon

In-situ Value-aligned Human-Robot Interactions with Physical Constraints

Add code
Aug 11, 2025
Viaarxiv icon

TongSearch-QR: Reinforced Query Reasoning for Retrieval

Add code
Jun 16, 2025
Viaarxiv icon

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Add code
Jun 10, 2025
Viaarxiv icon

When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways

Add code
May 30, 2025
Viaarxiv icon

Discrete Markov Bridge

Add code
May 26, 2025
Viaarxiv icon

EuroCon: Benchmarking Parliament Deliberation for Political Consensus Finding

Add code
May 26, 2025
Viaarxiv icon

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Add code
May 22, 2025
Viaarxiv icon

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Add code
May 19, 2025
Viaarxiv icon

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Add code
May 07, 2025
Viaarxiv icon