Picture for Hongning Wang

Hongning Wang

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

Reasoning to Rank: An End-to-End Solution for Exploiting Large Language Models for Recommendation

Add code
Feb 13, 2026
Viaarxiv icon

The Missing Half: Unveiling Training-time Implicit Safety Risks Beyond Deployment

Add code
Feb 04, 2026
Viaarxiv icon

Trust-Region Adaptive Policy Optimization

Add code
Dec 19, 2025
Figure 1 for Trust-Region Adaptive Policy Optimization
Figure 2 for Trust-Region Adaptive Policy Optimization
Figure 3 for Trust-Region Adaptive Policy Optimization
Figure 4 for Trust-Region Adaptive Policy Optimization
Viaarxiv icon

Data-Efficient RLVR via Off-Policy Influence Guidance

Add code
Oct 30, 2025
Viaarxiv icon

Think Socially via Cognitive Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

JPS: Jailbreak Multimodal Large Language Models with Collaborative Visual Perturbation and Textual Steering

Add code
Aug 07, 2025
Viaarxiv icon

Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making

Add code
Jun 11, 2025
Viaarxiv icon

How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study

Add code
May 21, 2025
Viaarxiv icon