Picture for Wenqi Shi

Wenqi Shi

Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning

Add code
Jan 28, 2026
Viaarxiv icon

LLM-as-RNN: A Recurrent Language Model for Memory Updates and Sequence Prediction

Add code
Jan 19, 2026
Viaarxiv icon

MEDVISTAGYM: A Scalable Training Environment for Thinking with Medical Images via Tool-Integrated Reinforcement Learning

Add code
Jan 12, 2026
Viaarxiv icon

MENDR: Manifold Explainable Neural Data Representations

Add code
Aug 07, 2025
Viaarxiv icon

RAG in the Wild: On the (In)effectiveness of LLMs with Mixture-of-Knowledge Retrieval Augmentation

Add code
Jul 26, 2025
Viaarxiv icon

AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM Post-Training

Add code
Jul 02, 2025
Viaarxiv icon

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Add code
Jun 04, 2025
Figure 1 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 2 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 3 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Figure 4 for MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
Viaarxiv icon

WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning

Add code
May 28, 2025
Viaarxiv icon

Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation

Add code
May 06, 2025
Viaarxiv icon

Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration

Add code
Apr 07, 2025
Viaarxiv icon