Picture for Fangzhen Lin

Fangzhen Lin

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Add code
Sep 03, 2025
Viaarxiv icon

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Add code
May 23, 2025
Viaarxiv icon

Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Add code
Apr 10, 2025
Viaarxiv icon

SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs

Add code
Mar 07, 2025
Figure 1 for SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
Figure 2 for SplitQuantV2: Enhancing Low-Bit Quantization of LLMs Without GPUs
Viaarxiv icon

Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems

Add code
Feb 12, 2025
Figure 1 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 2 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 3 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Figure 4 for Hierarchical Learning-based Graph Partition for Large-scale Vehicle Routing Problems
Viaarxiv icon

The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study

Add code
Feb 11, 2025
Figure 1 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 2 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 3 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Figure 4 for The Combined Problem of Online Task Assignment and Lifelong Path Finding in Logistics Warehouses: A Case Study
Viaarxiv icon

Adjustable Robust Reinforcement Learning for Online 3D Bin Packing

Add code
Oct 06, 2023
Viaarxiv icon

On Computing Universal Plans for Partially Observable Multi-Agent Path Finding

Add code
May 25, 2023
Viaarxiv icon

Using Language Models For Knowledge Acquisition in Natural Language Reasoning Problems

Add code
Apr 04, 2023
Viaarxiv icon