Picture for Min Lin

Min Lin

DEPTHOR++: Robust Depth Enhancement from a Real-World Lightweight dToF and RGB Guidance

Add code
Sep 30, 2025
Viaarxiv icon

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Add code
Sep 26, 2025
Viaarxiv icon

Variational Reasoning for Language Models

Add code
Sep 26, 2025
Viaarxiv icon

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Add code
Jun 10, 2025
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

Lifelong Safety Alignment for Language Models

Add code
May 26, 2025
Figure 1 for Lifelong Safety Alignment for Language Models
Figure 2 for Lifelong Safety Alignment for Language Models
Figure 3 for Lifelong Safety Alignment for Language Models
Figure 4 for Lifelong Safety Alignment for Language Models
Viaarxiv icon

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Add code
May 19, 2025
Figure 1 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 2 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 3 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Figure 4 for Optimizing Anytime Reasoning via Budget Relative Policy Optimization
Viaarxiv icon

A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Add code
Apr 21, 2025
Figure 1 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 2 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 3 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 4 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Viaarxiv icon

FlowReasoner: Reinforcing Query-Level Meta-Agents

Add code
Apr 21, 2025
Figure 1 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 2 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 3 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Figure 4 for FlowReasoner: Reinforcing Query-Level Meta-Agents
Viaarxiv icon

Understanding R1-Zero-Like Training: A Critical Perspective

Add code
Mar 26, 2025
Viaarxiv icon