Picture for Zhen Tan

Zhen Tan

OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning

Add code
Nov 12, 2025
Viaarxiv icon

Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder

Add code
Nov 07, 2025
Figure 1 for Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Figure 2 for Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Figure 3 for Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Figure 4 for Beyond Redundancy: Diverse and Specialized Multi-Expert Sparse Autoencoder
Viaarxiv icon

Metacognitive Self-Correction for Multi-Agent System via Prototype-Guided Next-Execution Reconstruction

Add code
Oct 16, 2025
Viaarxiv icon

Multi-Agent Debate for LLM Judges with Adaptive Stability Detection

Add code
Oct 14, 2025
Figure 1 for Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Figure 2 for Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Figure 3 for Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Figure 4 for Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Viaarxiv icon

Learning from Diverse Reasoning Paths with Routing and Collaboration

Add code
Aug 23, 2025
Figure 1 for Learning from Diverse Reasoning Paths with Routing and Collaboration
Figure 2 for Learning from Diverse Reasoning Paths with Routing and Collaboration
Figure 3 for Learning from Diverse Reasoning Paths with Routing and Collaboration
Figure 4 for Learning from Diverse Reasoning Paths with Routing and Collaboration
Viaarxiv icon

Transferring Expert Cognitive Models to Social Robots via Agentic Concept Bottleneck Models

Add code
Aug 06, 2025
Viaarxiv icon

Are Today's LLMs Ready to Explain Well-Being Concepts?

Add code
Aug 06, 2025
Viaarxiv icon

Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm

Add code
Jun 25, 2025
Figure 1 for Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Figure 2 for Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Figure 3 for Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Figure 4 for Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm
Viaarxiv icon

EQA-RM: A Generative Embodied Reward Model with Test-time Scaling

Add code
Jun 12, 2025
Viaarxiv icon

IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios

Add code
May 27, 2025
Viaarxiv icon