Picture for Qi Liu

Qi Liu

Tony

ToolGym: an Open-world Tool-using Environment for Scalable Agent Testing and Data Curation

Add code
Jan 09, 2026
Viaarxiv icon

Mind2Report: A Cognitive Deep Research Agent for Expert-Level Commercial Report Synthesis

Add code
Jan 08, 2026
Viaarxiv icon

Layer-Order Inversion: Rethinking Latent Multi-Hop Reasoning in Large Language Models

Add code
Jan 07, 2026
Viaarxiv icon

CREPES-X: Hierarchical Bearing-Distance-Inertial Direct Cooperative Relative Pose Estimation System

Add code
Dec 31, 2025
Viaarxiv icon

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Add code
Dec 30, 2025
Viaarxiv icon

A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation

Add code
Dec 24, 2025
Figure 1 for A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation
Figure 2 for A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation
Figure 3 for A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation
Figure 4 for A Turn Toward Better Alignment: Few-Shot Generative Adaptation with Equivariant Feature Rotation
Viaarxiv icon

ClarifyMT-Bench: Benchmarking and Improving Multi-Turn Clarification for Conversational Large Language Models

Add code
Dec 24, 2025
Viaarxiv icon

Towards Arbitrary Motion Completing via Hierarchical Continuous Representation

Add code
Dec 24, 2025
Figure 1 for Towards Arbitrary Motion Completing via Hierarchical Continuous Representation
Figure 2 for Towards Arbitrary Motion Completing via Hierarchical Continuous Representation
Figure 3 for Towards Arbitrary Motion Completing via Hierarchical Continuous Representation
Figure 4 for Towards Arbitrary Motion Completing via Hierarchical Continuous Representation
Viaarxiv icon

Sample-Efficient Policy Constraint Offline Deep Reinforcement Learning based on Sample Filtering

Add code
Dec 23, 2025
Figure 1 for Sample-Efficient Policy Constraint Offline Deep Reinforcement Learning based on Sample Filtering
Figure 2 for Sample-Efficient Policy Constraint Offline Deep Reinforcement Learning based on Sample Filtering
Figure 3 for Sample-Efficient Policy Constraint Offline Deep Reinforcement Learning based on Sample Filtering
Figure 4 for Sample-Efficient Policy Constraint Offline Deep Reinforcement Learning based on Sample Filtering
Viaarxiv icon

Tempo as the Stable Cue: Hierarchical Mixture of Tempo and Beat Experts for Music to 3D Dance Generation

Add code
Dec 21, 2025
Viaarxiv icon