Picture for Chao Qu

Chao Qu

INF Technology

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay

Add code
Mar 17, 2026
Viaarxiv icon

SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation

Add code
Mar 17, 2026
Viaarxiv icon

PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation

Add code
Mar 11, 2026
Viaarxiv icon

Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation

Add code
Mar 10, 2026
Viaarxiv icon

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Add code
Aug 07, 2025
Figure 1 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 2 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 3 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 4 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Viaarxiv icon

Equivariant Spherical Transformer for Efficient Molecular Modeling

Add code
May 29, 2025
Figure 1 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 2 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 3 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Figure 4 for Equivariant Spherical Transformer for Efficient Molecular Modeling
Viaarxiv icon

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

Add code
Apr 10, 2025
Viaarxiv icon

Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging

Add code
Mar 05, 2025
Figure 1 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 2 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 3 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Figure 4 for Near-infrared Image Deblurring and Event Denoising with Synergistic Neuromorphic Imaging
Viaarxiv icon

AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification

Add code
Feb 17, 2025
Figure 1 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 2 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 3 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Figure 4 for AURORA:Automated Training Framework of Universal Process Reward Models via Ensemble Prompting and Reverse Verification
Viaarxiv icon