Picture for Qi He

Qi He

May

Parameter-Free Adaptive Multi-Scale Channel-Spatial Attention Aggregation framework for 3D Indoor Semantic Scene Completion Toward Assisting Visually Impaired

Add code
Feb 19, 2026
Viaarxiv icon

How Far Are LLMs from Professional Poker Players? Revisiting Game-Theoretic Reasoning with Agentic Tool Use

Add code
Jan 31, 2026
Viaarxiv icon

STReasoner: Empowering LLMs for Spatio-Temporal Reasoning in Time Series via Spatial-Aware Reinforcement Learning

Add code
Jan 06, 2026
Viaarxiv icon

EpiQAL: Benchmarking Large Language Models in Epidemiological Question Answering for Enhanced Alignment and Reasoning

Add code
Jan 06, 2026
Viaarxiv icon

Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 2 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 3 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Figure 4 for Veri-R1: Toward Precise and Faithful Claim Verification via Online Reinforcement Learning
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon

EcomScriptBench: A Multi-task Benchmark for E-commerce Script Planning via Step-wise Intention-Driven Product Association

Add code
May 21, 2025
Viaarxiv icon

Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation

Add code
Apr 25, 2025
Figure 1 for Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Figure 2 for Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Figure 3 for Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Figure 4 for Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Figure 1 for ToolRL: Reward is All Tool Learning Needs
Figure 2 for ToolRL: Reward is All Tool Learning Needs
Figure 3 for ToolRL: Reward is All Tool Learning Needs
Figure 4 for ToolRL: Reward is All Tool Learning Needs
Viaarxiv icon

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Add code
Apr 13, 2025
Figure 1 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 2 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 3 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Figure 4 for UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
Viaarxiv icon