Picture for Qiguang Chen

Qiguang Chen

Beyond Surface Reasoning: Unveiling the True Long Chain-of-Thought Capacity of Diffusion Large Language Models

Add code
Oct 10, 2025
Viaarxiv icon

AutoPR: Let's Automate Your Academic Promotion!

Add code
Oct 10, 2025
Viaarxiv icon

Can LLMs Refuse Questions They Do Not Know? Measuring Knowledge-Aware Refusal in Factual Tasks

Add code
Oct 02, 2025
Viaarxiv icon

AI4Research: A Survey of Artificial Intelligence for Scientific Research

Add code
Jul 02, 2025
Viaarxiv icon

OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Add code
May 29, 2025
Viaarxiv icon

CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models

Add code
May 25, 2025
Figure 1 for CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
Figure 2 for CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
Figure 3 for CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
Figure 4 for CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
Viaarxiv icon

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System

Add code
May 21, 2025
Viaarxiv icon

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning

Add code
May 19, 2025
Viaarxiv icon

Efficient Process Reward Model Training via Active Learning

Add code
Apr 14, 2025
Viaarxiv icon