Picture for Yasheng Wang

Yasheng Wang

DAST: Difficulty-Aware Self-Training on Large Language Models

Add code
Mar 12, 2025
Viaarxiv icon

DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities

Add code
Feb 25, 2025
Viaarxiv icon

Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger

Add code
Feb 18, 2025
Viaarxiv icon

Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation

Add code
Feb 18, 2025
Viaarxiv icon

Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge

Add code
Feb 18, 2025
Figure 1 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 2 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 3 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Figure 4 for Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Viaarxiv icon

Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving

Add code
Feb 17, 2025
Figure 1 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 2 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 3 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Figure 4 for Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving
Viaarxiv icon

A Survey on Multi-Turn Interaction Capabilities of Large Language Models

Add code
Jan 17, 2025
Viaarxiv icon

NILE: Internal Consistency Alignment in Large Language Models

Add code
Dec 21, 2024
Viaarxiv icon

GUI Agents with Foundation Models: A Comprehensive Survey

Add code
Nov 07, 2024
Figure 1 for GUI Agents with Foundation Models: A Comprehensive Survey
Figure 2 for GUI Agents with Foundation Models: A Comprehensive Survey
Viaarxiv icon

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Add code
Oct 24, 2024
Viaarxiv icon