Picture for Gokhan Tur

Gokhan Tur

Bilkent University, Ankara, Turkey

ATOD: An Evaluation Framework and Benchmark for Agentic Task-Oriented Dialogue System

Add code
Jan 17, 2026
Viaarxiv icon

Current Agents Fail to Leverage World Model as Tool for Foresight

Add code
Jan 08, 2026
Viaarxiv icon

MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations

Add code
Dec 15, 2025
Viaarxiv icon

SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning

Add code
Dec 15, 2025
Figure 1 for SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
Figure 2 for SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
Figure 3 for SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
Figure 4 for SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
Viaarxiv icon

Self-Improving LLM Agents at Test-Time

Add code
Oct 09, 2025
Figure 1 for Self-Improving LLM Agents at Test-Time
Figure 2 for Self-Improving LLM Agents at Test-Time
Figure 3 for Self-Improving LLM Agents at Test-Time
Figure 4 for Self-Improving LLM Agents at Test-Time
Viaarxiv icon

Goal Alignment in LLM-Based User Simulators for Conversational AI

Add code
Jul 27, 2025
Figure 1 for Goal Alignment in LLM-Based User Simulators for Conversational AI
Figure 2 for Goal Alignment in LLM-Based User Simulators for Conversational AI
Figure 3 for Goal Alignment in LLM-Based User Simulators for Conversational AI
Figure 4 for Goal Alignment in LLM-Based User Simulators for Conversational AI
Viaarxiv icon

Must Read: A Systematic Survey of Computational Persuasion

Add code
May 12, 2025
Viaarxiv icon

PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents

Add code
May 02, 2025
Figure 1 for PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
Figure 2 for PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
Figure 3 for PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
Figure 4 for PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents
Viaarxiv icon

TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons

Add code
Apr 28, 2025
Figure 1 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 2 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 3 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 4 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Figure 1 for ToolRL: Reward is All Tool Learning Needs
Figure 2 for ToolRL: Reward is All Tool Learning Needs
Figure 3 for ToolRL: Reward is All Tool Learning Needs
Figure 4 for ToolRL: Reward is All Tool Learning Needs
Viaarxiv icon