Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Baizhi Chen

ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

Dec 18, 2024

Jie-Jing Shao, Xiao-Wen Yang, Bo-Wen Zhang, Baizhi Chen, Wen-Da Wei, Lan-Zhe Guo, Yu-feng Li

Figure 1 for ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

Figure 2 for ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

Figure 3 for ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

Figure 4 for ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning

Abstract:Recent advances in LLMs, particularly in language reasoning and tool integration, have rapidly sparked the real-world development of Language Agents. Among these, travel planning represents a prominent domain, combining academic challenges with practical value due to its complexity and market demand. However, existing benchmarks fail to reflect the diverse, real-world requirements crucial for deployment. To address this gap, we introduce ChinaTravel, a benchmark specifically designed for authentic Chinese travel planning scenarios. We collect the travel requirements from questionnaires and propose a compositionally generalizable domain-specific language that enables a scalable evaluation process, covering feasibility, constraint satisfaction, and preference comparison. Empirical studies reveal the potential of neuro-symbolic agents in travel planning, achieving a constraint satisfaction rate of 27.9%, significantly surpassing purely neural models at 2.6%. Moreover, we identify key challenges in real-world travel planning deployments, including open language reasoning and unseen concept composition. These findings highlight the significance of ChinaTravel as a pivotal milestone for advancing language agents in complex, real-world planning scenarios.

* Webpage: https://www.lamda.nju.edu.cn/shaojj/chinatravel

Via

Access Paper or Ask Questions