Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ATLAS: Constraints-Aware Multi-Agent Collaboration for Real-World Travel Planning

About

While Large Language Models (LLMs) have shown remarkable advancements in reasoning and tool use, they often fail to generate optimal, grounded solutions under complex constraints. Real-world travel planning exemplifies these challenges, evaluating agents' abilities to handle constraints that are explicit, implicit, and even evolving based on interactions with dynamic environments and user needs. In this paper, we present ATLAS, a general multi-agent framework designed to effectively handle such complex nature of constraints awareness in real-world travel planning tasks. ATLAS introduces a principled approach to address the fundamental challenges of constraint-aware planning through dedicated mechanisms for dynamic constraint management, iterative plan critique, and adaptive interleaved search. ATLAS demonstrates state-of-the-art performance on the TravelPlanner benchmark, improving the final pass rate from 23.3% to 44.4% over its best alternative. More importantly, our work is the first to demonstrate quantitative effectiveness on real-world travel planning tasks with live information search and multi-turn feedback. In this realistic setting, ATLAS showcases its superior overall planning performance, achieving an 84% final pass rate which significantly outperforms baselines including ReAct (59%) and a monolithic agent (27%).

Jihye Choi, Jinsung Yoon, Jiefeng Chen, Somesh Jha, Tomas Pfister• 2025

Related benchmarks

TaskDatasetResultRank
PlanningTravelPlanner #180 (val)
CS-Micro85.42
22
Travel PlanningTravelPlanner 1000 tasks (test)
Commonsense Score (Micro)85.81
13
Multi-Turn Constraint AdaptationFlexTravelBench 2-Turn Local Scenario
Delivery Success Rate100
4
Multi-Turn Constraint AdaptationFlexTravelBench 2-Turn Global Scenario
Delivery Rate100
4
Multi-Turn Constraint AdaptationFlexTravelBench 3-Turn Local-to-Global
Delivery Success Rate100
4
Multi-Turn Constraint AdaptationFlexTravelBench 3-Turn Global-to-Local
Delivery Rate100
4
Showing 6 of 6 rows

Other info

Follow for update