Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ProcTHOR

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visually-Grounded Active View SelectionAVS-ProcTHOR (val)
Existence Score93.02
11
Object SearchProcTHOR 150 distinct maps (test)
Average Navigation Cost215.59
9
Multi-Robot PlanningProcTHOR Large
Average Cost429.83
9
Multi-Robot PlanningProcTHOR Medium
Average Cost269.45
9
Multi-Robot PlanningProcTHOR Small
Average Cost130.86
9
Action PredictionProcTHOR single-object (OOD Systematic)
Accuracy75
9
Action PredictionProcTHOR single-object (OOD Compositional)
Accuracy95
9
Action PredictionProcTHOR single-object (IID)
Accuracy97
9
Action PredictionProcTHOR multi-object
IID Accuracy94
8
Pairwise Scene Generation EvaluationProcthor-10K (Easy)
Score (SceneCritic) Method A75.6
6
Pairwise evaluator agreement with human judgmentProcthor-10K Complex 1.0 (test)
Method A SceneCritic Score79.5
6
Interactive NavigationProcTHOR-10k 7-10 rooms (test)
SR100
6
Interactive NavigationProcTHOR-10k 4-6 rooms (test)
Success Rate100
6
Interactive NavigationProcTHOR-10k 1-3 rooms (test)
Success Rate (SR)100
6
Task PlanningPROCTHOR Any-of-Three
Average Cost22.03
5
Task PlanningPROCTHOR Breakfast+Coffee
Average Cost207.17
5
Task PlanningPROCTHOR Coffee
Avg Cost112.76
5
Task PlanningPROCTHOR Breakfast
Average Cost82.59
5
Task PlanningPROCTHOR Deliver 3-Object
Avg. Cost94.58
5
NavigationProcTHOR
Success Rate58
4
Showing 20 of 20 rows