Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PDP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Pronoun Disambiguation ProblemPDP 2016 (test)
Accuracy78.3
21
Pickup and Delivery ProblemPDP20 uniform
Objective Value4.595
9
Commonsense ReasoningPDP
Accuracy91.66
8
Coreference ResolutionPDP (test)
Accuracy95
7
Showing 4 of 4 rows