Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NCQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Error predictionNCQA (test)
AUROC99.2
76
Selective PredictionNCQA (test)
PRR99.1
19
Showing 2 of 2 rows