Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PubHealth

Benchmarks

Task NameDataset NameSOTA ResultTrend
Fact-CheckingPubHealth
Balanced Accuracy78.66
26
Closed-set Question AnsweringPubHealth
Accuracy74.5
15
Multi-Label Verdict PredictionPUBHEALTH supplementary experiments
OI2.786
8
Active RAGPubhealth
Accuracy73.4
6
Question AnsweringPubHealth (test)
Accuracy40.86
3
Multi-label Verdict PredictionPUBHEALTH
Margin-0.0008
2
Showing 6 of 6 rows