Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ChronoTQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringChronoTQA Tier 1 (147 Phenopackets-grounded onset questions)
Accuracy86.4
16
Biomedical Temporal ReasoningChronoTQA 1.0 (120-question stratified subsample)
Cross-disease Comparison100
4
Biomedical Question AnsweringChronoTQA
Metric-
0
Showing 3 of 3 rows