Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Translated SQuAD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Machine ComprehensionTranslated SQuAD QH-PH 5.1 (test)
F1 Score63.59
4
Machine ComprehensionTranslated SQuAD QH-PE 5.1 (test)
F1 Score59.19
4
Machine ComprehensionTranslated SQuAD QE-PH 5.1 (test)
F164.51
4
Machine ComprehensionTranslated SQuAD QE-PE 5.1 (test)
F1 Score94.56
4
Showing 4 of 4 rows