Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MOCHA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Question AnsweringMOCHA (test)
Pearson's r0.741
36
NLG Meta-evaluationMOCHA
Kendall Correlation0.72
6
Question Answering EvaluationMOCHA
Spearman Correlation0.872
6
Showing 3 of 3 rows