Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-form Multiple Choice Question Answering

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multiple Choice Question AnsweringLong-form Multiple Choice Question Answering benchmark
Accuracy83.2
10
Showing 1 of 1 rows