Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CiQi-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Free-form Question AnsweringCiQi-Bench
Accuracy (Dynasty)71.3
11
Multiple-choice Question AnsweringCiQi-Bench
Dynasty Accuracy77.6
11
Showing 2 of 2 rows