Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSC

Benchmarks

Task NameDataset NameSOTA ResultTrend
Dialogue Response GenerationMSC
B-4 Score35.8
38
Language ModelingMSC Session Openings 1.0 (val)
Perplexity7.78
10
Language ModelingMSC Session 5 1.0 (val)
Perplexity8.99
10
Language ModelingMSC Session 4 1.0 (val)
Perplexity9.07
10
Language ModelingMSC Session 3 1.0 (val)
Perplexity8.96
10
Language ModelingMSC Session 2 1.0 (val)
Perplexity9.08
10
Language ModelingMSC Session 1 1.0 (val)
Perplexity8.14
10
Text GenerationMSC
SacreBLEU1.23
5
Conversational MemoryMSC
RP@1077.2
5
Sparse Matrix-Vector multiplicationmsc10848
Memory (MB)1,014.04
4
Speech Mask DetectionMSC (test)
UAR72.5
3
Head-to-Head Comparative EvaluationMSC (test)
Wins289
2
Conversational Quality Evaluation (Conversational Turns)MSC 10% human-annotated sample
Topic Consistency64.12
1
Pulmonary nodule diagnosisMSC
AUC0.927
1
Showing 14 of 14 rows