Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ASAP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Aspect-based Sentiment AnalysisASAP (test)
F1 Score94.59
95
Automated Essay ScoringASAP 1.0 (test)
Prompt 1 QWK0.836
51
Sheet music transcriptionASAP n=102
OMR-NED64.3
16
Evaluation AlignmentASAP 2.0
QWK0.7276
16
Automatic Text EvaluationASAP
QWK0.379
15
Automated Essay ScoringASAP Kaggle 2.0 (test)
QWK0.84
13
Trait-wise Automated Essay ScoringASAP and ASAP++ (five-fold cross-val)
Overall Score77.8
11
Automated Essay ScoringASAP and ASAP++ (five-fold cross-validation)
Score P10.73
11
Essay ScoringASAP++ five-fold averaged results
Overall Score0.712
10
Essay ScoringASAP-SAS
QWK (Prompt 3)0.661
10
Automated Essay ScoringASAP++ full-data setting
Score P10.734
10
Multi-trait Automated Essay ScoringASAP++ (full-data)
Overall Score0.781
10
Automatic Text ScoringASAP (test)
QWK0.764
9
Automatic Essay ScoringASAP In-domain (5-fold cross-validation)
Overall QWK0.785
8
Automated Essay ScoringASAP 2.0
QWK49.51
7
Rhythm QuantizationASAP ACPAS definitions (test)
Epsilon Onset Error12.3
7
Expressive Piano Performance RenderingASAP (test)
Velocity JS Div0.0427
7
Standard-Cell Performance PredictionASAP 7nm (test)
Rise Delay MAPE0.94
6
Multi-trait automated essay scoringASAP Prompt 8 (test)
Ideas0.694
6
Multi-trait automated essay scoringASAP Prompt 7 (test)
Ideas Score69.5
6
Automated Essay ScoringASAP++
QWK0.726
5
Automated Essay ScoringASAP
QWK0.743
5
Trait-level Essay ScoringASAP (test)
Content Score65.1
4
Automated Essay ScoringASAP Long Essays (Prompts 1, 2, 8)
Score (P1)83.6
4
Alignment(n)ASAP Dataset piano performances
Mean Error (ms)6
3
Showing 25 of 34 rows