Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
3-Way Classification on BEA Shared Task 2 Track 1 (Unseen Answers) 2026 (Evaluation)
Loading...
0.729
QWK
Meta-prompt thinking medium
0.69468
0.70359
0.7125
0.72141
May 11, 2026
QWK
WF1
Rank
Updated 21d ago
Evaluation Results
Method
Method
Links
QWK
WF1
Rank
Meta-prompt thinking medium
thinking_level=medium
2026.05
0.729
72.8
30
Meta-prompt thinking high (different prompt)
thinking_level=high, p...
2026.05
0.696
70.2
35
Feedback
Search any
task
Search any
task