Share your thoughts, 1 month free Claude Pro on usSee more

Automated Evaluation of Tutor Responses on Kochmar 2025 (demonstration set)

68Accuracy

LoMTL

Updated 5mo ago

Evaluation Results

Method	Links
LoMTL 2025.12		68	55
GPT-5 2025.12		66	58
Prometheus2 2025.12		41	33