Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring

About

Automated Essay Scoring (AES) has gained increasing attention in recent years, yet research on Arabic AES remains limited due to the lack of publicly available datasets. To address this, we introduce LAILA, the largest publicly available Arabic AES dataset to date, comprising 7,859 essays annotated with holistic and trait-specific scores on seven dimensions: relevance, organization, vocabulary, style, development, mechanics, and grammar. We detail the dataset design, collection, and annotations, and provide benchmark results using state-of-the-art Arabic and English models in prompt-specific and cross-prompt settings. LAILA fills a critical need in Arabic AES research, supporting the development of robust scoring systems.

May Bashendy, Walid Massoud, Sohaila Eltanbouly, Salam Albatarni, Marwan Sayed, Abrar Abir, Houda Bouamor, Tamer Elsayed• 2025

Related benchmarks

TaskDatasetResultRank
Trait-level Essay ScoringLAILA (test)
Relevance0.36
3
Automated essay scoringLAILA
P1 Score0.4
3
Showing 2 of 2 rows

Other info

Follow for update