Share your thoughts, 1 month free Claude Pro on usSee more

Argument Component Detection on Persuasive Essays (PE) (test)

88.6Macro F1

Human Upper Bound

Updated 4mo ago

Evaluation Results

Method	Links
Human Upper Bound 2026.03		88.6	-
Llama-3-8B 2026.03		87.78	90.04
CRF with features 2026.03		86.7	-
GPT-2-1.5B 2026.03		85.21	88.04
OPT-6.7B 2026.03		85.18	88.56
MT-all 2026.03		75.66	-
DeBERTa-v3 2026.03		71.12	-
RoBERTa 2026.03		69.33	-
Heuristic Baseline 2026.03		64.2	-