Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Poetry Continuation on Arabic Dialectal Poetry Gulf
Loading...
2.01
LLM-as-a-Judge Score
Qwen-3-8B (Random)
1.022
1.2785
1.535
1.7915
Apr 30, 2026
LLM-as-a-Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
LLM-as-a-Judge Score
Qwen-3-8B (Random)
Training Type=Random
2026.04
2.01
ALLaM-7B-instruct (Curriculum)
Training Type=Curriculum
2026.04
1.97
ALLaM-7B-instruct (Random)
Training Type=Random
2026.04
1.97
LLaMA-3-8B (Random)
Training Type=Random
2026.04
1.95
Qwen-3-8B (Curriculum)
Training Type=Curriculum
2026.04
1.94
LLaMA-3-8B (Curriculum)
Training Type=Curriculum
2026.04
1.92
Fanar-1-9B (Curriculum)
Training Type=Curriculum
2026.04
1.43
Fanar-1-9B (Random)
Training Type=Random
2026.04
1.38
Qwen-3-8B
Training Type=Base
2026.04
1.38
ALLaM-7B-instruct
Training Type=Instruct
2026.04
1.36
LLaMA-3-8B
Training Type=Base
2026.04
1.19
Fanar-1-9B
Training Type=Base
2026.04
1.06
Feedback
Search any
task
Search any
task