Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text Generation on Aggregate NLP Tasks (GEC, Smart Reply, Summarization, Tone Adjustment, QA) (test)

32.9Average Score

Separate single-task LoRAs

5.44412.57219.726.828Jan 24, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
32.9100
2026.01
31.7100
2026.01
29100
2026.01
2612.5
2026.01
24.612.5
2026.01
22.512.5
2026.01
22.412.5
2026.01
22.312.5
2026.01
21.912.5
2026.01
21.512.5
2026.01
20.412.5
2026.01
17.312.5
2026.01
16.112.5
2026.01
15.512.5
2026.01
15.20
2026.01
15.212.5
2026.01
14.70
2026.01
6.50