Share your thoughts, 1 month free Claude Pro on usSee more

Text Generation on Aggregate NLP Tasks (GEC, Smart Reply, Summarization, Tone Adjustment, QA) (test)

32.9Average Score

Separate single-task LoRAs

Updated 3mo ago

Evaluation Results

Method	Links
Separate single-task LoRAs 2026.01		32.9	100
Separate single-task LoRAs 2026.01		31.7	100
Separate single-task LoRAs 2026.01		29	100
D2C 2026.01		26	12.5
D2C 2026.01		24.6	12.5
Random clustering 2026.01		22.5	12.5
K-Means w/ SVD clustering 2026.01		22.4	12.5
K-Means w/ SVD clustering 2026.01		22.3	12.5
K-Means clustering 2026.01		21.9	12.5
Random clustering 2026.01		21.5	12.5
K-Means clustering 2026.01		20.4	12.5
D2C 2026.01		17.3	12.5
K-Means w/ SVD clustering 2026.01		16.1	12.5
K-Means clustering 2026.01		15.5	12.5
Zero-shot 2026.01		15.2	0
Random clustering 2026.01		15.2	12.5
Zero-shot 2026.01		14.7	0
Zero-shot 2026.01		6.5	0