Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Syntax Evaluation on BLiMP
Loading...
84.61
Accuracy
GTCA
78.3388
79.9669
81.595
83.2231
Jan 23, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GTCA
Backbone Model=Llama-3...
2026.01
84.61
GTCA
Backbone Model=Qwen-2....
2026.01
83.12
LoRA-only
Backbone Model=Qwen-2....
2026.01
80.87
LoRA-only
Backbone Model=Llama-3...
2026.01
80.7
Direct-Joint
Backbone Model=Llama-3...
2026.01
80.36
Direct-Joint
Backbone Model=Qwen-2....
2026.01
80.27
Backbone
Backbone Model=Llama-3...
2026.01
79.95
Backbone
Backbone Model=Qwen-2....
2026.01
78.58
Feedback
Search any
task
Search any
task