Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Linguistic Probing on BLiMP
Loading...
64.8
Performance
Masked-Diffusion
31.936
40.468
49
57.532
Dec 16, 2025
Performance
Updated 4d ago
Evaluation Results
Method
Method
Links
Performance
Masked-Diffusion
Repetitions=32, Alpha...
2025.12
64.8
Dual
Alpha (α)=63/64, Data...
2025.12
63.7
Dual
Repetitions=128, Alpha...
2025.12
63.5
Masked-Diffusion
Repetitions=128, Alpha...
2025.12
63.3
Dual
Repetitions=32, Alpha...
2025.12
62.7
Autoregressive
Alpha (α)=1, Data repe...
2025.12
61.3
Dual
Alpha (α)=3/4, Data re...
2025.12
57.9
Dual
Alpha (α)=1/8, Data re...
2025.12
56.1
Autoregressive
Alpha (α)=1, Data repe...
2025.12
53.3
Autoregressive
Alpha (α)=1, Data repe...
2025.12
33.2
Feedback
Search any
task
Search any
task