Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Revision Generation on RESEARCHARCADE
Loading...
0.733
SBERT Score
LLM
0.30452
0.41576
0.527
0.63824
Nov 27, 2025
SBERT Score
ROUGE-L
GPT-4o-mini Score
Updated 4d ago
Evaluation Results
Method
Method
Links
SBERT Score
ROUGE-L
GPT-4o-mini Score
LLM
Backbone=Qwen3-0.6B, T...
2025.11
0.733
0.554
0.572
LLM
Backbone=Qwen3-8B, Tra...
2025.11
0.704
0.446
0.889
LLM
Backbone=GPTOSS-120B,...
2025.11
0.669
0.265
0.999
LLM
Backbone=Qwen3-0.6B, T...
2025.11
0.321
0.21
0.447
Feedback
Search any
task
Search any
task