Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Lexical Paraphrasing on TURK (test)
Loading...
4.13
Mean Score
PO_Think
3.4436
3.6218
3.8
3.9782
Dec 6, 2025
Mean Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Score
PO_Think
backbone=Phi3-3.8B
2025.12
4.13
GPT-4o
2025.12
3.65
LENS_SALSA
backbone=Phi3-3.8B
2025.12
3.47
Feedback
Search any
task
Search any
task