Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Semantic Representation Evaluation on UpWork Narrative Situations Firefighting (test)
Loading...
98
Preference Rate
Operator
-1.84
24.08
50
75.92
Nov 10, 2025
Preference Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Preference Rate
Operator
Compared Baseline=GLEN
2025.11
98
Operator
Compared Baseline=BERT...
2025.11
98
Operator
Compared Baseline=FST
2025.11
79
FST
Compared Against=Operator
2025.11
21
GLEN
Compared Against=Operator
2025.11
2
BERT-SRL
Compared Against=Operator
2025.11
2
Feedback
Search any
task
Search any
task