Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UpWork Narrative Situations

Benchmarks

Task NameDataset NameSOTA ResultTrend
Semantic Representation EvaluationUpWork Narrative Situations Tech. Development (test)
Preference Rate96
6
Semantic Representation EvaluationUpWork Narrative Situations Healthcare (test)
Preference Rate100
6
Semantic Representation EvaluationUpWork Narrative Situations Firefighting (test)
Preference Rate98
6
Semantic Representation EvaluationUpWork Narrative Situations Economy (test)
Preference Rate98
6
Semantic Representation EvaluationUpWork Narrative Situations Crime & Justice (test)
Preference Rate96
6
Showing 5 of 5 rows