Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Human Correlation Analysis on Refined human judgment dataset human vs model-generated
Loading...
0.995
SO-S
SO
0.33148
0.50374
0.676
0.84826
May 17, 2023
SO-S
MAUVE-S
Updated 1mo ago
Evaluation Results
Method
Method
Links
SO-S
MAUVE-S
SO
Human Judgment Criteri...
2023.05
0.995
-
SO
Human Judgment Criteri...
2023.05
0.81
-
SO
Human Judgment Criteri...
2023.05
0.357
-
MAUVE
Human Judgment Criteri...
2023.05
-
0.214
MAUVE
Human Judgment Criteri...
2023.05
-
0.667
MAUVE
Human Judgment Criteri...
2023.05
-
0.706
Feedback
Search any
task
Search any
task