Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Inference correction review (reason) on SocialIQA
Loading...
100
MHA
Mistral
79.2
84.6
90
95.4
Apr 18, 2026
MHA
Updated 1mo ago
Evaluation Results
Method
Method
Links
MHA
Mistral
pipeline=IIE pipeline
2026.04
100
GPT
pipeline=IIE pipeline
2026.04
80
Feedback
Search any
task
Search any
task