Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Pairwise engagement prediction on CS 1.6
Loading...
66.7
Accuracy
InternVL
38.62
45.91
53.2
60.49
Mar 19, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
InternVL
strategy=S5
2026.03
66.7
InternVL
strategy=S6
2026.03
66.7
Qwen
strategy=S5
2026.03
55.6
Majority-class baseline
mode=Majority baseline
2026.03
55.1
InternVL
strategy=S1
2026.03
52.6
Qwen
strategy=S2
2026.03
52.6
InternVL
strategy=S3
2026.03
48.7
Qwen
strategy=S1
2026.03
48.7
InternVL
strategy=S4
2026.03
44.9
GPT-4o
strategy=S2
2026.03
44.9
Qwen
strategy=S6
2026.03
44.4
InternVL
strategy=S2
2026.03
43.6
Qwen
strategy=S4
2026.03
42.3
GPT-4o
strategy=S1
2026.03
42.3
Qwen
strategy=S3
2026.03
39.7
Feedback
Search any
task
Search any
task