Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Data Contamination Detection on Omni-MATH Dataset C
Loading...
23.22
Score (Reference)
ZCP
-0.7416
5.4792
11.7
17.9208
May 21, 2026
Score (Reference)
Score (Main)
Contamination Score (C_cont)
Updated 12d ago
Evaluation Results
Method
Method
Links
Score (Reference)
Score (Main)
Contamination Score (C_cont)
ZCP
Model=FT Qwen-Math, Me...
2026.05
23.22
28.04
99.7
ZCP
Model=FT Qwen3, Metric...
2026.05
19.55
25.13
100
ZCP
Model=FT Qwen-Math, Me...
2026.05
17.46
26.08
100
ZCP
Model=FT Qwen3, Metric...
2026.05
15.4
24.75
100
ZCP
Model=FT Qwen3, Metric...
2026.05
0.375
0.471
99.8
ZCP
Model=FT Qwen-Math, Me...
2026.05
0.212
0.334
99.8
ZCP
Model=FT Qwen-Math, Me...
2026.05
0.18
0.305
99.8
ZCP
Model=FT Qwen3, Metric...
2026.05
0.18
0.297
99.8
Feedback
Search any
task
Search any
task