Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Misinformation Belief Evaluation on MISBELIEF hard misinformation with third round evidence

4.05Performance

Qwen-turbo

3.36363.54183.723.8982Jan 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
4.055
2026.01
3.914
2026.01
3.853
2026.01
3.692
2026.01
3.391