Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Explanation Quality Evaluation on LIAR-RAW (test)

1.53ChatGPT Meaningfulness Score

Oracle

1.4981.7141.932.146Nov 25, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
1.534.54.774.771.473.613.893.86
2025.11
1.654.794.864.881.753.763.923.96
2025.11
1.774.44.64.531.973.683.523.56
2025.11
1.874.54.674.672.123.483.373.49
2025.11
1.894.764.784.52.353.483.362.62
2025.11
2.074.434.674.732.223.223.383.57
2025.11
2.334.174.434.632.682.682.843.27