Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Explanation Quality Evaluation on LIAR RAW

2.29Meaningfulness Score

ChatGPT w/ evi

1.74921.88962.032.1704Nov 25, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
2.293.714.043.99
2025.11
2.273.934.294.5
2025.11
2.24.394.644.63
2025.11
2.064.124.284.47
2025.11
1.94.484.64.65
2025.11
1.854.444.64.69
2025.11
1.774.584.664.83