Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logical Fallacy Tutoring on Elec2Deb20 Human Evaluation (pilot study)
Loading...
1.65
Divergence
BASE
1.584
2.0295
2.475
2.9205
May 31, 2026
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Helpfulness
Updated 1d ago
Evaluation Results
Method
Method
Links
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Helpfulness
BASE
Metric Type=Mean Liker...
2026.05
1.65
1.75
2.65
3
2.65
1.35
2.3
2.9
3.35
LFTutor
Metric Type=Mean Liker...
2026.05
3.3
3.1
3.1
4.15
4.2
2.15
3
4
4.15
Feedback
Search any
task
Search any
task