Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Evaluation on Elec2Deb20 Normal students
Loading...
83
Divergence
BASE
70.52
73.76
77
80.24
May 31, 2026
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Avg. Performance
Updated 1d ago
Evaluation Results
Method
Method
Links
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Avg. Performance
BASE
Teacher model=LLaMA-3....
2026.05
83
32
88
80
24
68
66
44
60.6
LFTutor
Teacher model=LLaMA-3....
2026.05
71
79
86
95
79
92
84
37
77.9
Feedback
Search any
task
Search any
task