Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Evaluation on Elec2Deb20 Normal Students 1.0 (test)
Loading...
86
Divergence
LFTutor
85.56
88.53
91.5
94.47
May 31, 2026
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Avg. Performance
Updated 1d ago
Evaluation Results
Method
Method
Links
Divergence
Stance Change
Repetition
Lack of Refutation
Lack of Evidence Inquiry
Strategy Fixation
Unexplained LF Terms
Passive Guidance
Avg. Performance
LFTutor
Teacher Model=Gemini-2...
2026.05
86
87
62
100
98
94
92
84
87.9
BASE
Teacher Model=Gemini-2...
2026.05
97
70
98
94
56
92
56
73
79.5
Feedback
Search any
task
Search any
task