Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Contextual Question Answering on Gemma-2B-IT 5% forget set
Loading...
92.4
ROUGE-L
PDU
-3.592
21.329
46.25
71.171
Oct 20, 2025
ROUGE-L
LLM Judge Score
Updated 6d ago
Evaluation Results
Method
Method
Links
ROUGE-L
LLM Judge Score
PDU
Variant=Context-aware
2025.10
92.4
97.5
GradDiff
Variant=Context-aware
2025.10
86.8
94.5
SimNPO
Variant=Context-aware
2025.10
76.5
97
DPO
Variant=Context-aware
2025.10
74.9
85.5
SimNPO
Variant=Vanilla
2025.10
70.6
94
DPO
Variant=Vanilla
2025.10
40.5
55.5
PDU
Variant=Vanilla
2025.10
3
0
GradDiff
Variant=Vanilla
2025.10
0.1
0
Feedback
Search any
task
Search any
task