Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
PRP Faithfulness Evaluation on Alice original characters 8 persona statements
Loading...
2.9
ΔAPC (DeB)
Retrieval-augmented Generation (RAG)
0.612
1.206
1.8
2.394
May 13, 2024
ΔAPC (DeB)
ΔAPC (GPT-4)
Human Score
Updated 4d ago
Evaluation Results
Method
Method
Links
ΔAPC (DeB)
ΔAPC (GPT-4)
Human Score
Retrieval-augmented Generation (RAG)
CPO (APC-based DPO)=True
2024.05
2.9
2.2
7.6
Retrieval-augmented Generation (RAG)
CPO (APC-based DPO)=False
2024.05
2.8
1.8
6.8
Long-context Memory (LCM)
CPO (APC-based DPO)=True
2024.05
2.8
2.2
7.6
Experience Upload (EU)
CPO (APC-based DPO)=True
2024.05
2.7
1.4
6.8
Experience Upload (EU)
CPO (APC-based DPO)=False
2024.05
2.6
1.1
6.4
Long-context Memory (LCM)
CPO (APC-based DPO)=False
2024.05
2.6
1.4
6.8
Gemma-7B
CPO (APC-based DPO)=False
2024.05
0.7
0.3
1.8
Feedback
Search any
task
Search any
task