Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Faithfulness Evaluation on CNN/DM
Loading...
33.2
AUPC
P-SHAP
8.76
15.105
21.45
27.795
Mar 21, 2024
AUPC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUPC
P-SHAP
Model=Flan-UL2, Scalar...
2024.03
33.2
MExGen C-LIME
Model=Flan-UL2, Scalar...
2024.03
32.1
MExGen LOO
Model=Flan-UL2, Scalar...
2024.03
32.1
MExGen L-SHAP
Model=Flan-UL2, Scalar...
2024.03
32
MExGen C-LIME
Model=Llama-3-8B-Instr...
2024.03
26.4
MExGen L-SHAP
Model=Llama-3-8B-Instr...
2024.03
26.3
MExGen LOO
Model=Llama-3-8B-Instr...
2024.03
26.1
P-SHAP
Model=Llama-3-8B-Instr...
2024.03
22.1
MExGen L-SHAP
Model=DistilBART, Scal...
2024.03
14.7
MExGen C-LIME
Model=DistilBART, Scal...
2024.03
13.5
MExGen LOO
Model=DistilBART, Scal...
2024.03
13.2
P-SHAP
Model=DistilBART, Scal...
2024.03
9.7
Feedback
Search any
task
Search any
task