Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
RAG Robustness Evaluation on SIG Trivial
Loading...
90.5
Style Robustness
Mistral-7B-Instruct
84.78
86.265
87.75
89.235
Mar 7, 2025
Style Robustness
Source Fidelity
Logical Consistency
Format Adherence
Meta-Information Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Style Robustness
Source Fidelity
Logical Consistency
Format Adherence
Meta-Information Accuracy
Mistral-7B-Instruct
Evaluation paradigm=LL...
2025.03
90.5
91.5
92
93.8
96
Mistral-7B-Instruct
Evaluation paradigm=st...
2025.03
88
94
94.5
94
99
Llama-3.1-8B-Inst.
Evaluation paradigm=st...
2025.03
87.5
93.5
93
90.8
97
Llama-3.1-8B-Inst.
Evaluation paradigm=LL...
2025.03
85
92
91
90.8
93.3
Feedback
Search any
task
Search any
task