Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Online Out-of-Distribution Detection on Precalculus Far-shift OOD
Loading...
99.28
Accuracy
TV Score
79.572
84.6885
89.805
94.9215
May 22, 2024
Accuracy
Robustness
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Robustness
TV Score
Model=Llama2-7B, Evalu...
2024.05
99.28
0.67
Output Embedding
Model=Llama2-7B, Evalu...
2024.05
88.5
1.38
Input Embedding
Model=Llama2-7B, Evalu...
2024.05
80.33
6.13
Feedback
Search any
task
Search any
task