Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Online Out-of-Distribution Detection on Num. Theory (Far-shift OOD)
Loading...
92.08
Accuracy
TV Score
52.872
63.051
73.23
83.409
May 22, 2024
Accuracy
Robustness
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Robustness
TV Score
Model=Llama2-7B, Evalu...
2024.05
92.08
2.34
Input Embedding
Model=Llama2-7B, Evalu...
2024.05
85.8
3.31
Output Embedding
Model=Llama2-7B, Evalu...
2024.05
54.38
11.45
Feedback
Search any
task
Search any
task