Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Identifying plausible explanations on δ-ATOMIC
Loading...
87.6
Accuracy
Always Tell Me The Odds
45.9272
56.7461
67.565
78.3839
May 24, 2023
Sep 19, 2023
Jan 15, 2024
May 12, 2024
Sep 7, 2024
Jan 3, 2025
May 2, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
87.6
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
81.2
Always Tell Me The Odds
Backbone=Qwen2.5-14B-I...
2025.05
80.1
Always Tell Me The Odds
Backbone=Qwen2.5-8B-In...
2025.05
78.9
Always Tell Me The Odds
Backbone=Qwen2.5-7B-In...
2025.05
78.6
RoBERTa
plausibility annotatio...
2023.05
78.3
LiPoR
plausibility annotatio...
2023.05
76.82
KDDC-ATOMIC (N)
knowledge-augmented=tr...
2023.05
75.9
RoBERTa-L
Type=Encoder
2025.05
75
Llama-3-Instruct
Evaluation Protocol=Pr...
2025.05
74.7
KDDC-CSKG (N)
knowledge-augmented=tr...
2023.05
72.2
GPT-4o
Evaluation Protocol=0-...
2025.05
70.7
DeepSeek-R1-Distill-Qwen-32B
Evaluation Protocol=0-...
2025.05
69.1
Tuned BART
setting=tuned, plausib...
2023.05
67.49
KDDC-CWWV (N)
knowledge-augmented=tr...
2023.05
62.48
ZS BART
setting=zero-shot, pla...
2023.05
59.05
ZS GPT3
setting=zero-shot, pla...
2023.05
50.73
ZS GPT-NEO
setting=zero-shot, pla...
2023.05
47.53
Feedback
Search any
task
Search any
task