Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Reasoning on StrategyQA (ASR/HSR)
Loading...
88.4
ASR
ShadowCoT
72.8
76.85
80.9
84.95
Apr 8, 2025
ASR
HSR
Updated 1mo ago
Evaluation Results
Method
Method
Links
ASR
HSR
ShadowCoT
Target Model=LLaMA
2025.04
88.4
81.2
DarkMind
Target Model=LLaMA
2025.04
79.6
71.5
SABER
Target Model=LLaMA
2025.04
75
66.8
BadChain
Target Model=LLaMA
2025.04
73.4
64.7
Feedback
Search any
task
Search any
task