Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Story Question Answering on LongStoryQA Large
Loading...
61.9
F1 Score
SitEmb-v1.5-Qwen3 (QA)
52.332
54.816
57.3
59.784
Aug 3, 2025
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
SitEmb-v1.5-Qwen3 (QA)
Number of retrieved ch...
2025.08
61.9
SitEmb-v1.5-Qwen3 (QA+SA)
Number of retrieved ch...
2025.08
61.5
Qwen3 (QA)
Number of retrieved ch...
2025.08
61.4
Qwen3 (out-of-box)
Number of retrieved ch...
2025.08
61.2
SitEmb-v1.5-Qwen3 (QA+SA)
Number of retrieved ch...
2025.08
59.4
Qwen3 (QA)
Number of retrieved ch...
2025.08
59.2
SitEmb-v1.5-Qwen3 (QA)
Number of retrieved ch...
2025.08
58.7
Qwen3 (QA)
Number of retrieved ch...
2025.08
58.3
Qwen3 (out-of-box)
Number of retrieved ch...
2025.08
57.9
SitEmb-v1.5-Qwen3 (QA)
Number of retrieved ch...
2025.08
57.7
SitEmb-v1.5-Qwen3 (QA+SA)
Number of retrieved ch...
2025.08
57.7
Qwen3 (out-of-box)
Number of retrieved ch...
2025.08
52.7
Feedback
Search any
task
Search any
task