Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Qualitative Coding on ScrumRQ2
Loading...
87.5
Recall
Qwen3.5-P
69.3
74.025
78.75
83.475
May 23, 2026
Recall
Updated 8d ago
Evaluation Results
Method
Method
Links
Recall
Qwen3.5-P
Perspective=Base
2026.05
87.5
Qwen3.5-P (Data)
Perspective=Data, Agen...
2026.05
87.5
GPT-5 (Data)
Perspective=Data, Agen...
2026.05
85
Qwen3.5-P (Theory)
Perspective=Theory, Ag...
2026.05
82.5
Qwen3.5-P (Applied)
Perspective=Applied, A...
2026.05
82.5
GPT-5
Perspective=Base
2026.05
80
GPT-5 (Theory)
Perspective=Theory, Ag...
2026.05
80
GPT-5 (Applied)
Perspective=Applied, A...
2026.05
80
DS-V3.2
Perspective=Base
2026.05
77.5
DS-V3.2 (Theory)
Perspective=Theory, Ag...
2026.05
77.5
DS-V3.2 (Data)
Perspective=Data, Agen...
2026.05
72.5
DS-V3.2 (Applied)
Perspective=Applied, A...
2026.05
70
Feedback
Search any
task
Search any
task