Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Conversational Question Answering on QuAC 1,000 1
Loading...
59.4
Accuracy
SINKTRACK
37.6328
43.2839
48.935
54.5861
Apr 11, 2026
Accuracy
Macro F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Macro F1
SINKTRACK
Model=Llama3.1-8B-Inst...
2026.04
59.4
54.05
CoT
Model=Llama3.1-8B-Inst...
2026.04
54.9
46.09
Direct
Model=Llama3.1-8B-Inst...
2026.04
54.1
48.87
SINKTRACK
Model=MiniCPM3-4B, Pro...
2026.04
53.57
52.82
SINKTRACK
Model=Qwen2.5-7B-Instr...
2026.04
52.93
47.52
Direct
Model=MiniCPM3-4B, Pro...
2026.04
52.82
52.79
Direct
Model=Qwen2.5-7B-Instr...
2026.04
52.57
47.06
CoT
Model=MiniCPM3-4B, Pro...
2026.04
42.69
45.45
CoT
Model=Qwen2.5-7B-Instr...
2026.04
38.47
42.11
Feedback
Search any
task
Search any
task