Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Conversational Question Answering on QuAC (Acc, Macro-F1)
Loading...
53.51
Accuracy
SINKTRACK
35.7988
40.3969
44.995
49.5931
Apr 11, 2026
Accuracy
Macro-F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Macro-F1
SINKTRACK
Model=Llama3.1-8B-Inst...
2026.04
53.51
51.56
Direct
Model=Llama3.1-8B-Inst...
2026.04
52.45
50.66
SINKTRACK
Model=Qwen2.5-7B-Instr...
2026.04
50.25
46.73
Direct
Model=Qwen2.5-7B-Instr...
2026.04
49.47
46.71
SINKTRACK
Model=MiniCPM3-4B, Pro...
2026.04
47.29
48.08
Direct
Model=MiniCPM3-4B, Pro...
2026.04
47.17
48.07
CoT
Model=Llama3.1-8B-Inst...
2026.04
46.95
28.74
CoT
Model=MiniCPM3-4B, Pro...
2026.04
37.68
36.5
CoT
Model=Qwen2.5-7B-Instr...
2026.04
36.48
36.58
Feedback
Search any
task
Search any
task