Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent action prediction on Switchboard 138-session (test)
Loading...
63.3
wF1
DualTurn (A – LoRA (main))
26.172
35.811
45.45
55.089
Mar 9, 2026
wF1
BC F1
Ant@-240
Updated 1mo ago
Evaluation Results
Method
Method
Links
wF1
BC F1
Ant@-240
DualTurn (A – LoRA (main))
variant=A, finetuning=...
2026.03
63.3
34.9
87.4
DualTurn (E – Full FT)
variant=E, finetuning=...
2026.03
62.6
33.7
87.9
DualTurn (B – LoRA + CB)
variant=B, finetuning=...
2026.03
61.3
7.7
87.6
DualTurn (G – Text-Aware PT)
variant=G, pretraining...
2026.03
60.5
8.5
87.3
DualTurn (C – No Pretrain)
variant=C, pretraining...
2026.03
60.4
7.9
86.3
DualTurn (D – LSTM)
variant=D, architectur...
2026.03
60.2
7.7
86.8
DualTurn (F – Discrete)
variant=F, type=Discrete
2026.03
60.2
7.2
86.7
VAP (LR-6)
aggregator=logistic re...
2026.03
38.9
0
78
VAP (native)
type=baseline
2026.03
27.6
-
78.5
Feedback
Search any
task
Search any
task