Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent action prediction on otoSpeech 113-session (test)
Loading...
70.9
wF1
DualTurn (E – Full FT)
45.108
51.804
58.5
65.196
Mar 9, 2026
wF1
BC F1
Ant@−240
Updated 1mo ago
Evaluation Results
Method
Method
Links
wF1
BC F1
Ant@−240
DualTurn (E – Full FT)
variant=E, finetuning=...
2026.03
70.9
49.8
86
DualTurn (A – LoRA (main))
variant=A, finetuning=...
2026.03
70.7
51.2
84.8
VAP (LR-6)
aggregator=logistic re...
2026.03
46.1
0
71.9
Feedback
Search any
task
Search any
task