Share your thoughts, 1 month free Claude Pro on usSee more

Query Auto-Completion on Human Evaluation Set

69.9Item-wise Score

Full

Updated 3mo ago

Evaluation Results

Method	Links
Full 2026.02		69.9	0.4
SFT + DPO w/o Eng 2026.02		69.8	0.69
SFT-only 2026.02		68.9	0.5
LTR Baseline 2026.02		65.3	-