TANDA: Transfer and Adapt Pre-Trained Transformer Models for Answer Sentence Selection
About
We propose TANDA, an effective technique for fine-tuning pre-trained Transformer models for natural language tasks. Specifically, we first transfer a pre-trained model into a model for a general task by fine-tuning it with a large and high-quality dataset. We then perform a second fine-tuning step to adapt the transferred model to the target domain. We demonstrate the benefits of our approach for answer sentence selection, which is a well-known inference task in Question Answering. We built a large scale dataset to enable the transfer step, exploiting the Natural Questions dataset. Our approach establishes the state of the art on two well-known benchmarks, WikiQA and TREC-QA, achieving MAP scores of 92% and 94.3%, respectively, which largely outperform the previous highest scores of 83.4% and 87.5%, obtained in very recent work. We empirically show that TANDA generates more stable and robust models reducing the effort required for selecting optimal hyper-parameters. Additionally, we show that the transfer step of TANDA makes the adaptation step more robust to noise. This enables a more effective use of noisy datasets for fine-tuning. Finally, we also confirm the positive impact of TANDA in an industrial setting, using domain specific datasets subject to different types of noise.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Answer Selection | WikiQA (test) | MAP0.92 | 149 | |
| Answer Sentence Selection | TREC-QA (test) | MAP94.3 | 63 | |
| Answer Sentence Selection | WQA (test) | P@11.9 | 19 | |
| Answer Sentence Selection | WikiQA clean (test) | MAP92 | 12 | |
| Answer Sentence Selection | Alexa Virtual Assistant traffic Sample 1 (test) | Prec@171.26 | 12 | |
| Answer Sentence Selection | Alexa Virtual Assistant traffic accurate Sample 2 (test) | Prec@174.85 | 12 | |
| Answer Sentence Selection | Alexa Virtual Assistant traffic accurate Sample 3 (test) | Prec@10.5814 | 12 | |
| Answer Selection | TREC-QA (test) | MAP91.4 | 9 |