End-to-End Open-Domain Question Answering with BERTserini

About

We demonstrate an end-to-end question answering system that integrates BERT with the open-source Anserini information retrieval toolkit. In contrast to most question answering and reading comprehension models today, which operate over small amounts of input text, our system integrates best practices from IR with a BERT-based reader to identify answers from a large corpus of Wikipedia articles in an end-to-end fashion. We report large improvements over previous results on a standard benchmark test collection, showing that fine-tuning pretrained BERT with SQuAD is sufficient to achieve high accuracy in identifying answer spans.

Wei Yang, Yuqing Xie, Aileen Lin, Xingyu Li, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin• 2019

Related benchmarks

Task	Dataset	Result
Open-domain Question Answering	SQUAD Open (test)	Exact Match38.6	39
Open-domain Question Answering	SQuAD Open-domain 1.1 (test)	Exact Match (EM)38.6	30
Question Answering	SQuAD-Open	EM38.6	28
Open-domain Question Answering	SQuAD	EM38.6	16
Open-domain Question Answering	SQuAD v1.1 (dev)	EM38.6	13
Open-domain Question Answering	SQuAD (test)	Accuracy38.6	7
Open-domain Question Answering	OpenSQuAD 1.1 (test)	EM38.6	7
Open-domain Question Answering	OpenCMRC (test)	F1 Score60.9	3
Open-domain Question Answering	OpenDRCD (test)	F165	3

Showing 9 of 9 rows

Other info

Code

Follow for update

@wizwand_team Discord