Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A BERT Baseline for the Natural Questions

About

This technical note describes a new baseline for the Natural Questions. Our model is based on BERT and reduces the gap between the model F1 scores reported in the original dataset paper and the human upper bound by 30% and 50% relative for the long and short answer tasks respectively. This baseline has been submitted to the official NQ leaderboard at ai.google.com/research/NaturalQuestions. Code, preprocessed data and pretrained model are available at https://github.com/google-research/language/tree/master/language/question_answering/bert_joint.

Chris Alberti, Kenton Lee, Michael Collins• 2019

Related benchmarks

TaskDatasetResultRank
Question AnsweringNatural Question (NQ) (dev)
F164.7
72
Question AnsweringNatural Questions (NQ) long answer (dev)
F1 Score64.7
11
Short Answer Question AnsweringNatural Questions (NQ) (test)
F1 Score52.1
7
Long Answer Question AnsweringNatural Questions (NQ) (test)
Precision64.1
6
Showing 4 of 4 rows

Other info

Code

Follow for update