Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Uncertainty Estimation in Autoregressive Structured Prediction

About

Uncertainty estimation is important for ensuring safety and robustness of AI systems. While most research in the area has focused on un-structured prediction tasks, limited work has investigated general uncertainty estimation approaches for structured prediction. Thus, this work aims to investigate uncertainty estimation for autoregressive structured prediction tasks within a single unified and interpretable probabilistic ensemble-based framework. We consider: uncertainty estimation for sequence data at the token-level and complete sequence-level; interpretations for, and applications of, various measures of uncertainty; and discuss both the theoretical and practical challenges associated with obtaining them. This work also provides baselines for token-level and sequence-level error detection, and sequence-level out-of-domain input detection on the WMT'14 English-French and WMT'17 English-German translation and LibriSpeech speech recognition datasets.

Andrey Malinin, Mark Gales• 2020

Related benchmarks

TaskDatasetResultRank
Hallucination DetectionTriviaQA
AUROC0.7962
621
Hallucination DetectionHotpotQA
AUROC0.7449
249
Hallucination DetectionTriviaQA (test)
AUC-ROC80.2
243
Hallucination DetectionHaluEval (test)
AUC-ROC73.57
176
Hallucination DetectionNQ
AUC0.73
154
Hallucination DetectionHaluEval
AUROC0.7371
131
Correctness PredictionTriviaQA
AUROC0.8453
113
Hallucination DetectionTruthfulQA (test)
AUC-ROC63.6
112
Uncertainty EstimationTriviaQA
AUROC72.57
111
Uncertainty EstimationTriviaQA (test)
AUROC78.3
110
Showing 10 of 184 rows
...

Other info

Follow for update