SOTA Sentence-level Error Detection on HQ2A 1.0 (test) and PapersWithCode

25.49Exact Accuracy

GPT-3.5-Turbo

Updated 3mo ago

Evaluation Results

Method	Links
GPT-3.5-Turbo 2024.07		25.49	11.76	62.75	37.65	0.99