Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Table Fact Verification on TABFACT small (test)

0.921Accuracy

Human Performance

0.487320.599910.71250.82509Oct 1, 2020May 14, 2021Dec 25, 2021Aug 8, 2022Mar 21, 2023Nov 1, 2023Jun 14, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
0.921--
0.921--
0.921--
2022.11
0.906--
2022.11
0.867--
2022.11
0.865--
2022.10
0.862--
2021.07
0.859--
2022.11
0.859--
2024.06
0.857791.3480.27
2024.06
0.85691.280
2022.10
0.855--
2024.06
0.851--
0.847--
0.846--
2020.10
0.839--
2021.07
0.839--
2022.10
0.839--
2022.11
0.839--
2024.06
0.826188.4676.84
2021.07
0.825--
2022.10
0.823--
2024.06
0.816788.3675.07
2020.10
0.81--
2020.10
0.804--
2024.06
0.800986.0774.19
2022.11
0.794--
2024.06
0.786184.6872.62
2020.10
0.774--
2020.10
0.773--
2021.07
0.762--
2022.11
0.762--
2024.06
0.758981.2970.56
2024.06
0.745176.3272.72
0.743--
2021.07
0.743--
2022.10
0.743--
0.743--
0.742--
2021.07
0.742--
2022.10
0.742--
2024.06
0.733276.0270.66
2020.10
0.722--
2024.06
0.718476.8266.93
2024.06
0.704173.7367.12
0.689--
0.689--
0.681--
2021.07
0.681--
2022.11
0.681--
2024.06
0.678973.1362.71
2024.06
0.662565.9766.54
2024.06
0.658670.7561.04
2024.06
0.651770.6559.76
2024.06
0.619164.8858.98
2024.06
0.577159.555.94
0.504--