Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Error Detection on Spreadsheet-Tables
Loading...
45.8
F1 Score
Table-Specialist
5.552
16.001
26.45
36.899
Oct 16, 2024
F1 Score
Updated 25d ago
Evaluation Results
Method
Method
Links
F1 Score
Table-Specialist
Base Model=GPT-4, Trai...
2024.10
45.8
GPT-4
Base Model=GPT-4, Trai...
2024.10
40.3
Table-Specialist
Base Model=GPT-3.5, Tr...
2024.10
20.7
GPT-3.5
Base Model=GPT-3.5, Tr...
2024.10
13.6
Table-Specialist
Base Model=Llama3.1-8B...
2024.10
13.6
Llama3.1-8B
Base Model=Llama3.1-8B...
2024.10
7.1
Feedback
Search any
task
Search any
task