Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Meeting Claim Evaluation on city_council (test)
Loading...
76
Accuracy
gpt-5.4
74.44
74.845
75.25
75.655
Apr 23, 2026
Accuracy
Completeness
Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Completeness
Coverage
gpt-5.4
Meetings=34, Evaluator...
2026.04
76
83
83.4
gpt-4.1
Meetings=34, Evaluator...
2026.04
74.5
78.1
78.1
Feedback
Search any
task
Search any
task