Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Meeting claim evaluation on private_data (test)
Loading...
74.5
Accuracy
gpt-5.4
72.836
73.268
73.7
74.132
Apr 23, 2026
Accuracy
Completeness
Coverage
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Completeness
Coverage
gpt-5.4
Meetings=22, Evaluator...
2026.04
74.5
86.5
92
gpt-4.1
Meetings=22, Evaluator...
2026.04
72.9
80.1
81.6
Feedback
Search any
task
Search any
task