Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-Form Question Answering on LFQA (GPT-4o Evaluation)
Loading...
4.115
GPT-4o Score
No watermark
3.91324
3.96562
4.018
4.07038
Apr 15, 2026
GPT-4o Score
Updated 3d ago
Evaluation Results
Method
Method
Links
GPT-4o Score
No watermark
Maximum generation len...
2026.04
4.115
QuantileMark
Maximum generation len...
2026.04
4.109
StealthInk
Maximum generation len...
2026.04
4.047
MPAC
Maximum generation len...
2026.04
3.921
Feedback
Search any
task
Search any
task