Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Shared (GQA, POPE, etc.)

Benchmarks

Task NameDataset NameSOTA ResultTrend
Uncertainty EstimationShared (GQA, POPE, etc.) (test)
ECE0.001
4
Showing 1 of 1 rows