Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image Classification on unmet-promise (Split 1)
Loading...
57.3
Task Performance
Lens 7B
53.244
54.297
55.35
56.403
Nov 6, 2025
Task Performance
Updated 12d ago
Evaluation Results
Method
Method
Links
Task Performance
Lens 7B
variant=debiased
2025.11
57.3
MMD²
2025.11
57.3
Test mean
2025.11
57.2
MDM
2025.11
57
Lens 7B
variant=biased
2025.11
56.7
Lens 32B
variant=debiased
2025.11
56.4
Mauve
2025.11
56
PAD
2025.11
55.9
Lens 32B
variant=biased
2025.11
53.4
Feedback
Search any
task
Search any
task