Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Image Classification on unmet-promise (Split 3)
Loading...
64.1
Task Performance
MMD²
51.724
54.937
58.15
61.363
Nov 6, 2025
Task Performance
Updated 12d ago
Evaluation Results
Method
Method
Links
Task Performance
MMD²
2025.11
64.1
Lens 32B
variant=debiased
2025.11
60.2
Lens 7B
variant=debiased
2025.11
59.1
Lens 7B
variant=biased
2025.11
58.7
Mauve
2025.11
58.4
PAD
2025.11
58.2
Test mean
2025.11
57.7
Lens 32B
variant=biased
2025.11
57
MDM
2025.11
52.2
Feedback
Search any
task
Search any
task