Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

P2

Benchmarks

Task NameDataset NameSOTA ResultTrend
Prompt OptimizationP2-hard
DSGScore92
7
Bias MitigationP2 Race unconditional 1.0
FD0.317
5
Bias MitigationP2 Age unconditional 1.0
FD0.401
5
Bias MitigationP2 Gender 1.0 (unconditional)
FD0.002
5
Investment decision alignmentP2 v1 (test)
Overall MSE6.12
4
Showing 5 of 5 rows