Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CALCONFLICTBENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Conflict ResolutionCALCONFLICTBENCH 1.0 (test)
Avg Error Rate (N=1)0.3
13
Preference AdaptationCALCONFLICTBENCH (test)
AER0.12
4
Showing 2 of 2 rows