Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bloom

Benchmarks

Task NameDataset NameSOTA ResultTrend
Open domain dialogueBloom ZS
RSR47.8
9
Red Teaming against BB-3BBloom ZS
RSR4,120
9
Red TeamingBloom ZS (filtered hard positive)
RSR15.6
7
Open-domain dialogue red teamingBloom ZS (filtered) (test)
RSR16.3
7
Language IdentificationBLOOM
Macro F195.76
5
Language ModelingBLOOM-1b7 (train)
Training PPL15.1
3
Attention Head Health AnalysisBLOOM-1b7 internal attention heads
Healthy Heads Count379
3
Showing 7 of 7 rows