Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

About

Labeling training data is one of the most costly bottlenecks in developing machine learning-based applications. We present a first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for this setting. Snorkel DryBell builds on the Snorkel framework, extending it in three critical aspects: flexible, template-based ingestion of diverse organizational knowledge, cross-feature production serving, and scalable, sampling-free execution. On three classification tasks at Google, we find that Snorkel DryBell creates classifiers of comparable quality to ones trained with tens of thousands of hand-labeled examples, converts non-servable organizational resources to servable models for an average 52% performance improvement, and executes over millions of data points in tens of minutes.

Stephen H. Bach, Daniel Rodriguez, Yintao Liu, Chong Luo, Haidong Shao, Cassandra Xia, Souvik Sen, Alexander Ratner, Braden Hancock, Houman Alborzi, Rahul Kuchhal, Christopher R\'e, Rob Malkin• 2018

Related benchmarks

Task	Dataset	Result
Comment Classification	Civil Comments	Accuracy73.9	30
Binary/Pairwise Classification	Summarize	Accuracy70.5	9
Binary/Pairwise Classification	Chatbot Arena	Accuracy54.3	9
Binary/Pairwise Classification	SHP	Accuracy61.9	9
Binary/Pairwise Classification	PKU-BETTER	Accuracy57.5	9
Binary/Pairwise Classification	PKU-SAFER	Accuracy57	9
scoring	ASSET	MAE29.073	5
scoring	FeedbackQA	MAE0.793	5
scoring	Review-5K	MAE2.593	5
scoring	Summarize	MAE1.364	5

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord