Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Sequential Algorithm for Training Text Classifiers

About

The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertainty sampling, reduced by as much as 500-fold the amount of training data that would have to be manually classified to achieve a given level of effectiveness.

David D. Lewis, William A. Gale• 1994

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-100 (test)
Accuracy30
3518
Image ClassificationCIFAR-10 (test)
Accuracy52.6
3381
Image ClassificationFashionMNIST (test)--
218
Image ClassificationF-MNIST (test)
Accuracy58.8
64
Image ClassificationImageNet-10 (test)
Accuracy95.1
42
Image ClassificationImageNet-50 (test)
Test Accuracy25.6
39
Image ClassificationImageNet 1k (test)
Final Accuracy66.5
12
Image ClassificationImageNet 10/50-class
Accuracy57.9
8
is unemployed classification (IU)English tweets (test)
AP52.7
4
Showing 9 of 9 rows

Other info

Follow for update