Snorkel is a system for programmatically building and managing training datasets without manual labeling. In Snorkel, users can develop large training datasets in hours or days rather than hand-labeling them over weeks or months.
T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau, и M. Dredze. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, стр. 80--88. Stroudsburg, PA, USA, Association for Computational Linguistics, (2010)