Dealing with Unreliable Labelers and Implications for Labeling Strategies
Chuck Lam
Department of Electrical Engineering
Stanford University
Machine learning and pattern recognition researchers have long studied
the effect of labeling noise. Most of those work have assumed that the
labeling process is fixed. However, in many situations, such as with
the Open Mind Initiative (www.openmind.org), the system architect can
specify a labeling strategy. For example, she can have the labelers
spend more of their effort cross-checking each others' work, thus
trading off the size of the dataset for quality. This talk will examine
such labeling strategy trade-offs from a couple perspectives. The first
is in labeling test data for classifier evaluation. The second
perspective is in the learning rate of an ideal domain with labeling
noise. In both cases we found the classifier's achievable error rate to
play a significant role in designing the labeling strategy.
Date: Thursday, October 9 |
Time: 4:15-5:30PM |
Place: Ventura 17 |
Return to the seminar schedule