Abstract

 
 

References (15)



 


 



Variance-Based Active Learning


Maytal Saar-Tsechansky


New York University (NYU)

Foster Provost


New York University

2000

NYU Working Paper No. 2451/14167

Abstract:     
For many supervised learning tasks, the cost of acquiringtraining data is dominated by the cost of class labeling.In this work, we explore active learning forclass probability estimation (CPE). Active learning acquiresdata incrementally, using the model learned sofar to help identify especially useful additional data forlabeling. We present a new method for active learning,BootstrapLV, which chooses new data based onthe variance in probability estimates from bootstrapsamples. We then show empirically that the methodreduces the number of data items that must be labeled,across a wide variety of data sets. We also compareBootstrap-LV with Uncertainty Sampling, an existingactive-learning method for maximizing classificationaccuracy, and show not only that BootstrapLV dominatesfor CPE but also that it is quite competitive evenfor accuracy maximization.

Number of Pages in PDF File: 7

working papers series


Download This Paper

Date posted: October 13, 2008  

Suggested Citation

Saar-Tsechansky, Maytal and Provost, Foster, Variance-Based Active Learning (2000). Information Systems Working Papers Series, Vol. , pp. -, 2000. Available at SSRN: http://ssrn.com/abstract=1283009

Contact Information

Maytal Saar-Tsechansky
New York University (NYU) ( email )
Bobst Library, E-resource Acquisitions
20 Cooper Square 3rd Floor
New York, NY 10003-711
United States
Foster Provost
New York University ( email )
44 West Fourth Street
New York, NY 10012
United States
Feedback to SSRN (Beta)


Paper statistics
Abstract Views: 168
Downloads: 19
References:  15

© 2013 Social Science Electronic Publishing, Inc. All Rights Reserved.  FAQ   Terms of Use   Privacy Policy   Copyright
This page was processed by apollo3 in 0.421 seconds