|
||||
|
||||
Ascent EM for Efficient Curve-Clustering in Large Online Auction Databases
Wolfgang Jank University of Maryland - Decision and Information Technologies Department November 30, 2004 Robert H. Smith School Research Paper No. RHS-06-008 Abstract: In this paper we propose a sampling-based implementation of the EM algorithm for modelbased clustering. By sampling-based we mean that the algorithm uses only a small sample from the entire database in every iteration. Using only a small sample allows for significant computational improvements. In contrast to previous sampling-based versions, we suggest to select the sample randomly since a random selection allows for statistical evaluation of the algorithm's progress. By appealing to EM's famous likelihood ascent property, the algorithm chooses samples as small as possible, thus ensuring computational efficiency, at the same time the samples are large enough to advance the progress of the method. The algorithm is stochastic in nature and has the potential of overcoming local traps and suboptimal solutions. We apply the algorithm to the problem of clustering infinite-dimensional curves and illustrate it on a large database of online auctions.
Keywords: stochastic optimization, monte carlo, em algorithm, clustering, functional data, electronic commerce, online auction, eBay Working Paper SeriesDate posted: May 18, 2006 ; Last revised: May 18, 2006Suggested CitationContact Information
|
|
||||||||||||
© 2010 Social Science Electronic Publishing, Inc. All Rights Reserved.
FAQ
Terms of Use
Privacy Policy
Copyright
This page was served by apollob 1 in 0.312 seconds.