dc.contributor.author | Estivill-Castro, V | |
dc.contributor.author | Yang, J | |
dc.contributor.editor | Heikki Mannila | |
dc.date.accessioned | 2017-05-03T14:15:42Z | |
dc.date.available | 2017-05-03T14:15:42Z | |
dc.date.issued | 2004 | |
dc.date.modified | 2010-08-09T07:17:25Z | |
dc.identifier.issn | 1384-5810 | |
dc.identifier.doi | 10.1023/B:DAMI.0000015869.08323.b3 | |
dc.identifier.uri | http://hdl.handle.net/10072/5182 | |
dc.description.abstract | General purpose and highly applicable clustering methods are usually required during the early stages of knowledge discovery exercises. k-MEANS has been adopted as the prototype of iterative model-based clustering because of its speed, simplicity and capability to work within the format of very large databases. However, k-MEANS has several disadvantages derived from its statistical simplicity. We propose an algorithm that remains very efficient, generally applicable, multidimensional but is more robust to noise and outliers. We achieve this by using medians rather than means as estimators for the centers of clusters. Comparison with k-MEANS, EXPECTATION and MAXIMIZATION sampling demonstrates the advantages of our algorithm. | |
dc.description.peerreviewed | Yes | |
dc.description.publicationstatus | Yes | |
dc.format.extent | 905036 bytes | |
dc.format.extent | 61987 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | text/plain | |
dc.language | English | |
dc.language.iso | eng | |
dc.publisher | Springer | |
dc.publisher.place | DORDRECHT, NETHERLAN | |
dc.relation.ispartofpagefrom | 127 | |
dc.relation.ispartofpageto | 150 | |
dc.relation.ispartofissue | 2 | |
dc.relation.ispartofjournal | Data Mining and Knowledge Discovery | |
dc.relation.ispartofvolume | 8 | |
dc.subject.fieldofresearch | Data management and data science | |
dc.subject.fieldofresearch | Information systems | |
dc.subject.fieldofresearchcode | 4605 | |
dc.subject.fieldofresearchcode | 4609 | |
dc.title | Fast and Robust General Purpose Clustering Algorithms | |
dc.type | Journal article | |
dc.type.description | C1 - Articles | |
dc.type.code | C - Journal Articles | |
gro.faculty | Griffith Sciences, School of Information and Communication Technology | |
gro.rights.copyright | © Springer 2004. This is the author-manuscript version of this paper. The original publication is available at www.springerlink.com | |
gro.date.issued | 2004 | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Estivill-Castro, Vladimir | |