A Clustering Algorithm for Data Streams
Joint work with research group Prof. Dr. Christian Sohler.
Project partially funded by DFG priority program 1307.
Publications
- Marcel R. Ackermann, Christiane Lammersen, Marcus Märtens, Christoph Raupach, Christian Sohler, Kamil Swierkot
StreamKM++: A Clustering Algorithm for Data Streams
In: Proceedings of the 12th Workshop on Algorithm Engineering and Experiments (ALENEX '10), 2010, [DOI]
Sourcecode
- algorithm StreamKM++: [Download] (15kB)
- algorithm LocalSearch: http://infolab.stanford.edu/~loc/
- algorithm BIRCH: http://pages.cs.wisc.edu/~vganti/birchcode/
- testing environment: [Download] (1,1 MB)
Datasets
- Tower: http://homepages.uni-paderborn.de/frahling/coremeans.html
- Census 1990: http://archive.ics.uci.edu/ml/datasets/US+Census+Data+%281990%29
- Covertype: http://archive.ics.uci.edu/ml/datasets/Covertype
- BigCross: [Download] (223 MB)
Legal Mumbo-Jumbo
Sourcecode and data files provided on this webpage come without any warranty. Use at your own risk!
The sourcecode provided on this webpage is the intellectual property of its authors. You have permission to download and use this sourcecode for the sake of inspection and for private and academic use. You are not allowed to redistribute this sourcecode without our permission. If you want to use the sourcecode in any other way as granted above, please contact Christian Sohler.