Do you know of any python mapreduce ready clustering libraries?
I have found some good libraries in Java (http://lucene.apache.org/mahout/), I'd prefer to use python though.
http://wiki.github.com/klbostee/dumbo/ (Python mapreduce API )
Edit --- I'm looking for mapreduce ready : Canopy, K-means, Means-shift,etc..