Stars
The "Python Machine Learning (2nd edition)" book code repository and info resource
A Distributed Associative Classifier for Apache Spark, mirror of
📜 Understanding Probabilistic Topic Models with Simulation in Python
Simple Spark Streaming example applications, with fake streaming data source
Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
Spark algorithms for building k-nn graphs
Zoe: Container Analytics as a Service -- mirror of https://gitlab.eurecom.fr/zoe/main/
Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...


