This directory contains helpful shell scripts for working with some of Mahout's examples. Here's a description of what each does: classify-20newsgroups.sh -- Run SGD and Bayes classifiers over the classic 20 News Groups. Downloads the data set automatically. cluster-reuters.sh -- Cluster the Reuters data set using a variety of algorithms. Downloads the data set automatically. cluster-syntheticcontrol.sh -- Cluster the Synthetic Control data set. Downloads the data set automatically. factorize-movielens-1m.sh -- Run the Alternating Least Squares Recommender on the Grouplens data set (size 1M). factorize-netflix.sh -- Run the ALS Recommender on the Netflix data set run-rf.sh -- Create some synthetic data, build a random forest, and test performance.