Table of Contents • ← Chapter 1.01 - Help • Chapter 2 - Classification →
Machine Learning is well supported on all major operating systems, thanks to analytical (...) such Anaconda. Anaconda is an open data science platform that pre-packages many of the required components for data science (and everything for this book!) and can be installed on Windows, OS X and Linux.
In fact, this entire book is written using components from Anaconda, including Python and R and their machine learning packages, as well as the super helpful Jupyter Notebook which allows for structured and repeatable machine learning.
To install Anaconda, follow the instructions for your operating system below:
conda install -c anaconda scikit-learn=0.18.1
pip install keras==2.0.0
sudo gem update --system
brew install apache-spark
export SPARK_HOME="/usr/local/Cellar/apache-spark/2.1.0/libexec/"
export PATH=$PATH:$SPARK_HOME/bin
export PYSPARK_SUBMIT_ARGS="--master local[2]"
export PYTHONPATH=$SPARK_HOME/python/:$PYTHONPATH
export PYTHONPATH=$SPARK_HOME/python/lib/py4j-0.9-src.zip:$PYTHONPATH
export PYSPARK_DRIVER_PYTHON=jupyter
export PYSPARK_DRIVER_PYTHON_OPTS='notebook'
pyspark
If you've downloaded a copy of this book as Jupyter notebooks, you can run Jupyter jupyter notebook
. This will open a new browser window with the Jupyter file browser, and you can select the directory containing your notebooks from there.
Table of Contents • ← Chapter 1.01 - Help • Chapter 2 - Classification →