1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430
|
.. _advanced-installation:
===================================
Advanced installation instructions
===================================
There are different ways to get scikit-learn installed:
* Install the version of scikit-learn provided by your
:ref:`operating system or Python distribution <install_by_distribution>`.
This is the quickest option for those who have operating systems that
distribute scikit-learn.
* :ref:`Install an official release <install_official_release>`. This
is the best approach for users who want a stable version number
and aren't concerned about running a slightly older version of
scikit-learn.
* :ref:`Install the latest development version
<install_bleeding_edge>`. This is best for users who want the
latest-and-greatest features and aren't afraid of running
brand-new code.
.. note::
If you wish to contribute to the project, you need to
:ref:`install the latest development version<install_bleeding_edge>`.
.. _install_official_release:
Installing an official release
==============================
Scikit-learn requires:
- Python (>= 2.6 or >= 3.3),
- NumPy (>= 1.6.1),
- SciPy (>= 0.9).
Mac OSX
-------
Scikit-learn and its dependencies are all available as wheel packages for OSX::
pip install -U numpy scipy scikit-learn
Linux
-----
At this time scikit-learn does not provide official binary packages for Linux
so you have to build from source if you want the latest version.
If you don't need the newest version, consider using your package manager to
install scikit-learn. It is usually the easiest way, but might not provide the
newest version.
Installing build dependencies
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Installing from source requires you to have installed the scikit-learn runtime
dependencies, Python development headers and a working C/C++ compiler.
Under Debian-based operating systems, which include Ubuntu, if you have
Python 2 you can install all these requirements by issuing::
sudo apt-get install build-essential python-dev python-setuptools \
python-numpy python-scipy \
libatlas-dev libatlas3gf-base
If you have Python 3::
sudo apt-get install build-essential python3-dev python3-setuptools \
python3-numpy python3-scipy \
libatlas-dev libatlas3gf-base
On recent Debian and Ubuntu (e.g. Ubuntu 13.04 or later) make sure that ATLAS
is used to provide the implementation of the BLAS and LAPACK linear algebra
routines::
sudo update-alternatives --set libblas.so.3 \
/usr/lib/atlas-base/atlas/libblas.so.3
sudo update-alternatives --set liblapack.so.3 \
/usr/lib/atlas-base/atlas/liblapack.so.3
.. note::
In order to build the documentation and run the example code contains in
this documentation you will need matplotlib::
sudo apt-get install python-matplotlib
.. note::
The above installs the ATLAS implementation of BLAS
(the Basic Linear Algebra Subprograms library).
Ubuntu 11.10 and later, and recent (testing) versions of Debian,
offer an alternative implementation called OpenBLAS.
Using OpenBLAS can give speedups in some scikit-learn modules,
but can freeze joblib/multiprocessing prior to OpenBLAS version 0.2.8-4,
so using it is not recommended unless you know what you're doing.
If you do want to use OpenBLAS, then replacing ATLAS only requires a couple
of commands. ATLAS has to be removed, otherwise NumPy may not work::
sudo apt-get remove libatlas3gf-base libatlas-dev
sudo apt-get install libopenblas-dev
sudo update-alternatives --set libblas.so.3 \
/usr/lib/openblas-base/libopenblas.so.0
sudo update-alternatives --set liblapack.so.3 \
/usr/lib/lapack/liblapack.so.3
On Red Hat and clones (e.g. CentOS), install the dependencies using::
sudo yum -y install gcc gcc-c++ numpy python-devel scipy
Building scikit-learn with pip
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
This is usually the fastest way to install or upgrade to the latest stable
release::
pip install --user --install-option="--prefix=" -U scikit-learn
The ``--user`` flag asks pip to install scikit-learn in the ``$HOME/.local``
folder therefore not requiring root permission. This flag should make pip
ignore any old version of scikit-learn previously installed on the system while
benefiting from system packages for numpy and scipy. Those dependencies can
be long and complex to build correctly from source.
The ``--install-option="--prefix="`` flag is only required if Python has a
``distutils.cfg`` configuration with a predefined ``prefix=`` entry.
From source package
~~~~~~~~~~~~~~~~~~~
download the source package from
`pypi <https://pypi.python.org/pypi/scikit-learn>`_, unpack the sources and
cd into the source directory.
This packages uses distutils, which is the default way of installing
python modules. The install command is::
python setup.py install
or alternatively (also from within the scikit-learn source folder)::
pip install .
.. warning::
Packages installed with the ``python setup.py install`` command cannot
be uninstalled nor upgraded by ``pip`` later. To properly uninstall
scikit-learn in that case it is necessary to delete the ``sklearn`` folder
from your Python ``site-packages`` directory.
Windows
-------
First, you need to install `numpy <http://www.numpy.org/>`_ and `scipy
<http://www.scipy.org/>`_ from their own official installers.
Wheel packages (.whl files) for scikit-learn from `pypi
<https://pypi.python.org/pypi/scikit-learn/>`_ can be installed with the `pip
<https://pip.readthedocs.io/en/stable/installing/>`_ utility.
Open a console and type the following to install or upgrade scikit-learn to the
latest stable release::
pip install -U scikit-learn
If there are no binary packages matching your python, version you might
to try to install scikit-learn and its dependencies from `christoph gohlke
unofficial windows installers
<http://www.lfd.uci.edu/~gohlke/pythonlibs/#scikit-learn>`_
or from a :ref:`python distribution <install_by_distribution>` instead.
.. _install_by_distribution:
Third party distributions of scikit-learn
=========================================
Some third-party distributions are now providing versions of
scikit-learn integrated with their package-management systems.
These can make installation and upgrading much easier for users since
the integration includes the ability to automatically install
dependencies (numpy, scipy) that scikit-learn requires.
The following is an incomplete list of python and os distributions
that provide their own version of scikit-learn.
MacPorts for Mac OSX
--------------------
The MacPorts package is named ``py<XY>-scikits-learn``,
where ``XY`` denotes the Python version.
It can be installed by typing the following
command::
sudo port install py26-scikit-learn
or::
sudo port install py27-scikit-learn
Arch Linux
----------
Arch Linux's package is provided through the `official repositories
<https://www.archlinux.org/packages/?q=scikit-learn>`_ as
``python-scikit-learn`` for Python 3 and ``python2-scikit-learn`` for Python 2.
It can be installed by typing the following command:
.. code-block:: none
# pacman -S python-scikit-learn
or:
.. code-block:: none
# pacman -S python2-scikit-learn
depending on the version of Python you use.
NetBSD
------
scikit-learn is available via `pkgsrc-wip <http://pkgsrc-wip.sourceforge.net/>`_:
http://pkgsrc.se/wip/py-scikit_learn
Fedora
------
The Fedora package is called ``python-scikit-learn`` for the Python 2 version
and ``python3-scikit-learn`` for the Python 3 version. Both versions can
be installed using ``yum``::
$ sudo yum install python-scikit-learn
or::
$ sudo yum install python3-scikit-learn
Building on windows
-------------------
To build scikit-learn on Windows you need a working C/C++ compiler in
addition to numpy, scipy and setuptools.
Picking the right compiler depends on the version of Python (2 or 3)
and the architecture of the Python interpreter, 32-bit or 64-bit.
You can check the Python version by running the following in ``cmd`` or
``powershell`` console::
python --version
and the architecture with::
python -c "import struct; print(struct.calcsize('P') * 8)"
The above commands assume that you have the Python installation folder in your
PATH environment variable.
32-bit Python
-------------
For 32-bit python it is possible use the standalone installers for
`microsoft visual c++ express 2008 <http://download.microsoft.com/download/A/5/4/A54BADB6-9C3F-478D-8657-93B3FC9FE62D/vcsetup.exe>`_
for Python 2 or Microsoft Visual C++ Express 2010 for Python 3.
Once installed you should be able to build scikit-learn without any
particular configuration by running the following command in the scikit-learn
folder::
python setup.py install
64-bit Python
-------------
For the 64-bit architecture, you either need the full Visual Studio or
the free Windows SDKs that can be downloaded from the links below.
The Windows SDKs include the MSVC compilers both for 32 and 64-bit
architectures. They come as a ``GRMSDKX_EN_DVD.iso`` file that can be mounted
as a new drive with a ``setup.exe`` installer in it.
- For Python 2 you need SDK **v7.0**: `MS Windows SDK for Windows 7 and .NET
Framework 3.5 SP1
<https://www.microsoft.com/en-us/download/details.aspx?id=18950>`_
- For Python 3 you need SDK **v7.1**: `MS Windows SDK for Windows 7 and .NET
Framework 4
<https://www.microsoft.com/en-us/download/details.aspx?id=8442>`_
Both SDKs can be installed in parallel on the same host. To use the Windows
SDKs, you need to setup the environment of a ``cmd`` console launched with the
following flags (at least for SDK v7.0)::
cmd /E:ON /V:ON /K
Then configure the build environment with::
SET DISTUTILS_USE_SDK=1
SET MSSdk=1
"C:\Program Files\Microsoft SDKs\Windows\v7.0\Setup\WindowsSdkVer.exe" -q -version:v7.0
"C:\Program Files\Microsoft SDKs\Windows\v7.0\Bin\SetEnv.cmd" /x64 /release
Finally you can build scikit-learn in the same ``cmd`` console::
python setup.py install
Replace ``v7.0`` by the ``v7.1`` in the above commands to do the same for
Python 3 instead of Python 2.
Replace ``/x64`` by ``/x86`` to build for 32-bit Python instead of 64-bit
Python.
Building binary packages and installers
---------------------------------------
The ``.whl`` package and ``.exe`` installers can be built with::
pip install wheel
python setup.py bdist_wheel bdist_wininst -b doc/logos/scikit-learn-logo.bmp
The resulting packages are generated in the ``dist/`` folder.
Using an alternative compiler
-----------------------------
It is possible to use `MinGW <http://www.mingw.org>`_ (a port of GCC to Windows
OS) as an alternative to MSVC for 32-bit Python. Not that extensions built with
mingw32 can be redistributed as reusable packages as they depend on GCC runtime
libraries typically not installed on end-users environment.
To force the use of a particular compiler, pass the ``--compiler`` flag to the
build step::
python setup.py build --compiler=my_compiler install
where ``my_compiler`` should be one of ``mingw32`` or ``msvc``.
.. _install_bleeding_edge:
Bleeding Edge
=============
See section :ref:`git_repo` on how to get the development version. Then follow
the previous instructions to build from source depending on your platform.
You will also require Cython >=0.23 in order to build the development version.
.. _testing:
Testing
=======
Testing scikit-learn once installed
-----------------------------------
Testing requires having the `nose
<https://somethingaboutorange.com/mrl/projects/nose/>`_ library. After
installation, the package can be tested by executing *from outside* the
source directory::
$ nosetests -v sklearn
Under Windows, it is recommended to use the following command (adjust the path
to the ``python.exe`` program) as using the ``nosetests.exe`` program can badly
interact with tests that use ``multiprocessing``::
C:\Python34\python.exe -c "import nose; nose.main()" -v sklearn
This should give you a lot of output (and some warnings) but
eventually should finish with a message similar to::
Ran 3246 tests in 260.618s
OK (SKIP=20)
Otherwise, please consider posting an issue into the `bug tracker
<https://github.com/scikit-learn/scikit-learn/issues>`_ or to the
:ref:`mailing_lists` including the traceback of the individual failures
and errors. Please include your operating system, your version of NumPy, SciPy
and scikit-learn, and how you installed scikit-learn.
Testing scikit-learn from within the source folder
--------------------------------------------------
Scikit-learn can also be tested without having the package
installed. For this you must compile the sources inplace from the
source directory::
python setup.py build_ext --inplace
Test can now be run using nosetests::
nosetests -v sklearn/
This is automated by the commands::
make in
and::
make test
You can also install a symlink named ``site-packages/scikit-learn.egg-link``
to the development folder of scikit-learn with::
pip install --editable .
|