File: dev_start_guide.txt

package info (click to toggle)
theano 1.0.3%2Bdfsg-1
  • links: PTS, VCS
  • area: main
  • in suites: buster
  • size: 30,752 kB
  • sloc: python: 141,182; ansic: 9,505; makefile: 259; sh: 214; pascal: 81
file content (604 lines) | stat: -rw-r--r-- 20,566 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
.. _dev_start_guide:

=====================
Developer Start Guide
=====================

Contributing
============

You want to contribute to Theano? That is great! This page explain our
workflow and some resource for doing so.

Looking for an idea for a first contribution? Check the `github issues
<https://github.com/Theano/Theano/issues?q=is%3Aopen+is%3Aissue+label%3A%22Easy+fix%22>`_
with a label ``easy fix``. They are good starter. It is recommended
that you write on the issue you want to work on it. This help make
sure it is up to date and see if nobody else is working on it. Also,
we can sometimes provides more information about it.  There is also
the label `NeedSomeoneToFinish
<https://github.com/Theano/Theano/labels/NeedSomeoneToFinish>`_ that is
interesting to check. The difficulty level is variable.

Resources
=========

See :ref:`theano_community` for a list of Theano resources. The
following groups/mailing-lists are especially useful to Theano
contributors: `theano-dev`_, `theano-buildbot`_, and `theano-github`_.

.. _theano-dev: https://groups.google.com/group/theano-dev
.. _theano-github: https://groups.google.com/group/theano-github
.. _theano-buildbot: https://groups.google.com/group/theano-buildbot


To get up to speed, you'll need to

- Learn some non-basic Python to understand what's going on in some of the
  trickier files (like tensor.py).
- Go through the `NumPy documentation`_.
- Learn to write reStructuredText_ for Sphinx_.
- Learn about how unittest_ and nose_ work

.. _Sphinx: http://sphinx.pocoo.org/
.. _reStructuredText: http://docutils.sourceforge.net/rst.html
.. _Allowed docstring sections in Napoleon: https://sphinxcontrib-napoleon.readthedocs.org/en/latest/#docstring-sections
.. _NumPy documentation: http://docs.scipy.org/numpy/
.. _unittest: http://docs.python.org/library/unittest.html
.. _nose: http://nose.readthedocs.org/en/latest/


.. _quality_contributions:

Requirements for Quality Contributions
======================================

* All the code should be properly tested. 

* The code should be compatible with Python 2.7 and above, as well as
  Python 3.4 and above (using `six` if needed).

* All the code should respect the 
  `PEP8 Code Style Guide <http://www.python.org/dev/peps/pep-0008>`_.

* The docstrings of all the classes and functions should respect the
  `PEP257 <https://www.python.org/dev/peps/pep-0257/>`_ rules and follow the 
  `Numpy docstring standard
  <https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt>`_.

Each point will be referred to more in detail in the following.

Unit tests
----------

When you submit a pull request, your changes will automatically be
tested via Travis-CI. This will post the results of the tests with a
little icon next to your commit. A yellow circle means the tests are
running.  A red X means the tests failed and a green circle means the
tests passed.

Just because the tests run automatically does not mean you shouldn't
run them yourself to make sure everything is all right.  You can run
only the portion you are modifying to go faster and have Travis to
make sure there are no global impacts.

Also, if you are changing GPU code, Travis doesn't test that, because
there are no GPUs on the test nodes.

To run the test suite with the default options, see 
:ref:`test_theano`.

Each night we execute all the unit tests automatically, with several
sets of options. The result is sent by email to the `theano-buildbot`_
mailing list.

For more detail, see :ref:`metadocumentation_nightly_build`.

To run all the tests with the same configuration as the buildbot, run
this script:

.. code-block:: bash

   theano/misc/do_nightly_build

This script accepts arguments that it forwards to nosetests. You can
run only some tests or enable pdb by giving the equivalent nosetests
parameters.

Setting up your Editor for PEP8
-------------------------------

Here are instructions for :ref:`Vim <vim_pep8>` and :ref:`Emacs
<emacs_pep8>`. If you have similar instructions for other text editors
or IDE, please let us know and we will update this documentation.

.. _vim_pep8:

Vim
~~~

Detection of warnings and errors is done by the `pep8`_ script
(or `flake8`_, that also checks for other things, like syntax
errors). Syntax highlighting and general integration into Vim is done by
the `Syntastic`_ plugin for Vim.

To setup VIM:

#. Install flake8 (if not already installed) with::

    pip install "flake8<3"

   .. warning:: Starting version 3.0.0, flake8 changed its dependencies and
      moved its Python API to a legacy module, breaking Theano's flake8 tests.
      We recommend using a version prior to 3.  

   .. note:: You can use ``easy_install`` instead of ``pip``, and ``pep8``
      instead of ``flake8`` if you prefer. The important thing is that the
      ``flake8`` or ``pep8`` executable ends up in your ``$PATH``.

#. Install vundle with::

    git clone https://github.com/VundleVim/Vundle.vim.git ~/.vim/bundle/Vundle.vim

#. Edit ``~/.vimrc`` and add the lines:

    .. code-block:: python

        set nocompatible              " be iMproved, required
        filetype off                  " required
        " set the runtime path to include Vundle and initialize
        set rtp+=~/.vim/bundle/Vundle.vim
        call vundle#begin()

        Plugin 'gmarik/Vundle.vim' " let Vundle manage Vundle (required!)
        Plugin 'scrooloose/syntastic'
        Plugin 'jimf/vim-pep8-text-width'

        call vundle#end()

        " Syntastic settings
        " You can run checkers explicitly by calling :SyntasticCheck <checker
        let g:syntastic_python_checkers = ['flake8'] "use one of the following checkers:
                                                     " flake8, pyflakes, pylint, python (native checker)
        let g:syntastic_enable_highlighting = 1  "highlight errors and warnings
        let g:syntastic_style_error_symbol = ">>" "error symbol
        let g:syntastic_warning_symbol = ">>" "warning symbol
        let g:syntastic_check_on_open = 1
        let g:syntastic_auto_jump = 0  "do not jump to errors when detected

#. Open a new vim and run ``:PluginInstall`` to automatically install the
   plugins. When the installation is done, close the installation "window" 
   with ``:q``. 
   From now on Vim will check for PEP8 errors and highlight them whenever a 
   file is saved.

A few useful commands
"""""""""""""""""""""

* Open the list of errors: ``:lopen``, that can be abbreviated in ``:lop``
  (denoted ``:lop[en]``).
* Close that list: ``:lcl[ose]``.
* Next error: ``:lne[xt]``.
* Previous error: ``:lp[revious]``.

Once you fix errors, messages and highlighting will still appear in the
fixed file until you save it again.

We can also configure the ``~/.vimrc`` to make it easier to work with Syntastic.
For instance, to add a summary in the status bar, you can add::

    set statusline+=%{SyntasticStatuslineFlag()}

To bind F2 and F3 to navigate to previous and next error, you can add::

    map <F2> :lprevious<CR>
    map <F3> :lnext<CR>

You can prefix those by ``autocmd FileType python`` if you want these
bindings to work only on Python files.

.. _pep8: http://pypi.python.org/pypi/pep8
.. _flake8: http://pypi.python.org/pypi/flake8
.. _Syntastic: https://github.com/scrooloose/syntastic/
.. _pathogen.vim: https://github.com/tpope/vim-pathogen
.. _quickfix: http://vimdoc.sourceforge.net/htmldoc/quickfix.html#quickfix

.. _emacs_pep8:

Emacs
~~~~~

There is an **excellent** system to configure emacs for Python:
`emacs-for-python
<https://github.com/gabrielelanaro/emacs-for-python>`_. It gathers many
emacs config into one, and modifies them to behave together nicely. You
can use it to check for pep8 compliance and for Python syntax errors.

To install it on Linux, you can do like this:

.. code-block:: bash

   cd
   git clone https://github.com/gabrielelanaro/emacs-for-python.git ~/.emacs.d/emacs-for-python


Then in your ``~/.emacs`` file, add this:

.. code-block:: common-lisp

   ;; Mandatory
   (load-file "~/.emacs.d/emacs-for-python/epy-init.el")
   (add-to-list 'load-path "~/.emacs.d/emacs-for-python/") ;; tell where to load the various files

   ;; Each of them enables different parts of the system.
   ;; Only the first two are needed for pep8, syntax check.
   (require 'epy-setup) ;; It will setup other loads, it is required!
   (require 'epy-python) ;; If you want the python facilities [optional]
   (require 'epy-completion) ;; If you want the autocompletion settings [optional]
   (require 'epy-editing) ;; For configurations related to editing [optional]
   ;; [newer version of emacs-for-python]
   (require 'epy-nose) ;; For shortcut to call nosetests [optional]

   ;; Define f10 to previous error
   ;; Define f11 to next error
   (require 'epy-bindings) ;; For my suggested keybindings [optional]

   ;; Some shortcut that do not collide with gnome-terminal,
   ;; otherwise, "epy-bindings" define f10 and f11 for them.
   (global-set-key [f2] 'flymake-goto-prev-error)
   (global-set-key [f3] 'flymake-goto-next-error)

   ;; Next two lines are the checks to do. You can add more if you wish.
   (epy-setup-checker "pyflakes %f") ;; For python syntax check
   (epy-setup-checker "pep8 -r %f") ;; For pep8 check

.. note::

   The script highlights problematic lines. This can make part of the
   line not readable depending on the background. To replace the line
   highlight by an underline, add this to your emacs configuration
   file:

   ;; Make lines readable when there is an warning [optional]
   (custom-set-faces
   '(flymake-errline ((((class color)) (:underline "red"))))
   '(flymake-warnline ((((class color)) (:underline "yellow")))))

Documentation and docstrings
----------------------------
* The documentation and the API documentation are generated using `Sphinx`_.

* The documentation should be written in `reStructuredText`_ and the
  docstrings of all the classes and functions should respect the
  `PEP257 <https://www.python.org/dev/peps/pep-0257/>`_ rules and follow the 
  `Numpy docstring standard
  <https://github.com/numpy/numpy/blob/master/doc/HOWTO_DOCUMENT.rst.txt>`_.

* Split the docstrings in sections, according to the `Allowed docstring
  sections in Napoleon`_

* To cross-reference other objects (e.g. reference other classes or methods) in 
  the docstrings, use the
  `cross-referencing objects <http://www.sphinx-doc.org/en/stable/domains.html#cross-referencing-python-objects>`_ 
  syntax. ``:py`` can be omitted, see e.g. this 
  `stackoverflow answer <http://stackoverflow.com/a/7754189>`_.

* See :ref:`metadocumentation`, for some information on how to generate the 
  documentation.


A Docstring Example
~~~~~~~~~~~~~~~~~~~
Here is an example on how to add a docstring to a class.

.. testcode:: python

    import theano

    class DoubleOp(theano.Op):
        """
        Double each element of a tensor.

        Parameters
        ----------
        x : tensor
            Input tensor

        Returns
        -------
        tensor
            a tensor of the same shape and dtype as the input with all
            values doubled.

        Notes
        -----
        this is a test note

        See Also
        --------
        :class:`~theano.tensor.elemwise.Elemwise` : You can use this to replace
        this example.  Just execute `x * 2` with x being a Theano variable.


        .. versionadded:: 0.6
        """

This is how it will show up for files that we auto-list in the library
documentation:

.. automodule:: theano.misc.doubleop
    :members:

Installation and configuration
==============================

To obtain developer access: register with `GitHub
<http://www.github.com/>`_ and create a fork of `Theano
<http://www.github.com/Theano/Theano>`_.

This will create your own Theano project on GitHub, referred later
as "YourProfile/Theano", or "origin", from which you will be able to
contribute to the original Theano/Theano, also called "central".


Create a local copy
-------------------

Clone your fork locally with

.. code-block:: bash

    git clone git@github.com:YOUR_GITHUB_LOGIN/Theano.git

For this URL to work, you must set your public ssh keys inside your
`github account setting <https://github.com/settings/ssh>`_.

From your local repository, your own fork on GitHub will be called "origin".

Then, add a reference to the original ("central") Theano repository with

.. code-block:: bash

    git remote add central git://github.com/Theano/Theano.git

You can choose another name than "central" to reference Theano/Theano
(for instance, NumPy uses "upstream"), but this documentation will stick
to "central."

You can then test your installation of Theano by following the steps of
:ref:`test_theano`.


Using your local copy
---------------------

To update your library to the latest revision, you should have a local branch
that tracks central/master. You can add one (named "trunk" here) with:

.. code-block:: bash

    git fetch central
    git branch trunk central/master

Once you have such a branch, in order to update it, do:

.. code-block:: bash

    git checkout trunk
    git pull

Keep in mind that this branch should be "read-only": if you want to
patch Theano, you should work in another branch, like described in the
:ref:`dev_workflow` section below.


Configure Git
-------------

On your local machine, you need to configure git with basic information:

.. code-block:: bash

   git config --global user.email you@yourdomain.example.com
   git config --global user.name "Your Name Comes Here"


You can also instruct git to use color in diff. For this, you need to
add those lines in the file ~/.gitconfig

.. code-block:: cfg

    [color]
       branch = auto
       diff = auto
       interactive = auto
       status = auto


.. _dev_workflow:

Development Workflow
====================

Start a new local branch
------------------------

When working on a new feature in your own fork, start from an up-to-date copy
of the `master` branch (the principal one) of the central repository
(Theano/Theano on GitHub):

.. code-block:: bash

    git fetch central
    git checkout -b my_shiny_feature central/master

.. note::

    This last line is a shortcut for:

    .. code-block:: bash

        git branch my_shiny_feature central/master
        git checkout my_shiny_feature


Submit your changes to the central repository
---------------------------------------------

Once your code is ready for others to review, you need to commit all the changes and then push your
branch to your github fork first:

.. code-block:: bash
    
    git commit -a -m "your message here"

.. code-block:: bash

    git push -u origin my_shiny_feature

Then, go to your fork's github page on the github website, select your
feature branch and hit the "Pull Request" button in the top right
corner.  This will signal the maintainers that you wish to submit your
changes for inclusion in central/master.
If you don't get any feedback, bug us on the theano-dev mailing list.


Address reviewer comments
-------------------------

Your pull request will be reviewed by members of the core development
team. If your branch is not directly accepted, the reviewers will use
GitHub's system to add "notes", either general (on the entire commit),
or "line notes", relative to a particular line of code.
In order to have the pull request accepted, you may have to answer
the reviewer's questions, you can do that on GitHub.

You may also have to edit your code to address their concerns. Some
of the usual requests include fixing typos in comments, adding or
correcting comments, adding unit tests in the test suite.  In order to
do that, you should continue your edits in the same branch you used (in
this example, "my_shiny_feature").  For instance, if you changed your
working branch, you should first:

.. code-block:: bash

    git checkout my_shiny_feature

Then, edit your code, and test it appropriately (see
:ref:`quality_contributions` below), and push it again to your GitHub
fork, like the first time (except the ``-u`` option is only needed the
first time):

.. code-block:: bash

    git push origin my_shiny_feature

The pull request to the central repository will then be automatically
updated by GitHub. However, the reviewers will not be automatically
notified of your revision, so it is advised to reply to the comments on
GitHub, to let them know that you have submitted a fix.


More Advanced Git Usage
=======================

You can find information and tips in the `numpy development
<http://docs.scipy.org/doc/numpy/dev/gitwash/development_workflow.html>`_
page. Here are a few.


Cleaning up branches
--------------------

When your pull request has been merged, you can delete the branch from
your GitHub fork's list of branches. This is useful to avoid having too
many branches staying there. Deleting this remote branch is achieved
with:

.. code-block:: bash

   git push origin :my_shiny_feature

This lines pushes to the "origin" repository (your fork of Theano on
GitHub), into the branch "my_shiny_feature", an empty content (that's
why there is nothing before the colon), effectively removing it.

The branch will still be present in your local clone of the repository.
If you want to delete it from there, too, you can run:

.. code-block:: bash

   git branch -d my_shiny_feature


Amending a submitted pull request
---------------------------------

If you want to fix a commit already submitted within a pull request
(e.g. to fix a small typo), before the pull request is accepted, you can
do it like this to keep history clean:

.. code-block:: bash

   git checkout my_shiny_feature
   git commit --amend
   git push origin my_shiny_feature:my_shiny_feature

Do not abuse that command, and please use it only when there are only
small issues to be taken care of. Otherwise, it becomes difficult to
match the comments made by reviewers with the new modifications.
In the general case, you should stick with the approach described above.


Cleaning up history
-------------------

Sometimes you may have commits in your feature branch that
are not needed in the final pull request. There is a `page
<http://sandofsky.com/blog/git-workflow.html>`_ that talks about
this. In summary:

* Commits to the trunk should be a lot cleaner than commits to your
  feature branch; not just for ease of reviewing but also
  because intermediate commits can break blame (the bisecting tool).
* `git merge --squash` will put all of the commits from your feature branch into one commit.
* There are other tools that are useful if your branch is too big for one squash.


Add another distant repository
------------------------------

To collaborate with another user on some feature he is developing, and
that is not ready for inclusion in central, the easiest way is to use a
branch of their Theano fork (usually on GitHub).

Just like we added Theano/Theano as a remote repository, named
"central", you can add (on your local machine) a reference to their fork
as a new remote repository. REPO_NAME is the name you choose to name
this fork, and GIT_REPO_PATH is the URL of the fork in question.

.. code-block:: bash

    git remote add REPO_NAME GIT_REPO_PATH

Then, you can create a new local branch (LOCAL_BRANCH_NAME) based on
a specific branch (REMOTE_BRANCH_NAME) from the remote repository
(REPO_NAME):

.. code-block:: bash

   git checkout -b LOCAL_BRANCH_NAME REPO_NAME/REMOTE_BRANCH_NAME


Other tools that can help you
=============================

 * `cProfile <http://docs.python.org/library/profile.html>`_: time profiler that work at function level.
 * `Yep <http://pypi.python.org/pypi/yep>`_: A module for profiling compiled extensions.
 * `autopep8 <http://pypi.python.org/pypi/autopep8/>`_: A tool that automatically formats Python code to conform to the PEP 8 style guide.
 * `line_profiler <http://pypi.python.org/pypi/line_profiler/>`_: Line-by-line profiler.
 * `memory_profiler <http://fseoane.net/blog/2012/line-by-line-report-of-memory-usage/>`_: memory profiler
 * `runsnake <http://www.vrplumber.com/programming/runsnakerun/>`_: Gui for cProfile(time profiler) and Meliae(memory profiler)
 * `Guppy <https://pypi.python.org/pypi/guppy/>`_: Supports object and heap memory sizing, profiling and debugging.
 * `hub <https://github.com/defunkt/hub>`_: A tool that adds github commands to the git command line.
 * `git pull-requests <http://www.splitbrain.org/blog/2011-06/19-automate_github_pull_requests>`_: Another tool for git/github command line.