1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192
|
Getting Started with RPython
============================
.. contents::
.. warning::
Please `read this FAQ entry`_ first!
.. _`read this FAQ entry`: http://rpython.readthedocs.org/en/latest/faq.html#do-i-have-to-rewrite-my-programs-in-rpython
RPython is a subset of Python that can be statically compiled. The PyPy
interpreter is written mostly in RPython (with pieces in Python), while
the RPython compiler is written in Python. The hard to understand part
is that Python is a meta-programming language for RPython, that is,
"being valid RPython" is a question that only makes sense on the
live objects **after** the imports are done.
This might require more explanation. You start writing RPython from
``entry_point``, a good starting point is
:source:`rpython/translator/goal/targetnopstandalone.py`. This does not do all that
much, but is a start. Now if code analyzed (in this case ``entry_point``)
calls some functions, those calls will be followed. Those followed calls
have to be RPython themselves (and everything they call etc.), however not
entire module files. To show how you can use metaprogramming, we can do
a silly example (note that closures are not RPython)::
def generator(operation):
if operation == 'add':
def f(a, b):
return a + b
else:
def f(a, b):
return a - b
return f
add = generator('add')
sub = generator('sub')
def entry_point(argv):
print add(sub(int(argv[1]), 3) 4)
return 0
In this example ``entry_point`` is RPython, ``add`` and ``sub`` are RPython,
however, ``generator`` is not.
The following introductory level articles are available:
* Laurence Tratt -- `Fast Enough VMs in Fast Enough Time`_.
* `How to write interpreters in RPython`_ and `part 2`_ by Andrew Brown.
.. _Fast Enough VMs in Fast Enough Time: http://tratt.net/laurie/tech_articles/articles/fast_enough_vms_in_fast_enough_time
.. _How to write interpreters in RPython: http://morepypy.blogspot.com/2011/04/tutorial-writing-interpreter-with-pypy.html
.. _part 2: http://morepypy.blogspot.com/2011/04/tutorial-part-2-adding-jit.html
.. _try-out-the-translator:
Trying out the translator
-------------------------
The translator is a tool based on the PyPy interpreter which can translate
sufficiently static RPython programs into low-level code (in particular it can
be used to translate the `full Python interpreter`_). To be able to experiment with it
you need to download and install the usual (CPython) version of:
* Pygame_
* `Dot Graphviz`_
.. _Pygame: http://www.pygame.org/
.. _Dot Graphviz: http://www.graphviz.org/
To start the interactive translator shell do::
cd rpython
python bin/translatorshell.py
Test snippets of translatable code are provided in the file
:source:`rpython/translator/test/snippet.py`, which is imported under the name
``snippet``. For example::
>>> t = Translation(snippet.is_perfect_number, [int])
>>> t.view()
After that, the graph viewer pops up, that lets you interactively inspect the
flow graph. To move around, click on something that you want to inspect.
To get help about how to use it, press 'H'. To close it again, press 'Q'.
Trying out the type annotator
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
We have a type annotator that can completely infer types for functions like
``is_perfect_number`` (as well as for much larger examples)::
>>> t.annotate()
>>> t.view()
Move the mouse over variable names (in red) to see their inferred types.
Translating the flow graph to C code
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
The graph can be turned into C code::
>>> t.rtype()
>>> lib = t.compile_c()
The first command replaces the operations with other low level versions that
only use low level types that are available in C (e.g. int). The compiled
version is now in a ``.so`` library. You can run it say using ctypes:
>>> f = get_c_function(lib, snippet.is_perfect_number)
>>> f(5)
0
>>> f(6)
1
A slightly larger example
~~~~~~~~~~~~~~~~~~~~~~~~~
There is a small-to-medium demo showing the translator and the annotator::
python bin/rpython --view --annotate translator/goal/bpnn.py
This causes ``bpnn.py`` to display itself as a call graph and class
hierarchy. Clicking on functions shows the flow graph of the particular
function. Clicking on a class shows the attributes of its instances. All
this information (call graph, local variables' types, attributes of
instances) is computed by the annotator.
To turn this example to C code (compiled to the executable ``bpnn-c``),
type simply::
python bin/rpython translator/goal/bpnn.py
Translating Full Programs
~~~~~~~~~~~~~~~~~~~~~~~~~
To translate full RPython programs, there is the script ``bin/rpython`` in
:source:`rpython/bin/`. Examples for this are a slightly changed version of
Pystone::
python bin/rpython translator/goal/targetrpystonedalone
This will produce the executable "targetrpystonedalone-c".
The largest example of this process is to translate the `full Python
interpreter`_. There is also an FAQ about how to set up this process for :ref:`your
own interpreters <compile-own-interpreters>`.
There are several environment variables you can find useful while playing with the RPython:
``PYPY_USESSION_DIR``
RPython uses temporary session directories to store files that are generated during the
translation process(e.g., translated C files). ``PYPY_USESSION_DIR`` serves as a base directory for these session
dirs. The default value for this variable is the system's temporary dir.
``PYPY_USESSION_KEEP``
By default RPython keeps only the last ``PYPY_USESSION_KEEP`` (defaults to 3) session dirs inside ``PYPY_USESSION_DIR``.
Increase this value if you want to preserve C files longer (useful when producing lots of lldebug builds).
Sources
-------
* :source:`rpython/translator` contains the code analysis and generation stuff.
Start reading from translator.py, from which it should be easy to follow
the pieces of code involved in the various translation phases.
* :source:`rpython/annotator` contains the data model for the type annotation that
can be inferred about a graph. The graph "walker" that uses this is in
:source:`rpython/annotator/annrpython.py`.
* :source:`rpython/rtyper` contains the code of the RPython typer. The typer transforms
annotated flow graphs in a way that makes them very similar to C code so
that they can be easy translated. The graph transformations are controlled
by the code in :source:`rpython/rtyper/rtyper.py`. The object model that is used can
be found in :source:`rpython/rtyper/lltypesystem/lltype.py`. For each RPython type
there is a file rxxxx.py that contains the low level functions needed for
this type.
* :source:`rpython/rlib` contains the :doc:`RPython standard library <rlib>`, things that you can
use from rpython.
.. _full Python interpreter: http://pypy.readthedocs.org/en/latest/getting-started-python.html
|