Use the built-in 'rbench' module. This module contains geninterpreted versions of pystone and richards, so it is useful to measure the interpretation overhead of the various pypy-\*.