Writing Tests for Biopython modules
                  ***********************************
                    Brad Chapman (chapmanb@uga.edu)
                    ===============================
                                    
  

Table of Contents
*=*=*=*=*=*=*=*=*

   
  

1  Purpose and Justifications
*=*=*=*=*=*=*=*=*=*=*=*=*=*=*

  
  So you want to write a Test for a Biopython module? Great! Providing
comprehensive tests for modules is one of the most important parts of
keeping code up to date and working the way you expect it to. It also 
tends to be one of the most undervalued aspects of contributing, so this
document is designed to make writing good test code as easy as possible.
  We start off with the simple assumption that there is a module you
wrote (or which doesn't already have tests), and you want to test it
out. We'll call it `MyModule', for lack of a better name, from now on.
  

2  Understanding the Biopython test system
*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=

  
  Biopython tests are found in `biopython/Tests' and each test will have
two important files and directories involved with it:
  
   
 1. `test_MyModule.py' -- The actual test code for your module.  
 2. `MyModule' -- A directory where any necessary input files  will be
   located. Any output files that will be generated should also  be
   written here (and preferrably cleaned up after the tests are  done)
   to prevent clogging up the main Tests directory. 
  
  The main engine that runs all the tests is `run_tests.py'. If you name
your file with a `test_' prefix and put it in the `Tests' directory,
`run_tests.py' will find it and run it.
  

3  Writing the Tests
*=*=*=*=*=*=*=*=*=*=

  
  Okay, now that we're oriented with the important parts of the test
framework, we'll get down to writing tests. We will use the unittest
framework, included with Python since version 2.1, and documented in the
python Library Reference (which I know you are keeping under your
pillow, as recommended).
  To get started, here's a framework to copy and paste:
<<
  import os
  import sys
  import unittest
  
  def run_tests(argv):
      test_suite = testing_suite()
      runner = unittest.TextTestRunner(sys.stdout, verbosity = 2)
      runner.run(test_suite)
  
  def testing_suite():
      """Generate the suite of tests.
      """
      test_suite = unittest.TestSuite()
  
      test_loader = unittest.TestLoader()
      test_loader.testMethodPrefix = 't_'
      tests = [MyModuleTestOne]
      
      for test in tests:
          cur_suite = test_loader.loadTestsFromTestCase(test)
          test_suite.addTest(cur_suite)
  
      return test_suite
  
  class MyModuleTestOne(unittest.TestCase):
      def setUp(self):
          pass
  
      def tearDown(self):
          pass
  
  if __name__ == "__main__":
      sys.exit(run_tests(sys.argv))
>>
  
  The two functions `run_tests' and `testing_suite' just do all of the
unittest setup -- so you don't really need to understand everything
about the framework to write the tests. All you'll need to do is write
classes, like MyModuleTestOne, and add them to the lists of test classes
in the `testing_suite' function and you'll be set.
  Here's an example of a Test class, which will lead into a description
of the different parts which go into making one:
<<
  class MyModuleTestOne(unittest.TestCase):
      def setUp(self):
          self.handle = open("MyModule/input_file.txt")
          self.output_file = "MyModule/output.txt"
  
      def tearDown(self):
          self.handle.close()
          if os.path.exists(self.output_file):
              os.remove(self.output_file)
  
      def t_simple_parsing(self):
          """Test to be sure that MyModule can parse input files.
          """
          parser = MyModule.RecordParser()
          rec = parser.parse(self.handle)
          assert rec.id = "TheExpectedID"
  
      def t_output(self):
          """Ensure that we can write proper output files.
          """
          parser = MyModule.RecordParser()
          rec = parser.parse(self.handle)
          output_handle = open(self.output_file, "w")
          rec.write_to_file(output_handle)
          output_handle.close()
>>
  
  We'll cover the important parts of this class one at a time:
  
   
 - It should derive from `unittest.TestCase' and should cover  one basic
   aspect of your code (parsing into Record objects, parsing  into
   Sequence objects...)
 
 - `setUp' -- code that you want to run before running each  test in the
   class. This might set up expected files or open files --  it just
   prevents having to rewrite the same code over and over in  each test.
 
 - `tearDown' -- code to clean up after your test is done.  This will
   close up handles you were using and clean up produced  files that you
   don't need any longer.
 
 - The tests are prefixed with `t_' and each test should cover  one
   specific part of what you are trying to test. You can have as  many
   tests as you want in a class.
 
 - The documentation strings of the tests are used when running  them --
   so write something useful that will provide help if a test  is
   failing.
 
 - You should test expected parts of your output using standard  python
   assertions. This makes explicit the things you think are  important
   enough to remain the same over multiple runs of the tests  and
   revisions of the code base. 
  
  When you are done, the tests should run happily by doing  `python
test_MyModule.py' and you should expect output like:
<<
  Test to be sure that MyModule can parse input files. ... ok
  Ensure that we can write proper output files. ... ok
  
  -------------------------------------------------------------------
  Ran 2 tests in 0.225s
>>
  
  You should continue like this until you feel you've suitable tested
the parts of the code so that someone running this test later on can
tell if they've broken anything essential. You have to draw a line
between testing everything in the world, and getting tests written in a
reasonable amount of time. Tests are important, but you'd much rather be
writing code, right? So get them done and you'll be happy.
  

4  Getting the test integrated in the testing framework
*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*

  
  Well, you've written all your tests and they all pass and your are
feeling very happy. There is just one last step to get things 
integrated with Biopython. Every time someone runs  `python
run_tests.py' each test is executed, and then checked against the
expected output (all 'ok's) to make sure nothing has broken. Well, you
need to make sure the tests knows what the expected output is.
  To do this takes just a second:
<<
  python run_tests.py -g test_MyModule.py
>>
  
  This will run your test, and write the output to
`output/test_MyModule'. You should go in and check this file to make
sure everything is as expected. Then, you're all set.
  So, now you should be able to run all of the Biopython tests, see your
test show up, and get a happy ok from the testing framework.
  Good job -- now you either need to check your tests in if you have CVS
access, or just send your `test_MyModule.py' and contents of `MyModule'
to someone with CVS priviledges.
  Thanks for the tests. 
-----------------------------------------------------------------------
  
   
              This document was translated from LaTeX by HeVeA
              (http://pauillac.inria.fr/~maranget/hevea/index.html).