From Biopython
Revision as of 04:43, 17 November 2010 by Mdehoon (Talk | contribs)
Jump to: navigation, search
Author Matthew Zapala, Ondrej Libiger and Nicholas Schork
Description MDMR is a Multivariate Distance Matrix Regression program that performs linear regression on a distance matrix of data and associates multiple independent variables to that matrix.

Author Frank Kauff
Description A python wrapper for RAxML-VI-HPC. This is a python script that reads nexus data files and prepares the necessary input files and command line options for RAxML.

Author Chih-Chung Chang and Chih-Jen Lin
Description LIBSVM is an integrated software for support vector classification, regression and distribution estimation. It includes a python interface and excellent documentation about SVMs.

Name PyML
Author Asa Ben-Hur
Description PyML is a flexible Python framework for using various classification methods including Support Vector Machines (SVM). It provides tools for model selection and feature selection.

Name PyMOL
Author Warren DeLano
Description PyMOL is an open-source molecular modeling program written in the Python language. A number of Python scripts are available which link PyMOL to standard structure analysis programs.

Name GenomeDiagram
Author Leighton Pritchard
Description A genomic and biological sequence schematic drawing package. Creates publication-quality genome schematics in several vector and bitmap formats.

Name Python Macromolecular Library (mmLib)
Author Jay Painter
Description The Python Macromolecular Library (mmLib) is a software toolkit and library of routines for the analysis and manipulation of macromolecular structural models, implemented in the Python programming language. It is accessed via a layered, object-oriented application programming interface, and provides a range of useful software components for parsing mmCIF, PDB, and MTZ files, a library of atomic elements and monomers, an object-oriented data structure describing biological macromolecules, and an OpenGL molecular viewer.

Name pyzerg
Author Leighton Pritchard
Description A Python wrapper for the Zerg BLAST parser, a very fast BLAST parser library written in C.

Name Pise Biopython client
Author Catherine Letondal
Description This package runs remote analyses on a Pise Web server. There are currently about 300 programs available, including EMBOSS and Phylip, several phylogeny, sequence comparison, 3D structure, pattern discovery, gene prediction and DNA, protein or RNA analyses programs. The package automatically chain programs to extract results as file, string or filehandle for Biopython modules. It also accepts Biopython sequences and alignment class as input data.

Name Pycluster
Author Michiel de Hoon
Description This is a Python C extension module for clustering gene expression data. The exact same package is available as part of Biopython (see Bio.Cluster).

Author Andrew Dalke
Description A Python script that reads the PDB format documentation and generates code for a given language to parse and/or generate column delimited record format used by the PDB.

Name Dinu Gherman's alignment code
Author Dinu Gherman
Description Code for pairwise sequence alignments.

Name Open Infrastructure for Outcomes (OIO)
Author Andrew P. Ho
URL http://www.TxOutcome.Org/
Description OIO is a free (as in GPL) web-based research and clinical data system that provides user-extensible plug-and-play components and data mining tools. We use it at Harbor-UCLA for health/treatment outcomes data. Forms/metadata can be exported+imported as XML and exchanged via the online OIO Library at www.TxOutcome.Org. It is written in Zope/Python and uses the PostgreSQL database backend. It has not been used to manage gene sequence/annotation data to my knowledge but it would be trivial to extend it for those functions.

Name Arne Mueller's BLAST parser
Author Arne Mueller
Description A parser for BLAST and PSI-BLAST written in and for python.

Name The Nilges group's python scripts
Author Michael Nilges and the structural bioinformatics research group
Description Python code for structural bioinformatics and data transformation.

Name PySAT
Author Chenna Ramu
Description Python Sequence Analysis Tools consists of modules to parse and iterate through various flat file databases (EMBL, GENBANK, SWISSPROT, PDBFINDER, ENZYME etc.) and does a few other things.

Name PyPhy
Author Thomas Sicheritz-Ponten
Description PyPhy is a set of python scripts and modules for automatic, large-scale reconstructions of phylogenetic relationships of complete microbial genomes. PyPhy consists of AutoTree which automatically generates phylogenetic trees for each amino acid sequence in a FASTA file, and Xphylome which generates and visualizes the Phylome Maps for a microbial genome.

Name Scripps Molecular Graphics Labs Python Software
Author Michel Sanner
Description This site contains code dealing with structural bioinformatics and molecular visualization. This includes MolKit, which reads molecules from a number of file formats; PyBabel, which builds up molecular structures; MSLib, which wraps up a molecular surface calculation library; Python Molecular Viewer (PMV), which provides a complete viewer; AutoDockTools, which provides a GUI to set up ligand to protein docking experiments; and many more.

Name Konrad Hinsen's Python Page - MMTK and ScientificPython
Author Konrad Hinsen
URL and
Description Contains the Molecular Modeling Toolkit (MMTK), an open source program library for molecular simulation applications. Additionally, Konrad has ScientificPython, which collects a number of modules that are useful in scientific computing, including code for statistics, basic geometry, etc.

Name Paul Magwene's Python Page
Author Paul Magwene
Description This collects Paul's modules for doing different tasks (and also has links and a nice essay about why he likes python). A particularly interesting module is disipyl, which provides an object oriented interface to the Dislin plotting library.

Name Noah Hoffman's Python Page
Author Noah Hoffman and Wolfgang Resch
Description A collection of python modules and scripts originally intended to process large collections of HIV sequences. Now contains general utilities for sequence analysis, manipulation, classification, and formatting. Plus other useful stuff.

Name Vienna RNA package
Author Ivo Hofacker, Institut für theoretische Chemie, University of Vienna, Austria
Description This is a package for RNA secondary structure prediction and comparison. While it was not specifically written for Python, you can get the Python wrappers for it using the SWIG file that is included with the software package. The easiest way is to first compile the Vienna RNA package as described in its documentation (you don't need to actually install it). Then, run "swig -python RNA.i" in the ViennaRNA-1.8.4/Perl directory. This will create a new RNA_wrap.c specifically for Python, as well as

Next, you should compile RNA_wrap.c. You can use this script:

#!/usr/bin/env python
from distutils.core import setup, Extension
import os
import sys
old_filename = os.path.join("Perl", "")
new_filename = os.path.join("Perl", "")
if os.path.exists(old_filename):
    os.rename(old_filename, new_filename)
extra_link_args = []
if sys.platform != 'darwin':
extension = Extension("_RNA",
      description="Vienna RNA",
      author="Ivo Hofacker, Institute for Theoretical Chemistry, University of Vienna",
      package_dir = {'RNA':'Perl'},
      packages = ['RNA'],

Put this in the ViennaRNA-1.8.4 directory, and run "python build; python install" as usual.

To try the package, use

>>> import RNA
['(((((...)))))', -7.5]
Personal tools