Active projects

(Difference between revisions)
Jump to: navigation, search
m (Link to git page)
m (Project ideas)
Line 33: Line 33:
 
* Build a general tool to filter sequences containing ambiguous or low quality bases. Chris Fields from BioPerl is interested in coordinating the BioPerl/Biopython implementations. See these threads on the mailing lists for discussion: http://lists.open-bio.org/pipermail/biopython/2009-July/005355.html, http://lists.open-bio.org/pipermail/biopython/2009-July/005342.html
 
* Build a general tool to filter sequences containing ambiguous or low quality bases. Chris Fields from BioPerl is interested in coordinating the BioPerl/Biopython implementations. See these threads on the mailing lists for discussion: http://lists.open-bio.org/pipermail/biopython/2009-July/005355.html, http://lists.open-bio.org/pipermail/biopython/2009-July/005342.html
  
* Use SQLAlchemy, an object relational mapper, for BioSQL internals. This would add an additional external dependency to Biopython, but provides ready support for additional databases like SQLite. It also would provide a raw object interface to BioSQL databases when the SeqRecord-like interface is not sufficient. Brad has some initial code for this.
+
* Use SQLAlchemy, an object relational mapper, for BioSQL internals. This would add an additional external dependency to Biopython, but provides ready support for additional databases like SQLite. It also would provide a raw object interface to BioSQL databases when the SeqRecord-like interface is not sufficient. Brad and Kyle have some initial code for this.
  
 
* Revamp the GEO SOFT parser, drawing on the ideas used in [http://www.bioconductor.org/packages/bioc/html/GEOquery.html Sean Davis' GEOquery parser in R/Bioconductor].  See also [http://www.warwick.ac.uk/go/peter_cock/r/geo/ this page].
 
* Revamp the GEO SOFT parser, drawing on the ideas used in [http://www.bioconductor.org/packages/bioc/html/GEOquery.html Sean Davis' GEOquery parser in R/Bioconductor].  See also [http://www.warwick.ac.uk/go/peter_cock/r/geo/ this page].

Revision as of 17:44, 24 September 2009

This page provides a central location to collect references to active projects. This is a good place to start if you are interested in contributing to Biopython and want to find larger projects in progress. For developers, use this to reference git branches or other projects which you will be working on for an extended period of time. Please keep it up to date as projects are finished and integrated into Biopython.

Contents

Current projects

Population Genetics development

Giovanni and Tiago are working on expanding population genetics code in Biopython. See the PopGen development page for more details.

GFF parser

Brad is working on a Biopython GFF parser. Source code is available from git hub. Documentation is in progress at GFF Parsing. See blog posts on the initial implementation and MapReduce parallel version.

PhyloXML driver (GSoC)

Eric is working on supporting the PhyloXML format, as a project for Google Summer of Code 2009. Brad is mentoring this project. The code lives on a branch in GitHub, and you can see a timeline and other info about ongoing development here. The new module is being documented on this wiki as PhyloXML.

Biogeography (GSoC)

Nick is working on developing a Biogeography module for BioPython. This work is funded by Google Summer of Code 2009 through NESCENT's Phyloinformatics Summer of Code 2009. See the project proposal at: Biogeographical Phylogenetics for BioPython. The mentors are Stephen Smith (primary), Brad Chapman, and David Kidd. The code currently lives at the Bio/Geography directory of the Geography fork of the nmatzke branch on GitHub, and you can see a timeline and other info about ongoing development here. The new module is being documented on this wiki as BioGeography.

Roche 454 SFF parsing in Bio.SeqIO

See Bug 2837, based on code from Jose Blanca.

Open Enhancement Bugs

This Bugzilla Search will list all open enhancement bugs (any filed by core developers are fairly likely to be integrated, some are just wish list entries).

Project ideas

Please add any ideas or proposals for new additions to Biopython. Bugs and enhancements for current code should be discussed though our bugzilla interface.

  • Use SQLAlchemy, an object relational mapper, for BioSQL internals. This would add an additional external dependency to Biopython, but provides ready support for additional databases like SQLite. It also would provide a raw object interface to BioSQL databases when the SeqRecord-like interface is not sufficient. Brad and Kyle have some initial code for this.

Enhancement list

Maintaining software involves incremental improvements for new format changes and removal of bugs. Please see our bugzilla page for a current list. Post to the developer mailing list if you are interested in tackling any open issues.

Personal tools
Namespaces
Variants
Actions
Navigation
Toolbox