Vous êtes sur la page 1sur 5

Software review

Bioinformatics software
resources
Abstract
This review looks at internet archives, repositories and lists for obtaining popular and useful
biology and bioinformatics software. Resources include collections of free software, services
for the collaborative development of new programs, software news media and catalogues of
links to bioinformatics software and web tools. Problems with such resources arise from needs

Downloaded from bib.oxfordjournals.org by guest on September 28, 2010


for continued curator effort to collect and update these, combined with less than optimal
Keywords: bioinformatics
software archives, web community support, funding and collaboration. Despite some problems, the available software
hyperlink catalogues, repositories provide needed public access to many tools that are a foundation for analyses in
bioinformatics news bioscience research efforts.

INTRODUCTION new and updated software are announced;


When the time comes to analyse results of and publications. Table 1 summarises a
a new experiment, bioscientists now selection of these resources, which is by
supplement their tool set of spreadsheet, no means comprehensive. These were
commercial bioinformatics and statistics selected to represent some of the more
programs, and word processor with comprehensive and/or actively managed
bioinformatics tools located through the resources for biology software.
web. This is essential for many biology Archives or repositories maintain
fields including sequence analysis, bio- collections of software from several
data management, phylogenetics, gene authors. These serve both as long-term
expression and proteomics. One can save libraries of useful tools, and a reference for
time and reach more reliable conclusions new software authors who want to avoid
by choosing the most appropriate analysis reinventing wheels. Bioinformatics
tools. Free bioinformatics tools are widely programs, including older ones especially
available, but it is not always easy to find with source code, are widely read and
the relevant ones, or even those that were used as reference works and building
available a few years ago. Web searches blocks by other bioinformaticians in
via Google, Yahoo and similar general development of new software. Long-term
search systems can miss the best of archives provide continuing access when
biology’s unique and focused resources. existence of authors’ web or ftp sites is
This review highlights some of biology’s often limited to a few years.1
current clearing-houses of informatics
tools. RESOURCES
Internet resources that one can use to Archives and web tools
find suitable biology software fall into When one wants to run software on a
four general groupings: resource sites workstation, stock a bioinformatics
(archives, bioinformatics service service centre with tools, or refer to
organisations); web lists and catalogues related programs when developing new
pointing to other sources; news and software, one needs to browse the web
discussion groups where information on for programs. Large collections of

300 & HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 3. 300–304. SEPTEMBER 2004
Software review

Table 1: Sources for biology software

Source Web URL Notes

Selected bioinformatics resource sites


Bioinformatics.org http://bioinformatics.org/ Bioinformatics developer repository
BioPortal, Weizmann Institute http://bioportal.weizmann.ac.il/ Software and data archive, web tools
BioWeb, Pasteur Institute http://bioweb.pasteur.fr/ Software and data archive, web tools
Canadian Bioinformatics Resource (CBR) http://cbr-rbc.nrc-cnrc.gc.ca/ Web tools
European Bioinformatics Institute http://www.ebi.ac.uk/ Web tools, data and software archive, catalogue
IUBio Archive, Indiana University http://iubio.bio.indiana.edu/ Software and data archive, catalogue, web tools
RFSB, Bioinformatics Centre, IMT http://imtech.res.in/pdsb/ Software archive
SourceForge.net http://sourceforge.net/ General developer repository with bioinformatics
section
Selected web lists and catalogues
BioHunt, ExPASy http://www.expasy.org/BioHunt/ Automatic robot updated list
Bioinformatics.ca http://www.bioinformatics.ca/links_directory Curated list
Bioinformatics.net http://www.bioinformatics.net/ Curated list
Bioinformatik.de http://www.bioinformatik.de/ Curated list
BioNetbook, Pasteur Institute http://www.pasteur.fr/recherche/BNB/bnb-en.html Semi-automatic updates
CSM Molecular Biology Resource, SDSC http://restools.sdsc.edu/ Curated list
GenomeWeb http://www.hgmp.mrc.ac.uk/GenomeWeb/ Curated list

Downloaded from bib.oxfordjournals.org by guest on September 28, 2010


Open Directory Project http://dmoz.org/Science/Biology/Bioinformatics/ Submitted links
SouthWest Biotechnology and Informatics Center http://www.swbic.org/ Curated list
News and discussion groups
BIOSCI/Bionet http://www.bio.net/ Biologist and bioinformatics focus
Bioinformatics.org http://bioinformatics.org/ Bioinformatics focus
Bioinformatics.net http://bioinformatics.net/ Biologist focus
Selected bioinformatics publications for software tools
BioInform http://www.bioinform.com/ Bioinformatics news briefs
Bioinformatics http://bioinformatics.oupjournals.org/ Bioinformatics focus, see application notes for
biologists
BMC Bioinformatics http://www.biomedcentral.com/bmcbioinformatics/ Bioinformatics focus
Briefings in Bioinformatics http://www.henrystewart.com/journals/bib/ Biologist and bioinformatics focus

programs are available through BioWeb at biology software.3 This resource also
Pasteur, BioPortal at Weizmann Institute, offers long-term access to many popular
European Bioinformatics Institute (EBI), biology programs.
IUBio Archive at Indiana University and Among those resources offering
RFSB at IMT India. The project bioinformatics web tools, some of the
repositories at Bioinformatics.org and more comprehensive include EBI,
SourceForge.net also offer bio-software BioWeb Pasteur and Canadian
with source code, many in active Bioinformatics Resource. There are a
development. number of common bioinformatics
IUBio Archive maintained by this analyses one can perform at these sites,
author is one such long-term archive, including BLAST and sequence analyses,
serving public biology and bioinformatics primer tools and phylogenetics tree
software since 1989. It houses over 500 construction. EMBOSS sequence analysis
software titles, many of which have been package and SRS bio-database access are
added in recent years. The addition of among the widely useful web tools
EPrints.org2 self-archiving web database available at these and other resource sites.
for promoting author-contributed
software will help expand this archive Web lists and catalogues
with minimal support. The Repository of There are numerous web lists of
Free Software in Biology (RFSB) at the bioinformatics resources, with many
Institute of Microbial Technology, India, aimed at the biologist looking for
collects and archives free, academic software. These are successors to last

& HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 3. 300–304. SEPTEMBER 2004 301
Software review

decade’s popular bioinformatics lists by biology-focused search engine proves


Keith Robison4 and Pedro Coutinho.5 especially useful in finding that tool or
Some of these, such as Bioinformatics.net, resource most relevant to one’s research.
include discussion forums on the use of This project also has implemented link
biology software. These are useful for maintenance by using semi-automatic
biologists, as well as bioinformatics scanning of internet news and resources
engineers looking for tools related to their (robot-like) to update the catalogue. A
work, or to be used at service centres. similar project is BioHunt, which uses
Many of these share a similar organisation internet robot technology to search and
by functional categories, with many of the update molecular biology resources.
same links. It is useful to compare these BioHunt maintains current entries (it
for their different editorial perspectives, eg shows update times of this review month
genomics/molecular biology or for several searches), making it especially
proteomics/biochemistry, as well as effort useful to find new or updated tools that
to update and remove obsolete links. one has heard of, but lacks curated
General resources such as Google, cataloguing of these to make it easy to
Amazon’s Alexa and Open Directory find by subject matter.

Downloaded from bib.oxfordjournals.org by guest on September 28, 2010


Project at Mozilla.org include biology and Bioinformatics.net is a catalogue of
bioinformatics categories in their online biology resources, specialising in
directories. These directories are bioinformatics tools. Its focus is towards
populated by robots or from submissions; the needs of molecular biologists and life
they tend to lack the comprehensiveness science professionals, more than for
of biologist-maintained lists. bioinformaticians, and includes discussion
Bioinformatics.ca provides a curated list and help forums on the use of software
of links that are well organised in and bioscience topics. Jonathan Rees,
categories, with main sections that include who developed this resource, also curates
human genome and model organisms, biology lists in the Open Directory
sequences, gene expression, education Project. This service is supported in part
and computer-related resources. Most or by advertising, as are others reviewed
all of these include useful editorial here, one of the limited options available
comments on the content and value of to maintain such services.
the linked resources, making this list Bioinformatik.de offers a similar directory
especially useful in learning about style collection of curated bioinformatics
resources. The GenomeWeb at MRC, and biology resource links. The CMS
UK, offers a similar very useful catalogue molecular biology resource is an extensive
of links with editorial abstracts. An catalogue of biology resources, including
interesting function at Bioinformatics.ca is software tools. The SouthWest
provided by an XML standard for web Biotechnology Center also maintains a
news called RSS, for sharing useful catalogue covering a broad range of
bioinformatics links. This allows biology resources.
customers and other web sites to have Bioinformatics.org and
computable access to this catalogue. For SourceForge.net are resources that
instance, you can use an RSS program to support software developers and
notify you of additions and changes to this bioinformatics engineers, but are also
catalogue. useful to biologists looking for tools.
The BioNetbook project at Pasteur Open-source software development in
Institute provides an example of resource bioinformatics and other fields is being
lists that are searchable by several invigorated through agencies such as
bioinformatics criteria: Biological these. The number of active, widely used
Domain, eg sequence analysis or structural and valuable bioinformatics projects at
biology, Resource type, eg database or these services is growing, including
online analysis tools, and Organism. This Generic Model Organism Database, Gene

302 & HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 3. 300–304. SEPTEMBER 2004
Software review

Ontology, GeneX Gene Expression pharmaceutical industry issues. BMC


Database and Staden Package for Bioinformatics is an electronic journal that
sequence analysis. These agencies allow requires no subscriber fees. It publishes a
for software archiving, but the primary range of original reports on new
attractions to software developers are bioinformatics software, with open access
infrastructure and tools that enable for their redistribution.
collaborative software development. A
historical archive or catalogue service of DISCUSSION
bioinformatics software is limited, and There are numerous, well-developed
maintenance of software releases is left to bioinformatics resources that bioscientists
developers using this service. can use to find the best available research
tools. All of those discussed here have
News and publications particular values; the interested scientist
Sources for announcements of software will find time well spent browsing
include the 15-year-old BIOSCI/Bionet through some of these to find a particular
public forums of bionet.software, tool. The most useful are those that have a

Downloaded from bib.oxfordjournals.org by guest on September 28, 2010


bionet.software.www, high level of editorial comment and
bionet.biology.computational and curation by biologists; these also tend to
bionet.announce news groups. The become out of date as funding for
Bionet groups have been overshadowed maintenance declines. The BioHunt and
in recent years by web discussion forums, BioNetbook projects show that robot
such as those at Bioinformatics.org and automation can add significant currency
Bioinformatics.net. They have a core and reduce curation effort to maintaining
value by carrying openly accessible and bioinformatics catalogues and search
widely distributed biology software news services.
and discussion, including through web Aside from the publications listed in
portals such as www.bio.net and Google Table 1, these resources are non-
groups. Other sources include the commercial, usually maintained by
growing number of bioinformatics paper bioscientists and bioinformaticians, and
and electronic publications, and web list/ are generally supported by an institution.
forum sites devoted to biology and A mixture of government support and
bioinformatics. Distinctions made growing biotechnology industry
between biologist and bioinformatics advertising supplements these. Although
focus are fuzzy, but suggest the tendency several of these have been in existence for
of resources to address needs of those with many years, they share with a larger group
one or the other field of primary training. of such resources an uncertain future. As
This journal Briefings in Bioinformatics Wren1 noted in his survey of the
covers a range of new software and longevity of access to URLs published in
bioinformatics projects, generally with a Medline abstracts (many of these for
focus that spans the interests of bioinformatics-related publications), 20
bioscientists and bioinformaticians. The per cent or more disappear in less than a
journal Bioinformatics, originally CABios in decade. With a yearly loss in access, older
the 1980s, is the longest-running software URLs are likely to be
publication for original papers in this unavailable. Often bioinformatics tools
field. It includes a useful section of short remain useful beyond the support an
application notes, where many new individual author or group can provide,
programs are announced. The weekly especially those designed for specific
newsletter BioInform has a short section of problems that do not attract new
recent new releases and updates to development. Many of these are used by
software and database projects in biology, other bioinformaticians as reference
along with in-depth news articles, often works for development of new software.
focused on biotechnology and One major problem in maintaining

& HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 3. 300–304. SEPTEMBER 2004 303
Software review

software archives is collecting the these functions among several institutions,


software. All of the noted biology and limited funding, all must be addressed
collections have been the effort of a few by any project to improve biology
archivists, rather than author software publication and access.
contributions. For several years, the
journal Bioinformatics recommended Acknowledgments
IUBio Archive as a repository for This work is supported in part by NSF grant
published software, but this did not lead 0090782 and NIH grant 1R01HG002733-01 to
to any notable increase in author D. Gilbert.
contributions. The issue of long-term
software archiving is one that might be Don Gilbert
better handled with an institutional library Biology Department,
paradigm such as maintains books, Indiana University,
journals and other science reference Bloomington, Indiana 47405 USA
material. The current archives are Tel: +1 812 855 0587
dependent on enthusiasm and dedication E-mail: gilbertd@indiana.edu
of those individuals who see value in

Downloaded from bib.oxfordjournals.org by guest on September 28, 2010


contributing their time in the face of References
limited institutional and agency support. 1. Wren, J. D. (2004), ‘404 not found: The
Bioinformatics is maturing as an stability and persistence of URLs published in
academic discipline, and the field’s MEDLINE’, Bioinformatics, Vol. 20, pp. 668–
672 (DOI: 10.1093/bioinformatics/btg465).
journals are catching up to other
electronic media for timely release of 2. EPrints.org, Self-Archiving and Open Archives
organization (URL: http://www.eprints.org/).
news and software announcements, in
addition to becoming the preferred route 3. Raghava, G. P. S. (2001), ‘PDSB: Public
domain software in biology’, Biotech Software
of authors for such announcements. Internet Rep., Vol. 2, pp. 154–156.
Merging into one venue these functions
4. Robison, K. (1993), ‘WWW Virtual Library for
of software archiving, news and Biosciences’ (URL: http://web.archive.org/
discussion, along with author publication web/*/http://golgi.harvard.edu/biopages.html;
reports, would make much sense for see also http://mcb.harvard.edu/BioLinks.html,
http://vlib.org/Biosciences.html).
future bioinformatics software
publication. This may be forthcoming, 5. Coutinho, P. (1995), ‘Pedro’s Biomolecular
Research Tools’ (URL: http://
though a growing influence of traditional www.public.iastate.edu/pedro/
academic publishing modes, dispersion of research_tools.html).

304 & HENRY STEWART PUBLICATIONS 1467-5463. B R I E F I N G S I N B I O I N F O R M A T I C S . VOL 5. NO 3. 300–304. SEPTEMBER 2004

Vous aimerez peut-être aussi