Debian Med Project
Help us to see Debian used by medical practitioners and biomedical researchers! Join us on the Alioth page.
Summary
Biology
Debian Med micro-biology packages

This metapackage will install Debian packages related to molecular biology, structural biology and bioinformatics for use in life sciences.

The list to the right includes various software projects which are of some interest to the Debian Med Project. Currently, only a few of them are available as Debian packages. It is our goal, however, to include all software in Debian Med which can sensibly add to a high quality Debian Pure Blend.

For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:

If you discover a project which looks like a good candidate for Debian Med to you, or if you have prepared an unofficial Debian package, please do not hesitate to send a description of that project to the Debian Med mailing list

Links to other tasks

Debian Med Biology packages

Official Debian packages with high relevance

Abacas
Algorithm Based Automatic Contiguation of Assembled Sequences
Versions of package abacas
ReleaseVersionArchitectures
wheezy1.3.1-1all
jessie1.3.1-1all
sid1.3.1-1all
Debtags of package abacas:
roleprogram
Popcon: 8 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

ABACAS is intended to rapidly contiguate (align, order, orientate), visualize and design primers to close gaps on shotgun assembled contigs based on a reference sequence.

ABACAS uses MUMmer to find alignment positions and identify syntenies of assembled contigs against the reference. The output is then processed to generate a pseudomolecule taking overlapping contigs and gaps in to account. ABACAS generates a comparision file that can be used to visualize ordered and oriented contigs in ACT. Synteny is represented by red bars where colour intensity decreases with lower values of percent identity between comparable blocks. Information on contigs such as the orientation, percent identity, coverage and overlap with other contigs can also be visualized by loading the outputted feature file on ACT.

Please cite: Samuel Assefa, Thomas M. Keane, Thomas D. Otto, Chris Newbold and Matthew Berriman: ABACAS: algorithm-based automatic contiguation of assembled sequences. (PubMed,eprint) Bioinformatics 25(15):1968-1969 (2009)
Acedb-other
retrieval of DNA or protein sequences
Versions of package acedb-other
ReleaseVersionArchitectures
wheezy4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package acedb-other:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
roleprogram
scopeutility
Popcon: 6 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

This package collects all those smallish applications that acedb collects under its 'other' target of its Makefile.

efetch: presumably short for 'entry fetch' collects sequence information from common DNA and protein databases.

Please cite: L. D. Stein and J. Thierry-Mieg: AceDB: a genome database management system. Computing in Science and Engineering 1(3):44-52 (1999)
Acedb-other-belvu
multiple sequence alignment editor
Versions of package acedb-other-belvu
ReleaseVersionArchitectures
wheezy4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package acedb-other-belvu:
roleprogram
uitoolkitgtk, ncurses
Popcon: 6 users (8 upd.)*
Versions and Archs
License: DFSG free
Svn

For the analysis of biological sequences, a general principle is to corresponding regions between related proteins, RNA or DNA. Written next to each other, corresponding positions above each other, one has prepared an alignment.

Belvu is best known for its perfect implementation of the Stockholm format of multiple sequence alignments, since upstream is maintaining that. That is for instance used in the Pfam and Rfam databases.

Please cite: L. D. Stein and J. Thierry-Mieg: AceDB: a genome database management system. Computing in Science and Engineering 1(3):44-52 (1999)
Acedb-other-dotter
visualisation of sequence similarity
Versions of package acedb-other-dotter
ReleaseVersionArchitectures
wheezy4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.9.39+dfsg.01-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package acedb-other-dotter:
roleprogram
uitoolkitgtk, ncurses
Popcon: 5 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

For the analysis of biological sequences, a general principle is to corresponding regions between related proteins, RNA or DNA.

Dotter displays graphically the similarity of DNA or protein sequence to itself or another sequence.

Please cite: L. D. Stein and J. Thierry-Mieg: AceDB: a genome database management system. Computing in Science and Engineering 1(3):44-52 (1999)
Adun.app
Molecular Simulator for GNUstep
Versions of package adun.app
ReleaseVersionArchitectures
squeeze0.81-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.81-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.81-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.81-5amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package adun.app:
fieldbiology, biology:structural
interfacex11
roleprogram
scopeapplication
suitegnustep
uitoolkitgnustep
useanalysing, organizing, viewing
works-with3dmodel, db
x11application
Popcon: 23 users (22 upd.)*
Versions and Archs
License: DFSG free
Svn

Adun is a biomolecular simulator that also includes data management and analysis capabilities. It was developed at the Computational Biophysics and Biochemistry Laboratory, a part of the Research Unit on Biomedical Informatics of the UPF.

Please cite: Michael A. Johnston, Ignacio Fdez. Galván and Jordi Villà-Freixa: Framework-based design of a new all-purpose molecular simulation application: The Adun simulator. (PubMed) J. Comp. Chem. 26(15):1647-1659 (2005)
Screenshots of package adun.app
Alien-hunter
Interpolated Variable Order Motifs to identify horizontally acquired DNA
Versions of package alien-hunter
ReleaseVersionArchitectures
squeeze1.7-1all
wheezy1.7-1all
jessie1.7-1all
sid1.7-1all
Debtags of package alien-hunter:
fieldbiology, biology:structural
roleprogram
scopeutility
useanalysing
Popcon: 9 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

Alien_hunter is an application for the prediction of putative Horizontal Gene Transfer (HGT) events with the implementation of Interpolated Variable Order Motifs (IVOMs). An IVOM approach exploits compositional biases using variable order motif distributions and captures more reliably the local composition of a sequence compared to fixed-order methods. Optionally the predictions can be parsed into a 2-state 2nd order Hidden Markov Model (HMM), in a change-point detection framework, to optimize the localization of the boundaries of the predicted regions. The predictions (embl format) can be automatically loaded into Artemis genome viewer freely available at: http://www.sanger.ac.uk/Software/Artemis/.

The manuscript describing the alien_hunter algorithm is available from Bioinformatics: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. Vernikos GS, Parkhill J Bioinformatics. 2006;. PMID: 16837528

Please cite: Georgios S. Vernikos and Julian Parkhill: Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands. (PubMed,eprint) Bioinformatics 22(18):2196-2203 (2006)
Altree
program to perform phylogeny-based association and localization analysis
Versions of package altree
ReleaseVersionArchitectures
squeeze1.0.1-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
sid1.0.1-7hurd-i386
wheezy1.2.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package altree:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram, shared-lib
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 12 users (14 upd.)*
Versions and Archs
License: DFSG free
Svn

ALTree was designed to perform association detection and localization of susceptibility sites using haplotype phylogenetic trees: first, it allows the detection of an association between a candidate gene and a disease, and second, it enables to make hypothesis about the susceptibility loci.

Please cite: Claire Bardel, Vincent Danjean and Emmanuelle Genin: ALTree: association detection and localization of susceptibility sites using haplotype phylogenetic trees. (PubMed,eprint) Bioinformatics 22(11):1402-1403 (2006)
Amap-align
Protein multiple alignment by sequence annealing
Versions of package amap-align
ReleaseVersionArchitectures
squeeze2.2-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package amap-align:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 30 users (36 upd.)*
Versions and Archs
License: DFSG free
Svn

AMAP is a command line tool to perform multiple alignment of peptidic sequences. It utilizes posterior decoding, and a sequence-annealing alignment, instead of the traditional progressive alignment method. It is the only alignment program that allows one to control the sensitivity / specificity tradeoff. It is based on the ProbCons source code, but uses alignment metric accuracy and eliminates the consistency transformation.

The java visualisation tool of AMAP 2.2 is not yet packaged in Debian.

Please cite: Ariel S. Schwartz and Lior Pachter: Multiple alignment by sequence annealing. (eprint) Bioinformatics 23(2):e24-e29 (2007)
Remark of Debian Med team: Dead upstream

The homepage of this project vanished as well as the Download area. An old unmaintained version remained at code.google.com. Please drop the maintainer a note if you have any news of this project.

Ampliconnoise
Programs for the removal of noise from 454 sequenced PCR amplicons
Versions of package ampliconnoise
ReleaseVersionArchitectures
wheezy1.25-1amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
jessie1.25-1amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid1.25-1amd64,armel,armhf,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
upstream1.28
Debtags of package ampliconnoise:
roleprogram
Popcon: 7 users (7 upd.)*
Newer upstream!
License: DFSG free
Svn

AmpliconNoise is a package of applications to clean up high-throughput sequence data. It consists of three main parts:

Pyronoise - does flowgram-based clustering to spot misreads SeqNoise - removes PCR point mutations Perseus - removes PCR chimeras without the need for a set of reference sequences

Previously there was a standalone "Pyronoise" by the same authors and this package includes an updated version. There is also a "Denoiser" in Qiime which is related but distinct.

Removing Noise From Pyrosequenced Amplicons Christopher Quince, Anders Lanzen, Russell J Davenport and Peter J Turnbaugh BMC Bioinformatics 2011, 12:38doi:10.1186/1471-2105-12-38

Please cite: Christopher Quince, Anders Lanzen, Russell J Davenport and Peter J Turnbaugh: Removing Noise From Pyrosequenced Amplicons. (PubMed,eprint) BMC Bioinformatics 12:38 (2011)
Aragorn
tRNA and tmRNA detection in nucleotide sequences
Versions of package aragorn
ReleaseVersionArchitectures
jessie1.2.36-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.36-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package aragorn:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
works-withbiological-sequence
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

The program employs heuristic algorithms to predict tRNA secondary structure, based on homology with recognized tRNA consensus sequences and ability to form a base-paired cloverleaf. tmRNA genes are identified using a modified version of the BRUCE program.

Please cite: Dean Laslett and Bjorn Canback: ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences.. (PubMed,eprint) Nucleic Acids Research 32(1):11-16 (2004)
Autodock
analysis of ligand binding to protein structure
Versions of package autodock
ReleaseVersionArchitectures
squeeze4.2.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.2.3-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.2.3-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.2.3-2s390,s390x,sparc
experimental4.2.5.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.2.5.1-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc
Debtags of package autodock:
fieldbiology, biology:structural
interfacecommandline
roleprogram
scopeutility
useanalysing
works-with3dmodel
Popcon: 19 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

AutoDock is a prime representative of the programs addressing the simulation of the docking of fairly small chemical ligands to rather big protein receptors. Earlier versions had all flexibility in the ligands while the protein was kept rather ridgid. This latest version 4 also allows for a flexibility of selected sidechains of surface residues, i.e., takes the rotamers into account.

The AutoDock program performs the docking of the ligand to a set of grids describing the target protein. AutoGrid pre-calculates these grids.

The package is enhanced by the following packages: autogrid
Please register by following this link if you are using autodock.
Please cite: Garrett M. Morris, Ruth Huey, William Lindstrom, Michel F. Sanner, Richard K. Belew, David S. Goodsell and Arthur J. Olson: AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. (PubMed) Journal of Computational Chemistry 30(16):2785-2791 (2009)
Screenshots of package autodock
Autodock-vina
docking of small molecules to proteins
Versions of package autodock-vina
ReleaseVersionArchitectures
wheezy1.1.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.1.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1.2-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package autodock-vina:
fieldbiology, biology:structural
interfacex11
roleprogram
works-with3dmodel
x11application
Popcon: 9 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

AutoDock Vina is a program to support drug discovery, molecular docking and virtual screening of compound libraries. It offers multi-core capability, high performance and enhanced accuracy and ease of use.

The same institute also developed autodock, which is widely used.

O. Trott, A. J. Olson, AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading, Journal of Computational Chemistry 31 (2010) 455-461

Please cite: Oleg Trott and Arthur J. Olson: AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. (eprint) Journal of Computational Chemistry 31(2):455-461 (2010)
Autogrid
pre-calculate binding of ligands to their receptor
Versions of package autogrid
ReleaseVersionArchitectures
squeeze4.2.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.2.3-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.2.3-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.2.3-2s390,s390x,sparc
experimental4.2.5.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.2.5.1-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc
Debtags of package autogrid:
fieldbiology, biology:structural
interfacecommandline
roleprogram
scopeutility
useanalysing
works-with3dmodel
Popcon: 14 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

The AutoDockSuite addresses the molecular analysis of the docking of a smaller chemical compounds to their receptors of known three-dimensional structure.

The AutoGrid program performs pre-calculations for the docking of a ligand to a set of grids that describe the effect that the protein has on point charges. The effect of these forces on the ligand is then analysed by the AutoDock program.

Please cite: Garrett M. Morris, Ruth Huey, William Lindstrom, Michel F. Sanner, Richard K. Belew, David S. Goodsell and Arthur J. Olson: AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. (PubMed) Journal of Computational Chemistry 30(16):2785-2791 (2009)
Ballview
free molecular modeling and molecular graphics tool
Versions of package ballview
ReleaseVersionArchitectures
squeeze1.3.2-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.4.1+20111206-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.4.1+20111206-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.4.1+20111206-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package ballview:
interfacex11
roleprogram
uitoolkitqt
x11application
Popcon: 18 users (32 upd.)*
Versions and Archs
License: DFSG free
Git

BALLView provides fast OpenGL-based visualization of molecular structures, molecular mechanics methods (minimization, MD simulation using the AMBER, CHARMM, and MMFF94 force fields), calculation and visualization of electrostatic properties (FDPB) and molecular editing features.

BALLView can be considered a graphical user interface on the basis of BALL (Biochemical Algorithms Library) with a focus on the most common demands of protein chemists and biophysicists in particular. It is developed in the groups of Hans-Peter Lenhof (Saarland University, Saarbruecken, Germany) and Oliver Kohlbacher (University of Tuebingen, Germany). BALL is an application framework in C++ that has been specifically designed for rapid software development in Molecular Modeling and Computational Molecular Biology. It provides an extensive set of data structures as well as classes for Molecular Mechanics, advanced solvation methods, comparison and analysis of protein structures, file import/export, and visualization.

Please cite: Andreas Moll, Andreas Hildebrandt, Hans-Peter Lenhof and Oliver Kohlbacher: BALLView: a tool for research and education in molecular modeling. (PubMed,eprint) Bioinformatics 22(3):365-366 (2006)
Screenshots of package ballview
Bedtools
suite of utilities for comparing genomic features
Versions of package bedtools
ReleaseVersionArchitectures
wheezy2.16.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.17.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.17.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package bedtools:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopesuite
useanalysing, comparing, converting, filtering
works-withbiological-sequence
Popcon: 25 users (46 upd.)*
Versions and Archs
License: DFSG free
Git

The BEDTools utilities allow one to address common genomics tasks such as finding feature overlaps and computing coverage. The utilities are largely based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM. Using BEDTools, one can develop sophisticated pipelines that answer complicated research questions by streaming several BEDTools together.

The groupBy utility is distribued in the filo package.

Please cite: Aaron R. Quinlan and Ira M. Hall: BEDTools: a flexible suite of utilities for comparing genomic features. (PubMed,eprint) Bioinformatics 26(6):841-842 (2010)
Biomaj
biological data-bank updater
Versions of package biomaj
ReleaseVersionArchitectures
wheezy1.2.1-1all
jessie1.2.1-2all
sid1.2.1-2all
Debtags of package biomaj:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
Popcon: 5 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

BioMAJ downloads remote data banks, checks their status and applies transformation workflows, with consistent state, to provide ready-to-use data for biologists and bioinformaticians. For example, it can transform original FASTA files into BLAST indexes. It is very flexible and its post-processing facilities can be extended very easily.

Please cite: Olivier Filangi, Yoann Beausse, Anthony Assi, Ludovic Legrand, Jean-Marc Larré, Véronique Martin, Olivier Collin, Christophe Caron, Hugues Leroy and David Allouche: BioMAJ: a flexible framework for databanks synchronization and processing. (PubMed,eprint) Oxford Journals Bioinformatics 24(16):1823-1825 (2008)
Blast2
Basic Local Alignment Search Tool
Versions of package blast2
ReleaseVersionArchitectures
squeeze2.2.21.20090809-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.26.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.26.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.26.20120620-2armhf,mips,s390,sparc
sid2.2.26.20120620-3amd64,armel,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390x
squeeze2.2.21.20090809-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.26.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.26.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.26.20120620-2armhf,mips,s390,sparc
sid2.2.26.20120620-3amd64,armel,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390x
Debtags of package blast2:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
sciencecalculation
scopeutility
usesearching
works-withbiological-sequence
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
sciencecalculation
scopeutility
usesearching
works-withbiological-sequence
Popcon: 69 users (52 upd.)*
Versions and Archs
License: DFSG free
Git

The famous sequence alignment program. This is "official" NCBI version, #2. The blastall executable allows you to give a nucleotide or protein sequence to the program. It is compared against databases and a summary of matches is returned to the user.

Note that databases are not included in Debian; they must be retrieved manually.

The package is enhanced by the following packages: mcl
Bowtie
Ultrafast memory-efficient short read aligner
Versions of package bowtie
ReleaseVersionArchitectures
wheezy0.12.7-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,s390,s390x,sparc
jessie1.0.0-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.0-5amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package bowtie:
biologynuceleic-acids
fieldbiology:bioinformatics
interfacecommandline
roleprogram
sciencecalculation
scopeutility
useanalysing, comparing
works-withbiological-sequence
Popcon: 23 users (44 upd.)*
Versions and Archs
License: DFSG free
Svn

This package addresses the problem to interpret the results from the latest (2010) DNA sequencing technologies. Those will yield fairly short stretches and those cannot be interpreted directly. It is the challenge for tools like Bowtie to give a chromosomal location to the short stretches of DNA sequenced per run.

Bowtie aligns short DNA sequences (reads) to the human genome at a rate of over 25 million 35-bp reads per hour. Bowtie indexes the genome with a Burrows-Wheeler index to keep its memory footprint small: typically about 2.2 GB for the human genome (2.9 GB for paired-end).

The package is enhanced by the following packages: bowtie-examples
Please cite: Ben Langmead, Cole Trapnell, Mihai Pop and Steven L Salzberg: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. (eprint) Genome Biology 10:R25 (2009)
Bowtie2
ultrafast memory-efficient short read aligner
Versions of package bowtie2
ReleaseVersionArchitectures
wheezy2.0.0-beta6-3amd64,i386,kfreebsd-amd64,kfreebsd-i386
jessie2.1.0-1amd64,i386,kfreebsd-amd64,kfreebsd-i386
sid2.1.0-1amd64,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386
Debtags of package bowtie2:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
sciencecalculation
scopeutility
useanalysing
works-withbiological-sequence, file
Popcon: 13 users (20 upd.)*
Versions and Archs
License: DFSG free
Git

is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes.

Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome, its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes

The package is enhanced by the following packages: bowtie2-examples
Boxshade
Pretty-printing of multiple sequence alignments
Versions of package boxshade
ReleaseVersionArchitectures
squeeze3.3.1-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.3.1-7+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.3.1-7+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.3.1-7+wheezy1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
experimental3.3.1-8~experimental1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package boxshade:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usetypesetting
works-with-formathtml, plaintext, postscript, tex
Popcon: 18 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

Boxshade is a program for creating good looking printouts from multiple-aligned protein or DNA sequences. The program does not perform the alignment by itself and requires as input a file that was created by a multiple alignment program or manually edited with respective tools.

Boxshade reads multiple-aligned sequences from either PILEUP-MSF, CLUSTAL-ALN, MALIGNED-data and ESEE-save files (limited to a maximum of 150 sequences with up to 10000 elements each). Various kinds of shading can be applied to identical/similar residues. Output is written to screen or to a file in the following formats: ANSI/VT100, PS/EPS, RTF, HPGL, ReGIS, LJ250-printer, ASCII, xFIG, PICT, HTML

Bwa
Burrows-Wheeler Aligner
Versions of package bwa
ReleaseVersionArchitectures
squeeze0.5.8c-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.6.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.6.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.6.2-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.7.5a
Debtags of package bwa:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline, text-mode
roleprogram
useanalysing, comparing
Popcon: 33 users (26 upd.)*
Newer upstream!
License: DFSG free
Git

Burrows-Wheeler Aligner (BWA) is a program that aligns relatively short nucleotide sequences against a long reference sequence such as the human genome. It implements two algorithms, bwa-short and BWA-SW. The former works for query sequences shorter than 200 bp and the latter for longer sequences up to around 100 kbp. Both algorithms do gapped alignment. They are usually more accurate and faster on queries with low error rates.

Please cite: Heng Li and Richard Durbin: Fast and accurate short read alignment with Burrows-Wheeler transform. (PubMed,eprint) Bioinformatics 25(14):1754-1760 (2009)
Cd-hit
suite of programs designed to quickly group sequences
Versions of package cd-hit
ReleaseVersionArchitectures
wheezy4.6-2012-04-25-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.6-2012-04-25-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.6-2012-04-25-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream4.6.1-2012-08-27
Popcon: 7 users (10 upd.)*
Newer upstream!
License: DFSG free
Svn

cd-hit contains a number of programs designed to quickly group sequences. cd-hit groups proteins into clusters that meet a user-defined similarity threshold. cd-hit-est is similar to cd-hit, but designed to group nucleotide sequences (without introns). cd-hit-est-2d is similar to cd-hit-2d but designed to compare two nucleotide datasets. A number of other related programs are also in this package. Please see the cd-hit user manual, also part of this package, for further information.

Cdbfasta
Constant DataBase indexing and retrieval tools for multi-FASTA files
Versions of package cdbfasta
ReleaseVersionArchitectures
jessie0.99-20100722-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.99-20100722-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

CDB (Constant DataBase) can be used for creating indices for quick retrieval of any particular sequences from large multi-FASTA files. It has the option to compress data records in order to save space.

Clonalframe
inference of bacterial microevolution using multilocus sequence data
Versions of package clonalframe
ReleaseVersionArchitectures
wheezy1.2-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package clonalframe:
roleprogram
Popcon: 4 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

ClonalFrame identifies the clonal relationships between the members of a sample, while also estimating the chromosomal position of homologous recombination events that have disrupted the clonal inheritance.

ClonalFrame can be applied to any kind of sequence data, from a single fragment of DNA to whole genomes. It is well suited for the analysis of MLST data, where 7 gene fragments have been sequenced, but becomes progressively more powerful as the sequenced regions increase in length and number up to whole genomes. However, it requires the sequences to be aligned. If you have genomic data that is not aligned, we recommend using Mauve which produces alignment of whole bacterial genomes in exactly the format required for analysis with ClonalFrame.

Please cite: Xavier Didelot and Daniel Falush: Inference of Bacterial Microevolution Using Multilocus Sequence Data. (PubMed,eprint) Genetics Advance 175:1251-1266 (2006)
Clustalo
General purpose multiple sequence alignment program for proteins
Versions of package clustalo
ReleaseVersionArchitectures
wheezy1.1.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.1.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.1.1
Popcon: 8 users (13 upd.)*
Newer upstream!
License: DFSG free
Svn

Clustal-Omega is a general purpose multiple sequence alignment (MSA) program for proteins. It produces high quality MSAs and is capable of handling data-sets of hundreds of thousands of sequences in reasonable time.

Please cite: Fabian Sievers, Andreas Wilm, David Dineen, Toby J Gibson, Kevin Karplus, Weizhong Li, Rodrigo Lopez, Hamish McWilliam, Michael Remmert, Johannes Söding, Julie D Thompson and Desmond G Higgins: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. (PubMed) Molecular Systems Biology 7(539) (2011)
Clustalw
global multiple nucleotide or peptide sequence alignment
Versions of package clustalw
ReleaseVersionArchitectures
squeeze2.0.12-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy2.1+lgpl-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1+lgpl-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1+lgpl-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package clustalw:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline, text-mode
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 42 users (47 upd.)*
Versions and Archs
License: DFSG free
Git

This program performs an alignment of multiple nucleotide or amino acid sequences. It recognizes the format of input sequences and whether the sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The output format may be selected from in various formats for multiple alignments such as Phylip or FASTA. Clustal W is very well accepted.

The output of Clustal W can be edited manually but preferably with an alignment editor like SeaView or within its companion Clustal X. When building a model from your alignment, this can be applied for improved database searches. The Debian package hmmer creates such in form of an HMM.

The package is enhanced by the following packages: clustalw-mpi
Please cite: M. A. Larkin, G. Blackshields, N. P. Brown, R. Chenna, P. A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J. D. Thompson, T. J. Gibson and D. G. Higgins: Clustal W and Clustal X version 2.0. (PubMed,eprint) Bioinformatics 23(21):2947-2948 (2007)
Clustalx
Multiple alignment of nucleic acid and protein sequences (graphical interface)
Versions of package clustalx
ReleaseVersionArchitectures
squeeze1.83-4 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy2.1+lgpl-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1+lgpl-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1+lgpl-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package clustalx:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacex11
roleprogram
scopeutility
uitoolkitmotif
useanalysing, comparing, viewing
works-with-formatplaintext
x11application
Popcon: 80 users (5 upd.)*
Versions and Archs
License: DFSG free
Git

This package offers a GUI interface for the Clustal multiple sequence alignment program. It provides an integrated environment for performing multiple sequence- and profile-alignments to analyse the results. The sequence alignment is displayed in a window on the screen. A versatile coloring scheme has been incorporated to highlight conserved features in the alignment. For professional presentations, one should use the texshade LaTeX package or boxshade.

The pull-down menus at the top of the window allow you to select all the options required for traditional multiple sequence and profile alignment. You can cut-and-paste sequences to change the order of the alignment; you can select a subset of sequences to be aligned; you can select a sub-range of the alignment to be realigned and inserted back into the original alignment.

An alignment quality analysis can be performed and low-scoring segments or exceptional residues can be highlighted.

Please cite: M.A. Larkin, G. Blackshields, N.P. Brown, R. Chenna, P.A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J.D. Thompson, T.J. Gibson and D.G. Higgins: Clustal W and Clustal X version 2.0. (PubMed,eprint) Bioinformatics 23(21):2947-2948 (2007)
Concavity
predictor of protein ligand binding sites from structure and conservation
Versions of package concavity
ReleaseVersionArchitectures
jessie0.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

ConCavity predicts protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

ConCavity takes as input a PDB format protein structure and optionally files that characterize the evolutionary sequence conservation of the chains in the structure file.

The following result files are produced by default:

  • Residue ligand binding predictions for each chain (*.scores).
  • Residue ligand binding predictions in a PDB format file (residue scores placed in the temp. factor field, *_residue.pdb).
  • Pocket prediction locations in a DX format file (*.dx).
  • PyMOL script to visualize the predictions (*.pml).

ConCavity has many features. The default run of concavity is equivalent to ConCavity^L in the paper: 'Capra JA, Laskowski RA, Thornton JM, Singh M, and Funkhouser TA(2009) Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure. PLoS Comput Biol, 5(12).'.

score_conservation(1) from the conservation-code package can be used to calculate evolutionary sequence conservation for concavity.

The package is enhanced by the following packages: conservation-code
Please cite: John A. Capra, Roman A. Laskowski, Janet M. Thornton, Mona Singh and Thomas A. Funkhouser: Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure. (PubMed) PLoS Computational Biology 5(12):e1000585 (2009)
Conservation-code
protein sequence conservation scoring tool
Versions of package conservation-code
ReleaseVersionArchitectures
jessie20110309.0-1all
sid20110309.0-3all
Popcon: 1 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

This package provides score_conservation(1), a tool to score protein sequence conservation.

The following conservation scoring methods are implemented:

  • sum of pairs
  • weighted sum of pairs
  • Shannon entropy
  • Shannon entropy with property groupings (Mirny and Shakhnovich 1995, Valdar and Thornton 2001)
  • relative entropy with property groupings (Williamson 1995)
  • von Neumann entropy (Caffrey et al 2004)
  • relative entropy (Samudrala and Wang 2006)
  • Jensen-Shannon divergence (Capra and Singh 2007)

A window-based extension that incorporates the estimated conservation of sequentially adjacent residues into the score for each column is also given. This window approach can be applied to any of the conservation scoring methods.

The program accepts alignments in the CLUSTAL and FASTA formats.

The sequence-specific output can be used as the conservation input for concavity.

Conservation is highly predictive in identifying catalytic sites and residues near bound ligands.

Please cite: John A. Capra and Mona Singh: Predicting functionally important residues from sequence conservation. (PubMed) Bioinformatics 23(15):1875-82 (2007)
Dialign
Segment-based multiple sequence alignment
Versions of package dialign
ReleaseVersionArchitectures
squeeze2.2.1-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.1-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.1-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.1-5amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package dialign:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 32 users (43 upd.)*
Versions and Archs
License: DFSG free
Svn

DIALIGN2 is a command line tool to perform multiple alignment of protein or DNA sequences. It constructs alignments from gapfree pairs of similar segments of the sequences. This scoring scheme for alignments is the basic difference between DIALIGN and other global or local alignment methods. Note that DIALIGN does not employ any kind of gap penalty.

Please cite: Burkhard Morgenstern: DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment. (PubMed,eprint) Bioinformatics 15(3):211-218 (1999)
Dialign-tx
Segment-based multiple sequence alignment
Versions of package dialign-tx
ReleaseVersionArchitectures
squeeze1.0.2-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.0.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.2-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package dialign-tx:
fieldbiology, biology:bioinformatics
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 13 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

DIALIGN-TX is a command line tool to perform multiple alignment of protein or DNA sequences. It is a complete reimplementation of the segment-base approach including several new improvements and heuristics that significantly enhance the quality of the output alignments compared to DIALIGN 2.2 and DIALIGN-T. For pairwise alignment, DIALIGN-TX uses a fragment-chaining algorithm that favours chains of low-scoring local alignments over isolated high-scoring fragments. For multiple alignment, DIALIGN-TX uses an improved greedy procedure that is less sensitive to spurious local sequence similarities.

Please cite: Amarendran R. Subramanian, Michael Kaufmann and Burkhard Morgenstern: DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment. (PubMed) Algorithms for Molecular Biology 3(1):6 (2008)
Disulfinder
cysteines disulfide bonding state and connectivity predictor
Versions of package disulfinder
ReleaseVersionArchitectures
wheezy1.2.11-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2.11-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.11-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package disulfinder:
roleprogram
Popcon: 6 users (8 upd.)*
Versions and Archs
License: DFSG free
Svn

'disulfinder' is for predicting the disulfide bonding state of cysteines and their disulfide connectivity starting from sequence alone. Disulfide bridges play a major role in the stabilization of the folding process for several proteins. Prediction of disulfide bridges from sequence alone is therefore useful for the study of structural and functional properties of specific proteins. In addition, knowledge about the disulfide bonding state of cysteines may help the experimental structure determination process and may be useful in other genomic annotation tasks.

'disulfinder' predicts disulfide patterns in two computational stages: (1) the disulfide bonding state of each cysteine is predicted by a BRNN-SVM binary classifier; (2) cysteines that are known to participate in the formation of bridges are paired by a Recursive Neural Network to obtain a connectivity pattern.

Please cite: A. Ceroni, A. Passerini, A. Vullo and P. Frasconi: DISULFIND: a disulfide bonding state and cysteine connectivity prediction server.. (PubMed) Nucleic Acids Res 34(Web Server issue):W177-81 (2006)
Dotur
distance based OTU and richness estimation
Versions of package dotur
ReleaseVersionArchitectures
wheezy1.53-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.53-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.53-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package dotur:
roleprogram
Popcon: 5 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

DOTUR is a computer program that takes a distance matrix describing the genetic distance between DNA sequence data and assigns sequences to operational taxonomic units (OTUs) using either the furthest, average, or nearest neighbor algorithms for all possible distances that can be described using the distance matrix. Using the OTU composition data, DOTUR constructs collector's and rarefaction curves for sampling intensity, richness estimators, and diversity indices.

Dssp
protein secondary structure assignment based on 3D structure
Versions of package dssp
ReleaseVersionArchitectures
wheezy2.0.4-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 8 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

DSSP is an application you use to assign the secondary structure of a protein based on its solved three dimensional (3D) structure.

This version (2) of DSSP is a rewrite that produces the same output as the original DSSP, but deals better with exceptions in PDB files and is much faster.

Edtsurf
triangulated mesh surfaces for protein structures
Maintainer: Steffen Moeller
Versions of package edtsurf
ReleaseVersionArchitectures
jessie0.2009-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.2009-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

EDTSurf is a open source program to construct triangulated surfaces for macromolecules. It generates three major macromolecular surfaces: van der Waals surface, solvent-accessible surface and molecular surface (solvent-excluded surface). EDTsurf also identifies cavities which are inside of macromolecules.

Embassy-domainatrix
Extra EMBOSS commands to handle domain classification file
Versions of package embassy-domainatrix
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domainatrix:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, converting, editing, searching
works-with-formatplaintext
Popcon: 5 users (6 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMAINATRIX programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domainatrix release are cathparse (generates DCF file from raw CATH files), domainnr (removes redundant domains from a DCF file), domainreso (removes low resolution domains from a DCF file), domainseqs (adds sequence records to a DCF file), domainsse (adds secondary structure records to a DCF file), scopparse (generates DCF file from raw SCOP files) and ssematch (searches a DCF file for secondary structure matches).

Embassy-domalign
Extra EMBOSS commands for protein domain alignment
Versions of package embassy-domalign
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domalign:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing, editing
works-with-formatplaintext
Popcon: 7 users (6 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMALIGN programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in the current domalign release are allversusall (sequence similarity data from all-versus-all comparison), domainalign (generates alignments (DAF file) for nodes in a DCF file), domainrep (reorders DCF file to identify representative structures) and seqalign (extend alignments (DAF file) with sequences (DHF file)).

Embassy-domsearch
Extra EMBOSS commands to search for protein domains
Versions of package embassy-domsearch
ReleaseVersionArchitectures
wheezy0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.0+20110714-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.1.650
Debtags of package embassy-domsearch:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 5 users (6 upd.)*
Newer upstream!
License: DFSG free
Svn

The DOMSEARCH programs were developed by Jon Ison and colleagues at MRC HGMP for their protein domain research. They are included as an EMBASSY package as a work in progress.

Applications in this DOMSEARCH release are seqfraggle (removes fragment sequences from DHF files), seqnr (removes redundancy from DHF files), seqsearch (generates PSI-BLAST hits (DHF file) from a DAF file), seqsort (Remove ambiguous classified sequences from DHF files) and seqwords (Generates DHF files from keyword search of UniProt).

Emboss
european molecular biology open software suite
Versions of package emboss
ReleaseVersionArchitectures
squeeze6.1.0-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy6.4.0-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie6.4.0-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid6.4.0-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream6.5.7
Debtags of package emboss:
fieldbiology, biology:bioinformatics, biology:molecular
interfacecommandline
roleprogram
scopesuite
useanalysing, comparing, converting, editing, organizing, searching, text-formatting, typesetting, viewing
works-withdb
works-with-formatplaintext
Popcon: 113 users (39 upd.)*
Newer upstream!
License: DFSG free
Git

EMBOSS is a free Open Source software analysis package specially developed for the needs of the molecular biology (e.g. EMBnet) user community. The software automatically copes with data in a variety of formats and even allows transparent retrieval of sequence data from the web. Also, as extensive libraries are provided with the package, it is a platform to allow other scientists to develop and release software in true open source spirit. EMBOSS also integrates a range of currently available packages and tools for sequence analysis into a seamless whole. EMBOSS breaks the historical trend towards commercial software packages.

The package is enhanced by the following packages: clustalw primer3
Screenshots of package emboss
Exonerate
generic tool for pairwise sequence comparison
Versions of package exonerate
ReleaseVersionArchitectures
squeeze2.2.0-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.0-6amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.0-6amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.0-6amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package exonerate:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usesearching
works-with-formatplaintext
Popcon: 29 users (34 upd.)*
Versions and Archs
License: DFSG free
Svn

Exonerate allows you to align sequences using a many alignment models, using either exhaustive dynamic programming, or a variety of heuristics. Much of the functionality of the Wise dynamic programming suite was reimplemented in C for better efficiency. Exonerate is an intrinsic component of the building of the Ensembl genome databases, providing similarity scores between RNA and DNA sequences and thus determining splice variants and coding sequences in general.

An In-silico PCR Experiment Simulation System (see the ipcress man page) is packaged with exonerate.

This package also comes with a selection of utilities for performing simple manipulations quickly on fasta files beyond 2Gb

Please cite: Guy C. Slater and Ewan Birney: Automated generation of heuristics for biological sequence comparison. (PubMed,eprint) BMC Bioinformatics 6(1):31 (2005)
Fastdnaml
Tool for construction of phylogenetic trees of DNA sequences
Versions of package fastdnaml
ReleaseVersionArchitectures
squeeze1.2.2-9amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.2.2-10amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2.2-10amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.2-10amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package fastdnaml:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 13 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

fastDNAml is a program derived from Joseph Felsenstein's version 3.3 DNAML (part of his PHYLIP package). Users should consult the documentation for DNAML before using this program.

fastDNAml is an attempt to solve the same problem as DNAML, but to do so faster and using less memory, so that larger trees and/or more bootstrap replicates become tractable. Much of fastDNAml is merely a recoding of the PHYLIP 3.3 DNAML program from PASCAL to C.

Note that the homepage of this program is not available any more and so this program will probably not see any further updates.

Please cite: Gary J. Olsen, Hideo Matsuda, Ray Hagstrom and Ross Overbeek: fastDNAml: a tool for construction of phylogenetic trees of DNA sequences using maximum likelihood. (PubMed,eprint) Comput Appl Biosci 10(1):41-48 (1994)
Fastlink
faster version of pedigree programs of Linkage
Versions of package fastlink
ReleaseVersionArchitectures
squeeze4.1P-fix95-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.1P-fix95-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.1P-fix95-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.1P-fix95-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package fastlink:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
Popcon: 13 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

Genetic linkage analysis is a statistical technique used to map genes and find the approximate location of disease genes. There was a standard software package for genetic linkage called LINKAGE. FASTLINK is a significantly modified and improved version of the main programs of LINKAGE that runs much faster sequentially, can run in parallel, allows the user to recover gracefully from a computer crash, and provides abundant new documentation. FASTLINK has been used in over 1000 published genetic linkage studies.

This package contains the following programs:

 ilink:    GEMINI optimization procedure to find a locally
           optimal value of the theta vector of recombination
           fractions
 linkmap:  calculates location scores of one locus against a
           fixed map of other loci
 lodscore: compares likelihoods at locally optimal theta
 mlink:    calculates lod scores and risk with two of more loci
 unknown:  identify possible genotypes for unknowns
Please cite: R. W. Cottingham Jr., R. M. Idury and A. A. Schaffer: Faster Sequential Genetic Linkage Computations. (PubMed,eprint) American Journal of Human Genetics 53(1):252-263 (1993)
Fastqc
quality control for high throughput sequence data
Versions of package fastqc
ReleaseVersionArchitectures
sid0.10.1+dfsg-1all
Popcon: 0 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines. It provides a modular set of analyses which you can use to give a quick impression of whether your data has any problems of which you should be aware before doing any further analysis.

The main functions of FastQC are

  • Import of data from BAM, SAM or FastQ files (any variant)
  • Providing a quick overview to tell you in which areas there may be problems
  • Summary graphs and tables to quickly assess your data
  • Export of results to an HTML based permanent report
  • Offline operation to allow automated generation of reports without running the interactive application
Fasttree
phylogenetic trees from alignments of nucleotide or protein sequences
Versions of package fasttree
ReleaseVersionArchitectures
wheezy2.1.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1.7-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1.7-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 8 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

FastTree infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. It handles alignments with up to a million of sequences in a reasonable amount of time and memory. For large alignments, FastTree is 100-1,000 times faster than PhyML 3.0 or RAxML 7.

FastTree is more accurate than PhyML 3 with default settings, and much more accurate than the distance-matrix methods that are traditionally used for large alignments. FastTree uses the Jukes-Cantor or generalized time-reversible (GTR) models of nucleotide evolution and the JTT (Jones-Taylor-Thornton 1992) model of amino acid evolution. To account for the varying rates of evolution across sites, FastTree uses a single rate for each site (the "CAT" approximation). To quickly estimate the reliability of each split in the tree, FastTree computes local support values with the Shimodaira-Hasegawa test (these are the same as PhyML 3's "SH-like local supports").

This package contains a single threaded version (fasttree) and a parallel version which uses OpenMP (fasttreMP).

Please cite: Morgan N. Price, Paramvir S. Dehal and Adam P. Arkin: FastTree 2 -- Approximately Maximum-Likelihood Trees for Large Alignments.. (PubMed,eprint) PLoS ONE 5(3):e9490 (2010)
Fastx-toolkit
FASTQ/A short nucleotide reads pre-processing tools
Versions of package fastx-toolkit
ReleaseVersionArchitectures
wheezy0.0.13.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.0.13.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.0.13.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package fastx-toolkit:
roleprogram
Popcon: 11 users (6 upd.)*
Versions and Archs
License: DFSG free
Git

The FASTX-Toolkit is a collection of command line tools for preprocessing short nucleotide reads in FASTA and FASTQ formats, usually produced by Next-Generation sequencing machines. The main processing of such FASTA/FASTQ files is mapping (aligning) the sequences to reference genomes or other databases using specialized programs like BWA, Bowtie and many others. However, it is sometimes more productive to preprocess the FASTA/FASTQ files before mapping the sequences to the genome—manipulating the sequences to produce better mapping results. The FASTX-Toolkit tools perform some of these preprocessing tasks.

Ffindex
simple index/database for huge amounts of small files
Versions of package ffindex
ReleaseVersionArchitectures
wheezy0.9.6.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.9.9.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.9.9.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 4 users (20 upd.)*
Versions and Archs
License: DFSG free
Svn

FFindex is a very simple index/database for huge amounts of small files. The files are stored concatenated in one big data file, separated by '\0'. A second file contains a plain text index, giving name, offset and length of the small files. The lookup is currently done with a binary search on an array made from the index file.

This package provides the executables.

Figtree
graphical phylogenetic tree viewer
Versions of package figtree
ReleaseVersionArchitectures
wheezy1.3.1-1all
jessie1.3.1-2all
sid1.3.1-2all
upstream1.4
Popcon: 9 users (17 upd.)*
Newer upstream!
License: DFSG free
Svn

FigTree is designed as a graphical viewer of phylogenetic trees and as a program for producing publication-ready figures. In particular it is designed to display summarized and annotated trees produced by BEAST.

Screenshots of package figtree
Filo
FILe and stream Operations
Versions of package filo
ReleaseVersionArchitectures
wheezy1.1+2011020401.2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.1+2011020401.2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1+2011020401.2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 10 users (7 upd.)*
Versions and Archs
License: DFSG free
Git

The following tools are available as part of the filo package:

groupBy – mimics the “groupBy” clause in database systems.

shuffle – randomize the order of lines in a file.

stats – computes descriptive statistic on a given column of a tab-delimited file or stream.

Because their name is too generic, ‘shuffle’ and ‘stats’ are relocated in /usr/lib/filo.

Flexbar
flexible barcode and adapter removal for sequencing platforms
Versions of package flexbar
ReleaseVersionArchitectures
sid2.32-1amd64,ia64
upstream2.34
Popcon: 1 users (0 upd.)*
Newer upstream!
License: DFSG free
Svn

Flexbar preprocesses high-throughput sequencing data efficiently. It demultiplexes barcoded runs and removes adapter sequences. Moreover, trimming and filtering features are provided. Flexbar increases mapping rates and improves genome and transcriptome assemblies. It supports next-generation sequencing data in fasta/q and csfasta/q format from Illumina, Roche 454, and the SOLiD platform.

Parameter names changed in Flexbar. Please review scripts. The recent months, default settings were optimised, several bugs were fixed and various improvements were made, e.g. revamped command-line interface, new trimming modes as well as lower time and memory requirements.

Gamgi
General Atomistic Modelling Graphic Interface (GAMGI)
Maintainer: Debichem Team
Versions of package gamgi
ReleaseVersionArchitectures
squeeze0.14.8-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.15.8-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.16.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.16.5-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package gamgi:
roleprogram
uitoolkitgtk
Popcon: 20 users (53 upd.)*
Versions and Archs
License: DFSG free
Svn

The General Atomistic Modelling Graphic Interface (GAMGI) provides a graphical interface to build, view and analyze atomic structures. The program is aimed at the scientific community and provides a graphical interface to study atomic structures and to prepare images for presentations, and for teaching the atomic structure of matter.

The package is enhanced by the following packages: gamgi-data gamgi-doc
Screenshots of package gamgi
Garlic
A visualization program for biomolecules
Versions of package garlic
ReleaseVersionArchitectures
squeeze1.6-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.6-1.1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.6-1.1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.6-1.1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package garlic:
fieldbiology, chemistry
interfacex11
roleprogram
scopeutility
uitoolkitxlib
useviewing
x11application
Popcon: 22 users (31 upd.)*
Versions and Archs
License: DFSG free
Svn

Garlic is written for the investigation of membrane proteins. It may be used to visualize other proteins, as well as some geometric objects. This version of garlic recognizes PDB format version 2.1. Garlic may also be used to analyze protein sequences.

It only depends on the X libraries, no other libraries are needed.

Features include:

  • The slab position and thickness are visible in a small window.
  • Atomic bonds as well as atoms are treated as independent drawable objects.
  • The atomic and bond colors depend on position. Five mapping modes are available (as for slab).
  • Capable to display stereo image.
  • Capable to display other geometric objects, like membrane.
  • Atomic information is available for atom covered by the mouse pointer. No click required, just move the mouse pointer over the structure!
  • Capable to load more than one structure.
  • Capable to draw Ramachandran plot, helical wheel, Venn diagram, averaged hydrophobicity and hydrophobic moment plot.
  • The command prompt is available at the bottom of the main window. It is able to display one error message and one command string.
Gbrowse
GMOD Generic Genome Browser
Versions of package gbrowse
ReleaseVersionArchitectures
wheezy2.48~dfsg-1all
jessie2.54+dfsg-1all
sid2.54+dfsg-1all
sid2.54+dfsg-2all
Debtags of package gbrowse:
fieldbiology, biology:bioinformatics
interfaceweb
roleprogram
useanalysing, viewing
webapplication, cgi
Popcon: 38 users (3 upd.)*
Versions and Archs
License: DFSG free
Svn

Generic Genome Browser is a simple but highly configurable web-based genome browser. It is a component of the Generic Model Organism Systems Database project (GMOD). Some of its features:

  • Simultaneous bird's eye and detailed views of the genome;
  • Scroll, zoom, center;
  • Attach arbitrary URLs to any annotation;
  • Order and appearance of tracks are customizable by administrator and end-user;
  • Search by annotation ID, name, or comment;
  • Supports third party annotation using GFF formats;
  • Settings persist across sessions;
  • DNA and GFF dumps;
  • Connectivity to different databases, including BioSQL and Chado;
  • Multi-language support;
  • Third-party feature loading;
  • Customizable plug-in architecture (e.g. run BLAST, dump & import many formats, find oligonucleotides, design primers, create restriction maps, edit features).
The package is enhanced by the following packages: libbio-samtools-perl
Please cite: Maureen J. Donlin: Using the Generic Genome Browser (GBrowse). (eprint) Department of Biochemistry and Molecular Biology and Department of Molecular Microbiology and Immunology, Saint Louis University School of Medicine (2009)
Gdpc
visualiser of molecular dynamic simulations
Versions of package gdpc
ReleaseVersionArchitectures
squeeze2.2.5-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.5-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.5-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.5-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package gdpc:
fieldbiology, biology:structural, chemistry, physics
interfacex11
roleprogram
scopeapplication
uitoolkitgtk
useviewing
works-with3dmodel, image, video
works-with-formatjpg, png
x11application
Popcon: 18 users (20 upd.)*
Versions and Archs
License: DFSG free
Svn

gpdc is a graphical program for visualising output data from molecular dynamics simulations. It reads input in the standard xyz format, as well as other custom formats, and can output pictures of each frame in JPG or PNG format.

Other screenshots of package gdpc
VersionURL
2.2.5-1http://screenshots.debian.net/screenshots/g/gdpc/7394_large.png
Screenshots of package gdpc
Genometools
versatile genome analysis toolkit
Versions of package genometools
ReleaseVersionArchitectures
jessie1.5.1-1amd64,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
sid1.5.1-1amd64,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Debtags of package genometools:
uitoolkitncurses
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

The GenomeTools contains a collection of useful tools for biological sequence analysis and -presentation combined into a single binary.

The toolkit contains binaries for sequence and annotation handling, sequence compression, index structure generation and access, annotation visualization, and much more.

Gentle
suite to plan genetic cloning
Versions of package gentle
ReleaseVersionArchitectures
squeeze1.9+cvs20100605+dfsg-2 (contrib)amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.9+cvs20100605+dfsg1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.9+cvs20100605+dfsg1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.9+cvs20100605+dfsg1-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package gentle:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacex11
roleprogram
uitoolkitwxwidgets
Popcon: 8 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

GENtle is a software for DNA and amino acid editing, database management, plasmid maps, restriction and ligation, alignments, sequencer data import, calculators, gel image display, PCR, and much more.

Please cite: Magnus Manske: GENtle, a free multi-purpose molecular biology tool. (eprint) (2006)
Gff2aplot
pair-wise alignment-plots for genomic sequences in PostScript
Versions of package gff2aplot
ReleaseVersionArchitectures
squeeze2.0-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.0-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.0-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.0-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package gff2aplot:
fieldbiology, biology:bioinformatics
interfacecommandline, shell
roleprogram
scopeutility
useconverting, viewing
works-withimage:vector
works-with-formatplaintext, postscript
Popcon: 13 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

A program to visualize the alignment of two genomic sequences together with their annotations. From GFF-format input files it produces PostScript figures for that alignment. The following menu lists many features of gff2aplot:

  • Comprehensive alignment plots for any GFF-feature. Attributes are defined separately so you can modify only whatsoever attributes for a given file or share same customization across different data-sets.
  • All parameters are set by default within the program, but it can be also fully configured via gff2ps-like flexible customization files. Program can handle several of such files, summarizing all the settings before producing the corresponding figure. Moreover, all customization parameters can be set via command-line switches, which allows users to play with those parameters before adding any to a customization file.
  • Source order is taken from input files, if you swap file order you can visualize alignment and its annotation with the new input arrangement.
  • All alignment scores can be visualized in a PiP box below gff2aplot area, using grey-color scale, user-defined color scale or score-dependent gradients.
  • Scalable fonts, which can also be chosen among the basic PostScript default fonts. Feature and group labels can be rotated to improve readability in both annotation axes.
  • The program is still defined as a Unix filter so it can handle data from files, redirections and pipes, writing output to standard-output and warnings to standard error.
  • gff2aplot is able to manage many physical page formats (from A0 to A10, and more -see available page sizes in its manual-), including user-defined ones. This allows, for instance, the generation of poster size genomic maps, or the use of a continuous-paper supporting plotting device, either in portrait or landscape.
  • You can draw different alignments on same alignment plot and distinguish them by using different colors for each.
  • Shape dictionary has been expanded, so that further feature shapes are now available (see manual).
  • Annotation projections through alignment plots (so called ribbons) emulate transparencies via complementary color fill patterns. This feature allows to show color pseudo-blending when horizontal and vertical ribbons overlap.
Please cite: J. F. Abril, R. Guigó and T. Wiehe: gff2aplot: Plotting sequence comparisons. (PubMed,eprint) Bioinformatics 19(18):2477-2479 (2003)
Gff2ps
produces PostScript graphical output from GFF-files
Versions of package gff2ps
ReleaseVersionArchitectures
squeeze0.98d-3all
wheezy0.98d-4all
jessie0.98d-4all
sid0.98d-4all
Debtags of package gff2ps:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useconverting, viewing
works-withimage:vector
works-with-formatpostscript
Popcon: 12 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

gff2ps is a script program developed with the aim of converting gff-formatted records into high quality one-dimensional plots in PostScript. Such plots maybe useful for comparing genomic structures and to visualizing outputs from genome annotation programs. It can be used in a very simple way, because it assumes that the GFF file itself carries enough formatting information, but it also allows through a number of options and/or a configuration file, for a great degree of customization.

Please cite: J. F. Abril and R. Guigó: gff2ps: visualizing genomic annotations.. (PubMed,eprint) Bioinformatics 16(8):743-744 (2000)
Ghemical
GNOME molecular modelling environment
Versions of package ghemical
ReleaseVersionArchitectures
squeeze2.99.2-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.0.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.0.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.0.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package ghemical:
fieldchemistry
interface3d, x11
roleprogram
suitegnome
uitoolkitgtk
useediting, learning, viewing
works-with3dmodel
x11application
Popcon: 58 users (51 upd.)*
Versions and Archs
License: DFSG free
Svn

Ghemical is a computational chemistry software package written in C++. It has a graphical user interface and it supports both quantum- mechanics (semi-empirical) models and molecular mechanics models. Geometry optimization, molecular dynamics and a large set of visualization tools using OpenGL are currently available.

Ghemical relies on external code to provide the quantum-mechanical calculations. Semi-empirical methods MNDO, MINDO/3, AM1 and PM3 come from the MOPAC7 package (Public Domain), and are included in the package. The MPQC package is used to provide ab initio methods: the methods based on Hartree-Fock theory are currently supported with basis sets ranging from STO-3G to 6-31G**.

Screenshots of package ghemical
Glam2
gapped protein motifs from unaligned sequences
Versions of package glam2
ReleaseVersionArchitectures
squeeze1064-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1064-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1064-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1064-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package glam2:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing, searching
works-with-formatplaintext
Popcon: 12 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

GLAM2 is a software package for finding motifs in sequences, typically amino-acid or nucleotide sequences. A motif is a re-occurring sequence pattern: typical examples are the TATA box and the CAAX prenylation motif. The main innovation of GLAM2 is that it allows insertions and deletions in motifs.

The package includes these programs:

 glam2:       discovering motifs shared by a set of sequences;
 glam2scan:   finding matches, in a sequence database, to a motif discovered
              by glam2;
 glam2format: converting glam2 motifs to  standard alignment formats;
 glam2mask:   masking glam2 motifs out of sequences, so that weaker motifs
              can be found;
 glam2-purge: removing highly similar members of a set of sequences.

In this package, the fast Fourier algorithm (FFT) was enabled for glam2.

If you use GLAM2, please cite: MC Frith, NFW Saunders, B Kobe, TL Bailey (2008) Discovering sequence motifs with arbitrary insertions and deletions, PLoS Computational Biology (in press).

Please cite: Martin C. Frith, Neil F. W. Saunders, Bostjan Kobe and Timothy L. Bailey: Discovering Sequence Motifs with Arbitrary Insertions and Deletions. (PubMed) PLoS Computational Biology 4(5):e1000071 (2008)
Screenshots of package glam2
Grinder
Versatile omics shotgun and amplicon sequencing read simulator
Versions of package grinder
ReleaseVersionArchitectures
wheezy0.4.5-1all
jessie0.4.5-1all
sid0.4.5-1all
upstream0.5.3
Popcon: 6 users (6 upd.)*
Newer upstream!
License: DFSG free
Git

Grinder is a versatile program to create random shotgun and amplicon sequence libraries based on DNA, RNA or proteic reference sequences provided in a FASTA file.

Grinder can produce genomic, metagenomic, transcriptomic, metatranscriptomic, proteomic, metaproteomic shotgun and amplicon datasets from current sequencing technologies such as Sanger, 454, Illumina. These simulated datasets can be used to test the accuracy of bioinformatic tools under specific hypothesis, e.g. with or without sequencing errors, or with low or high community diversity. Grinder may also be used to help decide between alternative sequencing methods for a sequence-based project, e.g. should the library be paired-end or not, how many reads should be sequenced.

Please cite: Florent E. Angly, Dana Willner, Forest Rohwer, Philip Hugenholtz and Gene W. Tyson: Grinder: a versatile amplicon and shotgun sequence simulator. (PubMed,eprint) Nucleic Acids Research Epub ahead of print (2012)
Gromacs
Molecular dynamics simulator, with building and analysis tools
Versions of package gromacs
ReleaseVersionArchitectures
squeeze4.0.7-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.5.5-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.5.5-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.5.5-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
experimental4.6-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream4.6.2
Debtags of package gromacs:
fieldbiology, biology:structural, chemistry
interfacecommandline, x11
roleprogram
uitoolkitxlib
x11application
Popcon: 61 users (56 upd.)*
Newer upstream!
License: DFSG free
Svn

GROMACS is a versatile package to perform molecular dynamics, i.e. simulate the Newtonian equations of motion for systems with hundreds to millions of particles.

It is primarily designed for biochemical molecules like proteins and lipids that have a lot of complicated bonded interactions, but since GROMACS is extremely fast at calculating the nonbonded interactions (that usually dominate simulations) many groups are also using it for research on non- biological systems, e.g. polymers.

GROMACS offers entirely too many features for a brief description to do it justice. A more complete listing is available at http://www.gromacs.org/content/view/12/176/.

Please cite: Berk Hess, Carsten Kutzner, David van der Spoel and Erik Lindahl: GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. (eprint) J. Chem. Theory Comput. 4(3):435-447 (2008)
Hhsuite
sensitive protein sequence searching based on HMM-HMM alignment
Versions of package hhsuite
ReleaseVersionArchitectures
wheezy2.0.15-1amd64
jessie2.0.16-1amd64
sid2.0.16-1amd64
Popcon: 5 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

HH-suite is an open-source software package for sensitive protein sequence searching based on the pairwise alignment of hidden Markov models (HMMs).

This package contains HHsearch and HHblits among other programs and utilities.

HHsearch takes as input a multiple sequence alignment (MSA) or profile HMM and searches a database of HMMs (e.g. PDB, Pfam, or InterPro) for homologous proteins. HHsearch is often used for protein structure prediction to detect homologous templates and to build highly accurate query-template pairwise alignments for homology modeling.

HHblits can build high-quality MSAs starting from single sequences or from MSAs. It transforms these into a query HMM and, using an iterative search strategy, adds significantly similar sequences from the previous search to the updated query HMM for the next search iteration. Compared to PSI-BLAST, HHblits is faster, up to twice as sensitive and produces more accurate alignments.

Please cite: Michael Remmert, Andreas Biegert, Andreas Hauser and Johannes Söding: HHblits: Lightning-fast iterative protein sequence searching by HMM-HMM alignment.. (PubMed) Nat. Methods 9(2):173-175 (2011)
Hmmer
profile hidden Markov models for protein sequence analysis
Versions of package hmmer
ReleaseVersionArchitectures
squeeze2.3.2-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.0-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.0-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.0-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package hmmer:
biologyformat:aln, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usesearching
works-withdb
works-with-formatplaintext
Popcon: 36 users (36 upd.)*
Versions and Archs
License: DFSG free
Svn

HMMER is an implementation of profile hidden Markov model methods for sensitive searches of biological sequence databases using multiple sequence alignments as queries.

Given a multiple sequence alignment as input, HMMER builds a statistical model called a "hidden Markov model" which can then be used as a query into a sequence database to find (and/or align) additional homologues of the sequence family.

Please cite: S. R. Eddy: Profile hidden Markov models. (PubMed,eprint) Bioinformatics 14(9):755-763 (1998)
Screenshots of package hmmer
Hmmer2
profile hidden Markov models for protein sequence analysis
Versions of package hmmer2
ReleaseVersionArchitectures
jessie2.3.2-6amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.3.2-6amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 4 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

HMMER is an implementation of profile hidden Markov model methods for sensitive searches of biological sequence databases using multiple sequence alignments as queries.

Given a multiple sequence alignment as input, HMMER builds a statistical model called a "hidden Markov model" which can then be used as a query into a sequence database to find (and/or align) additional homologues of the sequence family.

Please cite: Eddy, SR: Profile hidden Markov models. Bioinformatics 14(9):755-763 (1998)
Remark of Debian Med team: This older version of HMMER is used in some applications

While Debian has HMMER 3 since some time there are users of HMMER 2 interested in having this old version available and thus the package is reintroduced.

Infernal
inference of RNA secondary structural alignments
Versions of package infernal
ReleaseVersionArchitectures
squeeze1.0.2-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.0.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.2-2armel,armhf,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1~rc2-1amd64,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386
Debtags of package infernal:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
useanalysing
Popcon: 22 users (34 upd.)*
Versions and Archs
License: DFSG free
Svn

Infernal ("INFERence of RNA ALignment") searches DNA sequence databases for RNA structure and sequence similarities. It provides an implementation of a special variant of profile stochastic context-free grammars called covariance models (CMs). A CM is like a sequence profile, but it scores a combination of sequence consensus and RNA secondary structure consensus, so in many cases, it is more capable of identifying RNA homologs that conserve their secondary structure more than their primary sequence.

The tool is an integral component of the Rfam database.

Please cite: Eric P. Nawrocki, Diana L. Kolbe and Sean R. Eddy: Infernal 1.0: inference of RNA alignments. (PubMed,eprint) Bioinformatics 25(10):1335-1337 (2009)
Jalview
multiple alignment editor
Maintainer: Vincent Fourmond
Versions of package jalview
ReleaseVersionArchitectures
wheezy2.7.dfsg-2all
jessie2.7.dfsg-2all
sid2.7.dfsg-2all
Popcon: 8 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

JalView is a Java alignement editor that can work with sequence alignement produced by programs implementing alignment algorithms such as clustalw, kalign and t-coffee.

It has lots of features, is actively developed, and will compare advantageously to BioEdit, while being free as in free speech !

Screenshots of package jalview
Jellyfish
count k-mers in DNA sequences
Versions of package jellyfish
ReleaseVersionArchitectures
wheezy1.1.5-1amd64,kfreebsd-amd64
jessie1.1.5-1amd64,kfreebsd-amd64
sid1.1.5-1amd64,kfreebsd-amd64
upstream1.1.10
Popcon: 4 users (5 upd.)*
Newer upstream!
License: DFSG free
Git

JELLYFISH is a tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers using an order of magnitude less memory and an order of magnitude faster than other k-mer counting packages by using an efficient encoding of a hash table and by exploiting the "compare-and-swap" CPU instruction to increase parallelism.

JELLYFISH is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the "jellyfish dump" command.

Jmol
Molecular Viewer
Versions of package jmol
ReleaseVersionArchitectures
wheezy12.2.32+dfsg2-1all
jessie12.2.32+dfsg2-1all
sid12.2.32+dfsg2-1all
upstream13.0.15
Debtags of package jmol:
fieldchemistry
roleprogram
scopeutility
useviewing
Popcon: 30 users (20 upd.)*
Newer upstream!
License: DFSG free
Svn

Jmol is a Java molecular viewer for three-dimensional chemical structures. Features include reading a variety of file types and output from quantum chemistry programs, and animation of multi-frame files and computed normal modes from quantum programs. It includes with features for chemicals, crystals, materials and biomolecules. Jmol might be useful for students, educators, and researchers in chemistry and biochemistry.

File formats read by Jmol include PDB, XYZ, CIF, CML, MDL Molfile, Gaussian, GAMESS, MOPAC, ABINIT, ACES-II, Dalton and VASP.

Screenshots of package jmol
Kalign
Global and progressive multiple sequence alignment
Versions of package kalign
ReleaseVersionArchitectures
wheezy2.03+20110620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.03+20110620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.03+20110620-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
squeeze2.04-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
Debtags of package kalign:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 23 users (26 upd.)*
Versions and Archs
License: DFSG free
Svn

Kalign is a command line tool to perform multiple alignment of biological sequences. It employs the Muth-Manber string-matching algorithm, to improve both the accuracy and speed of the alignment. It uses global, progressive alignment approach, enriched by employing an approximate string-matching algorithm to calculate sequence distances and by incorporating local matches into the otherwise global alignment.

Please cite: Timo Lassmann, Oliver Frings and Erik L. L. Sonnhammer: Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. (PubMed,eprint) Nucl. Acids Res. 37(3):858-865 (2009)
Last-align
genome-scale comparison of biological sequences
Versions of package last-align
ReleaseVersionArchitectures
squeeze128-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy199-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie199-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid199-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package last-align:
fieldbiology, biology:bioinformatics
roleprogram
Popcon: 12 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

LAST is software for comparing and aligning sequences, typically DNA or protein sequences. LAST is similar to BLAST, but it copes better with very large amounts of sequence data. Here are two things LAST is good at:

  • Comparing large (e.g. mammalian) genomes.
  • Mapping lots of sequence tags onto a genome.

The main technical innovation is that LAST finds initial matches based on their multiplicity, instead of using a fixed size (e.g. BLAST uses 10-mers). This allows one to map tags to genomes without repeat-masking, without becoming overwhelmed by repetitive hits. To find these variable-sized matches, it uses a suffix array (inspired by Vmatch). To achieve high sensitivity, it uses a discontiguous suffix array, analogous to spaced seeds.

Librg-utils-perl
parsers and format conversion utilities used by (e.g.) profphd
Versions of package librg-utils-perl
ReleaseVersionArchitectures
wheezy1.0.43-1all
jessie1.0.43-1all
sid1.0.43-1all
Debtags of package librg-utils-perl:
devellang:perl, library
Popcon: 7 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

This package contributes to the PredictProtein server for the automated structural annotation of protein sequences. It features as series of conversion tools like:

  • blast2saf.pl
  • blastpgp_to_saf.pl
  • conv_hssp2saf.pl
  • copf.pl
  • hssp_filter.pl
  • safFilterRed.pl

which are supported by the modules:

  • RG:Utils::Conv_hssp2saf
  • RG:Utils::Copf
  • RG:Utils::Hssp_filter
Logol-bin
Pattern matching tool using Logol language
Versions of package logol-bin
ReleaseVersionArchitectures
jessie1.6.1-1amd64,i386,powerpc
sid1.6.2-1amd64,i386,powerpc
Debtags of package logol-bin:
uitoolkitncurses
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

This package contains the Prolog binaries used by Logol to parse the sequence and match the grammar.

Logol is a pattern matching tool using the Logol language. It searches with a specific grammar a pattern in small or large sequence (dna, rna, protein). It provides complete result matching with the original grammar in the results.

Loki
MCMC linkage analysis on general pedigrees
Versions of package loki
ReleaseVersionArchitectures
squeeze2.4.7.4-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.4.7.4-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.4.7.4-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.4.7.4-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package loki:
fieldbiology
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 13 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

Performs Markov chain Monte Carlo multipoint linkage analysis on large, complex pedigrees. The current package supports analyses on quantitative traits only, although this restriction will be lifted in later versions. Joint estimation of QTL number, position and effects uses Reversible Jump MCMC. It is also possible to perform affected only IBD sharing analyses.

The homepage of this project used to be at http://loki.homeunix.net but the project is dead now and the homepage vanished. The Homepage field above points to the web archive.

The package is enhanced by the following packages: loki-doc
Please cite: Simon C. Heath: Markov chain Monte Carlo segregation and linkage analysis for oligogenic models. (PubMed,eprint) American Journal of Human Genetics 61(3):748-60 (1997)
Ltrsift
postprocessing and classification of LTR retrotransposons
Versions of package ltrsift
ReleaseVersionArchitectures
jessie1.0.2-1amd64,armhf,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
sid1.0.2-1amd64,armhf,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Debtags of package ltrsift:
fieldbiology, biology:bioinformatics
interfacex11
roleprogram
scopeutility
uitoolkitgtk
useanalysing, converting
x11application
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

LTRsift is a graphical desktop tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations, such as the ones generated by LTRharvest and LTRdigest. Its user-friendly interface displays LTR retrotransposon candidates, their putative families and their internal structure in a hierarchical fashion, allowing the user to "sift" through the sometimes large results of de novo prediction software. It also offers customizable filtering and classification functionality.

Screenshots of package ltrsift
Macs
Model-based Analysis of ChIP-Seq on short reads sequencers
Versions of package macs
ReleaseVersionArchitectures
jessie2.0.9.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.0.9.1-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 2 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

MACS empirically models the length of the sequenced ChIP fragments, which tends to be shorter than sonication or library construction size estimates, and uses it to improve the spatial resolution of predicted binding sites. MACS also uses a dynamic Poisson distribution to effectively capture local biases in the genome sequence, allowing for more sensitive and robust prediction. MACS compares favorably to existing ChIP-Seq peak-finding algorithms, is publicly available open source, and can be used for ChIP-Seq with or without control samples.

Mafft
Multiple alignment program for amino acid or nucleotide sequences
Versions of package mafft
ReleaseVersionArchitectures
squeeze6.815-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy6.864-1amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie6.940-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid6.940-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream7.043
Debtags of package mafft:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 34 users (51 upd.)*
Newer upstream!
License: DFSG free
Svn

MAFFT is a multiple sequence alignment program which offers three accuracy-oriented methods:

  • L-INS-i (probably most accurate; recommended for <200 sequences; iterative refinement method incorporating local pairwise alignment information),
  • G-INS-i (suitable for sequences of similar lengths; recommended for <200 sequences; iterative refinement method incorporating global pairwise alignment information),
  • E-INS-i (suitable for sequences containing large unalignable regions; recommended for <200 sequences), and five speed-oriented methods:

  • FFT-NS-i (iterative refinement method; two cycles only),

  • FFT-NS-i (iterative refinement method; max. 1000 iterations),
  • FFT-NS-2 (fast; progressive method),
  • FFT-NS-1 (very fast; recommended for >2000 sequences; progressive method with a rough guide tree),
  • NW-NS-PartTree-1 (recommended for ∼50,000 sequences; progressive method with the PartTree algorithm).
Please cite: Kazutaka Katoh and Hiroyuki Toh: Recent developments in the MAFFT multiple sequence alignment program. (PubMed) Brief Bioinform 9(4):286-298 (2008)
Maq
maps short fixed-length polymorphic DNA sequence reads to reference sequences
Versions of package maq
ReleaseVersionArchitectures
squeeze0.7.1-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.7.1-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.7.1-5amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.7.1-5amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package maq:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing, searching
works-with-formatplaintext
Popcon: 19 users (24 upd.)*
Versions and Archs
License: DFSG free
Svn

Maq (short for Mapping and Assembly with Quality) builds mapping assemblies from short reads generated by the next-generation sequencing machines. It was particularly designed for Illumina-Solexa 1G Genetic Analyzer, and has a preliminary functionality to handle ABI SOLiD data. Maq is previously known as mapass2.

Developmemt of Maq stopped in 2008. Its successors are BWA and SAMtools.

Please cite: Heng Li, Jue Ruan and Richard Durbin: Mapping short DNA sequencing reads and calling variants using mapping quality scores. (PubMed,eprint) Genome Research 18(11):1851-1858 (2008)
Maqview
graphical read alignment viewer for short gene sequences
Versions of package maqview
ReleaseVersionArchitectures
wheezy0.2.5-4amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid0.2.5-4hurd-i386
jessie0.2.5-5amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid0.2.5-5amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 4 users (16 upd.)*
Versions and Archs
License: DFSG free
Svn

Maqview is graphical read alignment viewer. It is specifically designed for the Maq alignment file and allows you to see the mismatches, base qualities and mapping qualities. Maqview is nothing fancy as Consed or GAP, but just a simple viewer for you to see what happens in a particular region.

In comparison to tgap-maq, the text-based read alignment viewer writen by James Bonfield, Maqview is faster and takes up much less memory and disk space in indexing. This is possibly because tgap aims to be a general-purpose viewer but Maqview fully makes use of the fact that a Maq alignment file has already been sorted. Maqview is also efficient in viewing and provides a command-line tool to quickly retrieve any region in an Maq alignment file.

Please cite: Heng Li, Jue Ruan and Richard Durbin: Mapping short DNA sequencing reads and calling variants using mapping quality scores. (PubMed,eprint) Genome Research 18(11):1851-1858 (2008)
Massxpert
linear polymer mass spectrometry software
Versions of package massxpert
ReleaseVersionArchitectures
squeeze2.3.6-1squeeze1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.2.3-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.4.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.4.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package massxpert:
biologynuceleic-acids, peptidic
fieldbiology, chemistry
interfacex11
roleprogram
uitoolkitqt
useanalysing, simulating
works-withbiological-sequence
works-with-formatxml
x11application
Popcon: 21 users (37 upd.)*
Versions and Archs
License: DFSG free
Git

massXpert is a program to simulate and analyse mass spectrometric data obtained on linear (bio-)polymers. It is the successor of GNU polyxmass.

Four modules allow:

  • making brand new polymer chemistry definitions;
  • using the definitions to perform easy calculations in a desktop calculator-like manner;
  • performing sophisticated polymer sequence editing and simulations;
  • perform m/z list comparisons;

Chemical simulations encompass cleavage (either chemical or enzymatic), gas-phase fragmentations, chemical modification of any monomer in the polymer sequence, cross-linking of monomers in the sequence, arbitrary mass searches, calculation of the isotopic pattern...

Other screenshots of package massxpert
VersionURL
2.0.3http://screenshots.debian.net/screenshots/m/massxpert/2268_large.png
Screenshots of package massxpert
Melting
compute the melting temperature of nucleic acid duplex
Versions of package melting
ReleaseVersionArchitectures
squeeze4.3c-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy4.3c-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie4.3c-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid4.3c-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream4.31
Debtags of package melting:
biologynuceleic-acids
fieldbiology, biology:molecular
interfacecommandline
roleprogram
scopeutility
suitegnu
useanalysing
works-with-formatplaintext
Popcon: 11 users (9 upd.)*
Newer upstream!
License: DFSG free
Svn

This program computes, for a nucleic acid duplex, the enthalpy, the entropy and the melting temperature of the helix-coil transitions. Three types of hybridisation are possible: DNA/DNA, DNA/RNA, and RNA/RNA. The program first computes the hybridisation enthalpy and entropy from the elementary parameters of each Crick's pair by the nearest-neighbor method. Then the melting temperature is computed. The set of thermodynamic parameters can be easily changed, for instance following an experimental breakthrough.

Please cite: Le Novère, Nicolas: MELTING, computing the melting temperature of nucleic acid duplex. (PubMed,eprint) Bioinformatics 17(12):1226-1227 (2001)
Metastudent
predictor of Gene Ontology terms from protein sequence
Versions of package metastudent
ReleaseVersionArchitectures
sid1.0.8-2all
Popcon: 0 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Often, only the sequence of a protein is known, but not its functions. Metastudent will try to predict missing functional annotations through homology searches (BLAST).

All predicted functions correspond to Gene Ontology (GO) terms from the Molecular Function Ontology (MFO) and the Biological Process Ontology (BPO) and are associated with a reliability score.

Mipe
Tools to store PCR-derived data
Versions of package mipe
ReleaseVersionArchitectures
squeeze1.1-3all
wheezy1.1-4all
jessie1.1-4all
sid1.1-4all
Debtags of package mipe:
fieldbiology, biology:bioinformatics, biology:molecular
interfacecommandline
roledocumentation, program
scopeutility
useorganizing
works-with-formatxml
Popcon: 10 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

MIPE provides a standard format to exchange and/or storage of all information associated with PCR experiments using a flat text file. This will:

  • allow for exchange of PCR data between researchers/laboratories
  • enable traceability of the data
  • prevent problems when submitting data to dbSTS or dbSNP
  • enable the writing of standard scripts to extract data (e.g. a list of PCR primers, SNP positions or haplotypes for different animals)

Although this tool can be used for data storage, it's primary focus should be data exchange. For larger repositories, relational databases are more appropriate for storage of these data. The MIPE format could then be used as a standard format to import into and/or export from these databases.

Please cite: Jan Aerts and T. Veenendaal: MIPE - a XML-format to facilitate the storage and exchange of PCR-related data. Online Journal of Bioinformatics 6(2):114-120 (2005)
Mira-assembler
??? missing short description for package mira-assembler :-(
Versions of package mira-assembler
ReleaseVersionArchitectures
wheezy3.4.0.1-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.4.0.1-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.4.0.1-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream3.9.17
Debtags of package mira-assembler:
roleprogram
Popcon: 7 users (8 upd.)*
Newer upstream!
License: DFSG free
Svn
??? Missing long description for package mira-assembler
Mlv-smile
Find statistically significant patterns in sequences
Versions of package mlv-smile
ReleaseVersionArchitectures
wheezy1.47-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.47-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.47-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

Smile determines sequence motifs on the basis of a set of DNA, RNA or protein sequences.

  • No hard limit on the number of combinations of motifs to describe subsets of sequences.
  • The sequence alphabet may be specified.
  • The use of wildcards is supported.
  • Better determination of significance of motifs by simulation.
  • Introduction of a set of sequences with negative controls that should not match automatically determined motifs.
Mothur
sequence analysis suite for research on microbiota
Versions of package mothur
ReleaseVersionArchitectures
wheezy1.24.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.24.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.24.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.31.1
Debtags of package mothur:
roleprogram
Popcon: 7 users (6 upd.)*
Newer upstream!
License: DFSG free
Svn

Mothur seeks to develop a single piece of open-source, expandable software to fill the bioinformatics needs of the microbial ecology community. It has incorporated the functionality of dotur, sons, treeclimber, s-libshuff, unifrac, and much more. In addition to improving the flexibility of these algorithms, a number of other features including calculators and visualization tools were added.

Please cite: Patrick D Schloss, Sarah L Westcott, Thomas Ryabin, Justine R Hall, Martin Hartmann, Emily B Hollister, Ryan A Lesniewski, Brian B Oakley, Donovan H Parks, Courtney J Robinson, Jason W Sahl, Blaz Stres, Gerhard G Thallinger, David J Van Horn and Carolyn F Weber: Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. (PubMed) Appl Environ Microbiol 75(23):7537-7541 (2009)
Mrbayes
Bayesian Inference of Phylogeny
Versions of package mrbayes
ReleaseVersionArchitectures
wheezy3.2.1+dfsg-1amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.2.1+dfsg-2amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid3.2.1+dfsg-2amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 13 users (22 upd.)*
Versions and Archs
License: DFSG free
Svn

Bayesian inference of phylogeny is based upon a quantity called the posterior probability distribution of trees, which is the probability of a tree conditioned on the observations. The conditioning is accomplished using Bayes's theorem. The posterior probability distribution of trees is impossible to calculate analytically; instead, MrBayes uses a simulation technique called Markov chain Monte Carlo (or MCMC) to approximate the posterior probabilities of trees.

The package is enhanced by the following packages: mrbayes-doc
Please cite: Fredrik Ronquist, Maxim Teslenko, Paul van der Mark, Daniel L. Ayres, Aaron Darling, Sebastian Höhna, Bret Larget, Liang Liu, Marc A. Suchard and John P. Huelsenbeck: MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice across a Large Model Space. (PubMed,eprint) Systematic Biology (2012)
Mummer
Efficient sequence alignment of full genomes
Versions of package mummer
ReleaseVersionArchitectures
squeeze3.22~dfsg-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.23~dfsg-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.23~dfsg-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.23~dfsg-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package mummer:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 19 users (14 upd.)*
Versions and Archs
License: DFSG free
Svn

MUMmer is a system for rapidly aligning entire genomes, whether in complete or draft form. For example, MUMmer 3.0 can find all 20-basepair or longer exact matches between a pair of 5-megabase genomes in 13.7 seconds, using 78 MB of memory, on a 2.4 GHz Linux desktop computer. MUMmer can also align incomplete genomes; it handles the 100s or 1000s of contigs from a shotgun sequencing project with ease, and will align them to another set of contigs or a genome using the NUCmer program included with the system. If the species are too divergent for DNA sequence alignment to detect similarity, then the PROmer program can generate alignments based upon the six-frame translations of both input sequences.

Please cite: Stefan Kurtz, Adam Phillippy, Arthur L. Delcher, Michael Smoot, Martin Shumway, Corina Antonescu and Steven L. Salzberg: Versatile and open software for comparing large genomes. (PubMed) Genome Biology 5(2):R12 (2004)
Muscle
Multiple alignment program of protein sequences
Versions of package muscle
ReleaseVersionArchitectures
wheezy3.8.31-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.8.31-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.8.31-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
squeeze3.70+fix1-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
Debtags of package muscle:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 38 users (35 upd.)*
Versions and Archs
License: DFSG free
Svn

MUSCLE is a multiple alignment program for protein sequences. MUSCLE stands for multiple sequence comparison by log-expectation. In the authors tests, MUSCLE achieved the highest scores of all tested programs on several alignment accuracy benchmarks, and is also one of the fastest programs out there.

Please cite: Robert C. Edgar: MUSCLE: multiple sequence alignment with high accuracy and high throughput. (PubMed,eprint) Nucleic Acids Research 32(5):1792-1797 (2004)
Screenshots of package muscle
Mustang
multiple structural alignment of proteins
Versions of package mustang
ReleaseVersionArchitectures
squeeze3.2.1-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.2.1-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.2.1-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.2.1-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package mustang:
biologypeptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
useanalysing, comparing
works-with-formatplaintext
Popcon: 10 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

Mustang is an algorithm to align multiple protein structures. Given a set of PDB files, the program uses the spatial information in the Calpha atoms of the set to produce a sequence alignment. Based on a progressive pairwise heuristic the algorithm then proceeds through a number of refinement passes. Mustang reports the multiple sequence alignment and the corresponding superposition of structures.

The package is enhanced by the following packages: mustang-testdata
Please cite: Arun S. Konagurthu, James C. Whisstock, Peter J. Stuckey and Arthur M. Lesk: MUSTANG: A multiple structural alignment algorithm. (PubMed) Proteins: Structure, Function, and Bioinformatics 64(3):559-574 (2006)
Ncbi-blast+
next generation suite of BLAST sequence search tools
Versions of package ncbi-blast+
ReleaseVersionArchitectures
wheezy2.2.26-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.2.27-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.2.27-3armel,armhf,mips,mipsel,s390,s390x,sparc
sid2.2.28-1amd64,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc
Debtags of package ncbi-blast+:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
useanalysing
works-withbiological-sequence
Popcon: 23 users (28 upd.)*
Versions and Archs
License: DFSG free
Svn

The Basic Local Alignment Search Tool (BLAST) is the most widely used sequence similarity tool. There are versions of BLAST that compare protein queries to protein databases, nucleotide queries to nucleotide databases, as well as versions that translate nucleotide queries or databases in all six frames and compare to protein databases or queries. PSI-BLAST produces a position-specific-scoring-matrix (PSSM) starting with a protein query, and then uses that PSSM to perform further searches. It is also possible to compare a protein or nucleotide query to a database of PSSM’s. The NCBI supports a BLAST web page at blast.ncbi.nlm.nih.gov as well as a network service.

Ncbi-epcr
Tool to test a DNA sequence for the presence of sequence tagged sites
Versions of package ncbi-epcr
ReleaseVersionArchitectures
squeeze2.3.12-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.3.12-1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.3.12-1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.3.12-1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package ncbi-epcr:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usechecking, searching
works-with-formatplaintext
Popcon: 18 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

Electronic PCR (e-PCR) is computational procedure that is used to identify sequence tagged sites(STSs), within DNA sequences. e-PCR looks for potential STSs in DNA sequences by searching for subsequences that closely match the PCR primers and have the correct order, orientation, and spacing that could represent the PCR primers used to generate known STSs.

The new version of e-PCR implements a fuzzy matching strategy. To reduce likelihood that a true STS will be missed due to mismatches, multiple discontigous words may be used instead of a single exact word. Each of this word has groups of significant positions separated by 'wildcard' positions that are not required to match. In addition, it is also possible to allow gaps in the primer alignments.

The main motivation for implementing reverse searching (called Reverse e-PCR) was to make it feasible to search the human genome sequence and other large genomes. The new version of e-PCR provides a search mode using a query sequence against a sequence database.

Please cite: Schuler, Gregory D.: Sequence Mapping by Electronic PCR. (PubMed,eprint) Genome Research 7(5):541-550 (1997)
Ncbi-seg
tool to mask segments of low compositional complexity in amino acid sequences
Versions of package ncbi-seg
ReleaseVersionArchitectures
jessie0.0.20000620-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.0.20000620-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 2 users (3 upd.)*
Versions and Archs
License: DFSG free
Svn

ncbi-seg (a.k.a. SEG) is a program for identifying and masking segments of low compositional complexity in amino acid sequences.

ncbi-seg divides sequences into contrasting segments of low-complexity and high-complexity. Low-complexity segments defined by the algorithm represent "simple sequences" or "compositionally-biased regions".

This program is inappropriate for masking nucleotide sequences and, in fact, may strip some nucleotide ambiguity codes from nt. sequences as they are being read.

Please cite: John C. Wootton and Scott Federhen: Statistics of local complexity in amino acid sequences and sequence databases.. Computers & Chemistry 17:149-163 (1993)
Ncbi-tools-bin
NCBI libraries for biology applications (text-based utilities)
Versions of package ncbi-tools-bin
ReleaseVersionArchitectures
squeeze6.1.20090809-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy6.1.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie6.1.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid6.1.20120620-2armhf,mips,s390,sparc
sid6.1.20120620-3amd64,armel,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390x
Debtags of package ncbi-tools-bin:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
networkclient
roleprogram
sciencecalculation
scopeutility
useanalysing, calculating, converting, searching
works-withbiological-sequence
works-with-formatplaintext, xml
Popcon: 22 users (23 upd.)*
Versions and Archs
License: DFSG free
Git

This package includes various utilities distributed with the NCBI C SDK, including the development tools asntool and errhdr (formerly of libncbi6-dev). None of the programs in this package require X; you can find the X-based utilities in the ncbi-tools-x11 package. BLAST and related tools are in a separate package (blast2).

The package is enhanced by the following packages: mcl
Ncbi-tools-x11
NCBI libraries for biology applications (X-based utilities)
Versions of package ncbi-tools-x11
ReleaseVersionArchitectures
squeeze6.1.20090809-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy6.1.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie6.1.20120620-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid6.1.20120620-2armhf,mips,s390,sparc
sid6.1.20120620-3amd64,armel,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390x
Debtags of package ncbi-tools-x11:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics, biology:structural
interface3d, x11
networkclient
roleprogram
sciencevisualisation
scopeutility
uitoolkitmotif
useanalysing, calculating, editing, searching, viewing
x11application
Popcon: 112 users (11 upd.)*
Versions and Archs
License: DFSG free
Git

This package includes some X-based utilities distributed with the NCBI C SDK: Cn3D, Network Entrez, Sequin, ddv, and udv. These programs are not part of ncbi-tools-bin because they depend on several additional library packages.

Screenshots of package ncbi-tools-x11
Ncoils
coiled coil secondary structure prediction
Versions of package ncoils
ReleaseVersionArchitectures
squeeze2002-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2002-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2002-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2002-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 11 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

The program predicts the coiled coil secondary structure predictions from protein sequences. The algorithm was published in Lupas, van Dyke & Stock, Predicting coiled coils from protein sequences Science, 252, 1162-1164, 1991.

Please cite: Andrei Lupas, Marc Van Dyke and Jeff Stock: Predicting coiled coils from protein sequences. (PubMed) Science 252:1162-1164 (1991)
Neobio
computes alignments of amino acid and nucleotide sequences
Versions of package neobio
ReleaseVersionArchitectures
wheezy0.0.20030929-1all
jessie0.0.20030929-1all
sid0.0.20030929-1.1all
Popcon: 5 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

Library and graphical user interface for pairwise sequence alignments. Implementation of the dynamic programming methods of Needleman & Wunsch (global alignment) and Smith & Waterman (local alignment).

Njplot
phylogenetic tree drawing program
Versions of package njplot
ReleaseVersionArchitectures
squeeze2.3-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.4-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package njplot:
fieldbiology, biology:bioinformatics
interfacex11
roleprogram
scopeutility
uitoolkitmotif
useanalysing, editing, organizing, printing, viewing
works-withbiological-sequence
works-with-formatplaintext
x11application
Popcon: 94 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

NJplot is able to draw any dendrogram expressed in the Newick standard phylogenetic tree format (e.g., the format used by the Phylip package). NJplot is especially convenient for rooting the unrooted trees obtained from parsimony, distance or maximum likelihood tree-building methods.

Please cite: G. Perrière and M. Gouy: WWW-query: An on-line retrieval system for biological sequence banks. (PubMed) Biochimie 78(5):364–369 (1996)
Screenshots of package njplot
Norsnet
tool to identify unstructured loops in proteins
Versions of package norsnet
ReleaseVersionArchitectures
jessie1.0.16-1all
sid1.0.16-1all
Popcon: 2 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

NORSnet can distinguish between very long contiguous segments with non-regular secondary structure (NORS regions) and well-folded proteins.

NORSnet was trained on predicted information rather than on experimental data. This allows NORSnet to reach into regions in sequence space that are not covered by specialized disorder predictors. One disadvantage of this approach is that it is not optimal for the identification of the "average" disordered region.

NORSnet takes the following input, further described on norsnet(1):

  • a protein sequence in a FASTA file
  • secondary structure and solvent accessibility prediction by prof(1)
  • an HSSP file
  • flexible/rigid residues prediction by profbval(1)
Please cite: Avner Schlessinger, Jinfeng Liu and Burkhard Rost: Natively unstructured loops differ from other loops.. (PubMed,eprint) PLoS Comput Biol. 3:e140 (2007)
Norsp
predictor of non-regular secondary structure
Versions of package norsp
ReleaseVersionArchitectures
jessie1.0.5-1all
sid1.0.5-1all
Popcon: 2 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

NORSp is a publicly available predictor for disordered regions in proteins. Specifically, it predicts long regions with no regular secondary structure. Upon submission of a protein sequence, NORSp analyses the protein about its secondary structure, the presence of transmembrane helices and coiled-coils. It then returns the presence and position of disordered regions.

NORSp can be useful for biologists in several ways. For example, crystallographers can check whether their proteins contain NORS regions and make the decision about whether to proceed with the experiments since NORS proteins may be difficult to crystallise, as demonstrated by the their low occurrence in PDB. Biologists interested in protein structure-function relationship may also find it interesting to verify whether the protein-protein interaction sites coincide with NORS regions.

Please cite: Jinfeng Liu and Burkhard Rost: NORSp: Predictions of long regions without regular secondary structure.. (PubMed,eprint) Nucleic Acids Res 31(13):3833-3835 (2003)
Perlprimer
Graphical design of primers for PCR
Versions of package perlprimer
ReleaseVersionArchitectures
squeeze1.1.19-1all
wheezy1.1.21-1all
jessie1.1.21-1all
sid1.1.21-1all
Debtags of package perlprimer:
biologyformat:aln, nuceleic-acids, peptidic
fieldbiology, biology:molecular
interfacex11
networkclient
roleprogram
scopeutility
uitoolkittk
useanalysing
works-with-formatplaintext
x11application
Popcon: 97 users (5 upd.)*
Versions and Archs
License: DFSG free
Git

PerlPrimer is a free, open-source GUI application written in Perl that designs primers for standard Polymerase Chain Reaction (PCR), bisulphite PCR, real-time PCR (QPCR) and sequencing. It aims to automate and simplify the process of primer design.

If operated online, the tool nicely communicates with the Ensembl project for further insights into the gene structure, i.e., allowing for taking the location of exons and introns into account for the design of the primers. The sequences themselves can be retrieved, too.

Please cite: Marshall, Owen J.: PerlPrimer: cross-platform, graphical primer design for standard, bisulphite and real-time PCR. (PubMed,eprint) Bioinformatics 20(15):2471-2472 (2004)
Screenshots of package perlprimer
Phyml
Phylogenetic estimation using Maximum Likelihood
Versions of package phyml
ReleaseVersionArchitectures
squeeze20100123-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy20110919-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie20110919-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid20110919-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream20130513
Debtags of package phyml:
biologypeptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
useanalysing, comparing
works-withbiological-sequence
Popcon: 27 users (28 upd.)*
Newer upstream!
License: DFSG free
Svn

PhyML is a software that estimates maximum likelihood phylogenies from alignments of nucleotide or amino acid sequences. It provides a wide range of options that were designed to facilitate standard phylogenetic analyses. The main strengths of PhyML lies in the large number of substitution models coupled to various options to search the space of phylogenetic tree topologies, going from very fast and efficient methods to slower but generally more accurate approaches. It also implements two methods to evaluate branch supports in a sound statistical framework (the non-parametric bootstrap and the approximate likelihood ratio test).

PhyML was designed to process moderate to large data sets. In theory, alignments with up to 4,000 sequences 2,000,000 character-long can be analyzed. In practice however, the amount of memory required to process a data set is proportional of the product of the number of sequences by their length. Hence, a large number of sequences can only be processed provided that they are short. Also, PhyML can handle long sequences provided that they are not numerous. With most standard personal computers, the “comfort zone” for PhyML generally lies around 3 to 500 sequences less than 2,000 character long.

This pakcage also includes PhyTime.

Please cite: Stephane Guindon and Olivier Gascuel: A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood. (PubMed,eprint) Syst Biol 52(5):696-704 (2003)
Picard-tools
Command line tools to manipulate SAM and BAM files
Versions of package picard-tools
ReleaseVersionArchitectures
squeeze1.27-1all
wheezy1.46-1all
jessie1.82-2all
sid1.90-2all
upstream1.92
Popcon: 21 users (49 upd.)*
Newer upstream!
License: DFSG free
Git

SAM (Sequence Alignment/Map) format is a generic format for storing large nucleotide sequence alignments. Picard Tools includes these utilities to manipulate SAM and BAM files: BamToBfq IlluminaBasecallsToSam BuildBamIndex MarkDuplicates CalculateHsMetrics MeanQualityByCycle CleanSam MergeBamAlignment CollectAlignmentSummaryMetrics MergeSamFiles CollectGcBiasMetrics NormalizeFasta CollectInsertSizeMetrics QualityScoreDistribution CollectRnaSeqMetrics ReplaceSamHeader CompareSAMs RevertSam CreateSequenceDictionary SamFormatConverter ExtractIlluminaBarcodes SamToFastq EstimateLibraryComplexity SortSam FastqToSam ValidateSamFile FixMateInformation ViewSam

Plasmidomics
draw plasmids and vector maps with PostScript graphics export
Versions of package plasmidomics
ReleaseVersionArchitectures
squeeze0.2.0-2all
wheezy0.2.0-2all
jessie0.2.0-2all
sid0.2.0-2all
Debtags of package plasmidomics:
fieldbiology, biology:molecular
interfacex11
roleprogram
scopeutility
uitoolkittk
works-withimage:vector
works-with-formatpostscript
x11application
Popcon: 13 users (8 upd.)*
Versions and Archs
License: DFSG free
Svn

Plasmidomics is written for easy drawing of plasmids and vector maps to use them in theses, presentations or other forms of publications. It natively supports PostScript as output format.

Screenshots of package plasmidomics
Plink
whole-genome association analysis toolset
Versions of package plink
ReleaseVersionArchitectures
squeeze1.07-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,powerpc,s390,sparc
wheezy1.07-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.07-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.07-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package plink:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
Popcon: 11 users (13 upd.)*
Versions and Archs
License: DFSG free
Svn

plink expects as input the data from SNP (single nucleotide polymorphism) chips of many individuals and their phenotypical description of a disease. It finds associations of single or pairs of DNA variations with a phenotype and can retrieve SNP annotation from an online source.

SNPs can evaluated individually or as pairs for their association with the disease phenotypes. The joint investigation of copy number variations is supported. A variety of statistical tests have been implemented.

Please note: The executable was renamed to p-link because of a name clash. Please read more about this in /usr/share/doc/README.Debian.

Please cite: Shaun Purcell, Benjamin Neale, Kathe Todd-Brown, Lori Thomas, Manuel A. R. Ferreira, David Bender, Julian Maller, Pamela Sklar, Paul I. W. de Bakker, Mark J. Daly and Pak C. Sham: PLINK: a toolset for whole-genome association and population-based linkage analysis. (PubMed) American Journal of Human Genetics 81(3):559-75 (2007)
Poa
Partial Order Alignment for multiple sequence alignment
Versions of package poa
ReleaseVersionArchitectures
squeeze2.0+20060928-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.0+20060928-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.0+20060928-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.0+20060928-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package poa:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
works-with-formatplaintext
Popcon: 13 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

POA is Partial Order Alignment, a fast program for multiple sequence alignment (MSA) in bioinformatics. Its advantages are speed, scalability, sensitivity, and the superior ability to handle branching / indels in the alignment. Partial order alignment is an approach to MSA, which can be combined with existing methods such as progressive alignment. POA optimally aligns a pair of MSAs and which therefore can be applied directly to progressive alignment methods such as CLUSTAL. For large alignments, Progressive POA is 10-30 times faster than CLUSTALW.

Populations
population genetic software
Versions of package populations
ReleaseVersionArchitectures
wheezy1.2.33+svn0120106-2.1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2.33+svn0120106-2.1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.33+svn0120106-2.1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package populations:
interfacex11
roleprogram
uitoolkitqt
x11application
Popcon: 4 users (6 upd.)*
Versions and Archs
License: DFSG free

Populations is a population genetic software. It computes genetic distances between populations or individuals. It builds phylogenetic trees (NJ or UPGMA) with bootstrap values.

Predictnls
prediction and analysis of protein nuclear localization signals
Versions of package predictnls
ReleaseVersionArchitectures
jessie1.0.20-1all
sid1.0.20-1all
Popcon: 2 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

predictnls is a method for the prediction and analysis of protein nuclear localization signals (NLS). In addition to reporting the positions of NLSs found, predictnls also gives short statistics.

Please cite: Murat Cokol, Rajesh Nair and Burkhard Rost: Finding nuclear localization signals.. (PubMed,eprint) EMBO reports 1(5):411-415 (2000)
Predictprotein
suite of protein sequence analysis tools
Versions of package predictprotein
ReleaseVersionArchitectures
jessie1.0.90-1all
sid1.0.90-1all
upstream1.0.91
Popcon: 2 users (2 upd.)*
Newer upstream!
License: DFSG free
Svn

PredictProtein is a sequence analysis suite providing prediction of protein structure and function.

PredictProtein takes a protein sequence as input and provides the following per-residue, or whole protein annotations:

  • secondary structure
  • solvent accessibility
  • multiple sequence alignments
  • PROSITE sequence motifs
  • low-complexity regions
  • nuclear localisation signals
  • regions lacking regular structure (NORS)
  • unstructured loops
  • transmembrane helices
  • transmembrane beta barrels
  • coiled-coil regions
  • disulfide-bonds
  • disordered regions
  • B-value flexibility
  • protein-protein interaction sites
Please cite: Burkhrd Rost, Guy Yachdav and Jinfeng Liu: The PredictProtein server. (PubMed,eprint) Nucleic Acids Research 32(2):W321-W326 (2004)
Primer3
Tool to design flanking oligo nucleotides for DNA amplification
Versions of package primer3
ReleaseVersionArchitectures
squeeze1.1.4-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.2.3-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.3.5-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.3.5-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package primer3:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
works-with-formatplaintext
Popcon: 38 users (76 upd.)*
Versions and Archs
License: DFSG free
Svn

Primer3 picks primers for Polymerase Chain Reactions (PCRs), considering as criteria oligonucleotide melting temperature, size, GC content and primer-dimer possibilities, PCR product size, positional constraints within the source sequence, and miscellaneous other constraints. All of these criteria are user-specifiable as constraints, and some are specifiable as terms in an objective function that characterizes an optimal primer pair.

Please cite: Steve Rozen and Helen J. Skaletsky: Primer3 on the WWW for general users and for biologist programmers. (PubMed,eprint) Methods Mol Biol. 132(3):365-86 (2000)
Proalign
Probabilistic multiple alignment program
Versions of package proalign
ReleaseVersionArchitectures
jessie0.603-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.603-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 0 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

ProAlign performs probabilistic sequence alignments using hidden Markov models (HMM). It includes a graphical interface (GUI) allowing to (i) perform alignments of nucleotide or amino-acid sequences, (ii) view the quality of solutions, (iii) filter the unreliable alignment regions and (iv) export alignments to other software.

ProAlign uses a progressive method, such that multiple alignment is created stepwise by performing pairwise alignments in the nodes of a guide tree. Sequences are described with vectors of character probabilities, and each pairwise alignment reconstructs the ancestral (parent) sequence by computing the probabilities of different characters according to an evolutionary model.

Please cite: Ari Löytynoja and Michel C Milinkovitch: A hidden Markov model for progressive multiple alignment. (PubMed,eprint) Bioinformatics 19(12):1505-13 (2003)
Probalign
multiple sequence alignment using partition function posterior probabilities
Versions of package probalign
ReleaseVersionArchitectures
wheezy1.4-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.4-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.4-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 5 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

Probalign uses partition function posterior probability estimates to compute maximum expected accuracy multiple sequence alignments. It performs statistically significantly better than the leading alignment programs Probcons v1.1, MAFFT v5.851, and MUSCLE v3.6 on BAliBASE 3.0, HOMSTRAD, and OXBENCH benchmarks. Probalign improvements are largest on datasets containing N/C terminal extensions and on datasets with long and heterogeneous length sequences. On heteregeneous length datasets containing repeats Probalign alignment accuracy is 10% and 15% than the other three methods when standard deviation of length is at least 300 and 400.

Please cite: Usman Roshan and Dennis R. Livesay: Probalign: multiple sequence alignment using partition function posterior probabilities. (PubMed,eprint) Bioinformatics 22(22):2715-21 (2006)
Probcons
PROBabilistic CONSistency-based multiple sequence alignment
Versions of package probcons
ReleaseVersionArchitectures
squeeze1.12-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.12-9amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.12-9amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.12-9amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package probcons:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 25 users (27 upd.)*
Versions and Archs
License: DFSG free
Svn

Tool for generating multiple alignments of protein sequences. Using a combination of probabilistic modeling and consistency-based alignment techniques, PROBCONS has achieved the highest accuracies of all alignment methods to date. On the BAliBASE benchmark alignment database, alignments produced by PROBCONS show statistically significant improvement over current programs, containing an average of 7% more correctly aligned columns than those of T-Coffee, 11% more correctly aligned columns than those of CLUSTAL W, and 14% more correctly aligned columns than those of DIALIGN.

Please cite: Chuong B. Do, Mahathi S.P. Mahabhashyam, Michael Brudno and Serafim Batzoglou: ProbCons: Probabilistic consistency-based multiple sequence alignment. (PubMed,eprint) Genome Research 15(2):330-340 (2005)
Proda
multiple alignment of protein sequences
Versions of package proda
ReleaseVersionArchitectures
squeeze1.0-7amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.0-8amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0-8amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0-8amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package proda:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 13 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

ProDA is a system for automated detection and alignment of homologous regions in collections of proteins with arbitrary domain architectures. Given an input set of unaligned sequences, ProDA identifies all homologous regions appearing in one or more sequences, and returns a collection of local multiple alignments for these regions.

Please cite: Tu Minh Phuong, Chuong B. Do, Robert C. Edgar and Serafim Batzoglou: Multiple alignment of protein sequences with repeats and rearrangements. (PubMed,eprint) Nucl. Acids Res. 34(20):5932-5942 (2006)
Profbval
predictor of flexible/rigid protein residues from sequence
Versions of package profbval
ReleaseVersionArchitectures
jessie1.0.22-1all
sid1.0.22-1all
Popcon: 3 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

PROFbval can be useful for both protein structure and function predictions. For instance, a biologist can locate potentially antigenic determinants by identifying the most flexible residues on the protein surface. Additionally, a crystallographer can locate residues that potentially have high experimental B-values.

PROFbval takes the following input, further described on profbval(1):

  • a protein sequence in a FASTA file
  • secondary structure and solvent accessibility prediction by prof(1)
  • an HSSP file

Background: the mobility of a given residue on the protein surface is related to its functional role. Therefore, identification of extremely rigid or flexible residues on the protein surface is helpful for identifying functionally important residues in proteins. A common measure of atom mobility in proteins is B-value data from x-ray crystallography structures. PROFbval is the first tool to predict normalized backbone B-values from amino-acid sequence.

Please cite: Avner Schlessinger, Guy Yachdav and Burkhard Rost: PROFbval: predict flexible and rigid residues in proteins.. (PubMed) Bioinformatics 22(7):891-3 (2006)
Profisis
prediction of protein-protein interaction sites from sequence
Versions of package profisis
ReleaseVersionArchitectures
jessie1.0.11-1all
sid1.0.11-1all
Popcon: 2 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

Profisis (ISIS) identifies interacting protein residues in protein-protein interfaces from sequence alone.

The strongest predictions of the method reached over 90% accuracy in a cross-validation experiment.

Please cite: Yanay Ofran and Burkhard Rost: ISIS: interaction sites identified from sequence. (PubMed,eprint) Bioinformatics 23(2):e13-e16 (2007)
Profnet-bval
neural network architecture for profbval
Versions of package profnet-bval
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 8 users (14 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profbval.

Profnet-chop
neural network architecture for profchop
Versions of package profnet-chop
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (14 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profchop.

Profnet-con
neural network architecture for profcon
Versions of package profnet-con
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 5 users (14 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profcon.

Profnet-isis
neural network architecture for profisis
Versions of package profnet-isis
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 7 users (16 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profisis.

Profnet-md
neural network architecture for metadisorder
Versions of package profnet-md
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 5 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for metadisorder.

Profnet-norsnet
neural network architecture for norsnet
Versions of package profnet-norsnet
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (16 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for norsnet.

Profnet-prof
neural network architecture for profacc
Versions of package profnet-prof
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 10 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profsec and profacc.

Profnet-snapfun
neural network architecture for snapfun
Versions of package profnet-snapfun
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for snapfun.

Profphd
secondary structure and solvent accessibility predictor
Versions of package profphd
ReleaseVersionArchitectures
wheezy1.0.39-1all
jessie1.0.39-1all
sid1.0.39-1all
upstream1.0.40
Popcon: 7 users (7 upd.)*
Newer upstream!
License: DFSG free
Svn

This package provides prof(1), the protein secondary structure, accessibility and transmembrane helix predictor from Burkhard Rost. Prediction is either done from protein sequence alone or from an alignment - the latter should be used for optimal performance.

How well does prof(1) perform?

  • Secondary structure is predicted at an expected average accuracy > 72% for the three states helix, strand and loop.

  • Solvent accessibility is predicted at a correlation coefficient (correlation between experimentally observed and predicted relative solvent accessibility) of 0.54

  • Transmembrane helix prediction has an expected per-residue accuracy of about 95%. The number of false positives, i.e., transmembrane helices predicted in globular proteins, is about 2%.

Please cite: B. Rost and C. Sander: Combining evolutionary information and neural networks to predict protein secondary structure.. (PubMed) Proteins 19(1):55-72 (1994)
Screenshots of package profphd
Profphd-net
neural network architecture for profphd
Versions of package profphd-net
ReleaseVersionArchitectures
wheezy1.0.21-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
wheezy-proposed-updates1.0.21-1+wheezy1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.22-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.22-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 10 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

Profnet is a component of the prediction methods that make up the Predict Protein service by the lab of Burkhard Rost. It provides the neural network component to a variety of predictors that perform protein feature prediction directly from sequence. This neural network implementation has to be compiled for every different network architecture. This package contains the neural network architecture for profphd.

Profphd-utils
profphd helper utilities convert_seq and filter_hssp
Versions of package profphd-utils
ReleaseVersionArchitectures
wheezy1.0.9-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.0.10-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.0.10-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 10 users (16 upd.)*
Versions and Archs
License: DFSG free
Svn

The package provides the following binary utilities: convert_seq, filter_hssp. These are used by prof from the profphd package: a secondary structure, accessibility and transmembrane helix predictor from Burkhard Rost.

Proftmb
per-residue prediction of bacterial transmembrane beta barrels
Versions of package proftmb
ReleaseVersionArchitectures
wheezy1.1.10-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.1.12-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1.12-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

proftmb predicts transmembrane beta-barrel (TMB) proteins in Gram-negative bacteria.

For each query protein, proftmb provides both a Z-value indicating that the protein actually contains a membrane barrel, and a four-state per-residue labeling of upward- and downward-facing strands, periplasmic hairpins and extracellular loops.

The package is enhanced by the following packages: proftmb-dbg
Please cite: H. Bigelow and B. Rost: PROFtmb: a web server for predicting bacterial transmembrane beta barrel proteins.. (PubMed) Nucleic Acids Res 34(Web Server issue):W186-8 (2006)
Pymol
Molecular Graphics System
Versions of package pymol
ReleaseVersionArchitectures
squeeze1.2r2-1.1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.5.0.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.5.0.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.5.0.1-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.6~beta1
Debtags of package pymol:
fieldbiology:structural, chemistry
interface3d, x11
roleprogram
scopeutility
uitoolkittk
uselearning, viewing
works-withimage
x11application
Popcon: 462 users (59 upd.)*
Newer upstream!
License: DFSG free
Svn

PyMOL is a molecular graphics system targeted at medium to large biomolecules like proteins. It can generate high-quality publication-ready molecular graphics images and animations.

Features include:

  • Visualization of molecules, molecular trajectories and surfaces of crystallography data or orbitals
  • Molecular builder and sculptor
  • Internal raytracer and movie generator
  • Fully extensible and scriptable via a Python interface

File formats PyMOL can read include PDB, XYZ, CIF, MDL Molfile, ChemDraw, CCP4 maps, XPLOR maps and Gaussian cube maps.

Other screenshots of package pymol
VersionURL
1.2r2-1http://screenshots.debian.net/screenshots/p/pymol/4276_large.png
Screenshots of package pymol
Pynast
alignment of short DNA sequences
Versions of package pynast
ReleaseVersionArchitectures
wheezy1.1-3all
jessie1.1-4all
sid1.1-4all
Popcon: 6 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

The package provices a reimplementation of the Nearest Alignment Space Termination tool in Python. It was prepared for next generation sequencers.

Given a set of sequences and a template alignment, PyNAST will align the input sequences against the template alignment, and return a multiple sequence alignment which contains the same number of positions (or columns) as the template alignment. This facilitates the analysis of new sequences in the context of existing alignments, and additional data derived from existing alignments such as phylogenetic trees. Because any protein or nucleic acid sequences and template alignments can be provided, PyNAST is not limited to the analysis of 16s rDNA sequences.

Python-cogent
framework for genomic biology
Versions of package python-cogent
ReleaseVersionArchitectures
squeeze1.4.1-1.2 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy1.5.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.5.1-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.5.3-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package python-cogent:
biologypeptidic
devellang:python
fieldbiology
roledevel-lib
useanalysing, comparing
Popcon: 8 users (12 upd.)*
Versions and Archs
License: DFSG free
Svn

PyCogent is a software library for genomic biology. It is a fully integrated and thoroughly tested framework for:

  • controlling third-party applications,
  • devising workflows; querying databases,
  • conducting novel probabilistic analyses of biological sequence evolution, and
  • generating publication quality graphics. It is distinguished by many unique built-in capabilities (such as true codon alignment) and the frequent addition of entirely new methods for the analysis of genomic data.
Please cite: Rob Knight, Peter Maxwell, Amanda Birmingham, Jason Carnes, J Gregory Caporaso, Brett C Easton, Michael Eaton, Micah Hamady, Helen Lindsay, Zongzhi Liu, Catherine Lozupone, Daniel McDonald, Michael Robeson, Raymond Sammut, Sandra Smit, Matthew J Wakefield, Jeremy Widmann, Shandy Wikman, Stephanie Wilson, Hua Ying and Gavin A Huttley: PyCogent: a toolkit for making sense from sequence. (PubMed,eprint) Genome Biology 8(8):R171 (2007)
Qiime
Quantitative Insights Into Microbial Ecology
Versions of package qiime
ReleaseVersionArchitectures
wheezy1.4.0-2amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.4.0-2amd64,armel,armhf,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.5.0-2amd64,armel,armhf,hurd-i386,i386,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package qiime:
roleprogram
Popcon: 7 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

QIIME (canonically pronounced ‘Chime’) is a pipeline for performing microbial community analysis that integrates many third party tools which have become standard in the field. A standard QIIME analysis begins with sequence data from one or more sequencing platforms, including

  • Sanger,
  • Roche/454, and
  • Illumina GAIIx. With all the underlying tools installed, of which not all are yet available in Debian (or any other Linux distribution), QIIME can perform

  • library de-multiplexing and quality filtering;

  • denoising with PyroNoise;
  • OTU and representative set picking with uclust, cdhit, mothur, BLAST, or other tools;
  • taxonomy assignment with BLAST or the RDP classifier;
  • sequence alignment with PyNAST, muscle, infernal, or other tools;
  • phylogeny reconstruction with FastTree, raxml, clearcut, or other tools;
  • alpha diversity and rarefaction, including visualization of results, using over 20 metrics including Phylogenetic Diversity, chao1, and observed species;
  • beta diversity and rarefaction, including visualization of results, using over 25 metrics including weighted and unweighted UniFrac, Euclidean distance, and Bray-Curtis;
  • summarization and visualization of taxonomic composition of samples using pie charts and histograms and many other features.

QIIME includes parallelization capabilities for many of the computationally intensive steps. By default, these are configured to utilize a mutli-core environment, and are easily configured to run in a cluster environment. QIIME is built in Python using the open-source PyCogent toolkit. It makes extensive use of unit tests, and is highly modular to facilitate custom analyses.

R-bioc-cummerbund
tool for analysis of Cufflinks RNA-Seq output
Versions of package r-bioc-cummerbund
ReleaseVersionArchitectures
wheezy1.2.0-1all
jessie1.2.0-1all
sid1.2.0-2all
upstream2.2.0
Popcon: 12 users (19 upd.)*
Newer upstream!
License: DFSG free
Git

Allows for persistent storage, access, exploration, and manipulation of Cufflinks high-throughput sequencing data. In addition, provides numerous plotting functions for commonly used visualizations.

Please cite: L. Goff and C. Trapnell: cummeRbund: Analysis, exploration, manipulation, and visualization of Cufflinks high-throughput sequencing data (2012)
R-bioc-hilbertvis
GNU R package to visualise long vector data
Versions of package r-bioc-hilbertvis
ReleaseVersionArchitectures
squeeze1.5.0-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.14.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.18.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.18.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-bioc-hilbertvis:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
useanalysing
Popcon: 20 users (27 upd.)*
Versions and Archs
License: DFSG free
Svn

This tool allows one to display very long data vectors in a space-efficient manner, by organising it along a 2D Hilbert curve. The user can then visually judge the large scale structure and distribution of features simultaenously with the rough shape and intensity of individual features.

In bioinformatics, a typical use case is ChIP-Chip and ChIP-Seq, or basically all the kinds of genomic data, that are conventionally displayed as quantitative track ("wiggle data") in genome browsers such as those provided by Ensembl or UCSC.

Please cite: Simon Anders: Visualization of genomic data with the Hilbert curve. (PubMed,eprint) Bioinformatics 25(10):1231-1235 (2009)
Remark of Debian Med team: It would be interesting to package HilbertVisGUI (see below) as well.
R-cran-genabel
GNU R package for genome-wide SNP association analysis
Versions of package r-cran-genabel
ReleaseVersionArchitectures
wheezy1.7-0-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.7-4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.7-4-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.7-6
Popcon: 22 users (41 upd.)*
Newer upstream!
License: DFSG free
Svn

The package offers the R library GenABEL for the hunt of genetic contributions to a disease (or any other pheonypical trait) by so called genome-wide association analysis. Additional input commonly comes from DNA mircoarray experiments, performed on every individual, that determine differences (polymorphisms) in the population. GenABEL finds associations between quantitative or binary traits and single-nucleiotide polymorphisms (SNPs).

When publishing work based to which this package has contributed please cite: Yurii S. Aulchenko, Stephan Ripke, Aaron Isaacs, Cornelia M. van Duijn (2007) "GenABEL: an R library for genome-wide association analysis" Bioinformatics 2007 23(10):1294-1296.

Please cite: Yurii S. Aulchenko, Stephan Ripke, Aaron Isaacs and Cornelia M. van Duijn: GenABEL: an R library for genome-wide association analysis. (PubMed,eprint) Bioinformatics 23(10):1294-1296 (2007)
R-cran-qtl
GNU R package for genetic marker linkage analysis
Versions of package r-cran-qtl
ReleaseVersionArchitectures
squeeze1.16-6-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.23-16-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.25-15-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.27-10-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package r-cran-qtl:
devellang:r, library
fieldbiology, statistics
roleapp-data
suitegnu
Popcon: 50 users (115 upd.)*
Versions and Archs
License: DFSG free
Svn

R/qtl is an extensible, interactive environment for mapping quantitative trait loci (QTLs) in experimental crosses. It is implemented as an add-on-package for the freely available and widely used statistical language/software R (see http://www.r-project.org).

The development of this software as an add-on to R allows one to take advantage of the basic mathematical and statistical functions, and powerful graphics capabilities, that are provided with R. Further, the user will benefit by the seamless integration of the QTL mapping software into a general statistical analysis program. The goal is to make complex QTL mapping methods widely accessible and allow users to focus on modeling rather than computing.

A key component of computational methods for QTL mapping is the hidden Markov model (HMM) technology for dealing with missing genotype data. The main HMM algorithms were implemented, with allowance for the presence of genotyping errors, for backcrosses, intercrosses, and phase-known four-way crosses.

The current version of R/qtl includes facilities for estimating genetic maps, identifying genotyping errors, and performing single-QTL genome scans and two-QTL, two-dimensional genome scans, by interval mapping (with the EM algorithm), Haley-Knott regression, and multiple imputation. All of this may be done in the presence of covariates (such as sex, age or treatment). One may also fit higher-order QTL models by multiple imputation.

Please cite: Karl W. Broman, Hao Wu, Saunak Sen and Gary A. Churchill: R/qtl: QTL mapping in experimental crosses. (PubMed,eprint) Bioinformatics 19:889-890 (2003)
R-cran-vegan
Community Ecology Package for R
Versions of package r-cran-vegan
ReleaseVersionArchitectures
wheezy2.0-3-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.0-7-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.0-7-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 17 users (19 upd.)*
Versions and Archs
License: DFSG free
Git

R package for community ecologists. It contains most multivariate analysis needed in analysing ecological communities, and tools for diversity analysis. Most diversity methods assume that data are counts of individuals.

These tools are sometimes used outside the field of ecology, for instance to study populations of white blood cells or RNA molecules.

R-other-bio3d
GNU R package for biological structure analysis
Versions of package r-other-bio3d
ReleaseVersionArchitectures
squeeze1.0-8-1all
wheezy1.1-4-1all
jessie1.1-5-1all
sid1.1-5-1all
Debtags of package r-other-bio3d:
fieldbiology, biology:bioinformatics, biology:structural
interfacecommandline, x11
roleshared-lib
scopeapplication
useanalysing, comparing, viewing
works-with3dmodel
works-with-formatplaintext
Popcon: 22 users (38 upd.)*
Versions and Archs
License: DFSG free
Svn

The bio3d package contains utilities to process, organize and explore protein structure, sequence and dynamics data. Features include the ability to read and write structure, sequence and dynamic trajectory data, perform atom summaries, atom selection, re-orientation, superposition, rigid core identification, clustering, torsion analysis, distance matrix analysis, structure and sequence conservation analysis, and principal component analysis (PCA). In addition, various utility functions are provided to enable the statistical and graphical power of the R environment to work with biological sequence and structural data.

Please cite: Barry J. Grant, Ana P. C. Rodrigues, Karim M. ElSawy, J. Andrew McCammon and Leo S. D. Caves: Bio3d: an R package for the comparative analysis of protein structures. (PubMed,eprint) Bioinformatics 22(21):2695-2696 (2006)
R-other-mott-happy
GNU R package for fine-mapping complex diseases
Versions of package r-other-mott-happy
ReleaseVersionArchitectures
squeeze2.1-4amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.1-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream2.3
Debtags of package r-other-mott-happy:
fieldbiology, biology:bioinformatics
useanalysing
Popcon: 19 users (18 upd.)*
Newer upstream!
License: DFSG free
Svn

Happy is an R interface into the HAPPY C package for fine-mapping Quantitative Trait Loci (QTL) in Heterogenous Stocks (HS). An HS is an advanced intercross between (usually eight) founder inbred strains of mice. HS are suitable for fine-mapping QTL. It uses a multipoint analysis which offers significant improvements in statistical power to detect QTLs over that achieved by single-marker association.

The happy package is an extension of the original C program happy; it uses the C code to compute the probability of descent from each of the founders, at each locus position, but the happy packager allows a much richer range of models to be fit to the data.

Read /usr/share/doc/r-other-mott-happy/README.Debian for a more detailed explanation.

Please cite: Richard Mott, Christopher J. Talbot, Maria G. Turri, Allan C. Collins and Jonathan Flint: A method for fine mapping quantitative trait loci in outbred animal stocks. (PubMed,eprint) Proc. Natl. Acad. Sci. USA 97(23):12649-12654 (2000)
Remark of Debian Med team: Do not upgrade this package

Steffen Möller is scientifically collaborating with upstream and nothing shall happen to that package until authors give their confirmation.

Rasmol
Visualize biological macromolecules
Versions of package rasmol
ReleaseVersionArchitectures
squeeze2.7.5-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.7.5.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.7.5.2-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.7.5.2-1sparc
sid2.7.5.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Debtags of package rasmol:
fieldchemistry
interfacex11
roleprogram
scopeutility
uitoolkitgtk
uselearning, viewing
x11application
Popcon: 389 users (50 upd.)*
Versions and Archs
License: DFSG free
Git

RasMol is a molecular graphics program intended for the visualisation of proteins, nucleic acids and small molecules. The program is aimed at display, teaching and generation of publication quality images.

The program reads in a molecule coordinate file and interactively displays the molecule on the screen in a variety of colour schemes and molecule representations. Currently available representations include depth-cued wireframes, 'Dreiding' sticks, spacefilling (CPK) spheres, ball and stick, solid and strand biomolecular ribbons, atom labels and dot surfaces.

Supported input file formats include Protein Data Bank (PDB), Tripos Associates' Alchemy and Sybyl Mol2 formats, Molecular Design Limited's (MDL) Mol file format, Minnesota Supercomputer Center's (MSC) XYZ (XMol) format, CHARMm format, CIF format and mmCIF format files.

This package installs two versions of RasMol, rasmol-gtk has a modern GTK-based user interface and rasmol-classic is the version with the old Xlib GUI.

The package is enhanced by the following packages: rasmol-doc
Screenshots of package rasmol
Raster3d
tools for generating images of proteins or other molecules
Versions of package raster3d
ReleaseVersionArchitectures
squeeze2.9-1-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy3.0-2-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.0-2-4amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.0-2-4amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package raster3d:
fieldbiology, biology:structural
interfacecommandline
roleprogram
scopeapplication
useconverting, viewing
works-with3dmodel, image, image:raster
works-with-formatjpg, png
Popcon: 27 users (11 upd.)*
Versions and Archs
License: DFSG free
Svn

Raster3D is a set of tools for generating high quality raster images of proteins or other molecules. The core program renders spheres, triangles, cylinders, and quadric surfaces with specular highlighting, Phong shading, and shadowing. It uses an efficient software Z-buffer algorithm which is independent of any graphics hardware. Ancillary programs process atomic coordinates from PDB files into rendering descriptions for pictures composed of ribbons, space-filling atoms, bonds, ball+stick, etc. Raster3D can also be used to render pictures composed in other programs such as Molscript in glorious 3D with highlights, shadowing, etc. Output is to pixel image files with 24 bits of color information per pixel.

Please cite: E.A. Merritt and D.J. Bacon: Raster3D Photorealistic Molecular Graphics. (PubMed) Methods in Enzymology 277:505-524 (1997)
Raxml
Randomized Axelerated Maximum Likelihood of phylogenetic trees
Versions of package raxml
ReleaseVersionArchitectures
wheezy7.2.8-2amd64,i386,kfreebsd-amd64,kfreebsd-i386
jessie7.2.8-2amd64,i386,kfreebsd-amd64,kfreebsd-i386
sid7.2.8-2amd64,i386,kfreebsd-amd64,kfreebsd-i386
Debtags of package raxml:
fieldbiology
roleprogram
Popcon: 8 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

RAxML is a program for sequential and parallel Maximum Likelihood-based inference of large phylogenetic trees. It has originally been derived from fastDNAml.

Please cite: Alexandros Stamatakis: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. (PubMed,eprint) Bioinformatics 22(21):2688-2690 (2006)
Ray
parallel genome assemblies for parallel DNA sequencing
Versions of package ray
ReleaseVersionArchitectures
jessie2.1.0-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1.0-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream2.2.0
Popcon: 2 users (0 upd.)*
Newer upstream!
License: DFSG free
Git

Ray is a parallel software that computes de novo genome assemblies with next-generation sequencing data. Ray is written in C++ and can run in parallel on numerous interconnected computers using the message-passing interface (MPI) standard. Included:

  • Ray de novo assembly of single genomes
  • Ray Méta de novo assembly of metagenomes
  • Ray Communities microbe abundance + taxonomic profiling
  • Ray Ontologies gene ontology profiling
Please cite: Sébastien Boisvert, François Laviolette and Jacques Corbeil: Ray: Simultaneous Assembly of Reads from a Mix of High-Throughput Sequencing Technologies. (PubMed,eprint) Journal of Computational Biology 17(11):1519-1533 (2010)
Readseq
Conversion between sequence formats
Versions of package readseq
ReleaseVersionArchitectures
squeeze1-8amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1-9amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1-9amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1-9amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package readseq:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useconverting
works-with-formatplaintext
Popcon: 15 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

Reads and writes nucleic/protein sequences in various formats. Data files may have multiple sequences. Readseq is particularly useful as it automatically detects many sequence formats, and converts between them.

Please cite: Don Gilbert: Sequence file format conversion with command-line readseq. (PubMed,eprint) Current Protocols in Bioinformatics Appendix 1:E (2003)
Reprof
protein secondary structure and accessibility predictor
Versions of package reprof
ReleaseVersionArchitectures
wheezy1.0.1-1all
jessie1.0.1-1all
sid1.0.1-1all
Popcon: 5 users (7 upd.)*
Versions and Archs
License: DFSG free
Svn

'reprof' is an improved implementation of 'prof', a popular protein secondary structure and accessibility predictor. Prediction is either done from protein sequence alone or from an alignment - the latter should be used for optimal performance.

This package provides the 'reprof' command. It is only a command line interface to the functionality provided by the modules in librg-reprof-bundle-perl.

Rnahybrid
Fast and effective prediction of microRNA/target duplexes
Versions of package rnahybrid
ReleaseVersionArchitectures
squeeze2.1-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.1.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.1.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.1.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package rnahybrid:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 9 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

RNAhybrid is a tool for finding the minimum free energy hybridisation of a long and a short RNA. The hybridisation is performed in a kind of domain mode, ie. The short sequence is hybridised to the best fitting part of the long one. The tool is primarily meant as a means for microRNA target prediction.

Please cite: Marc Rehmsmeier, Peter Steffen, Matthias Höchsmann and Robert Giegerich: Fast and effective prediction of microRNA/target duplexes. (PubMed,eprint) RNA 10(10):1507-1517 (2004)
Samtools
processing sequence alignments in SAM and BAM formats
Versions of package samtools
ReleaseVersionArchitectures
squeeze0.1.8-1amd64,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390
wheezy0.1.18-1amd64,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390
jessie0.1.19-1amd64,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
sid0.1.19-1amd64,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Debtags of package samtools:
fieldbiology
interfacecommandline
networkclient
roleprogram
scopeutility
uitoolkitncurses
useanalysing, calculating, filtering
works-withbiological-sequence
Popcon: 52 users (64 upd.)*
Versions and Archs
License: DFSG free
Git

Samtools is a set of utilities that manipulate nucleotide sequence alignments in the binary BAM format. It imports from and exports to the ascii SAM (Sequence Alignment/Map) format, does sorting, merging and indexing, and allows to retrieve reads in any regions swiftly. It is designed to work on a stream, and is able to open a BAM (not SAM) file on a remote FTP or HTTP server.

The package is enhanced by the following packages: libbio-samtools-perl
Sibsim4
align expressed RNA sequences on a DNA template
Versions of package sibsim4
ReleaseVersionArchitectures
squeeze0.20-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.20-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.20-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.20-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package sibsim4:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing, searching
works-with-formatplaintext
Popcon: 11 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

The SIBsim4 project is based on sim4, which is a program designed to align an expressed DNA sequence with a genomic sequence, allowing for introns. SIBsim4 is a fairly extensive rewrite of the original code with the following goals:

  • speed improvement;
  • allow large, chromosome scale, DNA sequences to be used;
  • provide more detailed output about splice types;
  • provide more detailed output about polyA sites;
  • misc code cleanups and fixes.
Sigma-align
Simple greedy multiple alignment of non-coding DNA sequences
Versions of package sigma-align
ReleaseVersionArchitectures
squeeze1.1.3-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.1.3-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.1.3-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.1.3-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package sigma-align:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 12 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

Sigma (“Simple greedy multiple alignment”) is an alignment program. It's algorithm and scoring scheme are designed specifically for non-coding DNA sequence.

It uses a strategy of seeking the best possible gapless local alignments. This happens at each step making the best possible alignment consistent with existing alignments. It scores the significance of the alignment based on the lengths of the aligned fragments and a background model. These may be supplied or estimated from an auxiliary file of intergenic DNA.

Please cite: Siddharthan, Rahul: Sigma: multiple alignment of weakly-conserved non-coding DNA sequence. (PubMed) BMC Bioinformatics 7(1):143 (2006)
Sim4
tool for aligning cDNA and genomic DNA
Versions of package sim4
ReleaseVersionArchitectures
squeeze0.0.20030921-3amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.0.20030921-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.0.20030921-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.0.20030921-3amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream0.0.20121010
Debtags of package sim4:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing, searching
works-with-formatplaintext
Popcon: 27 users (30 upd.)*
Newer upstream!
License: DFSG free
Svn

sim4 is a similarity-based tool for aligning an expressed DNA sequence (EST, cDNA, mRNA) with a genomic sequence for the gene. It also detects end matches when the two input sequences overlap at one end (i.e., the start of one sequence overlaps the end of the other).

sim4 employs a blast-based technique to first determine the basic matching blocks representing the "exon cores". In this first stage, it detects all possible exact matches of W-mers (i.e., DNA words of size W) between the two sequences and extends them to maximal scoring gap-free segments. In the second stage, the exon cores are extended into the adjacent as-yet-unmatched fragments using greedy alignment algorithms, and heuristics are used to favor configurations that conform to the splice-site recognition signals (GT-AG, CT-AC). If necessary, the process is repeated with less stringent parameters on the unmatched fragments.

Soapdenovo
short-read assembly method to build de novo draft assembly
Versions of package soapdenovo
ReleaseVersionArchitectures
jessie1.05-1amd64,ia64,kfreebsd-amd64
sid1.05-1amd64,ia64,kfreebsd-amd64
upstream2.2.3
Debtags of package soapdenovo:
fieldbiology
roleprogram
Popcon: 1 users (0 upd.)*
Newer upstream!
License: DFSG free
Svn

SOAPdenovo is a novel short-read assembly method that can build a de novo draft assembly for the human-sized genomes. The program is specially designed to assemble Illumina GA short reads.

It creates new opportunities for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost effective way.

Please cite: Ruiqiang Li, Hongmei Zhu, Jue Ruan, Wubin Qian, Xiaodong Fang, Zhongbin Shi, Yingrui Li, Shengting Li, Gao Shan, Karsten Kristiansen, Songgang Li, Huanming Yang, Jian Wang and Jun Wang: De novo assembly of human genomes with massively parallel short read sequencing. (PubMed,eprint) Genome Research 20(2):265-72 (2009)
Squizz
Converter for genetic sequences and alignments
Versions of package squizz
ReleaseVersionArchitectures
wheezy0.99a-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.99b+dfsg-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.99b+dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package squizz:
fieldbiology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useconverting
Popcon: 7 users (15 upd.)*
Versions and Archs
License: DFSG free
Svn

Squizz is a sequence/alignment format checker, but it has some conversion capabilities too. Most common sequence and alignment formats are supported :

  • EMBL, FASTA, GCG, GDE, GENBANK, IG, NBRF, PIR (codata), RAW, and SWISSPROT.
  • CLUSTAL, FASTA, MEGA, MSF, NEXUS, PHYLIP (interleaved and sequential) and STOCKHOLM.
Sra-toolkit
utilities for the NCBI Sequence Read Archive
Versions of package sra-toolkit
ReleaseVersionArchitectures
wheezy2.1.7a-1amd64,i386,kfreebsd-amd64,kfreebsd-i386
jessie2.1.7a-1amd64,i386,kfreebsd-amd64,kfreebsd-i386
sid2.1.7a-1amd64,i386,kfreebsd-amd64,kfreebsd-i386
upstream2.3.2-4
Popcon: 8 users (9 upd.)*
Newer upstream!
License: DFSG free
Git

Tools for reading the SRA archive, generally by converting individual runs into some commonly used format such as fastq.

The textual dumpers "sra-dump" and "vdb-dump" are provided in this release as an aid in visual inspection. It is likely that their actual output formatting will be changed in the near future to a stricter, more formalized representation[s]. PLEASE DO NOT RELY UPON THE OUTPUT FORMAT SEEN IN THIS RELEASE.

The "help" information will be improved in near future releases, and the tool options will become standardized across the set. More documentation will also be provided documentation on the NCBI web site.

Tool options may change in the next release. Version 1 tool options will remain supported wherever possible in order to preserve operation of any existing scripts.

Please cite: Rasko Leinonen, Ruth Akhtar, Ewan Birney, James Bonfield, Lawrence Bower, Matt Corbett, Ying Cheng, Fehmi Demiralp, Nadeem Faruque, Neil Goodgame, Richard Gibson, Gemma Hoad, Christopher Hunter, Mikyung Jang, Steven Leonard, Quan Lin, Rodrigo Lopez, Michael Maguire, Hamish McWilliam, Sheila Plaister, Rajesh Radhakrishnan, Siamak Sobhany, Guy Slater, Petra Ten Hoopen, Franck Valentin, Robert Vaughan, Vadim Zalunin, Daniel Zerbino and Guy Cochrane: Improvements to services at the European Nucleotide Archive. (PubMed,eprint) Nucleic Acids Research 38(Database issue):D39-45 (2010)
Ssake
genomics application for assembling millions of very short DNA sequences
Versions of package ssake
ReleaseVersionArchitectures
squeeze3.5-1all
wheezy3.8-2all
jessie3.8-2all
sid3.8-2all
Debtags of package ssake:
biologynuceleic-acids
fieldbiology
interfaceshell
roleprogram
scopeutility
useanalysing
Popcon: 9 users (10 upd.)*
Versions and Archs
License: DFSG free
Svn

The Short Sequence Assembly by K-mer search and 3′ read Extension (SSAKE) is a genomics application for aggressively assembling millions of short nucleotide sequences by progressively searching for perfect 3′-most k-mers using a DNA prefix tree. SSAKE is designed to help leverage the information from short sequences reads by stringently clustering them into contigs that can be used to characterize novel sequencing targets.

Please cite: Rene L. Warren, Granger G. Sutton, Steven J. M. Jones and Robert A. Holt: Assembling millions of short DNA sequences using SSAKE. (PubMed,eprint) Bioinformatics 23(4):500-501 (2007)
Staden-io-lib-utils
programs for maniuplating DNA sequencing files
Versions of package staden-io-lib-utils
ReleaseVersionArchitectures
squeeze1.12.4-1amd64,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mipsel,s390
wheezy1.12.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.12.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.12.4-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.13.1
Debtags of package staden-io-lib-utils:
biologynuceleic-acids, peptidic
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing
Popcon: 11 users (6 upd.)*
Newer upstream!
License: DFSG free
Svn

The io_lib from the Staden package is a library of file reading and writing code to provide a general purpose trace file (and Experiment File) reading interface. It has been compiled and tested on a variety of unix systems, MacOS X and MS Windows.

This package contains the programs that are distributed with the Staden io_lib for manipulating and converting sequencing data files, and in particular files to maniuplate short reads generated by second and third generation sequencers and stored in SRF format.

T-coffee
Multiple Sequence Alignment
Versions of package t-coffee
ReleaseVersionArchitectures
squeeze8.84-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy9.02.r1228-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie9.02.r1228-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid9.02.r1228-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream9.03.r1318
Debtags of package t-coffee:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 32 users (40 upd.)*
Newer upstream!
License: DFSG free
Svn

T-Coffee is a multiple sequence alignment package. Given a set of sequences (Proteins or DNA), T-Coffee generates a multiple sequence alignment. Version 2.00 and higher can mix sequences and structures.

T-Coffee allows the combination of a collection of multiple/pairwise, global or local alignments into a single model. It also allows to estimate the level of consistency of each position within the new alignment with the rest of the alignments. See the pre-print for more information

T-Coffee has a special called M-Coffee that makes it possible to combine the output of many multiple sequence alignment packages. In its published version, it uses MUSCLE, PROBCONS, POA, DiAlign-TS, MAFFT, Clustal W, PCMA and T-Coffee. A special version has been made for Debian, DM-Coffee, that uses only free software by replacing Clustal W by Kalign. Using the 8 Methods of M-Coffee can sometimes be a bit heavy. You can use a subset of your favorite methods if you prefer.

The package is enhanced by the following packages: clustalw kalign mafft tm-align dialign-tx probcons muscle ncbi-blast+ poa
Please cite: Cédric Notredame, Desmond G. Higgins and Jaap Heringa: T-coffee: a novel method for fast and accurate multiple sequence alignment. (PubMed) Journal of Molecular Biology 302(1):205-217 (2000)
Tabix
generic indexer for TAB-delimited genome position files
Versions of package tabix
ReleaseVersionArchitectures
wheezy0.2.6-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.2.6-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.2.6-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package tabix:
fieldbiology
interfacecommandline
networkclient
roleprogram
scopeutility
usecompressing, filtering
works-withtext
Popcon: 13 users (7 upd.)*
Versions and Archs
License: DFSG free
Git

Tabix indexes files where some columns indicate sequence coordinates: name (usually a chromosme), start and stop. The input data file must be position sorted and compressed by bgzip (provided in this package), which has a gzip like interface. After indexing, tabix is able to quickly retrieve data lines by chromosomal coordinates. Fast data retrieval also works over network if an URI is given as a file name.

Please cite: Li, Heng: Tabix: Fast retrieval of sequence features from generic TAB-delimited files. (PubMed,eprint) Bioinformatics 27(5):718-9 (2011)
Theseus
superimpose macromolecules using maximum likelihood
Versions of package theseus
ReleaseVersionArchitectures
squeeze1.4.3-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.6.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.6.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.6.2-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
experimental2.0.1-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream2.0.3
Debtags of package theseus:
biologypeptidic
fieldbiology, biology:bioinformatics, biology:structural
interfacecommandline
roleprogram
useanalysing, comparing
works-with-formatplaintext
Popcon: 10 users (10 upd.)*
Newer upstream!
License: DFSG free
Svn

Theseus is a program that simultaneously superimposes multiple macromolecular structures. Theseus finds the optimal solution to the superposition problem using the method of maximum likelihood. By down-weighting variable regions of the superposition and by correcting for correlations among atoms, the ML superposition method produces very accurate structural alignments.

When macromolecules with different residue sequences are superimposed, other programs and algorithms discard residues that are aligned with gaps. Theseus, however, uses a novel superimposition algorithm that includes all of the data.

Please cite: Douglas L. Theobald and Deborah S. Wuttke: THESEUS: maximum likelihood superpositioning and analysis of macromolecular structures. (eprint) Bioinformatics 22(17):2171-2172 (2006)
Tigr-glimmer
Gene detection in archea and bacteria
Versions of package tigr-glimmer
ReleaseVersionArchitectures
squeeze3.02-2amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy3.02-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.02-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.02-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream302b
Debtags of package tigr-glimmer:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usesearching
works-with-formatplaintext
Popcon: 23 users (20 upd.)*
Newer upstream!
License: DFSG free
Svn

Developed by the TIGR institute this software detects coding sequences in bacteria and archea.

Glimmer is a system for finding genes in microbial DNA, especially the genomes of bacteria and archaea. Glimmer (Gene Locator and Interpolated Markov Modeler) uses interpolated Markov models (IMMs) to identify the coding regions and distinguish them from noncoding DNA.

Please cite: Steven L. Salzberg, Arthur L. Delcher, S. Kasif and O. White: Microbial gene identification using interpolated Markov models. (PubMed,eprint) Nucleic Acids Research 26(2):544-8 (1998)
Tm-align
structual alignment of proteins
Versions of package tm-align
ReleaseVersionArchitectures
wheezy20120507-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie20120507-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid20120507-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
experimental20120707-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream20130511
Debtags of package tm-align:
roleprogram
Popcon: 18 users (22 upd.)*
Newer upstream!
License: DFSG free
Svn

TM-align is a computer algorithm for protein structure alignment using dynamic programming. The scoring is performed by the TM-score rotation matrix. This is similar to the RMSD in that unaligned portions of the structure influence the scoring less than the more structurally conserved regions.

Tophat
fast splice junction mapper for RNA-Seq reads
Versions of package tophat
ReleaseVersionArchitectures
jessie2.0.8-1amd64,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
sid2.0.8-1i386
sid2.0.8b-1amd64,armhf,hurd-i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x
Popcon: 3 users (2 upd.)*
Versions and Archs
License: DFSG free
Git

TopHat aligns RNA-Seq reads to mammalian-sized genomes using the ultra high-throughput short read aligner Bowtie, and then analyzes the mapping results to identify splice junctions between exons. TopHat is a collaborative effort between the University of Maryland Center for Bioinformatics and Computational Biology and the University of California, Berkeley Departments of Mathematics and Molecular and Cell Biology.

The package is enhanced by the following packages: cufflinks
Please cite: Cole Trapnell, Lior Pachter and Steven L. Salzberg: TopHat: discovering splice junctions with RNA-Seq. (PubMed,eprint) Bioinformatics 25(9):1105-1111 (2009)
Transtermhp
find rho-independent transcription terminators in bacterial genomes
Versions of package transtermhp
ReleaseVersionArchitectures
wheezy2.09-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.09-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.09-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 6 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

TransTermHP finds rho-independent transcription terminators in bacterial genomes. Each terminator found by the program is assigned a confidence value that estimates its probability of being a true terminator. TransTermHP is the successor of TransTerm which was using very different search and scoring algorithms.

Please cite: Carleton L Kingsford, Kunmi Ayanbule and Steven L Salzberg: Rapid, accurate, computational discovery of Rho-independent transcription terminators illuminates their relationship to DNA uptake. (PubMed,eprint) Genome Biology 8(2):R22 (2007)
Tree-ppuzzle
Parallelized reconstruction of phylogenetic trees by maximum likelihood
Versions of package tree-ppuzzle
ReleaseVersionArchitectures
squeeze5.2-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
jessie5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid5.2-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
Debtags of package tree-ppuzzle:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 3 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that implements a fast tree search algorithm, quartet puzzling, that allows analysis of large data sets and automatically assigns estimations of support to each internal branch. TREE-PUZZLE also computes pairwise maximum likelihood distances as well as branch lengths for user specified trees. Branch lengths can also be calculated under the clock-assumption. In addition, TREE-PUZZLE offers a novel method, likelihood mapping, to investigate the support of a hypothesized internal branch without computing an overall tree and to visualize the phylogenetic content of a sequence alignment.

This is the parallelized version of tree-puzzle.

Please cite: Heiko A. Schmidt, Korbinian Strimmer, Martin Vingron and Arndt von Haeseler: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. (PubMed,eprint) Bioinformatics 18(3):502-504 (2002)
Tree-puzzle
Reconstruction of phylogenetic trees by maximum likelihood
Versions of package tree-puzzle
ReleaseVersionArchitectures
squeeze5.2-5amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
jessie5.2-7amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid5.2-7amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
Debtags of package tree-puzzle:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 14 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

TREE-PUZZLE (the new name for PUZZLE) is an interactive console program that implements a fast tree search algorithm, quartet puzzling, that allows analysis of large data sets and automatically assigns estimations of support to each internal branch. TREE-PUZZLE also computes pairwise maximum likelihood distances as well as branch lengths for user specified trees. Branch lengths can also be calculated under the clock-assumption. In addition, TREE-PUZZLE offers a novel method, likelihood mapping, to investigate the support of a hypothesized internal branch without computing an overall tree and to visualize the phylogenetic content of a sequence alignment.

Please cite: Heiko A. Schmidt, Korbinian Strimmer, Martin Vingron and Arndt von Haeseler: TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. (PubMed,eprint) Bioinformatics 18(3):502-504 (2002)
Treeviewx
Displays and prints phylogenetic trees
Versions of package treeviewx
ReleaseVersionArchitectures
squeeze0.5.1-7amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy0.5.1+20100823-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.5.1+20100823-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.5.1+20100823-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package treeviewx:
fieldbiology, biology:bioinformatics
interfacex11
roleprogram
scopeutility
uitoolkitwxwidgets
useviewing
works-with-formatpdf, plaintext, postscript, svg
x11application
Popcon: 104 users (9 upd.)*
Versions and Archs
License: DFSG free
Svn

TreeView X is an open source and multi-platform program to display phylogenetic trees. It can read and display NEXUS and Newick format tree files (such as those output by PAUP*, ClustalX, TREE-PUZZLE, and other programs). It allows one to order the branches of the trees, and to export the trees in SVG format.

Please cite: Page, Roderic D. M.: TreeView: an application to display phylogenetic trees on personal computers. (PubMed) Comput. Appl. Biosci. 12(4):357-8 (1996)
Screenshots of package treeviewx
Uc-echo
error correction algorithm designed for short-reads from NGS
Versions of package uc-echo
ReleaseVersionArchitectures
jessie1.12-1amd64,i386,kfreebsd-amd64,powerpc,s390,s390x,sparc
sid1.12-1amd64,i386,kfreebsd-amd64,powerpc,s390,s390x,sparc
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

ECHO is an error correction algorithm designed for short-reads from next-generation sequencing platforms such as Illumina's Genome Analyzer II. The algorithm uses a Bayesian framework to improve the quality of the reads in a given data set by employing maximum a posteriori estimation.

Vcftools
Collection of tools to work with VCF files
Versions of package vcftools
ReleaseVersionArchitectures
wheezy0.1.9-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie0.1.10+dfsg-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid0.1.10+dfsg-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package vcftools:
roleprogram
Popcon: 13 users (21 upd.)*
Versions and Archs
License: DFSG free
Svn

VCFtools is a program package designed for working with VCF files, such as those generated by the 1000 Genomes Project. The aim of VCFtools is to provide methods for working with VCF files: validating, merging, comparing and calculate some basic population genetic statistics.

Please cite: Petr Danecek, Adam Auton, Goncalo Abecasis, Cornelis A. Albers, Eric Banks, Mark A. DePristo, Robert E. Handsaker, Gerton Lunter, Gabor T. Marth, Stephen T. Sherry, Gilean McVean and Richard Durbin: The variant call format and VCFtools. (PubMed,eprint) Bioinformatics 27(15):2156-8 (2011)
Velvet
Nucleic acid sequence assembler for very short reads
Versions of package velvet
ReleaseVersionArchitectures
squeeze1.0.02~nozlibcopy-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy1.2.03~nozlibcopy-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.2.03~nozlibcopy-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.2.03~nozlibcopy-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.2.09
Debtags of package velvet:
biologynuceleic-acids
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
useanalysing
Popcon: 15 users (11 upd.)*
Newer upstream!
License: DFSG free
Svn

Velvet is a de novo genomic assembler specially designed for short read sequencing technologies, such as Solexa or 454, developed by Daniel Zerbino and Ewan Birney at the European Bioinformatics Institute (EMBL-EBI), near Cambridge, in the United Kingdom.

Velvet currently takes in short read sequences, removes errors then produces high quality unique contigs. It then uses paired read information, if available, to retrieve the repeated areas between contigs.

Please cite: Daniel R. Zerbino and Ewan Birney: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. (PubMed,eprint) Genome Research 18(5):821-829 (2008)
Wise
comparison of biopolymers, commonly DNA and protein sequences
Versions of package wise
ReleaseVersionArchitectures
squeeze2.4.1-8amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
wheezy2.4.1-10amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.4.1-12amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.4.1-12amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package wise:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
usecomparing
Popcon: 38 users (61 upd.)*
Versions and Archs
License: DFSG free
Svn

Wise2 is a package focused on comparisons of biopolymers, commonly DNA and protein sequences. There are many other packages which do this, probably the best known being BLAST package (from NCBI) and the Fasta package (from Bill Pearson). There are other packages, such as the HMMER package (Sean Eddy) or SAM package (UC Santa Cruz) focused on hidden Markov models (HMMs) of biopolymers.

Wise2's particular forte is the comparison of DNA sequence at the level of its protein translation. This comparison allows the simultaneous prediction of say gene structure with homology based alignment.

Wise2 also contains other algorithms, such as the venerable Smith-Waterman algorithm, or more modern ones such as Stephen Altschul's generalised gap penalties, or even experimental ones developed in house, such as dba. The development of these algorithms is due to the ease of developing such algorithms in the environment used by Wise2.

Wise2 has also been written with an eye for reuse and maintainability. Although it is a pure C package you can access its functionality directly in Perl. Parts of the package (or the entire package) can be used by other C or C++ programs without namespace clashes as all externally linked variables have the unique identifier Wise2 prepended.

Please cite: Ewan Birney, Michele Clamp and Richard Durbin: GeneWise and Genomewise. (PubMed,eprint) Genome Research 14(5):988-95 (2004)
Zalign
parallel local alignment of biological sequences
Versions of package zalign
ReleaseVersionArchitectures
jessie0.9.1-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
sid0.9.1-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,powerpc,sparc
Popcon: 3 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

zAlign is a local sequence aligner, especially intended for use with large biological DNA sequences, with more than 1Mbp (Millions of base pairs). It uses the Smith-Waterman exact algorithm with affine gap cost function to perform this task.

zAlign can be used both in distributed (clusters, for example) or standalone environments. Currently it has been tested on Linux and Sun Solaris, using both the MPICH (http://www.mcs.anl.gov/research/projects/mpi/mpich1/) and OpenMPI (http://www.open-mpi.org/) implementations. Ports for other Unix-like environments are highly considered.

Please cite: Rodolfo Bezerra Batista, Azzedine Boukerche and Alba Cristina Magalhaes Alves de Melo: A parallel strategy for biological sequence alignment in restricted memory space. Journal of Parallel and Distributed Computing 68(4):548-561 (2008)

Official Debian packages with lower relevance

Cain
simulations of chemical reactions
Versions of package cain
ReleaseVersionArchitectures
wheezy1.9-4all
jessie1.9-6all
sid1.9-6all
Popcon: 7 users (6 upd.)*
Versions and Archs
License: DFSG free
Svn

Cain performs stochastic and deterministic simulations of chemical reactions. It can spawn multiple simulation processes to utilize multi-core computers. It stores models, methods, and simulation output (populations and reaction counts) in an XML format. In addition, SBML models can be imported and exported. The models and methods can be read from input files or edited within the program.

The GUI (Graphical User Interface) is written in Python and uses the wxPython toolkit. Most of the solvers are implemented as command line executables, written in C++, which are driven by Cain. This makes it easy to launch batch jobs. It also simplifies the process of adding new solvers. Cain offers a variety of solvers:

  • Gillespie's direct method.
  • Gillespie's first reaction method.
  • Gibson and Bruck's next reaction method.
  • Tau-leaping.
  • Hybrid direct/tau-leaping.
  • ODE integration.

This package provides the architecture independent files for cain

Please cite: Sean Mauch and Mark Stalzer: Efficient Formulations for Exact Stochastic Simulation of Chemical Systems. IEEE/ACM Trans. Comput. Biol. Bioinformatics 8(1):27-35 (2011)
Circos
plotter for visualizing data
Versions of package circos
ReleaseVersionArchitectures
wheezy0.61-3all
jessie0.64-1all
sid0.64-1all
Debtags of package circos:
fieldbiology:bioinformatics
roleprogram
useviewing
Popcon: 5 users (24 upd.)*
Versions and Archs
License: DFSG free
Svn

Circos visualizes data in a circular layout — ideal for exploring relationships between objects or positions, and creating highly informative publication-quality graphics.

This package provides the Circos plotting engine, which is command-line driven (like gnuplot) and fully scriptable.

Please cite: Martin I Krzywinski, Jacqueline E Schein, Inanc Birol, Joseph Connors, Randy Gascoyne, Doug Horsman, Steven J Jones and Marco A Marra: Circos: An information aesthetic for comparative genomics. (PubMed,eprint) Genome Research 19(9):1639-45 (2009)
Emboss-explorer
web-based GUI to EMBOSS
Versions of package emboss-explorer
ReleaseVersionArchitectures
squeeze2.2.0-7all
wheezy2.2.0-7all
jessie2.2.0-7all
sid2.2.0-7all
Debtags of package emboss-explorer:
biologyemboss
fieldbiology, biology:bioinformatics
interfaceweb
roleplugin
webapplication, cgi
Popcon: 2 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

EMBOSS explorer is a web-based graphical user interface to the EMBOSS suite of bioinformatics tools. It is written in Perl.

If you use the Apache HTTP server, you will at most have to restart it before using EMBOSS explorer. For other web servers, you will have to do the configuration by yourself.

Maude
high-performance logical framework
Versions of package maude
ReleaseVersionArchitectures
wheezy2.6-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie2.6-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid2.6-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package maude:
uitoolkitncurses
Popcon: 4 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Maude is a high-performance reflective language and system supporting both equational and rewriting logic specification and programming for a wide range of applications. Maude has been influenced in important ways by the OBJ3 language, which can be regarded as an equational logic sublanguage. Besides supporting equational specification and programming, Maude also supports rewriting logic computation.

Rewriting logic is a logic of concurrent change that can naturally deal with state and with concurrent computations. It has good properties as a general semantic framework for giving executable semantics to a wide range of languages and models of concurrency. In particular, it supports very well concurrent object-oriented computation. The same reasons making rewriting logic a good semantic framework make it also a good logical framework, that is, a metalogic in which many other logics can be naturally represented and executed.

Maude supports in a systematic and efficient way logical reflection. This makes Maude remarkably extensible and powerful, supports an extensible algebra of module composition operations, and allows many advanced metaprogramming and metalanguage applications. Indeed, some of the most interesting applications of Maude are metalanguage applications, in which Maude is used to create executable environments for different logics, theorem provers, languages, and models of computation.

Maude is of interest to the biomedical community for modeling and analysis of biological systems.

Please cite: M. Matsumoto and T. Nishimura: Mersenne Twister: A 623-Dimensionally Equidistributed Uniform Pseudo-Random Number Generator. ACM Transactions on Modeling and Computer Simulation 8(1):3-30 (1998)
Melting-gui
graphical interface to compute the melting temperature of nucleic acid duplex
Versions of package melting-gui
ReleaseVersionArchitectures
squeeze4.3c-1all
wheezy4.3c-2all
jessie4.3c-2all
sid4.3c-2all
upstream4.31
Debtags of package melting-gui:
fieldbiology, biology:molecular
interfacex11
roleprogram
scopeutility
uitoolkittk
useanalysing
x11application
Popcon: 4 users (1 upd.)*
Newer upstream!
License: DFSG free
Svn

This package provides a graphical user interface for the “melting” program, which computes for a nucleic acid duplex the enthalpy, the entropy and the melting temperature of the helix-coil transitions.

Please cite: Le Novère, Nicolas: MELTING, computing the melting temperature of nucleic acid duplex. (PubMed,eprint) Bioinformatics 17(12):1226-1227 (2001)
Other screenshots of package melting-gui
VersionURL
4.2h-3http://screenshots.debian.net/screenshots/m/melting-gui/2032_large.png
Screenshots of package melting-gui
Mobyle
Web portal that provides web forms for command-line software
Versions of package mobyle
ReleaseVersionArchitectures
wheezy1.0.6~dfsg-1all
jessie1.5.0+dfsg-1all
sid1.5.0+dfsg-1all
sid1.5.0+dfsg-2all
Debtags of package mobyle:
develweb
fieldbiology, biology:bioinformatics
interfaceweb
roleprogram
webapplication, cgi
Popcon: 0 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

Mobyle is a framework and web portal specifically aimed at the integration of bioinformatics software and databanks. It generates a web interface from an xml description for command-line based tools. It can submit jobs locally or using DRM systems.

This package will restart Apache after enabling the ‘rewrite’ and ‘headers’ modules.

Please cite: Bertrand Neron, Herve Menager, Corinne Maufrais, Nicolas Joly, Julien Maupetit, Sebastien Letort, Sebastien Carrere, Pierre Tuffery and Catherine Letondal: Mobyle: a new full web bioinformatics framework. (PubMed,eprint) Bioinformatics 25(22):3005-11 (2009)
Screenshots of package mobyle
Mozilla-biofox
extension of bioinformatics tools to Iceape and Iceweasel browsers
Versions of package mozilla-biofox
ReleaseVersionArchitectures
squeeze1.1.5-1all
wheezy1.6-1all
jessie1.6-1all
sid1.6-1all
Popcon: 12 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

Code bioFOX aims at implementing various bioinformatics tools as an extension on the Iceape and Iceweasel browsers. Analysis of your favorite gene(s) usually require(s) retrieving it from a database like NCBI or Swiss-Prot and then performing one or more tasks including but not limited to:

  • Translation of a nucleotide sequence;
  • Blast search (eg. blastn, blastp etc.) of the desired nucleotide/protein sequence;
  • Calculation of properties (like PI, charge, molecular weight, AT/GC content etc.) of a protein/nucleotide sequence;
  • Conversion between formats (Genbank, Fasta, Swiss-Prot etc.);
  • Prediction of sequence for sub-cellular localization (PREDOTAR, TargetP, pSORT etc).
Pdb2pqr
Preparation of protein structures for electrostatics calculations
Maintainer: Debian Med (Manuel Prinz)
Versions of package pdb2pqr
ReleaseVersionArchitectures
wheezy1.8-1amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.8-1amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
sid1.8-1amd64,armel,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 12 users (17 upd.)*
Versions and Archs
License: DFSG free
Svn

PDB2PQR is a Python software package that automates many of the common tasks of preparing structures for continuum electrostatics calculations. It thus provides a platform-independent utility for converting protein files in PDB format to PQR format. These tasks include:

  • Adding a limited number of missing heavy atoms to biomolecular structures
  • Determining side-chain pKas
  • Placing missing hydrogens
  • Optimizing the protein for favorable hydrogen bonding
  • Assigning charge and radius parameters from a variety of force fields

This package also includes PropKa, a tool to modify the protonation state of protein structures in the Protein Data Bank (PDB) format to match a given pKa value. It can also be used to refine NMR structures, which often yield inaccurate pKa values for some residues.

Please cite: Todd J Dolinsky, Paul Czodrowski, Hui Li, Jens E Nielsen, Jan H Jensen, Gerhard Klebe and Nathan A Baker: PDB2PQR: Expanding and upgrading automated preparation of biomolecular structures for molecular simulations. (PubMed,eprint) Nucleic Acids Research 35:W522-5 (2007)
Python-rdkit
Collection of cheminformatics and machine-learning software
Maintainer: Debichem Team
Versions of package python-rdkit
ReleaseVersionArchitectures
wheezy201203-3amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie201212-1amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid201303-1amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 4 users (1 upd.)*
Versions and Archs
License: DFSG free
Svn

RDKit is a Python/C++ based cheminformatics and machine-learning software environment. Features Include:

  • Chemical reaction handling and transforms
  • Substructure searching with SMARTS
  • Canonical SMILES
  • Molecule-molecule alignment
  • Large number of descriptors
  • Fragmentation using RECAP rules
  • 2D coordinate generation and depiction
  • 3D coordinate generation using geometry embedding
  • UFF forcefield
  • Calculation of (R/S) stereochemistry codes
  • Pharmacophore searching
  • Calculation of shape similarity
  • Atom pairs and topological torsions fingerprints
  • Feature maps and feature-maps vectors
  • Machine-learning algorithms
  • Gasteiger-Marsili partial charge calculation

File formats RDKit supports include MDL Mol, SDF, TDT, SMILES and RDKit binary format.

R-cran-boolnet
tools for assembling, analyzing and visualizing Boolean networks
Versions of package r-cran-boolnet
ReleaseVersionArchitectures
jessie1.61-2amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.61-2amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Popcon: 0 users (4 upd.)*
Versions and Archs
License: DFSG free
Svn

BoolNet is an R package that provides tools for assembling, analyzing and visu- alizing synchronous and asynchronous Boolean networks as well as probabilistic Boolean networks.

Please cite: Christoph Muessel, Martin Hopfensitz and Hans A. Kestler: BoolNet -- an R package for generation, reconstruction and analysis of Boolean networks. (eprint) Bioinformatics 26(6):1378-1380 (2010)
Seqan-apps
C++ library for the analysis of biological sequences
Versions of package seqan-apps
ReleaseVersionArchitectures
squeeze1.2-1amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy1.3.1-1amd64,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x
jessie1.3.1-1amd64,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x
sid1.3.1-1amd64,armhf,i386,ia64,mips,mipsel,powerpc,s390,s390x
Debtags of package seqan-apps:
devellibrary
roledevel-lib
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free
Svn

SeqAn is a C++ template library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. This library applies a unique generic design that guarantees high performance, generality, extensibility, and integration with other libraries. SeqAn is easy to use and simplifies the development of new software tools with a minimal loss of performance. This package contains the applications dfi, pair_align, micro_razers, seqan_tcoffee, seqcons, razers and tree_recon.

Please cite: Andreas Doring, David Weese, Tobias Rausch and Knut Reinert: SeqAn An efficient, generic C++ library for sequence analysis. (PubMed,eprint) BMC Bioinformatics 9(1):11 (2008)

Debian packages in contrib or non-free

Abyss
de novo, parallel, sequence assembler for short reads
Versions of package abyss
ReleaseVersionArchitectures
wheezy1.3.4-3 (non-free)amd64
jessie1.3.4-3 (non-free)amd64
sid1.3.4-3 (non-free)amd64
upstream1.3.5
Debtags of package abyss:
roleprogram
Popcon: 3 users (2 upd.)*
Newer upstream!
License: non-free
Git

ABySS is a de novo, parallel, sequence assembler that is designed for short reads. It may be used to assemble genome or transcriptome sequence data. Parallelization is achieved using MPI, OpenMP and pthread.

To cite your use of ABySS, please reference ABySS: A parallel assembler for short read sequence data. Simpson JT, Wong K, Jackman SD, Schein JE, Jones SJ, Birol I. Genome Research, 2009-June.

Please cite: Jared T. Simpson, Kim Wong, Shaun D. Jackman, Jacqueline E. Schein, Steven J.M. Jones and İnanç Birol: ABySS: A parallel assembler for short read sequence data. (PubMed,eprint) Genome Research 19(6):1117-23 (2009)
Arb
Integrated package for sequence database handling and analysis
Versions of package arb
ReleaseVersionArchitectures
jessie5.5-1 (non-free)amd64,i386
sid5.5-1 (non-free)amd64,i386
Popcon: 2 users (0 upd.)*
Versions and Archs
License: non-free
Svn

The ARB software is a graphically oriented package comprising various tools for sequence database handling and data analysis. A central database of processed (aligned) sequences and any type of additional data linked to the respective sequence entries is structured according to phylogeny or other user defined criteria.

The ARB project (latin, "arbor"=tree) is a joint initiative of the Lehrstuhl fuer Mikrobiologie http://www.mikro.biologie.tu-muenchen.de/ and the Lehrstuhl fuer Rechnertechnik und Rechnerorganisation http://wwwbode.informatik.tu-muenchen.de/ of the Technical University of Munich.

Please cite: Wolfgang Ludwig, Oliver Strunk, Ralf Westram, Lothar Richter, Harald Meier, Arno Buchner, Tina Lai, Susanne Steppi, Gangolf Jobb, Wolfram Forster, Igor Brettske, Stefan Gerber, Anton W Ginhart, Oliver Gross, Silke Grumann, Stefan Hermann, Ralf Jost, Andreas Konig, Thomas Liss, Ralph Lussmann, Michael May, Bjorn Nonhoff, Boris Reichel, Robert Strehlow, Alexandros Stamatakis, Norbert Stuckmann, Alexander Vilbig, Michael Lenke, Thomas Ludwig, Arndt Bode and Karl-Heinz Schleifer: ARB: a software environment for sequence data. (PubMed,eprint) Nucl. Acids Res. 32(4):1363-1371 (2004)
Autodocktools
GUI to help set up, launch and analyze AutoDock dockings
Versions of package autodocktools
ReleaseVersionArchitectures
wheezy1.5.6~rc3~cvs.20120206-1 (non-free)all
jessie1.5.6~rc3~cvs.20120206-1 (non-free)all
sid1.5.7~rc1~cvs.20130519-1 (non-free)all
Debtags of package autodocktools:
fieldbiology, biology:structural, chemistry
interfacex11
roleprogram
sciencemodelling
scopesuite
uitoolkittk
useanalysing, configuring, converting, simulating, viewing
works-with3dmodel
x11application
Popcon: 11 users (4 upd.)*
Versions and Archs
License: non-free
Svn

This package is part of the mgltools set of Python libraries which provide an infrastructure for the analysis of protein structures and their docking of chemical compounds.

AutoDock is a well established package for the automated screening of libraries of formal representation of chemical compounds that putatively bind to a particular protein at hand. This package provides a graphical user interface that is helping with the preparation of the protein for such analyses.

The tool AutoLigand, which may help to constrain the location of the binding pocket for yet undescribed proteins, is accompanying the main executable 'runAdt'.

Beast-mcmc
Bayesian MCMC phylogenetic inference
Versions of package beast-mcmc
ReleaseVersionArchitectures
sid1.6.2-3 (contrib)all
sid1.7.5-1 (contrib)all
upstream1.7.5
Popcon: 0 users (0 upd.)*
Newer upstream!
License: DFSG free, but needs non-free components
Svn

BEAST is a cross-platform program for Bayesian MCMC analysis of molecular sequences. It is entirely orientated towards rooted, time-measured phylogenies inferred using strict or relaxed molecular clock models. It can be used as a method of reconstructing phylogenies but is also a framework for testing evolutionary hypotheses without conditioning on a single tree topology. BEAST uses MCMC to average over tree space, so that each tree is weighted proportional to its posterior probability. Included is a simple to use user-interface program for setting up standard analyses and a suit of programs for analysing the results.

The package is enhanced by the following packages: beast-mcmc-examples beast-mcmc-doc
Blimps-utils
blocks database improved searcher
Versions of package blimps-utils
ReleaseVersionArchitectures
wheezy3.9-1 (non-free)amd64
jessie3.9-1 (non-free)amd64
sid3.9-1 (non-free)amd64
Popcon: 1 users (1 upd.)*
Versions and Archs
License: non-free
Svn

BLIMPS (BLocks IMProved Searcher) is a searching tool that scores a protein sequence against blocks or a block against sequences.

This package contains the binaries.

The package is enhanced by the following packages: blimps-examples
Clustalw-mpi
MPI-distributed global sequence alignment with ClustalW
Versions of package clustalw-mpi
ReleaseVersionArchitectures
squeeze0.15-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy0.15-2 (non-free)amd64
jessie0.15-2 (non-free)amd64
sid0.15-2 (non-free)amd64
Debtags of package clustalw-mpi:
fieldbiology
interfacecommandline, text-mode
roleprogram
scopeutility
usecomparing
works-with-formatplaintext
Popcon: 3 users (3 upd.)*
Versions and Archs
License: non-free
Svn

ClustalW is a popular tool for multiple sequence alignment. The alignment is achieved via three steps: pairwise alignment, guide-tree generation and progressive alignment. ClustalW-MPI is an MPI implementation of ClustalW. Based on version 1.82 of the original ClustalW, both the pairwise and progressive alignments are parallelized with MPI, a popular message passing programming standard. The pairwise alignments can be easily parallelized since the many alignments are time independent on each other. However the progressive alignments are essentially not parallelizable because of the time dependencies between each alignment.

Here the recursive parallelism paradigm is applied to the linear space profile-profile alignment algorithm. This approach is more time efficient on computers with distributed memory architecture. Traditional approach that relies on precomputing the profile-profile score matrix has also been implemented. Results shown the latter is indeed more appropriate for shared memory multiprocessor computer.

ClustalX is suggested for its support for local realignments, seaview is a versatile editor of alignments.

The original ClustalW/ClustalX can be found at URL: http://www.clustal.org/download/pre-2/

Please cite: Kuo-Bin Li: ClustalW-MPI: ClustalW Analysis Using Distributed and Parallel Computing. (PubMed) Bioinformatics 19(12):1585-1586 (2003)
Cluster3
Reimplementation of the Eisen-clustering software
Versions of package cluster3
ReleaseVersionArchitectures
jessie1.50-1 (non-free)amd64
sid1.50-1 (non-free)amd64
Popcon: 1 users (2 upd.)*
Versions and Archs
License: non-free
Svn

The open source clustering software available here contains clustering routines that can be used to analyze gene expression data. Routines for hierarchical (pairwise simple, complete, average, and centroid linkage) clustering, k-means and k-medians clustering, and 2D self-organizing maps are included. The routines are available in the form of a C clustering library, an extension module to Python, a module to Perl, as well as an enhanced version of Cluster, which was originally developed by Michael Eisen of Berkeley Lab. The C clustering library and the associated extension module for Python was released under the Python license. The Perl module was released under the Artistic License. Cluster 3.0 is covered by the original Cluster/TreeView license.

This package only contains the command line and motif gui versions of Cluster 3.0.

Cufflinks
Transcript assembly, differential expression and regulation for RNA-Seq
Versions of package cufflinks
ReleaseVersionArchitectures
wheezy1.3.0-2 (non-free)amd64
jessie2.1.1-2 (non-free)amd64
sid2.1.1-2 (non-free)amd64
Debtags of package cufflinks:
fieldbiology
interfacecommandline
roleprogram
scopeutility
useanalysing
works-withbiological-sequence
Popcon: 4 users (3 upd.)*
Versions and Archs
License: non-free
Git

Cufflinks assembles transcripts, estimates their abundances, and tests for differential expression and regulation in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these transcripts based on how many reads support each one.

Embassy-phylip
EMBOSS conversions of the programs in the phylip package
Versions of package embassy-phylip
ReleaseVersionArchitectures
wheezy3.69+20110714-1 (non-free)amd64
jessie3.69+20110714-1 (non-free)amd64
sid3.69+20110714-1 (non-free)amd64
upstream3.69.650
Popcon: 3 users (0 upd.)*
Newer upstream!
License: non-free
Svn

This package is the adaptation of the PHYLIP package in which its programs can operate with the biological sequence formats and databases of the European Molecular Biology Open Software Suite (EMBOSS). The software packages adapted for EMBOSS are called EMBASSY.

PHYLIP (the PHYLogeny Inference Package) is a package of programs for inferring phylogenies (evolutionary trees). Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites and fragments, distance matrices, and discrete characters.

The EMBASSY PHYLIP programs all have the prefix "f" to distinguish them from the original programs and avoid namespace conflict.

Gmap
spliced and SNP-tolerant alignment for mRNA and short reads
Versions of package gmap
ReleaseVersionArchitectures
squeeze2010-07-21-1 (non-free)i386
wheezy2012-06-12-1 (non-free)amd64
jessie2012-06-12-1 (non-free)amd64
sid2012-06-12-1 (non-free)amd64
upstream2013-05-09
Debtags of package gmap:
fieldbiology, biology:bioinformatics, biology:structural
roleprogram
useanalysing
Popcon: 3 users (0 upd.)*
Newer upstream!
License: non-free
Git

This package contains the programs GMAP and GSNAP as well as utilities to manage genome databases in GMAP/GSNAP format. GMAP (Genomic Mapping and Alignment Program) is a tool for aligning EST, mRNA and cDNA sequences. GSNAP (Genomic Short-read Nucleotide Alignment Program) is a tool for aligning single-end and paired-end transcriptome reads. Both tools can use a database of known splice sites and identify novel splice sites. known single-nucleotide polymorphisms (SNPs). GSNAP can align bisulfite-treated DNA.

Please cite: Thomas D. Wu, Colin K. Watanabe: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. (eprint) Bioinformatics 21(9):1859-1875 (2005)
Igv
??? missing short description for package igv :-(
Versions of package igv
ReleaseVersionArchitectures
squeeze1.5.14-1 (non-free)all
wheezy2.0.30-1 (non-free)all
jessie2.0.30-1 (non-free)all
sid2.0.30-1 (non-free)all
experimental2.3.2+dfsg-1 (non-free)all
upstream2.3.8
Debtags of package igv:
fieldbiology
interfacex11
networkclient
roleprogram
scopeutility
useviewing
works-withbiological-sequence
Popcon: 2 users (1 upd.)*
Newer upstream!
License: non-free
Git
??? Missing long description for package igv
Paml
Phylogenetic Analysis by Maximum Likelihood (PAML)
Versions of package paml
ReleaseVersionArchitectures
wheezy4.5-1 (non-free)amd64
jessie4.5-1 (non-free)amd64
sid4.5-1 (non-free)amd64
upstream4.7
Popcon: 3 users (0 upd.)*
Newer upstream!
License: non-free
Svn

PAML is a package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood.

Please cite: Ziheng Yang: PAML 4: phylogenetic analysis by maximum likelihood. (PubMed,eprint) Molecular Biology and Evolution 24(8):1586-91 (2007)
Phylip
package of programs for inferring phylogenies
Versions of package phylip
ReleaseVersionArchitectures
squeeze3.69-1 (non-free)amd64,armel,i386,ia64,mips,mipsel,powerpc,s390,sparc
wheezy3.69-1 (non-free)amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie3.69-1 (non-free)amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid3.69-1 (non-free)amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream3.695
Debtags of package phylip:
fieldbiology, biology:bioinformatics
interfacecommandline
roleprogram
scopeutility
useanalysing, comparing
works-with-formatplaintext
Popcon: 42 users (16 upd.)*
Newer upstream!
License: non-free
Svn

The PHYLogeny Inference Package is a package of programs for inferring phylogenies (evolutionary trees) from sequences. Methods that are available in the package include parsimony, distance matrix, and likelihood methods, including bootstrapping and consensus trees. Data types that can be handled include molecular sequences, gene frequencies, restriction sites, distance matrices, and 0/1 discrete characters.

Raccoon
preparation of in silico drug screening projects
Maintainer: Steffen Moeller
Versions of package raccoon
ReleaseVersionArchitectures
wheezy1.0-1 (contrib)all
jessie1.0-1 (contrib)all
sid1.0-1 (contrib)all
upstream1.0b
Popcon: 1 users (0 upd.)*
Newer upstream!
License: DFSG free, but needs non-free components
Svn

The field of computational biology is all about modeling physiochemical entities. Structural biology is about how those entities look in 3D and behave. And we have a drug when we can change that behaviour in a way that we want it to be changed - and only that behaviour.

Some good part of the characterisation of promising drug-like compounds for their interaction with a larger protein can be done on computers. This package helps with getting collections of small ligands prepared to be fitted against a particular protein of known structure. That docking itself is then to be performed by autodock or autodock-vina.

Raccoon may not find the drug, but it may well find a lead to it. To have this package with Debian shall help smaller biochemistry labs and grants an opportunity for the general public to educate itself and/or actively join in to help the world .... just a bit.

Please cite: Sandro Cosconati, Stefano Forli, Alex L Perryman, Rodney Harris, David S Goodsell and Arthur J Olson: Virtual screening with AutoDock: theory and practice. (PubMed,eprint) Expert Opinion on Drug Discovery 5(6):597–607 (2010)
Seaview
Multiplatform interface for sequence alignment and phylogeny
Versions of package seaview
ReleaseVersionArchitectures
squeeze4.2.5-1 (non-free)amd64,i386
wheezy4.3.3-3 (non-free)amd64,armel,i386
jessie4.4.0-1 (non-free)amd64,armel,i386
sid4.4.0-1 (non-free)amd64,armel,i386
upstream4.4.1
Debtags of package seaview:
fieldbiology, biology:bioinformatics
interfacex11
networkclient
roleprogram
scopeutility
uitoolkitfltk
usecomparing, editing, printing, viewing
works-withbiological-sequence
works-with-formatplaintext
x11application
Popcon: 47 users (11 upd.)*
Newer upstream!
License: non-free
Svn

SeaView reads and writes various file formats (NEXUS, MSF, CLUSTAL, FASTA, PHYLIP, MASE, Newick) of DNA and protein sequences and of phylogenetic trees. Alignments can be manually edited. It drives the programs Muscle or Clustal Omega for multiple sequence alignment, and also allows one to use any external alignment algorithm able to read and write FASTA-formatted files. It computes phylogenetic trees by parsimony using PHYLIP's dnapars/protpars algorithm, by distance with NJ or BioNJ algorithms on a variety of evolutionary distances, or by maximum likelihood using the program PhyML 3.0. SeaView draws phylogenetic trees on screen or PostScript files, and allows one to download sequences from EMBL/GenBank/UniProt using the Internet.

The package is enhanced by the following packages: muscle
Please cite: Manolo Gouy, Stephane Guindon and Olivier Gascuel: SeaView version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. (PubMed,eprint) Mol Biol Evol 27(2):221-224 (2010)
Seq-gen
simulate the evolution of nucleotide or amino acid sequences
Versions of package seq-gen
ReleaseVersionArchitectures
wheezy1.3.3-1 (non-free)amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
jessie1.3.3-1 (non-free)amd64,armel,armhf,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
sid1.3.3-1 (non-free)amd64,armel,armhf,hurd-i386,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
Debtags of package seq-gen:
roleprogram
Popcon: 4 users (0 upd.)*
Versions and Archs
License: non-free
Svn

Seq-Gen is a program that will simulate the evolution of nucleotide or amino acid sequences along a phylogeny, using common models of the substitution process. A range of models of molecular evolution are implemented including the general reversible model. State frequencies and other parameters of the model may be given and site-specific rate heterogeneity may also be incorporated in a number of ways. Any number of trees may be read in and the program will produce any number of data sets for each tree. Thus large sets of replicate simulations can be easily created. It has been designed to be a general purpose simulator that incorporates most of the commonly used (and computationally tractable) models of molecular sequence evolution.

Please cite: A. Rambaut and N. C. Grassly: Seq-Gen: An application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees. (PubMed,eprint) Comput. Appl. Biosci. 13(3):235-238 (1997)
Sift
predicts if a substitution in a protein has a phenotypic effect
Versions of package sift
ReleaseVersionArchitectures
wheezy4.0.3b-3 (non-free)amd64
jessie4.0.3b-4 (non-free)amd64
sid4.0.3b-4 (non-free)amd64
Popcon: 2 users (3 upd.)*
Versions and Archs
License: non-free
Svn

SIFT is a sequence homology-based tool that sorts intolerant from tolerant amino acid substitutions and predicts whether an amino acid substitution in a protein will have a phenotypic effect. SIFT is based on the premise that protein evolution is correlated with protein function. Positions important for function should be conserved in an alignment of the protein family, whereas unimportant positions should appear diverse in an alignment.

Please cite: Prateek Kumar, Steven Henikoff and Pauline C Ng: Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. (PubMed,eprint) Nature Protocols 4(7):1073-81 (2009)
Spread-phy
analyze and visualize phylogeographic reconstructions
Versions of package spread-phy
ReleaseVersionArchitectures
jessie1.0.5+dfsg-1 (contrib)all
sid1.0.5+dfsg-1 (contrib)all
Popcon: 1 users (0 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Svn

SPREAD is a user-friendly application to analyze and visualize phylogeographic reconstructions resulting from Bayesian inference of spatio-temporal diffusion.

There is a tutorial for SPREAD online at http://www.kuleuven.be/aidslab/phylogeography/tutorial/spread_tutorial.html

Originally this program is named "spread". However, there is just such a package inside Debian and thus a 'phy' for phylogeny was prepended.

Please cite: Filip Bielejec, Andrew Rambaut, Marc A. Suchard and Philippe Lemey: SPREAD: spatial phylogenetic reconstruction of evolutionary dynamics. (PubMed,eprint) Bioinformatics 27(20):2910-2912 (2011)
Treeview
Java re-implementation of Michael Eisen's TreeView
Versions of package treeview
ReleaseVersionArchitectures
jessie1.1.6.2+dfsg-1 (contrib)all
sid1.1.6.2+dfsg-1 (contrib)all
Popcon: 2 users (3 upd.)*
Versions and Archs
License: DFSG free, but needs non-free components
Svn

TreeView creates a matrix-like display of expression data, known as Eisen clustering. The original implementation was a Windows program named TreeView by Michael Eisen. This TreeView package, sometimes also referred to as jTreeView, was rewritten in Java under a free license.

Java TreeView is an extensible viewer for microarray data in PCL or CDT format.

Debian packages in experimental

Eigensoft
reduction of population bias for genetic analyses
Versions of package eigensoft
ReleaseVersionArchitectures
experimental4.2-1 (non-free)amd64
Popcon: 0 users (0 upd.)*
Versions and Archs
License: non-free
Svn

The EIGENSOFT package combines functionality from the group's population genetics methods (Patterson et al. 2006) and their EIGENSTRAT stratification method (Price et al. 2006). The EIGENSTRAT method uses principal components analysis to explicitly model ancestry differences between cases and controls along continuous axes of variation; the resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. The EIGENSOFT package has a built-in plotting script and supports multiple file formats and quantitative phenotypes.

Snap
location of genes from DNA sequence with hidden markov model
Maintainer: Steffen Moeller
Versions of package snap
ReleaseVersionArchitectures
experimental2010-07-28-1amd64,armel,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,sparc
Popcon: 0 users (2 upd.)*
Versions and Archs
License: DFSG free
Svn

SNAP is a general purpose gene finding program suitable for both eukaryotic and prokaryotic genomes. SNAP is an acroynm for Semi-HMM-based Nucleic Acid Parser.

Please cite: Ian Korf: Gene finding in novel Genomes. (PubMed,eprint) BMC Bioinformatics 5:59 (2004)
Ugene
integrated bioinformatics toolkit
Versions of package ugene
ReleaseVersionArchitectures
experimental1.9.8+repack-1amd64,i386,ia64,kfreebsd-amd64,kfreebsd-i386,mips,mipsel,powerpc,s390,s390x,sparc
upstream1.11.5
Debtags of package ugene:
uitoolkitqt
Popcon: 1 users (0 upd.)*
Newer upstream!
License: DFSG free
Svn

Unipro UGENE is a cross-platform visual environment for DNA and protein sequence analysis. UGENE integrates the most important bioinformatics computational algorithms and provides an easy-to-use GUI for performing complex analysis of the genomic data. One of the main features of UGENE is a designer for custom bioinformatics workflows.

Screenshots of package ugene

Packaging has started and developers might try the packaging code in VCS

Amos-assembler
modular whole genome assembler
License: Artistic
Debian package not available
Svn
Version: 3.1.0-1

The AMOS consortium is committed to the development of open-source whole genome assembly software. The project acronym (AMOS) represents our primary goal - to produce A Modular, Open-Source whole genome assembler. Open-source so that everyone is welcome to contribute and help build outstanding assembly tools, and modular in nature so that new contributions can be easily inserted into an existing assembly pipeline. This modular design will foster the development of new assembly algorithms and allow the AMOS project to continually grow and improve in hopes of eventually becoming a widely accepted and deployed assembly infrastructure. In this sense, AMOS is both a design philosophy and a software system.

Anfo
Short Read Aligner/Mapper from MPG
License: free
Debian package not available
Svn
Version: 0.98-1ubuntu4

Anfo is a mapper in the spirit of Soap/Maq/Bowtie, but its implementation takes more after BLAST/BLAT. It's most useful for the alignment of sequencing reads where the DNA sequence is somehow modified (think ancient DNA or bisulphite treatment) and/or there is more divergence between sample and reference than what fast mappers will handle gracefully (say the reference genome is missing and a related species is used instead).

Apollo
genome annotation viewer and editor
License: Artistic
Debian package not available
Svn
Version: 1.11.6-1

Apollo is a genome annotation viewer and editor. It was developed as a collaboration between the Berkeley Drosophila Genome Project (part of the FlyBase consortium) and The Sanger Institute in Cambridge, UK. Apollo allows researchers to explore genomic annotations at many levels of detail, and to perform expert annotation curation, all in a graphical environment. It was used by the FlyBase biologists to construct the Release 3 annotations on the finished Drosophila melanogaster genome, and is also a primary vehicle for sharing these annotations with the community. The Generic Model Organism Database (GMOD) project, which aims to provide a complete ready-to-use toolkit for analyzing whole genomes, has adopted Apollo as its annotation workbench.

Axparafit - wnpp
optimized statistical analysis of host-parasite coevolution
License: GPL-2+
Debian package not available
Svn
Version: 1.0-1

AxParafit is a highly optimized version of Pierre Legendre's Parafit program for statistical analysis of host-parasite coevolution. AxParafit has been parallelized with MPI (Message Passing Interface) for compute clusters and was used to carry out the largest co-evolutionary analysis to date for the paper describing the software.

Axpcoords - wnpp
highly optimized and parallelized porting of pcoords
License: free
Debian package not available
Svn
Version: 1.0-1

AxPcoords is an highly optimized versions of Pierre Legendre's DistPCoA program for statistical analysis of host-parasite coevolution.

AxPcoords is a fast, LAPACK-based implementation of DistPCoA (see http://www.bio.umontreal.ca/Casgrain/en/labo/distpcoa.html) which is another program by Pierre Legendre, it conducts a principal coordinates analysis. This program is required for the pipeline that conducts a full host-parasite co-phylogenetic analysis in combination with AxParafit.

Bagpipe
genomewide LD mapping
License: GPL3+
Debian package not available
Svn
Version: 2012.02.15-1

Bagpipe is a program for performing genomewide linkage disequilibrium mapping of quantitative trait loci in populations whose genome structure can be accommodated in the HAPPY framework [Mott00]. This includes most diploid crosses where the founders of the individuals have known genotypes.

  • Bagpipe is a simplified and streamlined version of Bagphenotype that does not currently include resample model averaging (RMA) capabilities.
  • Bagpipe can help fit single locus regression models (with or without random effects) to marker intervals whose genetic ancestry is inferred using the HAPPY software.
  • Bagpipe cannot help you decide what is a sensible model to fit.
  • Bagpipe does not currently accommodate populations with significant population structure, except through the specification of simple random intercepts based on unpatterned covariance matrices.
  • Bagpipe is named after the Scottish wind instrument "the bagpipes" and after Bagphenotype, which in turn was a PIPEline for BAGging-based multiple QTL analysis of phenoTYPEs. Bagphenotype was in turn based on software written by Richard Mott and William Valdar to analyze heterogeneous stock mice in [Valdar06].
  • Bagpipe is experimental software, is provided free of charge subject to copyleft restrictions, and comes with no guarantees whatsoever.

[Mott00] Mott R, Talbot CJ, Turri MG, Collins AC, Flint, J (2000) A method for fine mapping quantitative trait loci in outbred animal stocks. Proceedings of the National Academy of Sciences of the United States of America, 97(23), 12649-54. [Valdar06] Valdar W, Solberg LC, Gaugier D, Burnett S, Klenerman P, Cookson WO, Taylor M, Rawlins JNP, Mott R, Flint J (2006) Genome-wide genetic association of complex traits in outbred mice. Nature Genetics 38(8):879-87. PMID:16832355

Bamtools - wnpp
C++ API for manipulating BAM (genome alignment) files
License: Expat License
Debian package not available
Git
Version: 2.2.3-1

BamTools facilitates research analysis and data management using BAM files. It copes with the enormous amount of data produced by current sequencing technologies that is typically stored in compressed, binary formats that are not easily handled by the text-based parsers commonly used in bioinformatics research.

BamTools provides both a C++ API for BAM file support as well as a command-line toolkit.

This is the bamtools command-line toolkit

Available bamtools commands: convert Converts between BAM and a number of other formats count Prints number of alignments in BAM file(s) coverage Prints coverage statistics from the input BAM file filter Filters BAM file(s) by user-specified criteria header Prints BAM header information index Generates index for BAM file merge Merge multiple BAM files into single file random Select random alignments from existing BAM file(s), intended more as

         a testing tool.
resolve  Resolves paired-end reads (marking the IsProperPair flag as needed)

revert Removes duplicate marks and restores original base qualities sort Sorts the BAM file according to some criteria split Splits a BAM file on user-specified property, creating a new BAM

         output file for each value found
stats    Prints some basic statistics from input BAM file(s)
Bigsdb - wnpp
Bacterial Isolate Genome Sequence Database
License: GPL2+
Debian package not available
Svn
Version: 1.5.0-1

The Bacterial Isolate Genome Sequence Database (BIGSdb) is a scalable, web-accessible database system designed to store and analyse linked phenotypic and genotypic information in a computationally efficient manner. Sequence data can range from single sequence reads to multiple contigs generated by whole genome sequencing technologies. The system incorporates the capacity to define and identify any number of loci and genetic variants at those loci within the stored nucleotide sequences. These loci can be further organised into schemes for isolate characterisation or for evolutionary or functional analyses.

Bitseq
Bayesian Inference of Transcripts from Sequencing Data
Responsible: Tim Booth (Tim Booth)
License: Artistic 2
Debian package not available
Svn
Version: 0.4.3-0ubuntu2

BitSeq is an application for inferring expression levels of individual transcripts from sequencing (RNA-Seq) data and estimating differential expression (DE) between conditions. An advantage of this approach is the ability to account for both technical uncertainty and intrinsic biological variance in order to avoid false DE calls. The technical contribution to the uncertainty comes both from finite read-depth and the possibly ambiguous mapping of reads to multiple transcripts.

Chimeraslayer
detects likely chimeras in PCR amplified DNA
License: BSD-like
Debian package not available
Svn
Version: 20101212-1ubuntu6

ChimeraSlayer is from the microbiomeutil package.

http://microbiomeutil.sourceforge.net/#A_CS

Cinema
multi-sequence alignment editor and viewer.
License: LGPL
Debian package not available
Svn
Version: 3.0.23-1

It has been designed to be as extensible as possible. Notes of this extensibility can be found in "EXTENDING_CINEMA", and the "cinema-module" sub-directory.

Cinema currently has limited support for various sequence formats, although its easy to add new ones. A large number of alignments in the appropriate format can be found as part of the align compendium at

Clonalorigin
inference of homologous recombination in bacteria using whole genome sequences
License: GPL-v3
Debian package not available
Svn
Version: 0.0.20101011svn-r29-1

Bacteria, unlike us, can reproduce on their own. They do however have mechanisms that transfer DNA between organisms, a process more formally known as recombination. The mechanisms by which recombination takes place have been studied extensively in the laboratory but much remains to be understood concerning how, when and where recombination takes place within natural populations of bacteria and how it helps them to adapt to new environments. ClonalOrigin performs a comparative analysis of the sequences of a sample of bacterial genomes in order to reconstruct the recombination events that have taken place in their ancestry.

Coot
model building program for macromolecular crystallography
License: free
Debian package not available
Git
Version: 0.7-1

Coot is a program for constructing atomic models of macromolecules from x-ray diffraction data. Coot displays electron density maps and molecular models and allows model manipulations such as idealization, refinement, manual rotation/translation, rigid-body fitting, ligand search, solvation, mutations, rotamers. Validation tools such as Ramachandran and geometry plots are available to the user. This package provides a Coot build with embedded Python support.

Crossbow
Genotyping from short reads using cloud computing
License: Artistic
Debian package not available
Svn
Version: 1.1.2-1

Crossbow is a scalable software pipeline for whole genome resequencing analysis. It combines Bowtie, an ultrafast and memory efficient short read aligner, and SoapSNP, an accurate genotyper, within Hadoop to distribute and accelerate the computation with many nodes. The pipeline can accurately analyze over 35x coverage of a human genome in one day on a 10-node local cluster, or in 3 hours for about $100 using a 40-node, 320-core cluster rented from Amazon's EC2 utility computing service.

Cytoscape - wnpp
visualizing molecular interaction networks
Responsible: Ivo Maintz (Ivo Maintz)
License: LGPL-2.1
Debian package not available
Svn
Version: 2.8.4-1

Cytoscape is an open source bioinformatics software platform for visualizing molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data. Although Cytoscape was originally designed for biological research, now it is a general platform for complex network analysis and visualization. Cytoscape core distribution provides a basic set of features for data integration and visualization.

Dazzle
Java-based DAS server
Responsible: Steffen Moeller (DMPT)
License: LGP-2+
Debian package not available
Svn
Version: 1.01-1

Dazzle is a general purpose server for the Distributed Annotation System (DAS) protocol. It is implemented as a Java servlet, using the BioJava APIs. Dazzle is a modular system which uses small "datasource" plugins to provide access to a range of databases. Several general-purpose plugins are included in the package, and it it straightforward to develop new plugins to connect to your own databases.

Information on DAS is available from http://www.biodas.org/

Ecell - wnpp
Concept and environment for constructing virtual cells on computers
License: GPL
Debian package not available
Svn
Version: 3.2.2-1

The E-Cell Project is an international research project aiming at developing necessary theoretical supports, technologies and software platforms to allow precise whole cell simulation.

The E-Cell System is an object-oriented software suite for modeling, simulation, and analysis of large scale complex systems such as biological cells, architected by Kouichi Takahashi and written by a team of developers.

The core part of the system, E-Cell Simulation Environment version 3, allows many components driven by multiple algorithms with different timescales to coexist.

E-Cell System consists of the following three major parts:

  • E-Cell Simulation Environment (or E-Cell SE)
  • E-Cell Modeling Environment (or E-Cell ME)
  • E-Cell Analysis Toolkit

This package contains all these parts, only the documentation is distributed separately.

Ensembl - wnpp
basic Ensembl genome browser
License: non-free
Svn
Version: 65-1

Ensembl is a joint project of the Sanger Center and the European Bioinformatics Institute, an outstation of the European Molecular Biology Laboratory, (EMBL-EBI) that are sharing a campus in Hinxton near Cambridge, UK. It presents the sequence data for the yet available complete genomes of many vertebrates and is helped by many sister-projects to cover also plants, invertebrates and bacteria.

This package provides a basic installation of Ensembl. It comprises a full copy of the public Ensembl website, minus Blast and SSAHA, and minus BioMart. It uses UniSearch instead of the engine used on the public site for searching by keyword. It connects directly to the public databases hosted by the EBI/Sanger.

This is meant as an easy way to get a basic Ensembl installation working on Debian. It can then be customised to local requirements.

Note that Ensembl has two odd dependencies: bioperl1.2.3 and libparallel-useragent-perl. Those are not required for routine browsing, but the bioperl1.2.3 library performs the parsing of BLAST outputs. Version 1.2.3 is in conflict with any other existing bioperl installation and forces you to effectively downgrade.

libwww-perl5.808 will conflict with the latest libwww-perl installation and thus force a downgrade to 5.808, which will disable many other tools on your system. Therefore it is advisable NOT to install this package in parallel with any other software, and/or use a virtual machine or dedicated machine.

WARNING: Requires internet connection both to install and to run, as it connects to the Sanger/EBI database servers during both installation and at runtime.

Remark of Debian Med team: Ensembl was removed from Debian due #645487

Ensembl used to be in Debian experimental branch but was removed for formal reasons which are explained in http://bugs.debian.org/645487

Forester - wnpp
Graphical vizualiation tool Archaeopteryx
License: LGPL 2.1+
Debian package not available
Svn
Version: 0.971-1

Archaeopteryx is a software tool for the visualization, analysis, and editing of potentially large and highly annotated phylogenetic trees. It can be used both as applet (ArchaeopteryxA and ArchaeopteryxE) and as a standalone application.

Remark of Debian Med team: This package ships with BioLinux http://envgen.nox.ac.uk/biolinux.html
Gassst
Global alignment short sequence search tool
License: CeCILL
Debian package not available
Svn
Version: 1.28-2

GASSST finds global alignments of short DNA sequences against large DNA banks. Its strong point is its ability to perform fast gapped alignments. The program takes as inputs banks in FASTA format and finds best alignment.

Please cite: Guillaume Rizk and Dominique Lavanier: GASSST: global alignment short sequence search tool. Bioinformatics 20(36) (2010)
Grogui
graphical user interface for popular molecular dynamics package GROMACS
License: GPL-2+
Debian package not available
Svn
Version: 0.5.2-1

Gromacs GUI is a Qt4-based graphical user interface for widely used molecular dynamics package, GROMACS.

 1. File browsing and management with customizable right-click pop up menu.
 2. Graphical interfaces for GROMACS commands (currently 21 commands have their own interfaces).
 3. Plot drawing tool which can export plots to pdf.
 4. A simple built-in console.
 5. Built-in GROMACS manual viewer.
 6. Built-in file editor with syntax highlighting for some GROMACS file formats (currently only mdp format is supported).
 7. MDP Writer section to easily create your mdp files.
 8. File icons based on their types.
Haploview - wnpp
Analysis and visualization of LD and haplotype maps
License: MIT
Debian package not available
Svn
Version: 4.1-1

This tools assists in the analysis of the nucleotide variation in a population. Such investigations are performed to determine genes and genetic pathways that are associated with diseases. This is an early stage in the quest for new drugs.

Hawkeye
Interactive Visual Analytics Tool for Genome Assemblies
License: Artistic
Debian package not available
Svn
Version: 3.1.0-1

Genome assembly remains an inexact science. Even when accomplished with the best software available, the assembly of a genome often contains numerous errors, both small and large. Hawkeye is a visual analytics tool for genome assembly analysis and validation, designed to aid in identifying and correcting assembly errors. Hawkeye blends the best practices from information and scientific visualization to facilitate inspection of large-scale assembly data while minimizing the time needed to detect mis-assemblies and make accurate judgments of assembly quality.

All levels of the assembly data hierarchy are made accessible to users, along with summary statistics and common assembly metrics. A ranking component guides investigation towards likely mis-assemblies or interesting features to support the task at hand. Wherever possible, high-level overviews, dynamic filtering, and automated clustering are leveraged to focus attention and highlight anomalies in the data. Hawkeyes effectiveness has been proven on several genome projects, where it has been used both to improve quality and to validate the correctness of complex genomes.

Hawkeye is compatible with most widely used assemblers, including Phrap, ARACHNE, Celera Assembler, Newbler, AMOS, and assemblies deposited in the NCBI Assembly Archive.

Jbrowse
The JBrowse Genome Browser
License: GPL-3.0+
Debian package not available
Svn
Version: 1.2.1-1

JBrowse is a fast, modern genome browser written primarily in JavaScript. To see it in action, see our Demonstrations page.

JBrowse is tested to work with the following web browsers:

  • Mozilla Firefox
  • Google Chrome
  • Apple Safari
  • Microsoft Internet Explorer

Server-side code, which is used only to pre-generate static data files (no CGI), requires only BioPerl and a couple other CPAN modules. The result is a cross-platform AJAX genome browser that is easy to install, embed and customize.

Kempbasu
Significance tests for comparing digital gene expression profiles
License: GPL-3
Debian package not available
Svn
Version: 0.9.1-1

This package implements the significance tests for comparing digital gene profiles described in the article:

Varuzza et al. "Significance tests for comparing digital gene expression profiles"

They provide two programs: kemp for the frequentist test and basu for the Bayesian test, and some auxiliary scripts.

Lagan
highly parametrizable pairwise global alignment program
License: GPL-v3
Debian package not available
Svn
Version: 2.0-1

Lagan takes local alignments generated by CHAOS as anchors, and limits the search area of the Needleman-Wunsch algorithm around these anchors.

Multi-LAGAN is a generalization of the pairwise algorithm to multiple sequence alignment. M-LAGAN performs progressive pairwise alignments, guided by a user-specified phylogenetic tree. Alignments are aligned to other alignments using the sum-of-pairs metric.

Logomat-m
visualize central aspects of Profile Hidden Markov Models (pHMMs)
License: Artistic
Debian package not available
Svn
Version: 0.0.20091119-1

Profile Hidden Markov Models (pHMMs) are a widely used tool for protein family research. We present a method to visualize all of their central aspects graphically, thus generalizing the concept of sequence logos introduced by Schneider and Stephens. For each emitting state of the pHMM, we display a stack of letters. As for sequence logos, the stack height is determined by the deviation of the position's letter emission frequencies from the background frequencies of the letters. As a new feature, the stack width now visualizes both the probability of reaching the state (the hitting probability) and the expected number of letters the state emits during a pass through the model (the expected contribution).

Mach-haplotyper
Markov Chain based SNP haplotyper
License: non-free
Debian package not available
Svn
Version: 1.0.18-1

Recent advancements in chip-based DNA genotyping allow to infer DNA variants that are not part of the chip but known to be associated with a combination of SNPs that are measured.

Mage2tab - wnpp
MAGE-MLv1 converter and visualiser
License: CBIL-1.0
Debian package not available
Svn
Version: 0.9-1

This tool-kit is part of MR_T, a framework for import or export various of MAGE (MicroArray Gene Expression) documents (MAGE-MLv1, MAGE-TAB, SOFT, MINiML) from or into databases like GUS (the Genomics Unified Schema, www.gusdb.org).

This package provides the following programs:

 mage2tab     — MAGE-MLv1 to MAGE-TAB converter
 mage2graph   — GraphViz-based mage data visualisation tool
 mage-checker — Validation tool
Martj
distributed data integration system for biological data
Responsible: Steffen Moeller (DMPT)
License: LGPL-2+
Debian package not available