Summary
Biology
Debian Med bioinformatics packages
This metapackage will install Debian packages for use in molecular biology,
structural biology and other biological sciences.
Description
For a better overview of the project's availability as a Debian package, each head row has a color code according to this scheme:
If you discover a project which looks like a good candidate for Debian Med
to you, or if you have prepared an unofficial Debian package, please do not hesitate to
send a description of that project to the Debian Med mailing list
Links to other tasks
|
Debian Med Biology packages
Official Debian packages with high relevance
abacas
close gaps in genomic alignments from short reads
|
Versions of package abacas |
Release | Version | Architectures |
buster | 1.3.1-5 | all |
trixie | 1.3.1-9 | all |
jessie | 1.3.1-2 | all |
bookworm | 1.3.1-9 | all |
bullseye | 1.3.1-9 | all |
sid | 1.3.1-9 | all |
stretch | 1.3.1-3 | all |
Debtags of package abacas: |
role | program |
|
License: DFSG free
|
ABACAS (Algorithm Based Automatic Contiguation of Assembled Sequences)
intends to rapidly contiguate (align, order, orientate), visualize and
design primers to close gaps on shotgun assembled contigs based on a
reference sequence.
ABACAS uses MUMmer to find alignment positions and identify syntenies
of assembled contigs against the reference. The output is then processed
to generate a pseudomolecule taking overlapping contigs and gaps in to
account. ABACAS generates a comparison file that can be used to
visualize ordered and oriented contigs in ACT. Synteny is represented by
red bars where colour intensity decreases with lower values of percent
identity between comparable blocks. Information on contigs such as the
orientation, percent identity, coverage and overlap with other contigs
can also be visualized by loading the outputted feature file on ACT.
Topics: Probes and primers
|
|
abpoa
adaptive banded Partial Order Alignment
|
Versions of package abpoa |
Release | Version | Architectures |
bookworm | 1.4.1-3 | amd64,arm64,ppc64el |
sid | 1.5.3-1 | amd64,arm64,ppc64el |
trixie | 1.5.3-1 | amd64,arm64,ppc64el |
|
License: DFSG free
|
abPOA is an extended version of Partial Order Alignment (POA) that performs
adaptive banded dynamic programming (DP) with an SIMD implementation. abPOA
can perform multiple sequence alignment (MSA) on a set of input sequences and
generate a consensus sequence by applying the heaviest bundling algorithm to
the final alignment graph.
abPOA can generate high-quality consensus sequences from error-prone long
reads and offer significant speed improvement over existing tools.
abPOA supports three alignment modes (global, local, extension) and flexible
scoring schemes that allow linear, affine and convex gap penalties. It right
now supports SSE2/SSE4.1/AVX2 vectorization.
For more information please refer to the paper1 published in Bioinformatics.
|
|
abyss
de novo, parallel, sequence assembler for short reads
|
Versions of package abyss |
Release | Version | Architectures |
sid | 2.3.9-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
stretch | 2.0.2-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.5.2-1 (non-free) | amd64 |
stretch-backports | 2.1.5-7~bpo9+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.2.5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.1.5-7 | amd64,arm64,armhf,i386 |
trixie | 2.3.9-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 2.3.5+dfsg-2 | amd64,arm64,mips64el,ppc64el,s390x |
upstream | 2.3.10 |
Debtags of package abyss: |
role | program |
|
License: DFSG free
|
ABySS is a de novo, parallel, sequence assembler that is designed for
short reads. It may be used to assemble genome or transcriptome
sequence data. Parallelization is achieved using MPI, OpenMP and
pthread.
Please cite:
Shaun D. Jackman, Benjamin P. Vandervalk, Hamid Mohamadi, Justin Chu, Sarah Yeo, S. Austin Hammond, Golnaz Jahesh, Hamza Khan, Lauren Coombe, Rene L. Warren and İnanç Birol:
"ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter".
(PubMed,eprint)
Genome Research
27(5):768-777
(2017)
Topics: Sequence assembly
|
|
acedb-other
retrieval of DNA or protein sequences
|
Versions of package acedb-other |
Release | Version | Architectures |
bullseye | 4.9.39+dfsg.02-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 4.9.39+dfsg.02-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 4.9.39+dfsg.01-5 | amd64,armel,armhf,i386 |
sid | 4.9.39+dfsg.02-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 4.9.39+dfsg.02-4 | amd64,arm64,armhf,i386 |
stretch | 4.9.39+dfsg.02-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package acedb-other: |
biology | nuceleic-acids |
field | biology, biology:bioinformatics |
role | program |
scope | utility |
|
License: DFSG free
|
This package collects all those smallish applications that acedb collects
under its 'other' target of its Makefile.
efetch: presumably short for 'entry fetch' collects sequence information
from common DNA and protein databases.
|
|
adapterremoval
rapid adapter trimming, identification, and read merging of gene sequences
|
Versions of package adapterremoval |
Release | Version | Architectures |
bullseye | 2.3.1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2.2.0-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 2.3.3-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.3.3-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.3.3-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2.2.3-1 | amd64,arm64,armhf,i386 |
upstream | 2.3.4 |
|
License: DFSG free
|
This program searches for and removes remnant adapter sequences from High-
Throughput Sequencing (HTS) data and (optionally) trims low quality
bases from the 3' end of reads following adapter removal. AdapterRemoval
can analyze both single end and paired end data, and can be used to
merge overlapping paired-ended reads into (longer) consensus sequences.
Additionally, the AdapterRemoval may be used to recover a consensus
adapter sequence for paired-ended data, for which this information is
not available.
The package is enhanced by the following packages:
multiqc
|
|
adun-core
|
Versions of package adun-core |
Release | Version | Architectures |
stretch | 0.81-9 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 0.81-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.81-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.81-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.81-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0.81-13 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Adun is a biomolecular simulator that also includes data management and
analysis capabilities. It was developed at the Computational Biophysics
and Biochemistry Laboratory, a part of the Research Unit on Biomedical
Informatics of the UPF.
This package contains the AdunCore program and the Adun server. If
you want the graphical UI frontend, install the adun.app package.
|
|
aegean
integrated genome analysis toolkit
|
Versions of package aegean |
Release | Version | Architectures |
trixie | 0.16.0+dfsg-4 | amd64,arm64,armel,armhf,i386,ppc64el,s390x |
bullseye | 0.16.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mipsel,ppc64el,s390x |
buster | 0.16.0+dfsg-1 | amd64,arm64,armhf,i386 |
stretch | 0.15.2+dfsg-1 | amd64,arm64,armel,armhf,i386,mipsel,ppc64el,s390x |
bookworm | 0.16.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mipsel,ppc64el,s390x |
sid | 0.16.0+dfsg-4 | amd64,arm64,armel,armhf,i386,ppc64el,s390x |
|
License: DFSG free
|
The AEGeAn Toolkit is designed for the Analysis and Evaluation of Genome
Annotations. The toolkit includes a variety of analysis programs, e.g. for
comparing distinct sets of gene structure annotations (ParsEval), computation
of gene loci (LocusPocus) and more.
|
|
aevol
digital genetics model to run Evolution Experiments in silico
|
Versions of package aevol |
Release | Version | Architectures |
buster | 5.0-2 | amd64,arm64,armhf,i386 |
jessie | 4.4-1 | amd64,armel,armhf,i386 |
sid | 5.0+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 5.0+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 5.0+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 5.0+ds-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 4.4-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Aevol is a digital genetics model: populations of digital organisms are
subjected to a process of selection and variation, which creates a
Darwinian dynamics.
By modifying the characteristics of selection (e.g. population size,
type of environment, environmental variations) or variation (e.g.
mutation rates, chromosomal rearrangement rates, types of
rearrangements, horizontal transfer), one can study experimentally the
impact of these parameters on the structure of the evolved organisms.
In particular, since Aevol integrates a precise and realistic model of
the genome, it allows for the study of structural variations of the
genome (e.g. number of genes, synteny, proportion of coding sequences).
The simulation platform comes along with a set of tools for analysing
phylogenies and measuring many characteristics of the organisms and
populations along evolution.
|
|
alien-hunter
Interpolated Variable Order Motifs to identify horizontally acquired DNA
|
Versions of package alien-hunter |
Release | Version | Architectures |
bookworm | 1.7-10 | all |
jessie | 1.7-3 | all |
stretch | 1.7-5 | all |
buster | 1.7-7 | all |
bullseye | 1.7-8 | all |
trixie | 1.7-10 | all |
sid | 1.7-10 | all |
Debtags of package alien-hunter: |
field | biology, biology:structural |
role | program |
scope | utility |
use | analysing |
|
License: DFSG free
|
Alien_hunter is an application for the prediction of putative
Horizontal Gene Transfer (HGT) events with the implementation of
Interpolated Variable Order Motifs (IVOMs). An IVOM approach
exploits compositional biases using variable order motif distributions
and captures more reliably the local composition of a sequence compared
to fixed-order methods. Optionally the predictions can be parsed into a
2-state 2nd order Hidden Markov Model (HMM), in a change-point detection
framework, to optimize the localization of the boundaries of the
predicted regions. The predictions (embl format) can be automatically
loaded into Artemis genome viewer freely available at:
http://www.sanger.ac.uk/Software/Artemis/.
|
|
alter-sequence-alignment
genomic sequences ALignment Transformation EnviRonment
|
Versions of package alter-sequence-alignment |
Release | Version | Architectures |
bullseye | 1.3.4-4 | all |
stretch | 1.3.3+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.3.4-2 | all |
trixie | 1.3.4-8 | all |
sid | 1.3.4-8 | all |
bookworm | 1.3.4-6 | all |
|
License: DFSG free
|
ALTER (ALignment Transformation EnviRonment) is a tool to transform
between multiple sequence alignment formats. ALTER focuses on the
specifications of mainstream alignment and analysis programs rather than
on the conversion among more or less specific formats.
|
|
altree
program to perform phylogeny-based association and localization analysis
|
Versions of package altree |
Release | Version | Architectures |
stretch | 1.3.1-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.3.1-2 | amd64,armel,armhf,i386 |
bullseye | 1.3.1-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.3.2-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.3.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.3.2-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.3.1-7 | amd64,arm64,armhf,i386 |
Debtags of package altree: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program, shared-lib |
scope | utility |
use | analysing, comparing |
works-with-format | plaintext |
|
License: DFSG free
|
ALTree was designed to perform association detection and localization of
susceptibility sites using haplotype phylogenetic trees: first, it allows the
detection of an association between a candidate gene and a disease, and second,
it enables to make hypothesis about the susceptibility loci.
|
|
amap-align
Protein multiple alignment by sequence annealing
|
Versions of package amap-align |
Release | Version | Architectures |
trixie | 2.2+git20080214.600fc29+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 2.2-4 | amd64,armel,armhf,i386 |
stretch | 2.2-6 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.2+git20080214.600fc29+dfsg-1 | amd64,arm64,armhf,i386 |
bullseye | 2.2+git20080214.600fc29+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.2+git20080214.600fc29+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 2.2+git20080214.600fc29+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package amap-align: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing |
works-with-format | plaintext |
|
License: DFSG free
|
AMAP is a command line tool to perform multiple alignment of peptidic
sequences. It utilizes posterior decoding, and a sequence-annealing
alignment, instead of the traditional progressive alignment method. It is
the only alignment program that allows one to control the sensitivity /
specificity tradeoff. It is based on the ProbCons source code, but
uses alignment metric accuracy and eliminates the consistency
transformation.
The Java visualisation tool of AMAP 2.2 is not yet packaged in Debian.
|
|
ampliconnoise
removal of noise from 454 sequenced PCR amplicons
|
Versions of package ampliconnoise |
Release | Version | Architectures |
bookworm | 1.29-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.29-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.29-2 | amd64,armel,armhf,i386 |
stretch | 1.29-6 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 1.29-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.29-8 | amd64,arm64,armhf,i386 |
Debtags of package ampliconnoise: |
role | program |
|
License: DFSG free
|
AmpliconNoise is a package of applications to clean up high-throughput
sequence data. It consists of three main parts:
Pyronoise - does flowgram-based clustering to spot misreads
SeqNoise - removes PCR point mutations
Perseus - removes PCR chimeras without the need for a set of reference
sequences
Previously there was a standalone "Pyronoise" by the same authors and
this package includes an updated version. There is also a "Denoiser"
in Qiime which is related but distinct.
|
|
andi
Efficient Estimation of Evolutionary Distances
|
Versions of package andi |
Release | Version | Architectures |
trixie | 0.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.10-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.12-4 | amd64,arm64,armhf,i386 |
bullseye | 0.13-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.14-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
This is the andi program for estimating the evolutionary distance
between closely related genomes. These distances can be used to rapidly
infer phylogenies for big sets of genomes. Because andi does not compute
full alignments, it is so efficient that it scales even up to thousands
of bacterial genomes.
|
|
anfo
Short Read Aligner/Mapper from MPG
|
Versions of package anfo |
Release | Version | Architectures |
bullseye | 0.98-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.98-7 | amd64,arm64,armhf,i386 |
stretch | 0.98-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.98-4 | amd64,armel,armhf,i386 |
sid | 0.98-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x |
|
License: DFSG free
|
Anfo is a mapper in the spirit of Soap/Maq/Bowtie, but its implementation takes
more after BLAST/BLAT. It's most useful for the alignment of sequencing reads
where the DNA sequence is somehow modified (think ancient DNA or bisulphite
treatment) and/or there is more divergence between sample and reference than
what fast mappers will handle gracefully (say the reference genome is missing
and a related species is used instead).
|
|
any2fasta
convert various sequence formats to FASTA
|
Versions of package any2fasta |
Release | Version | Architectures |
trixie | 0.4.2-2 | all |
bookworm | 0.4.2-2 | all |
sid | 0.4.2-2 | all |
|
License: DFSG free
|
Established tools like readseq and seqret from EMBOSS, both create mangled
IDs containing | or . characters, and there is no way to fix this behaviour.
This resultes in inconsitences between .gbk and .fna versions of files in
pipelines.
This script uses only core Perl modules, has no other dependencies like
Bioperl or Biopython, and runs very quickly.
It supports the following input formats:
1. Genbank flat file, typically .gb, .gbk, .gbff (starts with LOCUS)
2. EMBL flat file, typically .embl, (starts with ID)
3. GFF with sequence, typically .gff, .gff3 (starts with ##gff)
4. FASTA DNA, typically .fasta, .fa, .fna, .ffn (starts with >)
5. FASTQ DNA, typically .fastq, .fq (starts with @)
6. CLUSTAL alignments, typically .clw, .clu (starts with CLUSTAL or MUSCLE)
7. STOCKHOLM alignments, typically .sth (starts with # STOCKHOLM)
8. GFA assembly graph, typically .gfa (starts with ^[A-Z]\t)
Files may be compressed with:
1. gzip, typically .gz
2. bzip2, typically .bz2
3. zip, typically .zip
|
|
aragorn
tRNA and tmRNA detection in nucleotide sequences
|
Versions of package aragorn |
Release | Version | Architectures |
buster | 1.2.38-2 | amd64,arm64,armhf,i386 |
sid | 1.2.41-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.2.36-4 | amd64,armel,armhf,i386 |
trixie | 1.2.41-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.2.38-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.2.38-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 1.2.38-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
The program employs heuristic algorithms to predict tRNA secondary structure,
based on homology with recognized tRNA consensus sequences and ability to form
a base-paired cloverleaf. tmRNA genes are identified using a modified version
of the BRUCE program.
Topics: Functional, regulatory and non-coding RNA
|
|
arden
specificity control for read alignments using an artificial reference
|
Versions of package arden |
Release | Version | Architectures |
buster | 1.0-4 | all |
stretch | 1.0-3 | all |
jessie | 1.0-1 | amd64,armel,armhf,i386 |
bookworm | 1.0-5 | all |
bullseye | 1.0-5 | all |
sid | 1.0-6 | all |
trixie | 1.0-6 | all |
|
License: DFSG free
|
ARDEN (Artificial Reference Driven Estimation of false positives in NGS
data) is a novel benchmark that estimates error rates based on real
experimental reads and an additionally generated artificial reference
genome. It allows the computation of error rates specifically for a
dataset and the construction of a ROC-curve. Thereby, it can be used to
optimize parameters for read mappers, to select read mappers for a
specific problem or also to filter alignments based on quality
estimation.
|
|
ariba
Antibiotic Resistance Identification By Assembly
|
Versions of package ariba |
Release | Version | Architectures |
bullseye | 2.14.6+ds-1 | amd64,arm64,mips64el,ppc64el |
bookworm | 2.14.6+ds-5 | amd64,arm64,mips64el,ppc64el |
sid | 2.14.7+ds-2 | amd64,arm64,mips64el,ppc64el,riscv64 |
buster | 2.13.3+ds-1 | amd64 |
stretch | 2.6.1+ds-1 | amd64 |
trixie | 2.14.7+ds-2 | amd64,arm64,mips64el,ppc64el,riscv64 |
stretch-backports | 2.13.3+ds-1~bpo9+1 | amd64 |
|
License: DFSG free
|
ARIBA is a tool that identifies antibiotic resistance genes by running local
assemblies.
The input is a FASTA file of reference genes and paired sequencing reads. ARIBA
reports which of the reference genes were found, plus detailed information on
the quality of the assemblies and any variants between the sequencing reads
and the reference genes.
|
|
art-nextgen-simulation-tools
simulation tools to generate synthetic next-generation sequencing reads
|
Versions of package art-nextgen-simulation-tools |
Release | Version | Architectures |
bullseye | 20160605+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 20160605+dfsg-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 20160605+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 20160605+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 20160605+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 20160605+dfsg-3 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
ART is a set of simulation tools to generate synthetic next-generation
sequencing reads. ART simulates sequencing reads by mimicking real
sequencing process with empirical error models or quality profiles
summarized from large recalibrated sequencing data. ART can also
simulate reads using user own read error model or quality profiles. ART
supports simulation of single-end, paired-end/mate-pair reads of three
major commercial next-generation sequencing platforms: Illumina's
Solexa, Roche's 454 and Applied Biosystems' SOLiD. ART can be used to
test or benchmark a variety of method or tools for next-generation
sequencing data analysis, including read alignment, de novo assembly,
SNP and structure variation discovery. ART was used as a primary tool
for the simulation study of the 1000 Genomes Project . ART is
implemented in C++ with optimized algorithms and is highly efficient in
read simulation. ART outputs reads in the FASTQ format, and alignments
in the ALN format. ART can also generate alignments in the SAM
alignment or UCSC BED file format. ART can be used together with genome
variants simulators (e.g. VarSim) for evaluating variant calling tools
or methods.
|
|
artemis
genome browser and annotation tool
|
Versions of package artemis |
Release | Version | Architectures |
stretch | 16.0.17+dfsg-1 | all |
trixie | 18.2.0+dfsg-4 | all |
sid | 18.2.0+dfsg-4 | all |
bookworm | 18.2.0+dfsg-3 | all |
bullseye | 18.1.0+dfsg-3 | amd64 |
buster | 17.0.1+dfsg-2 | amd64 |
|
License: DFSG free
|
Artemis is a genome browser and annotation tool that allows visualisation of
sequence features, next generation data and the results of analyses within the
context of the sequence, and also its six-frame translation.
This package includes the Artemis genome browser, the Artemis Comparison
Tool (ACT), and the DNAplotter and BamView utilities.
|
|
artfastqgenerator
outputs artificial FASTQ files derived from a reference genome
|
Versions of package artfastqgenerator |
Release | Version | Architectures |
buster | 0.0.20150519-3 | all |
stretch | 0.0.20150519-2 | all |
trixie | 0.0.20150519-5 | all |
bookworm | 0.0.20150519-4 | all |
sid | 0.0.20150519-5 | all |
bullseye | 0.0.20150519-4 | all |
|
License: DFSG free
|
ArtificialFastqGenerator takes the reference genome (in FASTA format) as
input and outputs artificial FASTQ files in the Sanger format. It can
accept Phred base quality scores from existing FASTQ files, and use them
to simulate sequencing errors. Since the artificial FASTQs are derived
from the reference genome, the reference genome provides a gold-standard
for calling variants (Single Nucleotide Polymorphisms (SNPs) and
insertions and deletions (indels)). This enables evaluation of a Next
Generation Sequencing (NGS) analysis pipeline which aligns reads to the
reference genome and then calls the variants.
|
|
assembly-stats
get assembly statistics from FASTA and FASTQ files
|
Versions of package assembly-stats |
Release | Version | Architectures |
sid | 1.0.1+ds-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.0.1+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0.1+ds-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.1+ds-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Get statistics from a list of files.
Detection of FASTA or FASTQ format of each file is automatic from the
file contents, so file names and extensions are irrelevant.
The default output format is human readable. You can change the output
format and ignore sequences shorter than a given length.
|
|
assemblytics
detect and analyze structural variants from a genome assembly
|
Versions of package assemblytics |
Release | Version | Architectures |
sid | 1.2.1+dfsg-2 | all |
buster | 1.0+ds-1 | all |
bullseye | 1.0+ds-2 | all |
trixie | 1.2.1+dfsg-2 | all |
bookworm | 1.2.1+dfsg-1 | all |
|
License: DFSG free
|
Assemblytics incorporates a unique anchor filtering approach to increase
robustness to repetitive elements, and identifies six classes of variants
based on their distinct alignment signatures. Assemblytics can be applied
both to comparing aberrant genomes, such as human cancers, to a reference,
or to identify differences between related species.
|
|
atac
genome assembly-to-assembly comparison
|
Versions of package atac |
Release | Version | Architectures |
stretch | 0~20150903+r2013-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 0~20150903+r2013-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0~20150903+r2013-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0~20150903+r2013-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0~20150903+r2013-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0~20150903+r2013-6 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
atac computes a one-to-one pairwise alignment of large DNA sequences. It first
finds the unique k-mers in each sequence, chains them to larger blocks, and
fills in spaces between blocks. It was written primarily to transfer
annotations between different assemblies of the human genome.
The output is a set of ungapped 'matches', and a set of gapped 'runs' formed
from the matches. Each match or run associates one sequence with the other
sequence. The association is 'unique', in that there is no other (sizeable)
associations for either sequence. Thus, large repeats and duplications are not
present in the output - they appear as unmapped regions.
Though the output is always pairwise, atac can cache intermediate results to
speed a comparisons of multiple sequences.
This package is part of the Kmer suite.
|
|
ataqv
ATAC-seq QC and visualization
|
Versions of package ataqv |
Release | Version | Architectures |
bookworm | 1.3.0+ds-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
bullseye | 1.2.1+ds-1 | amd64,arm64,armhf,i386,mips64el,mipsel,ppc64el |
sid | 1.3.1+ds-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
trixie | 1.3.1+ds-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
|
License: DFSG free
|
A toolkit for measuring and comparing ATAC-seq results, made in the
Parker lab at the University of Michigan. They wrote it to help
understand how well their ATAC-seq assays had worked, and to make it
easier to spot differences that might be caused by library prep or
sequencing.
|
|
atropos
NGS read trimming tool that is specific, sensitive, and speedy
|
Versions of package atropos |
Release | Version | Architectures |
bookworm | 1.1.31+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
bullseye | 1.1.29+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
trixie | 1.1.32+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
sid | 1.1.32+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
|
License: DFSG free
|
Atropos is tool for specific, sensitive, and speedy trimming of NGS
reads. It is a fork of the venerable Cutadapt read trimmer, with the
primary improvements being:
1. Multi-threading support, including an extremely fast "parallel
write" mode.
2. Implementation of a new insert alignment-based trimming algorithm
for paired-end reads that is substantially more sensitive and
specific than the original Cutadapt adapter alignment-based
algorithm. This algorithm can also correct mismatches between the
overlapping portions of the reads.
3. Options for trimming specific types of data (miRNA, bisulfite-seq).
4. A new command ('detect') that will detect adapter sequences and
other potential contaminants.
5. A new command ('error') that will estimate the sequencing error
rate, which helps to select the appropriate adapter- and quality-
trimming parameter values.
6. A new command ('qc') that generates read statistics similar to
FastQC. The trim command can also compute read statistics both
before and after trimming (using the '--stats' option).
7. Improved summary reports, including support for serialization
formats (JSON, YAML, pickle), support for user-defined templates
(via the optional Jinja2 dependency), and integration with MultiQC.
8. The ability to merge overlapping reads (this is experimental and
the functionality is limited).
9. The ability to write the summary report and log messages to
separate files.
10. The ability to read SAM/BAM files and read/write interleaved
FASTQ files.
11. Direct trimming of reads from an SRA accession.
12. A progress bar, and other minor usability enhancements.
|
|
augur
pipeline components for real-time virus analysis
|
Versions of package augur |
Release | Version | Architectures |
bullseye | 11.0.0-1 | all |
sid | 24.4.0-1 | all |
trixie | 24.4.0-1 | all |
bookworm | 20.0.0-1 | all |
buster-backports | 6.4.2-2~bpo10+1 | all |
upstream | 26.0.0 |
|
License: DFSG free
|
The nextstrain project is an attempt to make flexible informatic
pipelines and visualization tools to track ongoing pathogen evolution as
sequence data emerges. The nextstrain project derives from nextflu,
which was specific to influenza evolution.
nextstrain is comprised of three components:
- fauna: database and IO scripts for sequence and serological data
- augur: informatic pipelines to conduct inferences from raw data
- auspice: web app to visualize resulting inferences
|
|
augustus
gene prediction in eukaryotic genomes
|
Versions of package augustus |
Release | Version | Architectures |
bookworm | 3.5.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.4.0+dfsg2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 3.2.3+dfsg-1 | amd64,arm64,armel,i386,mips64el,mipsel,ppc64el |
trixie | 3.5.0+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 3.3.2+dfsg-2 | amd64,arm64,armhf |
sid | 3.5.0+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
AUGUSTUS is a software for gene prediction in eukaryotic genomic sequences
that is based on a generalized hidden Markov model (HMM), a probabilistic
model of a sequence and its gene structure. After learning gene structures
from a reference annotation, AUGUSTUS uses the HMM to recognize genes in a new
sequence and annotates it with the regions of identified genes. External hints,
e.g. from RNA sequencing, EST or protein alignments etc. can be used to guide
and improve the gene finding process. The result is the set of most likely gene
structures that comply with all given user constraints, if such gene
structures exist.
AUGUSTUS already includes prebuilt HMMs for many species, as well as scripts
to train custom models using annotated genomes.
Topics: Gene transcripts; Gene and protein families
|
|
autodock
analysis of ligand binding to protein structure
|
Versions of package autodock |
Release | Version | Architectures |
stretch | 4.2.6-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 4.2.6-6 | amd64,arm64,armhf,i386 |
jessie | 4.2.6-2 | amd64,armel,armhf,i386 |
bullseye | 4.2.6-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package autodock: |
field | biology, biology:structural |
interface | commandline |
role | program |
scope | utility |
use | analysing |
works-with | 3dmodel |
|
License: DFSG free
|
AutoDock is a prime representative of the programs addressing the
simulation of the docking of fairly small chemical ligands to rather big
protein receptors. Earlier versions had all flexibility in the ligands
while the protein was kept rather ridgid. This latest version 4 also
allows for a flexibility of selected sidechains of surface residues,
i.e., takes the rotamers into account.
The AutoDock program performs the docking of the ligand to a set of
grids describing the target protein. AutoGrid pre-calculates these grids.
The package is enhanced by the following packages:
autogrid
|
|
autodock-vina
docking of small molecules to proteins
|
Versions of package autodock-vina |
Release | Version | Architectures |
jessie | 1.1.2-3 | amd64,armel,armhf,i386 |
bookworm | 1.2.3-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2.5-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.1.2-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.2.5-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.1.2-5 | amd64,arm64,armhf,i386 |
stretch | 1.1.2-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
AutoDock Vina is a program to support drug discovery, molecular
docking and virtual screening of compound libraries. It offers
multi-core capability, high performance and enhanced accuracy
and ease of use.
The same institute also developed autodock, which is widely used.
O. Trott, A. J. Olson, AutoDock Vina: improving the speed and accuracy
of docking with a new scoring function, efficient optimization and
multithreading, Journal of Computational Chemistry 31 (2010) 455-461
|
|
autogrid
pre-calculate binding of ligands to their receptor
|
Versions of package autogrid |
Release | Version | Architectures |
trixie | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 4.2.6-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 4.2.6-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 4.2.6-6 | amd64,arm64,armhf,i386 |
jessie | 4.2.6-2 | amd64,armel,armhf,i386 |
bookworm | 4.2.6-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package autogrid: |
field | biology, biology:structural |
interface | commandline |
role | program |
scope | utility |
use | analysing |
works-with | 3dmodel |
|
License: DFSG free
|
The AutoDockSuite addresses the molecular analysis of the docking of
a smaller chemical compounds to their receptors of known three-dimensional
structure.
The AutoGrid program performs pre-calculations for the docking of a
ligand to a set of grids that describe the effect that the protein has
on point charges. The effect of these forces on the ligand is then
analysed by the AutoDock program.
|
|
avogadro
Molecular Graphics and Modelling System
|
Versions of package avogadro |
Release | Version | Architectures |
bookworm | 1.97.0-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.93.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.99.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.99.0-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.2.0-4 | amd64,arm64,armhf,i386 |
stretch | 1.2.0-1+deb9u1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0.3-10.1 | amd64,armel,armhf,i386 |
Debtags of package avogadro: |
field | chemistry |
role | program |
uitoolkit | qt |
use | viewing |
|
License: DFSG free
|
Avogadro is a molecular graphics and modelling system targeted at molecules
and biomolecules. It can visualize properties like molecular orbitals or
electrostatic potentials and features an intuitive molecular builder.
Features include:
- Molecular modeller with automatic force-field based geometry optimization
- Molecular Mechanics including constraints and conformer searches
- Visualization of molecular orbitals and general isosurfaces
- Visualization of vibrations and plotting of vibrational spectra
- Support for crystallographic unit cells
- Input generation for the Gaussian, GAMESS and MOLPRO quantum chemistry
packages
- Flexible plugin architecture and Python scripting
File formats Avogadro can read include PDB, XYZ, CML, CIF, Molden, as well as
Gaussian, GAMESS and MOLPRO output.
|
|
axe-demultiplexer
Trie-based DNA sequencing read demultiplexer
|
Versions of package axe-demultiplexer |
Release | Version | Architectures |
stretch | 0.3.2+dfsg1-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 0.3.3+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.3.3+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.3.3+dfsg-1 | amd64,arm64,armhf,i386 |
bookworm | 0.3.3+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.3.3+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Axe very rapidly selects the optimal barcode present in a sequence read, even
in the presence of sequencing errors. The algorithm is able to handle
combinatorial barcoding, barcodes of differing length, and several mismatches
per barcode.
|
|
baitfisher
software package for designing hybrid enrichment probes
|
Versions of package baitfisher |
Release | Version | Architectures |
buster | 1.2.7+git20180107.e92dbf2+dfsg-1 | amd64,arm64,armhf,i386 |
sid | 1.2.7+git20211020.de26d5c+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.2.7+git20211020.de26d5c+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.2.7+git20190123.241d060+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2.7+git20211020.de26d5c+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
The BaitFisher package consists of two programs: BaitFisher and BaitFilter.
BaitFisher was been designed to construct hybrid enrichment baits from
multiple sequence alignments (MSAs) or annotated features in MSAs. The main
goal of BaitFisher is to avoid redundancy in the construction of baits by
designing fewer baits in conserved regions of the MSAs and designing more baits
in variable regions. This makes use of the fact that hybrid enrichment baits
can differ to some extends from the target region, which they should capture
in the enrichment procedure.
By specifying the allowed distance between baits and the sequences in the MSAs
the user can control the allowed bait-to-target distance and the degree of
reduction in the number of baits that are designed.
See the BaitFisher paper for details.
BaitFilter was designed (i) to determine whether baits bind unspecifically to
a reference genome, (ii) to filter baits that only have partial length matches
to a reference genome, (iii) to determine the optimal bait region in a MSA and
to convert baits to a format that can be uploaded at a bait constructing
company. The optimal bait region can be the most conserved region in the MSA
or the region with the highest number of sequences without gaps or ambiguous
nucleotides.
Please cite:
Christoph Mayer, Manuela Sann, Alexander Donath, Martin Meixner, Lars Podsiadlowski, Ralph S. Peters, Malte Petersen, Karen Meusemann, Karsten Liere, Johann-Wolfgang Wägele, Bernhard Misof, Christoph Bleidorn, Michael Ohl and Oliver Niehuis:
BaitFisher: A Software Package for Multispecies Target DNA Enrichment Probe Design.
(PubMed,eprint)
Mol. Biol. Evol.
33(7):1875-1886
(2016)
|
|
bali-phy
Bayesian Inference of Alignment and Phylogeny
|
Versions of package bali-phy |
Release | Version | Architectures |
bullseye | 3.6.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3.6.1+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 3.6.1+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 3.6.1+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.4+dfsg-1 | amd64,arm64,armhf,i386 |
experimental | 4.0~beta2+dfsg-1 | armhf,mips64el |
experimental | 4.0~beta13+dfsg-1 | amd64,arm64,i386,ppc64el,s390x |
|
License: DFSG free
|
BAli-Phy estimates multiple sequence alignments and evolutionary trees
from unaligned DNA, amino acid, or codon sequences. BAli-Phy uses MCMC
to estimate evolutionary trees, positive selection, and branch lengths
while averaging over alternative alignments. BAli-Phy can display
alignment ambiguity graphically in an alignment uncertainty (AU) plot.
BAli-Phy can also estimate phylogenies from a fixed alignment (like MrBayes
and BEAST) using substitution models like GTR+gamma. BAli-Phy automatically
estimates relative rates for each gene.
|
|
ballview
free molecular modeling and molecular graphics tool
|
Versions of package ballview |
Release | Version | Architectures |
bookworm | 1.5.0+git20180813.37fc53c-11 | amd64,arm64,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.5.0+git20180813.37fc53c-3 | amd64,arm64,i386 |
sid | 1.5.0+git20180813.37fc53c-11 | amd64,arm64,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.5.0+git20180813.37fc53c-6 | amd64,arm64,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.4.2+20140406-1 | amd64,armel,armhf,i386 |
stretch | 1.4.3~beta1-3 | amd64,arm64,armel,armhf,i386,mips,ppc64el,s390x |
upstream | 1.5.0+git20220524.d85d2dd |
Debtags of package ballview: |
interface | x11 |
role | program |
uitoolkit | qt |
x11 | application |
|
License: DFSG free
|
BALLView provides fast OpenGL-based visualization of molecular structures,
molecular mechanics methods (minimization, MD simulation using the
AMBER, CHARMM, and MMFF94 force fields), calculation and visualization
of electrostatic properties (FDPB) and molecular editing features.
BALLView can be considered a graphical user interface on the basis of
BALL (Biochemical Algorithms Library) with a focus on the most common
demands of protein chemists and biophysicists in particular. It is
developed in the groups of Hans-Peter Lenhof (Saarland University,
Saarbruecken, Germany) and Oliver Kohlbacher (University of Tuebingen,
Germany). BALL is an application framework in C++ that has been
specifically designed for rapid software development in Molecular
Modeling and Computational Molecular Biology. It provides an extensive
set of data structures as well as classes for Molecular Mechanics,
advanced solvation methods, comparison and analysis of protein
structures, file import/export, and visualization.
|
|
bamclipper
Remove gene-specific primer sequences from SAM/BAM alignments
|
Versions of package bamclipper |
Release | Version | Architectures |
bookworm | 1.0.0-3 | all |
sid | 1.0.0-3 | all |
trixie | 1.0.0-3 | all |
|
License: DFSG free
|
Remove gene-specific primer sequences from SAM/BAM alignments of PCR
amplicons by soft-clipping.
bamclipper.sh soft-clips gene-specific primers from BAM alignment file based
on genomic coordinates of primer pairs in BEDPE format.
|
|
bamkit
tools for common BAM file manipulations
|
Versions of package bamkit |
Release | Version | Architectures |
sid | 0.0.1+git20170413.ccd079d-3 | all |
trixie | 0.0.1+git20170413.ccd079d-3 | all |
bookworm | 0.0.1+git20170413.ccd079d-3 | all |
bullseye | 0.0.1+git20170413.ccd079d-2 | all |
|
License: DFSG free
|
This package provides some Python3 tools for common BAM file
manipulations.
|
|
bamtools
toolkit for manipulating BAM (genome alignment) files
|
Versions of package bamtools |
Release | Version | Architectures |
bookworm | 2.5.2+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 2.3.0+dfsg-2 | amd64,armel,armhf,i386 |
buster | 2.5.1+dfsg-3 | amd64,arm64,armhf,i386 |
sid | 2.5.2+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 2.5.1+dfsg-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.5.2+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.4.1+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
BamTools facilitates research analysis and data management using BAM
files. It copes with the enormous amount of data produced by current
sequencing technologies that is typically stored in compressed, binary
formats that are not easily handled by the text-based parsers commonly
used in bioinformatics research.
BamTools provides both a C++ API for BAM file support as well as a
command-line toolkit.
This is the bamtools command-line toolkit.
Available bamtools commands:
convert Converts between BAM and a number of other formats
count Prints number of alignments in BAM file(s)
coverage Prints coverage statistics from the input BAM file
filter Filters BAM file(s) by user-specified criteria
header Prints BAM header information
index Generates index for BAM file
merge Merge multiple BAM files into single file
random Select random alignments from existing BAM file(s), intended more
as a testing tool.
resolve Resolves paired-end reads (marking the IsProperPair flag as needed)
revert Removes duplicate marks and restores original base qualities
sort Sorts the BAM file according to some criteria
split Splits a BAM file on user-specified property, creating a new BAM
output file for each value found
stats Prints some basic statistics from input BAM file(s)
The package is enhanced by the following packages:
multiqc
|
|
bandage
Bioinformatics Application for Navigating De novo Assembly Graphs Easily
|
Versions of package bandage |
Release | Version | Architectures |
sid | 0.9.0-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.9.0-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.9.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.8.1-1 | amd64,arm64,armhf,i386 |
bullseye | 0.8.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Bandage is a GUI program that allows users to interact with the assembly
graphs made by de novo assemblers such as Velvet, SPAdes, MEGAHIT and
others.
De novo assembly graphs contain not only assembled contigs but
also the connections between those contigs, which were previously not
easily accessible. Bandage visualises assembly graphs, with connections,
using graph layout algorithms. Nodes in the drawn graph, which represent
contigs, can be automatically labelled with their ID, length or depth.
Users can interact with the graph by moving, labelling and colouring
nodes. Sequence information can also be extracted directly from the
graph viewer. By displaying connections between contigs, Bandage opens
up new possibilities for analysing and improving de novo assemblies that
are not possible by looking at contigs alone.
More information and download links are on the Bandage website:
rrwick.github.io/Bandage
The package is relevant to the field of genome assembly.
|
|
barrnap
rapid ribosomal RNA prediction
|
Versions of package barrnap |
Release | Version | Architectures |
bullseye | 0.9+dfsg-2 | all |
bookworm | 0.9+dfsg-3 | all |
stretch | 0.7+dfsg-2 | all |
trixie | 0.9+dfsg-4 | all |
sid | 0.9+dfsg-4 | all |
buster | 0.9+dfsg-1 | all |
|
License: DFSG free
|
Barrnap (BAsic Rapid Ribosomal RNA Predictor) predicts the location of
ribosomal RNA genes in genomes. It supports bacteria (5S,23S,16S), archaea
(5S,5.8S,23S,16S), mitochondria (12S,16S) and eukaryotes (5S,5.8S,28S,18S).
It takes FASTA DNA sequence as input, and writes GFF3 as output. It uses the
NHMMER tool that comes with HMMER 3.1 for HMM searching in RNA:DNA style.
Multithreading is supported and one can expect roughly linear speed-ups
with more CPUs.
Topics: Functional, regulatory and non-coding RNA
|
|
bbmap
BBTools genomic aligner and other tools for short sequences
|
Versions of package bbmap |
Release | Version | Architectures |
bullseye | 38.90+dfsg-1 | all |
buster-backports | 38.63+dfsg-1~bpo10+1 | all |
trixie | 39.09+dfsg-1 | all |
bookworm | 39.01+dfsg-2 | all |
sid | 39.09+dfsg-1 | all |
upstream | 39.10 |
|
License: DFSG free
|
The BBTools are a collection of small programs to solve recurrent
tasks for the creative handling of short biological RNA/DNA sequences.
This suite may be best known for its mapper, which is also the name of
the project on sourceforge, but several tools have been added over time.
All tools are multi-threaded, implemented platform-independently in Java:
BBMap: Short read aligner for DNA and RNA-seq data. Capable of handling
arbitrarily large genomes with millions of scaffolds. Handles Illumina,
PacBio, 454, and other reads; very high sensitivity and tolerant of
errors and numerous large indels.
BBNorm: Kmer-based error-correction and normalization tool.
Dedupe: Simplifies assemblies by removing duplicate or contained
subsequences that share a target percent identity.
Reformat: Reformats reads between fasta/fastq/scarf/fasta+qual/sam,
interleaved/paired, and ASCII-33/64, at over 500 MB/s.
BBDuk: Filters, trims, or masks reads with kmer matches to an
artifact/contaminant file.
The package is enhanced by the following packages:
multiqc
|
|
bcalm
de Bruijn compaction in low memory
|
Versions of package bcalm |
Release | Version | Architectures |
bullseye | 2.2.3-1 | amd64,arm64,i386,mips64el,ppc64el,s390x |
bookworm | 2.2.3-4 | amd64,arm64,mips64el,ppc64el |
trixie | 2.2.3-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
sid | 2.2.3-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
|
License: DFSG free
|
A bioinformatics tool for constructing the compacted de Bruijn graph
from sequencing data.
This is the parallel version of the BCALM software using gatb-core
library.
|
|
bcftools
genomic variant calling and manipulation of VCF/BCF files
|
Versions of package bcftools |
Release | Version | Architectures |
stretch-backports | 1.8-1~bpo9+1 | amd64,arm64,armel,armhf,mips64el,mipsel,ppc64el |
bullseye | 1.11-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
stretch | 1.3.1-1 | amd64,arm64,armel,mips64el,mipsel,ppc64el |
buster | 1.9-1 | amd64,arm64,armhf |
bookworm | 1.16-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
sid | 1.20-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.20-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
upstream | 1.21 |
|
License: DFSG free
|
BCFtools is a set of utilities that manipulate variant calls in the
Variant Call Format (VCF) and its binary counterpart BCF. All commands work
transparently with both VCFs and BCFs, both uncompressed and BGZF-compressed.
The package is enhanced by the following packages:
multiqc
|
|
beads
2-DE electrophoresis gel image spot detection
|
Versions of package beads |
Release | Version | Architectures |
bookworm | 1.1.22-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.1.20-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.1.22-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.1.18+dfsg-3 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Beads is a program for spot detection on 2-D gel images. It is based on
an analogy with beads flowing uphill on the surface of the gel image and
on the analysis of their paths (Langella & Zivy, 2008).
|
|
beagle
Genotype calling, genotype phasing and imputation of ungenotyped markers
|
Versions of package beagle |
Release | Version | Architectures |
bookworm | 220722-1 | all |
sid | 220722-1 | all |
buster | 5.0-180928+dfsg-1+deb10u1 | all |
stretch | 4.1~160727-86a+dfsg-1 | all |
bullseye | 5.1-200518+dfsg-1 | all |
trixie | 220722-1 | all |
upstream | 241029 |
|
License: DFSG free
|
Beagle performs genotype calling, genotype phasing, imputation of ungenotyped
markers, and identity-by-descent segment detection. Genotypic imputation works
on phased haplotypes using a Li and Stephens haplotype frequency model.
Beagle also implements the Refined IBD algorithm for detecting
homozygosity-by-descent (HBD) and identity-by-descent (IBD) segments.
The package is enhanced by the following packages:
beagle-doc
|
|
beast-mcmc
Bayesian MCMC phylogenetic inference
|
Versions of package beast-mcmc |
Release | Version | Architectures |
jessie | 1.8.0-1 (contrib) | all |
stretch | 1.8.4+dfsg.1-1 | all |
buster | 1.10.4+dfsg-1 | all |
bullseye | 1.10.4+dfsg-2 | all |
bookworm | 1.10.4+dfsg-5 | all |
trixie | 1.10.4+dfsg-5 | all |
sid | 1.10.4+dfsg-5 | all |
|
License: DFSG free
|
BEAST is a cross-platform program for Bayesian MCMC analysis of molecular
sequences. It is entirely orientated towards rooted, time-measured
phylogenies inferred using strict or relaxed molecular clock models. It
can be used as a method of reconstructing phylogenies but is also a
framework for testing evolutionary hypotheses without conditioning on a
single tree topology. BEAST uses MCMC to average over tree space, so that
each tree is weighted proportional to its posterior probability. Included
is a simple to use user-interface program for setting up standard
analyses and a suit of programs for analysing the results.
|
|
beast2-mcmc
Bayesian MCMC phylogenetic inference
|
Versions of package beast2-mcmc |
Release | Version | Architectures |
buster | 2.5.1+dfsg-2 | all |
sid | 2.7.6+dfsg-1 | all |
trixie | 2.7.6+dfsg-1 | all |
bookworm | 2.7.3+dfsg-1 | all |
bullseye | 2.6.3+dfsg-2 | all |
stretch | 2.4.4+dfsg-1 | all |
upstream | 2.7.7 |
|
License: DFSG free
|
BEAST is a cross-platform program for Bayesian MCMC analysis of molecular
sequences. It is entirely orientated towards rooted, time-measured
phylogenies inferred using strict or relaxed molecular clock models. It
can be used as a method of reconstructing phylogenies but is also a
framework for testing evolutionary hypotheses without conditioning on a
single tree topology. BEAST uses MCMC to average over tree space, so that
each tree is weighted proportional to its posterior probability. Included
is a simple to use user-interface program for setting up standard
analyses and a suit of programs for analysing the results.
This is no new upstream version of beast-mcmc (1.x) but rather a rewritten
version.
|
|
bedops
high-performance genomic feature operations
|
Versions of package bedops |
Release | Version | Architectures |
trixie | 2.4.41+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch-backports | 2.4.35+dfsg-1~bpo9+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.4.41+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.4.39+dfsg1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 2.4.41+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2.4.35+dfsg-1 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
BEDOPS is a suite of tools to address common questions raised in genomic
studies, mostly with regard to overlap and proximity relationships between
data sets.
It aims to be scalable and flexible, facilitating the efficient and accurate
analysis and management of large-scale genomic data.
Please cite:
Shane Neph, M. Scott Kuehn, Alex P. Reynolds, Eric Haugen, Robert E. Thurman, Audra K. Johnson, Eric Rynes, Matthew T. Maurano, Jeff Vierstra, Sean Thomas, Richard Sandstrom, Richard Humbert and John A. Stamatoyannopoulos:
BEDOPS: high-performance genomic feature operations.
(PubMed,eprint)
28(14):1919-1920
(2012)
|
|
bedtools
suite of utilities for comparing genomic features
|
Versions of package bedtools |
Release | Version | Architectures |
jessie | 2.21.0-1 | amd64,armhf,i386 |
sid | 2.31.1+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 2.31.1+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.30.0+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.30.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.27.1+dfsg-4 | amd64,arm64,armhf |
stretch | 2.26.0+dfsg-3 | amd64,arm64,armel,i386,mips64el,mipsel,ppc64el |
Debtags of package bedtools: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | suite |
use | analysing, comparing, converting, filtering |
works-with | biological-sequence |
|
License: DFSG free
|
The BEDTools utilities allow one to address common genomics tasks such as
finding feature overlaps and computing coverage. The utilities are largely
based on four widely-used file formats: BED, GFF/GTF, VCF, and SAM/BAM. Using
BEDTools, one can develop sophisticated pipelines that answer complicated
research questions by streaming several BEDTools together.
The groupBy utility is distributed in the filo package.
|
|
belvu
multiple sequence alignment viewer and phylogenetic tool
|
Versions of package belvu |
Release | Version | Architectures |
buster | 4.44.1+dfsg-3 | amd64,arm64,armhf,i386 |
trixie | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 4.44.1+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 4.44.1+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Belvu is a multiple sequence alignment viewer and phylogenetic tool with
an extensive set of user-configurable modes to color residues.
- View multiple sequence alignments.
- Residues can be coloured by conservation, with user-configurable
cutoffs and colours.
- Residues can be coloured by residue type (user-configurable).
- Colour schemes can be imported or exported.
- Swissprot (or PIR) entries can be fetched by double clicking.
- The position in the alignment can be easily tracked.
- Manual deletion of rows and columns.
- Automatic editing of rows and columns based on customisable criteria:
- removal of all-gap columns;
- removal of all gaps;
- removal of redundant sequences;
- removal of a column by a user-specified percentage of gaps;
- filtering of sequences by percent identity;
- removal of sequences by a user-specified percentage of gaps;
- removal of partial sequences (those starting or ending with
gaps); and
- removal of columns by conservation (with user-specified
upper/lower cutoffs).
- The alignment can be saved in Stockholm, Selex, MSF or FASTA format.
- Distance matrices between sequences can be generated using a variety
of distance metrics.
- Distance matrices can be imported or exported.
- Phylogenetic trees can be constructed based on various distance-based
tree reconstruction algorithms.
- Trees can be saved in New Hampshire format.
- Belvu can perform bootstrap phylogenetic reconstruction.
|
|
berkeley-express
Streaming quantification for high-throughput sequencing
|
Versions of package berkeley-express |
Release | Version | Architectures |
buster | 1.5.2+dfsg-1 | amd64,arm64,armhf,i386 |
sid | 1.5.3+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.5.3+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 1.5.1-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.5.3+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.5.3+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
eXpress is a streaming tool for quantifying the abundances of a set of
target sequences from sampled subsequences. Example applications include
transcript-level RNA-Seq quantification, allele-specific/haplotype
expression analysis (from RNA-Seq), transcription factor binding
quantification in ChIP-Seq, and analysis of metagenomic data. It is
based on an online-EM algorithm that results in space (memory)
requirements proportional to the total size of the target sequences and
time requirements that are proportional to the number of sampled
fragments. Thus, in applications such as RNA-Seq, eXpress can accurately
quantify much larger samples than other currently available tools
greatly reducing computing infrastructure requirements. eXpress can be
used to build lightweight high-throughput sequencing processing
pipelines when coupled with a streaming aligner (such as Bowtie), as
output can be piped directly into eXpress, effectively eliminating the
need to store read alignments in memory or on disk.
In an analysis of the performance of eXpress for RNA-Seq data, it was
observed that this efficiency does not come at a cost of accuracy.
eXpress is more accurate than other available tools, even when limited
to smaller datasets that do not require such efficiency. Moreover, like
the Cufflinks program, eXpress can be used to estimate transcript
abundances in multi-isoform genes. eXpress is also able to resolve
multi-mappings of reads across gene families, and does not require a
reference genome so that it can be used in conjunction with de novo
assemblers such as Trinity, Oases, or Trans-ABySS. The underlying model
is based on previously described probabilistic models developed for
RNA-Seq but is applicable to other settings where target sequences are
sampled, and includes parameters for fragment length distributions,
errors in reads, and sequence-specific fragment bias.
eXpress can be used to resolve ambiguous mappings in other
high-throughput sequencing based applications. The only required inputs
to eXpress are a set of target sequences and a set of sequenced
fragments multiply-aligned to them. While these target sequences will
often be gene isoforms, they need not be. Haplotypes can be used as the
reference for allele-specific expression analysis, binding regions for
ChIP-Seq, or target genomes in metagenomics experiments. eXpress is
useful in any analysis where reads multi-map to sequences that differ in
abundance.
|
|
bifrost
parallel construction, indexing and querying of de Bruijn graphs
|
Versions of package bifrost |
Release | Version | Architectures |
sid | 1.3.1-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
trixie | 1.3.1-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
upstream | 1.3.5 |
|
License: DFSG free
|
Bifrost is a command-line tool for sequencing that features a broad
range of functions, such as indexing, editing, and querying the graph,
and includes a graph coloring method that maps each k-mer of the graph
to the genomes it occurs in.
- Build, index, color and query the compacted de Bruijn graph
- No need to build the uncompacted de Bruijn graph
- Reads or assembled genomes as input
- Output graph in GFA (can be visualized with Bandage), FASTA or binary
- Graph cleaning: short tip clipping, etc.
- Multi-threaded
- No parameters to estimate with other tools
- Exact or approximate k-mer search of queries
|
|
bio-eagle
Haplotype phasing within a genotyped cohort or using a phased reference panel
|
Versions of package bio-eagle |
Release | Version | Architectures |
stretch | 2.3-3 | amd64,i386 |
buster | 2.4.1-1 | amd64 |
bullseye | 2.4.1-3 | amd64,i386 |
bookworm | 2.4.1-3 | amd64,i386 |
trixie | 2.4.1-3 | amd64,i386 |
sid | 2.4.1-3 | amd64,i386 |
|
License: DFSG free
|
Eagle estimates haplotype phase either within a genotyped cohort or using a
phased reference panel. The basic idea of the Eagle1 algorithm is to harness
identity-by-descent among distant relatives—which is pervasive at very large
sample sizes but rare among smaller numbers of samples—to rapidly call phase
using a fast scoring approach. In contrast, the Eagle2 algorithm analyzes a
full probabilistic model similar to the diploid Li-Stephens model used by
previous HMM-based methods.
Please note: The executable was renamed to bio-eagle because of a name clash.
Please read more about this in /usr/share/doc/bio-eagle/README.Debian.
|
|
bio-rainbow
clustering and assembling short reads for bioinformatics
|
Versions of package bio-rainbow |
Release | Version | Architectures |
bookworm | 2.0.4+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.0.4+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 2.0.4+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.0.4-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 2.0.4+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2.0.4+dfsg-1 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
Efficient tool for clustering and assembling short reads,
especially for RAD.
Rainbow is developed to provide an ultra-fast and memory-efficient
solution to clustering and assembling short reads produced by RAD-seq.
First, Rainbow clusters reads using a spaced seed method. Then, Rainbow
implements a heterozygote calling like strategy to divide potential
groups into haplotypes in a top-down manner. long a guided tree, it
iteratively merges sibling leaves in a bottom-up manner if they are
similar enough. Here, the similarity is defined by comparing the 2nd
reads of a RAD segment. This approach tries to collapse heterozygote
while discriminate repetitive sequences. At last, Rainbow uses a greedy
algorithm to locally assemble merged reads into contigs. Rainbow not
only outputs the optimal but also suboptimal assembly results. Based on
simulation and a real guppy RAD-seq data, it is shown that Rainbow is
more competent than the other tools in dealing with RAD-seq data.
|
|
bio-tradis
analyse the output from TraDIS analyses of genomic sequences
|
Versions of package bio-tradis |
Release | Version | Architectures |
bullseye | 1.4.5+dfsg2-1 | all |
trixie | 1.4.5+dfsg2-2 | all |
sid | 1.4.5+dfsg2-2 | all |
buster | 1.4.1+dfsg-1 | all |
stretch-backports | 1.3.3+dfsg-3~bpo9+1 | all |
bookworm | 1.4.5+dfsg2-1 | all |
|
License: DFSG free
|
Bio-Tradis contains a set of tools to analyse the output from
TraDIS analyses.
The Bio-Tradis analysis pipeline is implemented as an extensible Perl
library which can either be used as is, or as a basis for the
development of more advanced analysis tools.
Please note: You need to manually install BioConductor Edger which can
not be distributed by Debian in recent version since it is using
non-distributable code locfit.
|
|
bio-vcf
domain specific language (DSL) for processing the VCF format
|
Versions of package bio-vcf |
Release | Version | Architectures |
bookworm | 0.9.5-3 | all |
trixie | 0.9.5-3 | all |
bullseye | 0.9.5-2 | all |
sid | 0.9.5-3 | all |
|
License: DFSG free
|
Bio-vcf provides a domain specific language (DSL) for processing the
VCF format. Record named fields can be queried with regular
expressions, e.g.
sample.dp>20 and rec.filter !~ /LowQD/ and rec.tumor.bcount[rec.alt]>4
Bio-vcf is a new generation VCF parser, filter and converter. Bio-vcf
is not only very fast for genome-wide (WGS) data, it also comes with a
really nice filtering, evaluation and rewrite language and it can
output any type of textual data, including VCF header and contents in
RDF and JSON.
|
|
bioawk
extension of awk for biological sequence analysis
|
Versions of package bioawk |
Release | Version | Architectures |
sid | 1.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0-4+deb12u1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Bioawk is an extension to Brian Kernighan's awk, adding the support of
several common biological data formats, including optionally gzip'ed BED, GFF,
SAM, VCF, FASTA/Q and TAB-delimited formats with column names. It also adds a
few built-in functions and an command line option to use TAB as the
input/output delimiter. When the new functionality is not used, bioawk is
intended to behave exactly the same as the original BWK awk.
|
|
biobambam2
tools for early stage alignment file processing
|
Versions of package biobambam2 |
Release | Version | Architectures |
bookworm | 2.0.185+ds-1 | amd64,arm64,i386,ppc64el |
sid | 2.0.185+ds-2 | amd64,i386,mips64el,ppc64el,riscv64 |
bullseye | 2.0.179+ds-1 | amd64,arm64,i386,ppc64el |
trixie | 2.0.185+ds-2 | amd64,i386,mips64el,ppc64el,riscv64 |
upstream | 2.0.185-release-20221211202123 |
|
License: DFSG free
|
This package contains some tools for processing BAM files, including
bamsormadup: parallel sorting and duplicate marking
bamcollate2: reads BAM and writes BAM reordered such that alignment
or collated by query name
bammarkduplicates: reads BAM and writes BAM with duplicate alignments
marked using the BAM flags field
bammaskflags: reads BAM and writes BAM while masking (removing) bits
from the flags column
bamrecompress: reads BAM and writes BAM with a defined compression
setting. This tool is capable of multi-threading.
bamsort: reads BAM and writes BAM resorted by coordinates or
query name
bamtofastq: reads BAM and writes FastQ; output can be collated
or uncollated by query name
The package is enhanced by the following packages:
multiqc
|
|
biosyntax
Syntax Highlighting for Computational Biology (metapackage)
|
Versions of package biosyntax |
Release | Version | Architectures |
bookworm | 1.0.0b-4 | all |
buster | 1.0.0b-1 | all |
sid | 1.0.0b-6 | all |
trixie | 1.0.0b-6 | all |
bullseye | 1.0.0b-2 | all |
|
License: DFSG free
|
Syntax highlighting for computational biology to bring you intuitively close
to your data. BioSyntax supports .sam, .flagstat, .vcf, .fasta, .fastq, .faidx
, .clustal, .pdb, .gtf, .bed files & more.
This is a metapackage depending on all bioSyntax plugins.
|
|
bitseq
Bayesian Inference of Transcripts from Sequencing Data
|
Versions of package bitseq |
Release | Version | Architectures |
bullseye | 0.7.5+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.7.5+dfsg-4 | amd64,arm64,armhf,i386 |
sid | 0.7.5+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.7.5+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.7.5+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
BitSeq is an application for inferring expression levels of individual
transcripts from sequencing (RNA-Seq) data and estimating differential
expression (DE) between conditions. An advantage of this approach is the
ability to account for both technical uncertainty and intrinsic biological
variance in order to avoid false DE calls. The technical contribution to the
uncertainty comes both from finite read-depth and the possibly ambiguous
mapping of reads to multiple transcripts.
|
|
blasr
mapping single-molecule sequencing reads
|
Versions of package blasr |
Release | Version | Architectures |
sid | 5.3.5+dfsg-6 | amd64,arm64,mips64el,ppc64el,riscv64 |
bullseye | 5.3.3+dfsg-5 | amd64,arm64,mips64el,ppc64el |
bookworm | 5.3.5+dfsg-6 | amd64,arm64,mips64el,ppc64el |
stretch | 5.3+0-1 | amd64,arm64,mips64el,ppc64el |
buster | 5.3.2+dfsg-1.1 | amd64,arm64 |
|
License: DFSG free
|
Basic local alignment with successive refinement (BLASR) is a method
for mapping single-molecule sequencing reads against a reference genome.
Such reads are thousands of bases long, with divergence between them
and the genome being dominated by insertion and deletion error.
|
|
blixem
interactive browser of sequence alignments
|
Versions of package blixem |
Release | Version | Architectures |
sid | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 4.44.1+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 4.44.1+dfsg-3 | amd64,arm64,armhf,i386 |
bookworm | 4.44.1+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Blixem is an interactive browser of sequence alignments that have been
stacked up in a "master-slave" multiple alignment; it is not a 'true'
multiple alignment but a 'one-to-many' alignment.
- Overview section showing the positions of genes and alignments around
the alignment window
- Detail section showing the actual alignment of protein or nucleotide
sequences to the genomic DNA sequence.
- View alignments against both strands of the reference sequence.
- View sequences in nucleotide or protein mode; in protein mode, Blixem
will display the three-frame translation of the reference sequence.
- Residues are highlighted in different colours depending on whether
they are an exact match, conserved substitution or mismatch.
- Gapped alignments are supported, with insertions and deletions being
highlighted in the match sequence.
- Matches can be sorted and filtered.
- SNPs and other variations can be highlighted in the reference
sequence.
- Poly(A) tails can be displayed and poly(A) signals highlighted in the
reference sequence.
|
|
bolt-lmm
Efficient large cohorts genome-wide Bayesian mixed-model association testing
|
Versions of package bolt-lmm |
Release | Version | Architectures |
trixie | 2.4.1+dfsg-2 | amd64,i386,ppc64el |
buster | 2.3.2+dfsg-3 | amd64 |
bullseye | 2.3.4+dfsg-3 | amd64,i386,ppc64el |
bookworm | 2.4.0+dfsg-1 | amd64,i386,ppc64el |
sid | 2.4.1+dfsg-2 | amd64,i386,ppc64el |
|
License: DFSG free
|
The BOLT-LMM software package currently consists of two main algorithms, the
BOLT-LMM algorithm for mixed model association testing, and the BOLT-REML
algorithm for variance components analysis (i.e., partitioning of
SNP-heritability and estimation of genetic correlations).
The BOLT-LMM algorithm computes statistics for testing association between
phenotype and genotypes using a linear mixed model. By default, BOLT-LMM
assumes a Bayesian mixture-of-normals prior for the random effect attributed
to SNPs other than the one being tested. This model generalizes the standard
infinitesimal mixed model used by previous mixed model association methods,
providing an opportunity for increased power to detect associations while
controlling false positives. Additionally, BOLT-LMM applies algorithmic
advances to compute mixed model association statistics much faster than
eigendecomposition-based methods, both when using the Bayesian mixture model
and when specialized to standard mixed model association.
The BOLT-REML algorithm estimates heritability explained by genotyped SNPs and
genetic correlations among multiple traits measured on the same set of
individuals. BOLT-REML applies variance components analysis to perform these
tasks, supporting both multi-component modeling to partition SNP-heritability
and multi-trait modeling to estimate correlations. BOLT-REML applies a Monte
Carlo algorithm that is much faster than eigendecomposition-based methods for
variance components analysis at large sample sizes.
Please cite:
Po-Ru Loh, George Tucker, Brendan K Bulik-Sullivan, Bjarni J Vilhjálmsson, Hilary K Finucane, Rany M Salem, Daniel I Chasman, Paul M Ridker, Benjamin M Neale, Bonnie Berger, Nick Patterson and Alkes L Price:
Efficient Bayesian mixed-model analysis increases association power in large cohorts.
Nature Genetics
(2015)
|
|
bowtie
Ultrafast memory-efficient short read aligner
|
Versions of package bowtie |
Release | Version | Architectures |
buster | 1.2.2+dfsg-4 | amd64,arm64 |
bullseye | 1.3.0+dfsg1-1 | amd64,arm64,mips64el,ppc64el,s390x |
jessie | 1.1.1-2 | amd64 |
stretch | 1.1.2-6 | amd64,arm64,mips64el,ppc64el,s390x |
sid | 1.3.1-3 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
trixie | 1.3.1-3 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.3.1-1 | amd64,arm64,mips64el,ppc64el,s390x |
Debtags of package bowtie: |
biology | nuceleic-acids |
field | biology:bioinformatics |
interface | commandline |
role | program |
science | calculation |
scope | utility |
use | analysing, comparing |
works-with | biological-sequence |
|
License: DFSG free
|
This package addresses the problem to interpret the results from the
latest (2010) DNA sequencing technologies. Those will yield fairly
short stretches and those cannot be interpreted directly. It is the
challenge for tools like Bowtie to give a chromosomal location to the
short stretches of DNA sequenced per run.
Bowtie aligns short DNA sequences (reads) to the human genome at a rate
of over 25 million 35-bp reads per hour. Bowtie indexes the genome with
a Burrows-Wheeler index to keep its memory footprint small: typically
about 2.2 GB for the human genome (2.9 GB for paired-end).
|
|
bowtie2
ultrafast memory-efficient short read aligner
|
Versions of package bowtie2 |
Release | Version | Architectures |
stretch | 2.3.0-2 | amd64 |
bookworm | 2.5.0-3 | amd64,arm64,mips64el,ppc64el |
bullseye | 2.4.2-2 | amd64,arm64,mips64el,ppc64el |
trixie | 2.5.4-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
sid | 2.5.4-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
jessie | 2.2.4-1 | amd64 |
buster | 2.3.4.3-1 | amd64 |
|
License: DFSG free
|
is an ultrafast and memory-efficient tool for aligning sequencing reads
to long reference sequences. It is particularly good at aligning reads
of about 50 up to 100s or 1,000s of characters, and particularly good
at aligning to relatively long (e.g. mammalian) genomes.
Bowtie 2 indexes the genome with an FM Index to keep its memory footprint
small: for the human genome, its memory footprint is typically
around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes
|
|
boxshade
Pretty-printing of multiple sequence alignments
|
Versions of package boxshade |
Release | Version | Architectures |
jessie | 3.3.1-8 | amd64,armel,armhf,i386 |
buster | 3.3.1-12 | amd64,arm64,armhf,i386 |
bullseye | 3.3.1-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 3.3.1-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 3.3.1-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.3.1-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 3.3.1-10 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package boxshade: |
biology | format:aln, nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | typesetting |
works-with-format | html, plaintext, postscript, tex |
|
License: DFSG free
|
Boxshade is a program for creating good looking printouts from
multiple-aligned protein or DNA sequences. The program does not perform
the alignment by itself and requires as input a file that was created
by a multiple alignment program or manually edited with respective tools.
Boxshade reads multiple-aligned sequences from either PILEUP-MSF,
CLUSTAL-ALN, MALIGNED-data and ESEE-save files (limited to a maximum
of 150 sequences with up to 10000 elements each). Various kinds of
shading can be applied to identical/similar residues. Output is written
to screen or to a file in the following formats: ANSI/VT100, PS/EPS,
RTF, HPGL, ReGIS, LJ250-printer, ASCII, xFIG, PICT, HTML
|
|
bppphyview
Bio++ Phylogenetic Viewer
|
Versions of package bppphyview |
Release | Version | Architectures |
trixie | 0.6.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 0.3.0-1 | amd64,armel,armhf,i386 |
stretch | 0.3.0-1.1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 0.6.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.6.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.6.1-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.6.1-1 | amd64,arm64,armhf,i386 |
Debtags of package bppphyview: |
role | program |
uitoolkit | qt |
|
License: DFSG free
|
A phylogenetic tree editor developed using Bio++ and Qt. Phyview allows one
to visualize, edit, print and output phylogenetic trees and associated data.
|
|
bppsuite
|
Versions of package bppsuite |
Release | Version | Architectures |
sid | 2.4.1-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.2.0-0.1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.4.1-1 | amd64,arm64,armhf,i386 |
bullseye | 2.4.1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.4.1-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.4.1-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package bppsuite: |
role | program |
|
License: DFSG free
|
The Bio++ Program Suite is a package of programs using the Bio++
libraries and dedicated to Phylogenetics and Molecular Evolution. All
programs are independent, but can be combined to perform rather complex
analyses. These programs use the interface helper tools of the
libraries, and hence share the same syntax. They also have several
options in common, which may also be shared by third-party software.
The following programs are included:
- BppML for maximum likelihood analysis,
- BppSeqGen for sequences simulation,
- BppAncestor for ancestral states reconstruction,
- BppDist for distance methods,
- BppPars for parsimony analysis,
- BppSeqMan for file conversion and sequence manipulation,
- BppConsense for building consensus tree and computing
bootstrap values,
- BppReRoot for tree rerooting.
- BppTreeDraw for tree drawing.
- BppAlnScore for comparing alignments and computing alignment scores.
- BppMixedLikelihoods for computing site per site likelihoods of
components of mixture models.
- BppPopGen for population genetics analyses.
|
|
brig
BLAST Ring Image Generator
|
Versions of package brig |
Release | Version | Architectures |
trixie | 0.95+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.95+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.95+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.95+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.95+dfsg-2 | amd64,arm64,armhf,i386 |
bullseye | 0.95+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
BRIG can display circular comparisons between a large number of genomes,
with a focus on handling genome assembly data.
- Images show similarity between a central reference sequence and other
sequences as concentric rings.
- BRIG will perform all BLAST comparisons and file parsing automatically
via a simple GUI.
- Contig boundaries and read coverage can be displayed for draft genomes;
customized graphs and annotations can be displayed.
- Using a user-defined set of genes as input, BRIG can display gene
presence, absence, truncation or sequence variation in a set of
complete genomes, draft genomes or even raw, unassembled sequence data.
- BRIG also accepts SAM-formatted read-mapping files enabling genomic
regions present in unassembled sequence data from multiple samples to
be compared simultaneously
|
|
btllib-tools
Bioinformatics Technology Lab common code library tools
|
Versions of package btllib-tools |
Release | Version | Architectures |
trixie | 1.4.10+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 1.4.10+dfsg-1 | amd64,arm64,mips64el,ppc64el |
sid | 1.4.10+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
upstream | 1.7.3 |
|
License: DFSG free
|
Bioinformatics Technology Lab common code library in C++ with
Python wrappers.
This package contains the tool indexlr.
|
|
busco
benchmarking sets of universal single-copy orthologs
|
Versions of package busco |
Release | Version | Architectures |
trixie | 5.5.0-2 | amd64,arm64,i386 |
bookworm | 5.4.4-1 | amd64,i386 |
bullseye | 5.0.0-1 | all |
sid | 5.5.0-2 | amd64,arm64,i386 |
|
License: DFSG free
|
Assessing genome assembly and annotation completeness with Benchmarking
Universal Single-Copy Orthologs (BUSCO).
- Automated selection of lineages issued from https://www.orthodb.org/
- Automated download of all necessary files and datasets to conduct a run
- Use prodigal for non-eukaryotic genomes
The package is enhanced by the following packages:
multiqc
|
|
bustools
program for manipulating BUS files for single cell RNA-Seq datasets
|
Versions of package bustools |
Release | Version | Architectures |
trixie | 0.43.2+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.42.0+dfsg-1 | amd64,arm64,mips64el,ppc64el,s390x |
bullseye | 0.40.0-4 | amd64,arm64,mips64el,ppc64el,s390x |
sid | 0.43.2+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
upstream | 0.44.1 |
|
License: DFSG free
|
This package contains BUStools program, it can be used to error correct
barcodes, collapse UMIs, produce gene count or transcript compatibility
count matrices
|
|
bwa
|
Versions of package bwa |
Release | Version | Architectures |
sid | 0.7.18-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.7.17-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.7.17-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.7.17-3 | amd64 |
stretch-backports | 0.7.17-1~bpo9+1 | amd64 |
stretch | 0.7.15-2+deb9u1 | amd64 |
jessie | 0.7.10-1 | amd64 |
trixie | 0.7.18-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package bwa: |
biology | nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | commandline, text-mode |
role | program |
use | analysing, comparing |
|
License: DFSG free
|
BWA is a software package for mapping low-divergent sequences against
a large reference genome, such as the human genome. It consists of
three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. The first
algorithm is designed for Illumina sequence reads up to 100bp, while
the rest two for longer sequences ranged from 70bp to 1Mbp. BWA-MEM
and BWA-SW share similar features such as long-read support and split
alignment, but BWA-MEM, which is the latest, is generally recommended
for high-quality queries as it is faster and more accurate. BWA-MEM
also has better performance than BWA-backtrack for 70-100bp Illumina
reads.
|
|
canu
single molecule sequence assembler for genomes
|
Versions of package canu |
Release | Version | Architectures |
sid | 2.2+dfsg-5 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
buster | 1.8+dfsg-2 | amd64 |
stretch-backports | 1.7.1+dfsg-1~bpo9+1 | amd64 |
bullseye | 2.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Canu is a fork of the Celera Assembler, designed for high-noise
single-molecule sequencing (such as the PacBio RS II or Oxford
Nanopore MinION).
Canu is a hierarchical assembly pipeline which runs in four steps:
- Detect overlaps in high-noise sequences using MHAP
- Generate corrected sequence consensus
- Trim corrected sequences
- Assemble trimmed corrected sequences
|
|
cassiopee
index and search tool in genomic sequences
|
Versions of package cassiopee |
Release | Version | Architectures |
sid | 1.0.9-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.0.5-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0.1+dfsg-3 | amd64,armel,armhf,i386 |
trixie | 1.0.9-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.0.9-2 | amd64,arm64,armhf,i386 |
bookworm | 1.0.9-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0.9-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Cassiopee index and search library C implementation.
It is a complete rewrite of the ruby Cassiopee gem. It scans an input
genomic sequence (dna/rna/protein) and search for a subsequence with
exact match or allowing substitutions (Hamming distance) and/or
insertion/deletions.
This package contains the cassiopee and cassiopeeknife tools.
|
|
cat-bat
taxonomic classification of contigs and metagenome-assembled genomes (MAGs)
|
Versions of package cat-bat |
Release | Version | Architectures |
sid | 5.3-2 | amd64,arm64,ppc64el,riscv64,s390x |
trixie | 5.3-2 | amd64,arm64,ppc64el,riscv64,s390x |
bookworm | 5.2.3-2 | amd64,arm64,ppc64el,s390x |
bullseye | 5.2.2-1 | amd64,arm64,ppc64el,s390x |
upstream | 6.0.1 |
|
License: DFSG free
|
Contig Annotation Tool (CAT) and Bin Annotation Tool (BAT) are pipelines
for the taxonomic classification of long DNA sequences and metagenome
assembled genomes (MAGs/bins) of both known and (highly) unknown
microorganisms, as generated by contemporary metagenomics studies. The
core algorithm of both programs involves gene calling, mapping of
predicted ORFs against the nr protein database, and voting-based
classification of the entire contig / MAG based on classification of the
individual ORFs. CAT and BAT can be run from intermediate steps if files
are formatted appropriately.
|
|
cct
visually comparing bacterial, plasmid, chloroplast, or mitochondrial sequences
|
Versions of package cct |
Release | Version | Architectures |
stretch-backports | 20170919+dfsg-1~bpo9+1 | all |
bullseye | 1.0.0-1 | all |
trixie | 1.0.3-1 | all |
bookworm | 1.0.3-1 | all |
sid | 1.0.3-1 | all |
buster | 20170919+dfsg-1 | all |
|
License: DFSG free
|
The CGView Comparison Tool (CCT) is a package for visually comparing
bacterial, plasmid, chloroplast, or mitochondrial sequences of interest
to existing genomes or sequence collections. The comparisons are
conducted using BLAST, and the BLAST results are presented in the form
of graphical maps that can also show sequence features, gene and protein
names, COG category assignments, and sequence composition
characteristics. CCT can generate maps in a variety of sizes, including
400 Megapixel maps suitable for posters. Comparisons can be conducted
within a particular species or genus, or all available genomes can be
used. The entire map creation process, from downloading sequences to
redrawing zoomed maps, can be completed easily using scripts included
with the CCT. User-defined features or analysis results can be included
on maps, and maps can be extensively customized. To simplify program
setup, a CCT virtual machine that includes all dependencies preinstalled
is available. Detailed tutorials illustrating the use of CCT are
included with the CCT documentation.
|
|
cd-hit
suite of programs designed to quickly group sequences
|
Versions of package cd-hit |
Release | Version | Architectures |
stretch | 4.6.6-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 4.6.8-2 | amd64,arm64,armhf,i386 |
bullseye | 4.8.1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 4.8.1-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 4.8.1-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 4.6.1-2012-08-27-2 | amd64,armel,armhf,i386 |
bookworm | 4.8.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
cd-hit contains a number of programs designed to quickly group
sequences. cd-hit groups proteins into clusters that meet a user-defined
similarity threshold. cd-hit-est is similar to cd-hit, but designed to
group nucleotide sequences (without introns). cd-hit-est-2d is similar
to cd-hit-2d but designed to compare two nucleotide datasets. A number
of other related programs are also in this package. Please see the
cd-hit user manual, also part of this package, for further information.
|
|
cdbfasta
Constant DataBase indexing and retrieval tools for multi-FASTA files
|
Versions of package cdbfasta |
Release | Version | Architectures |
bullseye | 1.00+git20181005.014498c+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.00+git20230710.da8f5ba+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.00+git20230710.da8f5ba+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.00+git20181005.014498c+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.99-20100722-5 | amd64,arm64,armhf,i386 |
stretch | 0.99-20100722-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.99-20100722-1 | amd64,armel,armhf,i386 |
|
License: DFSG free
|
CDB (Constant DataBase) can be used for creating indices for quick
retrieval of any particular sequences from large multi-FASTA files.
It has the option to compress data records in order to save space.
|
|
centrifuge
rapid and memory-efficient system for classification of DNA sequences
|
Versions of package centrifuge |
Release | Version | Architectures |
bookworm | 1.0.3-11 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
bullseye | 1.0.3-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
trixie | 1.0.4.1-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
sid | 1.0.4.2-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
buster | 1.0.3-2 | amd64 |
|
License: DFSG free
|
Centrifuge is a very rapid and memory-efficient system for the
classification of DNA sequences from microbial samples, with better
sensitivity than and comparable accuracy to other leading systems. The
system uses a novel indexing scheme based on the Burrows-Wheeler
transform (BWT) and the Ferragina-Manzini (FM) index, optimized
specifically for the metagenomic classification problem. Centrifuge
requires a relatively small index (e.g., 4.3 GB for ~4,100 bacterial
genomes) yet provides very fast classification speed, allowing it to
process a typical DNA sequencing run within an hour. Together these
advances enable timely and accurate analysis of large metagenomics data
sets on conventional desktop computers.
|
|
cgview
|
Versions of package cgview |
Release | Version | Architectures |
buster | 0.0.20100111-4 | amd64,arm64,armhf,i386 |
stretch | 0.0.20100111-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.0.20100111-7 | all |
trixie | 0.0.20100111-7 | all |
bullseye | 0.0.20100111-7 | all |
sid | 0.0.20100111-7 | all |
|
License: DFSG free
|
CGView is a Java package for generating high quality, zoomable maps of
circular genomes. Its primary purpose is to serve as a component of
sequence annotation pipelines, as a means of generating visual output
suitable for the web. Feature information and rendering options are
supplied to the program using an XML file, a tab delimited file, or an
NCBI ptt file. CGView converts the input into a graphical map (PNG, JPG,
or Scalable Vector Graphics format), complete with labels, a title,
legends, and footnotes. In addition to the default full view map, the
program can generate a series of hyperlinked maps showing expanded
views. The linked maps can be explored using any web browser, allowing
rapid genome browsing, and facilitating data sharing. The feature labels
in maps can be hyperlinked to external resources, allowing CGView maps
to be integrated with existing web site content or databases.
In addition to the CGView application, an API is available for
generating maps from within other Java applications, using the
cgview package.
CGView can be used for any of the following:
- Bacterial genome visualization and browsing - CGView can be
incorporated into bacterial genome annotation pipelines, as a means
of generating web content for data visualization and navigation. The
PNG and image map content does not require Java applets or special
browser plugins.
- Genome poster generation - CGView can generate poster-sized images of
circular genomes in rasterized image formats or in Scalable Vector
Graphics format.
- Sequence analysis visualization - CGView can be used to display the
output of sequence analysis programs in a circular context.
CGView features:
- Images can be generated in PNG, JPG, or SVG format. See the
CGView gallery.
- Static or interactive maps can be generated. The interactive maps
make use of standard PNG images and HTML image maps. Scalable Vector
Graphics output is included in the interactive maps (see example).
- The XML input allows complete control over the appearance of the map.
- Tab delimited input files and NCBI ptt files can be used as an
alternative to the XML format.
- The CGView API can be used to incorporate CGView into Java
applications.
- The CGView applet can be used to incorporate zoomable maps into web
pages (see example).
- The CGView Server can be used to generate maps online.
|
|
changeo
Repertoire clonal assignment toolkit (Python 3)
|
Versions of package changeo |
Release | Version | Architectures |
sid | 1.3.0-2 | all |
buster | 0.4.5-1 | all |
bullseye | 1.0.2-1 | all |
bookworm | 1.3.0-1 | all |
trixie | 1.3.0-2 | all |
|
License: DFSG free
|
Change-O is a collection of tools for processing the output of V(D)J
alignment tools, assigning clonal clusters to immunoglobulin (Ig)
sequences, and reconstructing germline sequences.
Dramatic improvements in high-throughput sequencing technologies now
enable large-scale characterization of Ig repertoires, defined as the
collection of trans-membrane antigen-receptor proteins located on the
surface of B cells and T cells. Change-O is a suite of utilities to
facilitate advanced analysis of Ig and TCR sequences following germline
segment assignment. Change-O handles output from IMGT/HighV-QUEST
and IgBLAST, and provides a wide variety of clustering methods for
assigning clonal groups to Ig sequences. Record sorting, grouping,
and various database manipulation operations are also included.
This package installs the library for Python 3.
Please cite:
Namita T. Gupta, Jason A. Vander Heiden, Mohamed Uduman, Daniel Gadala-Maria, Gur Yaari and Steven H. Kleinstein:
Link
to publication
(PubMed,eprint)
Bioinformatics
31(20):3356-3358
(2015)
|
|
chimeraslayer
detects likely chimeras in PCR amplified DNA
|
Versions of package chimeraslayer |
Release | Version | Architectures |
bookworm | 20101212+dfsg1-5 | all |
bullseye | 20101212+dfsg1-4 | all |
buster | 20101212+dfsg1-2 | all |
trixie | 20101212+dfsg1-6 | all |
stretch | 20101212+dfsg1-1 | all |
jessie | 20101212+dfsg-1 | all |
sid | 20101212+dfsg1-6 | all |
Debtags of package chimeraslayer: |
biology | format:aln, nuceleic-acids |
field | biology, biology:molecular |
role | program |
scope | utility |
|
License: DFSG free
|
ChimeraSlayer is a chimeric sequence detection utility, compatible with
near-full length Sanger sequences and shorter 454-FLX sequences (~500bp).
Chimera Slayer involves the following series of steps that operate to
flag chimeric 16S rRNA sequences:
1. the ends of a query sequence are searched against an included
database of reference chimera-free 16S sequences to identify potential
parents of a chimera
2. candidate parents of a chimera are selected as those that form a
branched best scoring alignment to the NAST-formatted query sequence
3. the NAST alignment of the query sequence is improved in a
‘chimera-aware’ profile-based NAST realignment to the selected
reference parent sequences
4. an evolutionary framework is used to flag query sequences found to
exhibit greater sequence homology to an in silico chimera formed
between any two of the selected reference parent sequences.
To run Chimera Slayer, you need NAST-formatted sequences generated by
the nast-ier utility.
ChimeraSlayer is part of the microbiomeutil suite.
Please cite:
Brian J. Haas, Dirk Gevers, Ashlee M. Earl, Mike Feldgarden, Doyle V. Ward, Georgia Giannoukos, Dawn Ciulla, Diana Tabbaa, Sarah K. Highlander, Erica Sodergren, Barbara Methé, Todd Z. DeSantis, The Human Microbiome Consortium, Joseph F. Petrosino, Rob Knight and Bruce W. Birren:
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.
(PubMed,eprint)
Genome Research
21(3):494-504
(2011)
|
|
chromhmm
Chromatin state discovery and characterization
|
Versions of package chromhmm |
Release | Version | Architectures |
sid | 1.25+dfsg-1 | all |
bullseye | 1.21+dfsg-1 | all |
buster | 1.18+dfsg-1 | all |
bookworm | 1.24+dfsg-1 | all |
trixie | 1.25+dfsg-1 | all |
|
License: DFSG free
|
ChromHMM is software for learning and characterizing chromatin states.
ChromHMM can integrate multiple chromatin datasets such as ChIP-seq data of
various histone modifications to discover de novo the major re-occuring
combinatorial and spatial patterns of marks. ChromHMM is based on a
multivariate Hidden Markov Model that explicitly models the presence or
absence of each chromatin mark. The resulting model can then be used to
systematically annotate a genome in one or more cell types. By automatically
computing state enrichments for large-scale functional and annotation datasets
ChromHMM facilitates the biological characterization of each state. ChromHMM
also produces files with genome-wide maps of chromatin state annotations that
can be directly visualized in a genome browser.
|
|
chromimpute
Large-scale systematic epigenome imputation
|
Versions of package chromimpute |
Release | Version | Architectures |
bullseye | 1.0.3+dfsg-2 | all |
buster | 1.0.3+dfsg-1 | all |
trixie | 1.0.3+dfsg-5 | all |
bookworm | 1.0.3+dfsg-4 | all |
sid | 1.0.3+dfsg-5 | all |
|
License: DFSG free
|
ChromImpute takes an existing compendium of epigenomic data and uses it to
predict signal tracks for mark-sample combinations not experimentally mapped
or to generate a potentially more robust version of data sets that have been
mapped experimentally. ChromImpute bases its predictions on features from
signal tracks of other marks that have been mapped in the target sample and
the target mark in other samples with these features combined using an
ensemble of regression trees.
|
|
cif-tools
Suite of tools to manipulate, validate and query mmCIF files
|
Versions of package cif-tools |
Release | Version | Architectures |
bullseye | 1.0.0-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0.7-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.0.12-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0.12-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
This package contains a suite of tools for the manipulation of mmCIF files.
The structure of macro molecules is nowadays recorded in mmCIF files. Until
recently however the ancient PDB file format was used by many programs but
that format has since long been deprecated.
This package provides two tools, pdb2cif and cif2pdb, that can convert files
from one format into the other, provided that data fits of course.
Other tools are cif-validate, cif-grep, cif-diff, cif-merge and mmCQL. The
latter can be used to manipulate an mmCIF file as if it were a SQL like
database using SELECT, UPDATE, INSERT and DELETE commands.
This package depends on libcifpp.
|
|
circlator
circularize genome assemblies
|
Versions of package circlator |
Release | Version | Architectures |
stretch | 1.4.1-1 | all |
buster | 1.5.5-3 | amd64 |
bullseye | 1.5.6-5 | amd64 |
sid | 1.5.6-11 | all |
bookworm | 1.5.6-7 | amd64 |
|
License: DFSG free
|
Circlator is a tool to automate assembly circularization for bacterial and
small eukaryotic genomes and produce accurate linear representations of
circular sequences.
|
|
circos
plotter for visualizing data
|
Versions of package circos |
Release | Version | Architectures |
bullseye | 0.69.9+dfsg-2 | all |
stretch | 0.69.4+dfsg-1 | all |
buster | 0.69.6+dfsg-2 | all |
jessie | 0.66-1 | all |
trixie | 0.69.9+dfsg-2 | all |
sid | 0.69.9+dfsg-2 | all |
bookworm | 0.69.9+dfsg-2 | all |
Debtags of package circos: |
field | biology:bioinformatics |
role | program |
use | viewing |
|
License: DFSG free
|
Circos visualizes data in a circular layout — ideal for exploring
relationships between objects or positions, and creating highly
informative publication-quality graphics.
This package provides the Circos plotting engine, which is command-line
driven (like gnuplot) and fully scriptable.
|
|
clearcut
extremely efficient phylogenetic tree reconstruction
|
Versions of package clearcut |
Release | Version | Architectures |
jessie | 1.0.9-1 | amd64,armel,armhf,i386 |
sid | 1.0.9+git20211013.b799afe-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0.9+git20211013.b799afe-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0.9+git20211013.b799afe-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0.9-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.0.9-3 | amd64,arm64,armhf,i386 |
stretch | 1.0.9-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Clearcut is the reference implementation for the Relaxed Neighbor Joining (RNJ)
algorithm by J. Evans, L. Sheneman, and J. Foster from the Initiative
for Bioinformatics and Evolutionary Studies (IBEST) at the University of
Idaho.
|
|
clonalframe
inference of bacterial microevolution using multilocus sequence data
|
Versions of package clonalframe |
Release | Version | Architectures |
jessie | 1.2-3 | amd64,armel,armhf,i386 |
stretch | 1.2-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.2-9 | amd64,arm64,armhf,i386 |
bullseye | 1.2-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.2-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2-11 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.2-11 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package clonalframe: |
role | program |
|
License: DFSG free
|
ClonalFrame identifies the clonal relationships between the members of
a sample, while also estimating the chromosomal position of homologous
recombination events that have disrupted the clonal inheritance.
ClonalFrame can be applied to any kind of sequence data, from a single
fragment of DNA to whole genomes. It is well suited for the analysis of
MLST data, where 7 gene fragments have been sequenced, but becomes
progressively more powerful as the sequenced regions increase in length
and number up to whole genomes. However, it requires the sequences to be
aligned. If you have genomic data that is not aligned, it is recommend to
use Mauve which produces alignment of whole bacterial genomes in
exactly the format required for analysis with ClonalFrame.
|
|
clonalframeml
Efficient Inference of Recombination in Whole Bacterial Genomes
|
Versions of package clonalframeml |
Release | Version | Architectures |
bullseye | 1.12-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.11-3 | amd64,arm64,armhf,i386 |
sid | 1.13-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.13-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.12-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
ClonalFrameML is a software package that performs efficient inference of
recombination in bacterial genomes. ClonalFrameML was created by Xavier
Didelot and Daniel Wilson. ClonalFrameML can be applied to any type of
aligned sequence data, but is especially aimed at analysis of whole
genome sequences. It is able to compare hundreds of whole genomes in a
matter of hours on a standard Desktop computer. There are three main
outputs from a run of ClonalFrameML: a phylogeny with branch lengths
corrected to account for recombination, an estimation of the key
parameters of the recombination process, and a genomic map of where
recombination took place for each branch of the phylogeny.
ClonalFrameML is a maximum likelihood implementation of the Bayesian
software ClonalFrame which was previously described by Didelot and
Falush (2007). The recombination model underpinning ClonalFrameML is
exactly the same as for ClonalFrame, but this new implementation is a
lot faster, is able to deal with much larger genomic dataset, and does
not suffer from MCMC convergence issues
|
|
clonalorigin
inference of homologous recombination in bacteria using whole genome sequences
|
Versions of package clonalorigin |
Release | Version | Architectures |
sid | 1.0-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.0-3 | amd64,arm64,armhf,i386 |
bookworm | 1.0-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Bacteria, unlike us, can reproduce on their own. They do however have
mechanisms that transfer DNA between organisms, a process more formally
known as recombination. The mechanisms by which recombination takes
place have been studied extensively in the laboratory but much remains
to be understood concerning how, when and where recombination takes
place within natural populations of bacteria and how it helps them to
adapt to new environments. ClonalOrigin performs a comparative analysis
of the sequences of a sample of bacterial genomes in order to
reconstruct the recombination events that have taken place in their
ancestry.
|
|
clustalo
General-purpose multiple sequence alignment program for proteins
|
Versions of package clustalo |
Release | Version | Architectures |
bullseye | 1.2.4-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.2.4-2 | amd64,arm64,armhf,i386 |
stretch | 1.2.4-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.2.4-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2.4-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.2.4-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.2.1-1 | amd64,armel,armhf,i386 |
|
License: DFSG free
|
Clustal Omega is a general-purpose multiple sequence alignment (MSA)
program, primarily for amino-acid sequences. It produces high quality MSAs
and is capable of handling data sets of hundreds of thousands of sequences
in reasonable time, using multiple processors where available.
Please cite:
Fabian Sievers, Andreas Wilm, David Dineen, Toby J Gibson, Kevin Karplus, Weizhong Li, Rodrigo Lopez, Hamish McWilliam, Michael Remmert, Johannes Söding, Julie D Thompson and Desmond G Higgins:
Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.
(PubMed,eprint)
Molecular Systems Biology
7:539
(2011)
Topics: Sequence analysis
|
|
clustalw
global multiple nucleotide or peptide sequence alignment
|
Versions of package clustalw |
Release | Version | Architectures |
sid | 2.1+lgpl-7 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
jessie | 2.1+lgpl-4 | amd64,armel,armhf,i386 |
trixie | 2.1+lgpl-7 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.1+lgpl-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2.1+lgpl-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.1+lgpl-6 | amd64,arm64,armhf,i386 |
bullseye | 2.1+lgpl-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package clustalw: |
biology | format:aln, nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | commandline, text-mode |
role | program |
scope | utility |
use | comparing |
works-with-format | plaintext |
|
License: DFSG free
|
This program performs an alignment of multiple nucleotide or amino acid
sequences. It recognizes the format of input sequences and whether the
sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The output
format may be selected from in various formats for multiple alignments such as
Phylip or FASTA. Clustal W is very well accepted.
The output of Clustal W can be edited manually but preferably with an
alignment editor like SeaView or within its companion Clustal X. When building
a model from your alignment, this can be applied for improved database
searches. The Debian package hmmer creates such in form of an HMM.
The package is enhanced by the following packages:
clustalw-mpi
Please cite:
M. A. Larkin, G. Blackshields, N. P. Brown, R. Chenna, P. A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J. D. Thompson, T. J. Gibson and D. G. Higgins:
Clustal W and Clustal X version 2.0.
(PubMed,eprint)
Bioinformatics
23(21):2947-2948
(2007)
Topics: Sequence analysis
|
|
clustalx
Multiple alignment of nucleic acid and protein sequences (graphical interface)
|
Versions of package clustalx |
Release | Version | Architectures |
sid | 2.1+lgpl-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 2.1+lgpl-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.1+lgpl-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2.1+lgpl-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.1+lgpl-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.1+lgpl-8 | amd64,arm64,armhf,i386 |
jessie | 2.1+lgpl-3 | amd64,armel,armhf,i386 |
Debtags of package clustalx: |
biology | format:aln, nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | x11 |
role | program |
scope | utility |
uitoolkit | motif |
use | analysing, comparing, viewing |
works-with-format | plaintext |
x11 | application |
|
License: DFSG free
|
This package offers a GUI interface for the Clustal multiple sequence
alignment program. It provides an integrated environment for performing
multiple sequence- and profile-alignments to analyse the results.
The sequence alignment is displayed in a window on the screen.
A versatile coloring scheme has been incorporated to highlight conserved
features in the alignment. For professional presentations, one should
use the texshade LaTeX package or boxshade.
The pull-down menus at the top of the window allow you to select all the
options required for traditional multiple sequence and profile alignment.
You can cut-and-paste sequences to change the order of the alignment; you can
select a subset of sequences to be aligned; you can select a sub-range of the
alignment to be realigned and inserted back into the original alignment.
An alignment quality analysis can be performed and low-scoring segments or
exceptional residues can be highlighted.
Please cite:
M.A. Larkin, G. Blackshields, N.P. Brown, R. Chenna, P.A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J.D. Thompson, T.J. Gibson and D.G. Higgins:
Clustal W and Clustal X version 2.0.
(PubMed,eprint)
Bioinformatics
23(21):2947-2948
(2007)
Topics: Sequence analysis
|
|
cnvkit
Copy number variant detection from targeted DNA sequencing
|
Versions of package cnvkit |
Release | Version | Architectures |
buster | 0.9.5-3 | amd64 |
experimental | 0.9.10-3~0exp0 | all |
sid | 0.9.10-2 | all |
trixie | 0.9.10-2 | all |
bookworm | 0.9.9-2 | amd64,arm64,ppc64el |
bullseye | 0.9.8-1 | amd64,arm64,ppc64el |
upstream | 0.9.11 |
|
License: DFSG free
|
A command-line toolkit and Python library for detecting copy number variants
and alterations genome-wide from targeted DNA sequencing. It is designed for
use with hybrid capture, including both whole-exome and custom target panels,
and short-read sequencing platforms such as Illumina and Ion Torrent.
|
|
codonw
Correspondence Analysis of Codon Usage
|
Versions of package codonw |
Release | Version | Architectures |
buster | 1.4.4-4 | amd64,arm64,armhf,i386 |
trixie | 1.4.4-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.4.4-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.4.4-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.4.4-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.4.4-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
CodonW is a package for codon usage analysis. It was designed to simplify
Multivariate Analysis (MVA) of codon usage. The MVA method employed in
CodonW is correspondence analysis (COA) (the most popular MVA method for
codon usage analysis). CodonW can generate a COA for codon usage, relative
synonymous codon usage or amino acid usage. Additional analyses of codon
usage include investigation of optimal codons, codon and dinucleotide bias,
and/or base composition. CodonW analyses sequences encoded by genetic codes
other than the universal code.
Topics: Sequence composition, complexity and repeats
|
|
comet-ms
Tandem mass spectrometry (MS/MS) search engine
|
Versions of package comet-ms |
Release | Version | Architectures |
bullseye | 2019015+cleaned1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2014022-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 2019015+cleaned1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2019015+cleaned1-4.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2018012-1 | amd64,arm64,armhf,i386 |
sid | 2019015+cleaned1-4.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
upstream | 2021010 |
|
License: DFSG free
|
Comet is an open source tandem mass spectrometry (MS/MS) sequence
database search engine. It identifies peptides by searching MS/MS
spectra against sequences present in protein sequence databases.
This package ships a binary that does MS/MS database
searches. Supported input formats are mzXML, mzML, and ms2
files. Supported output formats are .out, SQT, and pepXML.
|
|
concavity
predictor of protein ligand binding sites from structure and conservation
|
Versions of package concavity |
Release | Version | Architectures |
jessie | 0.1-2 | amd64,armel,armhf,i386 |
sid | 0.1+dfsg.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1+dfsg.1-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1+dfsg.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.1+dfsg.1-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.1+dfsg.1-4 | amd64,arm64,armhf,i386 |
stretch | 0.1+dfsg.1-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
ConCavity predicts protein ligand binding sites by combining evolutionary
sequence conservation and 3D structure.
ConCavity takes as input a PDB format protein structure and optionally
files that characterize the evolutionary sequence conservation of the chains
in the structure file.
The following result files are produced by default:
- Residue ligand binding predictions for each chain (*.scores).
- Residue ligand binding predictions in a PDB format file (residue
scores placed in the temp. factor field, *_residue.pdb).
- Pocket prediction locations in a DX format file (*.dx).
- PyMOL script to visualize the predictions (*.pml).
|
|
conservation-code
protein sequence conservation scoring tool
|
Versions of package conservation-code |
Release | Version | Architectures |
stretch | 20110309.0-5 | all |
bookworm | 20110309.0-8 | all |
sid | 20110309.0-8 | all |
jessie | 20110309.0-3 | all |
bullseye | 20110309.0-8 | all |
buster | 20110309.0-7 | all |
trixie | 20110309.0-8 | all |
|
License: DFSG free
|
This package provides score_conservation(1), a tool to score protein sequence
conservation.
The following conservation scoring methods are implemented:
- sum of pairs
- weighted sum of pairs
- Shannon entropy
- Shannon entropy with property groupings (Mirny and Shakhnovich 1995,
Valdar and Thornton 2001)
- relative entropy with property groupings (Williamson 1995)
- von Neumann entropy (Caffrey et al 2004)
- relative entropy (Samudrala and Wang 2006)
- Jensen-Shannon divergence (Capra and Singh 2007)
A window-based extension that incorporates the estimated conservation of
sequentially adjacent residues into the score for each column is also given.
This window approach can be applied to any of the conservation scoring
methods.
The program accepts alignments in the CLUSTAL and FASTA formats.
The sequence-specific output can be used as the conservation input for
concavity.
Conservation is highly predictive in identifying catalytic sites and
residues near bound ligands.
|
|
coot
model building program for macromolecular crystallography
|
Versions of package coot |
Release | Version | Architectures |
sid | 1.1.09+dfsg-2 | amd64,arm64,armhf,ppc64el |
upstream | 1.1.10 |
|
License: DFSG free
|
This is a program for constructing atomic models of macromolecules
from x-ray diffraction data. Coot displays electron density maps and
molecular models and allows model manipulations such as idealization,
refinement, manual rotation/translation, rigid-body fitting, ligand
search, solvation, mutations, rotamers. Validation tools such as
Ramachandran and geometry plots are available to the user. This
package provides a Coot build with embedded Python support.
|
|
covtobed
convert the coverage track from a BAM file into a BED file
|
Versions of package covtobed |
Release | Version | Architectures |
bullseye | 1.2.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.3.5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.3.5+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.3.5+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Reads one (or more) alignment files (sorted BAM) and prints a BED with
the coverage. It will join consecutive bases with the same coverage, and
can be used to only print a BED file with the regions having a specific
coverage range.
|
|
crac
integrated RNA-Seq read analysis
|
Versions of package crac |
Release | Version | Architectures |
buster | 2.5.0+dfsg-3 | amd64,arm64 |
sid | 2.5.2+dfsg-6 | amd64,arm64,mips64el,ppc64el,riscv64 |
trixie | 2.5.2+dfsg-6 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 2.5.2+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
stretch | 2.5.0+dfsg-1 | amd64 |
bullseye | 2.5.2+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el |
|
License: DFSG free
|
CRAC is a tool to analyze High Throughput Sequencing (HTS) data in
comparison to a reference genome. It is intended for transcriptomic
and genomic sequencing reads. More precisely, with transcriptomic
reads as input, it predicts point mutations, indels, splice junction,
and chimeric RNAs (ie, non colinear splice junctions). CRAC can also
output positions and nature of sequence error that it detects in the
reads. CRAC uses a genome index. This index must be computed before
running the read analysis. For this sake, use the command "crac-index"
on your genome files. You can then process the reads using the command
crac. See the man page of CRAC (help file) by typing "man crac". CRAC
requires large amount of main memory on your computer. For processing
against the Human genome, say 50 million reads of 100 nucleotide each,
CRAC requires about 40 gigabytes of main memory. Check whether the
system of your computing server is equipped with sufficient amount of
memory before launching an analysis.
|
|
csb
Computational Structural Biology Toolbox (CSB)
|
Versions of package csb |
Release | Version | Architectures |
sid | 1.2.5+dfsg-10 | all |
buster | 1.2.5+dfsg-3 | all |
bookworm | 1.2.5+dfsg-8 | all |
trixie | 1.2.5+dfsg-10 | all |
bullseye | 1.2.5+dfsg-5 | all |
|
License: DFSG free
|
Computational Structural Biology Toolbox (CSB) is a Python class
library for reading, storing and analyzing biomolecular structures
in a variety of formats with rich support for statistical analyses.
CSB is designed for reusability and extensibility and comes with a clean,
well-documented API following good object-oriented engineering practice.
This package contains some user executable tools.
|
|
ctffind
fast and accurate defocus estimation from electron micrographs
|
Versions of package ctffind |
Release | Version | Architectures |
trixie | 4.1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 4.1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
upstream | 5.0.2 |
|
License: DFSG free
|
This is a widely-used program for the estimation of objective lens defocus
parameters from transmission electron micrographs. Defocus parameters are
estimated by fitting a model of the microscope's contrast transfer function
(CTF) to an image's amplitude spectrum.
|
|
cutadapt
Clean biological sequences from high-throughput sequencing reads
|
Versions of package cutadapt |
Release | Version | Architectures |
buster | 1.18-1 | all |
trixie | 4.7-2 | all |
bullseye | 3.2-2 | all |
bookworm | 4.2-1 | all |
stretch | 1.12-2 | all |
sid | 4.7-2 | all |
upstream | 4.9 |
|
License: DFSG free
|
Cutadapt helps with biological sequence clean tasks by finding the adapter
or primer sequences in an error-tolerant way.
It can also modify and filter reads in various ways.
Adapter sequences can contain IUPAC wildcard characters.
Also, paired-end reads and even colorspace data is supported.
If you want, you can also just demultiplex your input data, without removing
adapter sequences at all.
This package contains the user interface.
The package is enhanced by the following packages:
multiqc
|
|
cutesv
comprehensive discovery of structural variations of genomic sequences
|
Versions of package cutesv |
Release | Version | Architectures |
trixie | 2.1.0-2 | all |
sid | 2.1.1-1 | all |
bookworm | 2.0.2-1 | all |
|
License: DFSG free
|
Long-read sequencing enables the comprehensive discovery of structural
variations (SVs). However, it is still non-trivial to achieve high
sensitivity and performance simultaneously due to the complex SV
characteristics implied by noisy long reads.
cuteSV is a sensitive, fast and scalable long-read-based SV detection
approach. cuteSV uses tailored methods to collect the signatures of
various types of SVs and employs a clustering-and-refinement method to
analyze the signatures to implement sensitive SV detection. Benchmarks
on real Pacific Biosciences (PacBio) and Oxford Nanopore Technology
(ONT) datasets demonstrate that cuteSV has better yields and scalability
than state-of-the-art tools.
|
|
daligner
local alignment discovery between long nucleotide sequencing reads
|
Versions of package daligner |
Release | Version | Architectures |
trixie | 1.0+git20240119.335105d-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0+git20240119.335105d-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.0+git20180524.fd21879-1 | amd64,arm64,armhf,i386 |
stretch | 1.0+20161119-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0+git20200727.ed40ce5-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0+git20221215.bd26967-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
These tools permit one to find all significant local alignments between
reads encoded in a Dazzler database. The assumption is that the reads are
from a Pacific Biosciences RS II long read sequencer. That is, the reads
are long and noisy, up to 15% on average.
|
|
damapper
long read to reference genome mapping tool
|
Versions of package damapper |
Release | Version | Architectures |
trixie | 0.0+git20240314.b025cf9-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.0+git20240314.b025cf9-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.0+git20210330.ab45103-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.0+git20200322.b2c9d7f-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Recognised as the Damapper Library, this is a long read to reference
genome mapping command line tool.
For a given reference database 'X' and read block 'Y', damapper produces
the single file 'Y.X.las'. Each output file is sorted in order of the
A-reads, and if a match is a chain of local alignments, then the LA's in
the chain occur in increasing order of A-coordinates.
HPC.damapper writes a UNIX shell script to the standard output that maps
every read in blocks to of database to a
reference sequence [. If is missing then only the single
block is mapped, and if is also missing then all blocks
of the database are mapped.]
This package contains the damapper and HPC.damapper binaries.
|
|
datamash
statistics tool for command-line interface
|
Versions of package datamash |
Release | Version | Architectures |
jessie | 1.0.6-2 | amd64,armel,armhf,i386 |
buster | 1.4-1 | amd64,arm64,armhf,i386 |
bookworm | 1.7-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.8-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.7-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.8-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.0.7-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
GNU Datamash is a command-line program which performs basic numeric,
textual and statistical operations on input textual data files. It is
designed to be portable and reliable, and aid researchers to easily
automate analysis pipelines, without writing code or even short scripts.
|
|
dawg
simulate the evolution of recombinant DNA sequences
|
Versions of package dawg |
Release | Version | Architectures |
bookworm | 1.2-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.2-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.2-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.2-2 | amd64,arm64,armhf,i386 |
bullseye | 1.2-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
DNA Assembly with Gaps (Dawg) is an application designed to simulate the
evolution of recombinant DNA sequences in continuous time based on the robust
general time reversible model with gamma and invariant rate heterogeneity and
a novel length-dependent model of gap formation. The application accepts
phylogenies in Newick format and can return the sequence of any node,
allowing for the exact evolutionary history to be recorded at the discretion
of users. Dawg records the gap history of every lineage to produce the true
alignment in the output. Many options are available to allow users to
customize their simulations and results.
|
|
dazzdb
manage nucleotide sequencing read data
|
Versions of package dazzdb |
Release | Version | Architectures |
buster | 1.0+git20180908.0bd5e07-1 | amd64,arm64,armhf,i386 |
sid | 1.0+git20240115.be65e59-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0+git20221215.aad3a46-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0+git20201103.8d98c37-1+deb11u1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0+git20240115.be65e59-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.0+20161112-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
To facilitate the multiple phases of the dazzler assembler, all the read data
is organized into what is effectively a database of the
reads and their meta-information. The design goals for this data base
are as follows:
- The database stores the source Pacbio read information in such a
way that it can re-create the original input data, thus permitting
a user to remove the (effectively redundant) source files. This
avoids duplicating the same data, once in the source file and once
in the database.
- The data base can be built up incrementally, that is new sequence
data can be added to the data base over time.
- The data base flexibly allows one to store any meta-data desired for
reads. This is accomplished with the concept of tracks that
implementors can add as they need them.
- The data is held in a compressed form equivalent to the .dexta and
.dexqv files of the data extraction module. Both the .fasta and
.quiva information for each read is held in the data base and can be
recreated from it. The .quiva information can be added separately and
later on if desired.
- To facilitate job parallel, cluster operation of the phases of the
assembler, the database has a concept of a current partitioning in
which all the reads that are over a given length and optionally
unique to a well, are divided up into blocks containing roughly a
given number of bases, except possibly the last block which may have
a short count. Often programs can be run on blocks or pairs of blocks
and each such job is reasonably well balanced as the blocks are all
the same size. One must be careful about changing the partition
during an assembly as doing so can void the structural validity of
any interim block-based results.
|
|
deblur
deconvolution for Illumina amplicon sequencing
|
Versions of package deblur |
Release | Version | Architectures |
sid | 1.1.1-2 | all |
trixie | 1.1.1-2 | all |
|
License: DFSG free
|
Deblur is a greedy deconvolution algorithm for amplicon sequencing
based on Illumina Miseq/Hiseq error profiles. The authors recommend
using Deblur via the QIIME2 plugin q2-deblur. Examples of its use can be
found within the plugin itself. However, Deblur itself does not depend
on QIIME2.
The input to Deblur workflow is a directory of FASTA or FASTQ files
(1 per sample) or a single demultiplexed FASTA or FASTQ file. These
files can be gzip'd. The output directory will contain three BIOM
tables in which the observation IDs are the Deblurred sequences. The
outputs are contingent on the reference databases used and a more
focused discussion on them is in the subsequent README section titled
"Positive and Negative Filtering." The output files are as follows:
-
reference-hit.biom : contains only Deblurred reads matching the
positive filtering database. By default, a reference composed of 16S
sequences is used, and this resulting table will contain only those
reads which recruit at a coarse level to it will be retained. Reads
are also filtered against the negative reference, which by default
will remove any read which appears to be PhiX or adapter.
-
reference-hit.seqs.fa : a fasta file containing all the sequences
in reference-hit.biom
-
reference-non-hit.biom : contains only Deblurred reads that did not
align to the positive filtering database. Negative filtering is also
appied to this table, so by default, PhiX and adapter are removed.
-
reference-non-hit.seqs.fa : a fasta file containing all the
sequences in reference-non-hit.biom
-
all.biom : contains all Deblurred reads. This file represents the
union of the "reference-hit.biom" and "reference-non-hit.biom" tables.
-
all.seqs.fa : a fasta file containing all the sequences in all.biom
Deblur uses two types of filtering on the sequences:
-
Negative mode - removes known artifact sequences (i.e. sequences
aligning to PhiX or Adapter with >=95% identity and coverage).
-
Positive mode - keeps only sequences similar to a reference database
(by default known 16S sequences). SortMeRNA is used, and any sequence
with an e-value <= 10 is retained. Deblur also outputs a BIOM table
without this positive filtering step (named all.biom).
The FASTA files for both of these filtering steps can be supplied via
the --neg-ref-fp and --pos-ref-fp options. By default, the negative
database is composed of PhiX and adapter sequence and the positive
database of known 16S sequences.
Deblur uses negative mode filtering to remove known artifact (i.e. PhiX
and Adapter sequences) prior to denoising. The output of Deblur contains
three files: all.biom, which includes all sOTUs, reference-hit.biom,
which contains the output of positive filtering of the sOTUs (default
only sOTUs similar to 16S sequences), and reference-non-hit.biom,
which contains only sOTUs failing the positive filtering (default only
non-16S sOTUs).
|
|
deepnano
alternative basecaller for MinION reads of genomic sequences
|
Versions of package deepnano |
Release | Version | Architectures |
bullseye | 0.0+git20170813.e8a621e-3.1 | amd64,arm64,armhf,i386,ppc64el,s390x |
buster | 0.0+git20170813.e8a621e-3 | amd64,arm64,i386 |
|
License: DFSG free
|
DeepNano is alternative basecaller for Oxford Nanopore MinION reads
based on deep recurrent neural networks.
Currently it works with SQK-MAP-006 and SQK-MAP-005 chemistry and as a
postprocessor for Metrichor.
|
|
delly
Structural variant discovery by read analysis
|
Versions of package delly |
Release | Version | Architectures |
bookworm | 1.1.6-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.1.8-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.1.8-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0.8.1-2 | amd64,arm64,armhf |
bullseye | 0.8.7-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
upstream | 1.3.1 |
|
License: DFSG free
|
Delly performs Structural variant discovery by integrated paired-end and
split-read analysis. It discovers, genotypes and visualizes deletions,
tandem duplications, inversions and translocations at single-nucleotide
resolution in short-read massively parallel sequencing data. It uses
paired-ends, split-reads and read-depth to sensitively and accurately
delineate genomic rearrangements throughout the genome.
|
|
density-fitness
Calculates per-residue electron density scores
|
Versions of package density-fitness |
Release | Version | Architectures |
sid | 1.0.12-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.0.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.12-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0.8-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
The program density-fitness calculates electron density metrics,
for main- (includes Cβ atom) and side-chain atoms of individual residues.
For this calculation, the program uses the structure model in either PDB
or mmCIF format and the electron density from the 2mFo-DFc and mFo-DFc maps.
If these maps are not readily available, the MTZ file and model can be used
to calculate maps clipper. Density-fitness support both X-ray and electron
diffraction data.
This program is essentially a reimplementation of edstats, a program
available from the CCP4 suite. However, the output now contains only the
RSR, SRSR and RSCC fields as in edstats with the addition of EDIAm
and OPIA and no longer requires pre-calculated map coefficients.
|
|
dextractor
(d)extractor and compression command library
|
|
License: DFSG free
|
Dextractor commands allow one to pull exactly and only the
information needed for assembly and reconstruction from the source HDF5
files produced by the PacBio RS II sequencer, or from the source BAM
files produced by the PacBio Sequel sequencer.
For each of the three extracted file types -- fasta, quiva, and
arrow -- the library contains commands to compress the given file
type, and to decompress it, which is a reversible process delivering
the original uncompressed file. The compressed .fasta files, with the
extension .dexta, consume 1/4 byte per base. The compressed .quiva
files, with the extension .dexqv, consume 1.5 bytes per base on
average, and the compressed .arrow files, with the extension .dexar,
consume 1/4 byte per base
For more information, please view the available documentation at
https://github.com/thegenemyers/DEXTRACTOR
|
|
dialign
Segment-based multiple sequence alignment
|
Versions of package dialign |
Release | Version | Architectures |
bookworm | 2.2.1-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.2.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 2.2.1-7 | amd64,armel,armhf,i386 |
stretch | 2.2.1-8 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.2.1-10 | amd64,arm64,armhf,i386 |
bullseye | 2.2.1-11 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 2.2.1-13 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package dialign: |
biology | format:aln, nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing |
works-with-format | plaintext |
|
License: DFSG free
|
DIALIGN2 is a command line tool to perform multiple alignment of
protein or DNA sequences. It constructs alignments from gapfree pairs
of similar segments of the sequences. This scoring scheme for
alignments is the basic difference between DIALIGN and other global or
local alignment methods. Note that DIALIGN does not employ any kind of
gap penalty.
|
|
dialign-tx
Segment-based multiple sequence alignment
|
Versions of package dialign-tx |
Release | Version | Architectures |
bookworm | 1.0.2-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.0.2-12 | amd64,arm64,armhf,i386 |
trixie | 1.0.2-15 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0.2-15 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.0.2-7 | amd64,armel,armhf,i386 |
stretch | 1.0.2-9 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0.2-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package dialign-tx: |
field | biology, biology:bioinformatics |
role | program |
scope | utility |
use | comparing |
works-with-format | plaintext |
|
License: DFSG free
|
DIALIGN-TX is a command line tool to perform multiple alignment of protein or
DNA sequences. It is a complete reimplementation of the segment-base approach
including several new improvements and heuristics that significantly enhance
the quality of the output alignments compared to DIALIGN 2.2 and DIALIGN-T.
For pairwise alignment, DIALIGN-TX uses a fragment-chaining algorithm that
favours chains of low-scoring local alignments over isolated high-scoring
fragments. For multiple alignment, DIALIGN-TX uses an improved greedy
procedure that is less sensitive to spurious local sequence similarities.
|
|
diamond-aligner
accelerated BLAST compatible local sequence aligner
|
Versions of package diamond-aligner |
Release | Version | Architectures |
sid | 2.1.9-1 | amd64,arm64,ppc64el,riscv64,s390x |
bullseye | 2.0.7-1 | amd64,arm64,ppc64el,s390x |
trixie | 2.1.9-1 | amd64,arm64,ppc64el,riscv64,s390x |
bookworm | 2.1.3-1 | amd64,arm64,ppc64el,s390x |
stretch-backports | 0.9.22+dfsg-2~bpo9+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.24+dfsg-1 | amd64 |
upstream | 2.1.10 |
|
License: DFSG free
|
DIAMOND is a sequence aligner for protein and translated DNA searches
and functions as a drop-in replacement for the NCBI BLAST software
tools. It is suitable for protein-protein search as well as DNA-protein
search on short reads and longer sequences including contigs and
assemblies, providing a speedup of BLAST ranging up to x20,000.
|
|
discosnp
discovering Single Nucleotide Polymorphism from raw set(s) of reads
|
Versions of package discosnp |
Release | Version | Architectures |
jessie | 1.2.5-1 | amd64,armel,armhf,i386 |
buster | 2.3.0-2 | amd64,arm64,i386 |
bookworm | 2.6.2-2 | amd64,arm64,mips64el,ppc64el |
trixie | 2.6.2-3 | amd64,arm64,mips64el,ppc64el,riscv64 |
sid | 2.6.2-3 | amd64,arm64,mips64el,ppc64el,riscv64 |
stretch | 1.2.6-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 4.4.4-1 | amd64,arm64,i386,mips64el,ppc64el,s390x |
|
License: DFSG free
|
Software discoSnp is designed for discovering Single Nucleotide
Polymorphism (SNP) from raw set(s) of reads obtained with Next Generation
Sequencers (NGS).
Note that number of input read sets is not constrained, it can be one, two,
or more. Note also that no other data as reference genome or annotations
are needed.
The software is composed by two modules. First module, kissnp2, detects SNPs
from read sets. A second module, kissreads, enhance the kissnp2 results by
computing per read set and for each found SNP:
1) its mean read coverage
2) the (phred) quality of reads generating the polymorphism.
This program is superseded by DiscoSnp++.
|
|
disulfinder
cysteines disulfide bonding state and connectivity predictor
|
Versions of package disulfinder |
Release | Version | Architectures |
bullseye | 1.2.11-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 1.2.11-6 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.2.11-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.2.11-12 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.2.11-4 | amd64,armel,armhf,i386 |
sid | 1.2.11-12 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.2.11-8 | amd64,arm64,armhf,i386 |
Debtags of package disulfinder: |
role | program |
|
License: DFSG free
|
'disulfinder' is for predicting the disulfide bonding state of cysteines
and their disulfide connectivity starting from sequence alone. Disulfide
bridges play a major role in the stabilization of the folding process for
several proteins. Prediction of disulfide bridges from sequence alone is
therefore useful for the study of structural and functional properties
of specific proteins. In addition, knowledge about the disulfide bonding
state of cysteines may help the experimental structure determination
process and may be useful in other genomic annotation tasks.
'disulfinder' predicts disulfide patterns in two computational stages:
(1) the disulfide bonding state of each cysteine is predicted by a
BRNN-SVM binary classifier; (2) cysteines that are known to participate
in the formation of bridges are paired by a Recursive Neural Network
to obtain a connectivity pattern.
|
|
dnaclust
tool for clustering millions of short DNA sequences
|
Versions of package dnaclust |
Release | Version | Architectures |
bookworm | 3-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 3-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 3-2 | amd64,armel,armhf,i386 |
trixie | 3-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 3-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 3-6 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
dnaclust is a tool for clustering large number of short DNA sequences.
The clusters are created in such a way that the "radius" of each
clusters is no more than the specified threshold.
The input sequences to be clustered should be in Fasta format. The id
of each sequence is based on the first word of the seqeunce in the Fasta
format. The first word is the prefix of the header up to the first
occurrence of white space characters in the header.
|
|
dnarrange
Method to find rearrangements in long DNA reads relative to a genome seq
|
Versions of package dnarrange |
Release | Version | Architectures |
sid | 1.5.3-1 | all |
bookworm | 1.5.3-1 | all |
trixie | 1.5.3-1 | all |
upstream | 1.6.2 |
|
License: DFSG free
|
This package provides utilities to align the reads
to the genome, find rearrangements and draw pictures of rearranged groups
|
|
dotter
detailed comparison of two genomic sequences
|
Versions of package dotter |
Release | Version | Architectures |
buster | 4.44.1+dfsg-3 | amd64,arm64,armhf,i386 |
bullseye | 4.44.1+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 4.44.1+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 4.44.1+dfsg-7.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Dotter is a graphical dot-matrix program for detailed comparison of two
sequences.
- Every residue in one sequence is compared to every residue in the
other, and a matrix of scores is calculated.
- One sequence is plotted on the x-axis and the other on the y-axis.
- Noise is filtered out so that alignments appear as diagonal lines.
- Pairwise scores are averaged over a sliding window to make the score
matrix more intelligible.
- The averaged score matrix forms a three-dimensional landscape,
with the two sequences in two dimensions and the height of the
peaks in the third. This landscape is projected onto two
dimensions using a grey-scale image - the darker grey of a peak,
the higher the score is.
- The contrast and threshold of the grey-scale image can be adjusted
interactively, without having to recalculate the score matrix.
- An Alignment Tool is provided to examine the sequence alignment that
the grey-scale image represents.
- Known high-scoring pairs can be loaded from a GFF file and overlaid
onto the plot.
- Gene models can be loaded from GFF and displayed alongside the
relevant axis.
- Compare a sequence against itself to find internal repeats.
- Find overlaps between multiple sequences by making a dot-plot of all
sequences versus themselves.
- Run Dotter in batch mode to create large, time-consuming dot-plots as
a background process.
|
|
drop-seq-tools
|
Versions of package drop-seq-tools |
Release | Version | Architectures |
bullseye | 2.4.0+dfsg-6 | all |
sid | 3.0.2+dfsg-1 | all |
trixie | 3.0.2+dfsg-1 | all |
bookworm | 2.5.2+dfsg-1 | all |
|
License: DFSG free
|
This software provide for core computational analysis of Drop-seq data,
which shows you how to transform raw sequence data into an expression
measurement for each gene in each individual cell.
|
|
dssp
protein secondary structure assignment based on 3D structure
|
Versions of package dssp |
Release | Version | Architectures |
jessie | 2.2.1-2 | amd64,armel,armhf,i386 |
stretch | 2.2.1-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 3.0.0-3 | amd64,arm64,armhf,i386 |
bullseye | 4.0.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 4.4.10-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 4.4.10-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 4.2.2-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
DSSP is an application you use to assign the secondary structure of a protein
based on its solved three dimensional (3D) structure.
This version (4.2) of DSSP is a rewrite that writes annotated mmCIF files
by default but can still produce the older dssp format. New is also the
support of PP helices.
|
|
dwgsim
short sequencing read simulator
|
Versions of package dwgsim |
Release | Version | Architectures |
bookworm | 0.1.14-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.1.11-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1.14-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1.12-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.1.12-2 | amd64,arm64,armhf |
|
License: DFSG free
|
DWGSIM simulates short sequencing reads from modern sequencing platforms.
DWGSIM generates base error rates using a parametric model, allowing a more
realisic error profile. It was originally developed for use in evaluating
short read aligners.
|
|
e-mem
Efficient computation of Maximal Exact Matches for very large genomes
|
Versions of package e-mem |
Release | Version | Architectures |
sid | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.0.1-2 | amd64,arm64,armhf,i386 |
bullseye | 1.0.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
E-MEM enables efficient computation of Maximal Exact Matches (MEMs) that
does not use full text indexes. The algorithm uses much less space and
is highly amenable to parallelization. It can compute all MEMs of
minimum length 100 between the whole human and mouse genomes on a 12
core machine in 10 min and 2 GB of memory; the required memory can be as
low as 600 MB. It can run efficiently genomes of any size. Extensive
testing and comparison with currently best algorithms is provided.
Mummer has many different scripts where one of the key program is MEM
computation. In all the scripts, the MEM computation program can be
replaced with e-mem with ease for better performance.
|
|
ea-utils
command-line tools for processing biological sequencing data
|
Versions of package ea-utils |
Release | Version | Architectures |
trixie | 1.1.2+dfsg-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.1.2+dfsg-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.1.2+dfsg-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.1.2+dfsg-6 | amd64,arm64,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.1.2+dfsg-5 | amd64,arm64,armhf,i386 |
stretch | 1.1.2+dfsg-4 | amd64,arm64,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Ea-utils provides a set of command-line tools for processing biological
sequencing data, barcode demultiplexing, adapter trimming, etc.
Primarily written to support an Illumina based pipeline - but should work with
any FASTQs.
Main Tools are:
-
fastq-mcf
Scans a sequence file for adapters, and, based on a log-scaled threshold,
determines a set of clipping parameters and performs clipping. Also does
skewing detection and quality filtering.
-
fastq-multx
Demultiplexes a fastq. Capable of auto-determining barcode id's based on a
master set fields. Keeps multiple reads in-sync during demultiplexing. Can
verify that the reads are in-sync as well, and fail if they're not.
-
fastq-join
Similar to audy's stitch program, but in C, more efficient and supports some
automatic benchmarking and tuning. It uses the same "squared distance for
anchored alignment" as other tools.
-
varcall
Takes a pileup and calculates variants in a more easily parameterized manner
than some other tools.
|
|
ecopcr
estimate PCR barcode primers quality
|
Versions of package ecopcr |
Release | Version | Architectures |
stretch | 0.5.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.1+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0.1+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1.0.1+dfsg-1 | amd64,arm64,armhf,i386 |
bookworm | 1.0.1+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.0.1+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
DNA barcoding is a tool for characterizing the species origin using a
short sequence from a standard position and agreed upon position in the
genome. To be used as a DNA barcode, a genome locus should vary among
individuals of the same species only to a minor degree and it should
vary among species very quickly. From a practical point of view, a
barcode locus should be flanked by two conserved regions to design PCR
primers. Several manually discovered barcode loci like COI, rbcL, 18S,
16S and 23S rDNA, or trnH-ps are routinely used today, but no objective
function has been described to measure their quality in terms of
universality (barcode coverage, Bc ) or in terms of taxonomical
discrimination capacity (barcode specificity, Bs ).
ecoPCR is an electronic PCR software developed by LECA and
Helix-Project. It helps to estimate Barcode primers quality. In
conjunction with OBITools you can postprocess ecoPCR output to compute
barcode coverage and barcode specificity. New barcode primers can be
developed using the ecoPrimers software
|
|
edtsurf
triangulated mesh surfaces for protein structures
|
Versions of package edtsurf |
Release | Version | Architectures |
stretch | 0.2009-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 0.2009-10 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.2009-10 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.2009-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.2009-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.2009-6 | amd64,arm64,armhf,i386 |
jessie | 0.2009-3 | amd64,armel,armhf,i386 |
|
License: DFSG free
|
EDTSurf is a open source program to construct triangulated surfaces
for macromolecules. It generates three major macromolecular surfaces:
van der Waals surface, solvent-accessible surface and molecular surface
(solvent-excluded surface). EDTsurf also identifies cavities which are
inside of macromolecules.
|
|
eigensoft
reduction of population bias for genetic analyses
|
Versions of package eigensoft |
Release | Version | Architectures |
trixie | 8.0.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 6.1.4+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 8.0.0+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 7.2.1+dfsg-1 | amd64,arm64,armhf,i386 |
bullseye | 7.2.1+dfsg-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 8.0.0+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
The EIGENSOFT package combines functionality from the group's population
genetics methods (Patterson et al. 2006) and their EIGENSTRAT stratification
method (Price et al. 2006). The EIGENSTRAT method uses principal components
analysis to explicitly model ancestry differences between cases and
controls along continuous axes of variation; the resulting correction is
specific to a candidate marker's variation in frequency across ancestral
populations, minimizing spurious associations while maximizing power to
detect true associations. The EIGENSOFT package has a built-in plotting
script and supports multiple file formats and quantitative phenotypes.
|
|
elph
DNA/protein sequence motif finder
|
Versions of package elph |
Release | Version | Architectures |
buster | 1.0.1-2 | amd64,arm64,armhf,i386 |
bullseye | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0.1-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
ELPH (Estimated Locations of Pattern Hits) is a general-purpose
Gibbs sampler for finding motifs in a set of DNA or protein sequences.
The program takes as input a set containing anywhere from a few dozen to
thousands of sequences, and searches through them for the most common
motif, assuming that each sequence contains one copy of the motif. ELPH
was used to find patterns such as ribosome binding sites (RBSs) and exon
splicing enhancers (ESEs).
|
|
embassy-domainatrix
Extra EMBOSS commands to handle domain classification file
|
Versions of package embassy-domainatrix |
Release | Version | Architectures |
stretch | 0.1.660-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.1.660-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 0.1.650-1 | amd64,armel,armhf,i386 |
sid | 0.1.660-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 0.1.660-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1.660-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
buster | 0.1.660-3 | amd64,arm64,armhf,i386 |
Debtags of package embassy-domainatrix: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, converting, editing, searching |
works-with-format | plaintext |
|
License: DFSG free
|
The DOMAINATRIX programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in the current domainatrix release are cathparse (generates DCF
file from raw CATH files), domainnr (removes redundant domains from a DCF
file), domainreso (removes low resolution domains from a DCF file), domainseqs
(adds sequence records to a DCF file), domainsse (adds secondary structure
records to a DCF file), scopparse (generates DCF file from raw SCOP files) and
ssematch (searches a DCF file for secondary structure matches).
|
|
embassy-domalign
Extra EMBOSS commands for protein domain alignment
|
Versions of package embassy-domalign |
Release | Version | Architectures |
sid | 0.1.660-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
jessie | 0.1.650-1 | amd64,armel,armhf,i386 |
bookworm | 0.1.660-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.1.660-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.1.660-3 | amd64,arm64,armhf,i386 |
stretch | 0.1.660-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1.660-5 | amd64,arm64,mips64el,ppc64el,riscv64 |
Debtags of package embassy-domalign: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing, editing |
works-with-format | plaintext |
|
License: DFSG free
|
The DOMALIGN programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in the current domalign release are allversusall (sequence
similarity data from all-versus-all comparison), domainalign (generates
alignments (DAF file) for nodes in a DCF file), domainrep (reorders DCF file
to identify representative structures) and seqalign (extend alignments (DAF
file) with sequences (DHF file)).
|
|
embassy-domsearch
Extra EMBOSS commands to search for protein domains
|
Versions of package embassy-domsearch |
Release | Version | Architectures |
bookworm | 0.1.660-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.1.660-3 | amd64,arm64,armhf,i386 |
trixie | 0.1.660-4 | amd64,arm64,mips64el,ppc64el,riscv64 |
sid | 0.1.660-4 | amd64,arm64,mips64el,ppc64el,riscv64 |
bullseye | 0.1.660-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.1.660-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
jessie | 0.1.650-1 | amd64,armel,armhf,i386 |
Debtags of package embassy-domsearch: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing |
|
License: DFSG free
|
The DOMSEARCH programs were developed by Jon Ison and colleagues at MRC HGMP
for their protein domain research. They are included as an EMBASSY package as
a work in progress.
Applications in this DOMSEARCH release are seqfraggle (removes fragment
sequences from DHF files), seqnr (removes redundancy from DHF files), seqsearch
(generates PSI-BLAST hits (DHF file) from a DAF file), seqsort (Remove
ambiguous classified sequences from DHF files) and seqwords (Generates DHF
files from keyword search of UniProt).
|
|
emboss
European molecular biology open software suite
|
Versions of package emboss |
Release | Version | Architectures |
buster | 6.6.0+dfsg-7 | amd64,arm64,armhf,i386 |
jessie | 6.6.0+dfsg-1 | amd64,armel,armhf,i386 |
sid | 6.6.0+dfsg-15 | amd64,arm64,mips64el,ppc64el,riscv64 |
stretch | 6.6.0+dfsg-6 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 6.6.0+dfsg-15 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 6.6.0+dfsg-12 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 6.6.0+dfsg-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package emboss: |
field | biology, biology:bioinformatics, biology:molecular |
interface | commandline |
role | program |
scope | suite |
use | analysing, comparing, converting, editing, organizing, searching, text-formatting, typesetting, viewing |
works-with | db |
works-with-format | plaintext |
|
License: DFSG free
|
EMBOSS is a free Open Source software analysis package specially developed for
the needs of the molecular biology (e.g. EMBnet) user community. The software
automatically copes with data in a variety of formats and even allows
transparent retrieval of sequence data from the web. Also, as extensive
libraries are provided with the package, it is a platform to allow other
scientists to develop and release software in true open source spirit. EMBOSS
also integrates a range of currently available packages and tools for sequence
analysis into a seamless whole. EMBOSS breaks the historical trend towards
commercial software packages.
|
|
emmax
genetic mapping considering population structure
|
Versions of package emmax |
Release | Version | Architectures |
bookworm | 0~beta.20100307-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0~beta.20100307-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0~beta.20100307-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0~beta.20100307-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
EMMAX is a statistical test for large scale human or model organism
association mapping accounting for the sample structure. In addition
to the computational efficiency obtained by EMMA algorithm, EMMAX takes
advantage of the fact that each locus explains only a small fraction of
complex traits, which allows one to avoid repetitive variance component
estimation procedure, resulting in a significant amount of increase in
computational time of association mapping using mixed model.
|
|
estscan
ORF-independent detector of coding DNA sequences
|
Versions of package estscan |
Release | Version | Architectures |
trixie | 3.0.3-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 3.0.3-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.0.3-3 | amd64,arm64,armhf,i386 |
bookworm | 3.0.3-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3.0.3-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
ESTScan is a program that can detect coding regions in DNA sequences,
even if they are of low quality. ESTScan will also detect and correct
sequencing errors that lead to frameshifts. ESTScan is not a gene
prediction program , nor is it an open reading frame detector. In fact,
its strength lies in the fact that it does not require an open reading
frame to detect a coding region. As a result, the program may miss a
few translated amino acids at either the N or the C terminus, but will
detect coding regions with high selectivity and sensitivity.
ESTScan takes advantages of the bias in hexanucleotide usage found in
coding regions relative to non-coding regions. This bias is formalized
as an inhomogeneous 3-periodic fifth-order Hidden Markov Model
(HMM). Additionally, the HMM of ESTScan has been extended to allows
insertions and deletions when these improve the coding region statistics.
Please cite:
C. Lottaz, C. Iseli, CV. Jongeneel and Philipp Bucher:
Modeling sequencing errors by combining Hidden Markov models
Bioinformatics
19:103-112
(2003)
|
|
examl
Exascale Maximum Likelihood (ExaML) code for phylogenetic inference
|
Versions of package examl |
Release | Version | Architectures |
bookworm | 3.0.22-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.0.21-2 | amd64,i386 |
trixie | 3.0.22-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.0.22-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 3.0.22-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Exascale Maximum Likelihood (ExaML) is a code for phylogenetic inference
using MPI. This code implements the popular RAxML search algorithm for
maximum likelihood based inference of phylogenetic trees.
ExaML is a strapped-down light-weight version of RAxML for phylogenetic
inference on huge datasets. It can only execute some very basic
functions and is intended for computer-savvy users that can write little
perl-scripts and have experience using queue submission scripts for
clusters. ExaML only implements the CAT and GAMMA models of rate
heterogeneity for binary, DNA, and protein data.
ExaML uses a radically new MPI parallelization approach that yields
improved parallel efficiency, in particular on partitioned multi-gene or
whole-genome datasets. It also implements a new load balancing algorithm
that yields better parallel efficiency.
It is up to 4 times faster than its predecessor RAxML-Light and scales
to a larger number of processors.
|
|
exonerate
generic tool for pairwise sequence comparison
|
Versions of package exonerate |
Release | Version | Architectures |
buster | 2.4.0-4 | amd64,arm64,armhf,i386 |
trixie | 2.4.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 2.4.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.4.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 2.4.0-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.4.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 2.2.0-6 | amd64,armel,armhf,i386 |
Debtags of package exonerate: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | searching |
works-with-format | plaintext |
|
License: DFSG free
|
Exonerate allows you to align sequences using a many alignment models, using
either exhaustive dynamic programming, or a variety of heuristics. Much of
the functionality of the Wise dynamic programming suite was reimplemented in C
for better efficiency. Exonerate is an intrinsic component of the building of
the Ensembl genome databases, providing similarity scores between RNA and DNA
sequences and thus determining splice variants and coding sequences in
general.
An In-silico PCR Experiment Simulation System (see the ipcress man page) is
packaged with exonerate.
This package also comes with a selection of utilities for performing
simple manipulations quickly on fasta files beyond 2Gb
|
|
fasta3
tools for searching collections of biological sequences
|
Versions of package fasta3 |
Release | Version | Architectures |
buster | 36.3.8g-1 (non-free) | amd64 |
bookworm | 36.3.8i.14-Nov-2020-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 36.3.8i.14-Nov-2020-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 36.3.8h.2020-02-11-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 36.3.8i.14-Nov-2020-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
The FASTA programs find regions of local or global similarity between
Protein or DNA sequences, either by searching Protein or DNA databases,
or by identifying local duplications within a sequence. Other
programs provide information on the statistical significance of an
alignment. Like BLAST, FASTA can be used to infer functional and
evolutionary relationships between sequences as well as help identify
members of gene families.
- Protein
- Protein-protein FASTA
- Protein-protein Smith-Waterman (ssearch)
- Global Protein-protein (Needleman-Wunsch) (ggsearch)
- Global/Local protein-protein (glsearch)
- Protein-protein with unordered peptides (fasts)
-
Protein-protein with mixed peptide sequences (fastf)
-
Nucleotide
- Nucleotide-Nucleotide (DNA/RNA fasta)
- Ordered Nucleotides vs Nucleotide (fastm)
-
Un-ordered Nucleotides vs Nucleotide (fasts)
-
Translated
- Translated DNA (with frameshifts, e.g. ESTs)
vs Proteins (fastx/fasty)
- Protein vs Translated DNA (with frameshifts)
(tfastx/tfasty)
-
Peptides vs Translated DNA (tfasts)
-
Statistical Significance
- Protein vs Protein shuffle (prss)
- DNA vs DNA shuffle (prss)
-
Translated DNA vs Protein shuffle (prfx)
-
Local Duplications
- Local Protein alignments (lalign)
- Plot Protein alignment "dot-plot" (plalign)
- Local DNA alignments (lalign)
- Plot DNA alignment "dot-plot" (plalign)
This software is often used via a web service at the
EBI with readily indexed reference databases at
http://www.ebi.ac.uk/Tools/fasta/.
|
|
fastahack
utility for indexing and sequence extraction from FASTA files
|
Versions of package fastahack |
Release | Version | Architectures |
buster | 0.0+git20160702.bbc645f+dfsg-6 | amd64,arm64,armhf,i386 |
stretch | 0.0+20160702-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el |
bullseye | 1.0.0+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0.0+dfsg-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.0+dfsg-11 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.0.0+dfsg-11 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
fastahack is a small application for indexing and extracting sequences and
subsequences from FASTA files. The included Fasta.cpp library provides a FASTA
reader and indexer that can be embedded into applications which would benefit
from directly reading subsequences from FASTA files. The library automatically
handles index file generation and use.
Features:
- FASTA index (.fai) generation for FASTA files
- Sequence extraction
- Subsequence extraction
- Sequence statistics (currently only entropy is provided)
Sequence and subsequence extraction use fseek64 to provide fastest-possible
extraction without RAM-intensive file loading operations. This makes fastahack
a useful tool for bioinformaticists who need to quickly extract many
subsequences from a reference FASTA sequence.
|
|
fastani
Fast alignment-free computation of whole-genome Average Nucleotide Identity
|
Versions of package fastani |
Release | Version | Architectures |
sid | 1.33-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.33-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.33-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
ANI is defined as mean nucleotide identity of orthologous gene pairs
shared between two microbial genomes. FastANI supports pairwise comparison
of both complete and draft genome assemblies.
|
|
fastaq
FASTA and FASTQ file manipulation tools
|
Versions of package fastaq |
Release | Version | Architectures |
trixie | 3.17.0-8 | all |
sid | 3.17.0-8 | all |
bookworm | 3.17.0-5 | all |
bullseye | 3.17.0-3 | all |
jessie | 1.5.0-1 | all |
buster | 3.17.0-2 | all |
stretch | 3.14.0-1 | all |
|
License: DFSG free
|
Fastaq represents a diverse collection of scripts that perform useful and
common FASTA/FASTQ manipulation tasks, such as filtering, merging, splitting,
sorting, trimming, search/replace, etc. Input and output files can be gzipped
(format is automatically detected) and individual Fastaq commands can be piped
together.
|
|
fastdnaml
Tool for construction of phylogenetic trees of DNA sequences
|
Versions of package fastdnaml |
Release | Version | Architectures |
buster | 1.2.2-14 | amd64,arm64,armhf,i386 |
jessie | 1.2.2-10 | amd64,armel,armhf,i386 |
stretch | 1.2.2-11 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 1.2.2-17 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.2.2-17 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.2.2-15 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.2.2-15 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package fastdnaml: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing |
works-with-format | plaintext |
|
License: DFSG free
|
fastDNAml is a program derived from Joseph Felsenstein's version 3.3 DNAML
(part of his PHYLIP package). Users should consult the documentation for
DNAML before using this program.
fastDNAml is an attempt to solve the same problem as DNAML, but to do so
faster and using less memory, so that larger trees and/or more bootstrap
replicates become tractable. Much of fastDNAml is merely a recoding of the
PHYLIP 3.3 DNAML program from PASCAL to C.
Note that the homepage of this program is not available any more and so
this program will probably not see any further updates.
|
|
fastlink
faster version of pedigree programs of Linkage
|
Versions of package fastlink |
Release | Version | Architectures |
sid | 4.1P-fix100+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 4.1P-fix100+dfsg-2 | amd64,arm64,armhf,i386 |
bullseye | 4.1P-fix100+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 4.1P-fix100+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 4.1P-fix95-3 | amd64,armel,armhf,i386 |
stretch | 4.1P-fix100+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 4.1P-fix100+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package fastlink: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing |
|
License: DFSG free
|
Genetic linkage analysis is a statistical technique used to map
genes and find the approximate location of disease genes. There
was a standard software package for genetic linkage called
LINKAGE. FASTLINK is a significantly modified and improved
version of the main programs of LINKAGE that runs much faster
sequentially, can run in parallel, allows the user to recover
gracefully from a computer crash, and provides abundant new
documentation. FASTLINK has been used in over 1000 published
genetic linkage studies.
This package contains the following programs:
ilink: GEMINI optimization procedure to find a locally
optimal value of the theta vector of recombination
fractions
linkmap: calculates location scores of one locus against a
fixed map of other loci
lodscore: compares likelihoods at locally optimal theta
mlink: calculates lod scores and risk with two of more loci
unknown: identify possible genotypes for unknowns
|
|
fastml
maximum likelihood ancestral amino-acid sequence reconstruction
|
Versions of package fastml |
Release | Version | Architectures |
buster | 3.1-4 | amd64,arm64,armhf,i386 |
bullseye | 3.11-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 3.11-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 3.1-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
trixie | 3.11-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 3.11-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
FastML is a bioinformatics tool for the reconstruction of ancestral
sequences based on the phylogenetic relations between homologous
sequences. FastML runs several algorithms that reconstruct the ancestral
sequences with emphasis on an accurate reconstruction of both indels and
characters. For character reconstruction the previously described FastML
algorithms are used to efficiently infer the most likely ancestral
sequences for each internal node of the tree. Both joint and the
marginal reconstructions are provided. For indels reconstruction the
sequences are first coded according to the indel events detected within
the multiple sequence alignment (MSA) and then a state-of-the-art
likelihood model is used to reconstruct ancestral indels states. The
results are the most probable sequences, together with posterior
probabilities for each character and indel at each sequence position for
each internal node of the tree. FastML is generic and is applicable for
any type of molecular sequences (nucleotide, protein, or codon
sequences).
|
|
fastp
Ultra-fast all-in-one FASTQ preprocessor
|
Versions of package fastp |
Release | Version | Architectures |
trixie | 0.23.4+dfsg-1 | amd64,arm64,armel,armhf,mips64el,ppc64el,riscv64,s390x |
sid | 0.23.4+dfsg-1 | amd64,arm64,armel,armhf,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.23.2+dfsg-2 | amd64,arm64,armel,armhf,mips64el,mipsel,ppc64el,s390x |
buster | 0.19.6+dfsg-1 | amd64,arm64,armhf,i386 |
bullseye | 0.20.1+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
All-in-one FASTQ preprocessor, fastp provides functions including quality
profiling, adapter trimming, read filtering and base correction. It supports
both single-end and paired-end short read data and also provides basic support
for long-read data.
The package is enhanced by the following packages:
multiqc
|
|
fastq-pair
Rewrites paired end fastq so all reads have a mate to separate out singletons
|
Versions of package fastq-pair |
Release | Version | Architectures |
trixie | 1.0-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.0-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
This package rewrites the fastq files with the sequences in order,
with matching files for the two files provided on the command line,
and then any single reads that are not matched are place in two separate
files, one for each original file.
This code is designed to be fast and memory efficient, and works with large
fastq files. It does not store the whole file in memory, but rather just stores
the locations of each of the indices in the first file provided in memory.
|
|
fastqc
quality control for high throughput sequence data
|
Versions of package fastqc |
Release | Version | Architectures |
stretch | 0.11.5+dfsg-6 | all |
jessie | 0.11.2+dfsg-3 | all |
sid | 0.12.1+dfsg-4 | all |
trixie | 0.12.1+dfsg-4 | all |
bookworm | 0.11.9+dfsg-6 | all |
bullseye | 0.11.9+dfsg-4 | all |
buster | 0.11.8+dfsg-2 | all |
|
License: DFSG free
|
FastQC aims to provide a simple way to do some quality control checks on
raw sequence data coming from high throughput sequencing pipelines. It
provides a modular set of analyses which you can use to give a quick
impression of whether your data has any problems of which you should
be aware before doing any further analysis.
The main functions of FastQC are
- Import of data from BAM, SAM or FastQ files (any variant)
- Providing a quick overview to tell you in which areas there may
be problems
- Summary graphs and tables to quickly assess your data
- Export of results to an HTML based permanent report
- Offline operation to allow automated generation of reports without
running the interactive application
The package is enhanced by the following packages:
multiqc
|
|
fastqtl
Quantitative Trait Loci (QTL) mapper in cis for molecular phenotypes
|
Versions of package fastqtl |
Release | Version | Architectures |
buster | 2.184+dfsg-6 | amd64,arm64,armhf |
bookworm | 2.184+v7+dfsg-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.184+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.184+v7+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.184+dfsg-5 | amd64,arm64,armel,i386,mips64el,mipsel,ppc64el |
sid | 2.184+v7+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
The goal of FastQTL is to identify single-nucleotide polymorphisms (SNPs)
which are significantly associated with various molecular phenotypes
(i.e. expression of known genes, cytosine methylation levels, etc).
It performs scans for all possible phenotype-variant pairs in cis
(i.e. variants located within a specific window around a phenotype).
FastQTL implements a new permutation scheme (Beta approximation) to accurately
and rapidly correct for multiple-testing at both the genotype and phenotype
levels.
The package is enhanced by the following packages:
fastqtl-doc
|
|
fasttree
phylogenetic trees from alignments of nucleotide or protein sequences
|
Versions of package fasttree |
Release | Version | Architectures |
bookworm | 2.1.11-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.1.11-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 2.1.7-2 | amd64,armel,armhf,i386 |
stretch | 2.1.9-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 2.1.10-2 | amd64,arm64,armhf,i386 |
sid | 2.1.11-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 2.1.11-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
FastTree infers approximately-maximum-likelihood phylogenetic trees from
alignments of nucleotide or protein sequences. It handles alignments
with up to a million of sequences in a reasonable amount of time and
memory. For large alignments, FastTree is 100-1,000 times faster than
PhyML 3.0 or RAxML 7.
FastTree is more accurate than PhyML 3 with default settings, and much
more accurate than the distance-matrix methods that are traditionally
used for large alignments. FastTree uses the Jukes-Cantor or generalized
time-reversible (GTR) models of nucleotide evolution and the JTT
(Jones-Taylor-Thornton 1992) model of amino acid evolution. To account
for the varying rates of evolution across sites, FastTree uses a single
rate for each site (the "CAT" approximation). To quickly estimate the
reliability of each split in the tree, FastTree computes local support
values with the Shimodaira-Hasegawa test (these are the same as PhyML 3's
"SH-like local supports").
This package contains a single threaded version (fasttree) and a
parallel version which uses OpenMP (fasttreMP).
|
|
ffindex
simple index/database for huge amounts of small files
|
Versions of package ffindex |
Release | Version | Architectures |
stretch | 0.9.9.7-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.9.9.9-2 | amd64,arm64,armhf,i386 |
bullseye | 0.9.9.9-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 0.9.9.9-6.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 0.9.9.9-6.1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.9.9.9-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 0.9.9.3-2 | amd64,armel,armhf,i386 |
|
License: DFSG free
|
FFindex is a very simple index/database for huge amounts of small files. The
files are stored concatenated in one big data file, separated by '\0'. A
second file contains a plain text index, giving name, offset and length of
the small files. The lookup is currently done with a binary search on an
array made from the index file.
This package provides the executables.
|
|
figtree
graphical phylogenetic tree viewer
|
Versions of package figtree |
Release | Version | Architectures |
bookworm | 1.4.4-5 | all |
stretch | 1.4.2+dfsg-2 | all |
trixie | 1.4.4-6 | all |
buster | 1.4.4-3 | all |
sid | 1.4.4-6 | all |
bullseye | 1.4.4-5 | all |
jessie | 1.4-2 | all |
|
License: DFSG free
|
FigTree is designed as a graphical viewer of phylogenetic trees and as
a program for producing publication-ready figures. In particular it is
designed to display summarized and annotated trees produced by BEAST.
|
|
filtlong
quality filtering tool for long reads of genome sequences
|
Versions of package filtlong |
Release | Version | Architectures |
trixie | 0.2.1-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.2.1-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.2.0-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.2.1-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Filtlong is a tool for filtering long reads by quality. It can take a
set of long reads and produce a smaller, better subset. It uses both
read length (longer is better) and read identity (higher is better) when
choosing which reads pass the filter.
|
|
fitgcp
fitting genome coverage distributions with mixture models
|
Versions of package fitgcp |
Release | Version | Architectures |
bookworm | 0.0.20150429-5 | all |
trixie | 0.0.20150429-5 | all |
bullseye | 0.0.20150429-4 | all |
sid | 0.0.20150429-5 | all |
jessie | 0.0.20130418-2 | amd64,i386 |
buster | 0.0.20150429-2 | amd64,arm64 |
stretch | 0.0.20150429-1 | amd64,arm64,mips64el,ppc64el |
|
License: DFSG free
|
Genome coverage, the number of sequencing reads mapped to a position in
a genome, is an insightful indicator of irregularities within sequencing
experiments. While the average genome coverage is frequently used within
algorithms in computational genomics, the complete information available
in coverage profiles (i.e. histograms over all coverages) is currently
not exploited to its full extent. Thus, biases such as fragmented or
erroneous reference genomes often remain unaccounted for. Making this
information accessible can improve the quality of sequencing experiments
and quantitative analyses.
fitGCP is a framework for fitting mixtures of probability distributions
to genome coverage profiles. Besides commonly used distributions, fitGCP
uses distributions tailored to account for common artifacts. The mixture
models are iteratively fitted based on the Expectation-Maximization
algorithm.
|
|
flash
Fast Length Adjustment of SHort reads
|
Versions of package flash |
Release | Version | Architectures |
trixie | 1.2.11-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.2.11-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.2.11-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.2.11-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
FLASH (Fast Length Adjustment of SHort reads) is a very fast and
accurate software tool to merge paired-end reads from next-generation
sequencing experiments. FLASH is designed to merge pairs of reads when
the original DNA fragments are shorter than twice the length of reads.
The resulting longer reads can significantly improve genome assemblies.
They can also improve transcriptome assembly when FLASH is used to merge
RNA-seq data.
The package is enhanced by the following packages:
multiqc
|
|
flexbar
flexible barcode and adapter removal for sequencing platforms
|
Versions of package flexbar |
Release | Version | Architectures |
stretch | 2.50-2 | amd64,arm64,armhf,i386,mips,mips64el,mipsel,ppc64el |
bookworm | 3.5.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.5.0-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 3.4.0-2 | amd64,arm64,armhf,i386 |
trixie | 3.5.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 3.5.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 2.50-1 | amd64,armhf,i386 |
|
License: DFSG free
|
Flexbar preprocesses high-throughput sequencing data efficiently. It
demultiplexes barcoded runs and removes adapter sequences. Moreover,
trimming and filtering features are provided. Flexbar increases mapping
rates and improves genome and transcriptome assemblies. It supports
next-generation sequencing data in fasta/q and csfasta/q format from
Illumina, Roche 454, and the SOLiD platform.
Parameter names changed in Flexbar. Please review scripts. The recent
months, default settings were optimised, several bugs were fixed and
various improvements were made, e.g. revamped command-line interface,
new trimming modes as well as lower time and memory requirements.
The package is enhanced by the following packages:
multiqc
|
|
flye
de novo assembler for single molecule sequencing reads using repeat graphs
|
Versions of package flye |
Release | Version | Architectures |
bookworm | 2.9.1+dfsg-1 | amd64,arm64,mips64el,ppc64el,s390x |
trixie | 2.9.5+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
sid | 2.9.5+dfsg-1 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Flye is a de novo assembler for single molecule sequencing reads, such
as those produced by PacBio and Oxford Nanopore Technologies. It is
designed for a wide range of datasets, from small bacterial projects to
large mammalian-scale assemblies. The package represents a complete
pipeline: it takes raw PacBio / ONT reads as input and outputs polished
contigs. Flye also has a special mode for metagenome assembly.
|
|
fml-asm
tool for assembling Illumina short reads in small regions
|
Versions of package fml-asm |
Release | Version | Architectures |
sid | 0.1+git20190320.b499514-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.1+git20190320.b499514-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.1+git20190320.b499514-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
experimental | 0.1+git20190320.b499514-2~0exp | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.1-2 | amd64 |
trixie | 0.1+git20190320.b499514-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch-backports | 0.1-4~bpo9+1 | amd64 |
buster | 0.1-5 | amd64 |
upstream | 0.1+git20221215.85f159e |
|
License: DFSG free
|
Fml-asm is a command-line tool for assembling Illumina short reads in regions
from 100bp to 10 million bp in size, based on the fermi-lite library.
It is largely a light-weight in-memory version of fermikit without
generating any intermediate files. It inherits the performance, the relatively
small memory footprint and the features of fermikit. In particular, fermi-lite
is able to retain heterozygous events and thus can be used to assemble diploid
regions for the purpose of variant calling.
|
|
freebayes
Bayesian haplotype-based polymorphism discovery and genotyping
|
Versions of package freebayes |
Release | Version | Architectures |
experimental | 1.3.7-1~exp | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x |
stretch-backports | 1.2.0-1~bpo9+1 | amd64 |
buster | 1.2.0-2 | amd64 |
bookworm | 1.3.6-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.3.5-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.3.7-1 | amd64,arm64,mips64el,ppc64el,riscv64 |
upstream | 1.3.8-pre3 |
|
License: DFSG free
|
FreeBayes is a Bayesian genetic variant detector designed to find
small polymorphisms, specifically SNPs (single-nucleotide
polymorphisms), indels (insertions and deletions), MNPs
(multi-nucleotide polymorphisms), and complex events (composite
insertion and substitution events) smaller than the length of a
short-read sequencing alignment.
|
|
freecontact
fast protein contact predictor
|
Versions of package freecontact |
Release | Version | Architectures |
sid | 1.0.21-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0.21-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.0.21-3 | amd64,armel,armhf,i386 |
stretch | 1.0.21-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.0.21-7 | amd64,arm64,armhf,i386 |
bullseye | 1.0.21-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.0.21-14 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
FreeContact is a protein residue contact predictor optimized for speed.
Its input is a multiple sequence alignment. FreeContact can function as an
accelerated drop-in for the published contact predictors
EVfold-mfDCA of DS. Marks (2011) and
PSICOV of D. Jones (2011).
FreeContact is accelerated by a combination of vector instructions, multiple
threads, and faster implementation of key parts.
Depending on the alignment, 8-fold or higher speedups are possible.
A sufficiently large alignment is required for meaningful results.
As a minimum, an alignment with an effective (after-weighting) sequence count
bigger than the length of the query sequence should be used. Alignments with
tens of thousands of (effective) sequences are considered good input.
jackhmmer(1) from the hmmer package, or hhblits(1) from hhsuite
can be used to generate the alignments, for example.
This package contains the command line tool freecontact(1).
Topics: Structure prediction; Sequence analysis
|
|
fsa
Fast Statistical Alignment of protein, RNA or DNA sequences
|
Versions of package fsa |
Release | Version | Architectures |
buster | 1.15.9+dfsg-4 | amd64,arm64,armhf,i386 |
stretch | 1.15.9+dfsg-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.15.9+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.15.9+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.15.9+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.15.9+dfsg-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
FSA is a probabilistic multiple sequence alignment algorithm which uses
a "distance-based" approach to aligning homologous protein, RNA or DNA
sequences. Much as distance-based phylogenetic reconstruction methods
like Neighbor-Joining build a phylogeny using only pairwise divergence
estimates, FSA builds a multiple alignment using only pairwise
estimations of homology. This is made possible by the sequence annealing
technique for constructing a multiple alignment from pairwise
comparisons, developed by Ariel Schwartz.
FSA brings the high accuracies previously available only for
small-scale analyses of proteins or RNAs to large-scale problems such as
aligning thousands of sequences or megabase-long sequences. FSA
introduces several novel methods for constructing better alignments:
- FSA uses machine-learning techniques to estimate gap and
substitution parameters on the fly for each set of input sequences.
This "query-specific learning" alignment method makes FSA very robust:
it can produce superior alignments of sets of homologous sequences
which are subject to very different evolutionary constraints.
- FSA is capable of aligning hundreds or even thousands of sequences
using a randomized inference algorithm to reduce the computational
cost of multiple alignment. This randomized inference can be over ten
times faster than a direct approach with little loss of accuracy.
- FSA can quickly align very long sequences using the "anchor
annealing" technique for resolving anchors and projecting them with
transitive anchoring. It then stitches together the alignment between
the anchors using the methods described above.
- The included GUI, MAD (Multiple Alignment Display), can display the
intermediate alignments produced by FSA, where each character is
colored according to the probability that it is correctly aligned
Please cite:
Robert K. Bradley, Adam Roberts, Michael Smoot, Sudeep Juvekar, Jaeyoung Do, Colin Dewey, Ian Holmes and Lior Pachter:
Fast Statistical Alignment.
(PubMed,eprint)
PLoS Comput Biol.
5(5):e1000392
(2009)
|
|
fsm-lite
frequency-based string mining (lite)
|
Versions of package fsm-lite |
Release | Version | Architectures |
bullseye | 1.0-5 | amd64,arm64,mips64el,ppc64el,s390x |
sid | 1.0-8 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0-8 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.0-8 | amd64,arm64,mips64el,ppc64el,s390x |
buster | 1.0-3 | amd64,arm64 |
stretch | 1.0-2 | amd64,arm64,mips64el,ppc64el,s390x |
|
License: DFSG free
|
A singe-core implementation of frequency-based substring mining used in
bioinformatics to extract substrings that discriminate two (or more)
datasets inside high-throughput sequencing data.
|
|
gamgi
General Atomistic Modelling Graphic Interface (GAMGI)
|
Versions of package gamgi |
Release | Version | Architectures |
sid | 0.17.5-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.17.5-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
stretch | 0.17.1-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 0.17.3-2 | amd64,arm64,armhf,i386 |
jessie | 0.17.1-1 | amd64,armel,armhf,i386 |
bullseye | 0.17.3-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package gamgi: |
role | program |
uitoolkit | gtk |
|
License: DFSG free
|
The General Atomistic Modelling Graphic Interface (GAMGI) provides
a graphical interface to build, view and analyze atomic structures.
The program is aimed at the scientific community and provides a
graphical interface to study atomic structures and to prepare images
for presentations, and for teaching the atomic structure of matter.
|
|
garli
phylogenetic analysis of molecular sequence data using maximum-likelihood
|
Versions of package garli |
Release | Version | Architectures |
bullseye | 2.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 2.1-3 | amd64,arm64,armhf,i386 |
sid | 2.1-7 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.1-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bookworm | 2.1-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
GARLI, Genetic Algorithm for Rapid Likelihood Inference is a program for
inferring phylogenetic trees. Using an approach similar to a classical
genetic algorithm, it rapidly searches the space of evolutionary trees
and model parameters to find the solution maximizing the likelihood
score. It implements nucleotide, amino acid and codon-based models of
sequence evolution, and runs on all platforms. The latest version adds
support for partitioned models and morphology-like datatypes.
|
|
garlic
visualization program for biomolecules
|
Versions of package garlic |
Release | Version | Architectures |
bookworm | 1.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.6-1.1 | amd64,armel,armhf,i386 |
stretch | 1.6-1.1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.6-3 | amd64,arm64,armhf,i386 |
bullseye | 1.6-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.6-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 1.6-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package garlic: |
field | biology, chemistry |
interface | x11 |
role | program |
scope | utility |
uitoolkit | xlib |
use | viewing |
x11 | application |
|
License: DFSG free
|
Garlic is written for the investigation of membrane proteins. It may be
used to visualize other proteins, as well as some geometric objects.
This version of garlic recognizes PDB format version 2.1. Garlic may
also be used to analyze protein sequences.
It only depends on the X libraries, no other libraries are needed.
Features include:
- The slab position and thickness are visible in a small window.
- Atomic bonds as well as atoms are treated as independent drawable
objects.
- The atomic and bond colors depend on position. Five mapping modes
are available (as for slab).
- Capable to display stereo image.
- Capable to display other geometric objects, like membrane.
- Atomic information is available for atom covered by the mouse
pointer. No click required, just move the mouse pointer over the
structure!
- Capable to load more than one structure.
- Capable to draw Ramachandran plot, helical wheel, Venn diagram,
averaged hydrophobicity and hydrophobic moment plot.
- The command prompt is available at the bottom of the main window.
It is able to display one error message and one command string.
Please cite:
Damir Zucic and Davor Juretic:
Precise Annotation of Transmembrane Segments with Garlic - a Free Molecular Visualization Program
(eprint)
Croatica Chemica Acta
77(1-2):397-401
(2004)
|
|
gasic
genome abundance similarity correction
|
Versions of package gasic |
Release | Version | Architectures |
stretch | 0.0.r19-1 | amd64 |
sid | 0.0.r19-8 | all |
trixie | 0.0.r19-8 | all |
bookworm | 0.0.r19-8 | all |
bullseye | 0.0.r19-7 | all |
buster | 0.0.r19-4 | amd64 |
jessie | 0.0.r18-2 | amd64 |
|
License: DFSG free
|
One goal of sequencing based metagenomic analysis is the quantitative
taxonomic assessment of microbial community compositions. However, the
majority of approaches either quantify at low resolution (e.g. at phylum
level) or have severe problems discerning highly similar species. Yet,
accurate quantification on species level is desirable in applications
such as metagenomic diagnostics or community comparison. GASiC is a
method to correct read alignment results for the ambiguities imposed by
similarities of genomes. It has superior performance over existing
methods.
|
|
gatb-core
Genome Analysis Toolbox with de-Bruijn graph
|
Versions of package gatb-core |
Release | Version | Architectures |
buster | 1.4.1+git20181225.44d5a44+dfsg-3 | amd64,arm64,i386 |
sid | 1.4.2+dfsg-13 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 1.4.2+dfsg-11 | amd64,arm64,mips64el,ppc64el |
bullseye | 1.4.2+dfsg-6 | amd64,arm64,i386,mips64el,ppc64el,s390x |
trixie | 1.4.2+dfsg-13 | amd64,arm64,mips64el,ppc64el,riscv64 |
|
License: DFSG free
|
The GATB-CORE project provides a set of highly efficient
algorithms to analyse NGS data sets. These methods enable
the analysis of data sets of any size on multi-core desktop
computers, including very huge amount of reads data coming
from any kind of organisms such as bacteria, plants,
animals and even complex samples (e.g. metagenomes).
Read more about GATB at https://gatb.inria.fr/.
By itself GATB-CORE is not an NGS data analysis tool.
However, it can be used to create such tools. There already
exist a set of ready-to-use tools relying on GATB-CORE
library: see https://gatb.inria.fr/software/
Please cite:
Erwan Drezen, Guillaume Rizk, Rayan Chikhi, Charles Deltel, Claire Lemaitre, Pierre Peterlongo and Dominique Lavenier:
GATB: Genome Assembly & Analysis Tool Box.
Bioinformatics
30(20):2959-2961
(2014)
|
|
gbrowse
GMOD Generic Genome Browser
|
Versions of package gbrowse |
Release | Version | Architectures |
buster | 2.56+dfsg-4 | all |
bullseye | 2.56+dfsg-8 | all |
bookworm | 2.56+dfsg-11 | all |
trixie | 2.56+dfsg-12 | all |
sid | 2.56+dfsg-12 | all |
jessie | 2.54+dfsg-3 | all |
stretch | 2.56+dfsg-2 | all |
Debtags of package gbrowse: |
field | biology, biology:bioinformatics |
interface | web |
role | program |
use | analysing, viewing |
web | application, cgi |
|
License: DFSG free
|
Generic Genome Browser is a simple but highly
configurable web-based genome browser. It is a component of the
Generic Model Organism Systems Database project (GMOD).
Some of its features:
- Simultaneous bird's eye and detailed views of the genome;
- Scroll, zoom, center;
- Attach arbitrary URLs to any annotation;
- Order and appearance of tracks are customizable by administrator and
end-user;
- Search by annotation ID, name, or comment;
- Supports third party annotation using GFF formats;
- Settings persist across sessions;
- DNA and GFF dumps;
- Connectivity to different databases, including BioSQL and Chado;
- Multi-language support;
- Third-party feature loading;
- Customizable plug-in architecture (e.g. run BLAST, dump & import many
formats, find oligonucleotides, design primers, create restriction maps,
edit features).
|
|
gdpc
visualiser of molecular dynamic simulations
|
Versions of package gdpc |
Release | Version | Architectures |
trixie | 2.2.5-16 | amd64,arm64,mips64el,ppc64el,riscv64 |
bookworm | 2.2.5-15 | amd64,arm64,mips64el,ppc64el |
bullseye | 2.2.5-14 | amd64,arm64,mips64el,ppc64el |
sid | 2.2.5-16 | amd64,arm64,mips64el,ppc64el,riscv64 |
jessie | 2.2.5-3 | amd64,armel,armhf,i386 |
buster | 2.2.5-9 | amd64,arm64,armhf,i386 |
stretch | 2.2.5-6 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package gdpc: |
field | biology, biology:structural, chemistry, physics |
interface | x11 |
role | program |
scope | application |
uitoolkit | gtk |
use | viewing |
works-with | 3dmodel, image, video |
works-with-format | jpg, png |
x11 | application |
|
License: DFSG free
|
gpdc is a graphical program for visualising output data from
molecular dynamics simulations. It reads input in the standard xyz
format, as well as other custom formats, and can output pictures of
each frame in JPG or PNG format.
|
|
gemma
Genome-wide Efficient Mixed Model Association
|
Versions of package gemma |
Release | Version | Architectures |
trixie | 0.98.5+dfsg-2 | amd64,arm64,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.98.5+dfsg-2 | amd64,arm64,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.98.4+dfsg-4 | amd64,arm64,armhf,i386,mips64el,ppc64el,s390x |
bookworm | 0.98.5+dfsg-2 | amd64,arm64,armhf,i386,mips64el,ppc64el,s390x |
buster | 0.98.1+dfsg-1 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
GEMMA is the software implementing the Genome-wide Efficient Mixed
Model Association algorithm for a standard linear mixed model and some
of its close relatives for genome-wide association studies (GWAS):
- It fits a univariate linear mixed model (LMM) for marker association
tests with a single phenotype to account for population stratification
and sample structure, and for estimating the proportion of variance in
phenotypes explained (PVE) by typed genotypes (i.e. "chip heritability").
- It fits a multivariate linear mixed model (mvLMM) for testing marker
associations with multiple phenotypes simultaneously while controlling
for population stratification, and for estimating genetic correlations
among complex phenotypes.
- It fits a Bayesian sparse linear mixed model (BSLMM) using Markov
chain Monte Carlo (MCMC) for estimating PVE by typed genotypes,
predicting phenotypes, and identifying associated markers by jointly
modeling all markers while controlling for population structure.
- It estimates variance component/chip heritability, and partitions
it by different SNP functional categories. In particular, it uses HE
regression or REML AI algorithm to estimate variance components when
individual-level data are available. It uses MQS to estimate variance
components when only summary statisics are available.
GEMMA is computationally efficient for large scale GWAS and uses freely
available open-source numerical libraries.
Please cite:
Xiang Zhou and Matthew Stephens:
Genome-wide efficient mixed-model analysis for association studies
Nature Genetics
44:821-824
(2012)
|
|
genometester
toolkit for performing set operations on k-mer lists
|
Versions of package genometester |
Release | Version | Architectures |
sid | 4.0+git20200511.91cecb5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 4.0+git20200511.91cecb5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 4.0+git20200511.91cecb5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 4.0+git20200511.91cecb5+dfsg-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 4.0+git20180508.a9c14a6+dfsg-1 | amd64,arm64,armhf,i386 |
upstream | 4.0+git20221122.71e6625 |
|
License: DFSG free
|
Toolkit for performing set operations - union, intersection and
complement - on k-mer lists.
GenomeTester4 toolkit, which contains a novel tool GListCompare for
performing union, intersection and complement (difference) set
operations on k-mer lists. It contains examples of how these
general operations can be combined to solve a variety of biological
analysis tasks.
|
|
genomethreader
software tool to compute gene structure predictions
|
Versions of package genomethreader |
Release | Version | Architectures |
bookworm | 1.7.3+dfsg-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.7.3+dfsg-10 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.7.3+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.7.3+dfsg-10 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
GenomeThreader is a software tool to compute gene structure predictions.
The gene structure predictions are calculated using a similarity-based
approach where additional cDNA/EST and/or protein sequences are used to
predict gene structures via spliced alignments. GenomeThreader was motivated
by disabling limitations in GeneSeqer, a popular gene prediction program
which is widely used for plant genome annotation.
|
|
genometools
versatile genome analysis toolkit
|
Versions of package genometools |
Release | Version | Architectures |
bullseye | 1.6.1+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm-backports | 1.6.5+ds-2~bpo12+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye-backports-sloppy | 1.6.5+ds-2~bpo11+1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.6.5+ds-2.2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.6.5+ds-2.2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
jessie | 1.5.3-2 | amd64,armel,armhf,i386 |
bookworm | 1.6.2+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster-backports | 1.6.1+ds-3~bpo10+1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
stretch | 1.5.9+ds-4 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.5.10+ds-3 | amd64,arm64,armhf,i386 |
Debtags of package genometools: |
biology | nuceleic-acids |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
uitoolkit | ncurses |
|
License: DFSG free
|
The GenomeTools contains a collection of useful tools for biological
sequence analysis and -presentation combined into a single binary.
The toolkit contains binaries for sequence and annotation handling, sequence
compression, index structure generation and access, annotation visualization,
and much more.
|
|
genomicsdb-tools
sparse array storage library for genomics (tools)
|
Versions of package genomicsdb-tools |
Release | Version | Architectures |
sid | 1.5.4-1 | amd64 |
sid | 1.5.3-3 | mips64el |
bookworm | 1.4.4-3 | amd64,mips64el |
|
License: DFSG free
|
GenomicsDB is built on top of a htslib fork and an internal array storage
system for importing, querying and transforming variant data. Variant data is
sparse by nature (sparse relative to the whole genome) and using sparse array
data stores is a perfect fit for storing such data.
The GenomicsDB stores variant data in a 2D array where:
- Each column corresponds to a genomic position (chromosome + position);
- Each row corresponds to a sample in a VCF (or CallSet in the GA4GH
terminology);
- Each cell contains data for a given sample/CallSet at a given position;
data is stored in the form of cell attributes;
- Cells are stored in column major order - this makes accessing cells with
the same column index (i.e. data for a given genomic position over all
samples) fast.
- Variant interval/gVCF interval data is stored in a cell at the start of the
interval. The END is stored as a cell attribute. For variant intervals
(such as deletions and gVCF REF blocks), an additional cell is stored at
the END value of the variant interval. When queried for a given genomic
position, the query library performs an efficient sweep to determine all
intervals that intersect with the queried position.
This package contains some tools to be run as executable files.
|
|
gentle
??? missing short description for package gentle :-(
|
Versions of package gentle |
Release | Version | Architectures |
sid | 1.9.5~alpha2+dfsg1-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.9+cvs20100605+dfsg1-10 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.9.5~alpha2+dfsg1-2 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 1.9+cvs20100605+dfsg1-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 1.9+cvs20100605+dfsg1-3 | amd64,armel,armhf,i386 |
stretch | 1.9+cvs20100605+dfsg1-5 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.9+cvs20100605+dfsg1-7 | amd64,arm64,armhf,i386 |
Debtags of package gentle: |
biology | nuceleic-acids, peptidic |
field | biology, biology:bioinformatics |
interface | x11 |
role | program |
uitoolkit | wxwidgets |
|
License: DFSG free
|
|
|
gff2aplot
pair-wise alignment-plots for genomic sequences in PostScript
|
Versions of package gff2aplot |
Release | Version | Architectures |
bullseye | 2.0-13 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 2.0-7 | amd64,armel,armhf,i386 |
sid | 2.0-15 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2.0-11 | amd64,arm64,armhf,i386 |
bookworm | 2.0-14 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 2.0-15 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 2.0-8 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
Debtags of package gff2aplot: |
field | biology, biology:bioinformatics |
interface | commandline, shell |
role | program |
scope | utility |
use | converting, viewing |
works-with | image:vector |
works-with-format | plaintext, postscript |
|
License: DFSG free
|
A program to visualize the alignment of two genomic sequences together with
their annotations. From GFF-format input files it produces PostScript figures
for that alignment.
The following menu lists many features of gff2aplot:
- Comprehensive alignment plots for any GFF-feature. Attributes are defined
separately so you can modify only whatsoever attributes for a given file or
share same customization across different data-sets.
- All parameters are set by default within the program, but it can be also
fully configured via gff2ps-like flexible customization files. Program can
handle several of such files, summarizing all the settings before producing
the corresponding figure. Moreover, all customization parameters can be set
via command-line switches, which allows users to play with those parameters
before adding any to a customization file.
- Source order is taken from input files, if you swap file order you can
visualize alignment and its annotation with the new input arrangement.
- All alignment scores can be visualized in a PiP box below gff2aplot area,
using grey-color scale, user-defined color scale or score-dependent
gradients.
- Scalable fonts, which can also be chosen among the basic PostScript default
fonts. Feature and group labels can be rotated to improve readability in
both annotation axes.
- The program is still defined as a Unix filter so it can handle data from
files, redirections and pipes, writing output to standard-output and
warnings to standard error.
- gff2aplot is able to manage many physical page formats (from A0 to A10, and
more -see available page sizes in its manual-), including user-defined ones.
This allows, for instance, the generation of poster size genomic maps, or
the use of a continuous-paper supporting plotting device, either in portrait
or landscape.
- You can draw different alignments on same alignment plot and distinguish
them by using different colors for each.
- Shape dictionary has been expanded, so that further feature shapes are now
available (see manual).
- Annotation projections through alignment plots (so called ribbons) emulate
transparencies via complementary color fill patterns. This feature allows
one to show color pseudo-blending when horizontal and vertical ribbons
overlap.
|
|
gff2ps
produces PostScript graphical output from GFF-files
|
Versions of package gff2ps |
Release | Version | Architectures |
bullseye | 0.98l-4 | all |
sid | 0.98l-6 | all |
trixie | 0.98l-6 | all |
bookworm | 0.98l-6 | all |
buster | 0.98l-2 | all |
stretch | 0.98d-5 | all |
jessie | 0.98d-4 | all |
Debtags of package gff2ps: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | converting, viewing |
works-with | image:vector |
works-with-format | postscript |
|
License: DFSG free
|
gff2ps is a script program developed with the aim of converting gff-formatted
records into high quality one-dimensional plots in PostScript. Such plots
maybe useful for comparing genomic structures and to visualizing outputs from
genome annotation programs.
It can be used in a very simple way, because it assumes that the GFF file
itself carries enough formatting information, but it also allows through a
number of options and/or a configuration file, for a great degree of
customization.
|
|
gffread
GFF/GTF format conversions, region filtering, FASTA sequence extraction
|
Versions of package gffread |
Release | Version | Architectures |
trixie | 0.12.7-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.12.7-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.12.1-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 0.12.7-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
Gffread is a GFF/GTF parsing utility providing format conversions,
region filtering, FASTA sequence extraction and more.
|
|
ggd-utils
|
Versions of package ggd-utils |
Release | Version | Architectures |
bullseye | 0.0.7+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bookworm | 1.0.0+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1.0.0+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
trixie | 1.0.0+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
|
License: DFSG free
|
Takes a genome file and (currently) a .vcf.gz or
a .bed.gz and checks that:
* a .tbi is present
* the VCF has ""##fileformat=VCF" as the first
line
* the VCF has a #CHROM header
* the chromosome are in the order specified by
the genome file (and present)
* the positions are sorted
* the positions are <= the chromosome lengths
defined in the genome file.
As a result, any new genome going into GGD will have
a .genome file that will dictate the sort order
and presence or absence of the 'chr' prefix for chromosomes
|
|
ghemical
GNOME molecular modelling environment
|
Versions of package ghemical |
Release | Version | Architectures |
buster | 3.0.0-4 | amd64,arm64,armhf,i386 |
stretch | 3.0.0-2 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
sid | 3.0.0-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,s390x |
bookworm | 3.0.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 3.0.0-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
Debtags of package ghemical: |
field | chemistry |
interface | 3d, x11 |
role | program |
suite | gnome |
uitoolkit | gtk |
use | editing, learning, viewing |
works-with | 3dmodel |
x11 | application |
|
License: DFSG free
|
Ghemical is a computational chemistry software package written in C++.
It has a graphical user interface and it supports both quantum-
mechanics (semi-empirical) models and molecular mechanics models.
Geometry optimization, molecular dynamics and a large set of
visualization tools using OpenGL are currently available.
Ghemical relies on external code to provide the quantum-mechanical
calculations. Semi-empirical methods MNDO, MINDO/3, AM1 and PM3 come
from the MOPAC7 package (Public Domain), and are included in the
package. The MPQC package is used to provide ab initio methods: the
methods based on Hartree-Fock theory are currently supported with
basis sets ranging from STO-3G to 6-31G**.
|
|
ghmm
General Hidden-Markov-Model library - tools
|
Versions of package ghmm |
Release | Version | Architectures |
trixie | 0.9~rc3-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.9~rc3-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 0.9~rc3-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 0.9~rc3-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 0.9~rc3-2 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
The General Hidden Markov Model Library (GHMM) is a C library with
additional Python3 bindings implementing a wide range of types of
Hidden Markov Models and algorithms: discrete, continuous emissions,
basic training, HMM clustering, HMM mixtures.
This package contains some tools using the library.
|
|
glam2
gapped protein motifs from unaligned sequences
|
Versions of package glam2 |
Release | Version | Architectures |
jessie | 1064-3 | amd64,armel,armhf,i386 |
bookworm | 1064-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 1064-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 1064-5 | amd64,arm64,armhf,i386 |
stretch | 1064-3 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 1064-9 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1064-9 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
Debtags of package glam2: |
field | biology, biology:bioinformatics |
interface | commandline |
role | program |
scope | utility |
use | analysing, comparing, searching |
works-with-format | plaintext |
|
License: DFSG free
|
GLAM2 is a software package for finding motifs in sequences, typically
amino-acid or nucleotide sequences. A motif is a re-occurring sequence
pattern: typical examples are the TATA box and the CAAX prenylation motif. The
main innovation of GLAM2 is that it allows insertions and deletions in motifs.
This package includes programs for discovering motifs shared by a set of
sequences and finding matches to these motifs in a sequence database, as well
as utilities for converting glam2 motifs to standard alignment formats,
masking glam2 motifs out of sequences so that weaker motifs can be found, and
removing highly similar members of a set of sequences.
The package includes these programs:
glam2: discovering motifs shared by a set of sequences;
glam2scan: finding matches, in a sequence database, to a motif discovered
by glam2;
glam2format: converting glam2 motifs to standard alignment formats;
glam2mask: masking glam2 motifs out of sequences, so that weaker motifs
can be found;
glam2-purge: removing highly similar members of a set of sequences.
In this binary package, the fast Fourier algorithm (FFT) was enabled for the
glam2 program.
|
|
gmap
spliced and SNP-tolerant alignment for mRNA and short reads
|
Versions of package gmap |
Release | Version | Architectures |
jessie | 2014-10-22-1 (non-free) | amd64 |
bullseye | 2021-02-22+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
stretch | 2017-01-14-1 (non-free) | amd64 |
buster | 2019-01-24-1 (non-free) | amd64 |
trixie | 2024-10-10+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
bookworm | 2021-12-17+ds-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
sid | 2024-10-10+ds-1 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
upstream | 2024-10-20 |
Debtags of package gmap: |
field | biology, biology:bioinformatics, biology:structural |
role | program |
use | analysing |
|
License: DFSG free
|
This package contains the programs GMAP and GSNAP as well as
utilities to manage genome databases in GMAP/GSNAP format.
GMAP (Genomic Mapping and Alignment Program) is a tool for aligning
EST, mRNA and cDNA sequences.
GSNAP (Genomic Short-read Nucleotide Alignment Program) is a tool for
aligning single-end and paired-end transcriptome reads.
Both tools can use a database of
- known splice sites and identify novel splice sites.
- known single-nucleotide polymorphisms (SNPs).
GSNAP can align bisulfite-treated DNA.
|
|
grabix
wee tool for random access into BGZF files
|
Versions of package grabix |
Release | Version | Architectures |
bullseye | 0.1.7-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
bookworm | 0.1.7-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1.7-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.1.7-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 0.1.7-1 | amd64,arm64,armhf |
|
License: DFSG free
|
In biomedical research it is increasing practice to study
the genetic basis of disease. This now frequently comprises
the sequencing of human sequences. The output of the machine
however is redundant, and the real sequence is the best
sequence to explain the redundancy. The exchange of data
happens only with compressed files - to huge and redundant
to perform otherwise. One should avoid uncompression whenever
possible.
grabix leverages the fantastic BGZF library of the samtools
package to provide random access into text files that have
been compressed with bgzip. grabix creates it's own index
(.gbi) of the bgzipped file. Once indexed, one can extract
arbitrary lines from the file with the grab command. Or
choose random lines with the, well, random command.
|
|
graphlan
circular representations of taxonomic and phylogenetic trees
|
Versions of package graphlan |
Release | Version | Architectures |
bookworm | 1.1.3-4 | all |
sid | 1.1.3-6 | all |
bullseye | 1.1.3-2 | all |
trixie | 1.1.3-6 | all |
buster | 1.1.3-1 | all |
stretch | 1.1-2 | all |
|
License: DFSG free
|
GraPhlAn is a software tool for producing high-quality circular
representations of taxonomic and phylogenetic trees. It focuses on
concise, integrative, informative, and publication-ready representations
of phylogenetically- and taxonomically-driven investigation.
|
|
grinder
Versatile omics shotgun and amplicon sequencing read simulator
|
Versions of package grinder |
Release | Version | Architectures |
stretch | 0.5.4-1 | all |
bullseye | 0.5.4-6 | all |
bookworm | 0.5.4-6 | all |
sid | 0.5.4-6 | all |
trixie | 0.5.4-6 | all |
jessie | 0.5.3-3 | all |
buster | 0.5.4-5 | all |
|
License: DFSG free
|
Grinder is a versatile program to create random shotgun and amplicon sequence
libraries based on DNA, RNA or proteic reference sequences provided in a
FASTA file.
Grinder can produce genomic, metagenomic, transcriptomic, metatranscriptomic,
proteomic, metaproteomic shotgun and amplicon datasets from current
sequencing technologies such as Sanger, 454, Illumina. These simulated
datasets can be used to test the accuracy of bioinformatic tools under
specific hypothesis, e.g. with or without sequencing errors, or with low or
high community diversity. Grinder may also be used to help decide between
alternative sequencing methods for a sequence-based project, e.g. should the
library be paired-end or not, how many reads should be sequenced.
|
|
gromacs
Molecular dynamics simulator, with building and analysis tools
|
Versions of package gromacs |
Release | Version | Architectures |
trixie | 2024.3-2 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
bookworm | 2022.5-2 | amd64,arm64,mips64el,ppc64el,s390x |
stretch | 2016.1-2 | amd64,arm64,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2020.6-2 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
jessie | 5.0.2-1 | amd64,armel,armhf,i386 |
buster | 2019.1-1 | amd64,arm64,armhf,i386 |
sid | 2024.3-2 | amd64,arm64,mips64el,ppc64el,riscv64,s390x |
upstream | 2024.4 |
Debtags of package gromacs: |
field | biology, biology:structural, chemistry |
interface | commandline, x11 |
role | program |
uitoolkit | xlib |
x11 | application |
|
License: DFSG free
|
GROMACS is a versatile package to perform molecular dynamics, i.e. simulate
the Newtonian equations of motion for systems with hundreds to millions of
particles.
It is primarily designed for biochemical molecules like proteins and lipids
that have a lot of complicated bonded interactions, but since GROMACS is
extremely fast at calculating the nonbonded interactions (that usually
dominate simulations) many groups are also using it for research on non-
biological systems, e.g. polymers.
This package contains variants both for execution on a single machine, and
using the MPI interface across multiple machines.
|
|
gsort
|
Versions of package gsort |
Release | Version | Architectures |
bookworm | 0.1.4-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 0.1.4-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
sid | 0.1.4-3 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bullseye | 0.1.4-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
gsort is a tool to sort genomic files according to a genomefile.
For example, to sort VCF to have order:
X, Y, 2, 1, 3, ... and the header needs to be kept at the top.
As a more likely example, if a file nneds to be sorted to match GATK
order (1 ... X, Y, MT) which is not possible with any other sorting
tool. With gsort one can simply place MT as the last chrom in the
".genome" file.
It will also be useful for getting files ready for use in bedtools.
|
|
gubbins
phylogenetic analysis of genome sequences
|
Versions of package gubbins |
Release | Version | Architectures |
trixie | 3.3.5-1 | amd64,i386 |
sid | 3.3.5-1 | amd64,i386 |
stretch | 2.2.0-1 | amd64,i386 |
bullseye | 2.4.1-4 | amd64,i386 |
bookworm | 2.4.1-5 | amd64,i386 |
buster | 2.3.4-1 | amd64,i386 |
|
License: DFSG free
|
Gubbins supports rapid phylogenetic analysis of large samples of
recombinant bacterial whole genome sequences.
Gubbins (Genealogies Unbiased By recomBinations In Nucleotide
Sequences) is an algorithm that iteratively identifies loci containing
elevated densities of base substitutions while concurrently constructing
a phylogeny based on the putative point mutations outside of these
regions. Simulations demonstrate the algorithm generates highly accurate
reconstructions under realistic models of short-term bacterial
evolution, and can be run in only a few hours on alignments of hundreds
of bacterial genome sequences.
|
|
gwama
Genome-Wide Association Meta Analysis
|
Versions of package gwama |
Release | Version | Architectures |
trixie | 2.2.2+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 2.2.2+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
sid | 2.2.2+dfsg-5 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
buster | 2.2.2+dfsg-2 | amd64,arm64,armhf,i386 |
stretch | 2.2.2+dfsg-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
bullseye | 2.2.2+dfsg-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
|
License: DFSG free
|
GWAMA (Genome-Wide Association Meta Analysis) software performs
meta-analysis of the results of GWA studies of binary or
quantitative phenotypes. Fixed- and random-effect meta-analyses are
performed for both directly genotyped and imputed SNPs using
estimates of the allelic odds ratio and 95% confidence interval for
binary traits, and estimates of the allelic effect size and standard
error for quantitative phenotypes. GWAMA can be used for analysing
the results of all different genetic models (multiplicative,
additive, dominant, recessive). The software incorporates error
trapping facilities to identify strand alignment errors and allele
flipping, and performs tests of heterogeneity of effects between studies.
|
|
harvest-tools
archiving and postprocessing for reference-compressed genomic multi-alignments
|
Versions of package harvest-tools |
Release | Version | Architectures |
sid | 1.3-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
bookworm | 1.3-8 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
bullseye | 1.3-6 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
trixie | 1.3-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 1.3-1 | amd64,arm64,armel,armhf,i386,mips,mips64el,mipsel,ppc64el,s390x |
buster | 1.3-4 | amd64,arm64,armhf,i386 |
|
License: DFSG free
|
HarvestTools is a utility for creating and interfacing with Gingr files,
which are efficient archives that the Harvest Suite uses to store
reference-compressed multi-alignments, phylogenetic trees, filtered
variants and annotations. Though designed for use with Parsnp and Gingr,
HarvestTools can also be used for generic conversion between standard
bioinformatics file formats.
|
|
hhsuite
sensitive protein sequence searching based on HMM-HMM alignment
|
Versions of package hhsuite |
Release | Version | Architectures |
bookworm | 3.3.0+ds-7 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
stretch | 3.0~beta2+dfsg-3 | amd64 |
trixie | 3.3.0+ds-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
bullseye | 3.3.0+ds-4 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el |
sid | 3.3.0+ds-8 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64 |
buster | 3.0~beta3+dfsg-3 | amd64 |
jessie | 2.0.16-5 | amd64 |
|
License: DFSG free
|
HH-suite is an open-source software package for sensitive protein sequence
searching based on the pairwise alignment of hidden Markov models (HMMs).
This package contains HHsearch and HHblits among other programs and utilities.
HHsearch takes as input a multiple sequence alignment (MSA) or profile HMM
and searches a database of HMMs (e.g. PDB, Pfam, or InterPro) for homologous
proteins. HHsearch is often used for protein structure prediction to detect
homologous templates and to build highly accurate query-template pairwise
alignments for homology modeling.
HHblits can build high-quality MSAs starting from single sequences or from
MSAs. It transforms these into a query HMM and, using an iterative search
strategy, adds significantly similar sequences from the previous search to
the updated query HMM for the next search iteration. Compared to PSI-BLAST,
HHblits is faster, up to twice as sensitive and produces more accurate
alignments.
|
|
hilive
realtime alignment of Illumina reads
|
Versions of package hilive |
Release | Version | Architectures |
bookworm | 2.0a-3 | amd64,arm64,armel,armhf,i386,mips64el,mipsel,ppc64el,s390x |
buster | 1.1-2 | amd64,arm64,armhf |
sid | 2.0a-4 | amd64,arm64,armel,armhf,i386,mips64el,ppc64el,riscv64,s390x |
stretch | 0.3-2 | amd64,arm64,armel,i386,mips64el,mipsel,ppc64el |
bullseye | 2.0a-3 | amd64,arm64,armel,armh | | |