TestPage

	Type	Url
.NET BIO/URL 0
/URL 0
4peaks/URL 0
4peaks/URL 1
A5/URL 0
A5/URL 1
AB Large Indel Tool/URL 0
AB Small Indel Tool/URL 0
AB Small Indel Tool/URL 1
AB Small Indel Tool/URL 2
ABBA/URL 0
ABMapper/URL 0
ABySS/URL 0
ABySS/URL 2
ABySS/URL 4
ABySS/URL 5
ACE/URL 0
ADTEx/URL 0
ADTEx/URL 1
ADTEx/URL 2
ADTEx/URL 3
ADTEx/URL 4
ADTEx/URL 5
AFG/URL 0
AGE/URL 0
AGILE/URL 0
AGP/URL 0
ALEXA-Seq/URL 0
ALLPATHS/URL 0
AMOS/URL 0
ANCHOR/URL 0
ANNOVAR/URL 0
ANNOVAR/URL 1
ANNOVAR/URL 2
AREM/URL 0
ASC/URL 0
ATAC/URL 0
ATAC/URL 1
ATAC/URL 2
Adapter Removal/URL 0
Adapter Removal/URL 1
Agp2amos/URL 0
Alcovna/URL 0
Alta-Cyclic/URL 0
AnCorr/URL 0
Anno-J/URL 0
Arachne/URL 0
Arf/URL 0
Array Suite (Array Studio/Server)/URL 0
ArrayExpressHTS/URL 0
ArrayExpressHTS/URL 1
ArrayStar/URL 0
Atlas Suite/URL 0
Atlas Suite/URL 1
Atlas-SNP2/URL 0
Atlas-SNP2/URL 1
Avadis NGS/URL 0
Avadis NGS/URL 1
Avadis NGS/URL 2
BAM/URL 0
BAMseek/URL 0
BAMseek/URL 1
BBMap/URL 0
BBSeq/URL 0
BEAP/URL 0
BEDTools/URL 0
BFAST/URL 0
BFAST/URL 1
BFCounter/URL 0
BFCounter/URL 1
BING/URL 0
BLAST Ring Image Generator/URL 0
BLAST Ring Image Generator/URL 1
BLAST/URL 0
BLAST/URL 1
BLAT/URL 0
BLAT/URL 1
BOAT/URL 0
BOW/URL 0
BRAT/URL 0
BRCA-diagnostic/URL 0
BS Seeker/URL 0
BS-Seq/URL 0
BSMAP/URL 0
BSSim/URL 0
BWA-SW/URL 0
BWA/URL 0
BWA/URL 1
Baa.pl/URL 0
BamTools/URL 1
BamView/URL 0
Bambino/URL 0
Bambus/URL 0
Barcode generator/URL 0
Barcode generator/URL 1
Barcrawl Bartab/URL 0
BarraCUDA/URL 0
BarraCUDA/URL 1
Batman/URL 0
BaySeq/URL 0
BayesCall/URL 0
BayesPeak/URL 0
BayesPeak/URL 1
Bcbio-nextgen/URL 0
Bedutils/URL 0
Belvu/URL 0
BiQ Analyzer HT/URL 0
BiQ Analyzer/URL 0
BigBWA/URL 0
BigBWA/URL 1
BioJava/URL 0
BioJava/URL 1
BioNumerics/URL 0
BioPHP/URL 0
BioPerl/URL 0
BioRuby/URL 0
BioRuby/URL 1
BioSmalltalk/URL 0
Bionimbus/URL 0
Biopieces/URL 0
Bis-SNP/URL 0
Bismark/URL 0
Bison/URL 0
Blixem/URL 0
Bowtie/URL 0
BreakDancer/URL 0
BreakDancer/URL 1
BreakSeq/URL 0
BreakTrans/URL 0
Breakpointer/URL 0
Breakway/URL 0
Btrim/URL 0
CABOG/URL 0
CAF/URL 0
CANGS/URL 0
CANGS/URL 1
CANGS/URL 2
CARPET/URL 0
CASHX/URL 0
CATCH/URL 0
CEQer/URL 0
CEQer/URL 1
CGA Tools/URL 0
CHiCAGO/URL 0
CHiCAGO/URL 1
CHiCAGO/URL 2
CHiPSeq/URL 0
CLCbio Genomics Workbench/URL 0
CLCbio Genomics Workbench/URL 1
CLEVER/URL 0
CLEVER/URL 1
CNANorm/URL 0
CNAseg/URL 0
CNVator/URL 0
CNVer/URL 0
CNVkit/URL 0
CNVkit/URL 1
CNVnator/URL 0
CNVseq/URL 0
CORAL (Contig Ordering Algorithm)/URL 0
CPTRA/URL 0
CRAC/URL 0
CRAVAT/URL 0
CRISP/URL 0
CUDA-EC/URL 0
CatchAll/URL 0
CexoR/URL 0
ChIP-Seq (application)/URL 0
ChIP-Seq (application)/URL 1
ChIPMunk/URL 0
ChIPMunk/URL 1
ChIPMunk/URL 2
ChIPmeta/URL 0
ChIPmeta/URL 1
ChIPseqR/URL 0
ChimeraScan/URL 0
Chipster/URL 0
Chipster/URL 1
Chipster/URL 2
Chipster/URL 3
ChromHMM/URL 0
Circos/URL 0
Circos/URL 1
Circos/URL 2
Circos/URL 3
Cistrome/URL 0
Clean reads/URL 0
CleaveLand/URL 0
ClipCrop/URL 0
CloudAligner/URL 0
CloudAligner/URL 1
CloudBurst/URL 0
ClustDB/URL 0
Cluster Flow/URL 0
Cluster Flow/URL 1
Cluster Flow/URL 2
Cluster Flow/URL 3
CnD/URL 0
CnvHMM/URL 0
CoNAn-SNV/URL 0
CompreheNGSive/URL 0
ConDeTri/URL 0
ContEst/URL 0
ContEst/URL 1
Contra/URL 0
Contra/URL 1
Contrail/URL 0
Contrail/URL 1
CopySeq/URL 0
CopySeq/URL 1
Coral/URL 0
Cortex/URL 0
Crossbow/URL 0
Cufflinks/URL 0
CummeRbund/URL 0
Curtain/URL 0
Custom Bismark output format/URL 0
Cutadapt/URL 0
Cutadapt/URL 1
DCLIP/URL 0
DEGseq/URL 0
DEGseq/URL 1
DESeq/URL 0
DESeq/URL 1
DESeq/URL 2
DESeq/URL 3
DIAL/URL 0
DNA Baser/URL 0
DNA Baser/URL 1
DNA Baser/URL 2
DNA Baser/URL 3
DNA Baser/URL 4
DNA Baser/URL 5
DNA Baser/URL 6
DNA Chromatogram Explorer/URL 0
DNAA/URL 0
DNAzip/URL 0
DNaseR/URL 1
DSAP/URL 0
DSGseq/URL 0
DSRC/URL 0
DeFuse/URL 0
DecGPU/URL 0
DeconSeq/URL 0
DeconSeq/URL 1
DeepTools/URL 0
DeepTools/URL 1
DeepTools/URL 2
DeepTools/URL 3
DiBayes/URL 0
DiffBind/URL 0
DiffBind/URL 1
DiffBind/URL 2
DiffBind/URL 3
Diffreps/URL 0
Dindel/URL 0
DiscoSnp/URL 0
DrFAST/URL 0
E-miR/URL 0
EBCall/URL 0
ECHO/URL 0
EDENA/URL 0
ERANGE/URL 0
ERDS/URL 0
ERGO Genome Analysis and Discovery System/URL 0
ERNE/URL 0
ESTcalc/URL 0
EULER/URL 0
Ea-utils/URL 0
Ea-utils/URL 1
EaSeq/URL 0
EaSeq/URL 1
EaSeq/URL 2
EagleView genome viewer/URL 0
EagleView/URL 0
EagleView/URL 1
Easyfig/URL 0
EdgeR/URL 0
EdgeR/URL 1
EdgeR/URL 2
EpiGRAPH/URL 0
Epigenome/URL 0
Error Correction Evaluation Toolkit/URL 0
ExomeCNV/URL 0
ExomeCopy/URL 0
ExomeCopy/URL 1
ExomePicks/URL 0
Exonerate/URL 0
FAAST/URL 0
FACS/URL 0
FAST: Fast Analysis of Sequences Toolbox/URL 0
FAST: Fast Analysis of Sequences Toolbox/URL 1
FASTA/URL 0
FDM/URL 0
FHiTINGS/URL 0
FHiTINGS/URL 1
FLASH/URL 0
FPS/URL 0
FREEC/URL 0
FaBox/URL 0
Falcon/URL 0
FastQ Screen/URL 0
FastQC/URL 0
FastQC/URL 1
FastQC/URL 2
FastQValidator/URL 0
FastQValidator/URL 1
FeatureCounts/URL 0
Figaro/URL 0
Figaro/URL 1
Filter/URL 0
Filter/URL 1
FindPeaks 3.1/URL 0
FindPeaks 4.0 (Vancouver Short Read Package)/URL 0
Flexbar/URL 0
Flexbar/URL 1
Flexbar/URL 2
FlowSim/URL 0
Flower/URL 0
Flux/URL 0
Forge/URL 0
FragGeneScan/URL 0
FrameDP/URL 0
FreClu/URL 0
Freebayes/URL 0
Freebayes/URL 1
FusionAnalyser/URL 0
FusionAnalyser/URL 1
FusionCatcher/URL 0
FusionCatcher/URL 1
FusionHunter/URL 0
FusionMap/URL 0
FusionSeq/URL 0
FusionSeq/URL 1
FusionSeq/URL 2
FusionSeq/URL 3
Fuzzypath/URL 0
G-Mo.R-Seq/URL 0
G-SQZ/URL 0
GAMES/URL 0
GAR/URL 0
GASSST/URL 0
GASV/URL 0
GATK/URL 0
GBrowse/URL 0
GEM library/URL 0
GEM/URL 0
GENALICE MAP/URL 0
GENALICE MAP/URL 1
GENE-Counter/URL 0
GENE-Counter/URL 1
GFF3/URL 0
GMAP/URL 0
GPS/URL 0
GPSeq/URL 0
GRIDSS/URL 0
GRIDSS/URL 1
GRS/URL 0
GSNAP/URL 0
Galaxy/URL 0
Galaxy/URL 1
Galign/URL 0
GeeFu/URL 0
GenVision/URL 0
GeneProf/URL 0
GeneTalk/URL 0
Genedata Expressionist/URL 0
Genedata Expressionist/URL 1
Geneious/URL 0
Geneious/URL 1
GenoMiner/URL 0
GenoMiner/URL 1
GenoMiner/URL 2
GenoREAD/URL 0
GenoViewer/URL 0
GenoViewer/URL 1
Genomatix Mining Station (GMS)/URL 0
Genome Trax/URL 0
GenomeBrowse/URL 0
GenomeJack/URL 0
GenomeMapper/URL 0
GenomeTools/URL 0
GenomeTools/URL 1
GenomeView/URL 0
GenomeView/URL 1
GenomeView/URL 2
GenomeView/URL 3
Genomedata/URL 0
Genomedata/URL 1
Genomedata/URL 2
Genomedata/URL 3
Genomedata/URL 4
Genomedata/URL 5
Genometa/URL 0
GenomicTools/URL 0
GensearchNGS/URL 0
GensearchNGS/URL 1
GensearchNGS/URL 2
Geoseq/URL 0
GigaBayes/URL 0
GimmeMotifs/URL 0
GimmeMotifs/URL 1
Girafe/URL 0
Gk arrays/URL 0
Gk arrays/URL 1
Gk arrays/URL 2
GlobalSeq/URL 0
GlobalSeq/URL 1
GlobalSeq/URL 5
Gnumap/URL 0
Gnumap/URL 1
Goby framework/URL 0
Goby framework/URL 1
Goby framework/URL 2
Golden Helix/URL 0
Goseq/URL 0
Gowinda/URL 0
Gowinda/URL 1
Graphviz dot/URL 0
Graphviz dot/URL 1
HI/URL 0
HINT/URL 0
HINT/URL 1
HINT/URL 2
HMMSplicer/URL 0
HPeak/URL 0
HTSeq/URL 0
Hairpin Annotation/URL 0
Hairpin Annotation/URL 1
Haplowser/URL 0
HawkEye/URL 0
HeliSphere/URL 0
HiPipe/URL 0
HiTEC/URL 0
Hicup/URL 0
Hybrid-SHREC/URL 0
IBD2/URL 0
ICORN/URL 0
IDBA/URL 0
IDBA/URL 1
IGV/URL 0
IOmics/URL 0
IQSeq/URL 0
ISAAC/URL 0
ISAAC/URL 1
ISSAKE/URL 0
Ibis/URL 0
Illuminate/URL 0
Illuminate/URL 1
Illuminator/URL 0
InGAP/URL 0
Inchworm/URL 0
Ingenuity Variant Analysis/URL 0
Integrated Genome Browser/URL 0
Iobio/URL 0
Isas/URL 0
IsoEM/URL 0
JBrowse/URL 0
JBrowse/URL 1
Jellyfish/URL 0
JointSLM/URL 0
KARMA/URL 0
KBASE/URL 0
KNIME/URL 0
Kismeth/URL 0
Kissnp/URL 0
Kmergenie/URL 0
Knime4Bio/URL 0
Krona/URL 0
LAST/URL 0
LASTZ/URL 0
LOCAS/URL 0
LSKB, Life Science Knowledge Bank/URL 0
LSKB, Life Science Knowledge Bank/URL 1
Lab7/URL 0
Lab7/URL 1
Lasergene/URL 0
LobSTR/URL 0
LookSeq/URL 0
MACS google group/URL 0
MACS/URL 0
MAQ/URL 0
MAQGene/URL 0
MAXIMUS/URL 0
MAYDAY/URL 0
MEGAN/URL 0
METAGENassist/URL 0
MG-RAST/URL 0
MG-RAST/URL 1
MG-RAST/URL 2
MIP Scaffolder/URL 0
MIRA/URL 0
MIRA/URL 1
MISO/URL 0
MMSEQ/URL 0
MOM/URL 0
MOSAIK/URL 0
MPscan/URL 0
MPscan/URL 1
MPscan/URL 2
... further results

Name	Summary	Bio Tags	Meth Tags	Features	Language	Licence	OS
.NET BIO	".NET Bio is an open source library of common bioinformatics functions, intended to simplify the creation of life science applications. The core library implements a range of file parsers and formatters for common file types, connectors to commonly-used web services such as NCBI BLAST, and standard algorithms for the comparison and assembly of DNA, RNA and protein sequences. Sample tools and code snippets are also included."	Sequence analysis			C#		Windows Linux
4peaks	Allows viewing sequencing trace files, motif searching trimming, BLAST and exporting sequences.	Sequencing	Sequence analysis			Freeware	Mac OS X
A5	A5 is an integrative pipeline for genome assembly that automates sequence data cleaning, error correction, assembly, and quality control by chaining a number of programs together with additional custom algorithms.	Sequence assembly	De-novo assembly			GPLv3	Linux Mac OS X
AB Large Indel Tool	Identifies deviations in clone insert size that indicate intra-chromosomal structural variations compared to a reference genome.	Indel detection Sequencing	Mapping		Perl	GPL	Linux 64
AB Small Indel Tool	The SOLiD™ Small Indel Tool processes the indel evidences found in the pairing step of the SOLiD™ System Analysis pipeline Tool (Corona Lite).	Indel detection Sequencing	Read mapping Sequence alignment		Perl C++	GPL	Linux 64
ABBA	Assembly Boosted By Amino acid sequence is a comparative gene assembler, which uses amino acid sequences from predicted proteins to help build a better assembly	Sequence assembly	Sequence assembly Scaffolding			Artistic License	Linux
ABMapper	Maps RNA-Seq reads to target genome considering possible multiple mapping locations and splice junctions	Genomics Transcriptomics	Read mapping Sequence alignment		C++ Perl	GPLv3	Linux
ABySS	ABySS is a de novo sequence assembler designed for short reads and large genomes.	Sequence assembly (de novo assembly)	Sequence assembly Sequence assembly (de-novo assembly)	MPI OpenMP	C++	Commercial Freeware	POSIX Linux Mac OS X
Adapter Removal (software)	Removes adaptor fragments from raw short read sequence data and outputs data to FASTA format.	Workflows	Adapter removal	Sequence trimming	Java	Custom Licence	Linux 64 Windows Mac OS X
ADTEx	Aberration Detection in Tumour Exome (ADTEx) is a tool for copy number variation (CNV) detection for whole-exome data from paired tumour/matched normal samples.	Copy number estimation Exome analysis Cancer biology Next Generation Sequencing	Statistical calculation	Copy number analysis	Python R	GPLv3	GNU/Linux
AGE	AGE is a tool that implements an algorithm for optimal alignment of sequences with SVs.	Structural variation	Sequence alignment			Creative Commons license (Attribution-NonCommerical).
AGILE	A hash table based high throughput sequence mapping algorithm for longer 4A54 reads that uses diagonal multiple seed-match criteria, customized q-gram filtering and a dynamic incremental search approach among other heuristics to optimize every step of the mapping process		Mapping		C
Agp2amos	missing		Formatting				Windows Linux
Alcovna	ALgorithms for COmparing and Visualizing Non Assembled data	SNP detection			Java
ALEXA-Seq	Alternative Expression Analysis by massively parallel RNA sequencing	RNA-Seq quantification Alternative splicing			Perl	GPLv3
ALLPATHS	De novo assembly of whole-genome shotgun microreads.	Sequence assembly (de novo assembly)	Sequence assembly Sequence assembly (de-novo assembly)
Alta-Cyclic	Alta-Cyclic is a Illumina Genome-Analyzer (Solexa) base caller.		Base-calling
AMOS	AMOS is a Modular, Open-Source whole genome assembler.		Sequence assembly Sequence assembly validation Integrated solution Formatting Sequence assembly visualisation		C Perl		Linux
ANCHOR	Post-processing tools for de novo assemblies	De-novo assembly	Sequence assembly		C++ Python	BCCA (academic use)	Linux
AnCorr	Webserver/tool for evaluation coordinate and ordinal correlations between genomic tracks and/or expression or protein binding profiles.	Genomics	Statistical calculation Correlation	Easy-to-use point-and-click web interface with context help	Perl C CGI PHP JavaScript	free for non-commercial academic research	Cross-Platform
Anno-J	Annotation Browsing 2.0	Sequencing	Visualisation			Creative Commons - Attribution-NonCommercial-ShareAlike
ANNOVAR	ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data	Genomics Genetics	Annotation Variant prioritisation	Gene-based annotation region-based annotation filter-based annotation	Perl	Commercial Freeware	Linux Windows Mac OS X
Arachne	ARACHNE is a program for assembling data from whole genome shotgun sequencing experiments.		Sequence assembly
AREM	AREM: Aligning Short Reads from ChIP-sequencing by Expectation Maximisation	ChIP-seq	Read mapping Peak calling		Python		Linux
Arf	arf is a genetic analysis program for sequencing data.
Array Suite (Array Studio/Server)	Array Studio is a complete analysis and visualization package for NextGen sequencing data, as well as other -OMIC data types. Array Server is a backend enterprise server for storage and analysis of -OMIC and NextGen sequencing data.	Genomics SNP detection Indel detection	Read mapping Gene expression profiling	Data Visualisation Variant annotation and analysis coverage analysis Read mapping	C#	Commercial	Windows
ArrayExpressHTS	R-based pipeline for RNA-Seq data analysis.	RNA-Seq RNA-Seq quantification			R
ArrayStar	ArrayStar is an easy-to-use gene expression analysis software package that offers powerful visualization and statistical tools to help you analyze your microarray data.	Gene expression analysis	Statistical calculation Differential expression analysis Ontology comparison Genetic variation analysis			Commercial	Windows 7 64-bit or Higher Mac OS X 10.7 10.8 or 10.9 with Parallels Desktop
ASC	Empirical Bayes method to detect differential expression.	RNA-Seq quantification	Statistical calculation and probability
ATAC	ATAC is a computational process for comparative mapping between two genome assemblies, or between two different genomes.		Sequence alignment Sequence assembly validation				Linux
Atlas Suite	Atlas is a suite of variant analysis tools specializing in the separation of true SNPs and insertions and deletions (indels) from sequencing and mapping errors in Whole Exome Capture Sequecing (WECS) data. SNPs may be called using the Atlas-SNP2 application and indels may be called using the Atlas-Indel2 application.	SNP detection Indel detection	Variant calling		Ruby C	BSD	POSIX
Atlas-SNP2	Atlas-SNP2 is a SNP detection tool developed for next generation sequencing platforms	SNP detection			Ruby	Freeware	UNIX
Avadis NGS	Strand NGS formerly Avadis NGS is a desktop software platform for alignment, analysis, visualization, and management of data generated by next-generation sequencing (NGS) platforms. It supports workflows for RNA-Seq, DNA-Seq, small RNA-Seq, ChIP-seq, and Methyl-Seq data analysis. Strand NGS is designed with the biologist in mind.	ChIP-seq RNA-Seq Sequencing DNA-Seq Small RNA-Seq RNA Methyl-Seq MeDIP-Seq Pathway or network analysis	Sequence alignment Visualisation Sequencing quality control Sequence analysis Analysis Biological interpretation	Rich Visualisation Identify effects of SNPs on transcripts Identify Structural Variants from Paired Reads (Insertions Deletions Translocations Inversions) Identify binding site peaks in ChIP-seq data Identify motifs around binding sites Determine gene expression levels and identify differentially expressed genes De-convolve transcript expression levels and identify differential splice variants Identify Novel Exons Identify Novel Splice Junctions Identify Fusion Genes Perform QC on Reads determine on-and off-target reads and filter anomalous reads Determine Enriched GO Terms Determine Significant Pathways	Java R Python	Commercial	Windows Linux Mac OS X
Baa.pl	use transcripts to assess a de novo assembly	Sequence assembly (de novo assembly)	Sequence assembly validation Sequence alignment analysis		Perl	GPL	any
Bambino	Variant detector and graphical alignment viewer for SAM/BAM format data.	SNP detection Genetic variation			Java
Bambus	Bambus is a general purpose scaffolder		Scaffolding
BAMseek	BAMseek is a large file viewer for BAM and SAM alignment files.	Genomics Transcriptomics	Sequence alignment visualisation		Java	GPLv3	Cross-Platform
BamTools	BamTools provides a fast, flexible C++ API & toolkit for reading, writing, and managing BAM files.		Sequence alignment analysis		C++	MIT	Cross-Platform
BamView	Interactive Java application for visualising the large amounts of data stored for sequence reads which are aligned against a reference genome sequence		Visualisation		Java	GPL	Mac OS X UNIX Windows
Barcode generator	Generator of sequence barcodes suitable for Illumina sequencing.		DNA barcoding		Python
Barcrawl Bartab	Barcrawl facilitates the design of barcoded primers, for multiplexed high-throughput sequencing.		DNA barcoding			GPL
BarraCUDA	Barracuda is a high-speed sequence aligner based on BWA and uses the latest Nvidia CUDA architecture for accelerating alignments of sequence reads generated by the next-generation sequencers.	Sequence analysis	Read mapping Sequence alignment	Gapped and ungapped alignment paired-end mapping GPGPU parallel execution	C C++ CUDA	GPLv3 MIT	Linux
Batman	Bayesian tool for methylation analysis (Batman) for analyzing methylated DNA immunoprecipitation (MeDIP) profiles	DNA methylation			Java	LGPL
BayesCall	Bayesian basecaller	Sequencing	Base-calling		C++ Python	GPLv3
BayesPeak	A Bayesian hidden Markov model to detect enriched locations in ChIP-seq data.	ChIP-seq Simulation experiment	Statistical calculation	Multicore	R	GPL
BaySeq	Identify differential expressed genes	RNA-Seq quantification	Differential expression analysis		R
BBMap	BBMap is a fast splice-aware aligner for RNA and DNA. It is faster than almost all short-read aligners, yet retains unrivaled sensitivity and specificity, particularly for reads with many errors and indels.	Sequence alignment Whole genome resequencing RNA-Seq alignment SNP detection Metagenomics Phylogenetics Alternative splicing Resequencing Quality control Read binning	Read mapping Sequence alignment Sequence contamination filtering Sequence trimming RNA-Seq analysis	Multithreaded. Faster and more accurate than competing aligners. Splice-aware.	Java 7	BSD	Windows NIX Mac OS X all supporting JVM
BBSeq	Tool for analyzing RNA-Seq data to analyze gene expression	RNA-Seq quantification			R
Bcbio-nextgen	Python scripts and modules for automated next gen sequencing analysis. These provide a fully automated pipeline for taking sequencing results from an Illumina sequencer, converting them to standard Fastq format, aligning to a reference genome, doing SNP calling, and producing a summary PDF of results.	Workflows	Read mapping Sequence alignment Peak calling Sequence motif recognition Genotyping Sequencing quality control Differential expression analysis Sequence trimming Filtering Genomic region matching		Python	MIT	platform-independent
BEADS	ChIP-seq data normalization for Illumina	ChIP-seq	Standardisation and normalisation
BEAP	The Blast Extension and Assembly Program (BEAP) uses a short starting DNA fragment to recursively blast nucleotide databases to obtain all sequences that overlaps to construct the a "full length" sequence.		Read mapping
BEDTools	BEDTools is an extensive suite of utilities for comparing genomic features in BED format.	Genomics	Mapping	Feature overlaps UNIX pipes coverage split-alignments BAM support	C++	GPLv2	Linux Mac OS X
Bedutils	NGSUtils is a suite of software tools for working with next-generation sequencing datasets. Staring in 2009, we (Liu Lab @ Indiana University School of Medicine) starting working with next-generation sequencing data. We initially started doing custom coding for each project in a one-off manner. It quickly became apparent that this was an inefficient manner to work, so we started assembling smaller utilities that could be adapted into larger, more complicated, workflows. We have used them for Illumia, SOLiD and 454 sequencing data. We have used them for DNA and RNA resequcing, ChIP-seq, CLIP-Seq, and targeted resequencing (Agilent exome capture and PCR targeting). These tools are also used heavily in our in-house DNA and RNA mapping pipelines. These tools have of great use within our lab group, and so we are happy to make them available to the greater community. NGSUtils is made up of 50+ programs, mainly written in Python. These are separated into modules based on the type of file that is to be analyzed. There are four modules:
Belvu	An X-windows viewer for multiple sequence alignments		Sequence alignment visualisation				Linux
BFAST	Blat-like Fast Accurate Search Tool.	Whole genome resequencing	Read mapping Sequence alignment Genome indexing	parallel execution command line	C	GPL	Solaris UNIX
BFCounter	BFCounter is a program for counting k-mers in DNA sequence data.		K-mer counting		C++	GPLv3
BigBWA	Tool to run the Burrows-Wheeler Aligner-BWA on a Hadoop cluster. It supports the algorithms BWA-MEM, BWA-ALN, and BWA-SW, working with paired and single reads. It implies an important reduction in the computational time when running in a Hadoop cluster, adding scalability and fault-tolerancy.	Whole genome resequencing Genomics Exome capture Sequencing Resequencing Exome and whole genome variant detection Exome analysis Exome	Read mapping Sequence alignment Mapping	Gapped alignment paired-end mapping	Java	GPLv3	Linux 64
BING	biomedical informatics pipeline (BING) for the analysis of NGS data that offers several novel computational approaches to 1. image alignment, 2. signal correlation, compensation, separation, and pixel-based cluster registration, 3. signal measurement and base calling, 4. quality control and accuracy measurement.		Base-calling Sequencing quality control
BioJava	"BioJava is an open-source project dedicated to providing a Java framework for processing biological data. It provides analytical and statistical routines, parsers for common file formats and allows the manipulation of sequences and 3D structures. The goal of the biojava project is to facilitate rapid application development for bioinformatics. "	Genomics			Java	LGPL 2.1
Bionimbus	Cloud environment for analysis of microarray and second generation sequencing data.						Linux Amazon EC2 cloud
BioNumerics	BioNumerics can be used for the analysis of all major applications in bioinformatics	Sequence assembly Whole genome resequencing Genotyping Sequence analysis Comparative genomics Quality control Workflows Data handling Microbial Surveillance Epidemiology	Sequence assembly Read mapping Sequence alignment Annotation Genome visualisation Variant calling Comparative genomics Workflows			Commercial	Windows 7 or Higher
BioPerl	"BioPerl, a community effort to produce Perl code which is useful in biology. "	Genomics			Perl		Cross-Platform
BioPHP	biology tools for php.	Genomics			PHP	GPLv2
Biopieces	The Biopieces are a collection of bioinformatics tools that can be pieced together in a very easy and flexible manner to perform both simple and complex tasks. The Biopieces work on a data stream in such a way that the data stream can be passed through several different Biopieces, each performing one specific task: modifying or adding records to the data stream, creating plots, or uploading data to databases and web services.	Genomics	Sequence alignment Visualisation Sequencing quality control Sequence analysis		Perl Python Ruby C	GPLv2
Biopython	Biopython provides a tool kit for writing bioinformatics and computational molecular biology software in Python.	Sequence analysis Phylogenetics Population genetics Protein structure analysis	Sequence parsing	Various	Python	Biopython License (MIT/BSD style)	Linux Windows Mac OS X
BioRuby	"BioRuby comes with a comprehensive set of free development tools and libraries for bioinformatics and molecular biology, for the Ruby programming language. BioRuby has components for sequence analysis, pathway analysis, protein modelling and phylogenetic analysis; it supports many widely used data formats and provides easy access to databases, external programs and public web services, including BLAST, KEGG, GenBank, MEDLINE and GO."	Genomics			Ruby		Cross-Platform
BioSmalltalk	BioSmalltalk provides an environment to build bioinformatics scripts and applications using the most powerful object technology as of today, the Smalltalk programming environment	Sequence analysis Phylogenetics Population genetics Protein structure analysis	Sequence parsing	Various	Smalltalk		Linux Windows Mac OS X
BiQ Analyzer	BiQ Analyzer is a software tool for easy visualization and quality control of DNA methylation data. With more than 2,000 downloads so far, BiQ Analyzer has become a standard tool for processing DNA methylation data from bisulfite sequencing.	DNA methylation Epigenomics			Java		Windows Linux Mac OS X Solaris
BiQ Analyzer HT	BiQ Analyzer HT is an enhanced version of BiQ Analyzer that provides extensive support for high-throughput bisulfite sequencing. BiQ Analyzer HT facilitates the processing, quality control and initial analysis of single-basepair resolution DNA methylation data. It was developed for deep bisulfite sequencing of one or more loci using the Roche 454 platform, but it easily extends to other sequencing platforms. BiQ Analyzer HT features a biologist-friendly graphical user interface, a fast alignment algorithm and a variety of ways to visualize DNA methylation data.	DNA methylation Sequencing Epigenetics			Java		Windows Linux Mac OS X Solaris
Bis-SNP	BisSNP is a package based on the Genome Analysis Toolkit (GATK) map-reduce framework for genotyping in bisulfite treated massively parallel sequencing (Bisulfite-seq, NOMe-seq and RRBS) on Illumina platform. It uses bayesian inference with either manually specified or automatically estimated methylation probabilities of different cytosine context(not only CpG, CHH, CHG in Bisulfite-seq, but also GCH et.al. in other bisulfite treated sequencing) to determine genotypes and methylation levels simultaneously.	DNA methylation SNP detection Sequencing Genotyping Epigenetics	Bisulfite mapping SNP calling Methylation calling	Accurate SNP and methylation calling in Bisulfite-seq/NOMe-seq/RRBS	Java Perl	MIT	Linux Mac OS X
Bismark	Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.	Genomics DNA methylation Epigenomics	Read mapping Bisulfite mapping Methylation calling	fast and convenient Bisulfite-Seq output very flexible	Perl	GPLv3	Linux Mac OS X Windows
Bison	Bison allows users with access to a computer cluster to rapidly align whole-genome bisulfite sequencing or RRBS reads. It can align both directional and non-directional libraries and uses bowtie2.	DNA methylation Sequencing Epigenetics	Read mapping Bisulfite mapping Methylation calling	BAM support Bisulfite sequencing MPI	C		Unix-like Linux Mac OS X
BLAST	BLAST finds regions of similarity between biological sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance.		Sequence analysis				Linux
BLAST Ring Image Generator	"BRIG is a cross-platform (Windows/Mac/Unix) application that can display circular comparisons between a large number of genomes, with a focus on handling genome assembly data. "	Comparative genomics	Visualisation Sequence assembly visualisation				Cross-Platform
BLAT	Fast, accurate spliced alignment of DNA sequences		Read mapping Sequence alignment	Read mapping	C	Freeware	Linux Mac OS X
Blixem	a graphical blast viewer	Sequence analysis Phylogenetics	Sequence alignment visualisation			GPL	Linux
BOAT	Can accurately and efficiently map sequencing reads back to the reference genome.		Read mapping			GPL
Bort	Bort parses Blast output and quantifies hits by contig and read counts.	RNA-Seq quantification			Perl		any
BOW	BOW - Bioinformatics On Windows is essentially a windows port of BWA and SAMTOOLS
Bowtie	Bowtie is an ultrafast, memory-efficient short read aligner.		Read mapping Genome indexing (Burrows-Wheeler)				Mac OS X Linux Windows
BRAT	accurate and efficient tool for mapping short reads obtained from the Illumina Genome Analyzer following sodium bisulfite conversion. Both single and paired ends are supported.	DNA methylation Epigenomics	Bisulfite mapping Mapping			GPLv3
BRCA-diagnostic	Computational screening test for BRCA1/2 mutants in human genomic DNA	Personalised medicine			Perl
BreakDancer	BreakDancer is an application for detecting structural rearrangements and indels in short read sequencing data	Genomics Indel detection Structural variation			Perl C++	GPLv3
Breakpointer	Breakpointer is a fast tool for locating sequence breakpoints from the alignment of single end reads (SE) produced by next generation sequencing (NGS). It adopts a heuristic method in searching for local mapping signatures created by insertion/deletions (indels) or more complex structural variants(SVs). With current NGS single-end sequencing data, the output regions by Breakpoint mainly contain the approximate breakpoints of indels and a limited number of large SVs.	Indel detection Exome and whole genome variant detection	Statistical calculation		C++ Perl	GPL
BreakSeq	Database of known human breakpoint junctions and software to search short reads against them.	Structural variation	Read mapping
BreakTrans	BreakTrans is a computer program that maps predicted gene fusions to genomic structural rearrangements so as to validate both types of events.		Post-analysis
Breakway	Breakway is a suite of programs that take aligned genomic data and report structural variation breakpoints.	Whole genome resequencing Genomics Indel detection Structural variation Genetic variation	SNP calling Sequence analysis	Fast specific UNIX pipes	Perl	GPL	Linux Mac OS X Windows
BS Seeker	Mapping tool for bisulfite treated reads	Epigenomics	Bisulfite mapping		Python
BS-Seq	The source code and data for the "Shotgun Bisulphite Sequencing of the Arabidopsis Genome Reveals DNA methylation Patterning" Nature paper by Cokus et al. (Steve Jacobsen's lab at UCLA). POSIX.	Epigenomics	Bisulfite mapping
BSMAP	short reads mapping software for bisulfite sequencing	DNA methylation	Read mapping Bisulfite mapping	Bisulfite sequencing	C++	GPLv3	Linux 64
BSSim	BSSim: Bisulfite sequencing simulator for next-generation sequencing.	DNA methylation Sequencing Epigenetics	Modelling and simulation	BSSim can allow users to mimic various methylation level.	Python	GPLv3	UNIX Linux Mac OS X Windows
Btrim	Btrim is a fast and lightweight software to trim adapters and low quality regions in reads.		Sequence trimming				Linux
BWA	Fast, accurate, memory-efficient aligner for short and long sequencing reads	Mapping	Read mapping	Gapped alignment paired-end mapping	C	GPLv3 MIT	UNIX
BWA-SW	Fast, accurate, memory-efficient aligner for long sequencing reads	Mapping	Read mapping	Gapped alignment Local alignment	C	GPLv3 MIT	UNIX
CABOG	Celera Assembler is scientific software for DNA research.	Sequence assembly (de novo assembly)	Sequence assembly	Robust to homopolymer run length			Linux
CANGS	CANGS is a flexible and user-friendly utility to trim sequences, filter low quality sequences, and produce input files for further downstream analyses for 454 sequences. CANGS can be used to assign the taxonomic grouping based on similarity with sequences from the NCBI database	Metagenomics Phylogenetics	Sequencing quality control Sequence trimming Primer removal		Perl
CARPET	A web‐based package for the analysis of ChIP‐chip and expression tiling data	ChIP-on-chip	Genotyping		C++
CASHX	Parse, map, quantify and manage large quantities of short-read sequence data.	Transcriptomics	Read mapping
CATCH	A tool for exploring patterns in ChIP profiling data.	ChIP-seq ChIP-on-chip	Sequence alignment Clustering	parallel execution graphical browsing of results	Java	Open Source
CatchAll	Estimate ecological diversity with both parametric and non-parametric estimators.	Metagenomics Population genetics
CEQer	CEQer (Comparative Exome Quantification analyzer) is a graphical, event-driven tool for copy number abnormalities/allelic-imbalance coupled analysis of whole-exome sequencing data. By using case-control matched exome data, CEQer performs a comparative digital exonic quantification to generate CNA data and couples this information with exome-wide LOH and allelic imbalance detection.	Copy number estimation Genetic variation analysis Exome analysis		Copy number and allelic imbalance analyses from matched exomes.	C#	GPLv3	Windows Mac Linux
CexoR	Strand specific peak-pair calling in ChIP-exo data	ChIP-exo	Peak calling	R/Bioconductor package can run on major computer platforms	R	GPL-2 + file LICENSE	Linux Mac OS X Windows
CGA Tools	Tools for viewing, manipulating and converting data from Complete Genomics		File reformatting		C++	Apache License 2.0	Linux UNIX Mac OS X
CHiCAGO	CHiCAGO (Capture HiC Analysis of Genomic Organization) is a set of tools for calling significant interactions in Capture HiC data, such as Promoter Capture HiC.	Epigenomics	Peak calling		R Python	Artistic-2.0	Linux OSX
ChimeraScan	Identifies chimaeric transcripts in RNA-Seq data	Gene structure
ChIP-Seq (application)	The ChIP-seq web server provides access to a set of useful tools performing common ChIP-seq data analysis tasks, including positional correlation analysis, peak detection, and genome partitioning into signal-rich and signal-poor regions. It is an open system designed to allow interoperability with other resources, in particular the motif discovery programs from the Signal Search Analysis (SSA) server.	ChIP-seq	Read mapping Peak calling		C Perl	GPL	Linux Mac OS X
ChIPmeta	Combining data from ChIP-seq and ChIP-chip.	ChIP-seq ChIP-on-chip Transcription factors and regulatory sites	Peak calling
ChIPMunk	ChIPMunk is a fast heuristic DNA motif digger based a on greedy approach accompanied by bootstrapping. ChIPMunk identifies the strong motif with the maximum Kullback Discrete Information Content in a given set of DNA sequences.	ChIP-seq Sequence motifs	Sequence motif comparison Sequence motif discovery	efficient motif discovery for huge datasets up to tens of thousands of sequences; multi-core CPU support; usage of the ChIP-seq base coverage peak data	Java	Freeware	platform-independent
CHiPSeq	From Science Johnson, 2007	ChIP-seq	Peak calling
ChIPseqR	ChIP-seq qanalysis tool	ChIP-seq			R
Chipster	User-friendly NGS data analysis software with built-in genome browser and workflow functionality. Chipster includes tools for ChIP-seq, RNA-seq, miRNA-seq and MeDIP-seq analysis, and functionality for exome-seq and CGH-seq will soon be added.	DNA methylation ChIP-seq RNA-Seq Immunoprecipitation experiment	Read mapping Peak calling Sequence motif recognition Genome visualisation Sequencing quality control Differential expression analysis Sequence trimming Pathway or network analysis Methylation analysis		Java R	GPLv3	platform-independent
ChromaSig	An unsupervised learning method, which finds, in an unbiased fashion, commonly occurring chromatin signatures in both tiling microarray and sequencing data.	ChIP-on-chip Chromatin	Sequence motif recognition		Perl C
ChromHMM	ChromHMM is software for learning and characterizing chromatin states.	Epigenomics	Statistical calculation		Java	GPLv3
Circos	Circos is tool for visualizing data in a circular format. It was developed for genomic data but can work for many other kinds of data as well.	Comparative genomics	Visualisation		Perl		Windows Linux
CisGenome	An integrated tool for tiling array, ChIP-seq, genome and cis-regulatory element analysis	ChIP-seq ChIP-on-chip Sequence motifs	Sequence motif recognition Gibbs sampling Integrated solution Data retrieval		C C++		UNIX Windows
Cistrome	Galaxy-based web service for analysis of ChIP data	ChIP-seq ChIP-on-chip			Python
CLCbio Genomics Workbench	De novo and reference assembly SNP and small indel detection and annotation.	Sequence assembly (de novo assembly) Whole genome resequencing Genomics Transcriptomics ChIP-seq SNP detection Indel detection RNA-Seq Regulatory RNA Mapping	Sequence assembly Sequence assembly (de-novo assembly) Read mapping Sequence alignment Ab-initio gene prediction Adapter removal Annotation Bisulfite mapping SNP calling Heat map generation Sequence assembly validation	Advanced and user-friendly analyses of genomic transcriptomic and epigenomic NGS data in a graphical user-interface. Wizard driven tools and a freely available developer toolkit SIMD implementation multi-threading hybrid assembly Integrated solution	Java C++	Commercial	Windows Mac OS X Linux
Clean reads	clean_reads cleans NGS (Sanger, 454, Illumina and solid) reads.		Sequencing quality control Sequence trimming		Python
CleaveLand	A pipeline for using degradome data to find cleaved small RNA targets.	Regulatory RNA			Perl R	Freeware
CLEVER	CLEVER is a tool to discover structural variations such as (larger) insertions and deletions in genomes from paired-end sequencing reads.	Genomics Structural variation Copy number estimation		command line	C++ Python	GPLv3	any
ClipCrop	a new method and implementation named ClipCrop for detecting SVs with single-base resolution	Whole-genome sequencing
CloudAligner	Hadoop-based short read aligner		Read mapping Hadoop		Java	GPL	cloud
CloudBurst	CloudBurst is a parallel read-mapping algorithm optimized for mapping next-generation sequence data to the human genome and other reference genomes.	SNP detection Genotyping Personalised medicine	Read mapping	parallel execution Hadoop Academic Cloud Computing Initiative	Java
ClustDB	A powerful tool for exact sequence matching						Linux
Cluster Flow	A command-line pipeline tool which uses common cluster managers to run bioinformatics analysis pipelines.		Analysis pipeline		Perl	GPLv3	Linux
CNANorm	A normalization method for Copy Number Aberration in cancer samples.	Genomics Copy number estimation Cancer biology	Peak calling Standardisation and normalisation		R Perl	GPLv2	Linux Mac OS X Windows
CNAseg	We present a novel approach, called CNAseg, to identify CNAs from second-generation sequencing data. It uses depth of coverage to estimate copy number states and flowcell-to-flowcell variability in cancer and normal samples to control the false positive rate.	Structural variation			R
CNB MetaGenomics tools	A number of tools and meta-tools developed at CNB/CSIC for the analysis of metagenomics data (some rely on QIIME).	Metagenomics Sequencing			Bash Perl Python C	EU-GPL	Linux Unix-like POSIX
CnD	Program to detect copy number variation in inbred mouse strains	Copy number estimation			D	GPL
CNVer	CNVer is a method for CNV detection that supplements the depth-of-coverage with paired-end mapping information, where matepairs mapping discordantly to the reference serve to indicate the presence of variation. CNVer combines this information within a unified computational framework called the donor graph, allowing it to better mitigate the sequencing biases that cause uneven local coverage. CNVer can also reconstruct the absolute copy counts of segments of the donor genome, and work with low coverage datasets.	Structural variation Copy number estimation			Perl C++
CnvHMM	WashU copy number variant (CNV) detection algorithm for Illumina/Solexa data.	Structural variation					Linux
CNVkit	CNVkit is a software toolkit to infer and visualize copy number from targeted DNA sequencing data.	Structural variation Copy number estimation	Variant calling	Copy number analysis data visualization	Python	BSD	GNU/Linux Mac OS X
CNVnator	CNV discovery and genotyping from read-depth analysis of personal genome sequencing	Genotyping Copy number estimation
CNVseq	CNV-seq, a method to detect copy number variation using high-throughput sequencing. pubmed	Copy number estimation			Perl R
CompreheNGSive	compreheNGSive is an interactive visualization of the end results of the next-generation sequencing pipeline.	Next Generation Sequencing	Visualisation		Python Qt	LGPL	Mac OS X Linux
CoNAn-SNV	CoNAn-SNV is a probabilistic framework for the discovery of single nucleotide variants in WGSS data. This software explicitly integrates information about copy number state of different genomic segments into the inference of single nucleotide variants.	SNP detection			C
ConDeTri	ConDeTri is a content dependent read trimming software for Illumina/Solexa sequencing data	Genomics RNA-Seq Sequencing	Sequence trimming		Perl
ContEst	GATK tool to estimate amount of cross-individual contaminating sequence in a dataset	Metagenomic sequencing	Sequencing quality control		Java	BSD
Contra	Copy number analysis for exome-sequencing / targeted-resequencing. Two methods of analysis available: Case vs Control, or Case vs Baseline. Function available for creating a baseline from multiple samples.	Genomics Sequencing Copy number estimation Cancer biology		Copy number analysis baseline (pseudo-control) creation	Python R	GPLv3	Linux 64 Linux
Contrail	A Hadoop based genome assembler for assembling large genomes in the clouds	Sequence assembly (de novo assembly)	Sequence assembly Sequence assembly (de-novo assembly) Hadoop
CopySeq	CopySeq analyzes the depth-of-coverage of whole genome resequencing data to predict CNVs and to infer quantitative locus copy-number genotypes.	Structural variation Genotyping Personalised medicine Copy number estimation			Java R		Mac OS X Linux
Coral	Corrects sequencing errors in short read data via multiple alignments		Sequence error correction		C++
CORAL (Contig Ordering Algorithm)	An algorithm has been developed to order fingerprinted clones within contigs.		Sequence error correction		Java
Cortex	Cortex is an efficient and low-memory software framework for analysis of genomes using sequence data. Cortex allows de novo assembly of variants without having to do a consensus assembly first. Also allows comparison of genomes without using consensus, and alignment of sequence data to a de Bruijn graph	Genomics	Sequence assembly Variant calling		C	GPLv3
CPTRA	Integrated transcriptome analysis from Sanger, 454, Solexa, SOLiD, etc reads	RNA-Seq alignment RNA-Seq quantification			Python
CRAC	CRAC is a mapping software specialized for RNA-Seq data. It detects mutations, indels, splice or fusion junctions in each single read.	RNA-Seq alignment SNP detection Indel detection RNA-Seq quantification Alternative splicing Gene structure	Read mapping Genome indexing (Burrows-Wheeler)		C++	CeCILL	Linux Linux 64 Mac OS X
CRAVAT	CRAVAT is a web-based resource for cancer-related analysis of genomic variants (single base substitutions and indels). CRAVAT provides scoring, sorting, filtering, and interactive visualizations to assist with identification of important variants.	Genomics Genetic variation Genetics	Annotation Variant prioritisation SNP annotation Variant classification Sequence annotation	Gene-based annotation Detailed Reports Easy-to-use point-and-click web interface GUI HTML5 canvas graphics Identify effects of SNPs on transcripts Interactive Multiple replicates information used; automated pipeline; easy genomic annotation; finding hotspots; SNP annotation User-friendly User Management Variant annotation and analysis			Web Linux
CRISP	Identifies rare and common variants in pooled sequencing data	SNP detection			Python
Crossbow	Crossbow is a cloud-computing software tool that combines the aligner BOWTIE and the SNP caller SOAPsnp.	SNP detection	Read mapping
CUDA-EC	A scalable parallel algorithm for correcting sequencing errors in high-throughput short-read data so that error-free reads can be available before DNA fragment assembly.		Sequencing quality control	read error correction	C
Cufflinks	Cufflinks assembles transcripts and estimates their abundances in RNA-Seq samples. It accepts aligned RNA-Seq reads and assembles the alignments into a parsimonious set of transcripts. Cufflinks then estimates the relative abundances of these transcripts based on how many reads support each one.	Transcriptomics RNA-Seq alignment RNA-Seq RNA-Seq quantification Alternative splicing	Read mapping Transcriptome assembly Differential expression analysis			Boost
CummeRbund	Allows for persistent storage, access, exploration, and manipulation of Cufflinks high-throughput sequencing data. In addition, provides numerous plotting functions for commonly used visualizations.	RNA-Seq quantification	Visualisation
Curtain	Curtain is a Java wrapper around next-generation assemblers such as Velvet which allows the incremental introduction of read-pair information into the assembly process. This enables the assembly of larger genomes than would otherwise be possible within existing memory constraints.	Sequence assembly (de novo assembly)	Sequence assembly Sequence assembly (de-novo assembly)			Apache License 2.0
Cutadapt	remove adapter sequences from high-throughput sequencing data using alignment				Python Cython	MIT
DCLIP	dCLIP is a Perl program for discovering differential binding regions in two comparative CLIP-Seq (HITS-CLIP, PAR-CLIP or iCLIP) experiments.	CLIP-Seq HITS-CLIP PAR-CLIP ICLIP	Sequence alignment analysis		Perl C		UNIX Unix-like
DecGPU	Parallel and distributed error correction algorithm for high-throughput short reads.	Sequence assembly (de novo assembly)	Sequence error correction		C++	GPLv3	Linux
DeconSeq	DeconSeq can be used to automatically detect and efficiently remove any type of sequence contamination from metagenomic datasets, including human or other host sequences. The tool uses a modified version of the BWA-SW aligner and can be applied to longer-read datasets (150+bp read length). DeconSeq is available as both standalone and web-based versions.	Genomics Transcriptomics Metagenomics	Sequence contamination filtering		Perl C	GPLv3	UNIX Mac OS X
DeepTools	User-friendly tools for the normalization and visualization of deep-sequencing data.	Genomics ChIP-seq	Visualisation Conversion Standardisation and normalisation	Data Visualisation coverage analysis File reformatting normalization GC plot analysis	Python	GPLv3	Linux Mac OS X
DeFuse	deFuse is a software package for gene fusion discovery using RNA-Seq data. The software uses clusters of discordant paired end alignments to inform a split read alignment analysis for finding fusion boundaries. The software also employs a number of heuristic filters in an attempt to reduce the number of false positives and produces a fully annotated output for each predicted fusion	RNA-Seq Gene structure
DEGseq	an R package to identify differentially expressed genes or isoforms for RNA-seq data from different samples	RNA-Seq quantification	Differentially expressed gene identification		R
DESeq	DESeq is an R package to analyse count data from high-throughput sequencing assays such as RNA-Seq and test for differential expression. The latest version is DESeq2 (released April 2013).	ChIP-seq RNA-Seq quantification	Sequencing quality control Statistical testing		R	GPLv3	UNIX Windows Mac OS X
DIAL	A computational pipeline for identifying single-base substitutions between two closely related genomes without the help of a reference genome.	SNP detection Comparative genomics			C Python	MIT	Linux
DiBayes	Bayesian identification of SNPs in color space (SOLiD) data	SNP detection				GPL
DiffBind	Differential Binding Analysis of ChIP-seq peak data Compute differentially bound sites from multiple ChIP-seq experiments using affinity (quantitative) data. Also enables occupancy (overlap) analysis and plotting functions.	ChIP-seq	Differential binding sites	Multiple replicates information used; automated pipeline; finding hotspots;	R	Artistic-2.0	Linux Mac OS X Windows
Diffreps	diffReps is developed to find different peaks in ChIP-seq. It scans the whole genome using a sliding window, performing millions of statistical tests and report the significant hits. diffReps takes into account the biological variations within a group of samples and uses that information to enhance the statistical power. Considering biological variation is of high importance, especiallly for in vivo brain tissues.	ChIP-seq	Differential Binding	Multiple replicates information used; automated pipeline; easy genomic annotation; finding hotspots;	Perl	GPLv3	Linux Windows Mac OS X
Dindel	Calls small indels from short-read sequence data	Indel detection	Localised reassembly
DiscoSnp	discoSnps : qualitative de-novo SNP caller. Extremely low memory and time efficient. No reference genome needed. Call both homozygous and heterozygous SNPs.	Sequence assembly (de novo assembly) Sequencing Genotyping Comparative genomics High-throughput sequencing Population Genomics	Sequence assembly (de-novo assembly) Read depth analysis	de novo (reference free) SNP calling	C++	CeCILL	Unix-like iOS
DNA Baser	Tool for manual and automatic sequence assembly, analysis, editing, sample processing, metadata integration, file format conversion and mutation detection.	SNP detection Structural variation	Sequence assembly Sequence analysis Sequence assembly editing	Portable. Does not require installation. Can run from USB stick. Only 3MB.		Commercial Freeware	Windows
DNA Chromatogram Explorer	DNA Chromatogram Explorer is a Windows Explorer clone dedicated to DNA sequence analysis and manipulation.	Sequence analysis	File reformatting Chromatogram visualisation	Portable. Does not require installation. Can run from USB stick. Only 1MB.		Freeware	Windows
DNAA	DNAA (DNA Analysis) software for analysis of Next-Generation Sequencing data.	DNA methylation SNP detection Structural variation	Sequencing quality control Modelling and simulation Statistical calculation			GPL
DNaseR	DNase I footprinting analysis of DNase-seq data in R	DNase-seq	Peak calling Nucleic acid sequence feature detection	R/Bioconductor package can run on major computer platforms	R	GPL-2 + file LICENSE	Linux Mac OS X Windows
DNAzip	A series of techniques that in combination reduces a single genome to a size small enough to be sent as an email attachment.		File reformatting		C++
DrFAST	Fast mapper for dibase encoded data.
DSAP	Automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology	Transcriptomics Regulatory RNA					browser based
DSGseq	This program aims to identify differentially spliced genes from two groups of RNA-seq samples.	RNA-Seq Alternative splicing Gene expression	Statistical calculation Differential expression analysis RNA-Seq analysis		C R	Commercial Freeware	Linux Windows Mac OS X
DSRC	Compression algorithm for genomic data in FASTQ format		File reformatting
E-miR	Perl tools for processing miRNA sequencing data	Transcriptomics Regulatory RNA
Ea-utils	FASTQ processing utilities		Sequencing quality control Sequence trimming		C++	MIT
EagleView	EagleView is an information-rich genome assembler viewer with data integration capability.		Sequence assembly visualisation
EagleView genome viewer	EagleView is an information-rich genome assembler viewer with data integration capability.		Visualisation				Linux Windows Mac OS X
EaSeq	EaSeq is developed for user-friendly exploration, visualization and analysis of genome-wide single-read sequencing data (mainly ChIP-seq). Both individual genomic loci and populations of loci can be visualized e.g. as plots of average signals, scatter diagrams, or clustered heatmaps. The underlying loci can then be inspected just by selecting them in the plots - or they can be 'gated out' for further analysis. EaSeq also integrates more than 20 tools for analysis, including peak-finding, quantitation, normalization, clustering, distance analysis, randomization, scoring, and normalization. Finally, it automatically generates legends and descriptions of the handling and can store plots together with underlying data and these descriptions as a single compact session file.	ChIP-Seq	Visualisation Peak calling Viewer Clustering Integrated solution Genome visualisation Analysis Biological interpretation Regression analysis Correlation Filtering Genomic region matching Format conversion Read summarisation Enrichment Global test Visualisation Quality assessement And more...	Interactive User-friendly GUI WYSIWYG ChIPseq analysis Data Visualisation Data analysis Genome Viewer Integrated solution Memory efficient Multicore Integrated tutorials Automatic documentation Integrated chat forum		Freeware Custom Licence	Windows 7 or Higher
Easyfig	Genome comparison figure generator	Comparative genomics	Comparative genomics		Python	GPLv3	Windows Mac OS X GNU/Linux
EBCall	EBCall is a software package for somatic mutation detection (including InDels). EBCall uses not only paired tumor/normal sequence data of a target sample, but also multiple non-paired normal reference samples for evaluating distribution of sequencing errors, which leads to an accurate mutaiton detection even in case of low sequencing depths and low allele frequencies.
ECHO	Reference-free short read error correction from diploid genomes, with explicit modeling of heterozygous sites.	SNP detection Indel detection	Sequence error correction		Python C++	BSD
EDENA	An assembler dedicated to process the millions of very short reads produced by the Illumina Genome Analyzer.		Sequence assembly
EdgeR	edgeR is an R/Bioconductor software package for statistical analysis of replicated count data. Methods are designed for assessing differential expression in comparative RNA-Seq experiments, but are generally applicable to count data from other genome-scale platforms (ChIP-seq, MeDIP-Seq, Tag-Seq, SAGE-Seq etc).	DNA methylation ChIP-seq RNA-Seq RNA-Seq quantification Gene expression analysis	Statistical calculation		R	LGPL	Windows Mac OS X UNIX
ELAND	Efficient Large-Scale Alignment of Nucleotide Databases. Whole genome alignments to a reference genome. Written by Illumina author Anthony J. Cox for the Solexa 1G machine.		Sequence alignment			Commercial
EMBF	Frequency-based, de novo short-read clustering method that organizes erroneous short sequences originating in a single abundant sequence into a tree structure; in this structure, each “child” sequence is considered to be stochastically derived from its more abundant “parent” sequence with one mutation through sequencing errors.		Mapping
Epigenome	A bioinformatic pipeline that scores epigenetic alterations according to strength and significance and links them to potentially affected genes.	Epigenomics	Bisulfite mapping		R Python
EpiGRAPH	EpiGRAPH enables biologists to analyze genome and epigenome datasets with powerful statistical and machine learning methods. In a typical workflow, the user uploads a set of genomic regions of interest (e.g. experimentally mapped enhancers, hotspots of epigenetic regulation or sites exhibiting disease-specific alterations), and EpiGRAPH searches a large database of (epi-) genomic attributes for significant overlap and correlation with the regions in the input dataset. Furthermore, EpiGRAPH can predict the status of genomic regions that were not included in the input dataset.	Epigenomics Machine learning	Statistical calculation				browser based
ERANGE	ERANGE is a Python package for doing RNA-seq and ChIP-seq.	ChIP-seq RNA-Seq alignment RNA-Seq quantification DNA transcription		RNAseq analysis Chipseq analysis	Python
ERDS	ERDS is a free, open-source software, designed for detection of copy number variants (CNVs) on human genomes from next generation sequence data. It uses paired Hidden Markov models (PHMM) based on the expected distribution of read depth of short reads and the presence of heterozygous sites. ERDS is NOT good for whole exome data.	Copy number estimation
ERGO Genome Analysis and Discovery System	ERGO provides a systems-biology informatics toolkit centered on comparative genomics to capture, query and visualize sequenced genomes. Building upon the most comprehensive genomic database available anywhere integrated with the largest collection of microbial metabolic and non-metabolic pathways and using Igenbio's proprietary algorithms, ERGO assigns functions to genes, integrates genes into pathways, and identifies previously unknown or mischaracterized genes, cryptic pathways and gene products.	Transcriptomics Metagenomics Transcription factors and regulatory sites Phylogenetics Comparative genomics SNP Functional genomics Gene structure Exome analysis Genome Wide Association Studies Metabolic pathways	Sequence alignment SNP detection SNP annotation Gene expression analysis Pathway or network analysis Sequence annotation Sequence functional annotation Metabolic network modelling			Commercial	Web
ERNE	Extended Randomized Numerical alignEr for accurate alignment of NGS reads. It can map bisulfite-treated reads.	Sequence alignment Genomics DNA methylation Sequencing Epigenetics	Read mapping Bisulfite mapping	Bisulfite sequencing sequence alignment	C++	GPLv3	Linux Mac OS X Windows
Error Correction Evaluation Toolkit	Evaluation of error correction results	Data handling	Sequencing quality control		Python Perl		POSIX
Est2assembly	Processes raw sequence data from Sanger or 454 sequencing into a hybrid de-novo assembly, annotates it and produces GMOD compatible output, including a SeqFeature database suitable for GBrowse.	Genomics RNA-Seq alignment	Sequence assembly
ESTcalc	Estimation of project costs for RNA-Seq study.	RNA-Seq			Perl
EULER	EULER-SR is a program for de novo assembly of reads. Contrary to the overlap-layout approach, EULER-SR uses a de Bruijn graph to construct an assembly. The assembly of a genome corresponds to an Eulerian path in the de Bruijn graph. Long (possibly erroneous) reads, and mate-pairs are used to determine parts of the correct Eulerian traversal in the assembly.		Sequence assembly Sequence assembly (de-novo assembly)		C++ Perl		Linux
ExomeCNV	Identifies copy number variation from targeted exome sequencing data	Exome capture Copy number estimation			R
ExomeCopy	CNV detection from exome sequencing read depth	Copy number estimation Exome and whole genome variant detection Exome analysis		simultaneous normalization and segmentation	R	GPL 2.0+	Linux Windows Mac OS X
ExomePicks	ExomePicks is a program that suggests individuals to be sequenced in a large pedigree.
Exonerate	Various forms of alignment (including Smith-Waterman-Gotoh) of DNA/protein against a reference. Authors are Guy St C Slater and Ewan Birney from EMBL. C for POSIX.		Sequence alignment		C	GPL	Linux
FAAST	Flowspace Assisted Alignment Search Tool		Read mapping				Linux
FaBox	Tools for splitting, joining and otherwise manipulating FASTA format sequence files.	Sequence assembly Phylogenetics
FACS	Rapid and accurate classification of sequences as belonging or not belonging to a reference sequence.	Metagenomics			Perl f	GPLv2	Linux
Falcon	FALCON: experimental PacBio diploid assembler	Sequence assembly	Sequence assembly
FAST: Fast Analysis of Sequences Toolbox	The FAST Analysis of Sequences Toolbox (FAST) is a set of Unix tools (for example fasgrep, fascut, fashead and fastr) for sequence bioinformatics modeled after the Unix textutils (such as grep, cut, head, tr, etc).	Genomics Sequence analysis	Sequence analysis Sequence parsing				GNU/Linux Mac OS X
FastQ Screen	FastQ Screen provides a simple way to screen a library of short reads against a set of reference libraries. Its most common use is as part of a QC pipeline to confirm that a library comes from the expected source, and to help identify any sources of contamination.	Genomics Transcriptomics	Read mapping Sequencing quality control	Summarises the mapping of a library against a series of reference sequences	Perl	GPLv3	Linux Mac OS X Windows
FastQC	FastQC aims to provide a simple way to do some quality control checks on raw sequence data coming from high throughput sequencing pipelines.		Sequencing quality control		Java	GPLv3	UNIX Windows
FastQValidator	Checking that FastQ files are follows standards	Quality control	Sequencing quality control		C++
FDM	Detects differential transcription in RNA-Seq data	RNA-Seq quantification
FeatureCounts	featureCounts is a very efficient read quantifier. It can be used to summarize RNA-seq reads and gDNA-seq reads to a variety of genomic features such as genes, exons, promoters, gene bodies and genomic bins. It is included in the Bioconductor Rsubread package and also in the SourceForge Subread package.	Next Generation Sequencing	Read summarisation	read summarisation	R C	GPLv3	Linux 64 Mac OS X Mac OS X; x86 64
FHiTINGS	"FHiTINGS is designed for use in rapidly identifying, classifying, and parsing internal transcribed spacer (ITS) DNA sequences after a BLASTn search. This software is useful for fungal ecology studies using next generation sequencing (NGS)."	Metagenomics Comparative genomics	Sequence alignment		Python		Cross-Platform
Figaro	Figaro is a software tool for identifying and removing the vector from raw DNA sequence data without prior knowledge of the vector sequence.	Sequencing	K-mer counting	AMOS	C++ Perl	Artistic License	UNIX
Filter	Produces a filtered version of an sRNA dataset, controlled by several user-defined criteria, including sequence length, abundance, complexity, transfer and ribosomal RNA removal.	Workflows	Sequence contamination filtering	multi-threading	Java	Custom Licence	Linux 64 Windows Mac OS X
FindPeaks 3.1	Findpeaks was developed to perform analysis of ChIP-seq experiments.	ChIP-seq	Peak calling			GPLv3
FindPeaks 4.0 (Vancouver Short Read Package)	The Vancouver Short Read Analysis Package (VSRAP) contains the FindPeaks application for Chip-Seq and RNA-Seq analysis, as well as utilities for SNP finding, working with aligned sequence files and a nascent database for storing SNPs across multiple libraries.	Genomics SNP detection	Peak calling Formatting Sequence alignment analysis	command line	Java	GPL	Linux Windows Mac OS X
FLASH	Identifies paired-end reads which overlap in the middle, converting them to single long reads		Sequence assembly Read pre-processing	combining forward and reverse reads	C	Open Source	Linux 64
Flexbar	Flexible barcode and adapter processing for next-generation sequencing platforms	Genomics	Adapter removal Sequence trimming DNA barcoding Quality control Read pre-processing	Paired read support separate barcode reads multi-threading	C++	GPLv3	Linux Windows Mac OS X
Flower	Tool for reformatting SFF files into other formats or tab-delimited				Haskell
FlowSim	Tool for simulating errors in 454 sequencing data		Sequence error correction Modelling and simulation		Haskell
Flux	FluxCapacitor s a computer program to predict splice form abundancies from reads of an RNA-seq experiment. FluxSimulator can generate simulated data for testing RNA-seq pipelines	RNA-Seq	Modelling and simulation
Forge	De novo assembly using a combination of next-generation and Sanger reads	Sequence assembly (de novo assembly) Genomics	Sequence assembly
FragGeneScan	Application for finding (fragmented) genes in short reads	Genomics Metagenomics			C Perl	GPL
FrameDP	Sensitive peptide detection on noisy matured sequences. A self-training integrative pipeline for predicting CDS in transcripts which can adapt itself to different levels of sequence qualities.	RNA-Seq
FreClu	a frequency-based, de novo short-read clustering method that organizes erroneous short sequences originating in a single abundant sequence into a tree structure; in this structure, each “child” sequence is considered to be stochastically derived from its more abundant “parent” sequence with one mutation through sequencing errors. The root node is the most frequently observed sequence that represents all erroneous reads in the entire tree, allowing the alignment of the reliable representative read to the genome without the risk of mapping erroneous reads to false-positive positions.	RNA-Seq alignment	Read mapping
Freebayes	Bayesian genetic variant detector (SNPs, indels, MNPs)	Genomics				MIT
FREEC	A tool for control-free copy number alteration (CNA) detection using deep-sequencing data, particularly useful for cancer studies.	Copy number estimation					Linux Linux 64 Windows
FusionAnalyser	FusionAnalyser is a new graphical, event-driven tool dedicated to the identification of driver fusion rearrangements in human cancer through the analysis of paired-end high-throughput transcriptome sequencing data. Tested on Illumina. Requires short, paired-end sequences.	Gene structure High-throughput sequencing		Advanced and user-friendly analysis of RNA-seq data for fusion discovery	C#	GPLv3	Windows Linux
FusionCatcher	FusionCatcher searches for novel/known somatic fusion genes, translocations, and chimeras in RNA-seq data (paired-end or single-end reads from Illumina NGS platforms like Solexa/HiSeq/NextSeq/MiSeq) from diseased samples).	RNA-Seq Gene structure	Sequence alignment		Python	GPLv3	NIX
FusionHunter	Identifies gene fusions in RNA-Seq data	RNA-Seq			Perl C		Linux Linux 64
FusionMap	Detects fusion events in both single- and paired-end datasets from either RNA-Seq or gDNA-Seq studies and characterize fusion junctions at base-pair resolution.	Gene structure	Split-read mapping		C#	Commercial Freeware	Windows Linux Linux 64
FusionSeq	Identifies fusion transcripts from paired end RNA-Seq data.	RNA-Seq Gene structure	Sequence alignment analysis		C	Creative Commons - Attribution; Non-commercial 2.5	Mac OS X UNIX Linux
Fuzzypath	Assembler	Genomics	Sequence assembly Sequence assembly (de-novo assembly)
G-Mo.R-Seq	G-Mo.R-Se is a method aimed at using RNA-Seq short reads to build de novo gene models.	RNA-Seq alignment				CeCILL	Linux
G-SQZ	Huffman coding-based sequencing-reads specific representation scheme that compresses data without altering the relative order.		File reformatting		C++
Galaxy	"Galaxy is an open, web-based platform for data intensive biomedical research. Whether on the free public server or your own instance, you can perform, reproduce, and share complete analyses. "	Sequence assembly Whole genome resequencing Genomics Comparative genomics Functional genomics	Sequence assembly Sequence alignment Visualisation Sequencing quality control		Python		Cross-Platform
Galign	Identifies polymorphisms between sequence reads obtained using Illumina/Solexa technology and a reference genome	SNP detection	Read mapping			GPL
Gambit	A cross-platform GUI for sequence visualization and analysis.		Visualisation			GPL 2.0+ Commercial
GAMES	GAMES (Genomic Analysis of Mutations Extracted by Sequencing) is a tool for mining and prediction of functional effect of mutation.	SNP	SNP detection Indel detection SNP annotation		Perl		Linux
GASSST	Fast and accurate aligner for short an long reads		Sequence alignment Mapping	Gapped alignment short and long reads	C++	CeCILL	Linux
GASV	Software for classification and comparison of structural variants measured via paired-end sequencing and/or array-CGH.	Structural variation				GPLv3
GATK	The Genome Analysis Toolkit (GATK) is a structured programming framework designed to enable rapid development of efficient and robust analysis tools for next-generation DNA sequencers. The GATK solves the data management challenge by separating data access patterns from analysis algorithms, using the functional programming philosophy of Map/Reduce	SNP detection	Localised reassembly Statistical calculation		Java Python
GBrowse	Genome Viewer		Visualisation	Genome Viewer	Perl	Open Source	Linux Mac OS X Windows
GeeFu	Database tool for genomic assembly and feature data	Genomics	Sequence assembly		Ruby
GEM	GEM is a java software tool to analyze transcription factor binding ChIP-seq/ChIP-exo data. It predicts binding events, performs de novo motif discovery and use the motif to improve the binding event calling. It calls binding events right at (or very close to ) the motif positions, deconvolves closely spaced homotypic binding events and accurately discovers binding motifs.	ChIP-seq	Peak calling Sequence motif discovery	probabilistic mixture model motif prior multi-threading	Java	Commercial Freeware	Cross-Platform
GEM library	A set of very optimized tools for indexing/querying huge genomes/files. Provided so far: a very fast exact mapper, and an unconstrained split-mapper		Read mapping		C Python OCaml	GPLv3
GENALICE MAP	From FASTQ to VCF in 30 min or less. Ultra-fast Next-Generation Sequencing (NGS) read alignment and variant calling solution.	Genomics	Read mapping Variant calling	ultrafast alignment and variant calling		Commercial	Linux
GENE-Counter	GENE-counter is a computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression	RNA-Seq					Linux Mac OS X
Genedata Expressionist	Genedata Expressionist performs efficient and quality compliant analysis of next generation sequencing (NGS) and other genomic profiling data	Genomics Transcriptomics DNA methylation ChIP-seq SNP detection Indel detection RNA-Seq Sequencing Laboratory information management Copy number estimation Epigenetics Gene structure	Read mapping Sequence alignment Annotation Bisulfite mapping Clustering Genome visualisation Sequencing quality control Sequence analysis Statistical calculation Gene expression analysis RNA-Seq analysis	Data processing Data management Data analysis Data storage Downstream analysis Pipeline Management Workflows Reporting Audit Trail User Management Data Visualisation	Java	Commercial	Windows Linux
Geneious	Search, organize and analyze genomic and protein information of any size via desktop program that provides publication ready images to enhance the impact of your research.	Sequence assembly (de novo assembly) Genomics RNA-Seq Metagenomics Epigenomics Structural variation Sequence analysis Phylogenetics Population genetics	Sequence assembly Read mapping Sequence alignment Visualisation Annotation Sequence assembly validation Genome visualisation Variant calling DNA barcoding Sequence motif discovery		Java	Commercial	Windows Mac OS X Linux Solaris
GeneProf	GeneProf is a web-based, graphical software suite and database resource for high-throughput-sequencing experiments (RNA-seq and ChIP-seq).	ChIP-seq RNA-Seq	Read mapping Visualisation Peak calling Sequencing quality control Differential expression analysis	User-friendly wizards tutorials examples very flexible reproducible transparent extensible API	Java JavaScript	Commercial Freeware	browser based
GeneTalk	GeneTalk, a web-based platform, that can filter, reduce and prioritize human sequence variants from NGS data and assist in the interpretation of personal variants in clinical context.	Structural variation	Annotation Sequence contamination filtering Sequence analysis Variant calling Genetic variation analysis Plotting and rendering Exome analysis Variant classification	Easy-to-use point-and-click web interface data visualization data filtering Fast SNP annotation SNP calling Variant annotation and analysis variant counting	Ruby JavaScript	Freemium
Genomatix Mining Station (GMS)	The Genomatix Mining Station (GMS) offers mapping of NGS reads onto genomes, transcriptomes and splice-junction libraries. It is a client-server based solution and can be controlled through an intuitive GUI or via command-line. It covers different tasks such as, as genomic positioning, SNP detection, splice analyses and genomic enrichments.	ChIP-seq SNP detection RNA-Seq	Sequence assembly Read mapping SNP calling Correlation	Client-server based system allows for command-line and web-based access. Grid engine is used for job scheduling and mapping is run on multiple cores. Can be combined with a Genomatix Genome Analyzer (GGA) for a fully integrated NGS solution.	C++ Java Flash	Commercial	Windows Mac OS X Linux
Genome Trax	Genome Trax™ enables you to identify human genome variations of functional significance by mapping your NGS data to known elements such as disease mutations and regulatory sites.	SNP detection Indel detection Structural variation Regulatory genomics				Commercial
GenomeBrowse	A free genome browser for exploring sequencing pile-up and coverage data with numerous annotation tracks hosted on the cloud.	Sequence assembly (de novo assembly) Sequence alignment Whole genome resequencing Genomics Sequencing Sequence analysis Genetics Exome and whole genome variant detection Exome analysis Next Generation Sequencing	Visualisation Sequence assembly visualisation Sequence alignment visualisation				Windows Linux Mac OS X
Genomedata	Genomedata is a format for efficient storage of multiple tracks of numeric data anchored to a genome. The format allows fast random access to hundreds of gigabytes of data, while retaining a small disk space footprint.	Storage	Signal		Python C	GPL	Linux Mac OS X
GenomeJack	GenomeJack is a genome browser specialized in next-generation sequencing data. Advantages are intuitive interface and smooth drag'n drop response.	Genomics Personalised medicine	Visualisation		Java	Freeware	Windows Mac OS X Linux
GenomeMapper	GenomeMapper is a short read mapping tool designed for accurate read alignments. It quickly aligns millions of reads either with ungapped or gapped alignments. It can be used to align against multiple genomes simulanteously or against a single reference.		Read mapping Sequence alignment
Genometa	Genometa is a Java based local bioinformatics program which allows rapid analysis of metagenomic short read datasets. Millions of short reads can be accurately analysed within minutes and visualised in the browser component. A large database of diverse bacteria and archaea has been constructed as a reference sequence.	Genomics Metagenomics	Read mapping Visualisation	Read mapping Data Visualisation	Java		Linux Mac OS X Windows
GenomeTools	The GenomeTools genome analysis system is a free collection of bioinformatics tools for genome informatics.1.3.6	Genomics		Integrated solution	C	BSD	POSIX Linux Mac OS X OpenBSD Windows (Cygwin) UNIX
GenomeView	GenomeView is a next-generation stand-alone genome browser and editor initiated in the BSB group at VIB and currently developed at Broad Institute. It provides interactive visualization of sequences, annotation, multiple alignments, syntenic mappings, short read alignments and more. Many standard file formats are supported and new functionality can be added using a plugin system.	Genomics Transcriptomics Sequencing Sequence analysis Comparative genomics Quality control	Visualisation Data retrieval Genome visualisation Sequence alignment visualisation Plotting and rendering	Visualisation of a multitude of genomics data	Java	GPL	platform-independent
GenomicTools	GenomicTools is a platform for the analysis and manipulation of high-throughput sequencing data such as RNA-seq and ChIP-seq.	Genomics ChIP-seq RNA-Seq	Peak calling Heat map generation Genome comparison	create custom pipelines feature overlaps identify binding site peaks in ChIP-seq data create read profiles create read heatmaps	C C++	GPLv2
GenoMiner	A proprietary NGS analysis solution. Powerful hardware comes with preinstalled software, organized in workflows.	Sequence assembly Sequence assembly (de novo assembly) ChIP-seq RNA-Seq	Sequence assembly Sequence alignment Peak calling Sequence error correction Plotting and rendering Mutation detection Gene expression profiling	GenoMiner provide workflows for Reference assembly De novo assembly ChIPSeq RNASeq and more. You upload your files at the beginning and you get the results at the end while you can choose from various tools to use for analysis.	Java	Commercial	Linux
GenoREAD	GenoREAD is a web-based, sequence verification software that can be used to compare Sanger sequencing trace files against a reference sequence. Users can either submit their sequencing results one clone at a time, or they can submit a series of clones (as a project) to run at once. Results can be viewed online or downloaded.	Sequencing Clone verification	Sequence assembly Read mapping Sequence alignment		Perl PHP JavaScript		Linux 64
GenoViewer	A feature rich NGS assembly viewer/browser.		Viewer	large file loading multicontig handling SNP/InDel/Read Error display and search mutation table generation and export consensus sequence generation and export	Java	Freeware	platform-independent
GensearchNGS	A user friendly framework for re-sequencing in a diagnostics context: searching for mutations/variants, especially on well known genes.	Exome capture	Sequence alignment Sequence alignment visualisation Read alignment Variant prioritisation Mutation detection	Plugin framework Cafe Variome submission	Java	Commercial	UNIX Windows
GenVision	GenVision is a genomic visualization software package that is fully integrated with Lasergene and is designed to support easy generation of publication quality graphics and maps.	Genomics	Visualisation			Commercial	Windows Mac OS X
Geoseq	Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries.	Resequencing	Read mapping
GigaBayes	A short-read SNP and short-INDEL discovery program.	Genomics SNP detection	SNP calling
GimmeMotifs	GimmeMotifs is a de novo motif prediction pipeline, especially suited for ChIP-seq datasets. It incorporates several existing motif prediction algorithms in an ensemble method to predict motifs and clusters these motifs using the WIC similarity scoring metric.	ChIP-seq Epigenomics Gene regulation	Sequence motif comparison		Python	MIT	Linux
Girafe	The R/Bioconductor package girafe facilitates the functional exploration of alignments of sequence reads from next-generation sequencing data to a genome. It allows users to investigate the genomic intervals together with the aligned reads and to work with, visualise and export these intervals.		Sequence alignment		R
Gk arrays	Gk-arrays are a data structure to index the k-mers in a collection of reads.	Genomics Transcriptomics Metagenomics	Sequence assembly Read mapping Sequence error correction	programming library	C++	CeCILL-C license	Linux Linux 64 Mac OS X any
GlobalSeq	Testing for association between RNA-Seq and high-dimensional data	RNA-Seq ChIP-Seq MicroRNA-Seq Meth-Seq	Omnibus test Global test Joint significance test		R	GPL-3	cross-platform
GMAP	GMAP (Genomic Mapping and Alignment Program) for mRNA and EST Sequences.		Read mapping Sequence alignment		C Bourne shell		UNIX
Gnumap	The Genomic Next-generation Universal MAPper (gnumap) is a program designed to accurately map sequence data obtained from next-generation sequencing machines (specifically that of Solexa/Illumina) back to a genome of any size. Currently, gnumap is designed to be used with the _int.txt data received from the Solexa/Illumina machine.		Read mapping		C++
Goby framework	Goby is a next-gen data management framework designed to facilitate the implementation of efficient next-gen data analysis pipelines.	RNA-Seq	File reformatting		Java	GPLv2
Golden Helix	Golden Helix is a bioinformatic software provider and analytic service provider.	Genomics SNP detection Epigenomics Sequencing Copy number estimation Data handling	Annotation Genome visualisation Sequencing quality control Sequence contamination filtering Variant calling Statistical calculation Collapsing methods Variant classification				Windows Linux Mac OS X
Goseq	An R package to detect Gene Ontology (GO) categories and other categories of genes (such as KEGG pathways) that are over/under represented in an RNA-seq data.	RNA-Seq quantification	Gene set testing		R	LGPL	UNIX Windows
Gowinda	Gowinda: unbiased analysis of gene set enrichment for Genome Wide Association Studies	Genomics Sequencing Population genetics Population Genomics	Ontology comparison Functional enrichment Genome-wide association study	Multicore	Java	Mozilla Public License	Mac OS X Linux Windows
GPS	GPS is a high spatial resolution peak detection algorithm for ChIP-seq data.	Genomics ChIP-seq Epigenomics Transcription factors and regulatory sites	Peak calling	multi-threading	Java	Commercial Freeware	Cross-Platform
GPSeq	Analyze RNA-seq data to estimate gene and exon expression, identify differentially expressed genes, and differentially spliced exons	RNA-Seq quantification			R C
GRIDSS	GRIDSS is a structural variant caller which combines whole genome breakend assembly with read pair and split read support using a probabilistic model.	Genomics Structural variation Genetic variation analysis	Sequence assembly Split-read mapping Read alignment Assembly Whole genome breakend assembly		Java	GPLv3	Cross-Platform
GRS	Reference-based data compression for storage of resequencing data		File reformatting	sequence compression	C Bourne shell	Commercial Freeware	Linux Linux 64
GSNAP	GSNAP can align both single-end and paired-end reads as short as 14 nt and of arbitrarily long length. It can detect short- and long-distance splicing, including interchromosomal splicing, in individual reads using probabilistic models or a database of known splice sites. Our program also permits SNP-tolerant alignment to a reference space of all possible combinations of major and minor alleles, and can align reads from bisulfite treated DNA for the study of methylation state.	DNA methylation RNA-Seq alignment	Read mapping Bisulfite mapping		C Perl
Hairpin Annotation	Generates a secondary structure from an RNA sequence and highlights regions of interest using RNAplot	Workflows			Java	Custom Licence	Linux 64 Windows Mac OS X
Haplowser	Haplowser: comparative haplotype browser for personal genome and metagenome		Visualisation Haplotype inference		Java	GPL
HawkEye	An interactive visual analytics tool for genome assemblies.		Sequence assembly visualisation	Sequence assembly visualisation	C++	Artistic License	Linux Mac OS X
HeliSphere	Open-source LINUX software package intended for use in analyzing data produced by the HeliScope Single Molecule Sequencer.	Whole genome resequencing Genomics SNP detection RNA-Seq	Mapping			Freeware	Linux
HI	Program for haplotype reconstruction from paired-end reads.		Haplotype reconstruction		Java
Hicup	A mapping pipeline for HiC interaction data. Performs independent mapping on each end of the interaction pair and removes commonly found artefacts.	Epigenomics	Read mapping		Perl	GPLv3	UNIX Linux Mac OS X
HINT	HMM-based Identification of TF Footprints	Epigenomics Transcription factors and regulatory sites Regulatory genomics	Peak calling Nucleic acid sequence feature detection	Digital Genomic Footprinting	Python	GPLv3	Unix-like
HiPipe	HiPipe is to make NGS data analysis quick and easy with high performance pipelines and intuitive web GUI.	Genomics	Read mapping Variant calling Workflows		JavaScript Java Bash		platform-independent
HiTEC	An algorithm which provides a highly accurate, robust, and fully automated method to correct reads produced by high-throughput sequencing methods.		Sequence error correction		C++	GPLv3	Linux
HMMSplicer	Splice junction discovery in RNA-Seq data	RNA-Seq alignment			Python
HPeak	Hidden Markov model (HMM)-based Peak-finding algorithm for analyzing ChIP-seq data to identify protein-interacting genomic regions.	ChIP-seq
HTSeq	Python framework to process and analyse high-throughput sequencing (HTS) data				Python	GPLv3
Hybrid-SHREC	Improves sequence data quality using information from multiple platforms.		Sequence error correction		Java
IBD2	Our algorithm uses a non-homogeneous hidden Markov model (HMM) that employs local recombination rates to identify chromosomal regions that are identical by descent (IBD=2) in children of consanguineous or non-consanguineous parents solely based on genotype data of siblings derived from high-throughput sequencing platforms.	Exome capture			R Java
Ibis	Ibis (Improved base identification system), is an accurate, fast and easy-to-use base caller for the Illumina sequencing system, which significantly reduces the error rate and increases the output of usable reads. Ibis is faster and makes fewer assumptions about chemistry and technology	Sequencing	Base-calling	Statistical learning of base calling parameters and calibrated quality scoring	Python C C++	Non-commercial	Linux Windows (Cygwin)
ICORN	Iteratively aligns deep coverage of short sequencing reads to correct errors in reference genome sequences and evaluate their accuracy.		Sequence assembly Sequencing quality control
IDBA	IDBA (Iterative De Bruijn graph short read Assembler) is a short read assembler based on iterative De Bruijn graph. It is developed under 64-bit Linux, but should be suitable for all unix-like system	Sequence assembly (de novo assembly)	Sequence assembly				POSIX Linux Linux 64
IGV	The Integrative Genomics Viewer (IGV) is a high-performance visualization tool for interactive exploration of large, integrated datasets. It supports a wide variety of data types and format, including short-read alignments in the SAM/BAM format. Data can be viewed from local files or over the web via http.	Genomics	Visualisation		Java	LGPL	Windows Mac OS X Linux
Illuminate	Analytics toolkit in Python for Illumina HiSeq and MiSeq metrics	Genomics	Sequencing quality control	object-oriented access to results of binary parsing some command line support	Python	MIT	Unix-like
Illuminator	Software for machines running Windows to identify variants in Illumina short read data.	SNP detection Indel detection
IMAGE	“Iterative Mapping and Assembly for Gap Elimination”. IMAGE closes gaps in a draft assembly using Illumina paired-end reads.		Sequence assembly editing
Inchworm	Employs the Kmer graph method to reconstruct (in many cases full-length) transcripts from Illumina RNA-Seq (preferrably strand-specific) reads.	Sequence assembly (de-novo assembly) RNA-Seq Transcriptome assembly
InGAP	inGAP is an integrated platform for next-generation sequencing project, the core function of which is to detect SNPs and indels using a Bayesian algorithm.	SNP detection	Read mapping Sequence assembly visualisation
Ingenuity Variant Analysis	Ingenuity Variant Analysis is a web application that helps researchers studying human disease to identify causal variants	Whole genome resequencing Genomics SNPs Exome	Variant classification
Integrated Genome Browser	Visualisation software for next-generation genomics	Genomics	Visualisation		Java	Open Source	platform-independent
Iobio	Real-time Genomic Analysis iobio uses immediate visual feedback to make understanding complex genomic datasets more intuitive, and analysis more interactive	Workflows	Visualisation
IOmics	iOmics is a cloud based workflow analysis framework for managing, analyzing and visualizing NGS data.	Genomics Transcriptomics ChIP-seq RNA-Seq Regulatory RNA Exome capture Epigenomics	Sequence assembly Sequence alignment Ab-initio gene prediction Variant calling MiRNA analysis Exome analysis			Commercial	cloud
IQSeq	Integrated Isoform Quantiﬁcation Analysis based on A Partial Sampling Framework	RNA-Seq quantification Alternative splicing			C++
ISAAC	ISAAC comprises of genome aligner and variant caller, by Illumina.			Runtime Speed	C++		Linux 64
Isas	Fast aligner for color and base space short read data.		Sequence alignment				Linux
IsoEM	Expectation maximization algorithm for estimating alternative splicing isoform frequencies	Alternative splicing	Expectation Maximisation		Java
ISSAKE	Short Sequence Assembly by K-mer search and 3' read Extension, Immunology version (iSSAKE)	Metagenomics	Sequence assembly		Perl Python	GPLv2
JBrowse	Slick, speedy genome browser with a responsive and dynamic AJAX interface for visualization of genome data. Being developed by the GMOD project as a successor to GBrowse.		Visualisation		Perl JavaScript	Open Source	browser based
Jellyfish	Fast, memory-efficient k-mer counting algorithm				C++	GPLv3	Linux 64 Mac OS X
JointSLM	Copy number estimation from read depth information	Copy number estimation			R
KARMA	K-tuple Alignment with Rapid Matching Algorithm	DNA methylation Sequencing Epigenetics	Read mapping
KBASE	"KBase provides a computational framework and tools for integrating and analyzing large, diverse datasets generated by the scientific community to advance predictive understanding, manipulation, and design of biological processes in an environmental context. The purpose of KBase is to enable users to integrate a wide spectrum of genomics and systems biology data, models, and information for microbes, microbial communities, and plants. Powerful tools within KBase will allow users to analyze and simulate data to predict biological behavior, generate and test hypotheses, design new biological functions, and propose new experiments. "	Comparative genomics	Annotation				Linux
Kismeth	Web-based tool for bisulfite sequencing analysis	DNA methylation Epigenomics	Bisulfite mapping
Kissnp	kisSnp compares two sets of NGS raw reads, detecting Single Nucleotide Polymorphism occurring between the two sets. The two sets typically come from the sequencing of two individuals from the same species or from closely related species.	Transcriptomics SNP detection Indel detection Comparative genomics	Sequence assembly Sequence assembly (de-novo assembly) Data retrieval	SNP calling	C	CeCILL	Linux
Kmergenie	KmerGenie estimates the best k-mer length for genome de novo assembly. Given a set of reads, KmerGenie first computes the k-mer abundance histogram for many values of k. Then, for each value of k, it predicts the number of distinct genomic k-mers in the dataset, and returns the k-mer length which maximizes this number. Experiments show that KmerGenie's choices lead to assemblies that are close to the best possible over all k-mer lengths.	Sequence assembly	Sequence assembly		C++ Python R
KNIME	Software for organizing bioinformatic workflows		Workflows			GPLv3	Windows Mac OS X Linux
Knime4Bio	custom nodes for the interpretation of Next Generation Sequencing data with KNIME.	Genomics Transcription factors and regulatory sites Gene regulation	Data retrieval	KNIME	Java	GPLv3	any
Krona	Krona creates interactive HTML5 charts of hierarchical data (such as taxonomic abundance in a metagenome).	Metagenomics	Visualisation	Interactive Animation HTML5 canvas graphics	JavaScript Perl		Linux UNIX Mac OS X
Lab7	Data workflow management platform to streamline NGS analyses	Genomics Laboratory information management	Workflows		Python JavaScript	Commercial	Mac OS X Linux
Lasergene	Lasergene is a comprehensive DNA and protein sequence analysis software suite comprised of seven applications which include functions ranging from sequence assembly and SNP detection, to automated virtual cloning and primer design.	Sequence assembly (de novo assembly) Genomics SNP detection Indel detection Mapping Transcription factors and regulatory sites Sequence analysis Phylogenetics Protein structure analysis	Sequence assembly Read mapping Sequence alignment Peak calling Annotation Sequence analysis Scaffolding Sequence alignment analysis Chromatogram visualisation PCR primer design			Commercial	Windows XP Windows 7 or Higher Mac OS X 10.7 10 8 or 10.9
LAST	Short read alignment program incorporating quality scores	Genomics Comparative genomics	Sequence alignment		C++	GPLv3
LASTZ	A tool for (1) aligning two DNA sequences, and (2) inferring appropriate scoring parameters automatically	Genomics	Read mapping Sequence alignment				Mac OS X Linux
LobSTR	lobSTR is an alignment and genotyping tool for profiling short tandem repeats from next generation sequencing data	Sequencing	Repeat sequence organisation analysis	Fast Scalable sequence alignment Gapped alignment	C++ R Python	Freeware	UNIX
LOCAS	LOCAS low-coverage short-read assembler		Sequence assembly		C++		Linux
LookSeq	AJAX-based browser for deep sequencing data		Sequence assembly visualisation
LSKB, Life Science Knowledge Bank	LSKB Life Science Knowledge Bank, is a comprehensive drug discovery and genomic research workbench, knowledgebase, & data management system.	Genomics	Annotation SNP calling Sequence analysis Differential expression analysis Pathway or network analysis Classification Analysis pipeline Functional analysis
MACS	Model-based Analysis of ChIP-seq data.	ChIP-seq	Peak calling		Python	Artistic License	platform-independent
MagicViewer	Large-scale short reads and sequencing depth visualization.	Sequence assembly (de novo assembly) Exome capture Genetic variation	Visualisation SNP calling		Java		platform-independent
MapDamage	Identifies and quantifies DNA damage patterns in ancient DNA	Sequencing DNA	Sequencing quality control Statistical calculation		Python R		Linux Mac OS X
MapNext	MapNext provides four mainly analysis: (i) unspliced alignment and clustering of reads, (ii) spliced alignment of transcriptomic reads, (iii) SNP detection and calculation of SNP frequency from population sequences, and (iv) storage of result data into database to make it available for more flexible query and further analyses.	RNA-Seq alignment SNP detection	Sequence alignment		C++ Perl
Mapsembler	Mapsembler is a targeted assembly software. It takes as input a set of NGS raw reads and a set of input sequences (starters). It first determines if each starter is read-coherent, e.g. whether reads confirm the presence of each starter in the original sequence. Then for each read-coherent starter, Mapsembler outputs its sequence neighborhood as a linear sequence or as a graph, depending on the user choice.	Transcriptomics Metagenomics Exome capture RNA-Seq quantification Sequencing	Sequence assembly Read mapping	De novo assembly Identify Novel Exons Remove contaminants Detect enzymes in metagenomics NGS reads	C	CeCILL	Linux
MapSplice	We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy.	RNA-Seq alignment	Mapping		C++ Python		Linux
MapView	Visualisation of short reads alignment on desktop computer		Visualisation				Linux Windows
MAQ	Mapping and Assembly with Qualities (renamed from MAPASS2). Particularly designed for Illumina-Solexa 1G Genetic Analyzer, and has preliminary functions to handle ABI SOLiD data.	Genomics SNP detection	Read mapping		C++ Perl	GPL
MAQGene	Complete pipeline for mutant discovery, with web front end	SNP detection	Read mapping Integrated solution
MARGARITA	SNP detection and genotyping from low-coverage sequencing data	SNP detection Genotyping
Mason	A fast, feature-rich and hackable read simulator for the simulation of NGS and Sanger data.	Genomics	Sequence assembly Mapping Modelling and simulation	Empirical or simple model for position dependent errors can write out sample position and extensive information about the sampled infix haplotype simulation through mutation of reference sequence.	C++	GPLv3	UNIX Windows
Mauve	Mauve Genome Alignment software, for comparing two or more draft or finished genomes	Genomics Transcriptomics	Visualisation Sequence assembly validation Sequence alignment comparison		C++ Java	GPL	Mac OS X Windows Linux
MAXIMUS	Hybrid reference and de novo assembly pipeline	Genomics	Sequence assembly
MAYDAY	Extensible platform for visual data exploration and interactive analysis and provides many methods for dissecting complex transcriptome datasets.	RNA-Seq	Visualisation
Meerkat	Meerkat is designed to identify structure variations (SVs) from paired end high throughput sequencing data.	Structural variation
MEGAN	Metagenome Analysis Software - MEGAN (âMEtaGenome ANalyzerâ) is a new computer program that allows laptop analysis of large metagenomic datasets. In a preprocessing step, the set of DNA reads (or contigs) is compared against databases of known sequences using BLAST or another comparison tool. MEGAN can then be used to compute and interactively explore the taxonomical content of the dataset, employing the NCBI taxonomy to summarize and order the results.	Metagenomics		metagenomic analysis functional classification		Commercial Freeware
Megraft	Megraft is a software tool to graft ribosomal small subunit (16S/18S) fragments from metagenomes onto full-length SSU sequences, enabling accurate diversity estimates from fragmentary and non-overlapping sequence data.	Metagenomics Sequence analysis Phylogenetics Community analysis Rarefaction	Sequence analysis		Perl	GPLv3	Linux UNIX Mac OS X
Meraculous	De novo genome assembler from short reads		Sequence assembly	De novo assembly scaffolding	Perl C
METAGENassist	User-friendly, web-based analytical pipeline for comparative metagenomic studies. Input can be derived from either 16S rRNA data or NextGen shotgun sequencing.	Metagenomics Machine learning	Visualisation Statistical calculation Sequence clustering	Easy-to-use point-and-click web interface; data visualization; publication-quality graphs and charts; wide variety of statistical methods; taxon-to-phenotype mapping; data filtering and normalization; supports many common input formats
Metagenome@Kin	Metagenome@Kin is an automated high throughput sequencing data analysis tool for the 16S/28S/ITS rRNA genes. It measures the composition and diversity of microbial and fungal species in natural environments.	Metagenomics	Species frequency estimation Statistical analysis
MetaSim	The software can be used to generate collections of synthetic reads.	Genomics Metagenomics	Sequence assembly Mapping Modelling and simulation		Java	Commercial Freeware
Metaxa	Metaxa uses Hidden Markov Models to identify, extract and classify small-subunit (SSU) rRNA sequences (12S/16S/18S) of bacterial, archaeal, eukaryotic, chloroplast and mitochondrial origin in metagenomes and other large sequence sets	Metagenomics Sequence analysis Phylogenetics Community analysis	Sequence analysis Hidden Markov Model		Perl	GPLv3	Linux UNIX Mac OS X
MethMarker	MethMarker facilitates the design of DNA methylation assays for COBRA, bisulfite SNuPE, bisulfite pyrosequencing, MethyLight and MSP. It also implements a systematic workflow for design, optimization and (computational) validation of DNA methylation biomarkers. This workflow starts from a preselected differentially methylated region (DMR) and results in an optimized DNA methylation assay that is ready to be tested in a large-scale clinical trial.	DNA methylation Epigenomics			Java		Windows Linux Mac OS X Solaris
Methpipe	The MethPipe software package is a computational pipeline for analyzing bisulfite sequencing data (BS-seq, WGBS and RRBS). MethPipe provides tools for mapping bisulfite sequencing read and estimating methylation levels at individual cytosine sites. Additionally, MethPipe includes tools for identifying higher-level methylation features, such as hypo-methylated regions (HMR), partially methylated domains (PMD), hyper-methylated regions (HyperMR), and allele-specific methylated regions (AMR).	DNA methylation Sequencing Epigenetics	Bisulfite mapping		C++	GPL (>= 3)	Linux Mac OS X
MethylCoder	Pipeline for fast, simple processing of BiSulfite-treated reads into methylation data. Includes scripts for analysis and visualization. In addition to a binary output, the direct output of methylcoder is a text file that indicates per-nucleotide methylation context (CG/CHG/CHH) and methylation levels (both coverage and C-T conversions)	Genomics DNA methylation Epigenomics Sequencing	Read mapping Bisulfite mapping		Python C	BSD	Linux Linux 64 Mac OS X
MetMap	Produces corrected site-specific methylation states from MethylSeq experiments and annotates unmethylated islands across the genome.	DNA methylation
MeV	Visualisation of genomic data, Differential Gene Expression based on DEGseq, DESeq and edgeR	RNA-Seq	Visualisation Differential expression analysis Sequence clustering Classification			Artistic License
MG-RAST	MG-RAST is a fully-automated service for annotating metagenome samples providing analysis tools for comparison	Metagenomics Phylogenetics Metabolic reconstruction	Annotation		Perl C GO JavaScript Python		UNIX
MicroRazerS	MicroRazerS is a tool optimized for mapping short RNAs onto a reference genome.		Read mapping		C++		Linux
Microsoft Biology Foundation	C#/.NET library for biological applications.				C#
MICSA	Combines positional information with information on motif occurrences to better predict binding sites of transcription factors (TFs)	ChIP-seq Sequence motifs	Sequence motif comparison
Minia	De novo assembly of human genomes on a desktop computer	Sequence assembly (de novo assembly)	Sequence assembly	Memory efficient and fast	C++	CeCILL	Linux Mac OS X
MIP Scaffolder	MIP Scaffolder is a program for scaffolding contigs produced by fragment assemblers using mate pair data.		Scaffolding		C++ Perl		Linux
MIRA	MIRA 3 - Whole Genome Shotgun and EST Sequence Assembler	Sequence assembly (de novo assembly) RNA-Seq alignment SNP detection	Sequence assembly Read mapping Local sequence alignment K-mer counting Graph reduction Learning algorithm		C++	GPL	Linux Mac OS X UNIX
MiRanalyzer	Web-server for identifying and analyzing miRNA in next-gen sequencing experiments	Regulatory RNA		Annotation of micro RNA differential expression	Java Perl		browser based
MiRCat	Predicts mature miRNAs and their precursors from an sRNA dataset and a genome.	Workflows	MicroRNA detection	Detection and prediction of known or novel miRNAs secondary structure generation	Java	Custom Licence	Linux 64 Windows Mac OS X
MiRDeep	Discovering known and novel miRNAs from deep sequencing data	Regulatory RNA			Perl
MiRNAkey	A software pipeline for the analysis of microRNA Deep Sequencing data	Regulatory RNA			Java Perl		Linux Mac OS X
MiRProf	Determines normalised expression levels of sRNAs matching known miRNAs in miRBase.	Workflows	MicroRNA detection		Java	Custom Licence	Linux 64 Windows Mac OS X
MiRspring	missing				Perl JavaScript	GNU
MirTools	Web server for microRNA profiling and discovery based on high-throughput sequencing	Transcriptomics Regulatory RNA			Perl PHP
MirTrios	mirTrios is a web server to accurately detect de novo mutations (DNMs) based on Expectation-maximization (EM) model. mirTrios also surports identification of rare inherited mutations, known diagnostic variants, as well as the prioritisation of novel and promising candidate genes.	Genomics Sequence analysis De novo mutation detection	Read mapping Variant calling De novo mutation detection		Perl PHP JavaScript		Linux
MISO	An alternative to Cufflinks, MISO (Mixture-of-Isoforms) is a probabilistic framework that quantitates the expression level of alternatively spliced genes.	RNA-Seq RNA-Seq quantification Alternative splicing
Mlgt	Processing and analysis of high throughput, long-read (e.g. Roche 454) sequences generated from multiple loci and multiple biological samples. Sequences are assigned to their locus and sample of origin, aligned and trimmed. Where possible, genotypes are called and variants mapped to known alleles.	Exome capture Genotyping Resequencing	Sequence error correction Sequence contamination filtering Sequence analysis Read alignment DNA barcoding	Sequence assignment sequence alignment allignment error correction variant counting genotype calling allele-matching	R	GPL >=2	Windows UNIX Mac OS X
MMSEQ	Pipeline and methodology for simultaneously estimating isoform expression and allelic imbalance in diploid organisms using RNA-seq data.	DNA transcription			C++		Mac OS X Linux 64
MochiView	Hybrid genome browser and motif visualization/analysis/management desktop software.	Genomics ChIP-seq RNA-Seq ChIP-on-chip Sequence motifs	Genome visualisation Sequence motif comparison	Desktop hybrid genome browser and motif visualization/analysis software	Java		Linux Mac OS X Windows
MoDIL	Program to detect small indels in next generation sequencing data	Genomics Indel detection			Python
MOM	Short-read mapping	Genomics	Read mapping
MOSAIK	Reference guided aligner/assembler.		Sequence assembly		C++	Commercial GPLv2	Windows Linux Mac OS X
MotifLab	MotifLab is a general workbench for analysing regulatory sequence regions and discovering transcription factor binding sites and cis-regulatory modules within a rich graphical interface.	ChIP-seq Sequence motifs Transcription factors and regulatory sites Sequence analysis Gene regulation	Visualisation Peak calling Genome visualisation Sequence contamination filtering Sequence motif discovery Format conversion	Complete motif discovery workbench: import DNA sequences and download additional data tracks from UCSC Genome Browser visualize tracks in the integrated genome browser perform de novo motif discovery or search for instances of known motifs use operations to pre- and post-process data tracks explore the data interactively with various analysis tools automatically record all your steps in protocols that can be executed as workflows	Java	Freeware	platform-independent
MPscan	MPscan (multi-pattern scan) is a program for mapping short reads (<30bp) exactly on a set of reference sequences (eg, a genome) without indexing the reference. MPscan performs only exact mapping (no substitution, nor indels), is fast (optimal complexity), and easy to use.	Genomics Transcriptomics	Read mapping		C++		Linux Mac OS X
MrBayes	"MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters."	Phylogenetics	Statistical calculation				Cross-Platform
MrCaNaVaR	mrCaNaVaR is a copy number caller that analyzes the next-generation sequence mapping read depth to discover large segmental duplications and deletions. It also has the capability of predicting absolute copy numbers of genomic intervals.	Genomics Personalised medicine Copy number estimation	Read depth analysis		C	Commercial Freeware	POSIX
MrFAST	mrFAST is designed to map short reads generated with the Illumina platform to reference genome assemblies; in a fast and memory-efficient manner.	Genomics Read alignment	Read alignment		C	BSD	UNIX
MrsFAST	mrsFAST is a micro-read substitution-only Fast Alignment Search Tool. mrsFAST is a cache-oblivous short read mapper that optimizes cache usage to get higher performance.	Genomics	Mapping Read alignment		C	BSD	UNIX
MTR	Metagenomics software for clustering at multiple ranks.	Metagenomics			C++ Matlab
MU2A	Genomic variant annotation tool	SNP	SNP annotation		Java	Apache License 2.0	Windows Linux Mac OS X
MultiPSQ	MultiPSQ expedites the analysis and evaluation of multiplex-pyrograms, generated from multiplex pyrosequencing	Phylogenetics Microbial Surveillance Public health and epidemiology	Peak calling Classification Peptide mass fingerprint		C++	LGPLv3	Windows Linux
MultiQC	MultiQC aggregates results from multiple bioinformatics analyses across many samples into a single report. It searches a given directory for analysis logs and compiles a HTML report. It's a general use tool, perfect for summarising the output from numerous bioinformatics tools.	Next Generation Sequencing	Quality control	Data Visualisation	Python	GPLv3	Windows Mac OS X Linux
MUMmer	MUMmer is a modular system for the rapid whole genome alignment of finished or draft sequence. Basically it is a ultra-fast alignment of large-scale DNA and protein sequences	Genomics Transcriptomics	Read mapping Sequence alignment			Artistic License	Linux
MUMmerGPU	MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by HTS.	Genomics Transcriptomics	Sequence alignment
MuMRescueLite	Probabilistically reincorporates multi-mapping tags into mapped short read data.	Genomics ChIP-seq	Mapping		Python	MIT
MuSICA 2	Assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence ~800 human full-length cDNA clones.	Clone verification	Sequence assembly		Perl
Mutascope	Mutascope is a software suite designed to analyze data from high throughput sequencing of PCR amplicons, with an emphasis on normal-tumor comparison for the accurate and sensitive identification of low prevalence mutations.	Cancer biology	Variant calling Analysis pipeline		Perl		UNIX
MuTect	MuTect is a method developed at the Broad Institute for the reliable and accurate identification of somatic point mutations in next generation sequencing data of cancer genomes.		SNP calling
Myrialign	Software to align short reads produced by a short read genome sequencer to a reference genome. Alignments can contain any number of SNPs, insertions and deletions, up to a user specified cutoff. Myrialign can use a Cell Broadband Engine processor to accelerate alignments if available, for example on a PlayStation 3 running GNU/Linux. Myrialign performs brute force alignment using a variant on the "bitap" algorithm that aligns several thousand reads to a reference in parallel. It uses bit-parallelism, multiple processors, and Cell SPUs if available. Unlike other reference genome alignment software, heuristics and hashtable lookups are not used. Myrialign will find alignments with any number of errors up to a user specified cutoff. The emphasis is on doing a 100% accurate search as fast as is possible.		Read mapping Sequence alignment
Myrna	Myrna is a cloud computing tool for calculating differential gene expression in large RNA-seq datasets.	RNA-Seq alignment RNA-Seq quantification
Mzip	Reference-based sequence data compression tool		File reformatting
NarrowPeaks	Analysis of variation in ChIP-seq using functional PCA	ChIP-seq	Peak calling Differential Binding	R/Bioconductor package can run on major computer platforms	R	Artistic-2.0	Linux Mac OS X Windows
NCBI Genome Workbench	"NCBI Genome Workbench is an integrated application for viewing and analyzing sequence data. With Genome Workbench, you can view data in publically available sequence databases at NCBI, and mix this data with your own private data."	Sequencing Sequence analysis Sequence annotation Next Generation Sequencing	Visualisation Genome visualisation				Cross-Platform
Nephele	NIAID, a cloud based tool for microbiome or metagenomics data analysis. Nephele allows you to process and analyze 16S raw data using pipelines based on QIIME and mothur. It also facilitates WGS functional analyses using bioBakery tools such as MetaPhlAn, HUMMAnN and metagenomic assembly using the a5-miseq/UDBA-UD.	Metagenomics Microbial Ecology Microbiome	Functional analysis Taxonomic profiling Metagenomic Assembly	Pipelines
Nesoni	Nesoni is a high-throughput sequencing data analysis toolset.	RNA-Seq alignment SNP detection Phylogenetics	Sequence alignment	largely for bacterial genomes	Python
Newbler	The assembly/mapping program developed by 454 Life Sciences for of 454 data	Sequence assembly (de novo assembly)	Sequence assembly Read mapping		C++	Unknown	Linux 64
Nexalign	Nexalign is a program to align millions of short reads from next-generation sequencing data sets to reference genomes		Read mapping		C++ R	GPL	UNIX
NextGen Utility Scripts	A collection of links to scripts available for working with data generated by new sequencing technologies.			A collection of many different scripts
NextGENe	de novo and reference assembly of Roche/454, Illumina and SOLiD data. Uses a novel Condensation Assembly Tool approach where reads are joined via "anchors" into mini-contigs before assembly which reduces sequencing errors. Requires Win or MacOS.	Sequence assembly (de novo assembly) SNP detection Indel detection Metagenomics Exome capture		Unique condensation tool Data Visualisation very flexible	C++	Commercial	Windows
Ngs backbone	ngs_backbone is a bioinformatic application created to work on sequence analysis by using NGS (Next Generation Sequencing) and sanger sequences. It is capable of cleaning reads, do de novo assembly or mapping against a reference and annotate SNPs, SSRs, ORFs, GO terms and sequence descriptions.	Genomics SNP detection	Sequence assembly Read mapping			AGPL	UNIX
NGS-DesignTools	Tools to assist in designing deep sequencing experiments for haplotype reconstruction and structural variant breakpoint detection	RNA-Seq quantification Structural variation	Haplotype inference Modelling and simulation
Ngs-pipeline	Complete solution for human re-sequencing projects	SNP detection Indel detection Epigenomics Structural variation Personalised medicine	Read mapping Sequence annotation		Perl	GPLv3	Linux
Ngs.plot	ngs.plot is a program that allows you to easily visualize your next-generation sequencing (NGS) samples at functional genomic regions. The signature advantage of ngs.plot is that it collects a large database of functional elements for many genomes. A user can ask for a functionally important region to be displayed in one command. It handles large sequencing data efficiently and has only modest memory requirement. A web-based version (integrated into Galaxy) is also available for the ones who are allergic to terminals.	Transcriptomics Epigenomics	Visualisation	Data Visualisation	R Python	GPL (>= 3)	All
NGSUtils	NGSUtils is a suite of software tools for working with next-generation sequencing datasets	Genomics Transcriptomics	Formatting Sequencing quality control Sequence contamination filtering Variant calling Read pre-processing		Python	GPL	Linux Mac OS X
NGSView	High-throughput sequencing technologies introduce novel demands on tools available for data analysis. We have developed NGSView, a generally applicable, flexible and extensible next-generation sequence alignment editor. The software allows for visualization and manipulation of millions of sequences simultaneously on a desktop computer, through a graphical interface. NGSView is available under an open source license and can be extended through a well documented API.	Genomics	Sequence assembly visualisation
NOISeq	Next Generation Sequencing (NGS) technologies are increasingly being used for gene expression pro�filing as a replacement for microarrays. The expression level given by these technologies is the number of reads in the library mapping to a given feature (gene, exon, transcript, etc.), i.e., the read counts. Most of the statistical methods for assessment of differential expression using count data rely on parametric assumptions about the distribution of the counts (Poisson, Negative Binomial, …). Moreover, many of them need replicates to work and tend to have problems to evaluate differential expression in features with low counts. NOISeq is a non-parametric approach for the identification of differentially expressed genes from count data. NOISeq empirically models the noise distribution of count changes by contrasting fold-change differences (M) and absolute expression differences (D) for all the features in samples within the same condition. This reference distribution is then used to assess whether the M-D values computed between two conditions for a given gene is likely to be part of the noise or represent a true differential expression. The are two variants of the method: NOISeq-real uses replicates, when available, to compute the noise distribution and, NOISeq-sim simulates them in absence of replication. It should be noted that the NOISeq-sim simulation procedure assimilates to technical replication and does not reproduce biological variability, which is necessary for population inferential analysis.	Gene expression	Differential expression analysis
NovelSeq	A computational framework to discover the content and location of long novel sequence insertions using paired-end sequencing data	Indel detection Structural variation	Sequence assembly Read mapping Variant calling		C	BSD	UNIX
Novocraft	Novoalign is a program for mapping short reads from the Illumina/SOLiD sequencing platform(s) to a reference genome.	Whole genome resequencing Genomics ChIP-seq RNA-Seq alignment Regulatory RNA	Read mapping	Bisulfite sequencing Mate-pair/jumping libraries parallel execution insertions/deletions SAM format output paired-end colourspace MPI	C++	Commercial Freeware	Mac OS X Linux 64
NPS	Identify nucleosome positions given histone-modification ChIP-seq or nucleosome sequencing at the nucleosome level.	ChIP-seq Epigenomics			Python
NucleR	nucleR is a R/Bioconductor package for working with tiling arrays and next generation sequencing. It uses a novel aproach in this field which comprises a deep profile cleaning using Fourier Transform and peak scoring for a quick and flexible nucleosome calling	ChIP-seq Epigenomics ChIP-on-chip DNA packaging	Peak calling Annotation	Multicore Integrated solution	R	LGPLv3	Cross-Platform
Oases	De novo transcriptome assembler for very short reads	Sequence assembly (de-novo assembly) Transcriptome assembly		supports strand specific and paired-end RNA-seq data sets	C	GPLv3
Oasis	Oasis is a web application that allows for the fast and flexible online analysis of small-RNA-seq (sRNA-seq) data. It was designed for the end user in the lab, providing an easy-to-use web frontend including video tutorials, demo data, and best practice step-by-step guidelines on how to analyze sRNA-seq data. Oasis' exclusive selling points are a differential expression module that allows for the multivariate analysis of samples, a classification module for robust biomarker detection, and an API that supports the batch submission of jobs. Both modules include the analysis of novel miRNAs, miRNA targets, and functional analyses including GO and pathway enrichment. Oasis generates downloadable interactive web reports for easy visualization, exploration, and analysis of data on a local system.	RNA	Read mapping Differential expression analysis	differential expression classification API miRNA target prediction	JavaScript Perl Java Python Bash C		all supporting JVM
ODIN	ODIN is an HMM-based approach to detect and analyse differential peaks in pairs of ChIP-seq data.	Genomics Regulatory genomics	Peak calling		Python		tested for Linux (Ubuntu)
OLego	OLego is a program specifically designed for de novo spliced mapping of mRNA-seq reads. OLego adopts a seeding and extension scheme, and does not rely on a separate external mapper. It achieves high sensitivity of junction detection by using very small seeds (12-14 nt), efficiently mapped using Burrows-Wheeler transform (BWT) and FM-index. This also makes it particularly sensitive for discovering small exons. It is implemented in C++ with full support of multiple threading, to allow fast processing of large-scale data.	Genomics RNA-Seq alignment RNA-Seq	Read mapping Sequence alignment	capable of using very small seeds for splice Read mapping but still fast and accurate	C++	GPLv3	Linux Linux 64 Mac OS X
Omixon Variant Toolkit	Omixon Target Standard, Target HLA and Target Pro are designed to help clinical, diagnostic and research labs to efficiently get the maximum accuracy and precision from their targeted NGS data.	SNP detection Indel detection Mapping Sequence analysis Comparative genomics	Sequence assembly Read mapping Sequence alignment	easy to use parameters full documentation also a plugin available in CLCbio and Geneious		Commercial Freeware	interoperable
Optimus Primer	Automated primer design for large-scale resequencing by second generation sequencing	Resequencing	PCR primer design
PacBio conversion tools	Tools to convert from PacBio HDF5 format to other commonly used formats & libraries to read HDF5 from Java & R		File reformatting		Java R Python
PaCGeE	PaCGeE (Parallel Computational Genomics Engine) is a suite of HPC accelerated sequence data analysis tools for assembly and analysis. The tool set comprises of many popular open source and proprietary software for a high performance, high throughput and high quality data analysis. The PaCGeE family of parallel NGS analysis tools are Cloud-MAQ, VELVET-P, EULER, ERANGE, BOWTIE, BFAST, MPI-BLAST, ChIP Seq Peak Finder etc		Read mapping			Commercial
PALMA	We present a novel approach based on large margin learning that combines accurate splice site predictions with common sequence alignment techniques. By solving a convex optimization problem, our algorithm -- called PALMA -- tunes the parameters of the model such that true alignments score higher than other alignments. We study the accuracy of alignments of mRNAs containing artificially generated micro-exons to genomic DNA. In a carefully designed experiment, we show that our algorithm accurately identifies the intron boundaries as well as boundaries of the optimal local alignment. It outperforms all other methods: for 5702 artificially shortened EST sequences from C. elegans and human it correctly identifies the intron boundaries in all except two cases. The best other method is a recently proposed method called exalin which misaligns 37 of the sequences. Our method also demonstrates robustness to mutations, insertions and deletions, retaining accuracy even at high noise levels.	RNA-Seq alignment	Sequence alignment
PALMapper	Fast and Accurate Spliced Alignments of Sequence Reads.		Read mapping		C++	GPLv3
PanGEA	Tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology.	SNP detection RNA-Seq DNA transcription			Perl	Mozilla Public License
PARalyzer	Tool to analyze cross-linking and immunoprecipitation data (CLIP)				Java	Commercial Freeware
Partek Genomics Suite	Easy to use software providing A to Z analysis for all Next Generation Sequencing and Microarray data.	Transcriptomics ChIP-seq SNP detection RNA-Seq quantification DNA transcription Epigenomics Alternative splicing Functional genomics
PASH	Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-seq and methylome mapping by whole-genome bisulfite sequencing	DNA methylation Epigenomics	Sequence alignment Bisulfite mapping
PASS	PASS performs fast gapped and ungapped alignments of short DNA sequences onto a reference DNA, typically a genomic sequence. It is designed to handle a huge amount of reads such as those generated by Solexa, SOLiD or 454 technologies. The algorithm is based on a data structure that holds in RAM the index of the genomic positions of seed" words (typically 11-12 bases) as well as an index of the precomputed scores of short words (typically 7-8 bases) aligned against each other.		Sequence alignment		C++		Linux Windows
Patchwork	Patchwork is a bioinformatic tool for analyzing and visualizing allele-specific copy numbers and loss-of-heterozygosity in cancer genomes. The data input is in the format of whole-genome sequencing data which enables characterization of genomic alterations ranging in size from point mutations to entire chromosomes. High quality results are obtained even if samples have low coverage, ~4x, low tumor cell content or are aneuploid. Patchwork is available in two formats. The first, named simply patchwork, takes BAM files as input whereas patchworkCG takes input from CompleteGenomics files. Detailed guides and information regarding these can be found in their respective tabs.	Structural variation Copy number estimation		Allele specific copy numbers.	R		Linux Mac OS X
PatMaN	Patman searches for short patterns in large DNA databases, allowing for approximate matches. It is optimized for searching for many small pattern at the same time, for example microarray probes.		Read mapping
PE-Assembler	A simple 3' extension approach to assembling paired-end reads and capable of parallelization	Sequence assembly (de novo assembly)	Scaffolding		C++
PeakAnalyzer	PeakAnalyzer is a set of applications for processing ChIP signal peaks.	ChIP-seq Functional genomics			Java C++ R
PeakRanger	A multi-purpose, ultrafast ChIP Seq peak caller	ChIP-seq	Peak calling		C++	Artistic License	Linux Mac OS X
PeakSeq	PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. A methodology for identifying punctate binding sites in ChIP-seq experiments based on their characteristics. publication	ChIP-seq			C Perl
PeakTrace	PeakTrace is an alternative basecaller for improving the quality and read length of Sanger DNA sequencing traces. The PeakTrace basecaller works with trace files produced by the ABI 310, 3700, 3100, 3130, 3730, and 3500 DNA sequencers. MegBACE sequencers are also supported.	Sequencing	Base-calling	DNA basecaller	C	Commercial	Windows Mac OS X Linux 64
PECAN	Alignment method practical for large genomic sequences.		Sequence alignment
PEMer	The package is composed of three modules, PEMer workflow, SV-Simulation and BreakDB. PEMer workflow is a sensitive software for detecting SVs from paired-end sequence reads. SV-Simulation randomly introduces SVs into a given genome and generates simulated paired-end reads from the ‘novel’ genome. Subsequent analysis with PEMer workflow on the simulated reads can facilitate parameterize PEMer workflow. BreakDB is a web accessible database developed to store, annotate and dsplay SV breakpoint events identified by PEMer and from other sources.	Structural variation
PERalign	A probabilistic framework is described to predict the alignment to the genome of all paired-end read transcript fragments in a paired-end read dataset. Starting from possible exonic and spliced alignments of all end reads, our method constructs potential splicing paths connecting paired ends. An expectation maximization method assigns likelihood values to all splice junctions and assigns the most probable alignment for each transcript fragment.	RNA-Seq alignment			C++		Linux
PerM	PerM (Periodic Seed Mapping) uses periodic spaced seeds to significantly improve mapping efficiency for large reference genomes when compared to state-of-the-art programs.	Genomics SNP detection	Read mapping		C++	Apache License 2.0	Linux
Phred	The phred software reads DNA sequencing trace files, calls bases, and assigns a quality value to each called base.		Base-calling		C		Solaris IRIX AIX
Phred Phrap Consed Cross match	The phred software reads DNA sequencing trace files, calls bases, and assigns a quality value to each called base. Phrap is a program for assembling shotgun DNA sequence data. Cross_match is a general purpose utility for comparing any two DNA sequence sets using a 'banded' version of swat. Consed/Autofinish is a tool for viewing, editing, and finishing sequence assemblies created with phrap.		Sequence assembly Sequence alignment Base-calling Local sequence alignment
Phymm	A classifier for metagenomic data, that has been trained on 539 complete, curated genomes and can accurately classify reads as short as 100 base pairs	Metagenomics
PiCall	Identifies short indel polymorphisms in population sequencing data	Indel detection Population genetics			C
PICS	PICS identifies binding event locations by modeling local concentrations of directional reads, and uses DNA fragment length prior information to discriminate closely adjacent binding events via a Bayesian hierarchical t-mixture model.	ChIP-seq			R
PileLine	PileLine is a flexible command-line toolkit for efficient handling, filtering, and comparison of genomic position (GP) files produced by next-generation sequencing experiments. PileLineGUI adds a graphical interface.		Plotting and rendering		Java	LGPL
Pindel	A pattern growth approach to detect break points of large deletions and medium sized insertions from paired end short reads.	Indel detection Structural variation	Read mapping Split-read mapping Localised reassembly		C++		Linux Mac OS X Windows
Pipeline Pilot	Analysis and workflow development of Next Generation Sequencing and gene expression.	Sequence assembly (de novo assembly) Whole genome resequencing Genomics ChIP-seq SNP detection RNA-Seq Sequence analysis Gene expression analysis Next Generation Sequencing	Read mapping Sequence alignment Sequence analysis Variant calling Comparative genomics Gene expression analysis	Integrated solution wrapping custom and third party tools for integration analysis and reporting	C++ Java Perl R Pilot Script	Commercial	Linux Windows
PIQA	PIQA is a quality analysis pipeline designed to examine genomic reads produced by Next Generation Sequencing technology (Illumina G1 Genome Analyzer). It is a set of libraries for R.		Sequencing quality control		R
PoissonSeq	Identify differential expressed genes	Gene expression	Differential expression analysis
PolyBayesShort	A re-incarnation of the PolyBayes SNP detection tool developed by Gabor Marth at Washington University. This version is specifically optimized for the analysis of large numbers (millions) of high-throughput next-generation sequencer reads, aligned to whole chromosomes of model organism or mammalian genomes. Developers at Boston College.	SNP detection					Linux Linux 64
PoolHap	Computational tool for inferring haplotype frequencies from pooled samples when haplotypes are known. In future version, haplotype unknown analysis will be supported.		Read mapping Regression analysis
PoPoolation	Toolbox specifically designed for the population genetic analysis of sequence data from pooled individuals.	Population genetics			Perl R
PoPoolation2	PoPoolation2 allows to compare allele frequencies for SNPs between two or more populations and to identify significant differences. PoPoolation2 requires next generation sequencing data of pooled genomic DNA (Pool-Seq). It may be used for measuring differentiation between populations, for genome wide association studies and for experimental evolution.	Genomics Population genetics			Perl R
Post Assembly Genome Improvement Toolkit	" Tools to generate automatically high quality sequence by ordering contigs, closing gaps, correcting sequence errors and transferring annotation. With the advent of next generation sequencing a lot of effort was put into developing software for mapping or aligning short reads and performing genome assembly. For genome assembly the problem of generating a draft assembly (i.e. a set of unordered contigs) has now been very well addressed - but for users who need high quality assemblies for their analyses there are still unresolved issues: this is where PAGIT is used. "	Sequence assembly (de novo assembly)	Sequence assembly validation				Linux
Precrec	Precrec is a CRAN package that provides accurate computations of ROC and Precision-Recall curves.				R	GPL-3	Windows Mac OS X Unix-like
PRICE	PRICE uses paired-read information to iteratively increase the size of existing contigs.		Sequence assembly		C++
PRINSEQ	PRINSEQ is a sequence processing tool that can be used to filter, reformat and trim genomic and metagenomic sequence data. It generates summary statistics of the input in graphical and tabular formats that can be used for quality control steps. PRINSEQ is available as both standalone and web-based versions.	Genomics Transcriptomics Metagenomics	Sequence contamination filtering Sequence trimming Read pre-processing		Perl	GPLv3	UNIX Mac OS X Windows
ProbeMatch	Matches a large set of oligonucleotide sequences against a genome database using gapped alignments		Mapping				Linux Mac OS X
ProbHD	We present a new strategy for identifying heterozygous sites in a single individual by using a machine learning approach that generates a heterozygosity score for each chromosomal position. Our approach also facilitates the identification of regions with unequal representation of two alleles and other poorly sequenced regions. The availability of confidence scores allows for a principled combination of sequencing results from multiple samples.	SNP detection Population genetics			Perl R Python
Promisec Endpoint Manager	Promisec Endpoint Manager 2.41 uses patented, agentless technology to quickly and remotely inspect yan organisations endpoint environment to discover, analyze, and remediate any abnormalities that then in turn lead to failed audits and uncorrect business intelligence.		Security Endpoint Management Remediation
Proxygenes	We introduce a clustering method which significantly reduces the size of a metagenome dataset while maintaining a faithful representation of its functional and taxonomic content.	Metagenomics	Read mapping Annotation
Pybedtools	Python extension to BEDTools that allows use of all BEDTools programs directly from Python, as well as feature-by-feature manipulation, automatic handling of temporary files, and more.	Genomics	Read mapping	See full description	Python	GPLv2	Windows (Cygwin) Linux Linux 64 Mac OS X
PyroBayes	PyroBayes is a novel base caller for pyrosequences from the 454 Life Sciences sequencing machines.	SNP detection	Base-calling
PyroMap	PyroMap accurately maps pyrosequencing reads onto reference sequences using a selectively weighted Smith-Waterman (SW^2) algorithm to incorporate quality scores into alignment.		Read mapping		Python
PyroNoise	Clustering of pyrosequencing (454) data with noise model (AmpliconNoise) and chimaera removal (Perseus) for sequence diversity analysis.	Metagenomics Phylogenetics
QCALL	SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples	SNP detection
Qpalma	QPalma is an alignment tool targeted to align spliced reads produced by Next Generation sequencing platforms	RNA-Seq alignment	Sequence alignment		Python C++
QRidge	Given a set of RNA-Seq data, QRidge assembles the short reads into long full-length transcripts.	Sequence assembly (de-novo assembly) Transcriptome assembly	Sequence assembly
QSeq	QSeq is DNASTAR's Next-Gen application for RNA-Seq, ChIP-seq, and miRNA alignment and analysis.	ChIP-seq RNA-Seq Regulatory RNA	Sequence alignment Visualisation Peak calling			Commercial	Windows 7 64-bit or Higher Mac OS X 10.7 10.8 or 10.9 with Parallels Desktop
QSRA	Quality-value guided Short Read Assembler, created to take advantage of quality-value scores as a further method of dealing with error. Compared to previous published algorithms, our assembler shows significant improvements not only in speed but also in output quality.	Sequence assembly (de novo assembly)	Sequence assembly
QuadGT	QuadGT is a software package for calling single-nucleotide variants in four sequenced genomes: normal-tumor pairs coupled with parents. Genotypes are inferred using a joint model of parental variant frequencies, de novo germline mutations, and somatic mutations. The model quantifies the descent-by-modification relationships between the unknown genotypes by using a set of parameters in a Bayesian inference setting.	SNPs	SNP calling Variant calling		Java
Quake	Program to detect and correct errors in DNA sequencing reads. Using a maximum likelihood approach incorporating quality values and nucleotide specific miscall rates,		Sequence error correction
QualiMap	Qualimap is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data.		Sequencing quality control		Java R
QUAST	QUAST stands for QUality ASsessment Tool. It evaluates a quality of genome assemblies by computing various metrics and providing nice reports.	Sequence assembly Data handling	Visualisation Sequence assembly validation	Data Visualisation Assembly Quality Evaluation Detailed Reports	Python C Perl	GPLv2	Linux Mac OS X
QuEST	QuEST is a Kernel Density Estimator-based package for analysis of massively parallel sequencing data from chromatin immunoprecipitations (ChIP-seq or ChIPseq).	ChIP-seq			C++	GPLv2
Quip	Aggressive compression of FASTQ and SAM/BAM files.		File reformatting		C	BSD (3-clause)	any
R2R	R2R is a simple to use package for very sensitive analysis of short read sequence data obtained by NextGen sequencing techniques. R2R was developed in conjunction with data obtained on the Illumina GA platforms. R2R is written in simple Perl script and runs equally well under MS Windows, Mac OS and Linux/Unix operative systems.	SNP detection	Sequence alignment		Perl
R453Plus1Toolbox	Facilitates analysis of data from 454 sequencer in R/Bioconductor.				R
RACA	Reference-Assisted Chromosome Assembly (RACA)
RApiD	Tools for processing restriction site associated DNA sequencing.	SNP detection			Perl C++	GPLv3
RAPSearch	Fast protein similarity search tool for short reads that utilizes a reduced amino acid alphabet and suffix array to detect seeds of flexible length.	Metagenomics	Sequence alignment		C++	GPLv3
RAST	"RAST (Rapid Annotation using Subsystem Technology) is a fully-automated service for annotating complete or nearly complete bacterial and archaeal genomes. It provides high quality genome annotations for these genomes across the whole phylogenetic tree."	Genomics Phylogenetics	Genomics Annotation
Ray	de novo genome assembly is now a challenge because of the overwhelming amount of data produced by sequencers. Ray assembles reads obtained with new sequencing technologies (Illumina, 454, SOLiD) using MPI 2.2 -- a message passing inferface standard.	Sequence assembly (de novo assembly)	Sequence assembly	MPI 2.2 de Bruijn graphs parallel Illumina data	C++	GPL	Linux POSIX
RazerS	RazerS allows the user to align sequencing reads of arbitrary length using either the Hamming distance or the edit distance. The tool can work either lossless or with a user-defined loss rate at higher speeds.	Mapping	Read mapping Local sequence alignment	Gapped alignment paired-end mapping	C++	GPLv3	UNIX Mac OS X Windows
RDiff	rDiff is an open source tool for accurate detection of differential RNA processing from RNA-Seq data. It implements two statistical tests to detect changes of the RNA processing between two samples. rDiff.parametric is a powerful test, which can be applied for well annotated organisms to detect changes in the relative abundance of isoforms. rDiff.nonparametric is an alternative when the annotation is incomplete or missing.	Transcriptomics RNA-Seq Alternative splicing	Sequence alignment Differential expression analysis		Python Matlab	Open Source	Linux Mac OS X
RDP Pyrosequencing Pipeline	The Ribosomal Database Project's Pyrosequencing Pipeline aims to simplify the processing of large 16s rRNA sequence libraries obtained through pyrosequencing. This site processes and converts the data to formats suitable for common ecological and statistical packages such as SPADE, EstimateS, and R.	Metagenomics	Sequence alignment Formatting				browser based
Readaligner	A tool for mapping (short) DNA reads into reference sequences.		Read mapping
ReadDepth	Detects copy number aberrations in deep sequencing data	Copy number estimation			R	Apache License 2.0
REAL	REad ALigner for Next-Generation sequencing reads		Mapping		C++	GPLv3	Linux
Reaper	Reaper is a program for demultiplexing, trimming and filtering short read sequencing data.	Next Generation Sequencing	Adapter removal Sequencing quality control Sequence contamination filtering Sequence trimming DNA barcoding	Memory efficient and fast.	C	GPLv3	Linux UNIX Mac OS X
... further results

CSV

SEQwiki β

TestPage

SEQwiki ^β