Open main menu

SEQwiki β

Changes

DESeq

677 bytes added, 10:18, 11 January 2016
m
Text replace - "ChIP-Seq" to "ChIP-seq"
{{Bioinformatics application
|sw summary=DESeq is an R package to analyse count data from high-throughput sequencing assays such as RNA-Seq and test for differential expression. The latest version is DESeq2 (released April 2013).|bio domain=RNA-Seqquantification, ChIP-seq|bio method=statistical testing, Sequencing quality control,|created by=Anders S|created at=European Molecular Biology Laboratory|maintained=Yes|email address=sanders@fs.tum.de|input format=table with count data|output format=table
|language=R
|licence=GPLv3,
|os=UNIX, Windows, Mac OS X
}}
ESeq DESeq uses a model based on the negative binomial distribution and offers, in brief, the following features: 
Count data is discrete and skewed and is hence not well approximated by a normal distribution. Thus, a test based on the negative binomial distribution, which can reflect these properties, has much higher power to detect differential expression.
 Tests for differential expression between two experimental conditions should take into account both technical and biological variability. Recently, several authors have claimed that the Poisson distribution can be used for this purpose. However, tests based on the Poisson assumption (this includes the binomial test and the chi-squared test) ignore the biological sampling variance, leading to incorrectly optimistic p values. The negative binomial distribution is a generalisation of the Poisson model that allows to model biological variance correctly. In the former two points, DESeq is similar to earlier tools, especially to edgeR. One of the new features of DESeq is the ability to estimate the variance in a local fashion, using different coefficients of variation for different expression strengths. This removes potential selection biases in the hit list of differentially expressed genes, and gives a more balanced and accurate result. 
DESeq's applicability is not limited to RNA-Seq. Rather, it may be used for many kinds of count data derived from high-throughput experiments.
 
Beside from the differential testing functionality, DESeq offers two transformations for stabilizing the variance of count data: the Variance Stabilizing Transformation (VST), and the regularized logarithm (rlog). These can be used for visualization and data exploration, such as for calculating sample-sample distances.
{{Links}}
{{References}}
{{Link box}}