Rsolid

From SEQwiki
Jump to: navigation, search

Application data

Principal bioinformatics method(s) Base-calling
Technology ABI SOLiD
Maintained? Maybe
Input format(s) spch
Output format(s) csfasta
Programming language(s) R, C

Summary: Rsolid implements a version of the quantile normalization algorithm that transforms the intensity values before calling colors

"Error: no local variable "counter" was set." is not a number.

Rsolid is an R package for normalizing fluorescent intensity data from ABI/SOLiD second generation sequencing platform. It has been observed that the color-calls provided by factory software contain technical artifacts, where the proportions of colors called are extremely variable across sequencing cycles. Under the random DNA fragmentation assumption, these proportions should be equal across sequencing cycles and proportional to the dinucleotide frequencies of the sample. Rsolid implements a version of the quantile normalization algorithm that transforms the intensity values before calling colors. Results show that after normalization, the total number of mappable reads increases by around 5%, and number of perfectly mapped reads increases by 10%. Moreover a 2-5% reduction in overall error rates is observed, with a 2-6% reduction in the rate of valid adjacent color mis-matches. The latter is important, since it leads to a decrease in false-positive SNP calls.

The normalization algorithm is computationally efficient. In a test we are able to process 300 million reads in 2 hours using 10 computer cluster nodes. The engine functions of the package are written in C for better performance.

Links


References

  1. . 2010. Nature Methods


To add a reference for Rsolid, enter the PubMed ID in the field below and click 'Add'.

 


Search for "Rsolid" in the SEQanswers forum / BioStar or:

Web Search Wiki Sites Scientific