Difference between revisions of "How-to/SNP detection"

From SEQwiki
< How-to
How-toHow-to/SNP detection
Jump to: navigation, search
(Freebayes)
(Is this really a good idea....)
 
(25 intermediate revisions by 6 users not shown)
Line 1: Line 1:
= SNP detection =
+
<big>
 +
<big>
  
= Decision Helper =
+
'''Note''': This content has been ported to the new WikiBook project:
I want to quickly call SNP versus a reference
+
http://en.wikibooks.org/wiki/Next_Generation_Sequencing_(NGS)/DNA_Variants
=>Freebayes, samtools
 
  
 +
Previous contributors should ensure they take proper credit for their work here:
 +
http://en.wikibooks.org/w/index.php?title=Next_Generation_Sequencing_(NGS)/Authors
  
= Software Packages =
+
</big>
 
+
</big>
== Free Software ==
 
 
 
=== Freebayes ===
 
[[Freebayes]] is the scuccessor of Poly- Giga- and BAMBayes and should be much faster than these. Like these it relies on BAM files. It has also been described in some more detail by its developed on [http://biostar.stackexchange.com/questions/613/what-methods-do-you-use-for-in-del-snp-calling Biostar]
 
 
 
* Pros
 
** very easy to run for simple SNP calling
 
 
 
=== GATK ===
 
The Genome Analysis toolkit [[GATK]] allows multiple steps. The authors used their pipeline for variant calling using the NA12878 exome data set and compared their results to those of Crossbow (which uses SOAPsnp). Based on these results they concluded that crossbow had a lower spcecificity.
 
 
 
One easy way to to run GATK and other tools might be to use this [https://github.com/vlandham/variant_pipeline variant pipeline] mentioned on [http://biostar.stackexchange.com/questions/8260/workflow-or-tutorial-for-snp-calling Biostar]
 
 
 
* Pro
 
** Likely relatively specific
 
* Con
 
** relatively complex pipelines
 
 
 
=== MAQ ===
 
[[MAQ]]
 
 
 
=== samtools ===
 
[[samtools]] using the pilepup or mpileup pipeline
 
http://samtools.sourceforge.net/mpileup.shtml
 
 
 
This thread here describes some potential problems that occur when using the BAQ parameter. (In effect it recommends to turn it off, if one uses e.g. BWA that finds indels.
 
http://seqanswers.com/forums/showthread.php?t=11965
 
 
 
 
 
=== ssahaSNP ===
 
[[ssahaSNP]] - ssahaSNP is a polymorphism detection tool. It detects homozygous SNPs and indels by aligning shotgun reads to the finished genome sequence. Highly repetitive elements are filtered out by ignoring those kmer words with high occurrence numbers. More tuned for ABI Sanger reads. Developers are Adam Spargo and Zemin Ning from the Sanger Centre. Compaq Alpha, Linux-64, Linux-32, Solaris and Mac
 
 
 
 
 
 
 
 
 
=== SOAPsnp ===
 
[[SOAPsnp]]
 
 
 
=== SNVMix ===
 
[[SNVMix]]
 
 
 
== Further Reading Material and References ==
 

Latest revision as of 13:53, 13 November 2012

Note: This content has been ported to the new WikiBook project: http://en.wikibooks.org/wiki/Next_Generation_Sequencing_(NGS)/DNA_Variants

Previous contributors should ensure they take proper credit for their work here: http://en.wikibooks.org/w/index.php?title=Next_Generation_Sequencing_(NGS)/Authors