Difference between revisions of "Kmergenie"

From SEQwiki
Jump to: navigation, search
m (Text replace - "Genomic Assembly" to "Sequence assembly")
 
(4 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
{{Bioinformatics application
 
{{Bioinformatics application
|sw summary=KmerGenie estimates the best k-mer length for genome de novo assembly. Given a set of reads, KmerGenie first computes the k-mer abundance histogram for many values of k. Then, for each value of k, it predicts the number of distinct genomic k-mers in the da
+
|sw summary=KmerGenie estimates the best k-mer length for genome de novo assembly. Given a set of reads, KmerGenie first computes the k-mer abundance histogram for many values of k. Then, for each value of k, it predicts the number of distinct genomic k-mers in the dataset, and returns the k-mer length which maximizes this number. Experiments show that KmerGenie's choices lead to assemblies that are close to the best possible over all k-mer lengths.
|bio domain=Genomic Assembly,
+
|bio domain=Sequence assembly,
|bio method=Assembly
+
|bio method=Sequence assembly
 
|bio tech=Illumina
 
|bio tech=Illumina
|created by=R. Chikhi, P. Medveedv
+
|created by=R. Chikhi, P. Medvedev
 
|created at=Penn State University
 
|created at=Penn State University
 
|maintained=Yes
 
|maintained=Yes
 
|input format=Fasta/Fastq
 
|input format=Fasta/Fastq
 
|output format=HTML report
 
|output format=HTML report
|language=C++/Python/R
+
|language=C++, Python, R
 
}}
 
}}
 
== Description ==
 
== Description ==

Latest revision as of 17:18, 19 December 2015

Application data

Created by R. Chikhi, P. Medvedev
Biological application domain(s) Sequence assembly
Principal bioinformatics method(s) Sequence assembly
Technology Illumina
Created at Penn State University
Maintained? Yes
Input format(s) Fasta/Fastq
Output format(s) HTML report
Programming language(s) C++, Python, R

Summary: KmerGenie estimates the best k-mer length for genome de novo assembly. Given a set of reads, KmerGenie first computes the k-mer abundance histogram for many values of k. Then, for each value of k, it predicts the number of distinct genomic k-mers in the dataset, and returns the k-mer length which maximizes this number. Experiments show that KmerGenie's choices lead to assemblies that are close to the best possible over all k-mer lengths.

"Error: no local variable "counter" was set." is not a number.

Description

Links


References

none specified


To add a reference for Kmergenie, enter the PubMed ID in the field below and click 'Add'.

 


Search for "Kmergenie" in the SEQanswers forum / BioStar or:

Web Search Wiki Sites Scientific