You do not have permission to edit this page, for the following reason:
The action you have requested is limited to users in the group: Users.
Short description:
Please summarise the application in a few sentences. Avoid links here. bisulfite-treated Read FastQ Simulator
Software version:
Biological application domain(s) (Phylogenetics, Genomics, ...):
Genomics, Sequencing, DNA methylation, Epigenetics
Principal bioinformatics method(s) (Assembly, Mapping, ...):
Modelling and simulation,
Technology (Sanger, Illumina, 454, SOLiD, Ion Torrent, ...):
Illumina, ABI SOLiD,
Interface (Command line, Web UI, Desktop GUI, SOAP WS, HTTP WS, API, QL):
Resource type (Command-line tool, Web application, Desktop application, Script, Suite, Workbench, Database portal, Workflow, Plug-in, Library, Web API, Web service, SPARQL endpoint):
Babraham Bioinformatics, The Babraham Institute,
== Description == <!-- Describe the application in the space below --> Sherman can simulate ungapped high-throughput datasets for bisulfite sequencing (BS-Seq) or standard experiments. It allows the user to introduce various 'contaminants' into the sequences, such as basecall errors, SNPs, adapter fragments etc., in order to evaluate the influence of common problems observed in many Next-Gen Sequencing experiments. Sherman can be obtained from http://www.bioinformatics.babraham.ac.uk/projects/sherman/. Main features: - Generate any number of sequences of any length - Generate either completely random sequences or use genomic sequences (genome can be specified) - Generates single-end or paired-end data - Adjustable bisulfite conversion rate from 0-100% for either all cytosines or cytosines in CH and CG context individually - Generate directional or non-directional libraries - Generate sequences in base-space or SOLiD color-space format - Adjustable default Phred quality score (Sanger encoding, Phred+33 format) - Sequences can have constant Phred qualities throughout the read or can have quality scores following an exponential decay curve, which will eventually result in basecall errors (note that this is handled slightly different for base- and color-space data) - Introduce a variable number of random SNPs into each read - Introduce a fixed amount of adapter sequence at the 3' end of all sequences - Introduce a variable amount of adapter sequence at various positions at the 3' end of reads <!-- -->
Once you save the form, you will have the chance to add links and references.
Summary of edit
This is a minor edit Watch this page
Cancel