RSAT peak-motifs

From SEQwiki
Revision as of 10:33, 11 January 2016 by Krdav (talk | contribs) (Text replace - "ChIP-Seq" to "ChIP-seq")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
RSAT peak-motifsRSAT peak-motifs/URL 0
Jump to: navigation, search

Application data

Created by Jacques van Helden, Morgane Thomas-Chollier, Matthieu Defrance, Carl Herrmann, Denis Thieffry, Olivier Sand
Biological application domain(s) ChIP-seq, Gene regulation, Epigenomics
Principal bioinformatics method(s) Sequence motif discovery, Sequence motif recognition, Sequence motif comparison
Technology Any
Created at Université Libre de Bruxelles, Université de la Méditerrannée, Max Planck Institute for Molecular Genetics, Pasteur, IBENS
Maintained? Yes
Input format(s) Fasta
Output format(s) HTML, text, graphics (png)
Programming language(s) Perl, CGI, Python, C
Software libraries None required for the Web interface. Auto installation script for the stand-alone version of RSAT.
Licence Commercial, Freeware
Operating system(s) UNIX, Mac OS X, Linux

Summary: A workflow combining a series of time- and memory-efficient motif analysis tools to extract motifs from full-size collections of peaks as generated by ChIP-seq, ChIP-chip or other ChIP-X technologies.

"Error: no local variable "counter" was set." is not a number.

Description

The peak-motif workflow is integrated in the software suite "Regulatory Sequence Analysis Tools" (RSAT).

It combines:

  • analysis of sequence composition (peak lengths, global and positional distribution of nucleotide and dinucleotide frequencies);
  • motif discovery, based on complementary criteria: global over-representation of words (oligo-analysis) and spaced pairs (dyad-analysis), heterogeneity in word positional distributions (position-analysis), detection of local enrichment of words in positional windows of centered peaks (local-words);
  • motif comparison: discovered motifs are compared with databases of known motifs (JASPAR, RegulonDB) or with user-supplied motifs;
  • binding site prediction: matching of the discovered motifs against peak sequences;
  • visualization: predicted sites in bed format, suitable for the UCSC genome browser;


The tool presents two modes of utilization:

  • Single set analysis (peak sequences): when a single set of peak sequences is provided, the program discovers motifs that are “intrinsically" over-represented, i.e. more frequent than what can be expected from the sequence composition in olionucleotides. Background models of various stringency can be chosen (higher order Markov models ensure more specificity but reduce sensitivity for small data sets).
  • Differential analysis (test versus control): when two sets of peak sequences are provided, the program discovers motifs that are significantly more frequent in the test than in the control set.


The workflow is implemented in Perl, the Web interface in CGI, motif analysis algorithms in combinations of Perl, python and C.


Links


References

  1. . 2011. Nucleic Acids Research


To add a reference for RSAT peak-motifs, enter the PubMed ID in the field below and click 'Add'.

 


Search for "RSAT peak-motifs" in the SEQanswers forum / BioStar or:

Web Search Wiki Sites Scientific