Difference between revisions of "RSAT peak-motifs"

From SEQwiki
RSAT peak-motifsRSAT peak-motifs/URL 0
Jump to: navigation, search
Line 9: Line 9:
 
|input format=Fasta
 
|input format=Fasta
 
|output format=HTML, text, graphics (png)
 
|output format=HTML, text, graphics (png)
|language=workflow in Perl, Web interface in CGI, motif analysis algorithms in Perl + python + C
+
|language=Perl, CGI, Python, C,
 
|library=None required for the Web interface. Auto installation script for the stand-alone version of RSAT.
 
|library=None required for the Web interface. Auto installation script for the stand-alone version of RSAT.
 
|licence=freeware license for non-commercial and non-military utilization
 
|licence=freeware license for non-commercial and non-military utilization
Line 40: Line 40:
  
  
 
+
The workflow is implemented in Perl, the Web interface in CGI, motif analysis algorithms in combinations of Perl, python and C.
  
 
<!-- -->
 
<!-- -->

Revision as of 12:17, 22 April 2012

Application data

Created by Jacques van Helden, Morgane Thomas-Chollier, Matthieu Defrance, Carl Herrmann, Denis Thieffry, Olivier Sand
Biological application domain(s) ChIP-seq, regulatory genomics, epigenomics
Principal bioinformatics method(s) motif discovery, motif scanning, motif comparison
Technology any
Created at Université Libre de Bruxelles, Université de la Méditerrannée, Max Planck Institute for Molecular Genetics, Pasteur, IBENS
Maintained? Yes
Input format(s) Fasta
Output format(s) HTML, text, graphics (png)
Programming language(s) Perl, CGI, Python, C
Software libraries None required for the Web interface. Auto installation script for the stand-alone version of RSAT.
Licence freeware license for non-commercial and non-military utilization
Operating system(s) UNIX, Mac OS X, Linux

Summary: A workflow combining a series of time- and memory-efficient motif analysis tools to extract motifs from full-size collections of peaks as generated by ChIP-seq, ChIP-chip or other ChIP-X technologies.

"Error: no local variable "counter" was set." is not a number.

Description

The peak-motif workflow is integrated in the software suite "Regulatory Sequence Analysis Tools" (RSAT).

It combines:

  • analysis of sequence composition (peak lengths, global and positional distribution of nucleotide and dinucleotide frequencies);
  • motif discovery, based on complementary criteria: global over-representation of words (oligo-analysis) and spaced pairs (dyad-analysis), heterogeneity in word positional distributions (position-analysis), detection of local enrichment of words in positional windows of centered peaks (local-words);
  • motif comparison: discovered motifs are compared with databases of known motifs (JASPAR, RegulonDB) or with user-supplied motifs;
  • binding site prediction: matching of the discovered motifs against peak sequences;
  • visualization: predicted sites in bed format, suitable for the UCSC genome browser;


The tool presents two modes of utilization:

  • Single set analysis (peak sequences): when a single set of peak sequences is provided, the program discovers motifs that are “intrinsically" over-represented, i.e. more frequent than what can be expected from the sequence composition in olionucleotides. Background models of various stringency can be chosen (higher order Markov models ensure more specificity but reduce sensitivity for small data sets).
  • Differential analysis (test versus control): when two sets of peak sequences are provided, the program discovers motifs that are significantly more frequent in the test than in the control set.


The workflow is implemented in Perl, the Web interface in CGI, motif analysis algorithms in combinations of Perl, python and C.


Links


References

  1. . 2011. Nucleic Acids Research


To add a reference for RSAT peak-motifs, enter the PubMed ID in the field below and click 'Add'.

 


Search for "RSAT peak-motifs" in the SEQanswers forum / BioStar or:

Web Search Wiki Sites Scientific