RSAT peak-motifs

Application data
Created by	Jacques van Helden, Morgane Thomas-Chollier, Matthieu Defrance, Carl Herrmann, Denis Thieffry, Olivier Sand
Biological application domain(s)	ChIP-seq, Gene regulation, Epigenomics
Principal bioinformatics method(s)	Sequence motif discovery, Sequence motif recognition, Sequence motif comparison
Technology	Any
Created at	Université Libre de Bruxelles, Université de la Méditerrannée, Max Planck Institute for Molecular Genetics, Pasteur, IBENS
Maintained?	Yes
Input format(s)	Fasta
Output format(s)	HTML, text, graphics (png)
Programming language(s)	Perl, CGI, Python, C
Software libraries	None required for the Web interface. Auto installation script for the stand-alone version of RSAT.
Licence	Commercial, Freeware
Operating system(s)	UNIX, Mac OS X, Linux

Summary: A workflow combining a series of time- and memory-efficient motif analysis tools to extract motifs from full-size collections of peaks as generated by ChIP-seq, ChIP-chip or other ChIP-X technologies.

"Error: no local variable "counter" was set." is not a number.

Description

The peak-motif workflow is integrated in the software suite "Regulatory Sequence Analysis Tools" (RSAT).

It combines:

analysis of sequence composition (peak lengths, global and positional distribution of nucleotide and dinucleotide frequencies);

motif discovery, based on complementary criteria: global over-representation of words (oligo-analysis) and spaced pairs (dyad-analysis), heterogeneity in word positional distributions (position-analysis), detection of local enrichment of words in positional windows of centered peaks (local-words);

motif comparison: discovered motifs are compared with databases of known motifs (JASPAR, RegulonDB) or with user-supplied motifs;

binding site prediction: matching of the discovered motifs against peak sequences;

visualization: predicted sites in bed format, suitable for the UCSC genome browser;

The tool presents two modes of utilization:

Single set analysis (peak sequences): when a single set of peak sequences is provided, the program discovers motifs that are “intrinsically" over-represented, i.e. more frequent than what can be expected from the sequence composition in olionucleotides. Background models of various stringency can be chosen (higher order Markov models ensure more specificity but reduce sensitivity for small data sets).

Differential analysis (test versus control): when two sets of peak sequences are provided, the program discovers motifs that are significantly more frequent in the test than in the control set.

The workflow is implemented in Perl, the Web interface in CGI, motif analysis algorithms in combinations of Perl, python and C.

Links

RSAT peak-motifs Analysis server [ edit link ]

Add a Link

References

. 2011. Nucleic Acids Research

To add a reference for RSAT peak-motifs, enter the PubMed ID in the field below and click 'Add'.

[ edit box ]

Search for "RSAT peak-motifs" in the SEQanswers forum / BioStar or:

Web Search	Wiki Sites	Scientific
Google. Bioinformatics journals Clusty	WikiPedia MetaBase BioPedia OpenWetWare bifx wiki	PubMed Europe PMC ScientificCommons.org

RSAT peak-motifs

Description

Links

References

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

wiki navigation

Software

Tools