SNPsea

From SEQwiki
Jump to: navigation, search

Application data

Created by Kamil Slowikowski, Xinli Hu, Soumya Raychaudhuri
Biological application domain(s) Functional genomics, SNP detection, Gene expression analysis
Principal bioinformatics method(s) Enrichment
Created at Harvard University
Maintained? Yes
Programming language(s) C++, R, Python
Licence GPLv3
Operating system(s)
  • NIX
Contact: slowikow@broadinstitute.org

Summary: SNPsea is an algorithm to identify cell types and pathways likely to be affected by risk loci.

"Error: no local variable "counter" was set." is not a number.

Description

SNPsea requires a list of SNP identifiers and a matrix of genes and conditions.

Genome-wide association studies (GWAS) have discovered multiple genomic loci associated with risk for different types of disease. SNPsea provides a simple way to determine the types of cells influenced by genes in these risk loci.

Suppose disease-associated alleles influence a small number of pathogenic cell types. We hypothesize that genes with critical functions in those cell types are likely to be within risk loci for that disease. We assume that a gene’s specificity to a cell type is a reasonable indicator of its importance to the unique function of that cell type.

First, we identify the genes in linkage disequilibrium (LD) with the given trait-associated SNPs and score the gene set for specificity to each cell type. Next, we define a null distribution of scores for each cell type by sampling random SNP sets matched on the number of linked genes. Finally, we evaluate the significance of the original gene set’s specificity by comparison to the null distributions: we calculate an exact permutation p-value.

SNPsea is a general algorithm. You may provide your own:

  1. Continuous gene matrix with gene expression profiles (or other values).
  2. Binary gene annotation matrix with presence/absence 1/0 values.

We provide you with three expression matrices and one annotation matrix. See the Data section of the Manual.

The columns of the matrix may be tissues, cell types, GO annotation codes, or other conditions. Continuous matrices must be normalized before running SNPsea: columns must be directly comparable to each other.


Links


References

  1. . 2014. Bioinformatics (Oxford)


To add a reference for SNPsea, enter the PubMed ID in the field below and click 'Add'.

 


Search for "SNPsea" in the SEQanswers forum / BioStar or:

Web Search Wiki Sites Scientific