Complete data for ''DNA motifs in human and mouse proximal promoters predict tissue specific expression''


Tissue differential promoter sets - this uncompresses to a directory called 'Promoters', which contains 2 FASTA files for each tissue. For example, Liver.Mm.fg and Liver.Mm.bg have the foreground and background mouse liver promoters (mouse liver positive and negative sets) used in the study. [promoters.tgz] (tarred and gzipped archives can be opened with winzip)

Transcript lists corresponding to predictor calls - this uncompresses to a directory called 'TranscriptLists', which contains 4 transcript lists per tissue. For example, LymphNode.Hs.FN, LymphNode.Hs.FP, LymphNode.Hs.TN, and LymphNode.Hs.TP have the transcripts that were called false negative, false positive, true negative, and true positive by the human lymph node predictive model. [transcriptLists.tgz] (tarred and gzipped archives can be opened with winzip)

Top TRANSFAC single motifs - described in Supplementary Section 1.3; see Table 3 or Table 4 for a complete legend. [transfac.pdf]

Top TRANSFAC distinct motif pairs - described in Supplementary Section 1.4; see Table 5 or Table 6 for a complete legend. [transfacPairs.pdf]

Top single motifs - described in Supplementary Section 1.5. These are the single motifs with the top predictive ability. Here we include both TRANSFAC and computationally de novo identified (novel) motifs. [motifs.pdf]

Top distinct motif pairs - described in Supplementary Section 1.6. These pairs are composed of TRANSFAC and computationally de novo identified (novel) motifs. [motifPairs.pdf]

MARS Predictive models - described in Supplementary Section 1.7; see Table 7 and Fig. 5 for an example. [predictors.pdf]