Hardison Lab Datasets and Supplements
e-mail address: rch8@psu.edu
Current Directory Entry
My laboratory is generating various datasets useful in comparative genomics and in erythroid gene regulation. Although every effort is made to deposit these in appropriate databases, sometimes it is most convenient just to provide the data files here.
If you use these datasets, please cite the listed reference in any publications.
Mapping DNA segments occupied by GATA1 and the response in gene expression throughout the the mouse erythroid genome; Distinguishing induction from repression
- Yong Cheng, Weisheng Wu, Swathi Ashok Kumar, Duonan Yu, Wulan Deng, Tamara Tripic, David C. King, Kuan-Bei Chen, Ying Zhang, Daniela Drautz, Belinda Giardine, Stephan C. Schuster, Webb Miller, Francesca Chiaromonte, Yu Zhang, Gerd A. Blobel, Mitchell J. Weiss, and Ross C. Hardison (2009) Erythroid GATA1 function revealed by genome-wide analysis of transcription factor occupancy, histone modifications and mRNA expression. Genome Research, in press. Preprint.
- Supplementary Material.
- To view the data in a browser (PSU customization of UCSC Genome Browser) and to download the data from a "Table Browser" go to the mouse genome section of this browser.
Identification of DNA segments occupied by GATA1 on mouse chromosome 7, and demonstration that binding site motifs in enhancers are subject to evolutionary constraint
- Cheng Y, King DC, Dore LC, Zhang X, Zhou Y, Zhang Y, Dorman C, Abebe D, Kumar SA, Chiaromonte F, Miller W, Green RD, Weiss MJ, and Hardison RC (2008) Transcriptional enhancement by GATA1-occupied DNA segments is strongly associated with evolutionary constraint on the binding site motif. Genome Research 18: 1896-1905 Full text of publication.
- For datasets used in this paper, including occupancy data, enhancement, and phylogenetic conservation of motifs (cladistic motifs), click here.
Predictions of erythroid cis-regulatory modules and results of experimental tests
- Hao Wang, Ying Zhang, Yong Cheng, Yuepin Zhou, David C. King, James Taylor, Francesca Chiaromonte, Jyotsna Kasturi, Hanna Petrykowska, Bryan Gibb, Christine Dorman, Webb Miller, Louis C. Dore, John Welch, Mitchell J. Weiss, Ross C. Hardison (2006) Experimental Validation of Predicted Mammalian Erythroid Cis-Regulatory Modules. Genome Res. 16 : 1480-1492 Full text of publication.
- Predicted erythroid cis-regulatory modules in the mouse and human genomes, plus the intervals tested in Wang et al. 2006. Click here.
- Results for all the 99 tested fragments, pdf file. Click here.
- Results for all the 99 tested fragments, Excel file. Click here.
- DNA sequences of erythroid predicted cis-regulatory modules (preCRMs) in eight selected mouse loci, all tested for function. Click here.
Predictions of cis-regulatory modules genome-wide
- James Taylor, Svitlana Tyekucheva, David C. King, Ross C. Hardison, Webb Miller, and Francesca Chiaromonte (2006) ESPERR: Learning strong and weak signals in genomic sequence alignments to identify functional elements. Genome Res. 16 : 1596-1604. Full text of publication.
- Predicted cis-regulatory modules in the human genome (hg17, May 2004 assembly), from the Taylor et al. work. These have an RP score of at least 0.05 for at least 200 bp and do not overlap exons in the Known Genes set. These can be downloaded in BED format here.
- Blanchette M, Bataille AR, Chen X, Poitras C, Laganiere J, Lefebvre C, Deblois G, Giguere V, Ferretti V, Bergeron D, Coulombe B, Robert F. (2006) Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res. 16 : 656-668 publication and website . Our lab had nothing to do with this work, but it is a really good source of CRMs predicted by conservation of binding sites.
- Here are the PReMods from Blanchette et al., for the human genome (hg17, May 2004 assembly) in BED format. Click here.
- Intersection of the High RP segments and the PReMods for the human genome (hg17, May 2004 assembly) in BED format. I call them PRPs. Click here.
Reference sets of known CRMs in the human beta-globin gene complex
- King, David C., James Taylor, Laura Elnitski, Francesca Chiaromonte, Webb Miller, and Ross C. Hardison (2005) Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequence. Genome Res. 15 : 1051-1060. Reprint
- Known cis-regulatory modules in the human HBB gene complex, hg16 coordinates, custom track for UCSC Genome Browser. Click here.
- Known cis-regulatory modules in the human HBB gene complex, hg17 coordinates, custom track for UCSC Genome Browser. Click here.
- Known cis-regulatory modules in the human HBB gene complex, hg17 coordinates, file in bed format for use in Galaxy. Click here.
Microarray expression results on MEL_RL5 cells induced to mature to hemoglobinized erythroblasts with HMBA.
- Almost 7000 mouse genes, represented by 70-mers (from Qiagen/Oligo) spotted at the PSU/Huck Institute Microarray Facility, were hybridized with labeled cDNA from mRNA at progressive stages of induction, days 0-6. The Excel spreadsheets of the results can be obtained by clicking here.
- Citation for the microarray results is Hao Wang, Ying Zhang, Yong Cheng, Yuepin Zhou, David C. King, James Taylor, Francesca Chiaromonte, Jyotsna Kasturi, Hanna Petrykowska, Bryan Gibb, Christine Dorman, Webb Miller, Louis C. Dore, John Welch, Mitchell J. Weiss, Ross C. Hardison (2006) Experimental Validation of Predicted Mammalian Erythroid Cis-Regulatory Modules. Genome Res. 16 : 1480-1492. Full text of publication.
Conservation and constraint in ENCODE predicted transcriptional regulatory regions
- David C. King, James Taylor, Ying Zhang, Yong Cheng, Heather A. Lawson, Joel Martin, ENCODE groups for Transcriptional Regulation and Multispecies Alignment, Francesca Chiaromonte, Webb Miller, and Ross C. Hardison (2007) Finding cis-regulatory elements using comparative genomics: some lessons from ENCODE data. Genome Research 17 : 775-786. Full text.
- Predicted transcriptional regulatory regions (pTRRs) in ENCODE regions, which are high-probability hits to sites occupied by sequence specific binding proteins supported by data on chromatin alterations or histone modifications. This is a composite set based on ENCODE data as filtered and analyzed in the King, Taylor et al. 2007 paper. The coordinates are for hg17, the May 2004 assembly of the human genome. Click here.
- DNase hypersensitive sites and promoters from the ENCODE transcriptional regulation group. The coordinates are for hg17, the May 2004 assembly of the human genome. Click here.
Co-variation in frequencies of substitution, deletion, transposition and recombination during eutherian evolutions
- Hardison, R.C., K.M. Roskin, S. Yang, M. Diekhans, W.J. Kent, R. Weber, L. Elnitski, J. Li, M. O'Connor, D. Kolbe, S. Schwartz, T.S. Furey, S. Whelan, N. Goldman, A. Smit, W. Miller, F. Chiaromonte and D. Haussler (2003) Co-variation in frequencies of substitution, deletion, transposition and recombination during eutherian evolution. Genome Res. 13 : 13-26. Full text.
- Description of columns in datafiles Click here.
- Various measures of sequence similarity (compared to mouse) and genomic properties were computed in windows across the human genome. The data files for 1 Mb non-overlapping windows are here and the ones for 5 Mb windows with a 1Mb slide (overlapping by 4Mb) are here.
Presentations on using vertebrate genome comparisons to predict and test cis-regulatory modules
- Cold Spring Harbor Biology of Genomes Meeting, May 2005. Click here.
- Seminar at University of Nebraska, Lincoln, February 2007. Click here.
Page created: Thursday, 13-Jan-2005, updated 07-Oct-2009