PhD, University of Washington (2017)
MD, University of Washington School of Medicine (2019)
ERROR! No headcode.htm file found.
Our understanding of the beads-on-a-string arrangement of nucleosomes has been built largely on high-resolution sequence-agnostic imaging methods and sequence-resolved bulk biochemical techniques. To bridge the divide between these approaches, we present the single-molecule adenine methylated oligonucleosome sequencing assay (SAMOSA). SAMOSA is a high-throughput single-molecule sequencing method that combines adenine methyltransferase footprinting and single-molecule real-time DNA sequencing to natively and nondestructively measure nucleosome positions on individual chromatin fibres. SAMOSA data allows unbiased classification of single-molecular 'states' of nucleosome occupancy on individual chromatin fibres. We leverage this to estimate nucleosome regularity and spacing on single chromatin fibres genome-wide, at predicted transcription factor binding motifs, and across both active and silent human epigenomic domains. Our analyses suggest that chromatin is comprised of a diverse array of both regular and irregular single-molecular oligonucleosome patterns that differ subtly in their relative abundance across epigenomic domains. This irregularity is particularly striking in constitutive heterochromatin, which has typically been viewed as a conformationally static entity. Our proof-of-concept study provides a powerful new methodology for studying nucleosome organization at a previously intractable resolution, and offers up new avenues for modeling and visualizing higher-order chromatin structure.
View details for DOI 10.7554/eLife.59404
View details for PubMedID 33263279
Animal and plant centromeres are embedded in repetitive "satellite" DNA, but are thought to be epigenetically specified. To define genetic characteristics of centromeres, we surveyed satellite DNA from diverse eukaryotes and identified variation in <10-bp dyad symmetries predicted to adopt non-B-form conformations. Organisms lacking centromeric dyad symmetries had binding sites for sequence-specific DNA-binding proteins with DNA-bending activity. For example, human and mouse centromeres are depleted for dyad symmetries, but are enriched for non-B-form DNA and are associated with binding sites for the conserved DNA-binding protein CENP-B, which is required for artificial centromere function but is paradoxically nonessential. We also detected dyad symmetries and predicted non-B-form DNA structures at neocentromeres, which form at ectopic loci. We propose that centromeres form at non-B-form DNA because of dyad symmetries or are strengthened by sequence-specific DNA binding proteins. This may resolve the CENP-B paradox and provide a general basis for centromere specification.
View details for DOI 10.1093/molbev/msy010
View details for PubMedID 29365169
View details for PubMedCentralID PMC5889037
Centromeres are the chromosomal sites of assembly for kinetochores, the protein complexes that attach to spindle fibers and mediate separation of chromosomes to daughter cells in mitosis and meiosis. In most multicellular organisms, centromeres comprise a single specific family of tandem repeats-often 100-400 bp in length-found on every chromosome, typically in one location within heterochromatin. Drosophila melanogaster is unusual in that the heterochromatin contains many families of mostly short (5-12 bp) tandem repeats, none of which appear to be present at all centromeres, and none of which are found only at centromeres. Although centromere sequences from a minichromosome have been identified and candidate centromere sequences have been proposed, the DNA sequences at native Drosophila centromeres remain unknown. Here we use native chromatin immunoprecipitation to identify the centromeric sequences bound by the foundational kinetochore protein cenH3, known in vertebrates as CENP-A. In D. melanogaster, these sequences include a few families of 5- and 10-bp repeats; but in closely related D. simulans, the centromeres comprise more complex repeats. The results suggest that a recent expansion of short repeats has replaced more complex centromeric repeats in D. melanogaster.
View details for DOI 10.1534/genetics.117.300620
View details for PubMedID 29305387
View details for PubMedCentralID PMC5844345
Centromeres were familiar to cell biologists in the late 19th century, but for most eukaryotes the basis for centromere specification has remained enigmatic. Much attention has been focused on the cenH3 (CENP-A) histone variant, which forms the foundation of the centromere. To investigate the DNA sequence requirements for centromere specification, we applied a variety of epigenomic approaches, which have revealed surprising diversity in centromeric chromatin properties. Whereas each point centromere of budding yeast is occupied by a single precisely positioned tetrameric nucleosome with one cenH3 molecule, the "regional" centromeres of fission yeast contain unphased presumably octameric nucleosomes with two cenH3s. In Caenorhabditis elegans, kinetochores assemble all along the chromosome at sites of cenH3 nucleosomes that resemble budding yeast point centromeres, whereas holocentric insects lack cenH3 entirely. The "satellite" centromeres of most animals and plants consist of cenH3-containing particles that are precisely positioned over homogeneous tandem repeats, but in humans, different ?-satellite subfamilies are occupied by CENP-A nucleosomes with very different conformations. We suggest that this extraordinary evolutionary diversity of centromeric chromatin architectures can be understood in terms of the simplicity of the task of equal chromosome segregation that is continually subverted by selfish DNA sequences.
View details for DOI 10.1101/sqb.2017.82.033605
View details for PubMedID 29196559
Chromatin endogenous cleavage (ChEC) uses fusion of a protein of interest to micrococcal nuclease (MNase) to target calcium-dependent cleavage to specific genomic loci in vivo. Here we report the combination of ChEC with high-throughput sequencing (ChEC-seq) to map budding yeast transcription factor (TF) binding. Temporal analysis of ChEC-seq data reveals two classes of sites for TFs, one displaying rapid cleavage at sites with robust consensus motifs and the second showing slow cleavage at largely unique sites with low-scoring motifs. Sites with high-scoring motifs also display asymmetric cleavage, indicating that ChEC-seq provides information on the directionality of TF-DNA interactions. Strikingly, similar DNA shape patterns are observed regardless of motif strength, indicating that the kinetics of ChEC-seq discriminates DNA recognition through sequence and/or shape. We propose that time-resolved ChEC-seq detects both high-affinity interactions of TFs with consensus motifs and sites preferentially sampled by TFs during diffusion and sliding.
View details for DOI 10.1038/ncomms9733
View details for PubMedID 26490019
View details for PubMedCentralID PMC4618392
Occupied Regions of Genomes from Affinity-purified Naturally Isolated Chromatin (ORGANIC) is a high-resolution method that can be used to quantitatively map protein-DNA interactions with high specificity and sensitivity. This method uses micrococcal nuclease (MNase) digestion of chromatin and low-salt solubilization to preserve protein-DNA complexes, followed by immunoprecipitation and paired-end sequencing for genome-wide mapping of binding sites. In this unit, we describe methods for isolation of nuclei and MNase digestion of unfixed chromatin, immunoprecipitation of protein-DNA complexes, and high-throughput sequencing to map sites of bound factors.
View details for DOI 10.1002/0471142727.mb2131s110
View details for PubMedID 25827087
View details for PubMedCentralID PMC4410783
Genomic selection (GS) approaches, in combination with reproductive technologies, are revolutionizing the design and implementation of breeding programs in livestock species, particularly in cattle. GS leverages genomic readouts to provide estimates of breeding value early in the life of animals. However, the capacity of these approaches for improving genetic gain in breeding programs is limited by generation interval, the average age of an animal when replacement progeny are born. Here, we present a cost-effective approach that combines GS with reproductive technologies to reduce generation interval by rapidly producing high genetic merit calves.
View details for DOI 10.1038/srep08674
View details for PubMedID 25728468
View details for PubMedCentralID PMC4345332
The intractability of homogeneous ?-satellite arrays has impeded understanding of human centromeres. Artificial centromeres are produced from higher-order repeats (HORs) present at centromere edges, although the exact sequences and chromatin conformations of centromere cores remain unknown. We use high-resolution chromatin immunoprecipitation (ChIP) of centromere components followed by clustering of sequence data as an unbiased approach to identify functional centromere sequences. We find that specific dimeric ?-satellite units shared by multiple individuals dominate functional human centromeres. We identify two recently homogenized ?-satellite dimers that are occupied by precisely positioned CENP-A (cenH3) nucleosomes with two ~100-base pair (bp) DNA wraps in tandem separated by a CENP-B/CENP-C-containing linker, whereas pericentromeric HORs show diffuse positioning. Precise positioning is largely maintained, whereas abundance decreases exponentially with divergence, which suggests that young ?-satellite dimers with paired ~100-bp particles mediate evolution of functional human centromeres. Our unbiased strategy for identifying functional centromeric sequences should be generally applicable to tandem repeat arrays that dominate the centromeres of most eukaryotes.
View details for DOI 10.1126/sciadv.1400234
View details for PubMedID 25927077
View details for PubMedCentralID PMC4410388
View details for Web of Science ID 000346366000014
In this issue of Cancer Cell, Yang et al. describe a causal relationship between gene body methylation and gene expression and a role for genic methylation in response to clinical DNA methylation inhibitors, which suggests that the mechanism of action of these inhibitors includes gene body hypomethylation-induced downregulation of cancer-associated genes.
View details for DOI 10.1016/j.ccell.2014.09.004
View details for PubMedID 25314073
View details for PubMedCentralID PMC4322907
Polycomb-mediated chromatin repression modulates gene expression during development in metazoans. Binding of multiple sequence-specific factors at discrete Polycomb response elements (PREs) is thought to recruit repressive complexes that spread across an extended chromatin domain. To dissect the structure of PREs, we applied high-resolution mapping of nonhistone chromatin proteins in native chromatin of Drosophila cells. Analysis of occupied sites reveal interactions between transcription factors that stabilize Polycomb anchoring to DNA, and implicate the general transcription factor ADF1 as a novel PRE component. By comparing two Drosophila cell lines with differential chromatin states, we provide evidence that repression is accomplished by enhanced Polycomb recruitment both to PREs and to target promoters of repressed genes. These results suggest that the stability of multifactor complexes at promoters and regulatory elements is a crucial aspect of developmentally regulated gene expression.
View details for DOI 10.1101/gr.163642.113
View details for PubMedID 24668908
View details for PubMedCentralID PMC4009610
Sequence-specific DNA-binding proteins including transcription factors (TFs) are key determinants of gene regulation and chromatin architecture. TF profiling is commonly carried out by formaldehyde cross-linking and sonication followed by chromatin immunoprecipitation (X-ChIP). We describe a method to profile TF binding at high resolution without cross-linking. We begin with micrococcal nuclease-digested non-cross-linked chromatin and then perform affinity purification of TFs and paired-end sequencing. The resulting occupied regions of genomes from affinity-purified naturally isolated chromatin (ORGANIC) profiles of Saccharomyces cerevisiae Abf1 and Reb1 provide high-resolution maps that are accurate, as defined by the presence of known TF consensus motifs in identified binding sites, that are not biased toward accessible chromatin and that do not require input normalization. We profiled Drosophila melanogaster GAGA factor and Pipsqueak to test ORGANIC performance on larger genomes. Our results suggest that ORGANIC profiling is a widely applicable high-resolution method for sensitive and specific profiling of direct protein-DNA interactions.
View details for DOI 10.1038/nmeth.2766
View details for PubMedID 24336359
View details for PubMedCentralID PMC3929178
An understanding of developmental processes requires knowledge of transcriptional and epigenetic landscapes at the level of tissues and ultimately individual cells. However, obtaining tissue- or cell-type-specific expression and chromatin profiles for animals has been challenging. Here we describe a method for purifying nuclei from specific cell types of animal models that allows simultaneous determination of both expression and chromatin profiles. The method is based on in vivo biotin-labeling of the nuclear envelope and subsequent affinity purification of nuclei. We describe the use of the method to isolate nuclei from muscle of adult Caenorhabditis elegans and from mesoderm of Drosophila melanogaster embryos. As a case study, we determined expression and nucleosome occupancy profiles for affinity-purified nuclei from C. elegans muscle. We identified hundreds of genes that are specifically expressed in muscle tissues and found that these genes are depleted of nucleosomes at promoters and gene bodies in muscle relative to other tissues. This method should be universally applicable to all model systems that allow transgenesis and will make it possible to determine epigenetic and expression profiles of different tissues and cell types.
View details for DOI 10.1101/gr.131748.111
View details for PubMedID 22219512
View details for PubMedCentralID PMC3317158
We previously identified an alpha1-AR-ERK (alpha1A-adrenergic receptor-extracellular signal-regulated kinase) survival signaling pathway in adult cardiac myocytes. Here, we investigated localization of alpha1-AR subtypes (alpha1A and alpha1B) and how their localization influences alpha1-AR signaling in cardiac myocytes. Using binding assays on myocyte subcellular fractions or a fluorescent alpha1-AR antagonist, we localized endogenous alpha1-ARs to the nucleus in wild-type adult cardiac myocytes. To clarify alpha1 subtype localization, we reconstituted alpha1 signaling in cultured alpha1A- and alpha1B-AR double knockout cardiac myocytes using alpha1-AR-green fluorescent protein (GFP) fusion proteins. Similar to endogenous alpha1-ARs and alpha1A- and alpha1B-GFP colocalized with LAP2 at the nuclear membrane. alpha1-AR nuclear localization was confirmed in vivo using alpha1-AR-GFP transgenic mice. The alpha1-signaling partners Galphaq and phospholipase Cbeta1 also colocalized with alpha1-ARs only at the nuclear membrane. Furthermore, we observed rapid catecholamine uptake mediated by norepinephrine-uptake-2 and found that alpha1-mediated activation of ERK was not inhibited by a membrane impermeant alpha1-blocker, suggesting alpha1 signaling is initiated at the nucleus. Contrary to prior studies, we did not observe alpha1-AR localization to caveolae, but we found that alpha1-AR signaling initiated at the nucleus led to activated ERK localized to caveolae. In summary, our results show that nuclear alpha1-ARs transduce signals to caveolae at the plasma membrane in cardiac myocytes.
View details for DOI 10.1161/CIRCRESAHA.108.176024
View details for PubMedID 18802028
View details for PubMedCentralID PMC2792747