Mol. Cells 2021; 44(11): 805-829
Published online November 12, 2021
https://doi.org/10.14348/molcells.2021.0224
© The Korean Society for Molecular and Cellular Biology
Correspondence to : daeyoup@kaist.ac.kr
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
CCCTC-binding factor (CTCF) critically contributes to 3D chromatin organization by determining topologically associated domain (TAD) borders. Although CTCF primarily binds at TAD borders, there also exist putative CTCF-binding sites within TADs, which are spread throughout the genome by retrotransposition. However, the detailed mechanism responsible for masking the putative CTCF-binding sites remains largely elusive. Here, we show that the ATP-dependent chromatin remodeler, chromodomain helicase DNA-binding 4 (CHD4), regulates chromatin accessibility to conceal aberrant CTCF-binding sites embedded in H3K9me3-enriched heterochromatic B2 short interspersed nuclear elements (SINEs) in mouse embryonic stem cells (mESCs). Upon CHD4 depletion, these aberrant CTCF-binding sites become accessible and aberrant CTCF recruitment occurs within TADs, resulting in disorganization of local TADs. RNA-binding intrinsically disordered domains (IDRs) of CHD4 are required to prevent this aberrant CTCF binding, and CHD4 is critical for the repression of B2 SINE transcripts. These results collectively reveal that a CHD4-mediated mechanism ensures appropriate CTCF binding and associated TAD organization in mESCs.
Keywords 3D chromatin organization, ATP-dependent chromatin remodeler, B2 short interspersed nuclear elements, chromodomain-helicase-DNA binding protein 4, intrinsically disordered domains, topologically associated domains
The mammalian genome is spatially arranged into three-dimensional (3D) chromatin organizations that allow for proper gene regulation; these include chromosome territories (Cremer and Cremer, 2010), compartments (the A and B compartments are enriched for active/open chromatin and inactive/closed chromatin, respectively) (Lieberman-Aiden et al., 2009; Simonis et al., 2006), topologically associated domains (TADs) (Dixon et al., 2012; Nora et al., 2012), contact/loop domains (Rao et al., 2014; 2017), and insulated neighborhoods (Dowen et al., 2014; Ji et al., 2016). Previous studies found that the insulator protein, CCCTC-binding factor (CTCF), coexists with the cohesin complex in chromatin (Wendt et al., 2008) and localizes to the anchors/borders of chromatin loops (Dixon et al., 2012; Nora et al., 2012; Rao et al., 2014; Splinter et al., 2006). Furthermore, removal of the cohesin loader, Nipbl (Schwarzer et al., 2017), loss of cohesin itself (Rao et al., 2017), deletion of CTCF-binding sites (de Wit et al., 2015; Guo et al., 2015; Sanborn et al., 2015), or loss of CTCF itself (Nora et al., 2017) have all been shown to interfere with the maintenance of 3D chromatin organizations, suggesting that these factors all play critical functions. Notably, recent loss-of-function studies using single-cell analysis performed in mammalian cells yielded two interesting observations (Luppino et al., 2020; Szabo et al., 2020). First, the loss of CTCF was found to result in the loss of TAD borders and the subsequent merging of two previously insulated TADs, but did not significantly affect intra-TAD interactions. Second, and in contrast, the loss of the cohesin subunit, RAD21, significantly disrupted intra-TAD interactions but did not significantly affect TAD borders. These findings demonstrated that 3D chromatin organizations are regulated by the combined action of CTCF and cohesin via distinct mechanisms: Cohesin generates intra-TAD contacts, whereas CTCF prevents inter-TAD contacts by determining the TAD borders.
Importantly, in addition to the CTCF-binding sites (or motifs) that exhibit convergent orientation at the border of 3D domains (Rao et al., 2014), there also exist putative CTCF-binding sites at the TAD interiors, which are spread throughout the genome by retrotransposition of short interspersed nuclear element (SINE) retrotransposons (Bourque et al., 2008; Schmidt et al., 2012). From an evolutionary perspective, conserved CTCF-binding sites are enriched at the borders of conserved 3D domains, whereas CTCF-binding sites that diverge between species (divergent CTCF-binding sites) drive local 3D structural changes at the domain interiors (Vietri Rudan et al., 2015). Thus, it is critical to mask the putative CTCF-binding sites that lie within the TADs; otherwise, similar to the divergent CTCF-binding sites, these sites could cause aberrant CTCF recruitment and disrupt the normal 3D chromatin organizations. Although most studies to date have focused on the function of CTCF at the TAD borders, the prevention of inappropriate CTCF binding at TAD interiors is equally essential for the maintenance of 3D chromatin organizations. However, the detailed mechanism responsible for masking these putative CTCF-binding sites remains poorly understood.
To address this issue, we focused on the members of the chromodomain helicase DNA-binding protein (CHD) family, especially the well-known ATP-dependent chromatin remodeler, CHD4. The CHD family is evolutionarily conserved (Hall and Georgel, 2007), and its members contribute to diverse cellular processes by assembling nucleosomes (de Dieuleveult et al., 2016; Gaspar-Maia et al., 2009; Micucci et al., 2015; Skene et al., 2014). Since the majority of CTCF-binding sites are located at nucleosome-free regions (Cuddapah et al., 2009), it has been proposed that chromatin remodelers could govern CTCF recruitment by regulating nucleosome occupancy at certain CTCF-binding sites (Ong and Corces, 2014). Furthermore, the ability of chromatin remodelers to regulate access to DNA through nucleosome positioning may contribute to obscuring the putative CTCF-binding sites within TADs.
Here, we show that CHD4 regulates chromatin accessibility to conceal aberrant CTCF-binding sites within TADs, thereby preventing aberrant CTCF recruitment and securing the 3D chromatin organizations in mESCs. We used various next-generation sequencing (NGS) assays, including
E14Tg2a mESCs were maintained under feeder-free conditions. Briefly, cells were cultured on gelatin-coated cell culture dishes in an mESC culture medium consisting of Glasgow’s minimum essential medium (GMEM) containing 10% knockout serum replacement, 1% non-essential amino acids, 1% sodium pyruvate, 0.1 mM β-mercaptoethanol (all from Gibco, USA), 1% fetal bovine serum, 0.5% antibiotic-antimycotic (both from Hyclone, USA) and 1,000 units/ml LIF (ESG1106; Millipore, USA). mESCs were maintained at 37°C with 5% CO2 in humidified air.
The siRNAs against
Total RNAs were purified from mESCs using the TRIzol reagent (Invitrogen, USA) according to the manufacturer’s protocol. Briefly, mESCs cultured in 6-well plates were harvested and homogenized with 1 ml of TRIzol reagent. Chloroform (200 μl/sample) was added, and the samples were mixed vigorously by hand for 15 s and incubated at 25°C for 2 min. The mixtures were centrifuged at 12,000 rpm for 15 min at 4°C, and 500 μl of each aqueous phase was transferred to a new Eppendorf tube and mixed with the same volume of isopropanol. The mixtures were incubated at 25°C for 10 min to precipitate total RNAs. The samples were centrifuged at 12,000 rpm for 10 min at 4°C, washed with 75% ethanol, and centrifuged again at 10,000 rpm for 5 min at 4°C. The RNA pellets were dried and dissolved in RNase-free water, and 1 μg of DNase-treated total RNA was applied for cDNA synthesis using an Improm-II Reverse transcription system (A3802; Promega, USA) according to the manufacturer’s protocol. For analysis of retrotransposon expression, the cDNA synthesis step was primed with random hexamers.
The generated cDNAs were amplified using a BioFact Real-time PCR kit (BIOFACT, Korea) according to the manufacturer’s manual. The primer sequences used for RT-qPCR are presented in Supplementary Table S2. Briefly, 20-μl reactions containing 1× EvaGreen, 10 mM tetraethylammonium chloride, and 10 pmol of primers were analyzed with a CFX96 system (Bio-Rad, USA) under the following conditions: 95°C for 12 min (initial melting), followed by 40 cycles of 95°C for 20 s (denaturation), 57°C for 30 s (annealing), and 72°C for 30 s (extension). The relative expression levels of Chd family members and retrotransposons were quantified with respect to those of β-actin and the 28S rRNA, respectively.
We employed the auxin-inducible degron (AID) system to the
Cleavage under targets and release using nuclease (CUT&RUN) assays were performed as previously described (Meers et al., 2019; Skene and Henikoff, 2017), with minor modification. Briefly, 4 million mESCs were harvested and washed thrice with 1.5 ml wash buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 0.5 mM spermidine). Cells were bound to activated concanavalin A-coated magnetic beads (at 25°C for 10 min on a nutator), then permeabilized with antibody buffer (wash buffer containing 0.05% digitonin and 4 mM EDTA). The bead-cell slurry was incubated with 3 μl of the appropriate antibody (see below) in a 150 μl volume at 25°C for 2 h on a nutator. After two washes in 1 ml Dig-wash buffer (wash buffer containing 0.05% digitonin), the beads were resuspended in 150 μl pAG/MNase and incubated at 4°C for 1 h on a nutator. After two washes in 1 ml Dig-wash buffer, the beads were gently vortexed with 100 μl Dig-wash buffer. The tubes were chilled to 0°C for 5 min and ice-cold 2.2 mM CaCl2 was added with gentle vortexing. The tubes were immediately placed on ice and incubated at 4°C for 1 h on a nutator; thereafter, 100 μl 2xSTOP buffer (340 mM NaCl, 20 mM EDTA, 4 mM EGTA, 0.05% digitonin, 0.1 mg/ml RNase A, 50 μg/ml glycogen) was added and the tubes were incubated at 37°C for 30 min on a nutator. The beads were secured using a magnet stand, and the liquid was removed to a fresh tube, combined with 2 μl 10% SDS and 2.5 μl proteinase K (20 mg/ml), and incubated at 50°C for 1 h. DNA was extracted using phenol chloroform as described at https://www.protocols.io/view/cut-amp-run-targeted-in-situ-genome-wide-profiling-zcpf2vn. CUT&RUN libraries were prepared using an Accel-NGS 2S Plus DNA Library Kit (21024; Swift BioSciences, USA), according to the manufacturer’s guidelines. The libraries were then sequenced using an Illumina Novaseq 6000 platform (Illumina, USA). The libraries were generated from two sets of biological replicates.
For Control and Chd4KD cells,
ChIP assays were performed as previously described (Kim et al., 2012), with minor modification. Briefly, mESCs were washed twice with phosphate-buffered saline (PBS) and incubated with 1% formaldehyde for 15 min at 25°C. This crosslinking was quenched with 125 mM glycine at 25°C for 5 min, and the cells were harvested with cold PBS and suspended in SDS lysis buffer (50 mM Tris-HCl, pH 8.0, 1% SDS, 10 mM EDTA). The chromatin was sheared to mono- or dinucleosome sizes using a focused ultrasonicator (S220; Covaris, USA). The sonicates were incubated overnight with the relevant antibodies (see below) and Protein A, G sepharose (17-1279-03 and 17-0618-05; GE Healthcare, USA) at 4°C on a nutator. The immune complexes were washed for 10 min each with the following wash buffers: low-salt wash buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), high-salt wash buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), and LiCl wash buffer (10 mM Tris, pH 8.0, 250 mM LiCl, 1% NP-40, 1% sodium deoxycholate, 1 mM EDTA). The immune complexes were further washed twice with TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA). Finally, the immune complexes were eluted with elution buffer (1% SDS, 0.1 M NaHCO3) and reverse-crosslinked overnight at 68°C. The immunoprecipitated DNA was treated with proteinase K and RNase A and recovered by phenol-chloroform-isoamyl alcohol precipitation. ChIP-seq libraries were prepared using a NEXTflex ChIP-seq kit (5143-02; Bioo Scientific, USA) according to the manufacturer’s guidelines. The libraries were then sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
Antibodies against CHD4 (ab70469; Abcam [UK] and in-house generated), CTCF (ab37477; Abcam), α-tubulin (2144S; Cell Signaling Technology, USA), V5 (R960-25; Thermo Fisher Scientific), mCherry2 (M11217; Invitrogen), β-actin (SC-47778; Santa Cruz Biotechnology), and FLAG (F7425; Sigma) were used for immunoblotting. Antibodies against CHD4 (ab70469; Abcam and in-house generated) and IgG (12-371; Millipore) were used for immunoprecipitation. Antibodies against CTCF (07-729; Millipore), RAD21 (ab992; Abcam), H3 (ab1791; Abcam), H3K9me3 (ab8898; Abcam), H3K4me1 (in-house generated), H3K4me3 (in-house generated), H3K27ac (ab4729; Abcam), and H3K27me3 (07-449; Millipore) were used for ChIP-seq. Antibodies against CHD4 (ab70469; Abcam and in-house generated), RAD21 (ab992; Abcam), and IgG (12-371; Millipore and in-house generated) were used for CUT&RUN.
E14Tg2a mESCs were transfected with siRNAs against
ATAC-seq libraries were prepared as previously described (Buenrostro et al., 2013; 2015; Choi et al., 2020), with minor modification. Briefly, 50,000 mESCs were harvested, washed with cold PBS, lysed with cold lysis buffer, and immediately centrifuged. The nuclear pellets were resuspended in 25 μl of 2× tagmentation reaction buffer (10 mM Tris, pH 8.0, 5 mM MgCl2, 10% dimethylformamide), 23 μl of nuclease-free water, and 2 μl of Tn5 transposase (in-house generated), and incubated at 37°C for 30 min. The samples were then immediately purified using a QIAquick PCR purification kit (28106; Qiagen, Germany). The libraries were pre-enriched for five cycles using the KAPA HiFi Hotstart ready mix (KK2601; Kapa Biosystems, USA), and the threshold cycle (Ct) was monitored using qPCR to determine the additional enrichment cycles, which were then applied. The final libraries were purified again with a QIAquick PCR purification kit, and sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
Cells cultured in 60Ф culture dishes were washed with PBS and incubated with modified PBS (150 mM NaCl, 0.02% Tween-20, 0.02% Triton X-100) at 25°C for 5 min. The cells were then fixed with 1% formaldehyde at 25°C for 10 min, quenched with 125 mM glycine at 25°C for 5 min, washed, and suspended with lysis buffer (18% Ficoll-400, 10 mM KH2PO4, 10 mM K2HPO4, 1 mM MgCl2, 250 nM EGTA). The lysates were centrifuged at 12,000 rpm for 40 min at 4°C, and the pelleted chromatin was resuspended in A buffer (10 mM Tris-Cl, pH 7.4, 150 mM NaCl, 5 mM KCl, 1 mM EDTA). CaCl2 (5 mM) was added to the samples, which were immediately digested with MNase (M0247S; NEB) at 37°C for 60 min. The reaction was inactivated with 1% SDS and 500 mM EDTA, and reverse-crosslinking was performed overnight at 68°C. Proteinase K and RNase were added sequentially to the samples, and DNAs were recovered using phenol-chloroform-isoamyl alcohol extraction. The MNase-digested DNAs were separated on a 1.5% agarose gel, and mono-nucleosomal DNAs were extracted using a QIAEX II gel extraction kit (20021; Qiagen) according to the manufacturer’s manual. The purified mono-nucleosomal DNAs were subjected to sequencing using a TruSeq DNA library prep kit (FC-121-2001; Illumina). The final libraries were sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
DNase-treated total RNAs were purified from mESCs and used for total RNA-seq. Total RNA-seq libraries were prepared using a TruSeq stranded RNA kit (RS-122-2301; Illumina) according to the manufacturer’s manual. For mRNA-seq library preparation, mRNAs were isolated from total RNA using a Magnetic mRNA isolation kit (S1550S; NEB), and libraries were prepared using a NEXTflex Rapid directional RNA-seq kit (5138-08; Bioo Scientific). The libraries were sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
E14Tg2a mESCs were harvested, washed with cold PBS, and lysed with EBC buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.5% NP-40, 1 mM PMSF). The lysates were incubated for 1 h with the indicated antibodies and Protein A, G sepharose (17-1279-03 and 17-0618-05; GE Healthcare) at 4°C with agitation. The immune complexes were washed three times with EBC buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.5% NP-40, 1 mM PMSF). The lysates were boiled for 5 min with SDS sample buffer, resolved by SDS-PAGE, and subjected to immunoblotting.
RNA immunoprecipitation (RIP) assays were conducted as previously described (Heo and Sung, 2011; Kim et al., 2014), with minor modification. Briefly, cultured mESCs were crosslinked with 1% formaldehyde for 15 min at 25°C and quenched with 125 mM glycine. The cells were harvested, washed with cold PBS, and then lysed with RIPA buffer (50 mM Tris-Cl, pH 8.0, 150 mM NaCl, 0.05% SDS, 1 mM EDTA, 1% NP-40, and 0.05% sodium deoxycholate) containing an RNase inhibitor (M007L; Enzynomics, Korea). The samples were sonicated, treated with DNase I (18068-015; Invitrogen) for 10 min at 37°C, and cleared by centrifugation. The extracts were bound with anti-Chd4 (ab70469; Abcam) for 2 h at 4°C in the presence of RNase inhibitor. The immune complexes were further incubated with Dynabeads Protein A and G (10001D, 10004D; Invitrogen) for 1 h at 4°C, and then washed for 10 min each with the following wash buffers: low-salt wash buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), high-salt wash buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), and LiCl wash buffer (10 mM Tris, pH 8.0, 250 mM LiCl, 1% NP-40, 1% sodium deoxycholate, 1 mM EDTA). The immune complexes were further washed twice with TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA), eluted, and reverse-crosslinked with 400 mM NaCl for 2 h at 65°C. RNAs were purified from the eluates using an RNA Clean & ConcentratorTM-5 kit (R1015; Zymo Research, USA) according to the manufacturer’s manual, and then incubated again with DNase I (18068-015; Invitrogen) for 30 min at 37°C. Finally, the DNase I was inactivated and removed using a DNA-freeTM kit (AM1906; Invitrogen) according to the manufacturer’s manual.
The plasmids used for
EMSA was carried out as previously described (Kim et al., 2014), with minor modifications. Briefly, the indicated amount of purified Chd4 protein was incubated with 5 nM of biotinylated RNAs in binding buffer (50 mM Tris-Cl, pH 7.5, 100 mM NaCl, 10 mM β-mercaptoethanol, 5% glycerol) for 40 min at 25°C. Bound protein-RNA complexes were resolved on 5% native polyacrylamide mini gels at 70V and 4°C for 2 h. The complexes were transferred to an Amersham Hybond-N+ nylon membrane (RPN303B; GE Healthcare), UV-crosslinked, and visualized using a Chemiluminescent Nucleic Acid Detection Module kit (89880; Thermo Fisher Scientific) according to the manufacturer’s manual.
Nucleosome reconstitution was performed as previously described (Owen-Hughes et al., 1999). Briefly, two types (lateral and central forms) of a 147-bp DNA fragment were amplified from pGEM-3z/601, and
For the nucleosome-sliding assay, reconstituted nucleosomes (75 ng, 0.36 pmol) were incubated with purified Chd4 (50 ng, 0.23 pmol), varying concentrations of purified RNAs (0 pmol, 0.1 pmol, 0.2 pmol, 0.5 pmol, and 1 pmol), and ATP (0.1 pmol) in sliding buffer (50 mM KCL, 20 mM HEPES, pH 7.9, 2 mM DTT, 0.5 mM PMSF, 0.05% NP40, 10% glycerol, 100 μg/ml BSA, 10 mM MgCl2) for 45 min at 30°C. The reaction was stopped by the addition of 500 ng of plasmid (size > 10 kb) and 0.1 pmol of ATP-γ-S tetralithium salt (10102342001-Roche; Sigma) followed by incubation for 30 min at 30°C. The samples were resolved on 5% native polyacrylamide mini gels at 65 V and 4°C for 2.5 h.
Circularized GRO-seq was performed as previously described (Danko et al., 2013; Lam et al., 2013). Briefly, around 10 million nuclei per sample were purified and used for global run-on and base hydrolysis, which were carried out as previously described (Core et al., 2008). BrU-labeled nascent RNAs were immunoprecipitated twice with anti-BrdU antibody-conjugated agarose (SC-32323ac; Santa Cruz Biotechnology). Between the two immunoprecipitations, the BrU-precipitated RNAs were subjected to polyA tailing with poly(A)-polymerase (P7460L; Enzymatics, USA). The RNAs were subjected to first-strand cDNA synthesis using Superscript III Reverse Transcriptase (18080-044; Invitrogen) and the oNTI223 RT primer (IDT). Excess RT primers were removed with exonuclease I (M0293S; NEB), and the cDNAs were size-selected (120-220 nt) in a 6% polyacrylamide TBE-urea gel. The selected cDNAs were circularized using CircLigase (CL4111K; Epicentre, USA) and re-linearized with ApeI (M0282S; NEB). The single-stranded DNA templates were amplified using Phusion High-Fidelity DNA Polymerase (M0530S; NEB) and Illumina TruSeq small-RNA sample barcoded primers. The generated PCR products were isolated and size-selected (190-290 bp) by electrophoresis on a 6% native polyacrylamide TBE gel. The final libraries were sequenced on an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
For Hi-C analysis, the
For the MNase-seq, ATAC-seq, ChIP-seq, CUT&RUN, and GRO-seq analyses, the raw reads were aligned to the mouse genome (mm10) using Bowtie2 (ver. 2.2.9) with default parameters (Langmead and Salzberg, 2012). For the total RNA-seq and mRNA-seq analyses, the raw reads were aligned to the mouse genome (mm10) using STAR (ver. 2.5.2a) with default parameters (Dobin et al., 2013).
Generally, we used MACS2 (Zhang et al., 2008) to convert the aligned BAM files into bedGraph files, and normalized the data with respect to the total read counts. Then, we used bedGraphToBigWig (Kent et al., 2010) to convert the bedGraph files into bigWig files. The bigWig files were used as input files for bwtool (Pohl and Beato, 2014) (matrix and aggregate option) to quantify the intensity (e.g., using heatmaps or average line plots) of the relevant sequencing data. Further analyses were performed using in-house scripts, MACS2 (Zhang et al., 2008), and HOMER (Heinz et al., 2010). All of our raw data (fastq files) were confirmed to be of good quality using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/).
For our CUT&RUN and ChIP-seq analyses, we used MACS2 (callpeak option,
For our mRNA-seq and total RNA-seq analyses, we used Cufflinks (Cuffdiff option, fr-firststrand) to assess the expression levels. For our analysis of SINEs, we built a STAR-index exclusively containing the previously identified 1.5 million SINE copies (RepeatMasker in UCSC table browser), and performed STAR and Cufflinks analyses as described above.
For our GRO-seq analysis, we used HOMER (findPeaks option, -style groseq) to examine the number of
Box plots, volcano plots, and other plots were drawn with R (ggplot2) (Wickham, 2009) and heatmaps were drawn with Java TreeView (Saldanha, 2004). The examples of our genome-wide data were visualized using the Integrative Genomics Viewer (IGV) (Robinson et al., 2011).
The publicly released ChIP-seq data were downloaded from NCBI GEO DataSets and the ENCODE Consortium. These data were downloaded as sra or fastq formats, and the sra files were converted to fastq files using the SRA Toolkit (https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=software). Thus, we analyzed the public datasets and our own results using the same methods. Details of the analyzed data are summarized in Supplementary Table S4.
We first determined the expression of
To elucidate the function of CHD4, we first identified CHD4-binding sites by performing CUT&RUN (Skene and Henikoff, 2017) and also by analyzing the public chromatin immunoprecipitation sequencing (ChIP-seq) data against CHD4 (GSE64825, GSE61188, and GSE27844, see also Supplementary Table S4). Then, we analyzed genomic features within the CHD4-binding sites (CUT&RUN and ChIP-seq peaks). Notably, we found that CTCF motifs were highly enriched within CHD4-binding sites relative to CHD1-binding sites and randomly selected sites (Supplementary Fig. S1B, Supplementary Table S5). Interestingly, we also observed that CHD4 localized to the borders of TADs (Fig. 1A, Supplementary Fig. S1C) and the anchors of insulated neighborhoods (Fig. 1B, Supplementary Fig. S1D), where CTCF is known to be localized (Figs. 1A and 1B) and play a boundary function (Dixon et al., 2012; Dowen et al., 2014; Luppino et al., 2020; Nora et al., 2012; Rao et al., 2014; Szabo et al., 2020). Furthermore, 34% of the CTCF ChIP-seq peaks coincided with the CHD4 CUT&RUN peaks (Fig. 1C), suggesting that CHD4 and CTCF resided near one another. To evaluate this possibility, we analyzed the localization of CHD4 near the CTCF ChIP-seq peaks. As expected, we found that CHD4 resided near CTCF (Figs. 1D and 1E). Intriguingly, although the CHD4-binding sites were enriched for CTCF motifs (Supplementary Fig. S1B, Supplementary Table S5), we observed that CHD4 was also present at locations lacking CTCF (Figs. 1C and 1E), suggesting that CHD4 may conceal CTCF motifs (Fig. 1F). We also observed that CTCF localized to exposed DNA sequences surrounded by well-positioned nucleosomes (Figs. 1D and 1E), consistent with a previous report (Cuddapah et al., 2009). The presence of CTCF at nucleosome-free regions flanking nucleosomes and CHD4 (Fig. 1D) may suggest that CHD4 could regulate nucleosome occupancy near CTCF-binding sites, and thereby control interactions between CTCF and DNA sequences harboring CTCF motifs. Taken together, these results strongly indicate that CHD4 and CTCF are closely linked in mESCs. Thus, we hypothesized that CHD4 might control CTCF-mediated 3D chromatin organization by regulating chromatin accessibility near CTCF-binding sites (Fig. 1F).
To test our hypothesis, we first designed and applied siRNAs against
To investigate whether CHD4 depletion disturbed the 3D chromatin organization, we performed
To investigate the role of CHD4 in local 3D chromatin organization, we performed ATA upon CHD4 depletion at four TADs obtained from each cell (Control, Chd4KD, –Aux, and +Aux cells). Notably, we detected genome-wide decreases of Hi-C interactions within the TADs of Control and –Aux cells upon CHD4 depletion (Figs. 2A and 2B, arrow). In contrast, Hi-C interactions within the TADs of Chd4KD and +Aux cells were largely unaltered by CHD4 loss (Figs. 2A and 2B). These results suggest that CHD4 maintains the local interactions within the original TADs of wild-type mESCs.
To address this further, we identified four types of differential TADs seen upon CHD4 depletion: We compared the changes in Hi-C interactions to identify weakened or strengthened TADs, and compared the changes in TAD positions to identify separating and merging TADs (Figs. 2C-2G). Weakened TADs were defined as original TADs of wild-type mESCs (Control or –Aux cells) whose Hi-C interactions were decreased upon CHD4 loss. Strengthened TADs were defined as TADs of CHD4-depleted cells (Chd4KD or +Aux cells) whose Hi-C interactions were increased upon CHD4 loss. Separating TADs were defined as original TADs of wild-type mESCs (Control or –Aux cells) that were split into two or more smaller TADs upon CHD4 loss. Finally, merging TADs were defined as TADs of CHD4-depleted cells (Chd4KD or +Aux cells) that contained two or more original TADs of wild-type mESCs (Control or –Aux cells). The ATA results indicated that Hi-C interactions showed genome-wide decreases at weakened TADs (Fig. 2D) and separating TADs (Fig. 2E), but genome-wide increases at the strengthened TADs (Fig. 2F) and merging TADs (Fig. 2G) upon CHD4 loss. The decreasing TADs (weakened and separating TADs) shared more TADs than the increasing TADs (strengthened and merging TADs) (Supplementary Fig. S2A), indicating that weakened and separating TADs were more closely associated. Interestingly, comparing the numbers of the four types of differential TAD seen upon CHD4 loss, revealed that decreasing TADs occurred more frequently than increasing TADs (Fig. 2H). This was consistent with our above-described observation that Hi-C interactions showed genome-wide decreases within the original TADs of wild-type mESCs (Figs. 2A and 2B, arrow). Thus, we focused on decreasing TADs (weakened and separating TADs).
To elucidate how CHD4 maintains local interactions within TADs, we performed CTCF ChIP-seq upon CHD4 loss. By comparing the CTCF intensity, we defined CTCF signals that were gained or lost upon CHD4 depletion (Fig. 2I). Next, we divided the decreasing TADs (weakened and separating TADs) into borders and interiors (Fig. 2J, top right). We observed that both weakened and separating TADs exhibited similar distribution patterns of gained/lost CTCF at the borders/interiors of TADs (Fig. 2J), suggesting that these two types of decreasing TAD may be fundamentally the same and/or formed similarly upon CHD4 loss. We did not detect any significant difference in the distribution of gained and lost CTCF at TAD borders, whereas gained CTCF was significantly enriched at TAD interiors compared to lost CTCF (Fig. 2J). These results indicate that gained CTCF may be responsible for decreasing interactions within weakened and separating TADs upon CHD4 loss.
To further support this notion, we comprehensively analyzed Hi-C interactions, TAD positions, insulations scores, and CTCF ChIP-seq at specific regions of the genome upon CHD4 loss. Consistent with our genome-wide ATA results (Figs. 2D and 2E), we found a notable decrease in Hi-C interactions within the weakened and separating TADs (Fig. 2K, Supplementary Figs. S2B and S2C, black arrow). Importantly, we observed that the presence of a gained CTCF (marked with *) greatly reduced the local insulation, thereby generating a novel TAD border (separating TAD, Fig. 2K, Supplementary Fig. S2B) and/or markedly decreasing the Hi-C interactions within TADs (weakened TAD, Supplementary Figs. S2B and S2C). In general, we observed the same types of decreasing TAD in the two cell models CHD4 loss (Chd4KD and +Aux cells); at some genomic loci, however, we observed a weakened TAD in Chd4KD cells but a separating TAD in +Aux cells (Supplementary Fig. S2B). This observation supports the idea that these two types of decreasing TAD are closely associated, and that the gained CTCF causes both types of decreasing TAD upon CHD4 depletion. When considering the splitting feature of separating TADs (Fig. 2C), one can speculate the reduced TAD sizes and an elevated number of chromatin loops (peaks) coupled with gained CTCF upon CHD4 loss. As expected, we observed reduced TAD sizes (Supplementary Fig. S2D) and an elevated number of total chromatin loops (peaks) and cell-type-specific peaks (Supplementary Figs. S2E and S2F) upon CHD4 depletion (Supplementary Figs. S2G and S2H). Importantly, the insulation scores at the gained CTCF sites were globally reduced upon CHD4 depletion (Fig. 2L), indicating that the CHD4 depletion-triggered gain of CTCF is directly responsible for generating TAD borders in a genome-wide manner. Furthermore, our APA results showed that Hi-C interactions were increased at the gained CTCF-associated peaks upon CHD4 loss (Supplementary Fig. S2I), supporting the direct relationship between gained CTCF and changes in Hi-C interactions. Collectively, our results strongly imply that the decreasing TADs (weakened and separating TADs) are directly associated with gained CTCF upon CHD4 loss. Therefore, these findings suggest that CHD4 regulates the local 3D chromatin organization by preventing the aberrant recruitment of CTCF to TAD interiors.
To investigate how CHD4 prevents aberrant CTCF binding, we analyzed the genome-wide changes in CTCF upon CHD4 depletion at CHD4 CUT&RUN peaks (or CHD4-binding sites) (Figs. 3A and 3B, Supplementary Figs. S3A and S3B). We classified the CHD4 CUT&RUN peaks into those that coincided with CTCF ChIP-seq peaks of wild-type mESCs (Control or –Aux cells) and those that did not coincide with these peaks. We did not detect any major change in CTCF upon CHD4 depletion at CHD4 peaks that coincided with CTCF peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B), but observed aberrant gain of CTCF upon CHD4 depletion at CHD4 peaks that did not coincide with CTCF peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B, arrow). Since CHD4 CUT&RUN peaks are highly enriched for CTCF motifs (Supplementary Fig. S1B, Supplementary Table S5), these results indicate that CHD4 conceals putative CTCF motifs (Fig. 1F, left). The observed changes in CTCF binding were not derived from any change in
To elucidate how CHD4 conceals putative CTCF motifs, we comprehensively analyzed CTCF ChIP-seq data obtained from Control and Chd4KD cells. We first determined the CTCF-binding sites (or ChIP-seq peaks) in Control and Chd4KD cells. We then merged the CTCF-binding sites (n = 39,800) and sorted them in descending order of their relative CTCF intensity (Chd4KD/Control) (Figs. 3C and 3F, Supplementary Figs. 3E, S3H-S3J). According to this relative CTCF intensity, we defined two groups of CTCF-binding sites: Group 1 (n = 4,736, 11.9%) represents CTCF-binding sites at which more CTCF was recruited in Chd4KD cells compared to Control cells, while Group 2 (n = 35,064, 88.1%) represents the remainder of the CTCF-binding sites, at which CTCF binding was unchanged or reduced in Chd4KD cells. Consistent with our above-described results (Figs. 2I, 2K, 3A, and 3B, Supplementary Figs. S2B, S2C, S2E, S3A, and S3B), we observed that CHD4 depletion-triggered gain and loss of CTCF binding occurred at the Group 1 and Group 2 CTCF-binding sites, respectively (Fig. 3C). Furthermore, we found that RAD21 was aberrantly gained along with CTCF at the Group 1 sites upon CHD4 depletion (Supplementary Fig. S3E, left). This supports our observation that the CTCF that is aberrantly gained upon CHD4 depletion is directly involved in forming new chromatin loops (Supplementary Fig. S2I), and thereby disrupting local 3D chromatin organizations.
Previously, we observed that CTCF binds to exposed DNA sequences surrounded by well-positioned nucleosomes (Fig. 1D) (Cuddapah et al., 2009), and herein found that CHD4 also localizes near these sites (Fig. 1D). In addition, we found that CTCF was aberrantly gained upon CHD4 depletion at the CHD4 CUT&RUN peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B), which harbor CTCF motifs. Considering these results, we hypothesized that CHD4 conceals putative CTCF motifs by regulating chromatin accessibility at specific DNA sequences with embedded CTCF motifs (Fig. 1F, left). To test this hypothesis, we analyzed MNase-seq (chromatin digestion with micrococcal nuclease combined with sequencing), DNase-seq (DNase I hypersensitive site sequencing), and ATAC-seq (assay for transposase-accessible chromatin using sequencing) data and aligned them with the same order as previously defined relative CTCF intensity (Group 1 and 2 sites). In Control cells, CTCF mainly localized to Group 2 sites (Fig. 3C), which were characterized by an accessible chromatin region with depleted nucleosomes, enriched DNase I hypersensitive sites, and enriched ATAC-seq signals (Fig. 3C). In accordance with this, Control cells exhibited far less binding of CTCF at Group 1 sites (Fig. 3C), which were characterized by an inaccessible chromatin region with enriched nucleosomes, depleted DNase I hypersensitive sites, and depleted ATAC-seq signals (Fig. 3C). Importantly, we detected a significant decrease in nucleosomes and a significant increase in ATAC-seq signals upon CHD4 depletion at Group 1 sites (Figs. 3C-3E), indicating that CHD4 depletion caused the Group 1 sites to change from inaccessible chromatin regions to accessible chromatin regions. In contrast, we did not detect any significant change in chromatin accessibility at Group 2 sites (Figs. 3C and 3D).
To examine why nucleosomes were depleted specifically at Group 1 sites, we analyzed the features of these DNA sequences. Notably, we found that nucleosome-repelling sequences (AA-AAAA) and DNA bases (AT) (Kaplan et al., 2009; Segal and Widom, 2009; Tillo and Hughes, 2009; Valouev et al., 2011) were more abundant within Group 1 sites compared to Group 2 sites (Fig. 3C). Moreover, CTCF motifs were also present within Group 1 sites (Fig. 3C), even though these sites were not typically targeted by CTCF in Control cells (Fig. 3C). Consistently, when we performed the same analysis using CTCF ChIP-seq peaks of –Aux and +Aux cells, we observed similar changes in CTCF, RAD21, and ATAC-seq signals upon CHD4 depletion at Group 1* sites (Supplementary Figs. S3F and S3G). The Group 1* sites also exhibited patterns similar to those of Group 1 sites in terms of depleted DNase I hypersensitive sites, enriched nucleosome-repelling sequences, and enriched CTCF motifs (Fig. 3C). Taken together, our results suggest that CHD4 maintains the inaccessible/closed chromatin state and conceals CTCF motifs at Group 1 sites by assembling nucleosomes, thereby preventing aberrant CTCF recruitment. Upon CHD4 depletion, however, nucleosomes are no longer assembled by CHD4 and instead tend to be removed from Group 1 sites due to the high-level enrichment of nucleosome-repelling sequences, resulting in the open/accessible chromatin states, exposure of CTCF motifs, and the mislocalization of CTCF. Thus, our data support the idea that the CHD4-mediated regulation of chromatin accessibility controls the appropriate recruitment of CTCF in mESCs.
We next determined the epigenetic features of the Group 1 sites using ChIP-seq data generated against various factors, including histone modifications. Consistent with our observation of nucleosome enrichment (Fig. 3C), we observed that histone H3.1/2 was enriched at Group 1 sites in wild-type mESCs (Supplementary Fig. S3H). Remarkably, among the various histone modifications, H3K9me3 (Fig. 3F, Supplementary Fig. S3H) and the H3K9 methyltransferases, SUV39h1 and SUV39h2, but not SETDB1 (Supplementary Fig. S3H), were the only marks enriched at Group 1 sites in wild-type mESCs. As expected, given that the chromatin structure changed from the closed (nucleosome-enriched) to open (nucleosome-depleted) chromatin state at Group 1 sites upon CHD4 depletion, H3K9me3 was also diminished at these regions in Chd4KD cells (Fig. 3F); this reflected the absence of core histones, which are the substrates for H3K9 methyltransferases. This reduction was not related to any change in the expression of genes encoding H3K9 methyltransferases (Supplementary Table S7). Whereas CHD4 (Supplementary Fig. S3E) and ChAHP complexes (Supplementary Fig. S3I) were abundant at Group 1 sites, most of the analyzed chromatin remodelers (Supplementary Fig. S3E), mediators, histone variants, histone modifications (Supplementary Fig. S3H), nucleosome remodeling deacetylase (NuRD) complexes, architectural proteins, and transcription factors (Supplementary Fig. S3J) were sparse at the Group 1 sites. Together, these results indicate that CHD4 assembles core histones to conceal putative CTCF motifs at heterochromatic regions.
Lastly, we analyzed the positions of the Group 1 and Group 2 CTCF-binding sites with respect to the TADs. We observed that the Group 2 sites were mostly localized at the TAD borders, whereas the Group 1 sites (equivalent to gained CTCF in Fig. 2I) were not (Fig. 3G). Instead, the latter sites were mainly located at the interior regions of TADs in Control cells (84.8%) and –Aux cells (86.7%) (Fig. 3H), which is consistent with our earlier observations (Figs. 2J and 2K, Supplementary Figs. S2B and S2C). In light of the reduced insulation scores (Fig. 2L) and APA results (Supplementary Fig. S2I), these data collectively imply that CHD4 depletion-triggered gained CTCF (Group 1 sites) directly contributes to disrupting local TAD organizations by forming a TAD borders and/or chromatin loops at TAD interiors.
Our results indicated that CHD4 depletion increased both chromatin accessibility (decreased nucleosome levels and increased ATAC-seq signals) and CTCF occupancy at Group 1 sites (Fig. 3, Supplementary Fig. S3). Since CHD4 is a chromatin remodeler, we speculated that CHD4 depletion would first alter chromatin accessibility, leading to the exposure of CTCF motifs, whereupon CTCF would be newly recruited to the Group 1 sites, resulting in aberrant CTCF binding (Fig. 3, Supplementary Fig. S3, see also Fig. 4A). To confirm this, we carried out two types of temporal experiments to determine the order of events (Figs. 4A and 4H). We first performed temporal depletion of CHD4 by varying the RNAi (siRNA against
To further validate our findings, we performed temporal restoration of CHD4 levels (Fig. 4H). We first confirmed that CHD4 was completely degraded after 6 h of auxin treatment in CHD4-mAID cells (Supplementary Fig. S4B). Then, we removed the auxin by changing the cell culture medium. We found that CHD4 protein was gradually restored during 12 h of auxin withdrawal (Figs. 4I and 4J). To determine whether core histones assembly precedes CTCF eviction (or vice versa) at Group 1 sites upon CHD4 restoration, we varied the auxin withdrawal time (Fig. 4I) and performed ChIP-seq against CTCF and histone H3. Consistent with our earlier results (Figs. 3 and 4D, Supplementary Fig. S3), we found that CHD4 depletion, which is equivalent to 6 h of auxin treatment without auxin withdrawal (“+Aux (6 h), withdraw 0 h”), decreased the H3 level and increased the CTCF level at Group 1 sites (Supplementary Fig. S4C). Upon temporal restoration of CHD4, the CTCF level gradually decreased (gained CTCF) and the H3 level gradually increased (reflecting increased chromatin accessibility) at Group 1 sites (Supplementary Fig. S4C). When we performed the same analysis using the CTCF ChIP-seq peaks of –Aux and +Aux cells (“+Aux (6 h), withdraw 0 h”), we observed the same rescue patterns for CTCF and H3 (i.e., a gradual decrease and increase in the levels of CTCF and H3, respectively) upon temporal restoration of CHD4 at Group 1* sites (Fig. 4K). To quantitatively assess these changes, we calculated the average intensities of CTCF and H3 at Group 1* sites for each time point (Figs. 4L and 4M). Importantly, we found that the H3 level increased between 0.5-3 h of auxin withdrawal (Fig. 4M, red arrow) and the CTCF level thereafter decreased between 3-12 h of auxin withdrawal (Fig. 4L, red arrow). These results were further confirmed when we examined individual loci of Group 1* sites (Fig. 4N, Supplementary Fig. S4D). Thus, CHD4 restoration first rescues chromatin accessibility (open to closed chromatin state), then rescues CTCF binding. Collectively, these results enabled us to determine the order of events and conclude that CHD4 directly regulates chromatin accessibility at Group 1 sites to conceal putative CTCF motifs and thereby prevent aberrant CTCF binding.
We next questioned why CHD4 depletion majorly affects chromatin accessibility at Group 1 sites but has only modest effects at Group 2 sites, even though it binds to both types of site (Supplementary Fig. S3E). The most plausible explanation would be that there is a between-site-type difference in the presence of various factors. Indeed, such differences were seen among chromatin remodelers (Supplementary Fig. S3E), mediators, histone variants, histone modifications (Supplementary Fig. S3H), NuRD complexes, architectural proteins, and transcription factors (Supplementary Fig. S3J). Such factors, particularly the chromatin remodelers (CHD1 and brahma-related gene-1 [BRG1]), may compensate for the loss of CHD4 at Group 2 sites. At Group 1 sites, in contrast, these factors are sparse, while CHD4, ChAHP complex, and SUV39H1/2 are abundant. Together, these differences may explain why CHD4 depletion primarily affects the chromatin states of putative CTCF motifs at Group 1 sites.
Alternatively, differences in the chromatin state between Group 1 and 2 sites may explain the differential effects of CHD4 depletion. As described above, Group 1 sites represent heterochromatic regions, while Group 2 sites represent relatively euchromatic regions (Figs. 3C and 3F, Supplementary Fig. S3H) where RNAs are abundant. Since CHD4 is an RNA-binding protein (He et al., 2016; Hendrickson et al., 2016), we hypothesized that RNAs might inhibit the catalytic activity of CHD4 (given that CHD4 functions majorly at RNA-depleted heterochromatic Group 1 sites in wild-type mESCs), which could explain why CHD4 depletion primarily affects Group 1 sites. We first confirmed that CHD4 antibodies could be used for immunoprecipitation (Supplementary Fig. S5A), and then performed RIP. We found that CHD4 bound the
To investigate the importance of the RNA-binding ability of CHD4 in the regulation of CTCF binding, we first sought to identify the specific RNA-binding domain(s) of CHD4. Many RNA-binding proteins contain IDRs that can bind to RNAs (Lin et al., 2016; Molliex et al., 2015). Here, we used MobiDB (Piovesan et al., 2018; 2021) to determine the RNA-binding IDRs of CHD4. Our MobiDB analysis predicted four IDRs (IDR 1-4) within
Having defined the epigenetic features of H3K9me3-enriched heterochromatic aberrant CTCF-binding sites (Group 1) and determined the role of CHD4 at these sites, we next sought to elucidate the genomic features of Group 1 sites by examining the distribution of genomic regions via genome annotation. Notably, we found that SINEs (particularly B2 SINEs), which are well-known retrotransposons (McClintock, 1950), were enriched specifically at Group 1 sites (Fig. 6A); this is consistent with previous reports that H3K9me3 suppresses SINEs (Martens et al., 2005; Varshney et al., 2015), which harbor CTCF motifs (Bourque et al., 2008; Schmidt et al., 2012). Since CHD4 depletion caused the loss of nucleosomes/core histones (Figs. 3 and 4, Supplementary Figs. S3 and S4), resulting in the reduction of H3K9me3 levels (Fig. 3F) at Group 1 sites, we next investigated whether this affected the transcription of B2 SINE by performing mRNA-seq, total RNA-seq, and global run-on sequencing (GRO-seq). Surprisingly, CHD4 depletion elevated the signals for all three assays at Group 1 sites (Fig. 6B, Supplementary Fig. S6A), where B2 SINEs are enriched (Fig. 6A). Our GRO-seq analysis further revealed that the number of
To confirm this, we determined the expression levels of various retrotransposons, including long interspersed nuclear elements (LINEs), endogenous retroviruses (ERVs), and SINEs, using RT-qPCR. Consistently, the expression level of B2 SINE, a major mouse SINE, was dramatically up-regulated in Chd4KD cells, whereas the other retrotransposons (e.g., LINE-1 and IAP) were not significantly altered (Fig. 6C). B2 SINE transcripts are generally transcribed by RNA polymerase III (RNAPIII), but some B2 SINEs are located within introns and may be affected by RNA polymerase II (RNAPII). To exclude the possible relevance of RNAPII-affected B2 SINES, we applied the RNAPII inhibitor, α-amanitin, and determined the expression of B2 SINEs in mESCs. Consistent with the above-described results, B2 SINEs were specifically up-regulated by
We next investigated the role of CHD4 depletion-induced de-repressed B2 SINE in mESCs. As up-regulated B2 SINE RNA has been reported to directly bind RNAPII and inhibit transcription (Allen et al., 2004; Espinoza et al., 2004), we examined whether CHD4 depletion-triggered de-repressed B2 SINE RNA could do the same. We first performed RIP and confirmed that de-repressed B2 SINE transcripts have a binding preference for RNAPII in Chd4KD cells (Fig. 6E). We then used genomic regions enrichment of annotations tool (GREAT) analysis (McLean et al., 2010) to identify genes located in the vicinity of de-repressed B2 SINEs. Interestingly, our results revealed that genes important for stem cell maintenance (pluripotent genes) were located near de-repressed B2 SINEs (Fig. 6F) and Group 1 CTCF-binding sites (Supplementary Fig. S6E). To investigate whether de-repressed B2 SINE transcripts hinder the recruitment of RNAPII to the transcription start sites (TSSs) of genes involved in stem cell maintenance, we analyzed our total RNA-seq data and publicly available RNAPII ChIP-exo data (GSE64825, see also Supplementary Table S4). For a detailed analysis, we separated the total RNA-seq data into those for the –/+ strands and confirmed the presence of de-repressed B2 SINEs in Chd4KD cells (Fig. 6G). Notably, Chd4KD cells exhibited a significant reduction of RNAPII recruitment at TSSs when abundant de-repressed B2 SINEs were located nearby (Fig. 6G), whereas no reduction of RNAPII recruitment was observed at TSSs without a nearby de-repressed B2 SINE (Supplementary Fig. S6F). The observed changes in RNAPII recruitment were not derived from any change in the expression levels of the RNAPII subunits (Supplementary Fig. S6G). To determine whether this reduction of RNAPII recruitment at TSSs could dysregulate the expression levels of stem cell maintenance-related genes, we analyzed total RNA-seq and mRNA-seq data. Notably, we observed that genes with abundantly de-repressed B2 SINEs, which exhibited reduced RNAPII recruitment at their TSSs (Fig. 6G), were down-regulated (Supplementary Fig. S6H, red) compared to genes that lacked a nearby de-repressed B2 SINE (Supplementary Fig. S6H, gray). Taken together, our data indicate that
Here, we explored the changes in the 3D chromatin organizations of mESCs upon CHD4 depletion using
Since the genome is packaged into 3D chromatin organizations and most of the biological processes, including transcription, occur in this context, it is important to investigate the 3D organization of the genome. Previous studies characterized the 3D chromatin organization in terms of compartments (Lieberman-Aiden et al., 2009; Simonis et al., 2006), TADs (Dixon et al., 2012; Nora et al., 2012), contact/loop domains (Rao et al., 2014; 2017), and insulated neighborhoods (Dowen et al., 2014; Ji et al., 2016). Recent reports have mainly addressed the importance of CTCF and cohesin in 3D genome topology by observing the disruption of 3D chromatin structures after CRISPR/Cas9 system-mediated deletion/orientation change of CTCF-binding sites (de Wit et al., 2015; Guo et al., 2015; Sanborn et al., 2015), deletion of cohesion (Luppino et al., 2020; Rao et al., 2017; Szabo et al., 2020), and deletion of CTCF (Luppino et al., 2020; Nora et al., 2017; Szabo et al., 2020). These studies mainly focused on the properties and formation of the 3D chromatin organization and the key factors (e.g., CTCF and cohesin) that compose its structures. However, intriguing questions remained, such as: (1) How are TADs established or maintained? (2) How are aberrant CTCF binding and associated Hi-C interactions at TAD interiors prevented? (3) How is CTCF recruitment at the TAD borders regulated? (4) Are chromatin remodelers involved in CTCF recruitment? (5) What is the role of chromatin remodelers in 3D chromatin organizations? We believe that our present work provides clues to these questions by elucidating a CHD4-mediated mechanism that conceals aberrant CTCF-binding sites at the TAD interiors by regulating chromatin accessibility, thereby preventing inappropriate CTCF binding and safeguarding both proper CTCF recruitment and 3D chromatin organization in mESCs.
It is interesting to set our results in the context of evolutionary aspects. A previous report compared Hi-C data from four mammals and identified two types of CTCF sites: (1) conserved CTCF sites, which are conserved across species and enriched at strong TAD borders; and (2) divergent CTCF sites, which lack conservation across species, are newly evolved, and localize at TAD interiors (Vietri Rudan et al., 2015). The authors also demonstrated that the evolutionary dynamics of intra-TAD interactions, which reflect the actions of divergent CTCF sites, could play a critical role in enhancer-promoter interactions within TADs (Vietri Rudan et al., 2015). In support of this, a recent review suggested that 3D chromatin organization may have significantly impacted the evolution of transcriptional regulation by increasing regulatory complexity (the combination of enhancer and promoter contacts) (Acemel et al., 2017). Taken together, these findings indicate that the evolution of gene regulation is strongly correlated with 3D chromatin organization, especially intra-TAD interactions involving divergent CTCF sites. Interestingly, the divergent CTCF sites that primarily reside at TAD interiors have been spread through genomes by retrotransposition of B2 SINE transposable elements, which contain CTCF motifs in their consensus sequences (Bourque et al., 2008; Schmidt et al., 2012). Our present data illustrate that the chromatin remodeler, CHD4, conceals the B2 SINE-originated divergent/inappropriate CTCF-binding sites by regulating chromatin accessibility, thereby preventing three events: (1) aberrant CTCF recruitment; (2) aberrant CTCF binding-associated changes in local TADs (separating and weakened TADs); and (3) overexpression of B2 SINE transcripts in mESCs.
The previous studies on chromatin remodeler mainly focused on examining ATP catalysis-based nucleosome sliding
Among the studies that did not focus specifically on transcribed regions, several reports showed that chromatin remodelers are closely associated with CTCF. For example, RNAi-mediated knockdown of
Based on our findings, we propose a mechanism wherein the ATP-dependent chromatin remodeler, CHD4, maintains the lower-level chromatin structure (including chromatin accessibility and nucleosome positioning/occupancy) and prevents the aberrant recruitment of CTCF, thereby safeguarding appropriate CTCF binding and associated higher-order TAD organization by suppressing aberrant interactions at TAD interiors (Fig. 7). Our results reveal how the primary structure of chromatin could facilitate higher-order 3D chromatin organization in mESCs. Collectively, our findings provide a novel perspective on the functional role of chromatin remodelers in mESCs, thereby adding an extra layer of information in stem cell biology and greatly facilitating our understanding of how chromatin remodelers function in embryonic stem cells.
We thank Taemook Kim and Prof. Keunsoo Kang for assistance with analyzing the NGS data; Kwang-Beom Hyun and Prof. Jaehoon Kim for purification of CHD4 protein using the Bac-to-Bac Baculovirus expression system (Full-length cDNA of mouse
S.H., H.L., and D.L. designed the concepts and experiments. S.H. analyzed most of the NGS data. S.H. and H.L. performed most of the experiments. S.H. performed the Hi-C experiments for –Aux and +Aux cells. A.J.L. performed the Hi-C experiments for Control and Chd4KD cells under the guidance of I.J. S.K.K. performed the GRO-seq experiments under the guidance of T.K.K. S.H. and H.L. wrote the manuscript under the mentorship and technical supervision of G.Y.K. and D.L.
The authors have no potential conflicts of interest to disclose.
Mol. Cells 2021; 44(11): 805-829
Published online November 30, 2021 https://doi.org/10.14348/molcells.2021.0224
Copyright © The Korean Society for Molecular and Cellular Biology.
Sungwook Han1,4 , Hosuk Lee1,2,4
, Andrew J. Lee1
, Seung-Kyoon Kim3
, Inkyung Jung1
, Gou Young Koh2
, Tae-Kyung Kim3
, and Daeyoup Lee1,*
1Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Korea, 2Center for Vascular Research, Institute for Basic Sciences, Daejeon 34141, Korea, 3Department of Life Sciences, Pohang University of Science and Technology, Pohang 37673, Korea, 4These authors contributed equally to this work.
Correspondence to:daeyoup@kaist.ac.kr
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
CCCTC-binding factor (CTCF) critically contributes to 3D chromatin organization by determining topologically associated domain (TAD) borders. Although CTCF primarily binds at TAD borders, there also exist putative CTCF-binding sites within TADs, which are spread throughout the genome by retrotransposition. However, the detailed mechanism responsible for masking the putative CTCF-binding sites remains largely elusive. Here, we show that the ATP-dependent chromatin remodeler, chromodomain helicase DNA-binding 4 (CHD4), regulates chromatin accessibility to conceal aberrant CTCF-binding sites embedded in H3K9me3-enriched heterochromatic B2 short interspersed nuclear elements (SINEs) in mouse embryonic stem cells (mESCs). Upon CHD4 depletion, these aberrant CTCF-binding sites become accessible and aberrant CTCF recruitment occurs within TADs, resulting in disorganization of local TADs. RNA-binding intrinsically disordered domains (IDRs) of CHD4 are required to prevent this aberrant CTCF binding, and CHD4 is critical for the repression of B2 SINE transcripts. These results collectively reveal that a CHD4-mediated mechanism ensures appropriate CTCF binding and associated TAD organization in mESCs.
Keywords: 3D chromatin organization, ATP-dependent chromatin remodeler, B2 short interspersed nuclear elements, chromodomain-helicase-DNA binding protein 4, intrinsically disordered domains, topologically associated domains
The mammalian genome is spatially arranged into three-dimensional (3D) chromatin organizations that allow for proper gene regulation; these include chromosome territories (Cremer and Cremer, 2010), compartments (the A and B compartments are enriched for active/open chromatin and inactive/closed chromatin, respectively) (Lieberman-Aiden et al., 2009; Simonis et al., 2006), topologically associated domains (TADs) (Dixon et al., 2012; Nora et al., 2012), contact/loop domains (Rao et al., 2014; 2017), and insulated neighborhoods (Dowen et al., 2014; Ji et al., 2016). Previous studies found that the insulator protein, CCCTC-binding factor (CTCF), coexists with the cohesin complex in chromatin (Wendt et al., 2008) and localizes to the anchors/borders of chromatin loops (Dixon et al., 2012; Nora et al., 2012; Rao et al., 2014; Splinter et al., 2006). Furthermore, removal of the cohesin loader, Nipbl (Schwarzer et al., 2017), loss of cohesin itself (Rao et al., 2017), deletion of CTCF-binding sites (de Wit et al., 2015; Guo et al., 2015; Sanborn et al., 2015), or loss of CTCF itself (Nora et al., 2017) have all been shown to interfere with the maintenance of 3D chromatin organizations, suggesting that these factors all play critical functions. Notably, recent loss-of-function studies using single-cell analysis performed in mammalian cells yielded two interesting observations (Luppino et al., 2020; Szabo et al., 2020). First, the loss of CTCF was found to result in the loss of TAD borders and the subsequent merging of two previously insulated TADs, but did not significantly affect intra-TAD interactions. Second, and in contrast, the loss of the cohesin subunit, RAD21, significantly disrupted intra-TAD interactions but did not significantly affect TAD borders. These findings demonstrated that 3D chromatin organizations are regulated by the combined action of CTCF and cohesin via distinct mechanisms: Cohesin generates intra-TAD contacts, whereas CTCF prevents inter-TAD contacts by determining the TAD borders.
Importantly, in addition to the CTCF-binding sites (or motifs) that exhibit convergent orientation at the border of 3D domains (Rao et al., 2014), there also exist putative CTCF-binding sites at the TAD interiors, which are spread throughout the genome by retrotransposition of short interspersed nuclear element (SINE) retrotransposons (Bourque et al., 2008; Schmidt et al., 2012). From an evolutionary perspective, conserved CTCF-binding sites are enriched at the borders of conserved 3D domains, whereas CTCF-binding sites that diverge between species (divergent CTCF-binding sites) drive local 3D structural changes at the domain interiors (Vietri Rudan et al., 2015). Thus, it is critical to mask the putative CTCF-binding sites that lie within the TADs; otherwise, similar to the divergent CTCF-binding sites, these sites could cause aberrant CTCF recruitment and disrupt the normal 3D chromatin organizations. Although most studies to date have focused on the function of CTCF at the TAD borders, the prevention of inappropriate CTCF binding at TAD interiors is equally essential for the maintenance of 3D chromatin organizations. However, the detailed mechanism responsible for masking these putative CTCF-binding sites remains poorly understood.
To address this issue, we focused on the members of the chromodomain helicase DNA-binding protein (CHD) family, especially the well-known ATP-dependent chromatin remodeler, CHD4. The CHD family is evolutionarily conserved (Hall and Georgel, 2007), and its members contribute to diverse cellular processes by assembling nucleosomes (de Dieuleveult et al., 2016; Gaspar-Maia et al., 2009; Micucci et al., 2015; Skene et al., 2014). Since the majority of CTCF-binding sites are located at nucleosome-free regions (Cuddapah et al., 2009), it has been proposed that chromatin remodelers could govern CTCF recruitment by regulating nucleosome occupancy at certain CTCF-binding sites (Ong and Corces, 2014). Furthermore, the ability of chromatin remodelers to regulate access to DNA through nucleosome positioning may contribute to obscuring the putative CTCF-binding sites within TADs.
Here, we show that CHD4 regulates chromatin accessibility to conceal aberrant CTCF-binding sites within TADs, thereby preventing aberrant CTCF recruitment and securing the 3D chromatin organizations in mESCs. We used various next-generation sequencing (NGS) assays, including
E14Tg2a mESCs were maintained under feeder-free conditions. Briefly, cells were cultured on gelatin-coated cell culture dishes in an mESC culture medium consisting of Glasgow’s minimum essential medium (GMEM) containing 10% knockout serum replacement, 1% non-essential amino acids, 1% sodium pyruvate, 0.1 mM β-mercaptoethanol (all from Gibco, USA), 1% fetal bovine serum, 0.5% antibiotic-antimycotic (both from Hyclone, USA) and 1,000 units/ml LIF (ESG1106; Millipore, USA). mESCs were maintained at 37°C with 5% CO2 in humidified air.
The siRNAs against
Total RNAs were purified from mESCs using the TRIzol reagent (Invitrogen, USA) according to the manufacturer’s protocol. Briefly, mESCs cultured in 6-well plates were harvested and homogenized with 1 ml of TRIzol reagent. Chloroform (200 μl/sample) was added, and the samples were mixed vigorously by hand for 15 s and incubated at 25°C for 2 min. The mixtures were centrifuged at 12,000 rpm for 15 min at 4°C, and 500 μl of each aqueous phase was transferred to a new Eppendorf tube and mixed with the same volume of isopropanol. The mixtures were incubated at 25°C for 10 min to precipitate total RNAs. The samples were centrifuged at 12,000 rpm for 10 min at 4°C, washed with 75% ethanol, and centrifuged again at 10,000 rpm for 5 min at 4°C. The RNA pellets were dried and dissolved in RNase-free water, and 1 μg of DNase-treated total RNA was applied for cDNA synthesis using an Improm-II Reverse transcription system (A3802; Promega, USA) according to the manufacturer’s protocol. For analysis of retrotransposon expression, the cDNA synthesis step was primed with random hexamers.
The generated cDNAs were amplified using a BioFact Real-time PCR kit (BIOFACT, Korea) according to the manufacturer’s manual. The primer sequences used for RT-qPCR are presented in Supplementary Table S2. Briefly, 20-μl reactions containing 1× EvaGreen, 10 mM tetraethylammonium chloride, and 10 pmol of primers were analyzed with a CFX96 system (Bio-Rad, USA) under the following conditions: 95°C for 12 min (initial melting), followed by 40 cycles of 95°C for 20 s (denaturation), 57°C for 30 s (annealing), and 72°C for 30 s (extension). The relative expression levels of Chd family members and retrotransposons were quantified with respect to those of β-actin and the 28S rRNA, respectively.
We employed the auxin-inducible degron (AID) system to the
Cleavage under targets and release using nuclease (CUT&RUN) assays were performed as previously described (Meers et al., 2019; Skene and Henikoff, 2017), with minor modification. Briefly, 4 million mESCs were harvested and washed thrice with 1.5 ml wash buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 0.5 mM spermidine). Cells were bound to activated concanavalin A-coated magnetic beads (at 25°C for 10 min on a nutator), then permeabilized with antibody buffer (wash buffer containing 0.05% digitonin and 4 mM EDTA). The bead-cell slurry was incubated with 3 μl of the appropriate antibody (see below) in a 150 μl volume at 25°C for 2 h on a nutator. After two washes in 1 ml Dig-wash buffer (wash buffer containing 0.05% digitonin), the beads were resuspended in 150 μl pAG/MNase and incubated at 4°C for 1 h on a nutator. After two washes in 1 ml Dig-wash buffer, the beads were gently vortexed with 100 μl Dig-wash buffer. The tubes were chilled to 0°C for 5 min and ice-cold 2.2 mM CaCl2 was added with gentle vortexing. The tubes were immediately placed on ice and incubated at 4°C for 1 h on a nutator; thereafter, 100 μl 2xSTOP buffer (340 mM NaCl, 20 mM EDTA, 4 mM EGTA, 0.05% digitonin, 0.1 mg/ml RNase A, 50 μg/ml glycogen) was added and the tubes were incubated at 37°C for 30 min on a nutator. The beads were secured using a magnet stand, and the liquid was removed to a fresh tube, combined with 2 μl 10% SDS and 2.5 μl proteinase K (20 mg/ml), and incubated at 50°C for 1 h. DNA was extracted using phenol chloroform as described at https://www.protocols.io/view/cut-amp-run-targeted-in-situ-genome-wide-profiling-zcpf2vn. CUT&RUN libraries were prepared using an Accel-NGS 2S Plus DNA Library Kit (21024; Swift BioSciences, USA), according to the manufacturer’s guidelines. The libraries were then sequenced using an Illumina Novaseq 6000 platform (Illumina, USA). The libraries were generated from two sets of biological replicates.
For Control and Chd4KD cells,
ChIP assays were performed as previously described (Kim et al., 2012), with minor modification. Briefly, mESCs were washed twice with phosphate-buffered saline (PBS) and incubated with 1% formaldehyde for 15 min at 25°C. This crosslinking was quenched with 125 mM glycine at 25°C for 5 min, and the cells were harvested with cold PBS and suspended in SDS lysis buffer (50 mM Tris-HCl, pH 8.0, 1% SDS, 10 mM EDTA). The chromatin was sheared to mono- or dinucleosome sizes using a focused ultrasonicator (S220; Covaris, USA). The sonicates were incubated overnight with the relevant antibodies (see below) and Protein A, G sepharose (17-1279-03 and 17-0618-05; GE Healthcare, USA) at 4°C on a nutator. The immune complexes were washed for 10 min each with the following wash buffers: low-salt wash buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), high-salt wash buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), and LiCl wash buffer (10 mM Tris, pH 8.0, 250 mM LiCl, 1% NP-40, 1% sodium deoxycholate, 1 mM EDTA). The immune complexes were further washed twice with TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA). Finally, the immune complexes were eluted with elution buffer (1% SDS, 0.1 M NaHCO3) and reverse-crosslinked overnight at 68°C. The immunoprecipitated DNA was treated with proteinase K and RNase A and recovered by phenol-chloroform-isoamyl alcohol precipitation. ChIP-seq libraries were prepared using a NEXTflex ChIP-seq kit (5143-02; Bioo Scientific, USA) according to the manufacturer’s guidelines. The libraries were then sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
Antibodies against CHD4 (ab70469; Abcam [UK] and in-house generated), CTCF (ab37477; Abcam), α-tubulin (2144S; Cell Signaling Technology, USA), V5 (R960-25; Thermo Fisher Scientific), mCherry2 (M11217; Invitrogen), β-actin (SC-47778; Santa Cruz Biotechnology), and FLAG (F7425; Sigma) were used for immunoblotting. Antibodies against CHD4 (ab70469; Abcam and in-house generated) and IgG (12-371; Millipore) were used for immunoprecipitation. Antibodies against CTCF (07-729; Millipore), RAD21 (ab992; Abcam), H3 (ab1791; Abcam), H3K9me3 (ab8898; Abcam), H3K4me1 (in-house generated), H3K4me3 (in-house generated), H3K27ac (ab4729; Abcam), and H3K27me3 (07-449; Millipore) were used for ChIP-seq. Antibodies against CHD4 (ab70469; Abcam and in-house generated), RAD21 (ab992; Abcam), and IgG (12-371; Millipore and in-house generated) were used for CUT&RUN.
E14Tg2a mESCs were transfected with siRNAs against
ATAC-seq libraries were prepared as previously described (Buenrostro et al., 2013; 2015; Choi et al., 2020), with minor modification. Briefly, 50,000 mESCs were harvested, washed with cold PBS, lysed with cold lysis buffer, and immediately centrifuged. The nuclear pellets were resuspended in 25 μl of 2× tagmentation reaction buffer (10 mM Tris, pH 8.0, 5 mM MgCl2, 10% dimethylformamide), 23 μl of nuclease-free water, and 2 μl of Tn5 transposase (in-house generated), and incubated at 37°C for 30 min. The samples were then immediately purified using a QIAquick PCR purification kit (28106; Qiagen, Germany). The libraries were pre-enriched for five cycles using the KAPA HiFi Hotstart ready mix (KK2601; Kapa Biosystems, USA), and the threshold cycle (Ct) was monitored using qPCR to determine the additional enrichment cycles, which were then applied. The final libraries were purified again with a QIAquick PCR purification kit, and sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
Cells cultured in 60Ф culture dishes were washed with PBS and incubated with modified PBS (150 mM NaCl, 0.02% Tween-20, 0.02% Triton X-100) at 25°C for 5 min. The cells were then fixed with 1% formaldehyde at 25°C for 10 min, quenched with 125 mM glycine at 25°C for 5 min, washed, and suspended with lysis buffer (18% Ficoll-400, 10 mM KH2PO4, 10 mM K2HPO4, 1 mM MgCl2, 250 nM EGTA). The lysates were centrifuged at 12,000 rpm for 40 min at 4°C, and the pelleted chromatin was resuspended in A buffer (10 mM Tris-Cl, pH 7.4, 150 mM NaCl, 5 mM KCl, 1 mM EDTA). CaCl2 (5 mM) was added to the samples, which were immediately digested with MNase (M0247S; NEB) at 37°C for 60 min. The reaction was inactivated with 1% SDS and 500 mM EDTA, and reverse-crosslinking was performed overnight at 68°C. Proteinase K and RNase were added sequentially to the samples, and DNAs were recovered using phenol-chloroform-isoamyl alcohol extraction. The MNase-digested DNAs were separated on a 1.5% agarose gel, and mono-nucleosomal DNAs were extracted using a QIAEX II gel extraction kit (20021; Qiagen) according to the manufacturer’s manual. The purified mono-nucleosomal DNAs were subjected to sequencing using a TruSeq DNA library prep kit (FC-121-2001; Illumina). The final libraries were sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
DNase-treated total RNAs were purified from mESCs and used for total RNA-seq. Total RNA-seq libraries were prepared using a TruSeq stranded RNA kit (RS-122-2301; Illumina) according to the manufacturer’s manual. For mRNA-seq library preparation, mRNAs were isolated from total RNA using a Magnetic mRNA isolation kit (S1550S; NEB), and libraries were prepared using a NEXTflex Rapid directional RNA-seq kit (5138-08; Bioo Scientific). The libraries were sequenced using an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
E14Tg2a mESCs were harvested, washed with cold PBS, and lysed with EBC buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.5% NP-40, 1 mM PMSF). The lysates were incubated for 1 h with the indicated antibodies and Protein A, G sepharose (17-1279-03 and 17-0618-05; GE Healthcare) at 4°C with agitation. The immune complexes were washed three times with EBC buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.5% NP-40, 1 mM PMSF). The lysates were boiled for 5 min with SDS sample buffer, resolved by SDS-PAGE, and subjected to immunoblotting.
RNA immunoprecipitation (RIP) assays were conducted as previously described (Heo and Sung, 2011; Kim et al., 2014), with minor modification. Briefly, cultured mESCs were crosslinked with 1% formaldehyde for 15 min at 25°C and quenched with 125 mM glycine. The cells were harvested, washed with cold PBS, and then lysed with RIPA buffer (50 mM Tris-Cl, pH 8.0, 150 mM NaCl, 0.05% SDS, 1 mM EDTA, 1% NP-40, and 0.05% sodium deoxycholate) containing an RNase inhibitor (M007L; Enzynomics, Korea). The samples were sonicated, treated with DNase I (18068-015; Invitrogen) for 10 min at 37°C, and cleared by centrifugation. The extracts were bound with anti-Chd4 (ab70469; Abcam) for 2 h at 4°C in the presence of RNase inhibitor. The immune complexes were further incubated with Dynabeads Protein A and G (10001D, 10004D; Invitrogen) for 1 h at 4°C, and then washed for 10 min each with the following wash buffers: low-salt wash buffer (20 mM Tris-HCl, pH 8.0, 150 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), high-salt wash buffer (20 mM Tris-HCl, pH 8.0, 500 mM NaCl, 0.1% SDS, 1% Triton X-100, 2 mM EDTA), and LiCl wash buffer (10 mM Tris, pH 8.0, 250 mM LiCl, 1% NP-40, 1% sodium deoxycholate, 1 mM EDTA). The immune complexes were further washed twice with TE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA), eluted, and reverse-crosslinked with 400 mM NaCl for 2 h at 65°C. RNAs were purified from the eluates using an RNA Clean & ConcentratorTM-5 kit (R1015; Zymo Research, USA) according to the manufacturer’s manual, and then incubated again with DNase I (18068-015; Invitrogen) for 30 min at 37°C. Finally, the DNase I was inactivated and removed using a DNA-freeTM kit (AM1906; Invitrogen) according to the manufacturer’s manual.
The plasmids used for
EMSA was carried out as previously described (Kim et al., 2014), with minor modifications. Briefly, the indicated amount of purified Chd4 protein was incubated with 5 nM of biotinylated RNAs in binding buffer (50 mM Tris-Cl, pH 7.5, 100 mM NaCl, 10 mM β-mercaptoethanol, 5% glycerol) for 40 min at 25°C. Bound protein-RNA complexes were resolved on 5% native polyacrylamide mini gels at 70V and 4°C for 2 h. The complexes were transferred to an Amersham Hybond-N+ nylon membrane (RPN303B; GE Healthcare), UV-crosslinked, and visualized using a Chemiluminescent Nucleic Acid Detection Module kit (89880; Thermo Fisher Scientific) according to the manufacturer’s manual.
Nucleosome reconstitution was performed as previously described (Owen-Hughes et al., 1999). Briefly, two types (lateral and central forms) of a 147-bp DNA fragment were amplified from pGEM-3z/601, and
For the nucleosome-sliding assay, reconstituted nucleosomes (75 ng, 0.36 pmol) were incubated with purified Chd4 (50 ng, 0.23 pmol), varying concentrations of purified RNAs (0 pmol, 0.1 pmol, 0.2 pmol, 0.5 pmol, and 1 pmol), and ATP (0.1 pmol) in sliding buffer (50 mM KCL, 20 mM HEPES, pH 7.9, 2 mM DTT, 0.5 mM PMSF, 0.05% NP40, 10% glycerol, 100 μg/ml BSA, 10 mM MgCl2) for 45 min at 30°C. The reaction was stopped by the addition of 500 ng of plasmid (size > 10 kb) and 0.1 pmol of ATP-γ-S tetralithium salt (10102342001-Roche; Sigma) followed by incubation for 30 min at 30°C. The samples were resolved on 5% native polyacrylamide mini gels at 65 V and 4°C for 2.5 h.
Circularized GRO-seq was performed as previously described (Danko et al., 2013; Lam et al., 2013). Briefly, around 10 million nuclei per sample were purified and used for global run-on and base hydrolysis, which were carried out as previously described (Core et al., 2008). BrU-labeled nascent RNAs were immunoprecipitated twice with anti-BrdU antibody-conjugated agarose (SC-32323ac; Santa Cruz Biotechnology). Between the two immunoprecipitations, the BrU-precipitated RNAs were subjected to polyA tailing with poly(A)-polymerase (P7460L; Enzymatics, USA). The RNAs were subjected to first-strand cDNA synthesis using Superscript III Reverse Transcriptase (18080-044; Invitrogen) and the oNTI223 RT primer (IDT). Excess RT primers were removed with exonuclease I (M0293S; NEB), and the cDNAs were size-selected (120-220 nt) in a 6% polyacrylamide TBE-urea gel. The selected cDNAs were circularized using CircLigase (CL4111K; Epicentre, USA) and re-linearized with ApeI (M0282S; NEB). The single-stranded DNA templates were amplified using Phusion High-Fidelity DNA Polymerase (M0530S; NEB) and Illumina TruSeq small-RNA sample barcoded primers. The generated PCR products were isolated and size-selected (190-290 bp) by electrophoresis on a 6% native polyacrylamide TBE gel. The final libraries were sequenced on an Illumina HiSeq 2500 platform. The libraries were generated from two sets of biological replicates.
For Hi-C analysis, the
For the MNase-seq, ATAC-seq, ChIP-seq, CUT&RUN, and GRO-seq analyses, the raw reads were aligned to the mouse genome (mm10) using Bowtie2 (ver. 2.2.9) with default parameters (Langmead and Salzberg, 2012). For the total RNA-seq and mRNA-seq analyses, the raw reads were aligned to the mouse genome (mm10) using STAR (ver. 2.5.2a) with default parameters (Dobin et al., 2013).
Generally, we used MACS2 (Zhang et al., 2008) to convert the aligned BAM files into bedGraph files, and normalized the data with respect to the total read counts. Then, we used bedGraphToBigWig (Kent et al., 2010) to convert the bedGraph files into bigWig files. The bigWig files were used as input files for bwtool (Pohl and Beato, 2014) (matrix and aggregate option) to quantify the intensity (e.g., using heatmaps or average line plots) of the relevant sequencing data. Further analyses were performed using in-house scripts, MACS2 (Zhang et al., 2008), and HOMER (Heinz et al., 2010). All of our raw data (fastq files) were confirmed to be of good quality using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/).
For our CUT&RUN and ChIP-seq analyses, we used MACS2 (callpeak option,
For our mRNA-seq and total RNA-seq analyses, we used Cufflinks (Cuffdiff option, fr-firststrand) to assess the expression levels. For our analysis of SINEs, we built a STAR-index exclusively containing the previously identified 1.5 million SINE copies (RepeatMasker in UCSC table browser), and performed STAR and Cufflinks analyses as described above.
For our GRO-seq analysis, we used HOMER (findPeaks option, -style groseq) to examine the number of
Box plots, volcano plots, and other plots were drawn with R (ggplot2) (Wickham, 2009) and heatmaps were drawn with Java TreeView (Saldanha, 2004). The examples of our genome-wide data were visualized using the Integrative Genomics Viewer (IGV) (Robinson et al., 2011).
The publicly released ChIP-seq data were downloaded from NCBI GEO DataSets and the ENCODE Consortium. These data were downloaded as sra or fastq formats, and the sra files were converted to fastq files using the SRA Toolkit (https://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?view=software). Thus, we analyzed the public datasets and our own results using the same methods. Details of the analyzed data are summarized in Supplementary Table S4.
We first determined the expression of
To elucidate the function of CHD4, we first identified CHD4-binding sites by performing CUT&RUN (Skene and Henikoff, 2017) and also by analyzing the public chromatin immunoprecipitation sequencing (ChIP-seq) data against CHD4 (GSE64825, GSE61188, and GSE27844, see also Supplementary Table S4). Then, we analyzed genomic features within the CHD4-binding sites (CUT&RUN and ChIP-seq peaks). Notably, we found that CTCF motifs were highly enriched within CHD4-binding sites relative to CHD1-binding sites and randomly selected sites (Supplementary Fig. S1B, Supplementary Table S5). Interestingly, we also observed that CHD4 localized to the borders of TADs (Fig. 1A, Supplementary Fig. S1C) and the anchors of insulated neighborhoods (Fig. 1B, Supplementary Fig. S1D), where CTCF is known to be localized (Figs. 1A and 1B) and play a boundary function (Dixon et al., 2012; Dowen et al., 2014; Luppino et al., 2020; Nora et al., 2012; Rao et al., 2014; Szabo et al., 2020). Furthermore, 34% of the CTCF ChIP-seq peaks coincided with the CHD4 CUT&RUN peaks (Fig. 1C), suggesting that CHD4 and CTCF resided near one another. To evaluate this possibility, we analyzed the localization of CHD4 near the CTCF ChIP-seq peaks. As expected, we found that CHD4 resided near CTCF (Figs. 1D and 1E). Intriguingly, although the CHD4-binding sites were enriched for CTCF motifs (Supplementary Fig. S1B, Supplementary Table S5), we observed that CHD4 was also present at locations lacking CTCF (Figs. 1C and 1E), suggesting that CHD4 may conceal CTCF motifs (Fig. 1F). We also observed that CTCF localized to exposed DNA sequences surrounded by well-positioned nucleosomes (Figs. 1D and 1E), consistent with a previous report (Cuddapah et al., 2009). The presence of CTCF at nucleosome-free regions flanking nucleosomes and CHD4 (Fig. 1D) may suggest that CHD4 could regulate nucleosome occupancy near CTCF-binding sites, and thereby control interactions between CTCF and DNA sequences harboring CTCF motifs. Taken together, these results strongly indicate that CHD4 and CTCF are closely linked in mESCs. Thus, we hypothesized that CHD4 might control CTCF-mediated 3D chromatin organization by regulating chromatin accessibility near CTCF-binding sites (Fig. 1F).
To test our hypothesis, we first designed and applied siRNAs against
To investigate whether CHD4 depletion disturbed the 3D chromatin organization, we performed
To investigate the role of CHD4 in local 3D chromatin organization, we performed ATA upon CHD4 depletion at four TADs obtained from each cell (Control, Chd4KD, –Aux, and +Aux cells). Notably, we detected genome-wide decreases of Hi-C interactions within the TADs of Control and –Aux cells upon CHD4 depletion (Figs. 2A and 2B, arrow). In contrast, Hi-C interactions within the TADs of Chd4KD and +Aux cells were largely unaltered by CHD4 loss (Figs. 2A and 2B). These results suggest that CHD4 maintains the local interactions within the original TADs of wild-type mESCs.
To address this further, we identified four types of differential TADs seen upon CHD4 depletion: We compared the changes in Hi-C interactions to identify weakened or strengthened TADs, and compared the changes in TAD positions to identify separating and merging TADs (Figs. 2C-2G). Weakened TADs were defined as original TADs of wild-type mESCs (Control or –Aux cells) whose Hi-C interactions were decreased upon CHD4 loss. Strengthened TADs were defined as TADs of CHD4-depleted cells (Chd4KD or +Aux cells) whose Hi-C interactions were increased upon CHD4 loss. Separating TADs were defined as original TADs of wild-type mESCs (Control or –Aux cells) that were split into two or more smaller TADs upon CHD4 loss. Finally, merging TADs were defined as TADs of CHD4-depleted cells (Chd4KD or +Aux cells) that contained two or more original TADs of wild-type mESCs (Control or –Aux cells). The ATA results indicated that Hi-C interactions showed genome-wide decreases at weakened TADs (Fig. 2D) and separating TADs (Fig. 2E), but genome-wide increases at the strengthened TADs (Fig. 2F) and merging TADs (Fig. 2G) upon CHD4 loss. The decreasing TADs (weakened and separating TADs) shared more TADs than the increasing TADs (strengthened and merging TADs) (Supplementary Fig. S2A), indicating that weakened and separating TADs were more closely associated. Interestingly, comparing the numbers of the four types of differential TAD seen upon CHD4 loss, revealed that decreasing TADs occurred more frequently than increasing TADs (Fig. 2H). This was consistent with our above-described observation that Hi-C interactions showed genome-wide decreases within the original TADs of wild-type mESCs (Figs. 2A and 2B, arrow). Thus, we focused on decreasing TADs (weakened and separating TADs).
To elucidate how CHD4 maintains local interactions within TADs, we performed CTCF ChIP-seq upon CHD4 loss. By comparing the CTCF intensity, we defined CTCF signals that were gained or lost upon CHD4 depletion (Fig. 2I). Next, we divided the decreasing TADs (weakened and separating TADs) into borders and interiors (Fig. 2J, top right). We observed that both weakened and separating TADs exhibited similar distribution patterns of gained/lost CTCF at the borders/interiors of TADs (Fig. 2J), suggesting that these two types of decreasing TAD may be fundamentally the same and/or formed similarly upon CHD4 loss. We did not detect any significant difference in the distribution of gained and lost CTCF at TAD borders, whereas gained CTCF was significantly enriched at TAD interiors compared to lost CTCF (Fig. 2J). These results indicate that gained CTCF may be responsible for decreasing interactions within weakened and separating TADs upon CHD4 loss.
To further support this notion, we comprehensively analyzed Hi-C interactions, TAD positions, insulations scores, and CTCF ChIP-seq at specific regions of the genome upon CHD4 loss. Consistent with our genome-wide ATA results (Figs. 2D and 2E), we found a notable decrease in Hi-C interactions within the weakened and separating TADs (Fig. 2K, Supplementary Figs. S2B and S2C, black arrow). Importantly, we observed that the presence of a gained CTCF (marked with *) greatly reduced the local insulation, thereby generating a novel TAD border (separating TAD, Fig. 2K, Supplementary Fig. S2B) and/or markedly decreasing the Hi-C interactions within TADs (weakened TAD, Supplementary Figs. S2B and S2C). In general, we observed the same types of decreasing TAD in the two cell models CHD4 loss (Chd4KD and +Aux cells); at some genomic loci, however, we observed a weakened TAD in Chd4KD cells but a separating TAD in +Aux cells (Supplementary Fig. S2B). This observation supports the idea that these two types of decreasing TAD are closely associated, and that the gained CTCF causes both types of decreasing TAD upon CHD4 depletion. When considering the splitting feature of separating TADs (Fig. 2C), one can speculate the reduced TAD sizes and an elevated number of chromatin loops (peaks) coupled with gained CTCF upon CHD4 loss. As expected, we observed reduced TAD sizes (Supplementary Fig. S2D) and an elevated number of total chromatin loops (peaks) and cell-type-specific peaks (Supplementary Figs. S2E and S2F) upon CHD4 depletion (Supplementary Figs. S2G and S2H). Importantly, the insulation scores at the gained CTCF sites were globally reduced upon CHD4 depletion (Fig. 2L), indicating that the CHD4 depletion-triggered gain of CTCF is directly responsible for generating TAD borders in a genome-wide manner. Furthermore, our APA results showed that Hi-C interactions were increased at the gained CTCF-associated peaks upon CHD4 loss (Supplementary Fig. S2I), supporting the direct relationship between gained CTCF and changes in Hi-C interactions. Collectively, our results strongly imply that the decreasing TADs (weakened and separating TADs) are directly associated with gained CTCF upon CHD4 loss. Therefore, these findings suggest that CHD4 regulates the local 3D chromatin organization by preventing the aberrant recruitment of CTCF to TAD interiors.
To investigate how CHD4 prevents aberrant CTCF binding, we analyzed the genome-wide changes in CTCF upon CHD4 depletion at CHD4 CUT&RUN peaks (or CHD4-binding sites) (Figs. 3A and 3B, Supplementary Figs. S3A and S3B). We classified the CHD4 CUT&RUN peaks into those that coincided with CTCF ChIP-seq peaks of wild-type mESCs (Control or –Aux cells) and those that did not coincide with these peaks. We did not detect any major change in CTCF upon CHD4 depletion at CHD4 peaks that coincided with CTCF peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B), but observed aberrant gain of CTCF upon CHD4 depletion at CHD4 peaks that did not coincide with CTCF peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B, arrow). Since CHD4 CUT&RUN peaks are highly enriched for CTCF motifs (Supplementary Fig. S1B, Supplementary Table S5), these results indicate that CHD4 conceals putative CTCF motifs (Fig. 1F, left). The observed changes in CTCF binding were not derived from any change in
To elucidate how CHD4 conceals putative CTCF motifs, we comprehensively analyzed CTCF ChIP-seq data obtained from Control and Chd4KD cells. We first determined the CTCF-binding sites (or ChIP-seq peaks) in Control and Chd4KD cells. We then merged the CTCF-binding sites (n = 39,800) and sorted them in descending order of their relative CTCF intensity (Chd4KD/Control) (Figs. 3C and 3F, Supplementary Figs. 3E, S3H-S3J). According to this relative CTCF intensity, we defined two groups of CTCF-binding sites: Group 1 (n = 4,736, 11.9%) represents CTCF-binding sites at which more CTCF was recruited in Chd4KD cells compared to Control cells, while Group 2 (n = 35,064, 88.1%) represents the remainder of the CTCF-binding sites, at which CTCF binding was unchanged or reduced in Chd4KD cells. Consistent with our above-described results (Figs. 2I, 2K, 3A, and 3B, Supplementary Figs. S2B, S2C, S2E, S3A, and S3B), we observed that CHD4 depletion-triggered gain and loss of CTCF binding occurred at the Group 1 and Group 2 CTCF-binding sites, respectively (Fig. 3C). Furthermore, we found that RAD21 was aberrantly gained along with CTCF at the Group 1 sites upon CHD4 depletion (Supplementary Fig. S3E, left). This supports our observation that the CTCF that is aberrantly gained upon CHD4 depletion is directly involved in forming new chromatin loops (Supplementary Fig. S2I), and thereby disrupting local 3D chromatin organizations.
Previously, we observed that CTCF binds to exposed DNA sequences surrounded by well-positioned nucleosomes (Fig. 1D) (Cuddapah et al., 2009), and herein found that CHD4 also localizes near these sites (Fig. 1D). In addition, we found that CTCF was aberrantly gained upon CHD4 depletion at the CHD4 CUT&RUN peaks (Figs. 3A and 3B, Supplementary Figs. S3A and S3B), which harbor CTCF motifs. Considering these results, we hypothesized that CHD4 conceals putative CTCF motifs by regulating chromatin accessibility at specific DNA sequences with embedded CTCF motifs (Fig. 1F, left). To test this hypothesis, we analyzed MNase-seq (chromatin digestion with micrococcal nuclease combined with sequencing), DNase-seq (DNase I hypersensitive site sequencing), and ATAC-seq (assay for transposase-accessible chromatin using sequencing) data and aligned them with the same order as previously defined relative CTCF intensity (Group 1 and 2 sites). In Control cells, CTCF mainly localized to Group 2 sites (Fig. 3C), which were characterized by an accessible chromatin region with depleted nucleosomes, enriched DNase I hypersensitive sites, and enriched ATAC-seq signals (Fig. 3C). In accordance with this, Control cells exhibited far less binding of CTCF at Group 1 sites (Fig. 3C), which were characterized by an inaccessible chromatin region with enriched nucleosomes, depleted DNase I hypersensitive sites, and depleted ATAC-seq signals (Fig. 3C). Importantly, we detected a significant decrease in nucleosomes and a significant increase in ATAC-seq signals upon CHD4 depletion at Group 1 sites (Figs. 3C-3E), indicating that CHD4 depletion caused the Group 1 sites to change from inaccessible chromatin regions to accessible chromatin regions. In contrast, we did not detect any significant change in chromatin accessibility at Group 2 sites (Figs. 3C and 3D).
To examine why nucleosomes were depleted specifically at Group 1 sites, we analyzed the features of these DNA sequences. Notably, we found that nucleosome-repelling sequences (AA-AAAA) and DNA bases (AT) (Kaplan et al., 2009; Segal and Widom, 2009; Tillo and Hughes, 2009; Valouev et al., 2011) were more abundant within Group 1 sites compared to Group 2 sites (Fig. 3C). Moreover, CTCF motifs were also present within Group 1 sites (Fig. 3C), even though these sites were not typically targeted by CTCF in Control cells (Fig. 3C). Consistently, when we performed the same analysis using CTCF ChIP-seq peaks of –Aux and +Aux cells, we observed similar changes in CTCF, RAD21, and ATAC-seq signals upon CHD4 depletion at Group 1* sites (Supplementary Figs. S3F and S3G). The Group 1* sites also exhibited patterns similar to those of Group 1 sites in terms of depleted DNase I hypersensitive sites, enriched nucleosome-repelling sequences, and enriched CTCF motifs (Fig. 3C). Taken together, our results suggest that CHD4 maintains the inaccessible/closed chromatin state and conceals CTCF motifs at Group 1 sites by assembling nucleosomes, thereby preventing aberrant CTCF recruitment. Upon CHD4 depletion, however, nucleosomes are no longer assembled by CHD4 and instead tend to be removed from Group 1 sites due to the high-level enrichment of nucleosome-repelling sequences, resulting in the open/accessible chromatin states, exposure of CTCF motifs, and the mislocalization of CTCF. Thus, our data support the idea that the CHD4-mediated regulation of chromatin accessibility controls the appropriate recruitment of CTCF in mESCs.
We next determined the epigenetic features of the Group 1 sites using ChIP-seq data generated against various factors, including histone modifications. Consistent with our observation of nucleosome enrichment (Fig. 3C), we observed that histone H3.1/2 was enriched at Group 1 sites in wild-type mESCs (Supplementary Fig. S3H). Remarkably, among the various histone modifications, H3K9me3 (Fig. 3F, Supplementary Fig. S3H) and the H3K9 methyltransferases, SUV39h1 and SUV39h2, but not SETDB1 (Supplementary Fig. S3H), were the only marks enriched at Group 1 sites in wild-type mESCs. As expected, given that the chromatin structure changed from the closed (nucleosome-enriched) to open (nucleosome-depleted) chromatin state at Group 1 sites upon CHD4 depletion, H3K9me3 was also diminished at these regions in Chd4KD cells (Fig. 3F); this reflected the absence of core histones, which are the substrates for H3K9 methyltransferases. This reduction was not related to any change in the expression of genes encoding H3K9 methyltransferases (Supplementary Table S7). Whereas CHD4 (Supplementary Fig. S3E) and ChAHP complexes (Supplementary Fig. S3I) were abundant at Group 1 sites, most of the analyzed chromatin remodelers (Supplementary Fig. S3E), mediators, histone variants, histone modifications (Supplementary Fig. S3H), nucleosome remodeling deacetylase (NuRD) complexes, architectural proteins, and transcription factors (Supplementary Fig. S3J) were sparse at the Group 1 sites. Together, these results indicate that CHD4 assembles core histones to conceal putative CTCF motifs at heterochromatic regions.
Lastly, we analyzed the positions of the Group 1 and Group 2 CTCF-binding sites with respect to the TADs. We observed that the Group 2 sites were mostly localized at the TAD borders, whereas the Group 1 sites (equivalent to gained CTCF in Fig. 2I) were not (Fig. 3G). Instead, the latter sites were mainly located at the interior regions of TADs in Control cells (84.8%) and –Aux cells (86.7%) (Fig. 3H), which is consistent with our earlier observations (Figs. 2J and 2K, Supplementary Figs. S2B and S2C). In light of the reduced insulation scores (Fig. 2L) and APA results (Supplementary Fig. S2I), these data collectively imply that CHD4 depletion-triggered gained CTCF (Group 1 sites) directly contributes to disrupting local TAD organizations by forming a TAD borders and/or chromatin loops at TAD interiors.
Our results indicated that CHD4 depletion increased both chromatin accessibility (decreased nucleosome levels and increased ATAC-seq signals) and CTCF occupancy at Group 1 sites (Fig. 3, Supplementary Fig. S3). Since CHD4 is a chromatin remodeler, we speculated that CHD4 depletion would first alter chromatin accessibility, leading to the exposure of CTCF motifs, whereupon CTCF would be newly recruited to the Group 1 sites, resulting in aberrant CTCF binding (Fig. 3, Supplementary Fig. S3, see also Fig. 4A). To confirm this, we carried out two types of temporal experiments to determine the order of events (Figs. 4A and 4H). We first performed temporal depletion of CHD4 by varying the RNAi (siRNA against
To further validate our findings, we performed temporal restoration of CHD4 levels (Fig. 4H). We first confirmed that CHD4 was completely degraded after 6 h of auxin treatment in CHD4-mAID cells (Supplementary Fig. S4B). Then, we removed the auxin by changing the cell culture medium. We found that CHD4 protein was gradually restored during 12 h of auxin withdrawal (Figs. 4I and 4J). To determine whether core histones assembly precedes CTCF eviction (or vice versa) at Group 1 sites upon CHD4 restoration, we varied the auxin withdrawal time (Fig. 4I) and performed ChIP-seq against CTCF and histone H3. Consistent with our earlier results (Figs. 3 and 4D, Supplementary Fig. S3), we found that CHD4 depletion, which is equivalent to 6 h of auxin treatment without auxin withdrawal (“+Aux (6 h), withdraw 0 h”), decreased the H3 level and increased the CTCF level at Group 1 sites (Supplementary Fig. S4C). Upon temporal restoration of CHD4, the CTCF level gradually decreased (gained CTCF) and the H3 level gradually increased (reflecting increased chromatin accessibility) at Group 1 sites (Supplementary Fig. S4C). When we performed the same analysis using the CTCF ChIP-seq peaks of –Aux and +Aux cells (“+Aux (6 h), withdraw 0 h”), we observed the same rescue patterns for CTCF and H3 (i.e., a gradual decrease and increase in the levels of CTCF and H3, respectively) upon temporal restoration of CHD4 at Group 1* sites (Fig. 4K). To quantitatively assess these changes, we calculated the average intensities of CTCF and H3 at Group 1* sites for each time point (Figs. 4L and 4M). Importantly, we found that the H3 level increased between 0.5-3 h of auxin withdrawal (Fig. 4M, red arrow) and the CTCF level thereafter decreased between 3-12 h of auxin withdrawal (Fig. 4L, red arrow). These results were further confirmed when we examined individual loci of Group 1* sites (Fig. 4N, Supplementary Fig. S4D). Thus, CHD4 restoration first rescues chromatin accessibility (open to closed chromatin state), then rescues CTCF binding. Collectively, these results enabled us to determine the order of events and conclude that CHD4 directly regulates chromatin accessibility at Group 1 sites to conceal putative CTCF motifs and thereby prevent aberrant CTCF binding.
We next questioned why CHD4 depletion majorly affects chromatin accessibility at Group 1 sites but has only modest effects at Group 2 sites, even though it binds to both types of site (Supplementary Fig. S3E). The most plausible explanation would be that there is a between-site-type difference in the presence of various factors. Indeed, such differences were seen among chromatin remodelers (Supplementary Fig. S3E), mediators, histone variants, histone modifications (Supplementary Fig. S3H), NuRD complexes, architectural proteins, and transcription factors (Supplementary Fig. S3J). Such factors, particularly the chromatin remodelers (CHD1 and brahma-related gene-1 [BRG1]), may compensate for the loss of CHD4 at Group 2 sites. At Group 1 sites, in contrast, these factors are sparse, while CHD4, ChAHP complex, and SUV39H1/2 are abundant. Together, these differences may explain why CHD4 depletion primarily affects the chromatin states of putative CTCF motifs at Group 1 sites.
Alternatively, differences in the chromatin state between Group 1 and 2 sites may explain the differential effects of CHD4 depletion. As described above, Group 1 sites represent heterochromatic regions, while Group 2 sites represent relatively euchromatic regions (Figs. 3C and 3F, Supplementary Fig. S3H) where RNAs are abundant. Since CHD4 is an RNA-binding protein (He et al., 2016; Hendrickson et al., 2016), we hypothesized that RNAs might inhibit the catalytic activity of CHD4 (given that CHD4 functions majorly at RNA-depleted heterochromatic Group 1 sites in wild-type mESCs), which could explain why CHD4 depletion primarily affects Group 1 sites. We first confirmed that CHD4 antibodies could be used for immunoprecipitation (Supplementary Fig. S5A), and then performed RIP. We found that CHD4 bound the
To investigate the importance of the RNA-binding ability of CHD4 in the regulation of CTCF binding, we first sought to identify the specific RNA-binding domain(s) of CHD4. Many RNA-binding proteins contain IDRs that can bind to RNAs (Lin et al., 2016; Molliex et al., 2015). Here, we used MobiDB (Piovesan et al., 2018; 2021) to determine the RNA-binding IDRs of CHD4. Our MobiDB analysis predicted four IDRs (IDR 1-4) within
Having defined the epigenetic features of H3K9me3-enriched heterochromatic aberrant CTCF-binding sites (Group 1) and determined the role of CHD4 at these sites, we next sought to elucidate the genomic features of Group 1 sites by examining the distribution of genomic regions via genome annotation. Notably, we found that SINEs (particularly B2 SINEs), which are well-known retrotransposons (McClintock, 1950), were enriched specifically at Group 1 sites (Fig. 6A); this is consistent with previous reports that H3K9me3 suppresses SINEs (Martens et al., 2005; Varshney et al., 2015), which harbor CTCF motifs (Bourque et al., 2008; Schmidt et al., 2012). Since CHD4 depletion caused the loss of nucleosomes/core histones (Figs. 3 and 4, Supplementary Figs. S3 and S4), resulting in the reduction of H3K9me3 levels (Fig. 3F) at Group 1 sites, we next investigated whether this affected the transcription of B2 SINE by performing mRNA-seq, total RNA-seq, and global run-on sequencing (GRO-seq). Surprisingly, CHD4 depletion elevated the signals for all three assays at Group 1 sites (Fig. 6B, Supplementary Fig. S6A), where B2 SINEs are enriched (Fig. 6A). Our GRO-seq analysis further revealed that the number of
To confirm this, we determined the expression levels of various retrotransposons, including long interspersed nuclear elements (LINEs), endogenous retroviruses (ERVs), and SINEs, using RT-qPCR. Consistently, the expression level of B2 SINE, a major mouse SINE, was dramatically up-regulated in Chd4KD cells, whereas the other retrotransposons (e.g., LINE-1 and IAP) were not significantly altered (Fig. 6C). B2 SINE transcripts are generally transcribed by RNA polymerase III (RNAPIII), but some B2 SINEs are located within introns and may be affected by RNA polymerase II (RNAPII). To exclude the possible relevance of RNAPII-affected B2 SINES, we applied the RNAPII inhibitor, α-amanitin, and determined the expression of B2 SINEs in mESCs. Consistent with the above-described results, B2 SINEs were specifically up-regulated by
We next investigated the role of CHD4 depletion-induced de-repressed B2 SINE in mESCs. As up-regulated B2 SINE RNA has been reported to directly bind RNAPII and inhibit transcription (Allen et al., 2004; Espinoza et al., 2004), we examined whether CHD4 depletion-triggered de-repressed B2 SINE RNA could do the same. We first performed RIP and confirmed that de-repressed B2 SINE transcripts have a binding preference for RNAPII in Chd4KD cells (Fig. 6E). We then used genomic regions enrichment of annotations tool (GREAT) analysis (McLean et al., 2010) to identify genes located in the vicinity of de-repressed B2 SINEs. Interestingly, our results revealed that genes important for stem cell maintenance (pluripotent genes) were located near de-repressed B2 SINEs (Fig. 6F) and Group 1 CTCF-binding sites (Supplementary Fig. S6E). To investigate whether de-repressed B2 SINE transcripts hinder the recruitment of RNAPII to the transcription start sites (TSSs) of genes involved in stem cell maintenance, we analyzed our total RNA-seq data and publicly available RNAPII ChIP-exo data (GSE64825, see also Supplementary Table S4). For a detailed analysis, we separated the total RNA-seq data into those for the –/+ strands and confirmed the presence of de-repressed B2 SINEs in Chd4KD cells (Fig. 6G). Notably, Chd4KD cells exhibited a significant reduction of RNAPII recruitment at TSSs when abundant de-repressed B2 SINEs were located nearby (Fig. 6G), whereas no reduction of RNAPII recruitment was observed at TSSs without a nearby de-repressed B2 SINE (Supplementary Fig. S6F). The observed changes in RNAPII recruitment were not derived from any change in the expression levels of the RNAPII subunits (Supplementary Fig. S6G). To determine whether this reduction of RNAPII recruitment at TSSs could dysregulate the expression levels of stem cell maintenance-related genes, we analyzed total RNA-seq and mRNA-seq data. Notably, we observed that genes with abundantly de-repressed B2 SINEs, which exhibited reduced RNAPII recruitment at their TSSs (Fig. 6G), were down-regulated (Supplementary Fig. S6H, red) compared to genes that lacked a nearby de-repressed B2 SINE (Supplementary Fig. S6H, gray). Taken together, our data indicate that
Here, we explored the changes in the 3D chromatin organizations of mESCs upon CHD4 depletion using
Since the genome is packaged into 3D chromatin organizations and most of the biological processes, including transcription, occur in this context, it is important to investigate the 3D organization of the genome. Previous studies characterized the 3D chromatin organization in terms of compartments (Lieberman-Aiden et al., 2009; Simonis et al., 2006), TADs (Dixon et al., 2012; Nora et al., 2012), contact/loop domains (Rao et al., 2014; 2017), and insulated neighborhoods (Dowen et al., 2014; Ji et al., 2016). Recent reports have mainly addressed the importance of CTCF and cohesin in 3D genome topology by observing the disruption of 3D chromatin structures after CRISPR/Cas9 system-mediated deletion/orientation change of CTCF-binding sites (de Wit et al., 2015; Guo et al., 2015; Sanborn et al., 2015), deletion of cohesion (Luppino et al., 2020; Rao et al., 2017; Szabo et al., 2020), and deletion of CTCF (Luppino et al., 2020; Nora et al., 2017; Szabo et al., 2020). These studies mainly focused on the properties and formation of the 3D chromatin organization and the key factors (e.g., CTCF and cohesin) that compose its structures. However, intriguing questions remained, such as: (1) How are TADs established or maintained? (2) How are aberrant CTCF binding and associated Hi-C interactions at TAD interiors prevented? (3) How is CTCF recruitment at the TAD borders regulated? (4) Are chromatin remodelers involved in CTCF recruitment? (5) What is the role of chromatin remodelers in 3D chromatin organizations? We believe that our present work provides clues to these questions by elucidating a CHD4-mediated mechanism that conceals aberrant CTCF-binding sites at the TAD interiors by regulating chromatin accessibility, thereby preventing inappropriate CTCF binding and safeguarding both proper CTCF recruitment and 3D chromatin organization in mESCs.
It is interesting to set our results in the context of evolutionary aspects. A previous report compared Hi-C data from four mammals and identified two types of CTCF sites: (1) conserved CTCF sites, which are conserved across species and enriched at strong TAD borders; and (2) divergent CTCF sites, which lack conservation across species, are newly evolved, and localize at TAD interiors (Vietri Rudan et al., 2015). The authors also demonstrated that the evolutionary dynamics of intra-TAD interactions, which reflect the actions of divergent CTCF sites, could play a critical role in enhancer-promoter interactions within TADs (Vietri Rudan et al., 2015). In support of this, a recent review suggested that 3D chromatin organization may have significantly impacted the evolution of transcriptional regulation by increasing regulatory complexity (the combination of enhancer and promoter contacts) (Acemel et al., 2017). Taken together, these findings indicate that the evolution of gene regulation is strongly correlated with 3D chromatin organization, especially intra-TAD interactions involving divergent CTCF sites. Interestingly, the divergent CTCF sites that primarily reside at TAD interiors have been spread through genomes by retrotransposition of B2 SINE transposable elements, which contain CTCF motifs in their consensus sequences (Bourque et al., 2008; Schmidt et al., 2012). Our present data illustrate that the chromatin remodeler, CHD4, conceals the B2 SINE-originated divergent/inappropriate CTCF-binding sites by regulating chromatin accessibility, thereby preventing three events: (1) aberrant CTCF recruitment; (2) aberrant CTCF binding-associated changes in local TADs (separating and weakened TADs); and (3) overexpression of B2 SINE transcripts in mESCs.
The previous studies on chromatin remodeler mainly focused on examining ATP catalysis-based nucleosome sliding
Among the studies that did not focus specifically on transcribed regions, several reports showed that chromatin remodelers are closely associated with CTCF. For example, RNAi-mediated knockdown of
Based on our findings, we propose a mechanism wherein the ATP-dependent chromatin remodeler, CHD4, maintains the lower-level chromatin structure (including chromatin accessibility and nucleosome positioning/occupancy) and prevents the aberrant recruitment of CTCF, thereby safeguarding appropriate CTCF binding and associated higher-order TAD organization by suppressing aberrant interactions at TAD interiors (Fig. 7). Our results reveal how the primary structure of chromatin could facilitate higher-order 3D chromatin organization in mESCs. Collectively, our findings provide a novel perspective on the functional role of chromatin remodelers in mESCs, thereby adding an extra layer of information in stem cell biology and greatly facilitating our understanding of how chromatin remodelers function in embryonic stem cells.
We thank Taemook Kim and Prof. Keunsoo Kang for assistance with analyzing the NGS data; Kwang-Beom Hyun and Prof. Jaehoon Kim for purification of CHD4 protein using the Bac-to-Bac Baculovirus expression system (Full-length cDNA of mouse
S.H., H.L., and D.L. designed the concepts and experiments. S.H. analyzed most of the NGS data. S.H. and H.L. performed most of the experiments. S.H. performed the Hi-C experiments for –Aux and +Aux cells. A.J.L. performed the Hi-C experiments for Control and Chd4KD cells under the guidance of I.J. S.K.K. performed the GRO-seq experiments under the guidance of T.K.K. S.H. and H.L. wrote the manuscript under the mentorship and technical supervision of G.Y.K. and D.L.
The authors have no potential conflicts of interest to disclose.