Mol. Cells 2022; 45(3): 158-167
Published online March 2, 2022
https://doi.org/10.14348/molcells.2022.2027
© The Korean Society for Molecular and Cellular Biology
Correspondence to : cshwang@postech.ac.kr(C.-S.H.),clee270@kist.re.kr(C.L.)
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
Ubiquitin (Ub) is post-translationally modified by Ub itself or Ub-like proteins, phosphorylation, and acetylation, among others, which elicits a variety of Ub topologies and cellular functions. However, N-terminal (Nt) modifications of Ub remain unknown, except the linear head-to-tail ubiquitylation via Nt-Met. Here, using the yeast Saccharomyces cerevisiae and an Nt-arginylated Ub-specific antibody, we found that the detectable level of Ub undergoes Nt-Met excision, Nt-deamination, and Nt-arginylation. The resulting Nt-arginylated Ub and its conjugated proteins are upregulated in the stationary-growth phase or by oxidative stress. We further proved the existence of Nt-arginylated Ub in vivo and identified Nt-arginylated Ub-protein conjugates using stable isotope labeling by amino acids in cell culture (SILAC)-based tandem mass spectrometry. In silico structural modeling of Nt-arginylated Ub predicted that Nt-Arg flexibly protrudes from the surface of the Ub, thereby most likely providing a docking site for the factors that recognize it. Collectively, these results reveal unprecedented Nt-arginylated Ub and the pathway by which it is produced, which greatly expands the known complexity of the Ub code.
Keywords arginylation, deamination, methionine excision, N-degron, proteolysis, ubiquitin code
The covalent attachment of ubiquitin (Ub) to target proteins recruits the cascade reactions of the E1 activating enzymes, E2 conjugation enzymes, and E3 ligases. Ub can also be post-translationally tagged by Ub itself or Ub-like proteins, phosphorylation, acetylation, ribosylation, and deamination (Mattiroli and Penengo, 2021; Swatek and Komander, 2016) (for details, see the references therein) (Fig. 1A). These post-translational modifications induce distinct topologies of the Ub molecule, leading to a vast range of cellular outcomes (cell division, death, development, etc.) via the creation of distinct words for the Ub code (Dittmar and Selbach, 2017; Komander and Rape, 2012; Kwon and Ciechanover, 2017). The identification of new types of Ub modification can reveal unrecognized Ub codes, which deepens our understanding of a vast range of Ub-associated biological processes. However, the post-translational modifications of Ub
Virtually all newly made polypeptides from ribosomes contain methionine (Met) due to the design of the AUG initiation codon. The initiator Met is frequently excised by Met-aminopeptidases (MetAPs), in cases where the residue at position 2 is Gly, Ala, Ser, Thr, Pro, or Val. Intriguingly, Nt-Met excision (NME) is an evolutionarily conserved and essential process that is applied to two-thirds of proteins (Giglione et al., 2004). NME also operates even in a set of proteins with a residue larger than Val at position 2, although the mechanism underlying this is still unclear (Chen and Kashina, 2021; Redman and Rubenstein, 1981; Sadis et al., 1995). Specifically in the yeast
The 76-residue Ub starts with the Met-Gln (MQ) N-terminus. The MQ-starting Ub (MQ-Ub3-76) is hereafter denoted simply as MQUb. Thus, we hypothesized that MQUb might be processed by these NME-provoked cascade reactions of Nt-deamination and Nt-arginylation to yield Nt-arginylated RE-Ub3-76 (denoted here as REUb) in yeast and mammalian cells (Figs. 1A and 2A). Indeed, we here identify the previously unknown REUb in yeast and consider the ramifications of this discovery, which greatly expands the known complexity and biological significance of the Ub code.
The yeast strains, plasmids, and oligomers used in this study are listed in Table 1. To construct pCH6913, pCH6914, pCH6915, or pCH6916 expressing MQUbAA-GST, QUbAA-GST, EUbAA-GST, or REUbAA-GST in the pHUE vector (Catanzariti et al., 2004), respectively, Ub variant-encoding open reading frames (ORFs) were polymerase chain reaction (PCR)-amplified from pCH5097 as a template using a set of primer pairs: OCH7003/OCH8110, OCH8089/OCH8110, OCH8088/OCH8110, or OCH6968/OCH8110. The resulting PCR products were digested with
Yeast cells were cultured in YPD (1% yeast extract, 2% peptone, 2% glucose) and synthetic complete (0.67% yeast nitrogen base with ammonium sulfate, 2% glucose, specific compounds essential for the growth of an auxotrophic strain) media. The construction of yeast strains and DNA transformation were performed following standard techniques (Gietz and Schiestl, 2007; Janke et al., 2004; Sherman, 2002).
pCH6913 (expressing 6HisUb-MQUbAA-GST), pCH6914 (expressing 6HisUb-QUbAA-GST), pCH6915 (expressing 6HisUb-EUbAA-GST), pCH6916 (expressing 6HisUb-REUbAA-GST), pCH7047 (expressing 6HisUb-MQIFVKTLTGK-GST), pCH7048 (expressing 6HisUb-QIFVKTLTGK-GST), pCH7049 (expressing 6HisUb-EIFVKTLTGK-GST), and pCH7050 (expressing 6HisUb-REIFVKTLTGK-GST) were transformed into BL21 (DE3)
Cell pellets were thawed and resuspended in lysis buffer (50 mM HEPES, 150 mM NaCl, 10% glycerol, 1 mM EDTA, 1 mM phenylmethylsulfonyl fluoride [PMSF, Cat. No. 10837091001; Sigma-Aldrich, USA], 1 mM DTT, 0.1% Triton X-100) with lysozyme (final concentration of 1 mg/ml). Resuspended cells were incubated on ice for 20 min and disrupted by sonication (VCX-130; SONICS, USA), five times for 30 s each, at 1 min intervals with 40% amplitude. Cell lysates were centrifuged at 25,000 ×
The synthetic peptide REIFVKTLTGKC was used for the production of anti-REUb as the antigen. Rabbit polyclonal antisera to the peptide were raised by AbClon (Korea). Antibodies specific for REIFVKTLTGKC were “negatively” selected from antiserum by sequentially incubating 5 ml of clarified antiserum with 1 ml of Glutathione Sepharose beads that had been pre-incubated individually with 2 mg of the purified MQIFVKTLTGK-GST, QIFVKTLTGK-GST, and EIFVKTLTGK-GST. The flow-through of antisera was further “positively” selected by overnight incubation with REIFVKTLTGK-GST-conjugated Sulfo-Link Coupling Resin (Cat. No. 20401; Thermo Fisher Scientific, USA) at 4°C. Bound antibodies were eluted by adding 9 ml of 0.1 M glycine HCl (pH 3.5). The eluted antibodies were further incubated with Affi-Gel 10/15 (Cat. No. 1536098; Bio-Rad, USA) that were pre-conjugated with 30 mg of proteins in the extracts of CHY2014 (
The indicated yeast cells were cultured in 100 ml of YPD medium to
The CBB-stained protein bands of interest were excised, destained in 50 mM ammonium bicarbonate with 50% acetonitrile (ACN), and dehydrated with 100% ACN, followed by drying in a vacuum evaporator. The destained proteins were proteolyzed with trypsin by the in-gel digestion method (Kim et al., 2013). The tryptic peptides were reconstituted in 7 µl of 0.1% formic acid, and a 5 µl aliquot was injected into a reverse-phase EASY-Spray PepMap RSLC C18 LC column (0.075 mm inner diameter × 500 mm length) on the Eksigent NanoLC Ultra system with an integrated column heater at 40°C; the column was pre-equilibrated with 96% buffer A (0.1% formic acid in water) and 4% buffer B (0.1% formic acid in ACN). The peptides were eluted with a 4%-35% gradient of buffer B over 150 min and a 32%-80% gradient of buffer B over 40 min. The total ion acquisition run time was set to 190 min at a flow rate of 250 nl/min. A Q-Exactive mass spectrometer (Thermo Fisher Scientific) was operated in data-dependent acquisition mode for the entire analysis. Full scans (m/z 300-1600) were acquired at a resolution of 70,000 using an automatic gain control (AGC) target value of 1e6 and a maximum ion injection time of 30 ms. Tandem mass spectra were generated for up to 12 precursors by high-energy collision dissociation using a normalized collision energy of 35%. The dynamic exclusion was set to 60 s. Fragment ions were detected in normal scan mode using an AGC target value of 5e4 and a maximum ion injection time of 120 ms. Source ionization parameters were as follows: spray voltage, 1.9 kV; capillary temperature, 275°C; and S-Lens RF Level, 50.
The RAW files from Q-Exactive were directly conveyed to the proteomic data analysis program suite, Proteome Discoverer v2.2 (Thermo Fisher Scientific). We employed the adjusted exemplar workflow for the SILAC experiment and the basic Sequest HT search module. Parameters for the Sequest search module were as follows: enzyme, specifically trypsin; protein database, UniProt reference proteome database of yeast (UP000002311, released in 04/2019) plus modified cRAP contaminant database with accessions in the dust/contact category (https://www.thegpm.org/crap/); fragment mass tolerance, 0.05 Da; precursor mass tolerance, 20 ppm; dynamic modifications, acetylation of protein-N-term (+42.010565 Da), oxidation of methionine (+15.9949 Da), label: 13C615N2 at lysine (+8.014199 Da), label: 13C615N4 at arginine (+10.008269 Da), Gly–Gly at Lys (Kε-GG) or protein-N-term (+114.042927 Da); and static modification, carbamidomethylation of cysteine (+57.021464 Da). Next, the identified spectra were validated using Percolator module, which was set to use a concatenated target/decoy strategy with a target cut-off q-value of 0.01 at PSM (peptide spectrum match) level. To integrate chromatographic features of identified SILAC pairs, we used Minora Feature Detector module with maximal trace retention time window of 5 min. Peptide abundance from the feature detector module was normalized with total sum of the abundance values over all identified peptides. Each result of the replicates was treated separately.
The generated mass spectrum files were directly matched to the peptide sequence in the customized REUb protein fasta file using the MS-GF+ search algorithm (v20190418) (Kim and Pevzner, 2014). The REUb sequence is a virtual one that replaces the two N-terminal residues of Ub MQ with RE. The search engine settings were as follows: trypsin enzyme specificity with 1 as the number of tolerable termini; 20 ppm for MS1 tolerance; variable modifications: oxidation of methionine (+15.9949 Da), label: 13C615N2 at lysine (+8.014199 Da), label: 13C615N4 at arginine (+10.008269 Da); and fixed modification: carbamidomethylation of cysteine (+57.021464 Da). All spectra matched to the Nt-peptide of the REUb were manually inspected. For annotating peptide spectra, we used a freely accessible version of the Interactive Peptide Spectrum Annotator (IPSA) (Brademan et al., 2019).
The amino acid sequence of
To detect the hypothetical REUb (Fig. 2A), we produced rabbit anti-REUb polyclonal antibody using a synthetic peptide, REIFVKTLTGKC, the 11-residue Nt-sequences of REUb and Cys for the KLH (keyhole limpet hemocyanin) conjugation. Dot immunoblotting analysis with the affinity-purified anti-REUb revealed the specific binding of the antibody to the REIFVKTLTGK-GST (glutathione
Moreover, the anti-REUb antibody specifically recognized REUbAA-GST (UbAA, a deubiquitylation-protective Ub variant with the mutation of C-terminal GG to AA), but not MQUbAA-GST, QUbAA-GST, or EUbAA-GST (Fig. 2C). In agreement with these dot blotting results, immunoblotting with anti-REUb further confirmed the specific recognition of purified REUb but not of MQUb; the REUb and MQUb were produced by the
We next attempted to determine the levels of endogenous REUb and REUb-linked protein species by immunoblotting with the purified anti-REUb. Notably, however, REUb and REUb–protein conjugates in wild-type
Since MQUb is a putative substrate of NatB Nt-acetylase that targets MQ, MN, ME, and MD at N-termini (Nguyen et al., 2018; Ree et al., 2018), we conjectured that Nt-acetylation might inhibit the NME-triggered Nt-modifications of MQUb. To rule out this possibility, we performed anti-REUb-based immunoblotting in the extracts from
To detect the REUb–protein adducts more effectively and sensitively, we performed anti-REUb-based immunoprecipitation, followed by immunoblotting with a highly Ub-specific monoclonal anti-Ub (P4D1) antibody, in extracts from
Ubr1 is as a key E3 Ub ligase of the Arg/N-degron pathway and directly detects the Nt-Arg of Nt-arginylated proteins via its UBR box (Choi et al., 2010). Interestingly, the levels of REUb and REUb-protein adducts were discernably decreased in
We also observed that REUb–protein adducts were strongly increased in wild-type cells in the presence of 2 mM H2O2, an oxidative stressor (Fig. 2I; lane 7). In agreement with results with the stationary-phase yeast, however, the REUb-conjugated proteins were hardly detected in
Given that oxidative stress increases the levels of REUb–protein conjugates, we next sought to identify the REUb-attached proteins by employing SILAC in the H2O2-treated wild-type cells and
Out of 541 distinct proteins detected by LC-MS/MS, 83 proteins were identified as direct or indirect REUb-binding candidates in the H2O2-treated wild-type cells (Fig. 3B). STRING enrichment analysis (for protein–protein interaction networks) indicated the functional associations of these putative REUb-interacting proteins with a set of cellular processes including gene transcription, vesicle trafficking, DNA replication, RNA processing, and mitochondrial regulation (Fig. 3C).
The tryptic digestion of Ub-conjugated proteins produces a K-ε-GG (KGG) Ub isopeptide, which allows mapping of the ubiquitylation sites of substrates by LC-MS/MS (Kirkpatrick et al., 2005). Using this Ub-remnant KGG signature-based approach, in the H2O2-treated yeast cells we identified the (putative) REUb-conjugated proteins and determined their ubiquitylation sites: Ub (43LIFAGKGGQLEDGR54); Fdc1 (a ferulic acid decarboxylase, 126THILSEEK133GG); Ino80 (a chromatin remodeling ATPase, 939NVQSELGDKGGIEIDVLCDLTQR959); Rpb3 (an RNA polymerase subunit, 155LTCVAKGGK161); Uso1 (a vesicle transport protein, 912ITEIKGGAINENLEEMK926); Sla1 (a cytoskeleton-binding protein, 358GIVQYDFMAESQDELTIK375GG); Ufd4 (a Ub-fusion decay E3 ligase, 549AINDQLIK556GG); Rad57 (a DNA repair protein, 405SFKGGASTIIQR414); Hda1 (a histone deacetylase, 561SKGGLNDELR568); and Rpl36a (a ribosome 60S subunit, 85AKGGVEEMNNIIAASR98) (Fig. 3D). In this setting, however, it should be noted that REUb might be attached not only directly to the identified proteins but also indirectly to them by way of the Ub moiety of the already Ub-tagged proteins.
Strikingly, the exhaust manual survey of the tandem MS/MS database revealed the REUb-derived REIFVK Nt-peptide, verifying the actual existence of REUb
To gain insight into the mechanistic role of REUb, we constructed its neural network-based 3D structure (for details, see Materials and Methods section). The overall surface structure of REUb was almost the same as that of MQUb, except its Nt-region (Figs. 4A and 4B). Specifically, the top and side views of the predicted models revealed that seven ubiquitylatable Lys residues and C-terminal Gly of REUb are located at almost the same surface positions as those of MQUb. Tellingly, however, Nt-Arg of REUb is stretched outward from the globular Ub protein, whereas MQUb retains its Nt-Met buried inward (Figs. 4A and 4B).
Given these findings, using the three-dimensional structure viewer programs we further estimated the H-bond acceptor–donor lengths of Nt-Met-Gln, Nt-Gln, Nt-Glu, or Nt-Arg-Glu of MQUb, QUb, EUb, or REUb with their surrounding residues, respectively (Figs. 4C-4F) (see Materials and Methods section for details). In MQUb, Nt-Met forms two H bonds with Val17 (each 2.69 Å in length), and Gln at position 2 forms two H bonds with Glu16 (2.54 Å) and with Glu64 (2.56 Å) (Fig. 4C). In contrast, both Nt-Gln of QUb and Nt-Glu of EUb form only one H bond with Glu64, resulting in H-bond acceptor–donor lengths of 2.77 Å and 2.67 Å, respectively (Figs. 4D and 4E). In the case of REUb, however, Nt-Arg forms two H bonds with Val17 (3.44 Å and 2.63 Å) and Glu at position 2 with Glu64 (2.66 Å). Notably, the H-bond acceptor–donor length between the amino group of Nt-Arg and the oxygen atom of Val17 in REUb was increased by 3.44 Å, relative to that in MQUb with 2.69 Å (Fig. 4F). These results suggest that Nt-Arg-Glu of REUb interacts more weakly with its surrounding residues Glu16, Val17, and Glu64 than Nt-Met-Gln of MQUb. Consequently, the Nt-Arg flexibly protruding from the Ub globule would increase the accessibility of REUb to the factors that recognize it.
In this study, we produced for the first time anti-REUb-specific antibody and detected the previously unrecognized REUb or REUb-conjugated proteins in
Using anti-REUb-based immunoprecipitation–tandem mass analysis, this study provides direct evidence of the existence of REUb
We found that REUb can be produced
The actual X-ray crystal structure of MQUb (PDB:4XOL, resolution 2.91 Å) reveals that the B factors of MQUb, Met-1, and Gln-2 are ~88.1 Å2, ~78.1 Å2, and ~90.5 Å2, respectively, reflecting the rigid property of Nt-Met-Gln region in MQUb. Accordingly, the relatively flexible exposure of the Nt-Arg moiety of REUb to its surface in comparison to Nt-Met of MQUb (Figs. 4A-4C and 4F and 4F) may provide a binding site for the factors that recognize it, such as Ubr1 that targets basic destabilizing Nt-residues including Nt-Arg for the Arg/N-degron pathway (Nguyen et al., 2019; Varshavsky, 2019).
Ubr1 contains at least three substrate-binding sites for basic (type-1) Nt-residues, bulky hydrophobic (type-2) Nt-residues, and internal degrons (Nguyen et al., 2019; Varshavsky, 2019). REUb is highly upregulated by stationary-growth phase and oxidative stress that most likely induces Ubr1 activation for protein quality control (Szoradi et al., 2018). If so, is Ubr1 mechanistically or functionally linked to the metabolic regulation of REUb under stress conditions and vice versa? As one of parsimonious functions, Ubr1 might mediate the degradation of the apparently faulty REUb or its derivatives which can perturb the operation of normal Ub molecule. However, such possibility seems to be low, because REUb and REUb–protein adducts were rather downregulated in the absence of Ubr1 (Fig. 2H). Conversely, the Nt-Arg-bearing REUb might occupy the type-1 substrate-binding site of Ubr1, thereby modulating the degradation of a wide range of its substrates, similarly to the hydrophilin protein Roq1 (Szoradi et al., 2018). The precise roles of Ubr1 in REUb regulation and the mechanism behind them remain to be examined.
While no specific enzymes for MQUb NME have been identified thus far, Met-aminopeptidases (Map1 or Map2) in yeast most likely remove Nt-Met of MQUb, as in the case of other MQ- or MN-starting proteins (Nguyen et al., 2019). However, the possibility that other dedicated proteases might mediate the NME of MQUb under specific conditions including oxidative stress cannot be ruled out.
The identification of REUb in this study would enormously increase the repertoire and complexity of the Ub code. In addition, REUb would be present and actively operate in multicellular organisms as well because NME, Nt-deamination, and Nt-arginylation are conserved in nearly all eukaryotes (Sriram et al., 2011; Varshavsky, 2019).
We are grateful to the current and former members of the Hwang and Lee laboratories for their assistance and advice. We also thank Edanz (www.edanz.com/ac) for editing a draft of this manuscript. This work was supported by grants from the Korean Government (MSIP) NRF-2020R1A3B2078127 and NRF2017R1A5A1015366 (C.-S.H.) and NRF-2020 R1A2C2003685 (C.L.), and the BK21 Plus program (C.-S.H.).
C.-S.H., C.L., K.T.N., and S.J. designed the research. K.T.N., S.J., S.-Y.K., and C.-S.L. performed the research and all coauthors analyzed the data. C.-S.H., C.L., K.T.N., S.J., and S.-Y.K. wrote the paper.
The authors have no potential conflicts of interest to disclose.
Yeast strains, plasmids, and primers used in this study
Material | Description or sequence | Source |
---|---|---|
Yeasts | ||
JD53 | MATα, his3-Δ200, leu2-3, 112 lys2-801, trp1-Δ63, ura3-52 | Dohmen et al., 1995 |
CHY367 | naa20::NatMX4 in JD53 | Lab collection |
CHY2009 | ubr1∆::KanMX6, naa20∆::NatMX4 in JD53 | Lab collection |
CHY2014 | ate1∆::HphNT2 in JD53 | Lab collection |
CHY2015 | ate1∆::HphNT2, naa20∆::NatMX4 in JD53 | Lab collection |
CHY2017 | nta1∆::HphNT2, naa20∆::NatMX4 in JD53 | Lab collection |
CHY3188 | arg4∆::KnaMX6 in JD53 | Lab collection |
CHY6056 | arg4∆::KanMX6, ate1∆::HphNT2 in JD53 | This study |
Plasmids | ||
pCH20 | pHUE | Catanzariti et al., 2004 |
pCH33 | pGEX4T-3 | Lab collection |
pCH5097 | haUb in pRK5 | Lim et al., 2005 |
pCH6913 | MQUbAA-GST in pHUE | This study |
pCH6914 | QUbAA -GST in pHUE | This study |
pCH6915 | EUbAA-GST in pHUE | This study |
pCH6916 | REUbAA-GST in pHUE | This study |
pCH7047 | MQIFVKTLTGK-GST in pHUE | This study |
pCH7048 | QIFVKTLTGK-GST in pHUE | This study |
pCH7049 | EIFVKTLTGK-GST in pHUE | This study |
pCH7050 | REIFVKTLTGK-GST in pHUE | This study |
Oligomers | ||
OCH6512 | 5′-GGTAAGCTTTCAACGCGGAACCAGATCCGATTT-3′ | |
OCH6968 | 5′-GATCCCGCGGTGGAAGAGAGATCTTCGTGAAGACCCTG-3′ | |
OCH6991 | 5′-GCTGAATTCATGTCCCCTATACTAGGTTATTGGAAAA-3′ | |
OCH7003 | 5′-GATCCCGCGGTGGAATGCAGATCTTCGTGAAGACCCTG-3′ | |
OCH8085 | 5′-GATCCCGCGGTGGAATGCAGATCTTCGTGAAGACTCTGACTGGTAAGACCGGTGCTGGAGCAGGTGCTATGTCCCCT-3′ | |
OCH8088 | 5′-GATCCCGCGGTGGAGAGATCTTCGTGAAGACCCTG-3′ | |
OCH8110 | 5′-GCTGAATTCAGCAGCTCTGAGACGGAGGAC-3′ |
Mol. Cells 2022; 45(3): 158-167
Published online March 31, 2022 https://doi.org/10.14348/molcells.2022.2027
Copyright © The Korean Society for Molecular and Cellular Biology.
Kha The Nguyen1,4 , Shinyeong Ju2,4
, Sang-Yoon Kim1
, Chang-Seok Lee1
, Cheolju Lee2,3,*
, and Cheol-Sang Hwang1,*
1Department of Life Sciences, Pohang University of Science and Technology, Pohang 37673, Korea, 2Center for Theragnosis, Korea Institute of Science and Technology, Seoul 02792, Korea, 3Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Korea, 4These authors contributed equally to this work.
Correspondence to:cshwang@postech.ac.kr(C.-S.H.),clee270@kist.re.kr(C.L.)
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
Ubiquitin (Ub) is post-translationally modified by Ub itself or Ub-like proteins, phosphorylation, and acetylation, among others, which elicits a variety of Ub topologies and cellular functions. However, N-terminal (Nt) modifications of Ub remain unknown, except the linear head-to-tail ubiquitylation via Nt-Met. Here, using the yeast Saccharomyces cerevisiae and an Nt-arginylated Ub-specific antibody, we found that the detectable level of Ub undergoes Nt-Met excision, Nt-deamination, and Nt-arginylation. The resulting Nt-arginylated Ub and its conjugated proteins are upregulated in the stationary-growth phase or by oxidative stress. We further proved the existence of Nt-arginylated Ub in vivo and identified Nt-arginylated Ub-protein conjugates using stable isotope labeling by amino acids in cell culture (SILAC)-based tandem mass spectrometry. In silico structural modeling of Nt-arginylated Ub predicted that Nt-Arg flexibly protrudes from the surface of the Ub, thereby most likely providing a docking site for the factors that recognize it. Collectively, these results reveal unprecedented Nt-arginylated Ub and the pathway by which it is produced, which greatly expands the known complexity of the Ub code.
Keywords: arginylation, deamination, methionine excision, N-degron, proteolysis, ubiquitin code
The covalent attachment of ubiquitin (Ub) to target proteins recruits the cascade reactions of the E1 activating enzymes, E2 conjugation enzymes, and E3 ligases. Ub can also be post-translationally tagged by Ub itself or Ub-like proteins, phosphorylation, acetylation, ribosylation, and deamination (Mattiroli and Penengo, 2021; Swatek and Komander, 2016) (for details, see the references therein) (Fig. 1A). These post-translational modifications induce distinct topologies of the Ub molecule, leading to a vast range of cellular outcomes (cell division, death, development, etc.) via the creation of distinct words for the Ub code (Dittmar and Selbach, 2017; Komander and Rape, 2012; Kwon and Ciechanover, 2017). The identification of new types of Ub modification can reveal unrecognized Ub codes, which deepens our understanding of a vast range of Ub-associated biological processes. However, the post-translational modifications of Ub
Virtually all newly made polypeptides from ribosomes contain methionine (Met) due to the design of the AUG initiation codon. The initiator Met is frequently excised by Met-aminopeptidases (MetAPs), in cases where the residue at position 2 is Gly, Ala, Ser, Thr, Pro, or Val. Intriguingly, Nt-Met excision (NME) is an evolutionarily conserved and essential process that is applied to two-thirds of proteins (Giglione et al., 2004). NME also operates even in a set of proteins with a residue larger than Val at position 2, although the mechanism underlying this is still unclear (Chen and Kashina, 2021; Redman and Rubenstein, 1981; Sadis et al., 1995). Specifically in the yeast
The 76-residue Ub starts with the Met-Gln (MQ) N-terminus. The MQ-starting Ub (MQ-Ub3-76) is hereafter denoted simply as MQUb. Thus, we hypothesized that MQUb might be processed by these NME-provoked cascade reactions of Nt-deamination and Nt-arginylation to yield Nt-arginylated RE-Ub3-76 (denoted here as REUb) in yeast and mammalian cells (Figs. 1A and 2A). Indeed, we here identify the previously unknown REUb in yeast and consider the ramifications of this discovery, which greatly expands the known complexity and biological significance of the Ub code.
The yeast strains, plasmids, and oligomers used in this study are listed in Table 1. To construct pCH6913, pCH6914, pCH6915, or pCH6916 expressing MQUbAA-GST, QUbAA-GST, EUbAA-GST, or REUbAA-GST in the pHUE vector (Catanzariti et al., 2004), respectively, Ub variant-encoding open reading frames (ORFs) were polymerase chain reaction (PCR)-amplified from pCH5097 as a template using a set of primer pairs: OCH7003/OCH8110, OCH8089/OCH8110, OCH8088/OCH8110, or OCH6968/OCH8110. The resulting PCR products were digested with
Yeast cells were cultured in YPD (1% yeast extract, 2% peptone, 2% glucose) and synthetic complete (0.67% yeast nitrogen base with ammonium sulfate, 2% glucose, specific compounds essential for the growth of an auxotrophic strain) media. The construction of yeast strains and DNA transformation were performed following standard techniques (Gietz and Schiestl, 2007; Janke et al., 2004; Sherman, 2002).
pCH6913 (expressing 6HisUb-MQUbAA-GST), pCH6914 (expressing 6HisUb-QUbAA-GST), pCH6915 (expressing 6HisUb-EUbAA-GST), pCH6916 (expressing 6HisUb-REUbAA-GST), pCH7047 (expressing 6HisUb-MQIFVKTLTGK-GST), pCH7048 (expressing 6HisUb-QIFVKTLTGK-GST), pCH7049 (expressing 6HisUb-EIFVKTLTGK-GST), and pCH7050 (expressing 6HisUb-REIFVKTLTGK-GST) were transformed into BL21 (DE3)
Cell pellets were thawed and resuspended in lysis buffer (50 mM HEPES, 150 mM NaCl, 10% glycerol, 1 mM EDTA, 1 mM phenylmethylsulfonyl fluoride [PMSF, Cat. No. 10837091001; Sigma-Aldrich, USA], 1 mM DTT, 0.1% Triton X-100) with lysozyme (final concentration of 1 mg/ml). Resuspended cells were incubated on ice for 20 min and disrupted by sonication (VCX-130; SONICS, USA), five times for 30 s each, at 1 min intervals with 40% amplitude. Cell lysates were centrifuged at 25,000 ×
The synthetic peptide REIFVKTLTGKC was used for the production of anti-REUb as the antigen. Rabbit polyclonal antisera to the peptide were raised by AbClon (Korea). Antibodies specific for REIFVKTLTGKC were “negatively” selected from antiserum by sequentially incubating 5 ml of clarified antiserum with 1 ml of Glutathione Sepharose beads that had been pre-incubated individually with 2 mg of the purified MQIFVKTLTGK-GST, QIFVKTLTGK-GST, and EIFVKTLTGK-GST. The flow-through of antisera was further “positively” selected by overnight incubation with REIFVKTLTGK-GST-conjugated Sulfo-Link Coupling Resin (Cat. No. 20401; Thermo Fisher Scientific, USA) at 4°C. Bound antibodies were eluted by adding 9 ml of 0.1 M glycine HCl (pH 3.5). The eluted antibodies were further incubated with Affi-Gel 10/15 (Cat. No. 1536098; Bio-Rad, USA) that were pre-conjugated with 30 mg of proteins in the extracts of CHY2014 (
The indicated yeast cells were cultured in 100 ml of YPD medium to
The CBB-stained protein bands of interest were excised, destained in 50 mM ammonium bicarbonate with 50% acetonitrile (ACN), and dehydrated with 100% ACN, followed by drying in a vacuum evaporator. The destained proteins were proteolyzed with trypsin by the in-gel digestion method (Kim et al., 2013). The tryptic peptides were reconstituted in 7 µl of 0.1% formic acid, and a 5 µl aliquot was injected into a reverse-phase EASY-Spray PepMap RSLC C18 LC column (0.075 mm inner diameter × 500 mm length) on the Eksigent NanoLC Ultra system with an integrated column heater at 40°C; the column was pre-equilibrated with 96% buffer A (0.1% formic acid in water) and 4% buffer B (0.1% formic acid in ACN). The peptides were eluted with a 4%-35% gradient of buffer B over 150 min and a 32%-80% gradient of buffer B over 40 min. The total ion acquisition run time was set to 190 min at a flow rate of 250 nl/min. A Q-Exactive mass spectrometer (Thermo Fisher Scientific) was operated in data-dependent acquisition mode for the entire analysis. Full scans (m/z 300-1600) were acquired at a resolution of 70,000 using an automatic gain control (AGC) target value of 1e6 and a maximum ion injection time of 30 ms. Tandem mass spectra were generated for up to 12 precursors by high-energy collision dissociation using a normalized collision energy of 35%. The dynamic exclusion was set to 60 s. Fragment ions were detected in normal scan mode using an AGC target value of 5e4 and a maximum ion injection time of 120 ms. Source ionization parameters were as follows: spray voltage, 1.9 kV; capillary temperature, 275°C; and S-Lens RF Level, 50.
The RAW files from Q-Exactive were directly conveyed to the proteomic data analysis program suite, Proteome Discoverer v2.2 (Thermo Fisher Scientific). We employed the adjusted exemplar workflow for the SILAC experiment and the basic Sequest HT search module. Parameters for the Sequest search module were as follows: enzyme, specifically trypsin; protein database, UniProt reference proteome database of yeast (UP000002311, released in 04/2019) plus modified cRAP contaminant database with accessions in the dust/contact category (https://www.thegpm.org/crap/); fragment mass tolerance, 0.05 Da; precursor mass tolerance, 20 ppm; dynamic modifications, acetylation of protein-N-term (+42.010565 Da), oxidation of methionine (+15.9949 Da), label: 13C615N2 at lysine (+8.014199 Da), label: 13C615N4 at arginine (+10.008269 Da), Gly–Gly at Lys (Kε-GG) or protein-N-term (+114.042927 Da); and static modification, carbamidomethylation of cysteine (+57.021464 Da). Next, the identified spectra were validated using Percolator module, which was set to use a concatenated target/decoy strategy with a target cut-off q-value of 0.01 at PSM (peptide spectrum match) level. To integrate chromatographic features of identified SILAC pairs, we used Minora Feature Detector module with maximal trace retention time window of 5 min. Peptide abundance from the feature detector module was normalized with total sum of the abundance values over all identified peptides. Each result of the replicates was treated separately.
The generated mass spectrum files were directly matched to the peptide sequence in the customized REUb protein fasta file using the MS-GF+ search algorithm (v20190418) (Kim and Pevzner, 2014). The REUb sequence is a virtual one that replaces the two N-terminal residues of Ub MQ with RE. The search engine settings were as follows: trypsin enzyme specificity with 1 as the number of tolerable termini; 20 ppm for MS1 tolerance; variable modifications: oxidation of methionine (+15.9949 Da), label: 13C615N2 at lysine (+8.014199 Da), label: 13C615N4 at arginine (+10.008269 Da); and fixed modification: carbamidomethylation of cysteine (+57.021464 Da). All spectra matched to the Nt-peptide of the REUb were manually inspected. For annotating peptide spectra, we used a freely accessible version of the Interactive Peptide Spectrum Annotator (IPSA) (Brademan et al., 2019).
The amino acid sequence of
To detect the hypothetical REUb (Fig. 2A), we produced rabbit anti-REUb polyclonal antibody using a synthetic peptide, REIFVKTLTGKC, the 11-residue Nt-sequences of REUb and Cys for the KLH (keyhole limpet hemocyanin) conjugation. Dot immunoblotting analysis with the affinity-purified anti-REUb revealed the specific binding of the antibody to the REIFVKTLTGK-GST (glutathione
Moreover, the anti-REUb antibody specifically recognized REUbAA-GST (UbAA, a deubiquitylation-protective Ub variant with the mutation of C-terminal GG to AA), but not MQUbAA-GST, QUbAA-GST, or EUbAA-GST (Fig. 2C). In agreement with these dot blotting results, immunoblotting with anti-REUb further confirmed the specific recognition of purified REUb but not of MQUb; the REUb and MQUb were produced by the
We next attempted to determine the levels of endogenous REUb and REUb-linked protein species by immunoblotting with the purified anti-REUb. Notably, however, REUb and REUb–protein conjugates in wild-type
Since MQUb is a putative substrate of NatB Nt-acetylase that targets MQ, MN, ME, and MD at N-termini (Nguyen et al., 2018; Ree et al., 2018), we conjectured that Nt-acetylation might inhibit the NME-triggered Nt-modifications of MQUb. To rule out this possibility, we performed anti-REUb-based immunoblotting in the extracts from
To detect the REUb–protein adducts more effectively and sensitively, we performed anti-REUb-based immunoprecipitation, followed by immunoblotting with a highly Ub-specific monoclonal anti-Ub (P4D1) antibody, in extracts from
Ubr1 is as a key E3 Ub ligase of the Arg/N-degron pathway and directly detects the Nt-Arg of Nt-arginylated proteins via its UBR box (Choi et al., 2010). Interestingly, the levels of REUb and REUb-protein adducts were discernably decreased in
We also observed that REUb–protein adducts were strongly increased in wild-type cells in the presence of 2 mM H2O2, an oxidative stressor (Fig. 2I; lane 7). In agreement with results with the stationary-phase yeast, however, the REUb-conjugated proteins were hardly detected in
Given that oxidative stress increases the levels of REUb–protein conjugates, we next sought to identify the REUb-attached proteins by employing SILAC in the H2O2-treated wild-type cells and
Out of 541 distinct proteins detected by LC-MS/MS, 83 proteins were identified as direct or indirect REUb-binding candidates in the H2O2-treated wild-type cells (Fig. 3B). STRING enrichment analysis (for protein–protein interaction networks) indicated the functional associations of these putative REUb-interacting proteins with a set of cellular processes including gene transcription, vesicle trafficking, DNA replication, RNA processing, and mitochondrial regulation (Fig. 3C).
The tryptic digestion of Ub-conjugated proteins produces a K-ε-GG (KGG) Ub isopeptide, which allows mapping of the ubiquitylation sites of substrates by LC-MS/MS (Kirkpatrick et al., 2005). Using this Ub-remnant KGG signature-based approach, in the H2O2-treated yeast cells we identified the (putative) REUb-conjugated proteins and determined their ubiquitylation sites: Ub (43LIFAGKGGQLEDGR54); Fdc1 (a ferulic acid decarboxylase, 126THILSEEK133GG); Ino80 (a chromatin remodeling ATPase, 939NVQSELGDKGGIEIDVLCDLTQR959); Rpb3 (an RNA polymerase subunit, 155LTCVAKGGK161); Uso1 (a vesicle transport protein, 912ITEIKGGAINENLEEMK926); Sla1 (a cytoskeleton-binding protein, 358GIVQYDFMAESQDELTIK375GG); Ufd4 (a Ub-fusion decay E3 ligase, 549AINDQLIK556GG); Rad57 (a DNA repair protein, 405SFKGGASTIIQR414); Hda1 (a histone deacetylase, 561SKGGLNDELR568); and Rpl36a (a ribosome 60S subunit, 85AKGGVEEMNNIIAASR98) (Fig. 3D). In this setting, however, it should be noted that REUb might be attached not only directly to the identified proteins but also indirectly to them by way of the Ub moiety of the already Ub-tagged proteins.
Strikingly, the exhaust manual survey of the tandem MS/MS database revealed the REUb-derived REIFVK Nt-peptide, verifying the actual existence of REUb
To gain insight into the mechanistic role of REUb, we constructed its neural network-based 3D structure (for details, see Materials and Methods section). The overall surface structure of REUb was almost the same as that of MQUb, except its Nt-region (Figs. 4A and 4B). Specifically, the top and side views of the predicted models revealed that seven ubiquitylatable Lys residues and C-terminal Gly of REUb are located at almost the same surface positions as those of MQUb. Tellingly, however, Nt-Arg of REUb is stretched outward from the globular Ub protein, whereas MQUb retains its Nt-Met buried inward (Figs. 4A and 4B).
Given these findings, using the three-dimensional structure viewer programs we further estimated the H-bond acceptor–donor lengths of Nt-Met-Gln, Nt-Gln, Nt-Glu, or Nt-Arg-Glu of MQUb, QUb, EUb, or REUb with their surrounding residues, respectively (Figs. 4C-4F) (see Materials and Methods section for details). In MQUb, Nt-Met forms two H bonds with Val17 (each 2.69 Å in length), and Gln at position 2 forms two H bonds with Glu16 (2.54 Å) and with Glu64 (2.56 Å) (Fig. 4C). In contrast, both Nt-Gln of QUb and Nt-Glu of EUb form only one H bond with Glu64, resulting in H-bond acceptor–donor lengths of 2.77 Å and 2.67 Å, respectively (Figs. 4D and 4E). In the case of REUb, however, Nt-Arg forms two H bonds with Val17 (3.44 Å and 2.63 Å) and Glu at position 2 with Glu64 (2.66 Å). Notably, the H-bond acceptor–donor length between the amino group of Nt-Arg and the oxygen atom of Val17 in REUb was increased by 3.44 Å, relative to that in MQUb with 2.69 Å (Fig. 4F). These results suggest that Nt-Arg-Glu of REUb interacts more weakly with its surrounding residues Glu16, Val17, and Glu64 than Nt-Met-Gln of MQUb. Consequently, the Nt-Arg flexibly protruding from the Ub globule would increase the accessibility of REUb to the factors that recognize it.
In this study, we produced for the first time anti-REUb-specific antibody and detected the previously unrecognized REUb or REUb-conjugated proteins in
Using anti-REUb-based immunoprecipitation–tandem mass analysis, this study provides direct evidence of the existence of REUb
We found that REUb can be produced
The actual X-ray crystal structure of MQUb (PDB:4XOL, resolution 2.91 Å) reveals that the B factors of MQUb, Met-1, and Gln-2 are ~88.1 Å2, ~78.1 Å2, and ~90.5 Å2, respectively, reflecting the rigid property of Nt-Met-Gln region in MQUb. Accordingly, the relatively flexible exposure of the Nt-Arg moiety of REUb to its surface in comparison to Nt-Met of MQUb (Figs. 4A-4C and 4F and 4F) may provide a binding site for the factors that recognize it, such as Ubr1 that targets basic destabilizing Nt-residues including Nt-Arg for the Arg/N-degron pathway (Nguyen et al., 2019; Varshavsky, 2019).
Ubr1 contains at least three substrate-binding sites for basic (type-1) Nt-residues, bulky hydrophobic (type-2) Nt-residues, and internal degrons (Nguyen et al., 2019; Varshavsky, 2019). REUb is highly upregulated by stationary-growth phase and oxidative stress that most likely induces Ubr1 activation for protein quality control (Szoradi et al., 2018). If so, is Ubr1 mechanistically or functionally linked to the metabolic regulation of REUb under stress conditions and vice versa? As one of parsimonious functions, Ubr1 might mediate the degradation of the apparently faulty REUb or its derivatives which can perturb the operation of normal Ub molecule. However, such possibility seems to be low, because REUb and REUb–protein adducts were rather downregulated in the absence of Ubr1 (Fig. 2H). Conversely, the Nt-Arg-bearing REUb might occupy the type-1 substrate-binding site of Ubr1, thereby modulating the degradation of a wide range of its substrates, similarly to the hydrophilin protein Roq1 (Szoradi et al., 2018). The precise roles of Ubr1 in REUb regulation and the mechanism behind them remain to be examined.
While no specific enzymes for MQUb NME have been identified thus far, Met-aminopeptidases (Map1 or Map2) in yeast most likely remove Nt-Met of MQUb, as in the case of other MQ- or MN-starting proteins (Nguyen et al., 2019). However, the possibility that other dedicated proteases might mediate the NME of MQUb under specific conditions including oxidative stress cannot be ruled out.
The identification of REUb in this study would enormously increase the repertoire and complexity of the Ub code. In addition, REUb would be present and actively operate in multicellular organisms as well because NME, Nt-deamination, and Nt-arginylation are conserved in nearly all eukaryotes (Sriram et al., 2011; Varshavsky, 2019).
We are grateful to the current and former members of the Hwang and Lee laboratories for their assistance and advice. We also thank Edanz (www.edanz.com/ac) for editing a draft of this manuscript. This work was supported by grants from the Korean Government (MSIP) NRF-2020R1A3B2078127 and NRF2017R1A5A1015366 (C.-S.H.) and NRF-2020 R1A2C2003685 (C.L.), and the BK21 Plus program (C.-S.H.).
C.-S.H., C.L., K.T.N., and S.J. designed the research. K.T.N., S.J., S.-Y.K., and C.-S.L. performed the research and all coauthors analyzed the data. C.-S.H., C.L., K.T.N., S.J., and S.-Y.K. wrote the paper.
The authors have no potential conflicts of interest to disclose.
. Yeast strains, plasmids, and primers used in this study.
Material | Description or sequence | Source |
---|---|---|
Yeasts | ||
JD53 | MATα, his3-Δ200, leu2-3, 112 lys2-801, trp1-Δ63, ura3-52 | Dohmen et al., 1995 |
CHY367 | naa20::NatMX4 in JD53 | Lab collection |
CHY2009 | ubr1∆::KanMX6, naa20∆::NatMX4 in JD53 | Lab collection |
CHY2014 | ate1∆::HphNT2 in JD53 | Lab collection |
CHY2015 | ate1∆::HphNT2, naa20∆::NatMX4 in JD53 | Lab collection |
CHY2017 | nta1∆::HphNT2, naa20∆::NatMX4 in JD53 | Lab collection |
CHY3188 | arg4∆::KnaMX6 in JD53 | Lab collection |
CHY6056 | arg4∆::KanMX6, ate1∆::HphNT2 in JD53 | This study |
Plasmids | ||
pCH20 | pHUE | Catanzariti et al., 2004 |
pCH33 | pGEX4T-3 | Lab collection |
pCH5097 | haUb in pRK5 | Lim et al., 2005 |
pCH6913 | MQUbAA-GST in pHUE | This study |
pCH6914 | QUbAA -GST in pHUE | This study |
pCH6915 | EUbAA-GST in pHUE | This study |
pCH6916 | REUbAA-GST in pHUE | This study |
pCH7047 | MQIFVKTLTGK-GST in pHUE | This study |
pCH7048 | QIFVKTLTGK-GST in pHUE | This study |
pCH7049 | EIFVKTLTGK-GST in pHUE | This study |
pCH7050 | REIFVKTLTGK-GST in pHUE | This study |
Oligomers | ||
OCH6512 | 5′-GGTAAGCTTTCAACGCGGAACCAGATCCGATTT-3′ | |
OCH6968 | 5′-GATCCCGCGGTGGAAGAGAGATCTTCGTGAAGACCCTG-3′ | |
OCH6991 | 5′-GCTGAATTCATGTCCCCTATACTAGGTTATTGGAAAA-3′ | |
OCH7003 | 5′-GATCCCGCGGTGGAATGCAGATCTTCGTGAAGACCCTG-3′ | |
OCH8085 | 5′-GATCCCGCGGTGGAATGCAGATCTTCGTGAAGACTCTGACTGGTAAGACCGGTGCTGGAGCAGGTGCTATGTCCCCT-3′ | |
OCH8088 | 5′-GATCCCGCGGTGGAGAGATCTTCGTGAAGACCCTG-3′ | |
OCH8110 | 5′-GCTGAATTCAGCAGCTCTGAGACGGAGGAC-3′ |
Chang Hoon Ji, and Yong Tae Kwon
Mol. Cells 2017; 40(7): 441-449 https://doi.org/10.14348/molcells.2017.0115Shigeomi Shimizu
Mol. Cells 2018; 41(1): 50-54 https://doi.org/10.14348/molcells.2018.2215Seok Keun Cho, Moon Young Ryu, Pratik Shah, Christian Peter Poulsen, and Seong Wook Yang
Mol. Cells 2016; 39(8): 581-586 https://doi.org/10.14348/molcells.2016.0085