Serine and arginine-rich (SR) proteins are RNA-binding proteins (RBPs) known as constitutive and alternative splicing regulators. As splicing is linked to transcriptional and post-transcriptional steps, SR proteins are implicated in the regulation of multiple aspects of the gene expression program. Recent global analyses of SR-RNA interaction maps have advanced our understanding of SR-regulated gene expression. Diverse SR proteins play partially overlapping but distinct roles in transcription-coupled splicing and mRNA processing in the nucleus. In addition, shuttling SR proteins act as adaptors for mRNA export and as regulators for translation in the cytoplasm. This mini-review will summarize the roles of SR proteins as RNA binders, regulators, and connectors from transcription in the nucleus to translation in the cytoplasm.
Gene expression regulation is highly connected process. Transcriptional regulation is interdependent on post-transcriptional processes both in the nucleus and cytoplasm (Maniatis and Reed, 2002). Regulatory proteins, such as RNA-binding proteins (RBPs), play important roles throughout the gene expression program, from the transcription to translation (Glisovic et al., 2008). One such important family of regulatory RBPs is the Serine–Arginine (SR) family of proteins. In this mini-review, the essential functions of SR proteins in splicing regulation, with a focus on their roles as RNA binders in the ribonucleoprotein complex (RNP) will be summarized. I will also discuss their emerging regulatory roles in mediating and connecting post-transcriptional processes from the nucleus to the cytoplasm. More extensive reviews on SR proteins are suggested for a comprehensive understanding of these multifunctional regulators of RNA metabolism (Anko, 2014; Howard and Sanford, 2015).
SR proteins are RBPs mainly functioning in RNA splicing (Shepard and Hertel, 2009). The first SR proteins identified were SRSF1 (previously called SF2/ASF) and SRSF2 (previously called SC35) (Manley and Krainer, 2010). SR proteins are characterized by the presence of a C-terminal domain enriched with the Arginine (R) and Serine (S) amino acid sequences (RS domain) and an N-terminal RNA recognition domain (RRM domain) (Fig. 1). Additional SR proteins, RNA-binding SR-related proteins and other RS domain containing proteins have been identified (Long and Caceres, 2009). In general, RRM domains recognize RNA, whereas RS domains participate in diverse protein-protein and protein-RNA interactions. Most SR proteins are located exclusively in the nucleus, but some SR proteins (mainly SRSF1, SRSF3, and SRSF7) can shuttle between the nucleus and the cytoplasm. Having many such SR proteins with differential expression and regulatory patterns indicates possible non-redundant and distinct roles in pre-mRNA splicing (Zahler et al., 1993) as well as in other steps of gene expression, as will be discussed below.
Pre-mRNA splicing is an essential process, which occurs in the nucleus of eukaryotic cells. Splicing removes intervening introns from primary transcripts and joins exons to create mature mRNA by the spliceosome, a dynamic RNA-protein enzyme complex (Papasaikas and Valcarcel, 2016; Wahl et al., 2009). In addition to constitutive splicing, regulated splicing occurs to generate a large number of mRNA isoforms from given pre-mRNA by alternative splicing. In fact, large-scale transcriptome analysis revealed that up to 90% of human genes undergo alternative splicing (Pan et al., 2008). Moreover, splicing regulation is dynamically integrated to gene regulatory pathways (Braunschweig et al., 2013). Thus, alternative splicing greatly expands transcriptome as well as proteome diversities from a limited number of genes in a genome (Maniatis and Tasik, 2002; Weatheritt et al., 2016).
Splicing regulatory RBPs are thought to recognize distinct RNA sequences (splicing regulatory elements, SREs) and regulate splicing by “splicing code” (Fu, 2004; Wang and Burge, 2008). SREs include exonic splicing enhancers (ESE), exonic splicing suppressors (ESS), intronic splicing enhancers (ISE), and intronic splicing suppressors (ISS). In general, SR proteins bind ESE and enhance the splicing by recruiting the spliceosome. On the contrary, heterogeneous nuclear ribonucleoprotein (hnRNP) family proteins can antagonize the positive effect of SR proteins by binding to ESS or ISS and repressing splicing (Geuens et al., 2016).
SR proteins are important alternative splicing regulators. In contrast to the robust splicing enhancing effect in constitutive splicing, SR-mediated alternative splicing regulation is more complex and subtle. Alternative exons generally hold shorter length and weaker 5′ spliced sites (5′ss), so SR proteins induce the inclusion of the alternative exon through increasing the recognition of weak splice sites by splicing machinery. Since the regulatory roles of RBPs in alternative splicing are position- and context-dependent (Fu and Ares, 2014), locations of SR-RNA interaction influence splicing outcome. For example, exon-bound SR proteins act as enhancers, but intron-bound SR proteins may function as suppressors (Shen and Mattox, 2012). So the location of SR-RNA interactions affect spliceosome assembly and splice site selection (Erkelenz et al., 2013). In addition, SR proteins can act as activators or repressors in a context-dependent manner with other RBPs (Fu and Ares, 2014; Han et al., 2011a).
Since individual SR proteins are not functionally equivalent (Zahler et al., 1993), SR proteins are believed to bind RNA with unique or preferential specificity. Various analytical techniques for RNA–protein interaction have been utilized to determine the SR-binding RNA sequences. As an approach to identify RNA-binding sequences of SR proteins,
SELEX approaches have been applied to determine specific SR-binding RNA sequences (Long and Caceres, 2009). SRSF1- and SRSF7-binding consensus RNA sequences are largely purine-rich elements with different RNA sequences, whereas pyrimidine-rich RNA sequence elements have been found for SRSF3. Selected RNA aptamers are useful for biochemical determination of protein-interacting RNA motifs and for inferring cellular target RNAs (Bunka and Stockley, 2006; Kim et al., 2012). Nonetheless, repeated selection procedures tend to amplify RNA aptamers with unique
Functional SELEX approaches have been developed to identify the splicing regulatory elements (SREs) in the cells (Fig. 2B). Randomized sequences are introduced in specific splicing reporters at the positions within or near an alternative exon and their splicing outcome are selected to identify ESEs in
Functional SELEX identified purine-rich ESEs, which resemble the
Cellular RNP is formed by a complex interaction involving numerous RNAs and proteins, albeit with relatively low affinity (Jankowsky and Harris, 2015). Moreover, direct interaction between RBP and RNA is accompanied by indirect protein-protein interactions in RNP. Therefore, it is challenging to identify direct cellular RNA targets of RBPs and to map RBP binding RNA motifs. To identify cellular RBP-RNA interaction maps at a global scale, CLIP (cross-linking and immunoprecipitation) analysis has been developed. In the CLIP method, ultraviolet (UV) radiation is used to cross-link direct RBP-RNA interactions within a short distance (~1 Å) in the cells (Ule et al., 2005) (Fig. 2C). To gain insight into RBP-RNA interactions with precision, high throughput genomic techniques were combined with CLIP (CLIP-Seq or HITS-CLIP) (Konig et al., 2012). A couple of refinements were also made to the CLIP method to increase cross-linking efficiency (PAR-CLIP, Photoactivatable Ribonucle-oside-Enhanced CLIP) or the binding site mapping precision (iCLIP, individual-nucleotide resolution CLIP).
CLIP methods have been applied to many SR proteins to identify SR-RNA interactions. In the case of SRSF1, CLIP-Seq revealed a functionally diverse landscape of RNA targets and identified purine-rich consensus motifs (Sanford et al., 2009; Wang et al., 2011). In contrast, CLIP analyses of SRSF3 and SRSF4 revealed their interactions to non-overlapping target genes, and identified distinct
The SR-RNA interactome in the cell is likely to affect the SR-regulated transcriptome. Splicing-sensitive detection methods have been developed using known alternative splice junctions (Blencowe, 2006) or combined with deep sequencing and bioinformatics tools (Katz et al., 2010). RNA-Seq analysis has also been utilized to identify SR-regulated transcriptome and splicing outcome changes on a large scale. These analyses have been employed for SRSF1 (Anczukow et al., 2015), SRSF1/SRSF2 (Pandit et al., 2013) and SRSF3 (Ajiro et al., 2016). CLIP-analyzed SR-RNA interaction maps were compared with global SR-regulated splicing outcome. As expected, the splicing outcome of target RNA was shown to be dependent on multiple SR bindings onto target RNAs. In fact, cooperation and competition between the SRSF1 and SRSF2 proteins regulates alternative splicing events, which are related to synergistic and compensatory interactions to target RNA (Pandit et al., 2013). More extensive studies are required to understand the global splicing outcome from the collective contribution of many SR proteins.
Recent studies indicate that mRNA processing events are functionally coupled to transcription (Bentley, 2014). Because splicing is a pivotal regulatory step in gene expression (Kornblihtt et al., 2013), splicing regulatory proteins could couple the transcription step to post-transcriptional steps. SR proteins were initially discovered as proteins associated with transcriptionally active chromatin in amphibian germinal vesicle and
SR proteins directly or indirectly interact with the C-terminal domain of RNA polymerase II (Pol II-CTD) (Das et al., 2006; 2007; de la Mata and Kornblihtt, 2006; Sapra et al., 2009). In general, Pol II-CTD is the assembly site of various RNA processing factors on the transcription complex (Hsin and Manley, 2012; Munoz et al., 2010). However, SR proteins have been shown to associate with nascent RNA transcripts, rather than to be stably preassembled with Pol II-CTD (Sapra et al., 2009). Thus, co-transcriptional recruitment of SR proteins requires ongoing pre-mRNA synthesis, thereby facilitating the spliceosome assembly into pre-mRNA (Listerman et al., 2006; Sapra et al., 2009). Additionally, certain SR proteins (SRSF1 and SRSF3) bind to H3 tail and dynamically associate with chromatin (Loomis et al., 2009). Since some histone modifications regulate alternative splicing (Luco et al., 2010), the splicing regulatory function of SR proteins can also be regulated by histone modification and nucleosome occupancy (Luco et al., 2011).
It should be pointed out that nuclear SR proteins are also located in small nuclear bodies, called speckles (Fig. 3A). Speckles are enriched with many proteins required for the assembly and storage of splicing machinery, of which SR proteins are prominent components (Shepard and Hertel, 2009). Since co-transcriptional and post-transcriptional RNPs are found in speckles, how splicing is dynamically regulated in speckles remains to be understood (Han et al., 2011b). Also, relevant coupling factors for transcription and splicing need to be identified to elucidate the SR-mediated splicing mechanism in the nucleus.
Once transcription is initiated at the transcription start site (TSS), Pol II pauses at the site just downstream of TSS and requires elongation factors to allow it to proceed. Switching of the RNA Pol II complex from the initiation to the elongation complexes is important for functional transcription, which is mediated by P-TEFb kinase phosphorylating Ser2 position in CTD (Fig. 3A) (Jonkers and Lis, 2015). As assumed, most of the mRNA processing complexes are assembled during the elongation step of transcription (Perales and Bentley, 2009) So chromatin-associated and pol II-interacting mRNA processing proteins are likely to function in regulating transcription elongation (Allemand et al., 2008).
A direct role for SR proteins in transcriptional regulation has been shown for SRSF2. In contrast to shuttling SR proteins (such as SRSF1, SRSF3, and SRSF7), SRSF2 is a non-shuttling protein located in the nucleus. Interestingly, SRSF2 associates with DNA only, but not with cytoplasmic mRNA, suggesting a role restricted to the nucleus (Sapra et al., 2009). Recently, SRSF2 has been shown to mediate the release of paused Pol II by switching p-TEFb from inhibitory 7SK RNP, which in turn activates transcriptional elongation in collaboration with promoter-associated nascent RNA with ESE (Ji et al., 2013). Such a transcription regulatory point would link the recruitment of the splicing machinery to the transcription complex, ensuring the proper assembly of transcriptional and co-transcriptional machineries (Jonkers and Lis, 2015).
SR proteins also play a role in many nuclear RNA processes, since nuclear mRNPs are dynamically assembled and function in transcription, splicing, export and nuclear surveillance (Muller-McNicoll and Neugebauer, 2013). In fact, some SR proteins have been reported to be involved in 3′ end processing (Lou et al., 1998), mRNA packaging (Singh et al., 2012) and mRNA export (Huang and Steitz, 2005) as will be discussed later.
RNA modification is emerging as an important regulatory marks for RNA stability and translation in the cytoplasm (Zhao et al., 2017). In addition, RNA modification on N6-methyadenosine (m6A) was recently reported to be involved in alternative splicing in the nucleus. Reader protein for m6A (nuclear reader YTHDC1) recruits or blocks SR proteins, SRSF3 or SRSF10, respectively. Therefore, RNA modification modulates SR protein access to the binding regions of target mRNA (Xiao et al., 2016). Considering the emerging importance of RNA modification during various steps of RNA metabolism, it will be interesting to decipher the mechanistic details of how m6A can regulate SR protein functions.
As was initially identified with a monoclonal antibody detecting phosphorylated SR proteins (mAb104) (Roth et al., 1990), most of the nuclear SR proteins are phosphorylated. Since SR proteins contain a RS domain with repeated Arg/Ser sequences, they are subjected to phosphorylation by SR specific kinases. The phosphorylation status of SR proteins is related to their functions in spliceosome assembly and their localization inside the nucleus as well as in the cytoplasm (Zhou and Fu, 2013).
Two families of kinases are major regulators of SR phosphorylation: SR protein-specific kinases (SRPKs) (Gui et al., 1994) and Cdc-2 like kinases (CLKs) (Colwill et al., 1996b). The subcellular localizations and substrate specificities of these two kinase families are distinct. SRPKs are detected both in the cytoplasm and in the nucleus, whereas CLKs are constitutively located in the nucleus and co-localize with SR proteins in nuclear speckles (Colwill et al., 1996a; 1996b). SRPKs are retained in the cytoplasm by molecular chaperons; upon activation by EGF growth factor, they can be translocated to the nucleus and cause changes in the alternative splicing of many genes (Zhong et al., 2009; Zhou et al., 2012). In contrast, nuclear CLKs are activated by osmotic and heat-shock stresses, so SR proteins are re-phosphorylated by CLKs during the recovery phase of stress (Ninomiya et al., 2011). The two SR kinase systems appear to act symbiotically for proper phosphorylation of SR proteins and splicing regulation (Aubol et al., 2016). Thus, the roles of the dual SR kinases are interrelated and contribute in a coordinated fashion toward protein phosphorylation and localization in response to different stimuli (Corkery et al., 2015; Ghosh and Adams, 2011). Additionally, de-phosphorylation and re-phosphorylation of SR proteins seem to be important for cytoplasmic functions (Huang et al., 2004; Sanford et al., 2005).
Since gene expression in cells is interconnected from the nucleus and to the cytoplasm (Moore and Proudfoot, 2009), nuclear SR proteins could be a connector for cytoplasmic events. In fact, most SR proteins dynamically localize to speckles and chromatin in the nucleus; but some SR proteins (shown as shuttling SR proteins in Fig. 1) can export out to the cytoplasm and shuttle continuously between the nucleus and the cytoplasm (Caceres et al., 1998). In general, export adaptors (such as TREX complex) link transcription to export of mRNA which is mediated by the export receptor, Nuclear export factor 1 (NXF1/TAP) (Wickramasinghe and Laskey, 2015). In addition, two shuttling SR proteins (SRSF3 and SRSF7) are export adaptors involved in specific mRNA export (Fig. 3B) (Huang, 2001; Huang et al., 2003). Structural analysis revealed that they interact with TAP export receptor via Arginine-rich peptide adjacent to RRM of SR proteins (Hargous et al., 2006).
Recent CLIP analyses of SR proteins (SRSF1 to SRSF7) and the NXF1 export receptor revealed co-binding of SR proteins to the export receptor. Among many SR proteins, SRSF3 emerges as the most potent adaptor for the NXF1 adaptor (Muller-McNicoll et al., 2016). As discussed above, the phosphorylation status of SR proteins is linked to the locations and functions of SR proteins. Dephosphorylated SR proteins act as export adaptors for specific mRNA (Huang et al., 2004; Sanford et al., 2005). Considering the critical role of mRNA export during gene expression regulation, the mechanisms underlying the role of SR proteins as molecular connectors from nuclear mRNA processing to cytoplasmic translation need to be elucidated.
Instead of acting as passengers in the mRNP journey from nucleus to cytoplasm, shuttling SR proteins actively engage in mRNA decay and translation thereby determining the ultimate fate of the bound mRNAs (Fig. 3C) (Huang and Steitz, 2005). Since spliced mRNP is assembled by EJC along with SR proteins (Singh et al., 2012), mRNA decay process enhanced by EJC assembly on mRNA, such as non-sense mediated decay (NMD), can be regulated by SR proteins (Popp and Maquat, 2014). In fact, SRSF1 have shown to enhance NMD of the premature termination codon (PTC) containing model
Other shuttling SR proteins, such as SRSF3 and SRSF7, can also function in the translation process. SRSF3 has been shown to regulate Internal Ribosomal Entry Site (IRES)-mediated translation initiation (Bedard et al., 2007), whereas SRSF7 plays a role in translation of un-spliced viral RNA containing Constitutive Transport Element (CTE) (Swartz et al., 2007). In the case of
Gene expression factory is formed by integration of transcription and RNA-processing machineries, which is mediated by mRNPs (Maniatis and Reed, 2002). Since mRNPs connect nuclear transcription and pre-mRNA processing to downstream cytoplasmic events (Moore and Proudfoot, 2009), SR proteins could be important parts of the gene expression machinery by forming various forms of mRNPs. Future research should be directed to identify diverse forms of SR-mRNPs in the nucleus as well as in the cytoplasm to understand multifunctionality of SR proteins.
Splicing regulation is performed by specific RBPs via their interactions with cis-acting regulatory elements on primary transcripts. Thus, SR-regulated splicing is likely to be modulated by many other RBPs. Recent proteomic analyses have indicated a large number of proteins that may be directly or indirectly linked to mRNA isoform production (Castello et al., 2016; Ray et al., 2013). Therefore, it will be important to identify known and novel RBPs relevant to SR-regulated splicing.
Master splicing regulators play important roles in generating the transcriptome (Jangi and Sharp, 2014); thus, SR proteins are involved in qualitative (splicing) and quantitative (transcription) regulations of gene expression program in the cells. Since alternative splicing contributes to cellular physiology in various environments (Kalsotra and Cooper, 2011), it will be important to identify signaling pathways and critical signaling molecules relevant to SR protein regulation. Especially, the extracellular signals or environmental cues required for the regulation of SR protein expression should be investigated. It will be interesting to dissect the signaling pathways involved in SR protein modifications, including phosphorylation.