Introduction
The
Target Of Rapamycin (TOR) was first identified in
Saccharomyces cerevisiae by Hall and colleagues through a genetic screen for mutants resistant to the antifungal activity of the drug Rapamycin (
Heitman et al., 1991). The Hall group went on to show that there are two yeast TOR genes, both of which encode kinases (
Kunz et al., 1993;
Helliwell et al., 1994), and the mammalian homolog mTOR was discovered and published by four groups independently thereafter (
Brown et al., 1994;
Chiu et al., 1994;
Sabatini et al., 1994;
Sabers et al., 1995). Since these seminal studies, (m)TORC1 has emerged as the key driver of cell biogenesis throughout the eukaryotes: (m)TORC1 activity has been shown to be stimulated by growth factor signaling and nutrients such as glucose and amino acids, whereas under conditions of stress, such as starvation, energy or oxygen deficiency, or DNA damage, (m)TORC1 activity is robustly supressed (
Laplante and Sabatini, 2012).
The core components of (m)TORC1 are (m)TOR, its catalytic core,
regulatory-associated protein of mTOR (RAPTOR; Kog1 in yeast) (
Kim et al., 2002;
Loewith et al., 2002), and
mammalian lethal with Sec13 8 (mLST8; Lst8 in yeast) (
Roberg et al., 1997;
Chen and Kaiser, 2003;
Kim et al., 2003).
Proline-rich Akt substrate of 40 kDa (PRAS40) and
DEP domain-containing partner of TOR (DEPTOR) are additional components which complete the roster, although these proteins are inhibitors of (m)TOR kinase activity (
Vander Haar et al., 2007;
Oshiro et al., 2007;
Sancak et al., 2007;
Peterson et al., 2009). When activated, (m)TORC1 phosphorylates substrates with roles in stimulating ribosome biogenesis, protein and lipid synthesis, and inhibition of autophagy (
Laplante and Sabatini, 2012;
Kennedy and Lamming, 2016). Many of these proteins feature a
TOR signaling (TOS) motif, characterized by alternating hydrophobic and acidic residues, which is specifically recruited by RAPTOR (
Schalm and Blenis, 2002;
Nojima et al., 2003;
Schalm et al., 2003).
(m)TORC1 activity is understood to be principally controlled by the nucleotide state of three small GTPases. Full mTORC1 activation requires its recruitment to the lysosome by heterodimeric Rag GTPases (
Sancak et al., 2008), whereupon GTP-bound Rheb GTPases allosterically activate the kinase (
Long et al., 2005;
Sato et al., 2009) (Fig. 1).
Signals are relayed to mTORC1 through these small GTPases, whose nucleotide binding states are regulated by a series of important protein complexes. The
TSC complex (TSCC) acts as a
GTPase activating protein (GAP) for Rheb, employing a GAP domain within the TSC2 subunit (
Castro et al., 2003;
Garami et al., 2003;
Inoki et al., 2003;
Saucedo et al., 2003;
Stocker et al., 2003;
Tee et al., 2003;
Zhang et al., 2003). Upstream regulation of the Rag GTPases is more baroque, featuring at least four large protein complexes including: GATOR1, a GAP for RagA/RagB/Gtr1 (
Neklesa and Davis, 2009;
Bar-Peled et al., 2013;
Panchaud et al., 2013a); FLCN-FNIP (Lst4-Lst7 in yeast), a GAP for RagC/RagD/Gtr2 (
Péli-Gulli et al., 2015;
Petit et al., 2013;
Tsun et al., 2013); GATOR2, an inhibitor of GATOR1 (
Bar-Peled et al., 2013;
Panchaud et al., 2013a,
2013b); and, KICSTOR, which is thought to regulate the GATOR complexes (
Peng et al., 2017;
Wolfson et al., 2017). The LAMTOR/Ragulator complex is thought to be a
guanine nucleotide exchange factor (GEF) for the RagA/B GTPases in mammals (
Bar-Peled et al., 2012), however GEFs for Rheb and RagC/D are yet to be identified. The current consensus understanding within the field is that mTORC1 is only active when both the Rag and Rheb GTPase activation pathways are fully activated, neither being sufficient in isolation.
Recent technological advances have enabled structural and mechanistic characterization of many technically challenging complexes such as mTORC1, (m)TORC2, Ragulator-Rag, and most recently GATOR1 and GATOR1-Rag, shedding increasing light on this key signaling pathway. However, many of the most important upstream components, including the TSCC, GATOR2 and KICSTOR complexes, remain either only partially or completely uncharacterised at a molecular level. Here we provide a survey of the current understanding within the field, first covering the nutrient sensing pathway leading to the Rag GTPases and (m)TORC1 recruitment, and then the plethora of signals communicated through Rheb to (m)TORC1 activation. Finally, we detail the remaining gaps that must be addressed before the complete picture can emerge.
Diverse, independent and ambivalently conserved cytoplasmic sensor proteins detect and signal nutrient insufficiency through the Rag pathway
While it has long been known that (m)TOR responds to nutrient availability (
Blommaart et al., 1995;
Hara et al., 1998), the direct inputs to (m)TORC1 signaling remained unclear. However, several human proteins are now recognized as key metabolite sensors for the mTORC1 pathway. These include SESN1 and SESN2 (
Chantranupong et al., 2014;
Parmigiani et al., 2014;
Peng et al., 2014), CASTOR1 (
Chantranupong et al., 2016), and SAMTOR (
Gu et al., 2017), and may also include SLC38A9 (
Jung et al., 2015;
Rebsamen et al., 2015;
Wang et al., 2015) and the vacuolar (v)-ATPase (
Zoncu et al., 2011) which have been shown to affect mTORC1 activity in response to amino acid availability. Homologs of only the SESNs have been detected by sequence analysis in other higher eukaryotes such as
C. elegans and
D. melanogaster, and the known sensor proteins in general are only present in few fungal species, potentially representing species-specific nutritional dependencies (
Wolfson and Sabatini, 2017). A great deal of work needs to be done at the periphery of the (m)TORC1 pathway to discover exactly where and how the first steps in nutrient sensing take place, as this is one of its least well-understood aspects. Recent structural and functional studies of the known elements of the pathway have successfully yielded mechanistic insights into how some of these proteins link metabolite detection to mTORC1 signaling.
Sestrins
The Sestrin/SESN proteins were first identified several years before being linked to mTORC1 signaling (
Velasco-Miguel et al., 1999;
Budanov et al., 2002;
Peeters et al., 2003), in a study demonstrating that expression of SESN1 and SESN2 strongly inhibited phosphorylation of the well-characterized mTORC1 substrates
ribosomal protein S6 kinase 1 (S6K1) and
eukaryotic initiation factor 4E binding protein 1 (4EBP1) (
Budanov and Karin, 2008). SESN2 interacts with GATOR2 (
Chantranupong et al., 2014;
Parmigiani et al., 2014;
Kim J S et al., 2015), with the interaction conditionally strengthened by both amino acid deprivation (
Chantranupong et al., 2014) and DNA damage-induced stress (
Parmigiani et al., 2014). Ectopic SESN2 expression prevents mTORC1 localization to the lysosome in the presence of amino acids (
Parmigiani et al., 2014;
Peng et al., 2014), while reducing GATOR1-GATOR2, and increasing GATOR1-RagB, association (
Kim J S et al., 2015), hence acting as a negative regulator of mTORC1 signaling. Experimentation with individual amino acids led to the discovery that the interaction between SESN2 and GATOR2 is leucine-dependent (Wolfson et al., 2016), and intrinsically coupled to its interaction with GATOR2, and consequently to mTORC1 regulation, as the expression of a GATOR2 binding-deficient mutant, SESN2
S190W, does not restore leucine sensitivity to SESN1-3 deficient cells. It has alternatively been suggested to act as a guanine nucleotide dissociation inhibitor (GDI) for RagA (
Peng et al., 2014) however subsequent structural studies have not supported this hypothesis. It may be worthwhile considering, in the face of newer observations, that Peng and colleagues did not consider the regulation of SESN2 by amino acids, and therefore SESN2 GDI activity may belong to the
apo-protein; however, in this instance their observation between a direct SESN2-RagA/B interaction has not been recapitulated by other groups, and therefore RagA/B-GDI activity remains unproven.
Crystal structures of SESN2 (
Kim H et al., 2015;
Saxton et al., 2016b) proved particularly revealing. SESN2 shares a common fold with the
carboxymucolactone decarboxylase (CMD) protein family, possessing globular CMD-like domains at either terminus, separated by a partially disordered linker (Fig. 2A). CMD family members are commonly homodimers, and therefore SESN2 represents an intramolecular “dimer” of two CMD-like domains, with the N- and C-terminal domains (NTD and CTD, respectively) juxtaposed around a twofold pseudosymmetry axis (Fig. 2B). SESN2 coordinates a single leucine molecule in a pocket in the CTD (Fig. 2C), adjoined by part of the linker, which packs against the side of the pocket. Saxton and colleagues identify an inter-helix loop in the CTD, containing three important threonine residues (T374, T377 and T386), which closes over the binding pocket and is responsible for providing specificity for leucine binding. This closure is thought to relay a conformational change from the binding pocket to the putative GATOR2 interaction surface, of which two residues, D406 and D407, were identified as necessary for GATOR2 recruitment (
Kim H et al., 2015;
Saxton et al., 2016b). Further mechanistic detail of this interaction remains elusive however, given the lack of any substantial structural or mechanistic understanding of the action of GATOR2.
Taken together, the functional and structural data suggest that SESNs act as negative regulators of mTORC1 signaling by inhibiting the function of GATOR2 in the
apo state. SESNs have also been suggested to constitute a switch between biogenesis and autophagy given the observation that ULK1, a kinase controlling the induction of autophagy, is able to stimulate phosphorylation of SESN2 either directly or indirectly (
Kimball et al., 2016). It seems likely that several stress signals are integrated through the SESNs, and thus they may represent an important, multifaceted sensor upstream of not only mTORC1, but related signaling circuits as well.
CASTORs
The CASTOR proteins (originally
GATSL2 and
3) were identified as through their activity as binding partners of the GATOR2 components WDR24, WDR59 and MIOS (
Huttlin et al., 2015), although only subsequent biochemical verification of these interactions linked them to the mTORC1 pathway, with GATSL3 renamed CASTOR1, for
cellular arginine sensor for mTORC1, and GATSL2 CASTOR2 (
Chantranupong et al., 2016). CASTOR1 and 2 homo- and heterodimerize with one another, with the CASTOR1 homodimer interacting with GATOR2 most strongly. CASTORs contain tandem
aspartate kinase, chorismate mutase and TyrA (ACT) domains (Fig. 2A), which have an evolutionarily conserved function in binding small molecules. In this case the CASTOR1-GATOR2 interaction is arginine-dependent, with arginine supressing the association of CASTOR1 with GATOR2 to sense arginine upstream of mTORC1. CASTOR2 binding to GATOR2, on the other hand, was amino acid-insensitive; thus, CASTOR2 may serve to retain responsivity of mTORC1 to nutrient signals through CASTOR1 by damping the initial activation of the kinase, affecting the properties of signal feedback as opposed to direct amino acid detection.
The crystal structure of the arginine-bound CASTOR1 homodimer (
Gai et al., 2016b;
Saxton et al., 2016a;
Xia et al., 2016) revealed that each monomer contains four ACT domains (Fig. 2D), and arginine binds to each monomer in a deep cleft at the intramolecular ACT2/4 interface (Fig. 2E). Mutation of critical residues D304, which coordinates the arginine guanidinium group, and S111, which binds the free amine, resulted in complete loss of arginine binding, as well as constitutive GATOR2 binding and strong mTORC1 inhibition even in the presence of arginine. Residues Y118, Q119 and D121 were identified as necessary for the GATOR2 interaction (
Saxton et al., 2016a;
Gai et al., 2016b) and further noted to form patches on the same side of the CASTOR1 homodimer, which are distinct surfaces to the arginine binding pocket (
Gai et al., 2016b). A loop comprising residues 270-280 sits over the arginine binding site and ‘locks’ the bound amino acid in place, however is disordered in the
apo-state (Guo and Deng; PDB ID: 5GT8) (Fig. 2F). This observation supports the hypothesis that this key loop mediates a conformational switch between the
apo- and occupied states, allosterically modulating the GATOR2 binding site.
It is noticeable that the proposed mechanism is reminiscent of that implied for the SESNs, although the means by which this small conformational change affects the GATOR1-GATOR2 interaction remains unknown. Intriguingly, the binding of the CASTOR1 homodimer to GATOR2 is not competitive with SESN2: both proteins can bind the GATOR2 complex simultaneously. WDR24 and SEH1L are known to form a minimal complex sufficient for interaction with SESN2 (
Parmigiani et al., 2014), and these GATOR2 components, with the addition of MIOS, were also identified as sufficient to bind CASTOR1 (
Chantranupong et al., 2016;
Gai et al., 2016b). It will remain for future structural and mechanistic studies of the GATOR2 complex and interacting proteins to unpick this knot, however the available evidence would imply that signal integration appears likely to occur at the GATOR2 binding step.
SAMTOR
SAMTOR, originally C7orf60, has been recently defined an
S-adenosylmethionine sensor upstream of mTOR in mammals (
Gu et al., 2017). Sequence analysis of SAMTOR suggests the presence of a class I Rossman fold, typically found in methyltransferases (Fig. 2A), which has been corroborated by in vitro binding assays establishing micromolar affinity binding coefficients for both S-adenosylmethionine (SAM) and S-adenosylhomocysteine (SAH). SAM binding appears to constitute a physiologic signal of methionine sufficiency, and overexpression of even wild type SAMTOR is sufficient to inhibit mTORC1 signaling, implying that the
apo-form represents the negative regulator, as for the SESN and CASTOR proteins. SAMTOR is thought to interact with the GATOR1 and KICSTOR complexes simultaneously (
Gu et al., 2017;
Wolfson et al., 2017), possibly through a tripartite interface given that all three proteins are required for any interaction. Furthermore, SAM/SAH binding reduces the association of SAMTOR with GATOR1-KICSTOR, analogously to the way in which leucine and arginine disrupt the interaction with GATOR2 of SESN2 and CASTOR1 respectively. Further development of the exact mechanisms by which SAMTOR achieves its function must await structural studies of the protein and its complexes.
Further nutrient sensors – an established paradigm
The presence of sensors specific for leucine and arginine in mammalian cells likely reflect their metabolic importance. For example, leucine is one of two amino acids which are completely ketogenic, and is known to potently activate muscle protein synthesis (
Grabacka et al., 2016;
Norton and Layman, 2006); additionally, arginine is the precursor of nitric oxide, ornithine and urea, all key molecules at the intersections of several critically important metabolic pathways (
Morris, 2006). It appears highly likely that further nutrient sensors feeding into the (m)TORC1 signaling pathway await discovery given the great diversity in nutrient signals to be taken account of and the lack of orthologs of even the most well-established sensors (
Wolfson and Sabatini, 2017). Although it is likely evolutionarily redundant to encode sensors for isoleucine and valine, the other branched-chain amino acids, it is tempting to speculate that specific sensors may have evolved for more metabolically unique amino acids such as histidine, and other key nutrients. Those discovered to date are diverse in structure, but follow an established paradigm by interacting with further proteins within the pathway in the case of insufficiency rather than sufficiency of the nutrient in question. This has the evolutionary benefit of allowing a small number of conserved complexes to trivially integrate a much larger number of diverse repressive inputs, enabling sensitivity while retaining responsivity in downstream signaling.
Nutrient signaling inhibits (m)TORC1 recruitment to the lysosome by controlling the nucleotide state of the RagA/B GTPases
As our understanding of mTORC1 signaling in response to specific stimuli increases, inputs such as the levels of individual metabolites to the integrator complexes are being recognized as increasingly important (
Wolfson and Sabatini, 2017). These complexes, comprising GATOR1 and GATOR2 (
Dokudovskaya et al., 2011;
Bar-Peled et al., 2013;
Panchaud et al., 2013a), and KICSTOR (
Peng et al., 2017;
Wolfson et al., 2017) constitute a network of integrators which filter the available chemical inputs to produce discrete on-or-off signals to the (m)TORC1 machinery. Though these complexes remain incompletely structurally characterized, a developing body of work is enabling us to understand how signals are integrated and communicated through Rag A/B.
The GATOR complexes
GATOR complexes 1 (GATOR1) and GATOR2, for
GAP Activity Toward Rags, are key evolutionarily conserved signal integration and transduction complexes that control the nucleotide state of the RagA/B GTPases. They were first identified and characterized in
S. cerevisiae, as
Seh1-associated complex (SEAC)
inhibiting TORC1 (SEACIT) and
SEAC activating TORC1 (SEACAT) respectively (
Dokudovskaya et al., 2011). GATOR1 is a negative regulator of (m)TORC1 signaling, whereas GATOR2 inhibits GATOR1 in turn, thereby acting as a positive regulator. SEACIT/GATOR1 consists of three proteins, Sea1/Iml1, Npr2 and Npr3; while SEACAT/GATOR2 comprises five, Seh1, Sec13, Sea2/Rtc1, Sea3/Mtc5 and Sea4. Whereas Sea1-Sea4 were previously uncharacterised, Npr2 and Npr3 were previously annotated with roles in autophagy, and shown to form an evolutionarily conserved heterodimer (
Neklesa and Davis, 2009;
Wu and Tu, 2011). Neklesa and colleagues also provided the first lines of evidence that the human homologs,
Npr2-like (NPRL2) and NPRL3, act as repressors of mTORC1. Subsequently, orthologous mammalian complexes were identified and categorised, with GATOR1 containing Iml1 homolog DEPDC5, NPRL2 and NPRL3; and GATOR2, consisting of SEC13, SEH1L, Sea2 homolog WDR24, Sea3 homolog WDR59, and Sea4 homolog MIOS (
Bar-Peled et al., 2013). Both the human and
S. cerevisiae GATOR1 complexes possess evolutionarily conserved GAP activity toward the Gtr1/RagA/B GTPase, while GATOR2 inhibits the activity of GATOR1 through an as-yet-undetermined mechanism (
Bar-Peled et al., 2013;
Panchaud et al., 2013a).
Bioinformatic analysis of the
S. cerevisiae proteins revealed predicted architectures commonly found in vesicle coatomer and tethering proteins (
Lee and Goldberg, 2010;
Faini et al., 2013;
Balderhaar and Ungermann, 2013), and these architectures are largely conserved in the mammalian homologs, especially in GATOR2 components (
Dokudovskaya et al., 2011) (Fig. S1A). Sec13/SEC13 and Seh1/SEH1L are both composed of six-bladed
b-propellers (Fig. S1B and 1C, respectively) that commonly serve as sites of protein–protein interactions (
Xu and Min, 2011), and are subunits directly shared with coatomers and nuclear membrane complexes. The structure of
S. cerevisiae Sec13 was first solved at high resolution as part of a Sec13-Sec31 COPII subcomplex (
Fath et al., 2007), before both yeast and human forms were later resolved as part of numerous nuclear pore complex (NPC)-related (
Hsia et al., 2007;
Brohawn and Schwartz, 2009;
Nagy et al., 2009;
Stuwe et al., 2015;
Kelley et al., 2015) and COPII-related (
Whittle and Schwartz, 2010;
Bharucha et al., 2013; Zanetti et al., 2013) subcomplexes. Seh1, like Sec13, has been consistently resolved at high resolution as part of NPC-related subcomplexes (
Brohawn et al., 2008;
Debler et al., 2008;
Stuwe et al., 2015). WDR24, WDR59 and MIOS are notably also predicted to contain
b-propeller domains at their N-termini, separated from C-terminal RING domains, commonly associated with E3 ubiquitin ligases (
Metzger et al., 2012). WDR59 is predicted to contain a
RING finger and WD domain-containing proteins and DEAD-like helicases (RWD) domain following the
b-propeller, which are a common architectural motif observed in kinetochore proteins (
Schmitzberger and Harrison, 2012). A structural modeling approach was used to derive an architecture for the GATOR complex based on bioinformatic predictions and interactivity assays (
Algret et al., 2014), however the lack of homologous structures for many of the GATOR components render this model highly speculative. Nonetheless in the proposed model the
b-propellers of Seh1, Sec13, Sea2 and Sea4 form a cluster, similar to that seen in COPI and COPII vesicle coat complexes (
Fath et al., 2007;
Lee and Goldberg, 2010). This exchange with the membrane trafficking machinery raises the possibility that mTORC1 is also regulated by signals originating in nucleocytoplasmic transport and vesicle transport processes through the Rag GTPase axis (
Panchaud et al., 2013b).
While GATOR2 remains structurally uncharacterised, the structure of the mammalian GATOR1 complex was recently resolved by cryo-EM (
Shen et al., 2018), revealing the architecture of each GATOR1 component (Fig. 3A) and the structure of a stable complex with the Rag GTPases (Fig. 3B). NPRL2 and NPRL3 form a pseudosymmetric heterodimer with their N-terminal longin domains and CTDs, interacting together with the DEPDC5
Structural Axis for Binding Arrangement (SABA) domain. The DEPDC5
Steric Hindrance for Enhancement of Nucleotidase activity (SHEN) domain interacts with GTP analog-bound RagA proximal to the nucleotide binding pocket (Fig. 3C), however the interaction captured in the structure does not appear to be responsible for stimulation of GTP hydrolysis. Kinetic assays for GTP hydrolysis suggest that a weaker interaction mediates true GAP activity, and this is thought to be performed by NPRL2-NPRL3 (
Shen et al., 2018,
2017). GATOR1 is thus presumed to interact with the Rag heterodimer in two ways: the ‘inhibitory’ conformation, captured in the resolved cryo-EM structure, which binds to GTP-bound RagA with higher relative affinity but insufficient to activate GTP hydrolysis, and a lower-affinity interaction dependent on the NPRL2-NPRL3 heterodimer that stimulates GTPase activity. NPRL2 and NPRL3 form a heterodimer in part using their N-terminal longin domains (
Zhang D et al., 2012;
Shen et al., 2018). It is intriguing that a recent structure of the
Chaetomium thermophilum Mon1-Ccz1-Ypt7 complex highlighted that the Mon1-Ccz1 GEF contacts its cognate GTPase Ypt7 using one face of a conserved longin domain heterodimer (
Kiontke et al., 2017). The conservation of this domain architecture in GTPase-regulator interactions suggests that the longin domains of NPRL2-NPRL3 may well bind the Rag GTPases in a similar way. Studies of the NPRL2-NPRL3-Rag heterodimer interaction are required to provide insights into the catalytic GAP mechanism employed by GATOR1, and hence enable us to understand exactly how these GATOR1-Rag interactions differ to regulate the GTPases. The physiologic relevance of the inhibitory binding mode is proposed to be to dampen mTORC1 activation in response to sustained nutrient sufficiency, although such bimodal activity has not previously been observed between a GTPase and its GAP. It is important to note that NPRL2 and NPRL3 are believed to constitute the GATOR2 binding site of GATOR1 (
Shen et al., 2018). Since NPRL2 and NPRL3 are also proposed to constitute the weakly binding GAP for the Rag GTPases, it is plausible that GATOR2 inhibits GATOR1 GAP activity by sterically hindering binding of NPRL2-NPRL3 to RagA/B.
Mutations in the mammalian GATOR1 genes have been recently linked to focal epilepsies and familial cortical dysplasia, previously not considered to have an underlying genetic basis, supporting a multifaceted role for these proteins in multiple cellular processes (
Baldassari et al., 2016;
Ricos et al., 2016). Detailed characterization of the interactions between mammalian GATOR2 and GATOR1 will likely provide some insights into the associated pathophysiologies, since unlike in
S. cerevisiae, the mammalian complexes do not appear to form a stable octamer, instead performing their functions as independent complexes dynamically associating in mammalian cells (
Bar-Peled et al., 2013;
Panchaud et al., 2013b).
KICSTOR
The KICSTOR complex is also a key component of the RagA/B control pathway, responsible for the localization of the Rag-GAP GATOR1 to its GTPase substrates. It consists of the proteins KPTN, ITFG2, C12orf66, and SZT2, whose initial letters give the complex its name, which were identified as DEPDC5 interactors and form a distinct complex (
Wolfson et al., 2017), while independently SZT2 was found as part of the SESN2 interactome (
Peng et al., 2017). The integrity of KICSTOR depends on the largest subunit, SZT2; this component is also responsible for the interaction of KICSTOR with GATOR1, as well as GATOR2, although the latter interaction is indirect (
Wolfson et al., 2017). Knockout studies targeting SZT2 impair the localization of GATOR1 to lysosomes, but not that of GATOR2 or the Rag GTPases. The SZT2 component appears to be necessary for coordinated GATOR1 and GATOR2 binding, and is necessary for GATOR1-dependent inactivation of mTORC1 on the lysosome, as its loss results in constitutive mTORC1 activity and lysosomal localization (
Wolfson et al., 2017;
Peng et al., 2017).
SZT2 is the largest subunit of KICSTOR, at 3432 residues long, however it exhibits very little homology with any other known proteins or domains (Fig. S2A). Bioinformatic predictions indicate a possible
von Willebrand factor (vWA)-like domain toward the N terminus, which is commonly associated with cell-cell adhesion (
Whittaker and Hynes, 2002). ITFG2 and KPTN, both much smaller proteins, are both predicted to have almost completely
b-stranded architecture, also featuring putative adhesion motifs: ITFG2 with two atypical FG-GAP motifs frequently found in
a-integrins (Springer, 1997); and, KPTN with a VCBS motif found in some fungal and bacterial adhesion proteins (
Wenter et al., 2010). The FG-GAP repeats as found in integrin
a-subunits form a
b-propeller (
Xiong et al., 2001), consistent with the prediction for the structure of ITFG2. C12orf66, on the other hand, was predicted to be almost completely
a-helical, with a putative coiled-coiled region at its N terminus followed by a domain of unknown function (DUF) with predominantly
a-helical architecture. The crystal structure of the C-terminal DUF from the
Mus musculus C12orf66 homolog (PDB ID: 2GNX) (Fig. S2B) revealed a helical bundle architecture for this domain, similar to that of the classical longin domain. The structure is reminiscent of SNARE proteins, which form helical bundles at membrane contact/fusion sites, and notably form a subfamily of longin domain-containing proteins (
De Franceschi et al., 2014;
Daste et al., 2015), but no further structural information is currently available.
Mutations in SZT2 has been linked to epilepsy (
Frankel et al., 2009;
Basel-Vanagaite et al., 2013), and genetic aberrations in KPTN and C12orf66 have been linked to brain malformations (
Baple et al., 2014;
Mc Cormack et al., 2015;
Pajusalu et al., 2015). Consistent with the notion that these abnormalities are associated with mTORC1 hyperactivation and GATOR1 inactivation (
Baldassari et al., 2016;
Baulac, 2016), KICSTOR clearly plays a role in the GATOR1-dependent repression of mTORC1 signaling. SZT2 contains several key regions that enable interaction with GATOR1 and GATOR2 respectively (
Peng et al., 2017), and more recent immunoprecipitation data suggests that a SZT2-DEPDC5 interaction occurs in the absence of other GATOR components (
Shen et al., 2018). It has previously been noted that the GATOR proteins exhibit architectural similarity to vesicle coat proteins as well as the related vesicle tethering complexes of the CORVET, HOPS and CATCHR families (
van der Kant et al., 2015;
Balderhaar and Ungermann, 2013). Structural studies of the GATOR1-GATOR2-KICSTOR complex will be required to understand how KICSTOR organizes the GATOR complexes on the lysosome. Further biochemical and genetic studies on individual KICSTOR components themselves are urgently needed; they might reveal as-yet-unknown functions in the growth pathway, or it may turn out that binding to KICSTOR is simply a convenient means of recruiting GATOR1 to its cognate compartment, with the complex performing another lysosomal-specific task.
Recruitment of the (m)TORC1 assembly to the lysosome or vacuole is accomplished in the absence of inhibitory nutrient signaling, and through incompletely understood control of the RagC/D GTPase
In the activated state of the cellular growth pathway, (m)TORC1 must be recruited to its cognate compartment, co-localizing it with its activator Rheb. This role is performed by the Rag GTPases, which are activated by nucleotide exchange and hydrolysis, in the case of Rag A/B and C/D respectively, and their own recruitment to the lysosome or to the vacuole in yeast. While FLCN-FNIP (
Petit et al., 2013;
Tsun et al., 2013;
Péli-Gulli et al., 2015) and the Ragulator complex are known to be the key regulators promoting this step, the underlying signals for activation of the Rag GTPases remain obscure in comparison to those inhibiting them.
FLCN-FNIP
Just as the GATOR complexes control RagA/B, Folliculin (FLCN) has been identified as the GAP controlling the nucleotide state of RagC/D. Unlike GATOR1 however, this renders it is a positive regulator of (m)TORC1 signaling, given the mismatched nucleotide requirements of the Rag heterodimer. It was first discovered as a novel truncated gene from a cohort of patients with Birt-Hogg-Dubé (BHD) syndrome (
Nickerson et al., 2002), associated with multiple pathophysiologies such as renal cancer and impaired skin development (
Schmidt and Linehan, 2018). Subsequent work established that FLCN forms functional complexes with
folliculin-interacting proteins 1 (FNIP1) and FNIP2, and furthermore that FNIP1 is a substrate, and binding partner, of AMPK, with phosphorylation resulting in downregulated mTOR signaling (
Baba et al., 2006;
Hasumi et al., 2008). The link between FLCN activity and mTORC1 signaling led to the discovery that FLCN and both FNIP1 and 2 are direct interacting partners of the Rag GTPases in mammalian cells (
Petit et al., 2013;
Tsun et al., 2013) and in yeast (
Péli-Gulli et al., 2015). An obligate FLCN-FNIP complex is necessary for the Rag interaction, since RagA and RagC coimmunoprecipitate with FLCN only when FNIP2 is coexpressed, and this interaction is strengthened during nutrient starvation and resulted in inhibition of (m)TORC1 activity (
Tsun et al., 2013;
Péli-Gulli et al., 2015). Endogenous FLCN-FNIP1/2 and Lst4-Lst7 complexes exhibit lysosomal or vacuolar localization during nutrient deficiency, respectively (
Petit et al., 2013;
Tsun et al., 2013;
Péli-Gulli et al., 2015), and once there FLCN-FNIP2 acts as the GAP for RagC/D.
FLCN and FNIP both contain conserved N-terminal longin and C-terminal
Differentially Expressed in Normal cells and Neoplasia (DENN) domains (Fig. S3A), which are highly represented in Rab GEFs (
Zhang D et al., 2012). The C-terminal DENN domain forms part of a larger so-called DENN module, which also comprises an N-terminal longin domain found in FLCN and the FNIPs, as well as the
S. cerevisiae homologs. The crystal structure of the human FLCN C-terminal DENN domain (Fig. S3B) revealed the molecular structure of the core, or c-DENN, and downstream (d-)DENN subdomains (
Nookala et al., 2012). Notably, FNIP1 and FNIP2 display a more divergent relationship to the prototypical DENN module architecture, encoding large, predicted unstructured, insertions within both the N-terminal longin and C-terminal c-DENN and d-DENN domains. To work around this, the structure of the
Kluyveromyces lactis Lst4 N-terminal longin domain was solved (
Pacitto et al., 2015), as this domain does not feature any disordered regions compared to the mammalian FNIPs. The structure revealed a classic longin domain architecture, which is the first subdomain of the complete DENN module (Fig. S3C), and which is also observed in the GATOR1 components NPRL2 and NPRL3. Unfortunately structures of the complete complex are as yet unavailable, while the details of its interaction with the Rag GTPases are keenly awaited.
Which signals converge on FLCN-FNIP to control its activity remains unclear. Although there is data implying that FLCN-FNIP is a GAP for RagC/D (
Tsun et al., 2013) and that FLCN interacts stably with RagA (
Petit et al., 2013), the localization of the FLCN-FNIP complexes, and their binding partners, remains incompletely understood. How FLCN-FNIP complexes are recruited to the lysosome or vacuole surface under nutrient-deficient conditions, and what events occur upon nutrient supplementation which cause their dispersion from these organellar membranes, remain open questions. A more recent study on
S. cerevisiae Lst4 and Lst7 characterized TORC1 phosphorylation sites within Lst4 – within an intra-DENN loop insertion – which is involved in the release of the Lst4-Lst7 complex from the vacuolar membrane in response to nutrient repletion (
Péli-Gulli et al., 2017). Given this insight, a similar feedback mechanism may occur in mammalian cells, where posttranslational modifications impinging on the intra-DENN regions of FNIP could cause dispersal from the lysosome under conditions of nutrient sufficiency. The known interaction with AMPK might well also imply a close connection to cellular energy state, and therefore the integration of metabolic signals directly. It is intriguing to note that a growing body of work also supports the intrinsic role of FLCN and its associated interacting proteins in regulating lysosome positioning in response to cellular metabolism (
Starling et al., 2016;
Dodding, 2017). These studies suggest that the association of FLCN-FNIP complexes with the lysosome during nutrient starvation promotes the formation of contacts between the lysosome and peri-nuclear membranes in a manner dependent on Rab34 and
Rab-interacting lysosomal protein (RILP). Clearly multiple signals converge on FLCN-FNIP, and it is likely that characterization of the dynamic intra-DENN regions will highlight upstream regulators. Structural studies of the entire FLCN-FNIP-Rag assembly will be required to provide a definitive mechanism of Rag regulation by the FLCN-FNIP heterodimer, and may highlight how FLCN-FNIP cooperates with related pathways in the coordination between cellular metabolism and lysosome positioning (
Dodding, 2017).
The Rag GTPase-Ragulator assembly
The Rag GTPases and their associated complex, ‘Ragulator’, are the essential platform for (m)TORC1 recruitment to the lysosome. The RagA and RagB genes were first identified as a novel subfamily of
Ras-related GTPases of unknown biological function (
Schürmann et al., 1995). Sequence alignment then identified RagA/B as the mammalian homologs of yeast Gtr1 (
Bun-Ya et al., 1992). Yeast two-hybrid experiments isolated two more candidate Rags, which were homologous to each other as well as yeast to Gtr2, and bound RagA/B in vivo using C-terminal roadblock domains unique to the Rag subfamily (Fig. 4A) similarly to the binding of Gtr1 to Gtr2 (
Nakashima et al., 1999;
Sekiguchi et al., 2001). The discovery of these proteins, RagC and RagD, completed the Rag family.
In yeast, the Gtr1 and Gtr2 GTPases were shown to associate with Ego1, Ego3 and more recently Ego2 (
Dubouloz et al., 2005;
Gao and Kaiser, 2006;
Powis et al., 2015) in an analogous complex to Ragulator named the EGO complex (EGOC). However, given the low sequence conservation between the Ego proteins and those from humans, the complex was not recognized as a Ragulator equivalent until structural data became available. The Rag GTPases were only later connected to the (m)TORC1 pathway (
Kim et al., 2008;
Sancak et al., 2008) when
Drosophila melanogaster homologs of RagA/B/Gtr1 and RagC/D/Gtr2 were identified through an RNA interference screen targeting GTPases whose knockdown prevented amino acid-dependent phosphorylation of S6K, a well-known (m)TORC1 target (
Burnett et al., 1998;
Hara et al., 1998). RagA and RagC regulate mTORC1 in an amino acid-sensitive manner; RagA/B, when GTP-bound, and RagC/D, when GDP-bound, recruit RAPTOR, a core (m)TORC1 component (
Sancak et al., 2008), thereby recruiting the kinase complex to late endosomes and lysosomes. None of the Rag proteins contain any lipid modification motifs that would result in their lysosomal localization, and therefore five proteins, MP1, p14, p18, HBXIP and C7orf59, which form a lysosomally-localized complex and which interact with the Rag GTPases, were found to be required to recruit them to their cognate compartment (
Bar-Peled et al., 2012;
Sancak et al., 2010). The protein p18 was shown to be the key determinant of lysosome localization by virtue of lipid modifications within its N-terminal region (
Bar-Peled et al., 2012;
Nada et al., 2009). Though MP1 and p14 were previously annotated with other functions (
Teis et al., 2002), to reflect the regulation of the Rags and mTORC1 by these proteins, the pentameric complex was termed ‘Ragulator’ and the gene names were renamed LAMTOR1 (p18), LAMTOR2 (p14), LAMTOR3 (MP1), LAMTOR4 (HBXIP) and LAMTOR5 (C7orf59) for
Late Endosomal/Lysosomal Adaptor, MAPK and mTOR activator/regulator (
Bar-Peled et al., 2012).
In addition to being a scaffold, Ragulator acts as the GEF for RagA/B, accelerating the release of both GDP and GTP from this subunit of the Rag heterodimer (
Bar-Peled et al., 2012). A GEF for RagC/D has yet to be identified, representing a key gap in our understanding of these proteins. Presence of amino acids decreases the amount of Ragulator coimmunoprecipitating with RagB; furthermore, this interaction is strengthened when the Rags are in a nucleotide-free state, as the addition of GTP to an in vitro binding assay weakened the interaction (
Bar-Peled et al., 2012). The interaction between Ragulator and the Rag heterodimers has been suggested to be dependent on the v-ATPase (
Zoncu et al., 2011), which is thought to transmit intra-lysosomal amino acid signals to the Rags by affecting Ragulator GEF activity. Intriguingly, several other lysosomal membrane proteins have been considered to regulate the Rag GTPase pathway through Ragulator, including putative arginine sensor SLC38A9 (
Jung et al., 2015;
Rebsamen et al., 2015;
Wang et al., 2015), the
BLOC-1 related complex (BORC) (
Pu et al., 2015),
Niemann-Pick C1 (NPC1) (
Castellano et al., 2017), and potentially TMEM55B (
Hashimoto et al., 2018) and TMEM127 (
Deng et al., 2018). Studies of SLC38A9 suggest that presence of amino acids, arginine in particular (
Wang et al., 2015) are transmitted through SLC38A9-v-ATPase to the Rag GTPases (
Jung et al., 2015;
Rebsamen et al., 2015;
Wang et al., 2015). It may be the case that stimulation of v-ATPase activity regulates Ragulator-Rag in response to fusion of lysosomes with incoming nutrient-rich endosomes, thus enabling signaling to mTORC1 from within the lysosome. Additionally, NPC1 has been implicated in cholesterol-dependent signaling to mTORC1 through SLC38A9 (
Castellano et al., 2017) and TMEM55B and TMEM127 appear to support the stability of the Ragulator-v-ATPase assembly (
Hashimoto et al., 2018;
Deng et al., 2018); thus, there are likely to exist more peripheral membrane sensors which can regulate mTORC1 through the Rag GTPases. BORC has been implicated in regulating lysosome positioning in concert with amino acid signaling through Ragulator-Rag (
Filipek et al., 2017;
Pu et al., 2017) and also lysosome-autophagosome fusion during autophagy (
Jia et al., 2017), thus representing another link to lysosome positioning and function alongside the Rab-RILP connection to FLCN-FNIP (
Starling et al., 2016). A structural dissection of Ragulator interactions with its associated machinery on the lysosome will certainly shed light on the mechanisms by which the complex communicates to the Rag GTPases from multiple peripherally associated proteins.
The structures of LAMTOR2, LAMTOR3 and LAMTOR5 were the first to be characterized prior to the discovery of the Ragulator complex (
Kurzbauer et al., 2004;
Cui et al., 2008;
Lunin et al., 2004;
Qian et al., 2005;
Garcia-Saez et al., 2011), revealing roadblock folds for the individual LAMTOR3 and LAMTOR5 monomers, and the formation of MP1/p14 heterodimers by edge-to-edge juxtaposition of the roadblock domains. The structure of
S. cerevisiae Ego3 provided evidence that the EGOC and Ragulator components are structurally conserved, despite substantial sequence divergence (
Kogan et al., 2010;
Zhang et al., 2012), exhibiting a dimeric architecture very similar to that of LAMTOR2-LAMTOR3. The crystal structure of Gtr1-Gtr2 confirmed that these GTPases heterodimerize using their roadblock domains analogously (
Gong et al., 2011). Given the notion that roadblock domains, like the structurally similar longin domains, are recurring architectures in GTPase-interacting proteins (
Levine et al., 2013) the Ragulator-Rag complex serves as an extended interaction hub for the nutrient-sensing machinery, including the GAPs GATOR1 and FLCN-FNIP, peripheral lysosomal proteins such as v-ATPase and SLC38A9, and (m)TORC1. Indeed, crystal structures of the yeast EGOC and mammalian Ragulator-Rag complex support this view (
Powis et al., 2015;
Mu et al., 2017;
Su et al., 2017;
Yonehara et al., 2017;
Zhang et al., 2017;
de Araujo et al., 2017). The pentameric Ragulator complex exhibits an architecture in which heterodimers of LAMTOR2-LAMTOR3 and LAMTOR4-LAMTOR5 are encircled by LAMTOR1, which adopts an extended conformation with short regions of partial
a-helicity making contacts with the other Ragulator members. The yeast EGOC structure, containing an Ego2-Ego3 heterodimer, and the LAMTOR1 homolog Ego1, demonstrated a remarkably similar architecture, with Ego1 overlapping considerably with LAMTOR1, and Ego2 and Ego3 overlapping with LAMTOR5 and LAMTOR2 respectively. The heptameric Ragulator-Rag complex structure showed Rag C-terminal roadblock domains binding to the LAMTOR2-LAMTOR3 heterodimer (
de Araujo et al., 2017;
Su et al., 2017;
Yonehara et al., 2017) (Fig. 4B), thus building a tower of three roadblock domain-mediated heterodimers tied together by the extended LAMTOR1 subunit, and presenting the GTPase domains for interaction with RAPTOR. It is important to note that when the Rag heterodimer binds Ragulator, the N-terminal portion of LAMTOR1 encompassing residues 47-64 is resolved, forming an
a-helix which packs against the RagC roadblock domain, whereas in the Ragulator complex alone this region of LAMTOR1 was disordered (
de Araujo et al., 2017;
Yonehara et al., 2017). Thus, LAMTOR1 forms a flexible tether for the remainder of the Ragulator complex and the Rag GTPases on the lysosome surface.
The GATOR1-Rag structure has provided the first glimpse of the Rag GTPase domains within the heterodimer (
Shen et al., 2018) (Fig. 4C). Although the resolution of the cryo-EM structure was insufficient to accurately identify the RagC-bound ligand, it is thought to be GDP as RagC contains a S75N mutation which induces preferential GDP binding (
Shen et al., 2018). A notable difference between both GTPase domains is the conformation of switch I, which in RagC appears to have extended and broken the central
b-sheet as seen in RagA. This dramatic structural rearrangement is reminiscent of the Gtr1
GTP-Gtr2
GDP structure, in which GDP-bound Gtr2 displays an extremely similar switch I conformation (
Jeong et al., 2012). In comparison to a Gtr1
GMPPNP-Gtr2
GMPPNP structure (
Gong et al., 2011), the Gtr2
GDP switch I region extends into an
a-helix away from the nucleotide binding pocket with a concomitant global rotation (Fig. 4D) establishing a small novel interface between the GTPase domains whereby Gtr1 residues directly contact the Gtr2 nucleotide binding pocket and bound nucleotide itself (Fig. 4E). It is of particular interest to note that such an observation may support the recent notion that members within the Rag GTPase heterodimer exhibit negative cooperativity toward one another (
Shen et al., 2017). Kinetic analyses of nucleotide binding suggest that the binding of a GTP molecule to one Rag establishes a dominant, conformation-driven, effect on the nucleotide binding pocket of the partner Rag, reducing affinity for GTP and stimulating GTP hydrolysis in the partner active site such that the heterodimer exists stably in a singly-GTP-bound state. Perhaps there exists a novel interface between the RagA-RagC GTPase domains, not captured in the recent cryo-EM structure, which may facilitate such cooperative behavior.
Overall, the Ragulator-Rag structures reveal a long molecular surface comprising three pairs of roadblock domain heterodimers, over which other protein complexes like the GAPs GATOR1 and FLCN-FNIP, as well as mTORC1, can bind. These structures, however, do not shed light on the mechanism by which Ragulator acts as a GEF for the Rags, although some insights from the study by Su and colleagues may reveal an unusual GEF mechanism (
Su et al., 2017). These authors used constitutively active or inactive Rag heterodimers and used hydrogen-deuterium exchange-mass spectrometry to probe the dynamics of the RagA and RagC P loop residues, finding that Ragulator appears to modulate the dynamics of the RagA P loop specifically, as RagC did not exhibit spectral differences in the presence or absence of Ragulator. A hypothesis for this unusual GEF activity has been proposed in which GDP/GTP binding and conformational changes of the switch I region cause corresponding changes in the surrounding
b-sheet, where the GTPase domain contacts the roadblock domain; the extended structure of LAMTOR1 may imply that its dynamic association with the other Ragulator components and Rag GTPases could regulate GTPase activity based on the interactions between individual roadblock domain heterodimers, as well as the interactions between Ragulator and lysosomal components including v-ATPase (
Cherfils, 2017). Finally, the dynamic changes in the GTPase domain during nucleotide cycling must be implicitly linked to RAPTOR binding, however a molecular understanding of this interaction has not yet been detailed. It is clear that static structural determination methods will not be sufficient to understand the regulation of the Rag GTPases in atomic detail.
Extracellular stimuli and survival signals control cell growth through GTPase activation and suppression of Rheb, which acts as the activator of (m)TORC1 catalysis
Whereas recruitment of (m)TORC1 to the lysosome is controlled in response predominantly to signals corresponding to the intracellular nutrient and energy state, its activation once there is controlled by Rheb
GTP in response to a wide variety of external and survival signals feeding into a single point of control. Several studies in the past few years have greatly improved our understanding of the structure of mTORC1 as well as how the Rag and Rheb GTPases recruit and activate the kinase, respectively (
Kim et al., 2008;
Long et al., 2005;
Sancak et al., 2008;
Sato et al., 2009). Together, these GTPases constitute a coincidence detector which cooperate synergistically to robustly induce mTORC1 activity and fail to do so when either is only singly activated under physiologic conditions (
Durán and Hall, 2012;
Groenewoud and Zwartkruis, 2013).
The TSC complex
The integration of the majority of external and survival signals feeding into the (m)TORC1 pathway occurs through the TSC complex (TSCC), composed of
tuberous sclerosis complex protein 1 (TSC1) and TSC2, also known as hamartin and tuberin, and TBC1D7, which is the key signal integrator and negative regulator upstream of Rheb. The genomic loci encoding TSC1 and TSC2 were first mapped upon identification of their linkage to the autosomal dominant disease
Tuberous Sclerosis Complex, after which these genes were named (
Fryer et al., 1987;
Kandt et al., 1992). TSC2 contains a domain with homology to Rap1-GAP (
van Slegtenhorst et al., 1997), and forms a complex in vivo with TSC1, mediated by predicted coiled-coil domains at the TSC1 C terminus and the TSC2 N terminus (
van Slegtenhorst et al., 1998;
Nellist et al., 1999) (Fig. 5A). The functions of the TSC1-TSC2 complex were largely mysterious until studies investigating the
Drosophila homologs provided a link between TSC1-TSC2 and cell growth, specifically downstream of the insulin receptor (
Gao and Pan, 2001;
Potter et al., 2001;
Tapon et al., 2001). This identification of this regulatory mechanism, shown using the human proteins, unequivocally identified TSC1-TSC2 as inhibitors of the mTORC1 signaling pathway (
Inoki et al., 2002;
Tee et al., 2002), specifically by virtue of TSC2 GAP activity toward the small GTPase Rheb (
Castro et al., 2003;
Garami et al., 2003;
Inoki et al., 2003;
Saucedo et al., 2003;
Stocker et al., 2003;
Tee et al., 2003;Zhang et al., 2003). A third subunit of TSCC, TBC1D7, was later discovered and shown to be a constitutive member of the complex (
Dibble et al., 2012). Unlike other TBC proteins, which are generally known to act as GAPs for the Rab family of GTPases (
Fukuda, 2011), TBC1D7 does not appear to have GAP activity relevant to the Rheb signaling axis upstream of mTORC1, instead likely increasing the sensitivity and responsivity of the entire complex to environmental signals. Chromatographic analyses suggest that TSCCs also contains multiple copies of TSC1 and TSC2, and it is postulated that including multiple copies of each constituent protein may increase the sensitivity of the complex to allosteric regulation (
Hoogeveen-Westerveld et al., 2012). As more recent structural studies appear to suggest, the presence of TBC1D7 may support higher-order oligomerisation of TSC1 and TSC2.
The large, complex nature of the TSC1 and TSC2 proteins has largely impeded structure determination, especially in contrast to the smaller TBC1D7 subunit (Fig. 5A). Insights to TSC1 and TSC2 structure first came from a characterization of the N-terminal domain of
S. pombe TSC1 (
Sun et al., 2013), dubbed the core domain (Fig. 5B). This region of
S. pombe TSC1 is homologous to residues 1-266 of human TSC1, notably lacking a region homologous to the first ~90 residues of the yeast protein. These authors found that the core domain is exclusively
a-helical, as predicted by bioinformatic analyses, and forms a solenoid built of HEAT repeats. It is worth noting that Sun and colleagues obtained crystals with one of two different space groups, and that in both cases the asymmetric unit contained a higher-order TSC1 oligomer. In one crystal form, ten monomers form two juxtaposed pentameric rings, ultimately forming a ‘face-to-face’ decamer (Fig. 5B). The driver for such an assembly appears to be an overhanging helix from each core domain, which makes contacts with neighboring domains. Though such an arrangement may be a consequence of crystal packing, and thus its physiologic relevance is questionable, the structure may indicate a role for the TSC1 N terminus in homooligomerisation. The core domain is considered essential for the stability of TSC1 (
Hoogeveen-Westerveld et al., 2010;
Mozaffari et al., 2009); if a function of this region was to enable higher-order oligomer formation, such an assembly may protect TSCC subunits from degradation. More recently, a crystal structure was solved of a stable N-terminal TSC2 fragment from
C. thermophilum (
Zech et al., 2016) (Fig. 5C). This TSC2 N-terminal fragment, like the
S. pombe TSC1 core domain, forms an
a-solenoid constituting nine HEAT repeats. This fragment of TSC2 was also shown to be sufficient to pull down the full-length, and C-terminal half of, TSC1, and these interacting regions are conserved in the human proteins (
Zech et al., 2016).
Contributing to a greater understanding of TSCC architecture, crystal structures have been solved of the smaller subunit TBC1D7 complexed with coiled-coil fragments of TSC1 (
Gai et al., 2016a;
Qin et al., 2016), alongside a TBC1D7
apo structure deposited in the PDB by Guan et al. (PDB ID: 3QWL). The
apo structure (Fig. 5D) reveals an architecture highly conserved with other TBC domain proteins (
Fischer et al., 2016;
Fukuda, 2011;
Park et al., 2011), with little conformational difference in comparison to the TSC1-complexed structures. Both TBC1D7-TSC1 crystal structures highlight a conserved interaction interface, with a long helix from TBC1D7 packing against the TSC1 coiled-coil to form a three-helix bundle. Both groups crystallized the same TSC1 region – residues 939-992 – with near full-length TBC1D7, either residues 19-293 (
Qin et al., 2016) or 21-293 (
Gai et al., 2016a). In the complex structure solved by Gai and colleagues, two TBC1D7 molecules and two TSC1 molecules form a dimer of dimers, with the full length of the TSC1 subunit visible, forming a parallel coiled-coil where each helix kinks at G973 and folds back onto each TBC1D7 subunit (Fig. 5E). In contrast, the complex solved by Qin et al. shows two TBC1D7 and four TSC1 molecules per asymmetric unit, whereby each TBC1D7 complexed with two TSC1 molecules, forming a triple helical bundle (Fig. 5F). In this structure, the last ~15 residues of each TSC1 molecule – which formed the kinked helix in the crystal structure by Gai et al. – were not visible. In both cases, one TBC1D7 molecule was able to interact simultaneously with two TSC1 coiled-coil fragments, and alignment of these domains highlights structural similarity (Fig. 5G). As supported by analytical ultracentrifugation data from both studies (
Gai et al., 2016a;Qin et al., 2016), the exact stoichiometry of the TBC1D7-TSC1 coiled-coil interaction appears to be dependent on the ratio between the two molecules, and this may underlie physiologic activity of TBC1D7 in regulation TSCC oligomerisation.
Whereas it appears that specific metabolites signal to mTORC1 through the Rag GTPase axis, stress conditions such as hypoxia, DNA damage stress, ER stress, and growth factor and survival signaling, are filtered through to the Rheb GTPase through TSCC by post-translational modification (
Kwiatkowski, 2003;
Huang and Manning, 2008;
Tomasoni and Mondino, 2011;
Demetriades et al., 2016). TSCC is phosphorylated by many kinases, each of which contributes either activating or inhibiting modifications under various environmental conditions, including S6K1,
cyclin-dependent kinase 1 (CDK1), AMPK and Akt, which is itself activated by mTORC2 (
Huang and Manning, 2009). Phosphorylation has been shown to cause dispersion of TSC2 from membrane-localized TSC1 punctae to the cytosol (
Cai et al., 2006), in a manner dependent on interaction with 14-3-3 proteins. Notably, TSC2 has already been shown to interact with 14-3-3 proteins when phosphorylated, and that this interaction disrupts the function of the TSC1-TSC2 complex (
Nellist et al., 2003;
Shumway et al., 2003). Overall, it appears that the localization of the TSC2 subunit is the main determinant of TSCC-mediated Rheb inhibition (
Demetriades et al., 2016). Another example of this localization dependency includes hypoxia-dependent regulation through REDD1 (
Brugarolas et al., 2004), which disrupts cytosolic 14-3-3-bound TSC2 and enables subsequent TSC2 localization to the lysosome where it can inhibit Rheb (
DeYoung et al., 2008). Future structural work capturing the TSCC assembly and TSC2-Rheb structure will provide molecular insights into the activity of this crucially important but as-yet incompletely understood mTORC1 regulator, and provide a rational basis for understanding the pathogenicity of mutations in human TSC1 and TSC2, associated with diseases such as TSC and sporadic cases of lymphangioleiomyomatosis (
Kwiatkowski, 2003;
Rosset et al., 2017).
Rheb GTPase
In almost all eukaryotes, Rheb
GTP is an essential coactivator of (m)TORC1, required for catalytic activity. It was first identified during mRNA screens of rat hippocampal tissue for genes induced after electroconvulsive stimulation (
Yamagata et al., 1994). The protein identified was noted to be a Ras GTPase homolog, and was named
Ras Homolog Enriched in Brain, although it has since been discerned that Rheb is ubiquitous in all tissues across the body (
Saito et al., 2005). It was subsequently confirmed that Rheb is farnesylated at a C-terminal CAAX motif (
Clark et al., 1997) (Fig. 6A). The subcellular localization of Rheb has been debated, with many sources arguing for a distribution between endomembrane compartments including the ER, Golgi, mitochondria, and late endosomes and lysosomes (
Takahashi et al., 2005;
Buerger et al., 2006;
Hanker et al., 2010;
Parmar and Tamanoi, 2010). Despite a lack of clear experimental evidence pointing to a defined localization pattern, Rheb is known to function on endomembranes in the mTORC1 pathway as a potent activator of the kinase (
Long et al., 2005;
Sato et al., 2009) and is the physiologic target of TSC2 (
Castro et al., 2003;
Garami et al., 2003;
Inoki et al., 2003;
Saucedo et al., 2003;
Stocker et al., 2003;
Tee et al., 2003;
Zhang et al., 2003).
Kinetic studies of Rheb revealed that the protein is unique compared to other Ras superfamily GTPases in that it exhibits very low basal GTPase activity, probably due to lacking a conserved catalytic glycine residue (G12 in Ras), which is replaced by an arginine. Crystal structures of Rheb complexed with GTP (Fig. 6B) and GDP revealed that the protein undergoes very small, select conformation changes involving switch I (residues 33-41) and switch II (residues 63-79) (
Yu et al., 2005). Notably, these structures reveal that the glycine-to-arginine mutation thought to result in low GTPase activity cannot alone be used to explain this difference in activity. Several differences from the canonical GTPase motifs account for these unusual kinetics. Q64, which is required for polarization/activation of the water molecule that initiates nucleophilic attack, is displaced by>4 Å from the counterpart Ras
GTP structure, due to an unusual conformation of switch II, and furthermore its sidechain is buried in a hydrophobic pocket. Y35 also prevents GTP dissociation by closing over the nucleotide binding pocket (Fig. 6C), coordinating the phosphates more tightly than in Ras and Rap (
Mazhab-Jafari et al., 2012).
Rheb has been observed to maintain a high proportion within the GTP state in within cells (
Im et al., 2002). The molecular features which enable this behavior thus prepare for de-inhibition by TSC2 to lead to rapid activation of mTORC1, thereby consolidating the function of Rheb as a potent activator of mTORC1. It is known that the TSC2 GAP domain utilizes an “asparagine-thumb” mechanism for the activation of Rheb GTPase activity, similar to that of the Rap GTPase and its GAP (
Marshall et al., 2009;
Scrima et al., 2008), and a structure of the TSC2 GAP-Rheb complex is sorely needed to enable us to understand more about how this unique GTPase-GAP pair functions to control the activation of mTORC1. The GEF for Rheb, if one exists, remains one of the most significant gaps in our knowledge, but it is notable that Rheb is found dispersed throughout the endomembrane compartments (
Parmar and Tamanoi, 2010), and it has been postulated that Rheb activation occurs at the Golgi, whereupon it then shuttles during maturation to late endosomes and lysosomes where it can activate mTORC1 (
Groenewoud and Zwartkruis, 2013). This would also rationalise the slow GTP hydrolysis and release, as it would be required for Rheb to survive translocation to lysosomes in GTP-state. More recent evidence suggests that
microspherule protein 1 (MCRS1) retains Rheb on the lysosome surface in the presence of amino acids, thus coordinating the localization of functional Rheb with the Rag signaling axis (
Fawal et al., 2015). Given the pleiotropic effects observed upon Rheb knockout in multiple cell lines and mouse models (
Heard et al., 2014), more work is required to pin down the exact pathways by which Rheb is activated and localized in proximity to mTORC1.
Architecture and activation of mTORC1
The structure of the mTORC1 core, consisting of an mTOR mLST8 “FATKIN” unit, was solved in 2013 by Yang and colleagues, providing the first structural data on the kinase and enabling a molecular-level understanding of kinase inhibition by rapamycin (
Yang et al., 2013). Being one of the mammalian
PI3K-related protein kinases (PIKKs), mTOR was known by sequence homology to contain a
FRAP, ATM and TRAPP (FAT) domain, a catalytic kinase domain and a C-terminal ‘FATC’ domain found only in PIKKs (
Bosotti et al., 2000). The crystal structure revealed that the kinase domain exhibited a classical bilobed structure, just as in the PI3K family, and the PIKK-specific FAT domain encircles the kinase domain, raising the possibility that interaction of regulatory proteins with this region will allosterically impact the kinase domain structure as well. Notably, however, the kinase N-lobe features a large insertion corresponding to the
FKBP12-rapamycin binding (FRB) domain (
Chen et al., 1995;
Chiu et al., 1994;
Vilella-Bach et al., 1999) specific to mTORC1 among the PIKKs. More recently, cryo-EM structures have revealed the architecture of the mTORC1 complex at higher resolution (
Aylett et al., 2016;
Baretić et al., 2016;
Yang et al., 2017,
2016). These structures revealed that mTORC1 is indeed dimeric, with each monomer in the complex interacting with its partner using the N-terminal HEAT repeat domains, which were truncated in the earlier crystal structure (
Yang et al., 2013), and with Raptor binding the juncture between the kinase domains (
Imseng et al., 2018).
Cross-linking of mTORC1 in the presence of Rheb
GTPgS and 4EBP1, another well-known mTORC1 substrate (
Burnett et al., 1998), yielded the cryo-EM structure of mTORC1 bound to Rheb (
Yang et al., 2017). The structure of mTORC1 corresponds well to the previously determined structures (
Aylett et al., 2016;
Baretić et al., 2016;
Yang et al., 2016), enabling the identification of the key mTORC1 domains, and verifying the tripartite dimerization interface across Raptor and the M- and N-HEAT regions. Density corresponding to Rheb was observed at the N-terminal portions of the N-HEAT, M-HEAT and FAT domains, thereby resulting in a four-way interface where Rheb switch I residues bound the M-HEAT solenoid and FAT domain, and switch II formed contacts with all three mTORC1 domains. A mechanistic understanding for the allosteric activation of mTORC1 is now available: Rheb binds to mTOR with its switch I and II regions, which have been previously observed crystallographically to undergo nucleotide-dependent conformational changes (
Yu et al., 2005). Rheb binding causes a rotation of the mTORC1 N-HEAT solenoid toward the M-HEAT and FAT domains, where N-HEAT-FAT interactions are formed (Fig. 6D). These in turn reorganise the N-lobe of the kinase domain to a catalytically-primed conformation: the catalytic and Mg
2+ binding residues are brought into closer proximity with the terminal phosphate of ATP (Fig. 6D, insets). The binding of Rheb to mTORC1 exhibits enforced cooperativity, since a singly-bound Rheb would result in a highly unfavorable dimerization interface; mTORC1 is therefore robustly activated by the binding of two Rheb GTPases per kinase complex, which causes a global rearrangement culminating in the priming of the active site for catalysis. This final step represents the culmination of the combined input of the entire pathway into a final binary “on/off” decision: to grow.
Concluding remarks
The control of the (m)TORC1 growth on/off switch at the juncture of so many key signaling pathways (Fig. 7) is a highly complex system evolved to reduce the cacophony of information available to the cell to the answer to a simple question: “Is it a good time to grow”?. A great deal of recent progress has been made on unpicking the mechanisms of this “Heath Robinson” contraption, revealing how key features of the process function at a molecular level, as in the case of the recent explosion of structural data for Rag/Ragulator and (m)TORC1, and how input reduction has been achieved, for the GATOR2-interacting nutrient sensors for example, and in the unexpectedly complete isolation of the Rag and Rheb pathways from one another. There have also been many completely unexpected observations, which have thrown up new questions remaining to be understood in time, such as the dual Rag binding sites of GATOR1, hinting at new mechanisms by which responses are tuned, and the proliferation of longin domains, suggesting evolutionary expansion from a simpler precursor pathway.
The molecular mechanisms of (m)TOR regulation are now tantalisingly close to being understood. Progress remains quite unevenly distributed, however; the recruitment and activation complexes themselves are now becoming relatively well described, only the mechanism of recruitment of (m)TORC1 by Rag/Ragulator and the substrate bound structures of (m)TORC1 eluding our grasp, and discussion moving toward low-level subjects such as kinetics and active site mechanisms, whereas the lack of structural detail on the less proximal parts of the pathway, the TSCC, FLCN-FNIP and GATOR2 complexes, hampers our understanding still. There is great cause for hope, however, given the pace of recent progress, that this highly important and medically relevant pathway will emerge from the scattered pieces to yield a clear picture over the next few years of scientific endeavor.
Higher Education Press and Springer-Verlag GmbH Germany, part of Springer Nature