Mol. Cells 2019; 42(5): 386-396
Published online May 31, 2019
https://doi.org/10.14348/molcells.2019.0078
© The Korean Society for Molecular and Cellular Biology
Correspondence to : *hspark@kaist.ac.kr
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
Labeling of a protein with a specific dye or tag at defined positions is a critical step in tracing the subtle behavior of the protein and assessing its cellular function. Over the last decade, many strategies have been developed to achieve selective labeling of proteins in living cells. In particular, the site-specific unnatural amino acid (UAA) incorporation technique has gained increasing attention since it enables attachment of various organic probes to a specific position of a protein in a more precise way. In this review, we describe how the UAA incorporation technique has expanded our ability to achieve site-specific labeling and visualization of target proteins for functional analyses in live cells.
Keywords bioorthogonal, click chemistry, genetic code expansion, site-specific labeling, unnatural amino acid
Visualization of a protein of interest in living cells is becoming increasingly important, not only to precisely address cellular functions of the individual target protein, but also to understand the complicated biological system as a whole (Crivat and Taraska, 2012; Kim and Heo, 2018; Liu et al., 2015; Stephens and Allan, 2003). In particular, recent rapid advances in live-cell imaging techniques, including nanoscopy and super-resolution microscopy (SRM), have augmented our ability to address the cellular functions of biomolecules at the nanoscale level (10–30 nm) (Sengupta et al., 2012; Toomre and Bewersdorf, 2010). Numerous dyes and tag-labeling approaches have been developed for efficiently visualizing cellular target proteins (Crivat and Taraska, 2012). Fusion of fluorescent protein tags has been used routinely owing to its ease and simplicity (Shaner et al., 2007). In addition, diverse self-labeling enzymes, such as Halo-tag (Los et al., 2008), SNAP-tag (Keppler et al., 2003) and CLIP-tag (Gautier et al., 2008), have also been explored for labeling target proteins using chemical fluorescent probes, which usually exhibit better photophysical properties than fluorescent proteins (van de Linde et al., 2012). However, such protein-based tagging methods suffer from intrinsic limitations. Fused proteins or enzymes can only be added to N- or C-termini of the target protein (Crivat and Taraska, 2012; Tsien, 1998). More critically, they can perturb the cellular function of the target protein, such as localization, activity or protein-protein interaction, owing to their relatively large size (> 20 kDa) and tendency to form multimeric structures (Prescher and Bertozzi, 2005; Vreja et al., 2015).
Attempts to reduce the size of the tagging unit have given rise to the development of various peptide-based approaches. For example, phage display of peptide libraries has led to isolation of the 12-residue peptide tags, S6 and A1, which enable orthogonal protein labeling catalyzed by the phosphopantetheinyl transferases, Sfp and AcpS, respectively (Zhou et al., 2007). A tetracysteine-based labeling scheme uses an even smaller peptide, a 6-amino acid motif (CCXXCC), that selectively binds to fluorescein arsenical hairpin (FlAsH) (Adams et al., 2002). Such peptide-tag labeling methods, despite certain favorable characteristics, still suffer from critical problems, including high background and cellular toxicity of the organic fluorophores. Strictly speaking, peptide-based methods are not site-specific, since they do not allow residue-specific protein labeling (Crivat and Taraska, 2012; Fernandez and Freed, 2017; Lotze et al., 2016). On the other hand, the genetic incorporation of unnatural amino acid (UAA) approach using an engineered orthogonal aminoacyl-tRNA synthetase (AARS) and tRNA pair can ideally lead to site-specific protein labeling with minimal perturbations, since this method allows installation of special chemical groups into the desired residue of the target protein (Chatterjee et al., 2013; Lang et al., 2015; Prescher and Bertozzi, 2005). Details of UAA-enabled approaches and diverse labeling schemes have been summarized by some excellent reviews (Lang and Chin, 2014b; Liu et al., 2007; Oliveira et al., 2017; Sletten and Bertozzi, 2009; Young and Schultz, 2018). In this review, we briefly discuss recent progress in UAA-based site-specific protein labeling methods and their promising applications. UAAs described in this report are listed in Figure 1 and Table 1.
In addition to the standard 20 amino acids, biological systems have evolved to genetically encode two additional amino acids, selenocysteine (Sec) and pyrrolysine (Pyl), and incorporate them into proteins (Ambrogelly et al., 2007). These two amino acids are thus sometimes referred to as the 21st and 22nd amino acids. In the case of Sec, serine is first attached to selenocysteinyl tRNA by SerRS (Leinfelder et al., 1988), and then converted into Sec by the sequential action of phosphoseryl-tRNA kinase (PSTK) (Carlson et al., 2004) and Sep (O-phosphoserine) tRNA:Sec tRNA synthase (SepSecS) (Yuan et al., 2006). Sec is then co-translationally inserted into proteins at a TGA stop codon with the help of special elongation factor (SelB) and a stem-loop Sec insertion sequence element (SECIS) (Forchhammer et al., 1989). Pyl is directly attached to pyrrolysyl tRNA (tRNApyl) by pyrrolysyl-tRNA synthetase (PylRS) and incorporated into proteins at a TAG stop codon (Blight et al., 2004; Polycarpo et al., 2004). Inspired by this natural expansion of the genetic code, researchers have developed a site-specific UAA incorporation technique (also called genetic code expansion) for reprogramming the universal genetic code by redesigning and evolving pairs of AARS and their cognate tRNAs (Mukai et al., 2017). The discovery of the PylRS/tRNApyl pair in methanogenic archaea, in particular, has helped rapidly advance this field (Wan et al., 2014). PylRS/tRNApyl is a naturally occurring orthogonal pair that functions efficiently in diverse living organisms from bacteria (Blight et al., 2004) and yeast (Hancock et al., 2010) to mice (Han et al., 2017). PylRS has a flexible amino acid binding pocket that is easily modified to accommodate a wide range of UAAs with diverse chemical structures. In addition, PylRS does not recognize the anticodon of tRNApyl, thus enabling suppression of three different stop codons, TAG, TGA and TAA, without sacrificing PylRS/tRNApyl interactions (Suzuki et al., 2017; Wan et al., 2014).
The major strength of the UAA incorporation technique is its ability to allow direct addition of a designed UAA into a specific position in a protein of interest. Researchers have explored this site-specific protein-labeling advantage by undertaking efforts to directly incorporate UAAs bearing various fluorescent chemical groups. The fluorescent UAA, 3-(6-acetylnaphthalen-2-ylamino)-2-aminopropanoic acid (Anap) (Chatterjee et al., 2013; Lee et al., 2009), was designed based on 6-propionyl-2-(N,N-dimethyl)aminonaphthalene (Prodan), a fluorophore that has been widely used in cell imaging (Weber and Farris, 1979). For genetic incorporation of Anap in mammalian cells, the Anap-specific AARS, AnapRS, was developed from
Ideally, direct genetic incorporation of fluorescent UAAs would minimize possible functional disruptions caused by fluorescence tagging. This method can also minimize the distance between the fluorophore and the protein of interest, which is essential for super-resolution imaging (Lang and Chin, 2014b). However, on a discouraging note, there are some concerns regarding the direct incorporation of fluorescent UAAs that impede their widespread use in cell biology (Lang and Chin, 2014b). Developing a specialized binding pocket in an orthogonal AARS for the unusually bulky chemical structures of fluorescent UAAs is quite difficult (Bryson et al., 2017). In addition, the incorporation efficiency of intrinsically fluorescent UAAs is considerably lower than that of UAAs designed for attachment of a fluorescent group, listed below (Summerer et al., 2006). Furthermore, most genetically incorporated fluorescent UAAs suffer from low quantum yield (Summerer et al., 2006). For example, CouAA has a quantum yield of 0.63 (Wang et al., 2006), which is comparable to that of enhanced green fluorescent protein (EGFP) (Tsien, 1998), whereas the quantum yield of Anap, at 0.48 (Chatterjee et al., 2013), is even lower. Such unfavorable characteristics of direct fluorescent UAA incorporation systems have limited their use in biological systems and live-cell visualization.
An alternative to directly incorporating fluorescent UAAs that has been widely attempted is bio-conjugation–mediated fluorescent tagging. For this, UAAs that enable bioorthogonal reactions need to be incorporated into proteins and then conjugated with specially designed dyes to achieve site-specific fluorescence labeling of the protein of interest (Lang and Chin, 2014a; Liu and Schultz, 2010; McKay and Finn, 2014). Numerous bioorthogonal schemes for conjugating chemical probes or tags to target biomolecules have been developed. The most popular schemes that are applicable to selective fluorescence labeling under physiological conditions without interfering with biological systems include copper-catalyzed azide alkyne cycloaddition (CuAAC) (Kolb et al., 2001; Tornoe et al., 2002), photoclick cycloaddition (Wang et al., 2007), strain-promoted azide alkyne cycloaddition (SPAAC) (Agard et al., 2004), and inverse electron-demand Diels–Alder cycloadditions (IEDDA) (Boger, 1986; Thalhammer et al., 1990) (Fig. 2).
CuAAC is a cycloaddition reaction between azide and alkyne groups catalyzed by copper. Because of its rapid (~3 M−1 S−1) and highly specific reaction, CuAAC has been applied in numerous contexts (McKay and Finn, 2014), from organic synthesis to biomolecule conjugation—even in diagnostic and therapeutic settings (He et al., 2016). However, despite the speed of the reaction and stability of the conjugation products, cytotoxicity caused by the Cu(l) metal catalyst has remained a lingering concern and has limited the widespread use of CuAAC in certain applications, including live-cell imaging (Brewer, 2010; Chen and Wu, 2016). The development of the photoclick cycloaddition scheme, which does not require copper, circumvents this toxicity issue and thus enables the expanded use of CuAAC in living organisms (Yu et al., 2012). One example of such a reaction is photoinducible dipolar cycloaddition, which induces conjugation between an alkene and a nitrile imine, activated from tetrazole (Herner and Lin, 2016; Song et al., 2008). Photoclick chemistry offers some additional advantages; the reaction is relatively fast (up to 50 M−1 S−1) and does not require toxic metals or ligands. Notably, the photo-induced reaction can be controlled in a spatiotemporal manner (Li et al., 2013). The highly reactive tetrazole group is readily activated by a low-power UV lamp, LED or laser beam, all of which are less harmful to living cells than Cu(I) (Ramil and Lin, 2014).
SPAAC induces bioorthogonal conjugation in the absence of metal catalysts or irradiation by introducing ring strain into an alkyne group (Agard et al., 2004). Many early generation strained alkynes, such as cyclooctyne, exhibited a slow conjugation reaction (~2 × 10−3 M−1 S−1)—clearly an undesirable attribute. Subsequent studies have improved the stability of the alkyne and reaction rate by modifying the cycloalkyne chemical structure (Chen and Wu, 2016; Dommerholt et al., 2016). Because of these improvements and inherently low cytotoxicity, SPAAC has come to be broadly used in live mammalian cells (Baskin et al., 2007; Mbua et al., 2011), and even in animals (Baskin et al., 2007; Laughlin et al., 2008). IEDDA, which enables cycloaddition between a dienophile and a diene, such as 1,2,4,5-tetrazine (Boger, 1986; Thalhammer et al., 1990), is a more recently developed bioorthogonal conjugation strategy (Blackman et al., 2008). Because of its extremely fast (up to 106 M−1 S−1) and selective reaction, IEDDA has become a prominent reaction scheme for bioorthogonal conjugation that enables chemical modifications and fluorescent dye labeling of proteins of interest in live mammalian cells (Lang and Chin, 2014a; Lang et al., 2012a; Nikic et al., 2015).
Since its first application in a biological system in 2002 (Kolb et al., 2001; Tornoe et al., 2002), the CuAAC approach has been applied to a variety of research areas, including drug development and diagnostics (He et al., 2016). However, a practical impediment to its application in living systems is the induction of cytotoxic reactive oxygen species (ROS) by Cu(I) oxidation (Brewer, 2010). A variety of water-soluble ligands have been designed to reduce the amount of Cu(I) metal catalyst needed to achieve efficient conjugation with minimal cytotoxicity. These include tris-(benzyltriazolylmethyl)amine (TBTA) (Chan et al., 2004), tris-(3-hydroxypropyltriazolylmethyl)-amine (THPTA) (Hong et al., 2009), bis(L-histidine) (Kennedy et al., 2011), and 2-[4-{(bis[(1-tert-butyl-1H-1,2,3-triazol-4-yl)methyl]amino)methyl}-1H-1,2,3-triazol-1-yl]acetic acid (BTTAA) (Besanceney-Webler et al., 2011).
To date, several UAA incorporation systems utilizing CuAAC have been successfully applied to label target proteins. For example, the epidermal growth factor receptor (EGFR), a cell surface receptor tyrosine kinase, was visualized in live mammalian cells by site-specifically incorporating the lysine-based UAA,
The photoactivated 1,3-dipolar cycloaddition reaction between diphenyltetrazole and methyl crotonate in benzene, first reported in 1967 (Huisgen et al., 1967), has been further expanded to cover nitrile oxide, azomethine ylide, and azide cycloaddition in aqueous media (Molteni et al., 2000). 1,3-Dipolar cycloaddition between alkene dipolarophiles and nitrile imine dipoles has also been developed (Molteni et al., 2000). Moreover, an efficient photo-inducible bioorthogonal reaction based on photoclick chemistry that enables tetrazole-alkene 1,3-dipolar cycloaddition was recently reported in living cells (Yu et al., 2012).
The lysine-based UAAs,
SPAAC is a simple and spontaneous conjugation between azide and cycloalkyne groups that generates a triazole product without the requirement for a catalyst (Agard et al., 2004). The early stage [3 + 2] azide-alkyne cycloaddition using strained cyclooctyne was considerably slower than CuAAC, but various subsequently developed strained alkyne chemical structures have increased the reaction rate (Chen and Wu, 2016). Thanks to such efforts, SPAAC has been effectively applied to visualize diverse biomolecules, including nucleotides (Singh and Heaney, 2011) and lipids (Neef and Schultz, 2009), in various living systems (Alamudi et al., 2016). Tetra-acetylated N-azidoacetyl-D-mannosamine (Ac-4ManNAz), an azide-functionalized sugar, can be metabolically incorporated into the cell surface membrane through the sialic acid biosynthetic pathway (Baskin et al., 2007; Kayser et al., 1992; Saxon and Bertozzi, 2000). The resulting azide-containing glycans can be visualized by labeling with cyclooctyne fluorophores such as difluorinated cyclooctyne (DIFO) (Baskin et al., 2007), dibenzocyclooctyne (DIBO) (Ning et al., 2008), azadibenzylcyclooctyne (DIBAC) (Debets et al., 2010) and bicyclo[6.1.0]nonyne (BCN) fluorescent dyes, allowing three-dimensional visualization of living melanoma cells (Dommerholt et al., 2010).
The SPAAC scheme has been further expanded to protein molecules in living organisms (Plass et al., 2011).
IEDDA is a [4 + 2] cycloaddition reaction between 1,2,4,5-tetrazine and dienophiles that is highly specific and extremely fast under physiological conditions (Thalhammer et al., 1990). In addition, it does not require any catalyst or reagent for the conjugation reaction. Because of such favorable characteristics, IEDDA has become one of the most popular bioorthogonal schemes for biological applications. Thus far, a wide range of dienophiles has been designed for IEDDA reactions, and numerous UAAs bearing such dienophile chemical groups have been developed for IEDDA-based bioorthogonal conjugation (Oliveira et al., 2017; Plass et al., 2012; Prokhorov and Kozhevnikov, 2012).
Bicyclo[6.1.0]non-4-yn-9-ylmethanol (BCN) is considered an attractive dienophile owing to its excellent reactivity with tetrazine. The BCN-bearing UAA, bicyclo[6.1.0]non-4-yn-9-ylmethanol lysine (BCNK), was first designed in 2012 and subsequently genetically incorporated using a mutant form of MbPylRS containing three amino acid substitution (Y271M, L274G, and C313A), generated by directed evolution. This triply mutated MbPylRS exhibited good protein production yield (6–12 mg/L) (Lang et al., 2012b). The double mutant, MmPylRS-AF, was also found to incorporate isoforms of BCNK (endo-BCNK and exo-BCNK) with high yield (Borrmann et al., 2012; Nikic et al., 2015; Peng and Hang, 2016). Exo-BCNK showed higher incorporation efficiency and faster labeling than its isomer when conjugated with monosubstituted tetrazine (H-Tetrazine) fluorophores, which are generally superior to methyl-substituted tetrazine (Me-Tetrazine) moieties (Peng and Hang, 2016). The BCNK incorporation and tetrazine-based fluorophore conjugation scheme has been successfully applied to label diverse cellular proteins, including cell surface and cytoskeletal proteins.
Trans-cyclooctene (TCO) is a highly reactive dienophile that exhibits about a 10-times higher reaction rate compared with BCN owing to its strained chemical structure (Lang et al., 2012b). TCO-bearing lysines (TCOKs) were designed and found to efficiently react with tetrazines. MmPylRS-AF and MbPylRS carrying three mutations (Y271A, L274M, and C313A) were shown to genetically incorporate 2′-TCOK and 4′-TCOK (Nikic et al., 2014; Peng and Hang, 2016). In addition to engineering PylRS, tRNApyl was also evolved to increase the incorporation of TCOKs and other related bulky UAAs that previously exhibited low incorporation efficiency (Serfling et al., 2018). The engineered tRNA, tRNAM15, carrying a canonical hinge between the D- and T-loop that is highly conserved in human tRNAs, was found to improve UAA incorporation several fold compared to the original tRNApyl. Combining MbPylRS-AF and the engineered tRNA enabled the extracellular target protein, CRF1R (corticotropin releasing factor type 1 receptor), carrying 2′-TCOK, to be efficiently produced and labeled with cy3-tetrazine dye for
The efficiency of genetic incorporation and labeling of TCOKs, like BCNKs, is highly dependent on the chemical structure (Peng and Hang, 2016; Uttamapinant et al., 2015). 2′-TCOK showed higher incorporation by MmPylRS-AF and better labeling with tetrazines than 4′-TCOK since it is less reactive towards cysteine residues, which are exposed in many proteins in mammalian cells and might otherwise cause interference (Nikic et al., 2014). Moreover, a side-by-side comparison of axial and equatorial isomers of 2′- and 4′-TCOKs showed better labeling efficiency of 2′-aTCOK (axial) and 4′-eTCOK (equatorial) with tetrazines compared with their diastereomers, probably owing to greater chemical stability or a more rapid reaction rate (Peng and Hang, 2016). This TCOK incorporation and tetrazine-based labeling system has been applied to a wide range of cellular targets, including EGFR (Lang et al., 2012b) and actin (Peng and Hang, 2016). In particular, this scheme was found to be highly useful for visualizing small transmembrane proteins like interferon-inducible transmembrane protein 3 (IFITM3), which could not easily be tracked using the classic fluorescent protein fusion method (Peng and Hang, 2016).
Background fluorescence resulting from insufficient washing of unreacted probes, which is often caused by the hydrophobicity of UAAs, can be troublesome, especially in visualizing intracellular proteins (Kozma et al., 2016). To further improve labeling efficiency of TCOK toward cytosolic proteins, researchers synthesized the more hydrophilic TCOK derivative, dioxo-TCOK (DOTCOK). This UAA was genetically incorporated into proteins by MmPylRS-AF with good yield. Excess DOTCOK is easily removed using considerably shorter washing periods (several minutes) compared with BCNK and TCOK (6 h or longer), which is clearly advantageous for visualizing cytosolic proteins (Kozma et al., 2016).
Other useful UAAs that support IEDDA include cyclobutene-containing lysine (CbK) (Liu et al., 2017), norbornene-2-yloxycarbonyl-lysine (NBOK) (Lang et al., 2012a), and strained cyclooctyne-lysine (SCOK) (Nikic et al., 2015; Plass et al., 2012). CbK is small and chemically stable; it also has a higher conjugation rate with tetrazine than most cycloprene, small alkene or norbornene derivatives, but a lower rate than TCO. CbK was genetically incorporated into OmpX (outer membrane protein X) of
In addition to lysine analogues, dienophile-carrying tyrosine derivatives have also been utilized. 5-Norbornen-2-ol (NOR), cyclooct-2-ynol (COY), and two isomers of (E)-2-(cyclooct-4-en-1-yloxyl)-ethanol tyrosines (DS1 and DS2) were designed and synthesized. These UAAs were genetically incorporated into target proteins in the outer membrane of
Beyond its straightforward use for protein labeling and visualization, the UAA-based protein-labeling scheme is beginning to find extended applications in diverse research areas. One of the major drawbacks of the conventional fluorescent protein fusion method is the difficulty in probing component proteins that form higher-order complexes, such as the virus capsid and virus-like particles, owing to their substantial size (Costantini and Snapp, 2015). Such difficult target proteins can be accessed and analyzed at the single-molecule level using UAA incorporation and a bioorthogonal probe tagging system (Das et al., 2018). Influenza hemagglutinin (HA) is a canonical type I viral envelope glycoprotein that is responsible for fusion of the viral envelope with the endosome membrane after entry into host cells. Acidification of the late endosome was thought to trigger a conformational change in HA, although this had not been directly observed. Recombinant influenza HA carrying two 2′-TCOKs (at positions 17 and 127) was produced using MmPylRS-AF and then labeled with Tet-Cy3 and (Me-)Tet-Cy5 via the IEDDA reaction. This dual-labeling scheme made it possible to study the conformational dynamics of single influenza HA molecules triggered by acidification using single-molecule FRET (smFRET) analysis and total internal reflection fluorescence (TIRF) microscopy (Das et al., 2018) (Fig. 3A). UAA-based bioorthogonal chemistry can also be applied to analyze differential gene and protein expression. Wild-type methionyl-tRNA synthetase (MetRS) and mutant MetRSLtoG are known to incorporate diverse clickable methionine analogs, including azidohomoalanine (AHA), homopropargylglycine (HPG), and azidonorleucine (ANL), which can be efficiently used for bioorthogonal noncanonical amino acid tagging (BONCAT) technique (Dieterich et al., 2006; Erdmann et al., 2015). Proteins carrying Met analogs containing azide or alkyne can be selectively labeled with fluorescence tags for visualization or with affinity tags for enrichment by CuAAC (Fig. 3B). This method has been used to visualize newly synthesized proteins in diverse organisms, including mouse (Calve et al., 2016) and Drosophila (Erdmann et al., 2015), in specific contexts, such as viral infection and stress conditions (Su Hui Teo et al., 2016). This approach also allows spatiotemporal analysis of proteomics after drug treatment or electrophysiological stimulation (Tom Dieck et al., 2012). The clear benefit of this technique compared with conventional isotope-based methods is that it confers a higher degree of temporal resolution without the need for isotopes (Stone et al., 2017).
The UAA incorporation technique can be further extended to
We apologize to all scientists in the field whose important work could not be cited in this report due to space limitations. This work was supported by grants from the National Research Foundation of Korea (2014M3A6A4075060 and 2017R1A2B3011543) and from Samsung Science & Technology Foundation (SSTF-BA1702-09).
The authors declare no conflict of interest.
Unnatural amino acids and bioorthogonal conjugation schemes
UAA | Orthogonal AARS | Mutations | Reaction | References |
---|---|---|---|---|
Fluorescent unnatural amino acids | ||||
1, Anap | EcLeuRS | L38F, M40G, L41P, Y499V, Y500L, Y527A, H537E, L538S, F541C, A560V | Chatterjee et al., 2013; Lee et al., 2009 | |
2, CouAA | MjTyrRS | Y32E, L65H, A67G, H70G, F108Y, Q109H, D158G, L162G | Charbon et al., 2011a; 2011b; Wang et al., 2006 | |
3, Dansylalanine | EcLeuRS | M40A, L41N, T252A, S497C, Y499I, Y527G, H537T | Summerer et al., 2006 | |
Lysine derivatives | ||||
4, PABK | MmPylRS | CuAAC | Chen and Wu, 2016 | |
5, PrK | MmPylRS | CuAAC | Greiss and Chin, 2011; Milles et al., 2012; Swiderska et al., 2017; Vreja et al., 2015; Zhang et al., 2018 | |
6, CpK | MbPylRS | Photoclick | Ramil and Lin, 2014; Yu et al., 2012 | |
7, AcrK | MbPylRS | D76G, L266M, L270I, Y271F, L274A, C313F | Photoclick | Li et al., 2013 |
8, CoK | MmPylRS | Y306A, Y384F | SPAAC | Alamudi et al., 2016; Plass et al., 2011 |
9, BCNK | MbPylRS | Y271M, L274G, C313A | IEDDA | Borrmann et al., 2012; Lang et al., 2012b; Nikic et al., 2015; Peng and Hang, 2016 |
10, 2′-TCOK | MmPylRS, MbPylRS | Y306A, Y384F | IEDDA | Lang et al., 2012b; Nikic et al., 2014; Peng and Hang, 2016 |
11, 4′-TCOK | MmPylRS, MbPylRS | Y306A, Y384F | IEDDA | Lang et al., 2012b; Nikic et al., 2014; Peng and Hang, 2016 |
12, DOTCOK | MmPylRS | Y306A, Y384F | IEDDA | Kozma et al., 2016 |
13, CbK | MbPylRS | L274M, C313A | IEDDA | Liu et al., 2017 |
14, NBOK | MbPylRS | IEDDA | Lang et al., 2012a | |
15, SCOK | MmPylRS | Y306A, Y384F | IEDDA | Nikic et al., 2015; Plass et al., 2012 |
22, NEAK | MmPylRS | SPAAC | Wu et al., 2016 | |
Tyrosine derivatives | ||||
16, NOR | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
17, COY | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
18, DS1/2 | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
Methionine analogs | ||||
19, AHA | MetRS | Calve et al., 2016; Dieterich et al., 2006; Stone et al., 2017; Tom Dieck et al., 2012 | ||
20, HPG | MetRS | Calve et al., 2016; Stone et al., 2017; Su Hui Teo et al., 2016; Tom Dieck et al., 2012 | ||
21, ANL | MetRS | L13G | Erdmann et al., 2015; Stone et al., 2017 |
Mol. Cells 2019; 42(5): 386-396
Published online May 31, 2019 https://doi.org/10.14348/molcells.2019.0078
Copyright © The Korean Society for Molecular and Cellular Biology.
Kyung Jin Lee1,2,3, Deokhee Kang1,3, and Hee-Sung Park1,*
1Department of Chemistry, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Korea
Correspondence to:*hspark@kaist.ac.kr
This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/3.0/.
Labeling of a protein with a specific dye or tag at defined positions is a critical step in tracing the subtle behavior of the protein and assessing its cellular function. Over the last decade, many strategies have been developed to achieve selective labeling of proteins in living cells. In particular, the site-specific unnatural amino acid (UAA) incorporation technique has gained increasing attention since it enables attachment of various organic probes to a specific position of a protein in a more precise way. In this review, we describe how the UAA incorporation technique has expanded our ability to achieve site-specific labeling and visualization of target proteins for functional analyses in live cells.
Keywords: bioorthogonal, click chemistry, genetic code expansion, site-specific labeling, unnatural amino acid
Visualization of a protein of interest in living cells is becoming increasingly important, not only to precisely address cellular functions of the individual target protein, but also to understand the complicated biological system as a whole (Crivat and Taraska, 2012; Kim and Heo, 2018; Liu et al., 2015; Stephens and Allan, 2003). In particular, recent rapid advances in live-cell imaging techniques, including nanoscopy and super-resolution microscopy (SRM), have augmented our ability to address the cellular functions of biomolecules at the nanoscale level (10–30 nm) (Sengupta et al., 2012; Toomre and Bewersdorf, 2010). Numerous dyes and tag-labeling approaches have been developed for efficiently visualizing cellular target proteins (Crivat and Taraska, 2012). Fusion of fluorescent protein tags has been used routinely owing to its ease and simplicity (Shaner et al., 2007). In addition, diverse self-labeling enzymes, such as Halo-tag (Los et al., 2008), SNAP-tag (Keppler et al., 2003) and CLIP-tag (Gautier et al., 2008), have also been explored for labeling target proteins using chemical fluorescent probes, which usually exhibit better photophysical properties than fluorescent proteins (van de Linde et al., 2012). However, such protein-based tagging methods suffer from intrinsic limitations. Fused proteins or enzymes can only be added to N- or C-termini of the target protein (Crivat and Taraska, 2012; Tsien, 1998). More critically, they can perturb the cellular function of the target protein, such as localization, activity or protein-protein interaction, owing to their relatively large size (> 20 kDa) and tendency to form multimeric structures (Prescher and Bertozzi, 2005; Vreja et al., 2015).
Attempts to reduce the size of the tagging unit have given rise to the development of various peptide-based approaches. For example, phage display of peptide libraries has led to isolation of the 12-residue peptide tags, S6 and A1, which enable orthogonal protein labeling catalyzed by the phosphopantetheinyl transferases, Sfp and AcpS, respectively (Zhou et al., 2007). A tetracysteine-based labeling scheme uses an even smaller peptide, a 6-amino acid motif (CCXXCC), that selectively binds to fluorescein arsenical hairpin (FlAsH) (Adams et al., 2002). Such peptide-tag labeling methods, despite certain favorable characteristics, still suffer from critical problems, including high background and cellular toxicity of the organic fluorophores. Strictly speaking, peptide-based methods are not site-specific, since they do not allow residue-specific protein labeling (Crivat and Taraska, 2012; Fernandez and Freed, 2017; Lotze et al., 2016). On the other hand, the genetic incorporation of unnatural amino acid (UAA) approach using an engineered orthogonal aminoacyl-tRNA synthetase (AARS) and tRNA pair can ideally lead to site-specific protein labeling with minimal perturbations, since this method allows installation of special chemical groups into the desired residue of the target protein (Chatterjee et al., 2013; Lang et al., 2015; Prescher and Bertozzi, 2005). Details of UAA-enabled approaches and diverse labeling schemes have been summarized by some excellent reviews (Lang and Chin, 2014b; Liu et al., 2007; Oliveira et al., 2017; Sletten and Bertozzi, 2009; Young and Schultz, 2018). In this review, we briefly discuss recent progress in UAA-based site-specific protein labeling methods and their promising applications. UAAs described in this report are listed in Figure 1 and Table 1.
In addition to the standard 20 amino acids, biological systems have evolved to genetically encode two additional amino acids, selenocysteine (Sec) and pyrrolysine (Pyl), and incorporate them into proteins (Ambrogelly et al., 2007). These two amino acids are thus sometimes referred to as the 21st and 22nd amino acids. In the case of Sec, serine is first attached to selenocysteinyl tRNA by SerRS (Leinfelder et al., 1988), and then converted into Sec by the sequential action of phosphoseryl-tRNA kinase (PSTK) (Carlson et al., 2004) and Sep (O-phosphoserine) tRNA:Sec tRNA synthase (SepSecS) (Yuan et al., 2006). Sec is then co-translationally inserted into proteins at a TGA stop codon with the help of special elongation factor (SelB) and a stem-loop Sec insertion sequence element (SECIS) (Forchhammer et al., 1989). Pyl is directly attached to pyrrolysyl tRNA (tRNApyl) by pyrrolysyl-tRNA synthetase (PylRS) and incorporated into proteins at a TAG stop codon (Blight et al., 2004; Polycarpo et al., 2004). Inspired by this natural expansion of the genetic code, researchers have developed a site-specific UAA incorporation technique (also called genetic code expansion) for reprogramming the universal genetic code by redesigning and evolving pairs of AARS and their cognate tRNAs (Mukai et al., 2017). The discovery of the PylRS/tRNApyl pair in methanogenic archaea, in particular, has helped rapidly advance this field (Wan et al., 2014). PylRS/tRNApyl is a naturally occurring orthogonal pair that functions efficiently in diverse living organisms from bacteria (Blight et al., 2004) and yeast (Hancock et al., 2010) to mice (Han et al., 2017). PylRS has a flexible amino acid binding pocket that is easily modified to accommodate a wide range of UAAs with diverse chemical structures. In addition, PylRS does not recognize the anticodon of tRNApyl, thus enabling suppression of three different stop codons, TAG, TGA and TAA, without sacrificing PylRS/tRNApyl interactions (Suzuki et al., 2017; Wan et al., 2014).
The major strength of the UAA incorporation technique is its ability to allow direct addition of a designed UAA into a specific position in a protein of interest. Researchers have explored this site-specific protein-labeling advantage by undertaking efforts to directly incorporate UAAs bearing various fluorescent chemical groups. The fluorescent UAA, 3-(6-acetylnaphthalen-2-ylamino)-2-aminopropanoic acid (Anap) (Chatterjee et al., 2013; Lee et al., 2009), was designed based on 6-propionyl-2-(N,N-dimethyl)aminonaphthalene (Prodan), a fluorophore that has been widely used in cell imaging (Weber and Farris, 1979). For genetic incorporation of Anap in mammalian cells, the Anap-specific AARS, AnapRS, was developed from
Ideally, direct genetic incorporation of fluorescent UAAs would minimize possible functional disruptions caused by fluorescence tagging. This method can also minimize the distance between the fluorophore and the protein of interest, which is essential for super-resolution imaging (Lang and Chin, 2014b). However, on a discouraging note, there are some concerns regarding the direct incorporation of fluorescent UAAs that impede their widespread use in cell biology (Lang and Chin, 2014b). Developing a specialized binding pocket in an orthogonal AARS for the unusually bulky chemical structures of fluorescent UAAs is quite difficult (Bryson et al., 2017). In addition, the incorporation efficiency of intrinsically fluorescent UAAs is considerably lower than that of UAAs designed for attachment of a fluorescent group, listed below (Summerer et al., 2006). Furthermore, most genetically incorporated fluorescent UAAs suffer from low quantum yield (Summerer et al., 2006). For example, CouAA has a quantum yield of 0.63 (Wang et al., 2006), which is comparable to that of enhanced green fluorescent protein (EGFP) (Tsien, 1998), whereas the quantum yield of Anap, at 0.48 (Chatterjee et al., 2013), is even lower. Such unfavorable characteristics of direct fluorescent UAA incorporation systems have limited their use in biological systems and live-cell visualization.
An alternative to directly incorporating fluorescent UAAs that has been widely attempted is bio-conjugation–mediated fluorescent tagging. For this, UAAs that enable bioorthogonal reactions need to be incorporated into proteins and then conjugated with specially designed dyes to achieve site-specific fluorescence labeling of the protein of interest (Lang and Chin, 2014a; Liu and Schultz, 2010; McKay and Finn, 2014). Numerous bioorthogonal schemes for conjugating chemical probes or tags to target biomolecules have been developed. The most popular schemes that are applicable to selective fluorescence labeling under physiological conditions without interfering with biological systems include copper-catalyzed azide alkyne cycloaddition (CuAAC) (Kolb et al., 2001; Tornoe et al., 2002), photoclick cycloaddition (Wang et al., 2007), strain-promoted azide alkyne cycloaddition (SPAAC) (Agard et al., 2004), and inverse electron-demand Diels–Alder cycloadditions (IEDDA) (Boger, 1986; Thalhammer et al., 1990) (Fig. 2).
CuAAC is a cycloaddition reaction between azide and alkyne groups catalyzed by copper. Because of its rapid (~3 M−1 S−1) and highly specific reaction, CuAAC has been applied in numerous contexts (McKay and Finn, 2014), from organic synthesis to biomolecule conjugation—even in diagnostic and therapeutic settings (He et al., 2016). However, despite the speed of the reaction and stability of the conjugation products, cytotoxicity caused by the Cu(l) metal catalyst has remained a lingering concern and has limited the widespread use of CuAAC in certain applications, including live-cell imaging (Brewer, 2010; Chen and Wu, 2016). The development of the photoclick cycloaddition scheme, which does not require copper, circumvents this toxicity issue and thus enables the expanded use of CuAAC in living organisms (Yu et al., 2012). One example of such a reaction is photoinducible dipolar cycloaddition, which induces conjugation between an alkene and a nitrile imine, activated from tetrazole (Herner and Lin, 2016; Song et al., 2008). Photoclick chemistry offers some additional advantages; the reaction is relatively fast (up to 50 M−1 S−1) and does not require toxic metals or ligands. Notably, the photo-induced reaction can be controlled in a spatiotemporal manner (Li et al., 2013). The highly reactive tetrazole group is readily activated by a low-power UV lamp, LED or laser beam, all of which are less harmful to living cells than Cu(I) (Ramil and Lin, 2014).
SPAAC induces bioorthogonal conjugation in the absence of metal catalysts or irradiation by introducing ring strain into an alkyne group (Agard et al., 2004). Many early generation strained alkynes, such as cyclooctyne, exhibited a slow conjugation reaction (~2 × 10−3 M−1 S−1)—clearly an undesirable attribute. Subsequent studies have improved the stability of the alkyne and reaction rate by modifying the cycloalkyne chemical structure (Chen and Wu, 2016; Dommerholt et al., 2016). Because of these improvements and inherently low cytotoxicity, SPAAC has come to be broadly used in live mammalian cells (Baskin et al., 2007; Mbua et al., 2011), and even in animals (Baskin et al., 2007; Laughlin et al., 2008). IEDDA, which enables cycloaddition between a dienophile and a diene, such as 1,2,4,5-tetrazine (Boger, 1986; Thalhammer et al., 1990), is a more recently developed bioorthogonal conjugation strategy (Blackman et al., 2008). Because of its extremely fast (up to 106 M−1 S−1) and selective reaction, IEDDA has become a prominent reaction scheme for bioorthogonal conjugation that enables chemical modifications and fluorescent dye labeling of proteins of interest in live mammalian cells (Lang and Chin, 2014a; Lang et al., 2012a; Nikic et al., 2015).
Since its first application in a biological system in 2002 (Kolb et al., 2001; Tornoe et al., 2002), the CuAAC approach has been applied to a variety of research areas, including drug development and diagnostics (He et al., 2016). However, a practical impediment to its application in living systems is the induction of cytotoxic reactive oxygen species (ROS) by Cu(I) oxidation (Brewer, 2010). A variety of water-soluble ligands have been designed to reduce the amount of Cu(I) metal catalyst needed to achieve efficient conjugation with minimal cytotoxicity. These include tris-(benzyltriazolylmethyl)amine (TBTA) (Chan et al., 2004), tris-(3-hydroxypropyltriazolylmethyl)-amine (THPTA) (Hong et al., 2009), bis(L-histidine) (Kennedy et al., 2011), and 2-[4-{(bis[(1-tert-butyl-1H-1,2,3-triazol-4-yl)methyl]amino)methyl}-1H-1,2,3-triazol-1-yl]acetic acid (BTTAA) (Besanceney-Webler et al., 2011).
To date, several UAA incorporation systems utilizing CuAAC have been successfully applied to label target proteins. For example, the epidermal growth factor receptor (EGFR), a cell surface receptor tyrosine kinase, was visualized in live mammalian cells by site-specifically incorporating the lysine-based UAA,
The photoactivated 1,3-dipolar cycloaddition reaction between diphenyltetrazole and methyl crotonate in benzene, first reported in 1967 (Huisgen et al., 1967), has been further expanded to cover nitrile oxide, azomethine ylide, and azide cycloaddition in aqueous media (Molteni et al., 2000). 1,3-Dipolar cycloaddition between alkene dipolarophiles and nitrile imine dipoles has also been developed (Molteni et al., 2000). Moreover, an efficient photo-inducible bioorthogonal reaction based on photoclick chemistry that enables tetrazole-alkene 1,3-dipolar cycloaddition was recently reported in living cells (Yu et al., 2012).
The lysine-based UAAs,
SPAAC is a simple and spontaneous conjugation between azide and cycloalkyne groups that generates a triazole product without the requirement for a catalyst (Agard et al., 2004). The early stage [3 + 2] azide-alkyne cycloaddition using strained cyclooctyne was considerably slower than CuAAC, but various subsequently developed strained alkyne chemical structures have increased the reaction rate (Chen and Wu, 2016). Thanks to such efforts, SPAAC has been effectively applied to visualize diverse biomolecules, including nucleotides (Singh and Heaney, 2011) and lipids (Neef and Schultz, 2009), in various living systems (Alamudi et al., 2016). Tetra-acetylated N-azidoacetyl-D-mannosamine (Ac-4ManNAz), an azide-functionalized sugar, can be metabolically incorporated into the cell surface membrane through the sialic acid biosynthetic pathway (Baskin et al., 2007; Kayser et al., 1992; Saxon and Bertozzi, 2000). The resulting azide-containing glycans can be visualized by labeling with cyclooctyne fluorophores such as difluorinated cyclooctyne (DIFO) (Baskin et al., 2007), dibenzocyclooctyne (DIBO) (Ning et al., 2008), azadibenzylcyclooctyne (DIBAC) (Debets et al., 2010) and bicyclo[6.1.0]nonyne (BCN) fluorescent dyes, allowing three-dimensional visualization of living melanoma cells (Dommerholt et al., 2010).
The SPAAC scheme has been further expanded to protein molecules in living organisms (Plass et al., 2011).
IEDDA is a [4 + 2] cycloaddition reaction between 1,2,4,5-tetrazine and dienophiles that is highly specific and extremely fast under physiological conditions (Thalhammer et al., 1990). In addition, it does not require any catalyst or reagent for the conjugation reaction. Because of such favorable characteristics, IEDDA has become one of the most popular bioorthogonal schemes for biological applications. Thus far, a wide range of dienophiles has been designed for IEDDA reactions, and numerous UAAs bearing such dienophile chemical groups have been developed for IEDDA-based bioorthogonal conjugation (Oliveira et al., 2017; Plass et al., 2012; Prokhorov and Kozhevnikov, 2012).
Bicyclo[6.1.0]non-4-yn-9-ylmethanol (BCN) is considered an attractive dienophile owing to its excellent reactivity with tetrazine. The BCN-bearing UAA, bicyclo[6.1.0]non-4-yn-9-ylmethanol lysine (BCNK), was first designed in 2012 and subsequently genetically incorporated using a mutant form of MbPylRS containing three amino acid substitution (Y271M, L274G, and C313A), generated by directed evolution. This triply mutated MbPylRS exhibited good protein production yield (6–12 mg/L) (Lang et al., 2012b). The double mutant, MmPylRS-AF, was also found to incorporate isoforms of BCNK (endo-BCNK and exo-BCNK) with high yield (Borrmann et al., 2012; Nikic et al., 2015; Peng and Hang, 2016). Exo-BCNK showed higher incorporation efficiency and faster labeling than its isomer when conjugated with monosubstituted tetrazine (H-Tetrazine) fluorophores, which are generally superior to methyl-substituted tetrazine (Me-Tetrazine) moieties (Peng and Hang, 2016). The BCNK incorporation and tetrazine-based fluorophore conjugation scheme has been successfully applied to label diverse cellular proteins, including cell surface and cytoskeletal proteins.
Trans-cyclooctene (TCO) is a highly reactive dienophile that exhibits about a 10-times higher reaction rate compared with BCN owing to its strained chemical structure (Lang et al., 2012b). TCO-bearing lysines (TCOKs) were designed and found to efficiently react with tetrazines. MmPylRS-AF and MbPylRS carrying three mutations (Y271A, L274M, and C313A) were shown to genetically incorporate 2′-TCOK and 4′-TCOK (Nikic et al., 2014; Peng and Hang, 2016). In addition to engineering PylRS, tRNApyl was also evolved to increase the incorporation of TCOKs and other related bulky UAAs that previously exhibited low incorporation efficiency (Serfling et al., 2018). The engineered tRNA, tRNAM15, carrying a canonical hinge between the D- and T-loop that is highly conserved in human tRNAs, was found to improve UAA incorporation several fold compared to the original tRNApyl. Combining MbPylRS-AF and the engineered tRNA enabled the extracellular target protein, CRF1R (corticotropin releasing factor type 1 receptor), carrying 2′-TCOK, to be efficiently produced and labeled with cy3-tetrazine dye for
The efficiency of genetic incorporation and labeling of TCOKs, like BCNKs, is highly dependent on the chemical structure (Peng and Hang, 2016; Uttamapinant et al., 2015). 2′-TCOK showed higher incorporation by MmPylRS-AF and better labeling with tetrazines than 4′-TCOK since it is less reactive towards cysteine residues, which are exposed in many proteins in mammalian cells and might otherwise cause interference (Nikic et al., 2014). Moreover, a side-by-side comparison of axial and equatorial isomers of 2′- and 4′-TCOKs showed better labeling efficiency of 2′-aTCOK (axial) and 4′-eTCOK (equatorial) with tetrazines compared with their diastereomers, probably owing to greater chemical stability or a more rapid reaction rate (Peng and Hang, 2016). This TCOK incorporation and tetrazine-based labeling system has been applied to a wide range of cellular targets, including EGFR (Lang et al., 2012b) and actin (Peng and Hang, 2016). In particular, this scheme was found to be highly useful for visualizing small transmembrane proteins like interferon-inducible transmembrane protein 3 (IFITM3), which could not easily be tracked using the classic fluorescent protein fusion method (Peng and Hang, 2016).
Background fluorescence resulting from insufficient washing of unreacted probes, which is often caused by the hydrophobicity of UAAs, can be troublesome, especially in visualizing intracellular proteins (Kozma et al., 2016). To further improve labeling efficiency of TCOK toward cytosolic proteins, researchers synthesized the more hydrophilic TCOK derivative, dioxo-TCOK (DOTCOK). This UAA was genetically incorporated into proteins by MmPylRS-AF with good yield. Excess DOTCOK is easily removed using considerably shorter washing periods (several minutes) compared with BCNK and TCOK (6 h or longer), which is clearly advantageous for visualizing cytosolic proteins (Kozma et al., 2016).
Other useful UAAs that support IEDDA include cyclobutene-containing lysine (CbK) (Liu et al., 2017), norbornene-2-yloxycarbonyl-lysine (NBOK) (Lang et al., 2012a), and strained cyclooctyne-lysine (SCOK) (Nikic et al., 2015; Plass et al., 2012). CbK is small and chemically stable; it also has a higher conjugation rate with tetrazine than most cycloprene, small alkene or norbornene derivatives, but a lower rate than TCO. CbK was genetically incorporated into OmpX (outer membrane protein X) of
In addition to lysine analogues, dienophile-carrying tyrosine derivatives have also been utilized. 5-Norbornen-2-ol (NOR), cyclooct-2-ynol (COY), and two isomers of (E)-2-(cyclooct-4-en-1-yloxyl)-ethanol tyrosines (DS1 and DS2) were designed and synthesized. These UAAs were genetically incorporated into target proteins in the outer membrane of
Beyond its straightforward use for protein labeling and visualization, the UAA-based protein-labeling scheme is beginning to find extended applications in diverse research areas. One of the major drawbacks of the conventional fluorescent protein fusion method is the difficulty in probing component proteins that form higher-order complexes, such as the virus capsid and virus-like particles, owing to their substantial size (Costantini and Snapp, 2015). Such difficult target proteins can be accessed and analyzed at the single-molecule level using UAA incorporation and a bioorthogonal probe tagging system (Das et al., 2018). Influenza hemagglutinin (HA) is a canonical type I viral envelope glycoprotein that is responsible for fusion of the viral envelope with the endosome membrane after entry into host cells. Acidification of the late endosome was thought to trigger a conformational change in HA, although this had not been directly observed. Recombinant influenza HA carrying two 2′-TCOKs (at positions 17 and 127) was produced using MmPylRS-AF and then labeled with Tet-Cy3 and (Me-)Tet-Cy5 via the IEDDA reaction. This dual-labeling scheme made it possible to study the conformational dynamics of single influenza HA molecules triggered by acidification using single-molecule FRET (smFRET) analysis and total internal reflection fluorescence (TIRF) microscopy (Das et al., 2018) (Fig. 3A). UAA-based bioorthogonal chemistry can also be applied to analyze differential gene and protein expression. Wild-type methionyl-tRNA synthetase (MetRS) and mutant MetRSLtoG are known to incorporate diverse clickable methionine analogs, including azidohomoalanine (AHA), homopropargylglycine (HPG), and azidonorleucine (ANL), which can be efficiently used for bioorthogonal noncanonical amino acid tagging (BONCAT) technique (Dieterich et al., 2006; Erdmann et al., 2015). Proteins carrying Met analogs containing azide or alkyne can be selectively labeled with fluorescence tags for visualization or with affinity tags for enrichment by CuAAC (Fig. 3B). This method has been used to visualize newly synthesized proteins in diverse organisms, including mouse (Calve et al., 2016) and Drosophila (Erdmann et al., 2015), in specific contexts, such as viral infection and stress conditions (Su Hui Teo et al., 2016). This approach also allows spatiotemporal analysis of proteomics after drug treatment or electrophysiological stimulation (Tom Dieck et al., 2012). The clear benefit of this technique compared with conventional isotope-based methods is that it confers a higher degree of temporal resolution without the need for isotopes (Stone et al., 2017).
The UAA incorporation technique can be further extended to
We apologize to all scientists in the field whose important work could not be cited in this report due to space limitations. This work was supported by grants from the National Research Foundation of Korea (2014M3A6A4075060 and 2017R1A2B3011543) and from Samsung Science & Technology Foundation (SSTF-BA1702-09).
The authors declare no conflict of interest.
. Unnatural amino acids and bioorthogonal conjugation schemes.
UAA | Orthogonal AARS | Mutations | Reaction | References |
---|---|---|---|---|
Fluorescent unnatural amino acids | ||||
1, Anap | EcLeuRS | L38F, M40G, L41P, Y499V, Y500L, Y527A, H537E, L538S, F541C, A560V | Chatterjee et al., 2013; Lee et al., 2009 | |
2, CouAA | MjTyrRS | Y32E, L65H, A67G, H70G, F108Y, Q109H, D158G, L162G | Charbon et al., 2011a; 2011b; Wang et al., 2006 | |
3, Dansylalanine | EcLeuRS | M40A, L41N, T252A, S497C, Y499I, Y527G, H537T | Summerer et al., 2006 | |
Lysine derivatives | ||||
4, PABK | MmPylRS | CuAAC | Chen and Wu, 2016 | |
5, PrK | MmPylRS | CuAAC | Greiss and Chin, 2011; Milles et al., 2012; Swiderska et al., 2017; Vreja et al., 2015; Zhang et al., 2018 | |
6, CpK | MbPylRS | Photoclick | Ramil and Lin, 2014; Yu et al., 2012 | |
7, AcrK | MbPylRS | D76G, L266M, L270I, Y271F, L274A, C313F | Photoclick | Li et al., 2013 |
8, CoK | MmPylRS | Y306A, Y384F | SPAAC | Alamudi et al., 2016; Plass et al., 2011 |
9, BCNK | MbPylRS | Y271M, L274G, C313A | IEDDA | Borrmann et al., 2012; Lang et al., 2012b; Nikic et al., 2015; Peng and Hang, 2016 |
10, 2′-TCOK | MmPylRS, MbPylRS | Y306A, Y384F | IEDDA | Lang et al., 2012b; Nikic et al., 2014; Peng and Hang, 2016 |
11, 4′-TCOK | MmPylRS, MbPylRS | Y306A, Y384F | IEDDA | Lang et al., 2012b; Nikic et al., 2014; Peng and Hang, 2016 |
12, DOTCOK | MmPylRS | Y306A, Y384F | IEDDA | Kozma et al., 2016 |
13, CbK | MbPylRS | L274M, C313A | IEDDA | Liu et al., 2017 |
14, NBOK | MbPylRS | IEDDA | Lang et al., 2012a | |
15, SCOK | MmPylRS | Y306A, Y384F | IEDDA | Nikic et al., 2015; Plass et al., 2012 |
22, NEAK | MmPylRS | SPAAC | Wu et al., 2016 | |
Tyrosine derivatives | ||||
16, NOR | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
17, COY | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
18, DS1/2 | MmPylRS | Y306A, N346A, C348A, Y384F | IEDDA | Kurra et al., 2014 |
Methionine analogs | ||||
19, AHA | MetRS | Calve et al., 2016; Dieterich et al., 2006; Stone et al., 2017; Tom Dieck et al., 2012 | ||
20, HPG | MetRS | Calve et al., 2016; Stone et al., 2017; Su Hui Teo et al., 2016; Tom Dieck et al., 2012 | ||
21, ANL | MetRS | L13G | Erdmann et al., 2015; Stone et al., 2017 |