Effect of Noncanonical Amino Acids on Protein–Carbohydrate Interactions: Structure, Dynamics, and Carbohydrate Affinity of a Lectin Engineered with Fluorinated Tryptophan Analogs

Protein–carbohydrate interactions play crucial roles in biology. Understanding and modifying these interactions is of major interest for fighting many diseases. We took a synthetic biology approach and incorporated noncanonical amino acids into a bacterial lectin to modulate its interactions with carbohydrates. We focused on tryptophan, which is prevalent in carbohydrate binding sites. The exchange of the tryptophan residues with analogs fluorinated at different positions resulted in three distinctly fluorinated variants of the lectin from Ralstonia solanacearum. We observed differences in stability and affinity toward fucosylated glycans and rationalized them by X-ray and modeling studies. While fluorination decreased the aromaticity of the indole ring and, therefore, the strength of carbohydrate–aromatic interactions, additional weak hydrogen bonds were formed between fluorine and the ligand hydroxyl groups. Our approach opens new possibilities to engineer carbohydrate receptors.

G lycans and glycoconjugates are ubiquitously found on all living cells. Protein receptors, such as lectins, bind to such glycans and decipher their structural code, which results in biological and physiological actions. 1 Modifying and engineering lectins to interfere with glycan interactions is in its infancy. 2 Novel synthetic biology tools are now available to explore glycobiology. Noncanonical amino acids (ncAAs) find application in diverse scientific fields, such as protein engineering, 3,4 proteomics, 5,6 and material sciences 7,8 (for recent reviews, see refs 9−12), which is attributed to their chemical or structural versatility. However, only little emphasis has been put on the use of ncAAs to improve or alter receptor−ligand interactions. Fluorinated amino acids were studied for their effects on protein−protein interactions (reviewed in ref 13). For example, incorporation of 5-fluorotryptophan into the cyclic peptide inhibitor compstatin resulted in a 4-fold increased affinity toward complement factor 3. 14 The effect of protein fluorination on carbohydrate binding has not been much investigated. Luck and Falke 15 explored structural changes in the D-galactose chemosensory receptor induced by sugar binding. They observed a 2.2− 3-fold reduced affinity for the sugar with a variant containing 5fluorotryptophan in its ligand binding site compared to the unlabeled receptor. Similarly, fluorination of phenylalanine and tyrosine of the small antibiotic peptide AcAMP2 lowered its affinity for chitin trisaccharide. 16 Ralstonia solanacearum lectin (RSL) is a carbohydrate binding protein with unusually strong affinity for fucose and specificity for fucosylated oligosaccharides from plants and animals. 17 RSL is a homotrimer that forms a six-bladed β-propeller fold. It presents six carbohydrate binding sites, which are either intra-or intermonomeric (Figure 1a), and nonetheless are structurally very similar. 17 Each RSL monomer contains seven Trp residues, six of which are directly involved in carbohydrate binding (Supporting Information, Table S1). The analysis of the RSL structure combined with modeling revealed the importance of three Trp residues present in the binding site. One makes hydrophobic contact with the sugar methyl group; another is involved in hydrogen bonding (Figure 1b). The third is responsible for a stacking interaction with the fucose residue and defines the specificity for this sugar. 18,19 Such carbohydrate− aromatic interactions are frequent 20,21 with a 9-fold prevalence of Trp in the carbohydrate binding site. 21 Recently, Trp was also demonstrated to have a crucial role in inducing a large conformational change of a fucosylated trisaccharide in the RSL binding site. 22 RSL appears as an excellent model protein to study the effect of tryptophan fluorination on carbohydrate recognition. Here, we incorporated four fluorinated Trp analogs (Figure 1c) into RSL to examine the effect on the protein stability and affinity toward mono-and oligosaccharides. We report the first crystal structures of fluorinated lectins and elucidate the molecular details of their interaction with ligands. After appropriate parametrization, molecular modeling gave access to the dynamics of the modified proteins and demonstrated the predictive power of modeling ncAA incorporation.

■ RESULTS AND DISCUSSION
Lectin Production and Physicochemical Characterization. We employed the supplementation of a tryptophan auxotrophic Escherichia coli strain with four tryptophan analogs (FWs, Figure 1c) for their residue-specific incorporation 23 into RSL. Since the tryptophan analogs are not commercially available, we fed our cells with indole analogs to convert these to tryptophan analogs 24−27 in situ by the host tryptophan synthase (reviewed by Phillips 28 ). The procedure resulted in titers of purified synthetic variants in the range of 35−60 mg lyophilized protein per liter of cell culture, compared to 100 mg L −1 cell culture for the unmodified parent protein. Interestingly, the incorporation of 6FW drove RSL into insolubility, while all other fluorinated RSL variants remained soluble ( Figure S1). Efficient labeling of RSL with FWs was confirmed by mass spectrometry ( Figure S2). The majority of the proteins (85% of RSL[4FW], 86% of RSL[5FW] and 84% of RSL[7FW]) showed quantitative replacement of Trp by the analogs, and only in a minor fraction (14−16%) the exchange was incomplete (Table  S2). By assuming a random distribution of the unlabeled residues over the seven positions, one would thus expect a labeling efficiency of 97% (RSL[7FW]) to 98% (RSL[4FW] and RSL[5FW]) per site. To test the individual sites for labeling with the fluorinated Trp analogs, we subjected the fluorinated variants and the wild-type RSL proteins to different enzymatic digests to ensure nearly complete sequence coverage. Nano-LCtandem mass spectrometry analysis showed that four Trp residues and their respective fluoro-analogs could be relatively quantified (W10, W31, W53, W74) because they were covered in well ionizing peptides with a single Trp in their sequence ( Figure  S3). Only minimal residual unlabeled Trp was found in the labeled protein analogs at these four positions (<3% of wildtype), which matches the proportion expected by a random distribution.
We analyzed the physico-and biochemical properties of the synthetic RSL variants. Several reports in the literature indicate that the incorporation of fluorinated ncAAs results in increased protein stability. 29−31 We compared the thermostability of the variants to the parent protein using differential scanning fluorimetry (DSF) and differential scanning calorimetry (DSC). Indeed, in the presence of D-mannose, the 5FWcontaining variant showed a slightly increased melting temperature in comparison to the parent protein (T m = 90°C vs T m = 88°C ; Figure S4a). In contrast, the incorporation of 4-and 7FW clearly decreased the melting temperatures to 79 and 72°C, respectively. The same effects were observed by DSC measurements in the presence of the high-affinity ligand methyl α-Lfucopyranoside (αMeFuc) 17 ( Figure S4b). Since the interaction of the RSL variants with the high-affinity ligand exceeded the range of DSF, we used DSC for the analysis. The results exemplify the remarkably pronounced effect of the position of the fluorine atom in the indole ring on protein stability. As shown in Figure 1, Trp residues are involved in interblade contacts, and

ACS Chemical Biology
Articles alteration of their structure may indeed affect the folding (6FW) or modify the protein's stability.
Affinity for Different Ligands. Given the prominent position of the fluorine in the ligand binding site of the synthetic RSL variants, we hypothesized that the variants could show subtle affinity differences for different sugar ligands. Therefore, we analyzed the affinity toward αMeFuc and two tetrasaccharides, blood group H type 2 antigen (HType2, Fucα1−2Galβ1− 4GlcNAcβ1−3Gal) and lewis X (Le X , Galβ1−4[Fucα1−3]-GlcNAcβ1−3Gal) by isothermal titration calorimetry (ITC, Figure S5, Table S3). A dissociation constant, K d , of 1.21 ± 0.04 μM was measured for RSL interacting with αMeFuc, which is in agreement with previous reports. 17 RSL Figure 2). Le X was therefore selected for the structural study to determine the molecular basis of the thermodynamic behavior.
Structural Analysis. The three fluoro-RSL proteins were cocrystallized with the Le X tetrasaccharide using the vapor diffusion method and yielded diamond-shaped crystals diffracting to high resolution (1.15 to 1.35 Å). All structures belonged to the F23 space group with two independent monomers in the asymmetric unit (Table S4,  We clearly observed the electron density of the carbohydrate ligand in all structures ( Figure S6−S8). Similar to previous observations in the RSL/Le X complex, 22 the Le X adopted open conformations (Table S5) that were very different from those observed in solution. 32 The conformational behavior of Le X in RSL was comprehensively described in previous work, 32 and we focus here on the contacts between the fluorinated Trp residues and the oligosaccharide ligand. For the sake of clarity, the description of structures and models below will be centered on the intermonomeric site that does not present significant variations as the intramonomeric site does. Due to the orientation of the indole ring of the Trp residues (Figure 4a

ACS Chemical Biology
Articles 3.1 Å, respectively. The fluorine of Trp31 was in close proximity to O2 of galactose and to O4 of GlcNAc (3.0 and 3.2 Å). Such distances are indicative of favorable contacts, and indeed, fluorine atoms are considered as weak acceptors of hydrogen bonds. 33,34 Fluorination is expected to induce a large modification of the π electron density on the indole ring, 14 so the effect on the affinity may be more complex than the observed weak hydrogen bonds.
Modeling of FW−Ligand Interactions. We performed molecular modeling to dissect the interaction between residues Trp31 and Trp36 that are involved in stacking with fucose and in hydrogen bonding, respectively. Parameterization was necessary to include F-Trp in the model. Atomic charges (Table S6) and other parameters were obtained by quantum chemical calculations and were included in Amber Tools to prepare the topology files of the different complexes. The monosaccharide αMeFuc was selected as the ligand to avoid the superimposition of the interaction analysis with conformational effects. Each RSL variant complexed with αMeFuc was submitted to a molecular dynamics simulation of 500 ns that was used for calculating the interaction energy and its decomposition per residue. 35 All modeled complexes were very stable, and the average structures of the binding sites were similar to the X-ray structures ( Figure S9). The calculated energies of interaction (Table 1) predicted that the parent protein and RSL[7FW] would have a stronger affinity for αMeFuc than the other variants. This is partially in agreement with experimental ITC data, where RSL[4FW] showed the lowest affinity for αMeFuc ( Figure S5).
A detailed analysis of the influence of the fluorination on the two Trp residues closer to fucose, that is, Trp36 and Trp31, demonstrated opposite effects: The binding energy of Trp31 that stacks on fucose is weakened by fluorination. The alteration of the indole charge distribution by fluorine deactivates the aromaticity as previously described for a fluorinated derivative of phenylalanine. 16 In contrast, the energy of interaction of Trp36 that establishes an NH···O hydrogen bond with the O3 of fucose is not decreased in the variants compared to the parent RSL (Table 1). In the case of RSL[7FW], the energy of interaction is even increased, which is in agreement with the formation of a fluorine-based hydrogen bond. A closer analysis of the molecular dynamics trajectories indicates that this variant, which shows direct contact between the fluorine atom and the ligand, presents a very unique behavior of fucose in the binding site ( Figure 5). While the average structure is similar to the one observed by X-ray, rapid fluctuation in the orientation of fucose creates a sporadic strong hydrogen bond between fluorine and HO4 (distance F···H shorter than 2 Å), which competes with the hydrogen bond between HO4 and Glu73.OE1 that is observed in all crystal structures. The rapid change of orientation of fucose is also correlated with a partial loss of stacking with Trp31. The average contacts of fucose in the binding site are not significantly affected (Table 1), but the stability of the complex is lower as indicated by larger deviation in hydrogen bond distances in the RSL[7FW] variant.
Differences in Specificity. Since the Trp residues can interact with other oligosaccharides in addition to the fucose residue, we assessed the specificity of the synthetic lectin variants

ACS Chemical Biology
Articles for >300 carbohydrates via a previously described 36 glycan microarray. RSL and all variants demonstrated rather similar binding patterns ( Figure S10). They bound to all fucosylated oligosaccharides, as described previously for RSL 17 and the structurally related BambL. 37 Nevertheless, a more thorough comparison of the binding pattern to a subset of fucosylated oligosaccharides, that is, the histo-blood group epitopes, revealed some variations as a function of the position of the fluorine atom ( Figure S11). While RSL bound more strongly to oligosaccharides of blood group A and B than to blood group O(H) and Lewis, this tendency was reversed for the fluorinated variants. Particularly RSL[7W] showed weaker binding to blood group B trisaccharide than to other glycans (Figure 6a, B-tri).
The relative change in specificity can be rationalized by manually docking the blood group B trisaccharide into the RSL binding site. We used the structure of the blood group B trisaccharide, which had been docked into the related BambL, 37 and superimposed it on the glycan binding site of RSL[7FW]. Evidently, the galactose residue of blood group B trisaccharide comes in close contact to the fluorine atoms of both Trp53 and Trp31 resulting in local steric conflicts (Figure 6b).

■ CONCLUSIONS
We present here the first structural characterization of lectins containing fluorinated tryptophan residues and their interaction with carbohydrate ligands. The global fluorination of the Trp residues moderately affected the glycan affinity, and it changed the specificity for blood group glycans. The structural analysis and the free energy decomposition by molecular dynamics calculations pointed out that fluorination may induce different effects. The stacking effect is partially lost, due to the deactivation of the aromaticity. Based on our experiments and calculations, the hydrogen bond property of the NH group of the indole ring does not seem to be affected by the presence of the neighboring

ACS Chemical Biology
Articles fluorine. Of interest for the future design of synthetic proteins with noncanonical amino acids is the possibility to create novel contacts, based on the weak hydrogen bond acceptor character of fluorine and the strong capacity of sugars to be involved in such contacts. Either the introduction of additional hydrogen bonds via the fluorine or the weakening of the stacking interaction with the ligand might be exploited. Here, we used a lectin in which several Trp residues participate in glycan binding and their global exchange for fluorinated derivatives leads to effects that interfere with each other. The site-specific incorporation of Trp analogs could be beneficial for a more surgical lectin−carbohydrate interaction analysis. Yet at the moment, this approach is limited to a small set of Trp analogs excluding fluorinated derivatives. 38 The approach described here could be used in the future for incorporation of fluorinated Trp analogs into lectins or other carbohydrate-binding proteins that involve a single Trp residue in the glycan binding. This could be a promising compromise until the advent of more versatile site-specific incorporation systems for these analogs. Precise insertion of fluorination in engineered proteins and the exploration of other hydrogen bond accepting groups represent a promising strategy for the conception of novel receptors for glycans. ) dcm+ Tet r gal λ (DE3) endA Hte ΔtrpC. The construction of this strain will be described elsewhere.
The amino acid sequence of RSL (PDB 2BS5) was back-translated into a DNA sequence codon optimized for the expression in E. coli and ordered as a gBlock (Integrated DNA Technologies, Coralville, IA). The rsl gene was amplified using primers pBP654 and pBP643 and cloned into the EcoRI/HindIII (Thermo Fisher Scientific, Waltham, MA) digested pQE80L vector (Qiagen, Hilden, Germany) via Gibson assembly 39 resulting in plasmid pQE80L-RSL. The DNA, primer, and amino acid sequences can be found in Table S1.
The tryptophan auxotrophic E. coli BWEC47 harboring the pQE80L-RSL plasmid was grown in M9 medium ( ) supplemented with 1% (w/v) casamino acids (BD Biosciences, San Jose, CA), 47 μM L-tryptophan, and 100 μg mL −1 ampicillin at 37°C and 120 rpm to allow for growth until tryptophan depletion occurred at a cell density of D 600 ≈ 3. Subsequently, the cells were incubated for an additional hour to ensure complete depletion of the canonical amino acid, before indole (Sigma-Aldrich, St. Louis, MO) or an indole-analog (4-fluoroindole, Tokyo Chemical Industry Europe, Zwijndrecht, Belgium; 5-, 6-, and 7-fluoroindole, Molekula, Newcastle Upon Tyne, U.K.) was added to a final concentration of 1 mM and protein expression was induced by the addition of 0.5 mM isopropyl β-Dthiogalactopyranoside (IPTG, Sigma-Aldrich). After expression for 18 h at 30°C and 120 rpm, the cells were harvested by centrifugation (20 min at 8000g and 4°C). The pellet was resuspended in lysis buffer (20 mM Tris/Cl, 1 M NaCl, pH 7.4), and the cells were disintegrated by sonication. Cell-free extract was prepared by centrifugation (50 min at 20 000g and 4°C) and loaded onto a mannose-agarose column (Mannose Separapore 4B, BioWorld, Dublin, OH). Unbound protein was removed by washing with washing buffer (20 mM Tris/Cl, 100 mM NaCl, pH 7.4), and RSL or the synthetic variants were eluted with washing buffer containing 100 mM D(+)-mannose (Carl Roth GmbH, Karlsruhe, Germany). The proteins were dialyzed against ddH 2 O for 5 days at 4°C, changing water twice a day, lyophilized, and stored at 4°C until use.
Intact Protein LC-MS Analysis of RSL Species. Protein solutions were diluted to 10 ng μL −1 with water containing 5% ACN and 0.1% FA. Possible protein variations were separated by nano-HPLC (Dionex Ultimate 3000) equipped with a Pepswift precolumn (monolithic, 5 × 0.2 mm 2 ) and a ProSwift RP-4H column (monolithic, 100 μm × 25 cm) (all Thermo Fisher Scientific, Vienna, Austria). One microliter of protein sample was injected and concentrated on the enrichment column for 2 min at a flow rate of 5 μL min −1 with 0.1% formic acid as isocratic solvent. Separation was carried out on the nanocolumn at a flow rate of 1 μL min −1 at 37°C using the following gradient: solvent A is 0.1% formic acid in water and solvent B acetonitrile containing 0.1% formic acid: 0−2 min, 5% B; 2−17 min, 5−60% B; 17−20 min, 60% B; 20−20.1 min, 60−5% B; 20.1−29 min, 5% B. The maXis II ETD mass spectrometer (Bruker, Bremen, Germany) was operated with the captive spray source in positive mode with following settings: mass range, 300−3000 m/z; 1 Hz; source voltage, 1.6 kV; dry gas flow, 3 L min −1 ; 180°C. The protein mass spectra were deconvoluted by the data analysis software, using the MaxEnt2 algorithm. The following main parameters were applied: charge carrier, H + ; m/z range, min. 800 to max. 2000; minimum instrument resolving power was set to 50 000. For peak detection SNAP algorithm with the following parameters was used: quality factor threshold 0.9, S/N threshold 2, and maximum charge state of 12. Glycan Microarray Analysis. The lectins were biotinylated using the EZ-Link Sulfo-NHS-LC-Biotinylation Kit (Thermo Fisher), according to the manufacturers recommendations. The glycan microarrays contained 317 carbohydrate ligands and were printed as described previously. 36 The microarray slides were blocked with blocking buffer (50 mM ethanolamide in 50 mM borate buffer, pH 8.0) for one hour and subsequently thoroughly rinsed with ddH 2 O. The biotinylated protein samples were diluted in PLI-P buffer (0.5 M NaCl, 3 mM KCl, 1.5 mM KH 2 PO 4 , 6.5 mM Na 2 HPO 4 , 1% (w/v) BSA, 1% (v/v) Triton-X-100, pH 7.4) to a protein concentration of 1 μg mL −1 and incubated on the microarray slides in a humidified chamber at RT with gentle agitation for one hour. After incubation, the protein sample was removed and the slides were washed with PBS buffer (10 mM Na 2 HPO 4 , 10 mM NaH 2 PO 4 , 138 mM NaCl, 2.7 mM KCl, pH 7.4). The microarray slides were then incubated with cyanine-3 labeled streptavidin (Thermo Fisher) diluted 1:500 in PLI-P buffer for 1 h, as described above. Subsequently, the slides were washed with PBS buffer, rinsed with ddH 2 O, and stored in the dark until scanning. Fluorescent measurements were performed using the ScanArray 4000 Microarray Analysis System (PerkinElmer, Waltham, MA). Fluorescence intensities were quantified using the ScanArray Express Microarray Analysis System 4.0 (PerkinElmer) and the data was further analyzed with Microsoft Excel (Microsoft Corporation, Redmond, WA).
Isothermal Titration Calorimetry (ITC). ITC experiments were performed on a MicroCal iTC200 microcalorimeter (Malvern Instruments Ltd., Worcestershire, UK). The experiments were carried out at 25°C. αMeFuc (Tokyo Chemical Industry Europe), HType2, and Le X tetrasacharides (ELICITYL, Crolles, France), as well as the lyophilized proteins, were dissolved in the same buffer (20 mM Tris/Cl, 100 mM NaCl, pH 7.5). Protein concentrations in the microcalorimeter cell (204 μL) varied from 0.0476 to 0.0554 mM and between 0.242 and 0.266 mM for measurements with αMeFuc/HType2 and Le X , respectively. Twenty injections with 1 μL of sugar solutions (1 or 1.5 mM of αMeFuc and HType2 or 5 mM of Le X ) were performed at intervals of 120 s with stirring at 750 rpm. The obtained data was integrated and fitted to a theoretical titration curve with Origin software (OriginLab Corporation, Northampton, MA) applying the one-set-of-sites model. The protein concentration was determined using NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific). An extinction coefficient of 43.68 × 10 4 M −1 cm −1 was used. For the variants, the concentration was adjusted in the software to obtain the binding site stoichiometry of n = 2, that is, one inter-and intramonomeric binding site per monomer as confirmed by the crystal structures.
Crystallization and Structure Determination. The three fluorinated RSL samples were dissolved in 20 mM Hepes 7.5 and 150 mM NaCl to 8 mg mL −1 and cocrystallized with 1 mM Le X tetrasaccharide (Ellicityl, Crolles, France). Crystallization screening was performed using the hanging drop vapor diffusion at 19°C. Diamond shaped crystals were obtained in a few days from solution 1−8 and 1−9 of the Morpheus screen or 1−30 from the Midas screen (Molecular Dimension Ltd., Newmarket, UK). Single crystals were directly mounted in a cryoloop and flash frozen in liquid nitrogen. RSL[7FW] diffraction data were collected at 100 K at the European Synchrotron Radiation Facility (Grenoble, France) on BM30A-FIP using an ADSC Q315r detector, while those for RSL[4FW] and RSL[5FW] were collected at Synchrotron Soleil (Saint Aubin, France) on Proxima 1 using a Pilatus 6 M detector. The data were processed using XDS, 40 and their quality statistics are summarized in Table S4. All further computing was performed using the CCP4 suite unless otherwise stated. 41 The RSL[7FW] and RSL[5FW] structures were solved by molecular replacement (MR) using PHASER and the coordinates of chain A from PDB-ID 2BT9 to search for two copies in the asymmetric unit. 42 After initial rebuilding with ARP/WARP, 43 the model was refined with restrained maximum likelihood refinement using REFMAC 5.8, 44 iterated with manual rebuilding in Coot. 45 The RSL[4FW] structure was also solved by MR using PHASER, but the coordinates of the protein chain A of RSL[5FW] were used as the search model. No rebuilding with ARP/WARP was required. Anisotropic refinement was introduced at the end of the refinement of all structures. Five percent of the observations were set aside for cross-validation analysis, and hydrogen atoms were added in their riding positions and used for geometry and structure-factor calculations. Incorporation of the ligand was performed after inspection of 2F o − DF c weighted maps. The 7FW ligand library was constructed with Acedrg. 46 Water molecules, introduced first with ARP/WARP and then automatically using Coot, were inspected manually. The model was validated with the wwPDB Validation server (http://wwpdb-validation.wwpdb.org), and the glycans were checked with Privateer. 47  We clearly observed the electron density of the carbohydrate ligand in both binding sites of all structures with the exception of chain A of RSL[4FW], where a glycerol molecule was present in the intramonomeric site. The electron density in the intermonomeric site was of higher quality than in the intramonomeric site in all structures, and the tetrasaccharide was distinctly defined. In contrast, some parts were conformationally disordered in the intramonomeric sites.
Molecular Modeling. The crystal structure of wild-type RSL complexed with αMeFuc was used as a starting reference to build all models (PDB ID 2BT9). 17 The fluorinated tryptophans were parametrized for compatibility with the AMBER-ff14SB force field, 48 using available standard parameters for fluorine for bonding and van der Waals parameters. Fluorination is expected to induce a large modification of the π electron density on the indole ring. 14 Therefore we performed quantum chemistry calculations in order to obtain representative atomic charges for each fluorinated tryptophan. RESP charges 49 were calculated following the recommended setup of the PyRED server 50 (http://upjv.q4md-forcefieldtools.org/REDServer-Development/). The atomic charges and modified parameters were included in Amber Tools to prepare the topology files of the different systems. Molecular dynamics simulations were carried out using the pmemd module of the AMBER12 code (http://ambermd.org). The AMBER-ff14SB 48 and GLYCAM06-j 51 force fields were used for protein and saccharides, respectively, together with the TIP3 water model. 52 The protonation states of titratable residues were assigned using the H+ + web server (http://biophysics.cs.vt.edu/H++). Standard protonation states were observed for each RSL variant. Short-range van der Waals interactions were switched to zero at a cutoff distance of 8 Å. The longrange electrostatic interactions were computed by means of the particle mesh Ewald (PME) method 53 using a real-space cutoff of 8 Å. The SHAKE algorithm was applied to constrain bonds involving hydrogen atoms, and thus an integration time step of 2 fs could be used. The temperature was maintained at 300 K using the Langevin thermostat with a collision frequency of 2 ps −1 . A length of 500 ns of NpT simulation was performed for each RSL variant. The MD trajectories were postprocessed to compute free energies of binding using the MMPBSA.py program. 35 The intramonomeric binding site of chain A complexed with α-L-Me-fucoside was used to make the calculations. The free energy of binding was computed for each RSL variant using the Poisson−Boltzmann surface area (PBSA) framework combined with normal modes analysis. Due to computational cost of the Poisson− Boltzmann method, the energy decomposition per residue was performed from generalized Born surface area (GBSA) calculations. 35 ■ ASSOCIATED CONTENT

* S Supporting Information
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acschembio.8b00377.
Sequences of RSL and primers used, quantification of FW incorporation into RSL, affinity constants and thermody-

ACS Chemical Biology
Articles namics parameters, data collection and refinement statistics for fluorinated RSL co-crystallized with Le X tetrasaccharide, conformation of the Le X tetrasaccharide in each binding site, atomic charges used for Amber parameterization of the fluorinated tryptophan residues, expression and purification of fluorotryptophan-containing RSL variants, mass analysis of the fluorinated RSL variants, quantification of the incorporation of FW at individual positions, melting temperatures of RSL and fluorinated variants, ITC data of the interaction of RSL and fluorinated variants with αMeFuc and the two tetrasaccharides, electron density in the binding sites of the variant lectins complexed with the tetrasaccharide Le X , superimposition of X-ray crystal structure and averaged one from MD calculations in the intermonomeric binding site of fucose in RSL[7FW], image analysis of glycan microarrays with RSL and the FW-containing variants, and interaction of the RSL and variants with the eight selected oligosaccharides (PDF)