April 10.1038/s41586-020-2012-7 An engineered stable mini-protein to plug SARS-Cov-2 Spikes Maria Romano± 0 Alessia Ruggiero± 0 Flavia Squeglia± 0 Rita Berisio rita.berisio@cnr.it 0 Institute of Biostructures and Bioimaging , IBB, CNR, Napoli , Italy 2020 29 2020 270 273

±These authors equally contributed to this work Abbreviations: COVID-19 - Coronavirus disease 19; PDB - Protein Databank; RBD - Receptor Binding Domain; RBM - Receptor Binding Motif; ACE2 - angiotensin-converting enzyme 2; MST - Microscale thermophoresis.

SARS-CoV-2 COVID-19 viral entry Spike protein protein structure infectious disease
-

The novel betacoronavirus SARS-CoV-2 is the etiological agent of the current pandemic COVID-19. Like other coronaviruses, this novel virus relies on the surface Spike glycoprotein to access the host cells, mainly through the interaction of its Receptor Binding Domain (RBD) with the human angiotensin-converting enzyme 2 (ACE2). Therefore, molecular entities able to interfere with binding of the SARS-CoV-2 Spike protein to ACE2 have a great potential to inhibit viral entry. Starting from the available structural data on the interaction between SARS-CoV-2 Spike protein and the host ACE2 receptor, we here engineered a mini-protein with the aim of creating a soluble and stable Spike interactor. This mini-protein, which was recombinantly produced in high yields, possesses a stable α helical conformation and is able to interact with the RBD of glycosylated Spike protein from SARSCoV-2 with nanomolar affinity, as measured by microscale thermophoresis. By plugging the Spike protein, our mini-protein constitutes a valid tool for the development of treatments against different types of coronavirus.

INTRODUCTION

The novel coronavirus SARS-CoV-2 has spread widely and rapidly since it was first identified in Wuhan, China in December 2019 [1-3]. Its associated disease, COVID-19, causes severe respiratory difficulties, with aged patients at higher risk of mortality [1]. At the time of writing, over 3.000.000 confirmed cases and over 200.000 deaths have been registered worldwide. Given the dramatic public health emergency, there is a strong and urgent need for new antiviral agents to block human-to-human transmission and to treat infected patients.

Like other coronaviruses, SARS-CoV-2 makes use of a densely glycosylated Spike (S) protein to gain access into host cells [4,5]. The S protein forms homotrimers protruding from the viral envelope and binds with high affinity to the host receptor ACE2 (angiotensin-converting enzyme 2), mainly expressed by epithelial cells of the respiratory tract [6]. Recently, another human receptor, CD147, has been identified as a possible route of viral entrance, again mediated by the S protein [7]. CD147 is also known as Basigin or EMMPRIN, and is a transmembrane glycoprotein that belongs to the immunoglobulin superfamily involved in

many processes, including tumor development, plasmodium invasion and virus infection [7]. After attachment, the human transmembrane protease serine 2 (TMPRSS2) cleaves and activates the S protein, thus allowing SARS-CoV-2 to enter the host cells [8]. Compared to SARS-CoV, an additional protease, possibly furin, is likely involved in priming of the SARS‐Cov‐2, since the S protein of SARS‐CoV‐2 contains four redundant furin cut Pro-ArgArg-Ala motifs, which are absent in SARS‐CoV [6]. The S protein contains two subunits, S1 and S2. Of these, S1 comprises the receptor binding domain (RBD), which is responsible for recognizing and binding the cell surface receptor ACE2 [5]. Being essential for infection, the S protein is a promising target for antibodies and vaccines [9]. Indeed, preventing attachment of the S protein to either the ACE2 or the CD147 receptors would hamper infection at the early viral entry step. Based on the structural information available on the complex between the S protein and the ACE2 receptor [10], we engineered a miniaturised mimic of ACE2, here named Spikeplug. This mini-protein, which we produced in high yields, is highly soluble, conformationally stable and displays nanomolar binding affinity with the S protein of SARS-CoV-2. Given its properties, Spikeplug is a promising molecule for the development of treatments based on inhibition of viral entry for different types of coronavirus.

RESULTS Spike interactor design and synthesis

We used all available structural information on the interactions between S protein and the human ACE2 receptor to design Spike interactors. An analysis of the structure of SARS-CoV-2 complex with ACE2, reveals that most hydrogen bonds between protein S and ACE2 involve the helix H1 (residues 20-52) and the C-terminal part of helix H2 (residues 56-82) of ACE2 [10]. Outside this region, an important interaction between the S protein and ACE2 is mediated by Lys353 of ACE2. Indeed, the presence of histidine in the place of lysine at position 353 of rat ACE2 disfavours viral binding [11].

Starting from the region H1-H2 of ACE2 as a basic scaffold for the design of a Spike interactor, we included a number of mutations to increase its stability and solubility and with the aim to compensate for the missing interactions mediated by residues outside helices H1 and H2. An important issue with peptide or small protein design is their conformational stability, since peptides are usually not folded in solution. Therefore, to stabilise this scaffold, we included an extra helix, H3 (residues 91-101) which naturally caps helices H1 and H2 in the ACE2 structure, through a cluster of hydrophobic interactions involving Val92, Leu97 and Leu100 of H3, Phe28, Leu29, Phe32 of H1 and Ala80 of H2, therefore holding together the two helices H1 and H2 (Figure 1). In addition, we noticed that Glu37 is important to stabilise the structure of the ACE2 receptor by forming a salt bridge interaction with Arg393 and a hydrogen bond with the backbone N of Lys353 (Figure 2A). However, this residue forms a weak hydrogen bond with Tyr505, which is considered outside the Receptor Binding Motif (RBM) of the RBD domain (Figure 2A), in the ACE2 complex with the RBD of the S protein from SARS-CoV-2 [10], while it is not engaged in any contact in the complex of ACE2 with RBD of the S protein of SARS-CoV (74% homologous to SARS-CoV-2 and 83% on the RBD domain) [11]. Therefore, we mutated Glu37 to an arginine residue and energy minimised using GROMACS [12]. As shown in Figure 2A, Arg37 side chain can form hydrogen bonds with the main chain of Gly496 of the S protein (Figure 2A). As such, the conformation of Arg37, also stabilised by a salt bridge with Asp38, mimics the interactions with the S protein mediated by Lys353 of ACE2 (Figure 2A) [10,11]. We further engineered this variant by including two mutations, Leu 91 to Gln and Leu 95 to Gln, to improve protein solubility by replacing hydrophobic with hydrophilic residues, and hamper aggregation in solution. In ACE2, these two residues are located on the opposite side of the S-binding region and form hydrophobic interactions with the core of ACE2 to stabilise it (Figure 2B,C). Their mutation to glutamine residues was also motivated by the characteristics of glutamine to further stabilize α helix structures. This recombinant mini-protein, here named Spikeplug (Table 1, Figure 2C), was successfully over-expressed in E.coli, resulting in a high yield, of 70 mg of pure protein from one litre of bacterial culture. 2 Spike protein (blue/cyan) and the human ACE2 receptor (grey/green). The receptor binding motif (RBM) is drawn in cyan. The green portion of the ACE2 domain including helices H1, H2 and H3 is drawn in green.

Structural stability and binding studies

Using far-UV CD spectroscopy, we observed that the spectrum of Spikeplug is typical of a wellstructured fold with a high α-helical content (Figure 3A), with typical minima at 208 and 222 nm (Figure 3). Spectra also showed that folding is fully reversible, with the CD spectrum after refolding fully superimposable to that recorded at 4°C (Figure 3A). To investigate the heat-induced changes in the protein secondary structure, thermal unfolding curves were recorded by following the CD signal at 208 nm as a function of temperature, thus providing a melting temperature Tm of 37°C (Figure 3B). Microscale thermophoresis (MST) analysis was performed to measure the binding affinity of Spikeplug to the glycosylated S protein of SARS-CoV-2. MST measurements showed nanomolar binding affinity of Spikeplug to the Spike RBD domain, with a KD of 40 nM (Figure 4).

DISCUSSION

Blocking the very first step of SARS-CoV-2 entry into host cells by hampering the interaction of the S protein with the cellular receptor, represents a highly promising therapeutic strategy against COVID-19. We focused on generating a Spike interactor that resembles the human ACE2 receptor as we argued that such type of molecule would act against all known coronaviruses using this receptor, such as SARS-CoV and HCoV-NL63, as well as other coronavirues possibily emerging in the future. Several structures of the complex between the S protein and the ACE2 receptor have been recently published. Based on this structural information, we designed a mini-protein which embeds all important Spike-interacting residues of the ACE2 receptor. This miniaturised mimic of ACE2, was properly engineered to be stable in solution and to enhance its affinity with the S protein. Indeed, an important issue with peptide- or protein-based drugs is its conformational stability. This miniprotein, Spikeplug, presents a highly stable α helical conformation in solution and a nanomolar binding affinity to the RBD domain of glycosylated S protein from SARS-CoV-2. The dissociation constant KD determined here using MST, 40 nM, is in the same range of the affinity measured between the S protein and the ACE2 receptor of 14.7 nM [5]. We are currently designing additional variants to further improve binding affinity to the S protein.

Our basic idea is that mimics of the host receptor have a great advantage over other drugs for several reasons. First, if a novel coronavirus will emerge, able to infect humans through the ACE2 receptor, then its S proteins will be recognised and plugged by our receptor mimics (Figure 5), blocking viral entry. Importantly, Spikeplug is large enough to cover most important interactions with the S protein, but small enough to bind simultaneously to the three chains of the S protein, thus increasing its potential to hamper interactions with the ACE2 receptor (Figure 5). Another point we considered in our research was drug resistance, as pathogens can easily mutate to escape therapeutic treatments. However, if the virus mutates to escape binding to Spikeplug, then it will also likely drop its affinity for the natural ACE2 receptor, resulting in suicide action. The validity of such an approach has been demonstrated by the development of the antiretroviral drug Enfuvirtide, which blocks the action of the HIV-1 gp41 fusion protein, thus preventing viral entry. As for all antiviral drugs, also Enfuvirtide can select for resistance mutations. However, it has been shown that resistance to the drug comes at a serious fitness cost for the virus since it involves structural alterations in an essential component of the virus-host cell fusion complex [13]. Hence, targeting the receptor binding domain of SARS-CoV2 might also result in a high genetic barrier towards resistance. Last, we expect Spikeplug to be well tolerated because of its almost complete identity with the correspondent domains of the endogenous human protein.

In conclusion, we believe that Spikeplug is a first-in class promising lead candidate for the development of molecular targeted therapies against SARS-CoV-2 and other coronaviruses.

EXPERIMENTAL METHODS Molecular design

Molecular modelling was initially performed using the cryo EM structure of the SARS-CoV-2 S protein (PDB code 6vsb) [5] and the crystal structure of the complex between ACE2 and the RBD domain of SARS-CoV S protein (PDB code 2ajf) [11]. To complete missing regions in the cryo EM structure, we computed the homology model of SARS-CoV-2 S protein using MODELLER [14], and the structure 2ajf as a template. While this work was in progress, the crystal structure of the complex between ACE2 and the RBD domain of the S protein from SARS-CoV-2 (PDB code 6m0j) was released [10]. In this structure, most of the interactions between the S protein and ACE2 are conserved. Given the high sequence identity between RBD of SARS-CoV and SARS-CoV-2 (84%), we relied on both structures to take also possible crystal packing bias into account. Mutations in the basic scaffold were generated using the software Coot [15]. Models were energy minimised using the GROMACS package [12]. Figures were generated with Pymol [16].

Protein expression and purification

The gene encoding Spikeplug was synthesised and codon-optimised for E. coli expression by GeneArt® Gene Synthesis company (Invitrogen) and subsequently sub-cloned into pETM-13 vector (EMBL, Germany) between the NcoI and XhoI restriction sites. This plasmid endows the placement of a histidine-tag at the C-terminal of the mini-protein. The expression level was optimised after carrying out small scale expression screening using different E. coli available strains. From the screenings, the recombinant protein was successfully over-expressed in E.coli BL21(DE3) cells, resulting in a yield of 70 mg of pure protein from one litre of bacterial culture. Briefly, an overnight starting culture of 10 mL was prepared for growth in 1L of Luria Bertani (LB) medium containing 50µg L−1 kanamycin, which was then induced with 0.8 mM of IPTG at 16°C for approximately 16 h. The protein was purified by sonicating bacterial cells resuspended in binding buffer (300mM NaCl, 50mM Tris-HCl, 10 mM imidazole, 2.5% (v/v) glycerol, pH 7.8) containing a protease-inhibitor cocktail (Roche Diagnostics). The lysate was cleared by centrifugation at 18000 rpm at 4°C and the supernatant was loaded onto 5 mL Ni–NTA resin (Qiagen, Milan, Italy) equilibrated with binding buffer. After washing with ten volumes of binding buffer, the protein was eluted by adding 300 mM imidazole to the binding buffer. The fractions containing the eluted protein were pooled, concentrated and then loaded onto a Superdex 75 HR 10/30 gel-filtration column (GE Healthcare) equilibrated with 150mM NaCl, 50mM Tris-HCl, 2.5% (v/v) glycerol pH 7.8 for a further purification step. The sample eluted in a single peak and was homogeneous, as judged by 18% SDS–PAGE analysis. The protein was concentrated using a centrifugal filter (Merck Millipore) and the concentration was determined by UV absorbance using the corresponding ɛ values (M-1 cm-1).

CD spectroscopy

CD measurements were carried out using a JASCO J-815 CD spectropolarimeter equipped with a Peltier temperature controller (Model PTC-423S) at 4°C in a 0.1 cm optical path length cell in the 200–260 nm wavelength range. Protein concentration was 0.2 mg/mL in 20 mM sodium phosphate buffer (pH 7.4). The data was recorded with a scanning speed of 20 nm/min and a band width of 1 nm. All spectra were averaged from three scans and baseline-corrected using a blank consisting of the protein buffer. The molar ellipticity per mean residue, [Ѳ] in deg· cm2· dmol−1, was calculated from the following equation: [Ѳ] = [Ѳ]obs × mrw × (10 × l × C)−1 where [Ѳ]obs is the ellipticity measured in degrees, mrw is the mean residue molecular mass (117.9 Da), l is the optical path length of the cell in cm, and C is the protein concentration in g/L. During the melting experiments, CD spectra were collected with increasing temperature from 4°C to 80 °C with an average rate of 1 °C/min. Thermal denaturation was investigated by recording the CD signal at 208 nm.

Microscale thermophoresis (MST) binding studies

The thermophoretic measurements were performed using Monolith NT.115 device with red detection channel (NanoTemper Technologies, Munich, Germany). Recombinant Spikeplug was produced in our laboratory, while SARS-CoV-2 Spike RBD was purchased from Sino Biological Inc. For MST recording, the SARS-CoV-2 Spike RBD was labelled with the fluorescent dye NT647 using the protocol suggested from the NanoTemper. Thermophoretic experiments were conducted using Monolith NT.115 (NanoTemper Technologies, Munich, Germany). Briefly, 90 µ l of a 10 µ M solution of SARS-CoV-2 Spike protein (RBD) in labelling buffer (130 mM NaHCO3, 50 mM NaCl, pH 8.2) was mixed with 10 µl of 300 µM

NT647-N-hydroxysuccinimide fluorophore (NanoTemper Technologies) and incubated in the dark for 30 min at room temperature. Spike RBD concentration after labelling (NT647-Spike RBD) was measured using a UV-Vis spectrophotometer and the labelling efficiency was determined to be about 25%.

The MST experiment was performed in a buffer containing 200 mM NaCl, 50mM Tris-HCl, 0.05% (v/v) Tween-20, 2.5% (v/v) glycerol, pH 7.8. 10 µ L of NT647-Spike RBD 50 nM was mixed with 10 µ L of 16 serial dilution of Spikeplug. The final concentration of NT647-Spike RBD was 25 nM in all samples, whereas Spikeplug concentration ranged from 0.15 nM to 5 μM. Samples were then loaded into sixteen premium-coated capillaries (NanoTemper Technologies) and fluorescence was recorded for 20 s using 100% laser power and 40% MST power. The temperature of the instrument was set to 25°C for all measurements. After recording the MST time traces, data were analysed. KD value was calculated from ligand concentration-dependent changes in the normalised fluorescence (Fnorm) of NT647-Spike RBD after 14 s of thermophoresis. The assay was performed in duplicate and the values reported were generated through the usage of MO Affinity Analysis software (NanoTemper Technologies).

ACKNOWLEDGEMENTS

this manuscript.

We thank Prof. Giovanni Maga, Institute of Molecular Genetics IGM-CNR, for careful reading of

Sequence

19stieeqaktfldkfnheaRdlfyqsslaswnyntniteenvqnmnnagdkwsaflke

Mutation Rationale

E37R L91Q, L95Q

To affinity protein enhance

binding with the

S To increase solubility, hamper aggregation and improve propensity α helical

Wu, J.T.; Leung, K.; Leung, G.M. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet 2020, 395, 689-697, doi:10.1016/S0140-6736(20)30260-9.

Lu, R.; Zhao, X.; Li, J.; Niu, P.; Yang, B.; Wu, H.; Wang, W.; Song, H.; Huang, B.; Zhu, N., et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet 2020, 395, 565-574, doi:10.1016/S01406736(20)30251-8.

Luan, J.; Lu, Y.; Jin, X.; Zhang, L. Spike protein recognition of mammalian ACE2 predicts the host range and an optimized ACE2 for SARS-CoV-2 infection. Biochem Biophys Res Wrapp, D.; Wang, N.; Corbett, K.S.; Goldsmith, J.A.; Hsieh, C.L.; Abiona, O.; Graham, B.S.; McLellan, J.S. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science 2020, 367, 1260-1263, doi:10.1126/science.abb2507.

Lukassen, S.; Chua, R.L.; Trefzer, T.; Kahn, N.C.; Schneider, M.A.; Muley, T.; Winter, H.; Meister, M.; Veith, C.; Boots, A.W., et al. SARS-CoV-2 receptor ACE2 and TMPRSS2 are primarily expressed in bronchial transient secretory cells.

EMBO J

2020, 10.15252/embj.20105114, e105114, doi:10.15252/embj.20105114.

Wang, K.; Chen, W.; Zhou, Y.-S.; Lian, J.-Q.; Zhang, Z.; Du, P.; Gong, L.; Zhang, Y.; Cui, H.-Y.; Geng, J.-J., et al. SARS-CoV-2 invades host cells via a novel route: CD147-spike protein. BioRxiv 2020. et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2

Li, F.; Li, W.; Farzan, M.; Harrison, S.C. Structure of SARS coronavirus spike receptorbinding domain complexed with

Science

2005, 309, 1864-1868,

Van Der Spoel, D.; Lindahl, E.; Hess, B.; Groenhof, G.; Mark, A.E.; Berendsen, H.J. GROMACS: fast, flexible, and free. J Comput Chem 2005, 26, 1701-1718, 13.

Reeves, J.D.; Lee, F.H.; Miamidian, J.L.; Jabara, C.B.; Juntilla, M.M.; Doms, R.W.

Hoffmann, M.; Kleine-Weber, H.; Schroeder, S.; Krüger, N.; Herrler, T.; Erichsen, S.; Schiergens, T.S.; Herrler, G.; Wu, N.H.; Nitsche, A., et al. SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor. Cell 2020, 10.1016/j.cell.2020.02.052, doi:10.1016/j.cell.2020.02.052.

Walls, A.C.; Park, Y.J.; Tortorici, M.A.; Wall, A.; McGuire, A.T.; Veesler, D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 2020, 181, 281-292

Enfuvirtide resistance mutations: impact on human immunodeficiency virus envelope function, entry inhibitor sensitivity, and virus neutralization. J Virol 2005, 79, 4991-4999,

Webb, B.; Sali, A. Protein Structure Modeling with MODELLER. Methods Mol Biol 2017,

Zhou , P. ; Yang , X.L. ; Wang , X.G. ; Hu , B. ; Zhang, L. ; Zhang, W.; Si, H.R. ; Zhu, Y. ; Li , B. ; Huang , C.L. , et al. A pneumonia outbreak associated with a new coronavirus of probable bat Lan , J.; Ge , J. ; Yu , J. ; Shan, S. ; Zhou , H. ; Fan, S. ; Zhang, Q. ; Shi , X. ; Wang , Q. ; Zhang, L., 15 .