Extremely Amyloidogenic Single-Chain Analogues of Insulin’s H-Fragment: Structural Adaptability of an Amyloid Stretch

Relatively short amino acid sequences often play a pivotal role in triggering protein aggregation leading to the formation of amyloid fibrils. In the case of insulin, various regions of A- and B-chains have been implicated as the most relevant to the protein’s amyloidogenicity. Here, we focus on the highly amyloidogenic H-fragment of insulin comprising the disulfide-bonded N-terminal parts of both chains. Analysis of the aggregation behavior of single-chain peptide derivatives of the H-fragment suggests that the A-chain’s part initiates the aggregation process while the disulfide-tethered B-chain reluctantly adapts to amyloid structure. Merging of both A- and B-parts into single-chain continuous peptides (A–B and B–A) results in extreme amyloidogenicity exceeding that of the double-chain H-fragment as reflected by almost instantaneous de novo fibrillization. Amyloid fibrils of A–B and B–A present distinct morphological and infrared traits and do not cross-seed insulin. Our study suggests that the N-terminal part of insulin’s A-chain containing the intact Cys6–Cys11 intrachain disulfide bond may constitute insulin’s major amyloid stretch which, through its bent conformation, enforces a parallel in-register alignment of β-strands. Comparison of the self-association behavior of H, A–B, and B–A peptides suggests that A-chain’s N-terminal amyloid stretch is very versatile and adaptive to various structural contexts.


■ INTRODUCTION
Nowadays, formation of amyloid fibrils is recognized as a generic process accessible to various proteins and peptides. 1,2 Due to association with a number of degenerative maladies (e.g., Alzheimer's disease, type 2 diabetes mellitus 3−5 ), as well as benign biological functions essential for numerous organisms, 6−8 research on amyloidogenesis has expanded considerably in recent years. Local inter-and intramolecular interactions driving self-assembly of amyloid fibrils are identical with those responsible for folding of the native state. However, formation of thermodynamically stable fibrillar structures imposes specific requirements on amino acid sequence, backbone topology, and conformation of aggregating proteins. 9−11 For example, β-sheet-breaking proline residues, uncompensated Coulombic repulsion between ionized side chains, and a main-chain topology restricted by numerous disulfide bonds are all intuitively expected to decrease the tendency to form fibrils. In accordance with this notion, early studies revealed that, for example, proline substitutions in rodent islet amyloid polypeptide (IAPP) prevent its in vivo aggregation and therefore spontaneous-onset type 2 diabetes mellitus. 12 On the other hand, it has been shown that protein amyloidogenicity may, in fact, adapt well to the presence of proline residues at some sites in the sequences 13 while marginally uncompensated electric charges may even enhance aggregation; 14 there is also evidence that disulfide bonds (both intra-and interchain) may accelerate formation of fibrils in specific cases. 15−17 Hence the actual impact of even such elementary factors on a protein's propensity to form fibrils depends on the structural and thermodynamic context. This, in turn, foreshadows the difficulty of accurately predicting amyloidogenic propensities of various amino acid sequences. It has been elegantly demonstrated by Minor and Kim that whether a particular main-chain segment within a globular protein adopts β-sheet conformation depends strongly on the tertiary context, as opposed to being determined solely by the intrinsic secondary structure preference. 18,19 From this it follows that an actual amyloidogenic propensity of a given amino acid sequence depends also on the quaternary context; i.e., its accurate estimation would require a priori knowledge of the 3D structure of the fibril. Despite these fundamental challenges, computational algorithms aimed at predicting amyloidogenic tendencies of amino acid sequences (and even of whole folded proteins, e.g., refs 11, 20−24; for a concise review, see ref 10) have been developed with some success. In part, the underlying rationale is that the abundance of certain hydrophobic amino acid residues, especially when arranged in particular sequence patterns, increases amyloidogenicity. 25 −29 In a permissive conformational context, a relatively short amyloidogenic segment (so-called amyloid stretch) of the main chain may drive the whole protein to form fibrils. 29,30 A distinction should be made between the latter term and a core region which primarily corresponds to a portion of amino acid sequence involved in formation of socalled steric zippers within amyloid fibrils. Mechanisms through which amyloid stretch segments induce their less aggregationprone surroundings to convert into fibril remain elusive.
Previously, a highly amyloidogenic double-chain fragment released upon partial digestion of bovine insulin (BI) with pepsin was identified. 31 This so-called H-fragment consists of N-terminal segments of insulin's A-and B-chains linked by a Cys7A−Cys7B disulfide bond; the A-chain part is bent due to another Cys6A−Cys11A disulfide bond ( Figure 1A). Singly dispersed H-fragment monomers are disordered in aqueous solutions but undergo rapid aggregation into amyloid-like fibrils with the infrared spectral features typical of parallel βsheet structure. 31,32 According to the follow-up study the highly amyloidogenic character of H-fragment arises from the Cys6A−Cys11A disulfide-restricted A-chain part while the B chain part itself is refractory to aggregation. 17 Our initial interest that motivated this work was focused on the versatility of A-chain's N-terminal segment with respect to the B-chain partner. Specifically, we were asking whether different structural arrangements of A-and B-parts (e.g., merging both parts into a single polypeptide chain) would preserve the highly amyloidogenic character of the H-fragment. and B-7A could be solubilized according to the previously described protocol 17 consisting of dispersion of solid peptide samples in 8 M guanidine hydrochloride (GdnHCl) solution, pH 9.0, crude A−B and B−A proved to be resistant to this approach. We have established an alternative solubilization protocol consisting of the following: (i) initial sonication-assisted dispersion of crude peptide in dimethyl sulfoxide (DMSO) at ∼5.6 mg/mL concentration, followed by (ii) 24-h-long (or 18-h-long in the case of the data presented in Figure 3) agitation at 37°C, followed by (iii) addition of 1/9 vol of H 2 O acidified to pH 1.64 and continued vigorous agitation at 37°C for 24 h. Such a 24 h + 24 h incubation protocol turned out to be extremely effective in solubilizing otherwise insoluble crude A−B, B−A, and H peptides. This holds true even if the incubation period is shortened, as is the case of the preliminary experiment summarized in Figure 3 (18 h + 2 h) in which acidified D 2 O (pD 1.64; pH-meter readout unadjusted for isotopic effects) was used instead of H 2 O in order to enable monitoring of the conformational state of peptide in solution with infrared spectroscopy. Thus, prepared concentrated peptide solutions in 90% DMSO/water system were swiftly diluted 10 times with acidified water in order to trigger spontaneous aggregation of disordered monomers into amyloid fibrils ( Figure 4). Unless otherwise specified, the typical composition of liquid samples for  B−B sample contained also the addition of TCEP at 6.8 mg/mL concentration which, by rapid reduction of the disulfide bond, released B-SH peptide in situ; 17 B-7A peptide was dissolved at 1 mg/ mL concentration in 9 vol/vol % DMSO in 50 mM NaCl, 20 μM ThT, pH 1.9. Measurements were carried out at 37°C. Insulin seeding experiments were conducted on freshly prepared 0.12 wt/vol % BI solutions in 100 mM NaCl, 20 μM ThT, pH 1.9. Thus, prepared liquid samples of native BI were mixed with preformed and sonicated amyloid fibrils at 1:20 amyloid:native BI mass ratio. Details of the sonication procedure and equipment used were described earlier. 33 Fibrillization Kinetics. ThT-fluorescence measurements (λ ex 440 nm/λ em 485 nm) of peptide aggregation kinetics were conducted on a CLARIOstar plate reader from BMG LABTECH (Offenburg, Germany) using 96-well black microplates. Typically, each well was filled with a 150 μL volume of peptide solution containing ThT. Measurements were carried out at 37°C with or without moderate agitation for 48 h, or for periods of time specified in the figures. Each kinetic trace was calculated as an average of three independently collected trajectories (the error bars correspond to the standard deviations). Once kinetic measurements were completed, aggregates were collected from wells and washed with portions of 50 mM NaCl, pH 1.9, solution in order to remove excess DMSO/denaturants prior to circular dichroism (CD), atomic force microscopy (AFM), and FT-IR (Fourier transform infrared) spectroscopic measurements.
AFM. A small portion of aggregate suspension collected from a plate well was initially washed several times with acidified 50 mM NaCl aqueous solution in order to remove excess DMSO. Subsequently, the suspension of fibrils was diluted with salt-free acidified water (pH 1.9) until the peptide concentration reached 0.1 mg/mL. A small droplet (10 μL) of fibril suspension was swiftly deposited onto freshly cleaved mica and left to dry overnight. AFM tapping-mode measurements were carried out with a Nanoscope III atomic force microscope from Veeco Instruments (Plainview, NY, USA) and TAP300-Al sensors (resonance frequency 300 kHz) from BudgetSensors (Sofia, Bulgaria).
ATR (Attenuated Total Reflectance) FT-IR and Transmission FT-IR Measurements. Centrifuged samples of aggregates collected from the plate reader were washed several times with portions of 50 mM NaCl in H 2 O, pH 1.9, in order to remove DMSO. Subsequently, aqueous suspensions of fibrils were deposited and allowed to dry up on the diamond surface of a single-reflection diamond ATR accessory of a Nicolet iS50 FT-IR spectrometer from Thermo Fisher Scientific (Waltham, MA, USA) equipped with a DTGS detector. Typically, for a single ATR FT-IR spectrum, 32 interferograms of 2 cm −1 resolution were coadded. Due to ambiguity in determining the real values of the refractive indexes of amyloid aggregates, uncorrected ATR FT-IR data  ) followed by addition of 1/9 vol of D 2 O, pD 1.64, and 2 h incubation at 37°C prior to spectral measurements. Langmuir pubs.acs.org/Langmuir Article is shown. Spectral data processing was limited to subtracting the water vapor spectrum and adjusting the two-point baseline with the use of GRAMS software (Thermo Fisher Scientific). Transmission FT-IR spectra shown in Figure 3B were collected on the same Nicolet iS50 FT-IR spectrometer except that a CaF 2 transmission cell with a 0.025 mm Teflon spacer was used instead of the ATR accessory. During measurements the cell's temperature was maintained at 37°C with a dedicated Peltier accessory while the sample chamber was continuously purged with dry CO 2 -depleted air. Typically, for a single spectrum, 16 interferograms of 2 cm −1 resolution were coadded. All spectra were corrected by subtracting the proper amount of buffer (DMSO−D 2 O solution) and water vapor spectra prior to being baseline-adjusted (flat two-point baseline). As neither DMSO nor D 2 O absorbs appreciably in the amide I band's infrared region, the subtraction of the solvent spectra multiplied by a factor of 1 gave typically acceptable results. The subtraction factor for the water vapor spectrum was individually selected in order to minimize the rotational−vibrational bands of water vapor overlapping the amide I band region. Data processing was performed with GRAMS software (Thermo Fisher Scientific).
CD Measurements. Aqueous suspensions of aggregates collected from the plate at the end of kinetic experiments were washed several times with 50 mM NaCl, pH 1.9, solution in order to remove residual DMSO, which strongly absorbs UV light. Thus, treated aggregates were suspended in acidified (pH 1.9) water to the final concentration of 0.04 mg/mL and were swiftly transferred to a quartz cuvette with a 10-mm-long optical pathway. Far-UV CD spectra corrected for the buffer signal were acquired at room temperature by the accumulation of five independent spectra (at 200 nm/s scanning rate) on a J-815 S spectropolarimeter from Jasco Corp. (Tokyo, Japan).

■ RESULTS AND DISCUSSION
De Novo Aggregation. We have selected a pair of synthetic single-chain analogues of insulin's H-fragment in which either the N-terminal section of B-chain was tethered to the C-terminal section of A-chain ("A−B" peptide) or vice versa ("B−A" peptide; see Figure 1A) to study the impact of alternative sequence arrangements on the fragment's amyloidogenicity. Residues Cys7 in both chains were replaced with alanine, while the intrachain disulfide bond within the A-part which is essential to H-fragment's amyloidogenicity was retained. Rather unsurprisingly, when removed from an insulin monomer, the H-fragment remains in an unfolded conformation which has been confirmed earlier. 31,32 In the absence of a folded globular state, sliding window methods such as TANGO 34−36 or Waltz 21 may be used for to predict amyloidogenic tendencies of amino acid sequences. 10 We used both these tools to assess the amyloidogenicity of separated and merged (as A−B and B−A peptides) N-terminal sections of insulin at low pH, i.e., under conditions favoring aggregation of both insulin and H-fragment ( Figure 1B). As an inherent limitation of these methods, the impact of the intrachain disulfide bond is not taken into account. The TANGO and Waltz scores calculated for separate A-and Bparts and B−A peptide are negligible, whereas for A−B peptide heightened scores are observed for the middle section encompassing C-terminal residues of A and N-terminal residues of B-part (TANGO) and for the whole sequence The two algorithms are suited specifically for prediction of strictly amyloidogenic (Waltz) or dehydration/aggregationprone (TANGO) regions; both tools are consistent in suggesting that swapping the sequential arrangement of Aand B-parts would have a dramatic impact on the amyloidogenic properties of the single-chain peptides. We have also employed FoldAmyloid 37 as a complementary bioinformatic tool dedicated to detection of amyloid-forming sequences. The results shown in Figure S1 are in line with the TANGO scores. Low-pH fibrillization kinetics of individual synthetic peptides corresponding to H-fragment's A-part (ACC) and B-part (B-7A, and B-SH depending on the substitution state of residue 7, see Figure 2A) were compared with the use of a ThT-fluorescence assay. The de novo aggregation of ACC peptide followed a steep upward trajectory preceded by a very short lag phase. The process was as fast as in the case of intact H-fragment at the same concentration ( Figure 2B), which contrasted sharply with the behavior of B peptide while contradicting the predictions based on TANGO and Waltz ( Figure 1B).
While commercial freeze-dried samples of H-fragment and ACC peptide could be solubilized following the previously established GdnHCl-based protocol, 17 stock A−B and B−A peptides were strongly agglomerated and resistant to this approach. Interestingly, the ATR FT-IR spectra of crude commercial A−B, B−A, and H-fragment exhibit the main spectral component of the amide I band in the range 1623− 1631 cm −1 indicative of the β-sheet conformation ( Figure 3A) and often observed in the infrared spectra of amyloid fibrils. 38 Furthermore, the absence of a minor exciton-split ([v(0,π)]) band above 1680 cm −1 suggests a parallel alignment of βstrands within the aggregate, and this again is in accordance with previous numerous studies on amyloid fibrils (e.g., refs 38 and 39). These observations indicate that all three peptides formed amyloid-like aggregates already during the final stages of synthesis and purification. The splitting of the main amide I component observed for B−A sample could be interpreted as the presence of two populations of β-sheet varying in the strength of hydrogen bonds (the lower the wavenumber the stronger the bond). The minor bands at 1660 and 1731 cm −1 could be assigned to turns and protonated carboxyl groups, respectively. 31,38,39 Moreover, an increased ThT fluorescence was detected when the fluorophore was added to aqueous suspensions of crude peptide powders (data not shown). Thus, it became essential to establish an effective protocol of A−B/ B−A peptide solubilization which would allow us to observe bona f ide amyloidogenic de novo reassociation of peptide monomers under controlled conditions. This was achieved through the application of the two-step procedure described under Materials and Methods consisting chiefly of prolonged incubation of raw peptide material in neat DMSO and in acidified DMSO−water mixtures. All three peptides dissolved in DMSO within a few hours. Solvent-subtracted transmission FT-IR spectra shown in Figure 3B indicate that the DMSOdissolved peptides undergo a β → disorder conformational transition: the broad amide I band at ca. 1665 cm −1 is characteristic of various DMSO-denatured proteins 40 including amyloid fibrils. 41,42 The high frequency shift of the band arises from free amide carbonyl groups which are outcompeted from hydrogen bonding to the main chain's N−H groups by the stronger hydrogen-bond acceptor (DMSO). The minor sharp component at 1692 cm −1 is likely to originate from residual βturns; a similar observation was made by Shanmugam and Polavarapu for Aβ 25−35 peptide dissolved in DMSO. 43 The application of DMSO as a solvent for crude peptides was advantageous due to its effectiveness in disrupting aggregated β-sheets (a necessary condition to study fibrillization of peptide under strictly de novo conditions), compatibility with in situ FT-IR measurements in the amide I band region, and, finally, the fact that while concentrated DMSO dissolves amyloid fibrils, diluted solvent does not have the same effect. 41,42 Hence, the reassociation process could possibly be initiated by a rapid mixing of a portion of peptide solution in concentrated DMSO with an excess of aqueous solution. Indeed, this was confirmed in follow-up experiments: kinetic trajectories of de novo fibrillization of A−B, B−A, and H peptides triggered by dilution of DMSO-dissolved peptide samples with acidified aqueous solution of NaCl are shown in Figure 4A.
The explosive rates of aggregation of A−B and B−A peptides are striking: no lag phases are detectable as the selfassociation begins already within the dead time of mixing (approximately 1 min) This contrasts with the case of Hfragment, whose aggregation is noticeably decelerated in terms of both the lag phase (approximately 80 min) and the less steep elongation phase. Importantly, unlike crude A−B and B− A powders, solid samples of H-fragment may be solubilized without DMSO 17, 32 and under such conditions the lag phase is negligible ( Figure 4A). Hence the observed deceleration of Hfragment's fibrillization should be attributed to the influence of residual DMSO. At pH 1.9 monomers of A−B and B−A are expected to bear the same positive electric charge (+2) whereas the net charge of H-fragment monomers could be larger (+3) due to additional N-termini. Thus, the Debye screening of repulsive electrostatic interactions by dissolved ions (NaCl) could impact differently aggregation of the A−B/ B−A pair and H-fragment. To verify this possibility, we have carried out reassociation of the three peptides also in the absence of NaCl ( Figure 4B). At the low ionic strength conditions, fibrillization of A−B and B−A remained very fast although the slope of the elongation phase was less dramatic; surprisingly no self-association of H-fragment was observed during the 24-h-long experiment. At the end of the kinetic experiments conducted in the presence of NaCl, samples of aggregated peptides were collected and, after removal of DMSO, were subjected to CD and infrared spectra measurements (Figure 4, parts C and 4D, respectively). The far-UV CD spectra feature single broad minima approximately at 220 nm (A−B) and ∼224 nm (B−A, H), indicative of aggregated β-sheets (the shift from the canonical 216 nm minimum typically observed for globular β-proteins is often observed for amyloid fibrils 32,44 ). We note that the spectrum of B−A aggregate is flattened, which has been attributed to stronger light scattering on this aggregate. In unison with the CD data, the frequency range (1623−1632 cm −1 ) of the infrared amide I band spectra shown in Figure 4D points to the β-sheet conformation as the dominating secondary component of aggregates. Furthermore, the absence of a high frequency (above 1680 cm −1 ) exciton-split band suggests a parallel arrangement of β-strands and this is in agreement with previous studies on insulin fibrils. 38,39,42 Pairwise comparison of infrared spectra collected for peptides before dissolution in DMSO ( Figure 3A), and after amyloidogenic reassociation ( Figure 4D), reveals a high degree of similarityalso in Langmuir pubs.acs.org/Langmuir Article reproducing fine spectral details such splitting of the amide I band observed for exclusively for B−A aggregate. This also confirms that all three peptides were already in an amyloid-like state at the stage of freeze-drying that followed after the synthesis and purification. However, in each case, the reassociation appears to lead to a higher structural homogeneity as compared with crude peptide samples which is reflected by the narrowing of amide I band's bandwidths (inset in Figure 4D). The sample conditions selected in this study include lowered pH, as is the case of many earlier studies on insulin aggregation (e.g., refs 33 and 39). The initial data on the H-fragment were also collected at low pH since the peptide was released in situ during digestion of insulin with pepsin active under such conditions. Hence the herein selected pH conditions facilitate comparisons with the previously collected data. However, it must be stressed that all H, A−B, and B−A peptides maintain the highly amyloidogenic properties at closeto-neutral pH (see Figure S2). Morphologies of aggregates collected at the end of the kinetic experiment are shown in Figure 5. Representative amplitude AFM images presented in the top row confirm the fibrillar nature of these assemblies. Clear differences in appearance of all three aggregates are immediately noticeable: H-fragment amyloid tends to form a straight singly dispersed specimen with a limited tendency to associate laterally into higher order structures. This contrasts with the strongly agglomerated short and straight A−B aggregates. On the other hand, B−A fibrils are long and slightly bent. On the basis of the corresponding height AFM data, we were able to estimate the thickness and periodicity of representative individual fibrils. To this end, cross sections of the selected specimen were analyzed in the directions perpendicular and parallel to the fibril axis. Individual fibrils built of self-associated H-fragment monomers are thinnest with a diameter of approximately 1.5 nm, whereas B−A fibrils are slightly thicker (1.8−2.2 nm).
Accurate height measurements of individual A−B fibrils were hampered by the pronounced self-agglomeration. Among thicker superstructures (>3 nm in diameter) there were thinner (∼1.8 nm) forms. Amyloid structures often differ in the periodic twisting pitch, which reflects various ways individual protofilaments are organized into higher order fibrils. The pitch may be easily estimated through a height AFM Langmuir pubs.acs.org/Langmuir Article measurement along the fibrils' longer axis. 45 The periodic gaps between height maxima are typically 16/20 nm measured alongside H-fragment fibrils and 12/16 nm for A−B and B−A fibrils (for the sake of clarity only selected cross sections are shown in Figure 5). All in all, the AFM data indicates that all three peptides form amyloid fibrils with distinct morphological features.
Coaggregation and Cross-Seeding. The data presented so far shows unambiguously that the reorganization of the Hfragment's primary structure into A−B and B−A single-chain analogues results in even more extreme amyloidogenicity, implying a high degree of versatility of the amyloid stretch segment encompassing the disulfide-bonded N-terminal fragment of insulin's A-chain. From the distinct fingerprint features visible in the conformation-sensitive amide I band region of A−B and B−A aggregates, and the noticeable morphological differences, it is clear that the two peptides follow different selfassembly pathways. As the interactions between A-chain's stretch segments appear to be primarily responsible for driving the self-assembly toward parallel β-sheet architectures, we became intrigued as to whether some degree of binding promiscuity between such segments built into A−B and B−A monomers could be observed. A cross-seeding experiment (e.g., seeding of dissolved A−B monomers with preformed B− A amyloid fibrils) is unfeasible due to the explosive kinetics of spontaneous aggregation of either peptide (at a reasonable monomer:seed ratio the aggregation through de novo pathway would outcompete or significantly overlap the seed-dependent pathway). Instead, we have chosen to study coaggregation of A−B and B−A induced by a rapid dilution of mixed DMSO solutions of both peptides with an excess of acidified solution of NaCl in water. The rationale of this approach is that any specific binding interactions between two coaggregating amyloidogenic substrates are expected to impact the kinetic trajectory of the aggregation and possibly result in the formation of new structural variants of aggregates. 46−48 However, the data presented in Figure 6 suggests that no actual coaggregation takes place in mixed A−B and B−A solutions: the kinetic trajectories of fibrillization share the same steep appearance irrespective of the mixing stoichiometry.
In unison with this observation, the infrared spectra of fibrils collected afterward lack any evidence that aggregation in mixed A−B/B−A solutions would lead to the emergence of new conformational variants of fibrils. For the series of spectra shown in Figure 6B, the contour of the amide I band changes gradually with the increasing A−B:B−A ratio, and the spectra obtained at 1:9 and 9:1 ratios are very similar to those corresponding to fibrils formed by neat peptides. Hence there is also no evidence of molecular imprinting or conformational memory effect. The fact that A−B and B−A do not coaggregate is consistent with a rather limited conformational space that is accessible to amyloidogenic self-assembly of either peptide. Hypothetically, an in-register parallel β-sheet structure could accommodate the peculiar bent shape of the main chain in the region restricted by the disulfide bridge (Cys6−Cys11 in A−B, Cys17−Cys22 in B−A) allowing for saturation of interstrand hydrogen bonds. Arguably, enforcing such an inregister arrangement at an A−B/B−A interface would leave the B-chain segment frayed and therefore thermodynamically frustrated. A shared amyloidogenic segment is an insufficient condition for effective cross-seeding between two peptides, although it is often the case that a peptide fragment of a larger amyloidogenic precursor can affect (accelerate, or disfavor) aggregation of the latter one (e.g., refs 49 and 50). Various scenarios of cross-seeding between full-length Aβ 1−40 and its Aβ 11−40 fragment have been studied by Hao et al. 51 Devlin et al. have shown that the lag phase of insulin aggregation is shortened in the presence of fibrils preformed from separated A-and B-chains of the protein. 52 A shortened lag phase may imply that amyloid growth is catalyzed through secondary nucleation rather than the direct elongation pathway (a scenario far less demanding in terms of sequence compatibility between the dissolved protein and the seed). We have examined catalytic effects of preformed A−B and B−A amyloid seeds on insulin aggregation (Figure 7).  Only controlled homologous seeding with preformed insulin amyloid results in the complete disappearance of the lag phase. Fibrils of A−B and H-fragment shorten the lag phase (implying possible secondary nucleation of insulin amyloid on side surfaces of seeds), whereas aggregation trajectories in the presence of B−A seeds are very similar to those observed for the nonseeded process. Infrared spectra of daughter fibrils are very similar to those of the de novo formed fibrils. Despite a minor red shift of the amide I band detectable for samples grown in the presence of all nonhomologous seeds, there is no evidence of conformational imprinting (the corresponding far-UV CD spectra are shown in Figure S3). The fact that fibrils of A−B (and of H-fragment) accelerate insulin aggregation, presumably via a secondary nucleation pathway, to a degree that B−A amyloid is incapable of may not necessarily implicate any well-defined molecular recognition between folded insulin and such seeds. For example, solvent-exposed hydrophobic patches alongside A−B fibrils could act a platform for unfolding of insulin monomers and formation of amyloid nuclei. It is unclear at this stage how significant in this context is the N-terminal placement of the GIVEQ sequence in both A−B and H-fragment (as opposed to B−A).
The concept of amyloid stretch is an attractive paradigm explaining how strong selective binding interactions involving relatively small portions of primary structure may drive aggregation of a much larger parent protein which itself may be far less prone to aggregation. In the case of insulin, various regions of its covalent structure have been implicated as the key amyloidogenic sites. Due to the presence of three disulfide bonds, the protein has a complex topology which itself is likely to contribute to and modify the observed amyloidogenicity. However, many of the studies dissecting insulin's amyloidogenic regions focused on short fragments lacking disulfide bonds (e.g., free A-and B-chains of insulin had either all cysteine residues oxidized to cysteic acid 52 or in the fully reduced form 53 ). Ivanova et al. screened amyloidogenic regions of human insulin by analyzing fibrillization of short mostly hexameric fragments, all of which lacked cysteine residues. 54 The extensive studies carried out by the same group implicated the segments LYQLEN of A-chain (residues 13−18) and LVEALYL of B-chain (residues 11−17) 49,55 as possibly involved in the formation of steric zippers within insulin amyloid fibrils. The earlier serendipitous discovery of the Hfragment 31 which preserves two of insulin's three disulfide bonds pointed to the N-terminal fragment of A-chain as a candidate for an amyloid stretch as long as the intrachain Cys6−Cys11 bridge remains intact. 17 The results presented here depict the disulfide-bonded N-terminal fragment of insulin's A-chain as a powerful and versatile amyloid stretch capable of forcing the aggregation-resistant B-chain's Nterminal fragment to become a part of fibrillar β-architecture. Our findings resonate with the earlier work by Yang et al. 56 pointing to a possible role of fluctuations in the N-terminal part of A-chain in insulin's amyloidogenesis. The authors hypothesized that a transient detachment of this segment could expose "a hydrophobic surface formed by native-like packing of the remaining α-helices". 56 In light of our work, this segment itself may constitute a hydrophobic patch responsible for selfassociation of insulin at low pH. The single-chain rearrangements employed in this study appear to only strengthen the amyloidogenic tendencies of H-fragment: parallel β-sheet structure is preserved as the main conformational component of fibrils. It is interesting to note that the very powerful tendency to aggregate both A−B and B−A peptides eludes the TANGO and Waltz algorithms ( Figure 1B). On the other hand, in their extensive screening study of various amyloidogenic regions in insulin, Surin et al. 57 noted an increased amyloidogenic potential within the A2−A12 region according to the data obtained by application of PASTA 2.0 58 and ZipperDB algorithms. 29,59 The distinct infrared features within the amide I band, various morphologies observed in AFM images, and no coaggregation of A−B and B−A strongly suggest that the self-assembly pathways of both peptides do not overlap. In light of these results, it seems likely that intermolecular interactions between N-terminal parts of A-chain contribute significantly to the thermodynamics of insulin fibrillization. This is significant in the context of desirable modulation of this process. The pronounced versatility of this newly identified amyloid stretch region could be employed as a tool for orderly organization of various chemical groups. More research is needed to illuminate these prospects.  The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.langmuir.0c01747.
Predictions of amyloidogenic tendencies of A−B and B− A based on FoldAmyloid; aggregation of H, A−B, and B−A at a close-to-neutral pH, kinetic data and IR spectra of fibrils; CD spectra of seeded amyloid fibrils (PDF)