In Search of a Universal Method: A Comparative Survey of Bottom-Up Proteomics Sample Preparation Methods

Robust, efficient, and reproducible protein extraction and sample processing is a key step for bottom-up proteomics analyses. While many sample preparation protocols for mass spectrometry have been described, selecting an appropriate method remains challenging since some protein classes may require specialized solubilization, precipitation, and digestion procedures. Here, we present a comprehensive comparison of the 16 most widely used sample preparation methods, covering in-solution digests, device-based methods, and commercially available kits. We find a remarkably good performance of the majority of the protocols with high reproducibility, little method dependency, and low levels of artifact formation. However, we revealed method-dependent differences in the recovery of specific protein features, which we summarized in a descriptive guide matrix. Our work thereby provides a solid basis for the selection of MS sample preparation strategies for a given proteomics project.


■ INTRODUCTION
State-of-the-art mass-spectrometry-based proteomics workflows are sophisticated multistep processes, combining different methodologies and instrumentation. Clearly, the data quality of an experiment depends on the characteristics and limitations of each step, with errors or biases propagating from the first step throughout the whole experiment. For this reason, the sample preparation protocol is a key determinant in defining what proportion of the proteome is available for the ensuing analysis. Moreover, the robustness and reproducibility of this step will define the degree of data variation and potential systematic bias. Ideally, the universal sample preparation protocol would efficiently and robustly isolate all proteins of any given sample to near completeness. In reality, such comprehensive isolation is very challenging as proteins constitute a heterogeneous group of macromolecules in terms of physicochemical properties and subcellular localization. In addition, other sample characteristics, such as rigid cell walls and tissues that are difficult to lyse or interfering cellular components (e.g., nucleic acids, metabolites, etc.), can greatly affect isolation efficacy and analysis and need to be addressed.
To solve these problems, different sample preparation methods have been developed that can be divided into insolution digestion methods and methods using additional devices such as filters or beads for protein immobilization or purification, or both. Classical in-solution digestion (ISD) protocols essentially differ in the choice of buffer systems, which are either based on chaotropic denaturants, such as urea or guanidine hydrochloride (GnHCl), or surfactants, such as ionic detergent sodium-dodecyl-sulfate (SDS) or bile salt sodium deoxycholate (SDC), as they effectively solubilize and denature proteins. 1−3 Recently, a novel ISD strategy, Sample Preparation by Easy Extraction and Digestion (SPEED), has been published 4 that uses neither detergents nor chaotropic agents for protein extraction but solely relies on dissolving proteins in trifluoroacetic acid (TFA). Further, ISD protocols often require protein precipitation using either acetone, 5−7 alcohols such as ethanol, 8 or chloroform/methanol 9 to avoid carry-over of nonprotein components that might interfere with downstream processing or analysis.
Device-based approaches (hereafter referred to as "cleanup methods") aim to remove interfering substances before digestion in "reactors" or on beads. For example, Filter-Aided Sample Preparation (FASP), 10 utilizing molecular weight cutoff (MWCO) membranes, and suspension trapping (S-Trap), 11 applying three-dimensional porous quartz filter materials, capture proteins on filters enabling detergent removal, protein digestion, and peptide recovery. Single-pot, solid-phase-enhanced sample preparation (SP3) 12 (and also SP4), 13 uses on-bead-based purification and digestion of proteins in a single tube, exploiting the property of denatured proteins to be nonspecifically immobilized on microparticles by protein aggregation. 14 Finally, the original in-StageTip (iST) method utilizes C18 discs prepared in pipette tips or cartridges to trap proteins for digestion and subsequently to desalt the peptides. 15 Based on these and similar methodical concepts, commercially available MS sample preparation kits in different formats have been developed for iST (PreOmics), S-Trap (ProtiFi), and in-solution digests coupled to peptide cleanup columns (EasyPep, Thermo Scientific).
Overall, this almost overwhelming number of protocols and variants with their apparent advantages and disadvantages make the selection of a suitable method for a given project difficult. Although previous studies compared selected sets of protocols, often focusing on particular aspects or on presenting a new method, a comparison including the most commonly used in-solution, device-based, and commercial methods had yet to be conducted. 1,3,4,14,16−20 It is also debatable whether there is a truly universal method that exhibits no or negligible extraction bias, as has been proposed for some protocols, 4,19 and that is applicable to all types of samples. Proving universality is an almost futile task, as it would require the comparison of a set of methods for a virtually endless list of cell types, tissues, body fluids, and organisms. However, Glatter et al. 1 and Doellinger et al. 4 convincingly demonstrated for a selection of ISD protocols and device-based protocols that there are organism-and buffer-specific differences in extraction efficiency when comparing samples of different bacterial and human cell lines. From these studies, it can be expected that such differences will further increase when comparing even more diverse sets of sample types, e.g., mammalian tissues, plants, or fungi. In contrast, investigating differences in proteome composition for a given set of methods in a defined sample type is more feasible and allows to answer whether and how protocols differ in their extraction properties for the given sample type. In combination with more practical considerations, like processing time, ease of use, and consumable costs, this could help in making a more informed decision for a particular sample preparation strategy and serve as a blueprint for similar studies in other sample types.
Here, we prepared proteomes from HeLa cells applying classical ISD protocols based on urea-, GnHCl-, and SDCbased buffer systems as well as SPEED, 4 FASP, 10 S-Trap 11,21 (ProtiFi), and SP3 12 protocols and two commercial kits: iST 15 (PreOmics) and EasyPep (Thermo Scientific). We therefore present a comprehensive quantitative and qualitative comparison of 16 of the most widely used MS sample preparation methods. Our experimental design maximizes reproducibility and comparability and allows for unbiased statistical analyses to extract differences between the methods. The individual methods show a similar proteome extraction efficacy and coverage based on identified proteins and peptides. Methodinduced peptide artifacts seem to be negligible. However, an exploratory analysis based on k-means clustering revealed qualitative differences in extracted proteomes, which we mapped to features derived from protein databases. The results were summarized into a descriptive guide matrix that highlights specific enrichment of protein features such as structure, abundance, and localization for individual methods. Consequently, our study provides a solid comparison of the currently most widely used sample preparation protocols in proteomics and can be used as an aid in selecting MS sample preparation strategies.

In-Solution Protocols
In-Solution Digests. HeLa cells (2.0E6 cells) were dissolved in 100 μL of denaturation buffer, 0.1 M Tris−HCl, pH 8.6, containing either 8 M urea (U), 6 M guanidine HCl (GnHCl), or 1% sodium deoxycholate (SDC), incubated for 10 min at room temperature (U) or at 60°C (GnHCl, SDC) in a ThermoMixer (Eppendorf), and subsequently disrupted by 2 × 20″ high-intensity sonication cycles at 4°C in a BioRuptor (Diagenode). Protein concentration was determined using the Micro BCA protein assay kit according to the manufacturer's instructions (Thermo Scientific). Each sample was split into two aliquots of 100 μg protein and one additional aliquot of 50 μg. Protein fractions of the two 100 μg aliquots were precipitated using acetone or chloroform− methanol, respectively. Only samples containing GnHCl were precipitated with ethanol instead of acetone since GnHCl is not soluble in the latter. Protein pellets were dissolved in their respective denaturation buffer, and protein concentration was determined as described above. Soluble proteins were reduced using 10 mM dithiothreitol (DTT) for 1 h at 37°C (U) or 60°C (SDC, GnHCl) and alkylated for 30 min using 20 mM iodoacetamide (IAA) in the dark. Chaotropic lysis buffers were then diluted to a final concentration of 1 M (urea) and 0.5 M (GnHCl). Proteins were digested overnight at 37°C using trypsin (Trypsin Gold, Promega) in a 1:30 (w/w) enzyme-toprotein ratio. Digests were stopped by adding 10% trifluoroacetic acid (TFA) to a final concentration of 1%. SDC precipitates were removed by centrifugation (14,000g, 1 min, room temperature (RT)). About 10 μg of resulting peptide samples was desalted on C18 StageTips (triple-plugs) 22 and eluted with 80% acetonitrile (ACN) and 0.1% trifluoroacetic acid (TFA). After removal of elution buffer by vacuum centrifugation, samples were resuspended in 0.1% TFA, 2% ACN.
Sample Preparation by Easy Extraction and Digestion (SPEED). 4 A total of 2.0E6 HeLa cells were resuspended in trifluoroacetic acid (TFA) (Merck) in a sample-to-TFA ratio of 1:4 (v/v), incubated at room temperature for 5 min, and neutralized with 2 M Tris base using 8× volume of TFA used for lysis. Reduction and alkylation of aliquots of 50 μg of protein were achieved by incubation in 10 mM tris(2carboxyethyl)phosphine (TCEP) and 40 mM 2-chloroacetamide (CAA) at 95°C for 5 min. Samples were diluted with ddH 2 O 1:5, and proteins were digested for 20 h at 37°C using trypsin (Trypsin Gold, Promega) at an enzyme/protein ratio of 1:50, as suggested in the original protocol. The digestion was stopped using 2% TFA (final concentration), and peptides were desalted on C18 StageTips and eluted with 80% acetonitrile (ACN) and 0.1% trifluoroacetic acid (TFA). Dried samples were resuspended in 0.1% TFA, 2% ACN.

Device-Based or Cleanup Protocols
Filter-Aided Sample Preparation (FASP). 10 A total of 2.0E6 HeLa cells were resuspended in SDT lysis buffer (4% SDS, 100 mM Tris−HCl, 100 mM DTT, pH 7.6) in a 1:10 (v/ v) sample/buffer ratio, incubated at 95°C for 5 min, and sonicated at 4°C for two cycles of 20 s at a high-intensity level using a BioRuptor (Diagenode). Samples were clarified by centrifugation at 16,000g for 15 min, at 24°C. Aliquots of 50 μg of protein were diluted in urea buffer UA (8 M urea, 0.1 M Tris−HCl, pH 8.5) to a final concentration of 0.5% SDS. Protein extracts were further processed in Microcon 30 kDa Centrifugal Filter Units (Merck) in a tempered centrifuge at 24°C . Samples were added to the filter unit, washed with UA buffer, centrifuged for 15 min at 14,000g, and incubated with 50 mM IAA for 20 min at room temperature (in the dark). SDS was exchanged by four consecutive washes with UA buffer (centrifugation: 15 min at 14,000g) and a single wash with 50 mM ammonium bicarbonate (ABC) followed by centrifugation for 5−10 min at 14,000g. Proteins were digested using trypsin (Trypsin Gold, Promega) in a 1:50 protein-to-enzyme ratio and incubated for 18 h at 37°C on a thermoshaker at 600 rpm. The resulting peptides were recovered by centrifugation at 14,000g for 5 min, followed by elution with 50 μL of 50 mM ABC and repeated centrifugation. Combined eluates were acidified using TFA at a final concentration of 1%.
In-StageTip Sample Preparation (iST). HeLa cell extracts were prepared using the iST 96x sample kit according to the manufacturer's instructions (PreOmics). In short, 2.0E6 cells were lysed by resuspension in a lysis buffer solution at a target protein concentration of 1 mg/mL and heated to 95°C for 10 min shaking (1000 rpm) followed by two cycles of 20 s of sonication in a BioRuptor (Diagenode). Aliquots containing 50 μg of protein were transferred into a cartridge and cooled. The digestion solution was added, and proteins were digested for 3 h at 37°C. Digestion was stopped by adding the "Stop" solution, and peptide purification was achieved by centrifugation for 3 min at 2250g, followed by three rounds of washing and elution into the collection plate using the provided solutions. Peptides were transferred to PCR tubes, dried in a vacuum centrifuge, and resuspended in 0.1% TFA, 2% ACN for MS analysis.
EasyPep. HeLa cell extracts were prepared using the EasyPep Mini MS Sample Prep Kit (Thermo Fisher Scientific) according to the manufacturer's instructions. Briefly, 2.0E6 cells were lysed with lysis buffer aiming for a protein concentration of 1 mg/mL, and aliquots containing 50 μg of protein were treated with Universal nuclease by ten cycles of pipetting up and down until the viscosity was reduced. Reduction and alkylation were achieved by addition of the respective solutions and incubation of samples at 95°C for 10 min. Once samples were cooled down, the trypsin/Lys-C protease mixture was added and samples were digested for 3 h at 37°C. Tryptic digestion was stopped using the "Digestion Stop Solution". Peptide Cleanup columns were cleared from the liquid by centrifugation and placed onto 2 mL microcentrifuge tubes. Sample mixtures were transferred into dry Peptide Cleanup columns. Two rounds of consecutive centrifugation and washing steps were performed. The columns were transferred to 2 mL microcentrifuge tubes, and peptides were eluted by addition of the elution solution and centrifugation at 1500g for 2 min. Samples were dried using a vacuum centrifuge and resuspended in 0.1% TFA, 2% ACN for MS analysis.
Suspension Trapping (S-Trap). 11 A total of 2.0E6 HeLa cells were resuspended in lysis buffer LB (10% SDS (w/v), 0.1 M Tris−H 3 PO 4 , pH 7.55). Cells were disrupted by sonication (two cycles of 20 s at 4°C) in a BioRuptor (Diagenode), and extracts were cleared by centrifugation at 15,000g for 1 min at 4°C. Aliquots of 50 μg of protein were reduced by incubation with 20 mM (final concentration) DTT at 95°C for 10 min and subsequently alkylated by addition of 40 mM (final concentration) IAA and incubation for 30 min in the dark at room temperature. Samples were acidified with 1.2% (final concentration) phosphoric acid, mixed with a 6 × volume of S-Trap binding buffer (90% MeOH in 0.1 M Tris−H 3 PO 4 , pH 7.1), and loaded onto S-Trap columns that were placed in low binding tubes (Axygen). The solvent was removed by centrifugation (4000g), and proteins were washed three times with 150 μL of S-Trap binding buffer, subsequently digested by addition of digestion buffer (500 mM ABC) containing 1:25 (w/w) trypsin (Trypsin Gold, Promega), and incubated at 37°C for 3 h. Peptides were eluted in three consecutive steps by addition of 40 μL of 50 mM ABC, 40 μL of 0.2% FA, and 35 μL of 50% ACN, 0.2% FA followed by centrifugation at 4000g, respectively. Eluates were pooled and concentrated in a SpeedVac (Thermo Fisher Scientific). Peptides were resolved in 0.1% TFA, 2% ACN. Aliquots of 10 μg of peptides were desalted on C18 StageTips (tripleplugs), 22 dried in a SpeedVac, and resuspended in 0.1% TFA, 2% ACN.
Single-Pot, Solid-Phase-Enhanced Sample Preparation (SP3). 12 HeLa cells (2.0E6) were resolved in reconstitution buffer (RB) 12 or 1% SDC to a final protein concentration of 1 mg/mL and subsequently lysed, reduced (DTT 5 mM contained in RB), and alkylated using IAA (25 mM final concentration). For protein cleanup and digestion, samples of 50 μg of protein were first mixed with SP3 beads in a 10:1 (w/w) beads-to-protein ratio. The mixture was then homogenized by adding 1 × volume of 100% EtOH and incubated for 5 min at 24°C shaking at 1000 rpm to induce protein binding to the beads. Proteins bound to beads were washed 4 x with 80% EtOH on a magnetic rack. On-bead digestion was achieved using trypsin (Trypsin Gold, Promega), added in a 1:30 (w/w) enzyme-to-protein ratio, and 20 h incubation at 37°C in a thermal shaker (1000 rpm). After digestion, beads were pelleted by centrifugation (20.000g, 1 min, 24°C) and supernatants containing peptides were transferred.

Experimental Design and Quality Control
To enable statistical analysis, we prepared three replicates of equal peptide concentration of each sample preparation method and applied several quality control steps that are summarized in detail below.
Type of Replicates. Starting from a commonly cultured pool of HeLa cells, three independent replicates were prepared for each sample preparation method. These replicates were defined as technical replicates.
Determination of Protein Concentration (of ISD Samples). The protein concentration after cell lysis and after protein precipitation was determined using the Micro BCA Protein assay kit (Thermo Scientific) according to the manufacturer's guidelines.
Determination of Peptide Concentration. An estimate of 250 ng of peptide per sample was mixed in 0.1% TFA, 2% ACN. Peptide concentrations were determined and adjusted according to UV chromatograms obtained at 214 nm on an UltiMate 3000 RSLC nano-HPLC System (Thermo Scientific), equipped with a monolithic column (PepSwift Monolithic RSLC, Thermo Scientific). To adjust the peptide concentration for MS measurements, peaks were integrated using chromatography software Chromeleon (Thermo Scientific) and peak areas were compared to in-house peptide standards of known concentrations.
Equal Loading of Samples. All samples were adjusted to an estimated concentration of 100 ng/μL. The indexed retention time standard (iRT, Biognosys) was added to all samples to a final concentration of 0.1 injection equivalents (IE)/μL, allowing continuous monitoring of LC−MS/MS performance. Five microliters of each sample corresponding to 500 ng of peptide with 0.5 IE were subjected to MS analysis. Equal loading of samples was confirmed by checking total summed peptide intensities.
Organization of Batches. Samples were organized into six batches. Batches 1−3 covered ISD protocols (including SPEED), with one replicate of each method per batch. Batches 4−6 were equally organized but with cleanup methods. Samples were measured in a randomized order, and all measurements were separated by wash runs. Before and after each batch, 25 ng of HeLa standard (Pierce) was injected to control system performance. Batches 1−3 and batches 4−6 were run in a single sequence.
Postacquisition QC. The quality of LC−MS runs was continuously monitored by checking the iRT signals in Skyline v20.1. 23 The number of missed cleavages and other metrics of quality control were determined using PTXQC. 24 Bridging of Batches. To account for changes in machine performance between batch sequences 1−3 and 4−6, three replicates of each group of batches (SDC-A and EasyPep, respectively) were remeasured in a single sequence of MS measurements. The differences in the number of IDs between these groups were ∼1%; nevertheless, the number of IDs of all original sample measurements was readjusted by the relative median change factor of the bridge samples. In short, the relative_median change_factor between the two groups was determined as [median ("SDC-A"_bridgesamples) − median ("EasyPep"_bridgesamples)]/median ("EasyPep"_bridgesamples). The corrected SDC-A median was calculated as [group_median ("EasyPep"_samples) + group_median ("EasyPep"_samples) * relative_median change_factor]. ISD groups were adjusted to the corrected SDC-A group median by their relative change to the original SDC-A group median.

MS Methods
LC−MS/MS analysis was performed on an UltiMate 3000 RSLC nano-HPLC System (Thermo Scientific), containing both a trapping column for peptide concentration (PepMap C18, 5 × 0.3 mm 2 , 5 μm particle size) and an analytical column (PepMap C18, 500 × 0.075 mm 2 , 2 μm particle size (Thermo Scientific)), coupled to a Q Exactive HF-X Orbitrap (with HCD, higher-energy collisional dissociation mode) mass spectrometer via a Proxeon nanospray flex ion source (all from Thermo Scientific). For peptide chromatography, the concen-tration of organic solvent (acetonitrile) was increased linearly over 2 h from 1.6 to 28% in 0.1% formic acid at a flow rate of 230 nL/min. For acquisition of MS2 spectra, the instrument was operated in data-dependent mode with dynamic exclusion enabled. The scan sequence began with an Orbitrap MS1 spectrum with the following parameters: resolution, 120,000; scan range, 375−1500m/z; automatic gain control (AGC) target, 3 × 10 6 ; and maximum injection time (IT), 60 ms. The top 20 precursors were selected for MS2 analysis (HCD) with the following parameters: resolution, 15,000; AGC, 1 × 10 5 ; maximum, IT 54 ms; isolation window, 1.2 m/z; scan range, 200−2000m/z; and normalized collision energy (NCE), 28. The minimum AGC target was set at 1 × 10 4 , which corresponds to a 1.9 × 10 5 intensity threshold. Peptide match was set to "preferred". In addition, unassigned, singly, and >6+ charged species and isotopes were excluded from MS2 analysis, and dynamic exclusion was set to 40 s.

MaxQuant Settings
Raw MS data was analyzed using MaxQuant 25 software version 1.6.14.0. MS2 spectra were searched against the canonical H o m o s a p i e n s ( h u m a n ) U n i P r o t d a t a b a s e (UP000005640_9606.fasta, release 2020_01, www.uniprot. org) containing 20607 entries, concatenated with the sequences of 397 common laboratory contaminants (extended MaxQuant contaminants database) and the iRT. Enzyme specificity was set to "Trypsin/P", the minimal peptide length was set to 7, and the maximum number of missed cleavages was set to 2. A maximum of five modifications per peptide were allowed. Carbamidomethylation of cysteine was searched as a fixed modification. "Acetyl (Protein N-term)" and "Oxidation (M)" were set as variable modifications. "Match between runs" and LFQ were activated. Results were filtered at a false discovery rate of 1% at the protein and peptide spectrum match level.

FragPipe Analysis
Screening for protein modifications in an unbiased manner was performed using the open search option of MSFragger 3.3 in FragPipe (v16.0). 26 All raw files were converted to the mzML format using MSConvert 27 with peak picking activated. mzML files were assigned according to sample preparation methods and replicates in the Experiments/Group tab. Default open search parameters were used, with trypsin specificity, −150 to +500 Da precursor mass window, oxidation of methionine, and protein N-terminal acetylation as variable modifications and carbamidomethylation of cysteine as fixed modification. PTM-Shepherd was activated at default settings. The observed mass shifts were obtained from the "global.modsummary.tsv" and "global.profile.tsv" tables in the FragPipe output, inspected, and filtered for abundant and relevant modifications.

Computational Methods
Computational analyses were performed using in-house Rscripts (ref 28 and Supplemental Material_Scripts). The data was processed as follows: Proteins only identified by a modified peptide, contaminant proteins as well as protein groups with less than two razor and unique peptides were removed, and LFQ intensities were log 2 -transformed. Only IDs identified by MS/MS were considered. The data was filtered based on valid values in LFQ intensities with a cutoff of three valid values in at least one group. The remaining missing values were imputed by a constant equal to the minimal log 2 LFQ intensity across all samples (rounded down to the next integer), in this case, 21. For principal component analysis (PCA) analysis, the prcomp() function from the package stats (preinstalled in R) was used.
k-Means Clustering. k-Means clustering was performed using the function kmeans() from the preinstalled R package stats. All of the above-described functions are embedded in the in-house script termed Cassiopeia. 28 Briefly, Cassiopeia is an in-house built LaTeX script that runs on R-code and is used for the generation of quality control outputs and statistical outputs and for visualization of information for a given "proteinG-roups.txt" file as produced by the quantitative proteomics software package MaxQuant. 25 Mapping of Protein Features. To map protein features, such as protein abundance level, protein structure, localization in cellular compartments, etc., to the clusters, the results of the k-means cluster analysis have been merged with entries of protein databases using an in-house Python script (Supplemental Material_Scripts). The following databases have been used: Human Protein Atlas (proteinatlas.org); 29 PhosphoSite-Plus; 30 PSIPRED; 31 D2P2; 32 Pdbtm; 33,34 Reactome.org; 35 and a database covering the protein expression level, 36 information  on complexes, 37 and aggregator feature. 38 Statistical significance for a protein characteristic's enrichment in a cluster was inferred via one-sided Fisher's exact test using the fisher.test() function from the R package stats. Enrichment factors were calculated as the ratio of observed number/expected number, where the expected number was calculated as the cluster size of cluster k multiplied by the relative frequency of the characteristic n throughout the whole experiment (i.e., enriched compared to the global relative frequency).
Linear Regression Modeling. Within ISD samples, the total number of observed features (proteins, peptides, and peptides with 0 missed cleavages) were analyzed by means of a simple linear model applying the following model formula: IDs ∼ batch + precipitation + buffer. F/ANOVA tests were applied to test for the significance of the individual variables (batch, precipitation method, and buffer). Linear model predictions were visualized as partial residual plots using the function effect_plot() from the R package jtools.
Venn Diagrams and UpSetR plots. The MaxQuant ProteinGroups.txt output table was cleared from contaminants, reverse hits, IDs only by site, and iRT (internal retention time standards) hits. Only protein IDs (protein groups) identified by MS/MS in at least two out of three replicates were considered. Area proportional Venn diagrams were created using web application DeepVenn. 39 Overlaps of protein IDs (%) were quantified in Python (version 3.9) with pandas data analysis toolkit. Intersecting sets of protein IDs in all methods were further visualized in an UpSet plot, which was generated using a scalable matrix-based visualization script employed by the open-source R package UpSetR. 40

Cost-Effort Calculations
The financial expenditure for a method was determined by the cost per sample either according to the manufacturer (e.g., 96 samples for iST 96x kit) or calculated by the reagent cost per sample. For SP3, the cost was determined by the usage of magnetic beads solution per sample. The cost of one-time investments such as magnetic racks needed for SP3 protocols is not included in our table. The hands-on times refer to the processing time of 6 (12 samples for SP3 and SP3-SDC) samples in parallel without digestion, and without additional C18 cleanup or vacuum centrifugation times.
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE 41 partner repository with the data set identifier PXD030406 and 10.6019/PXD030406.

Experimental Design and Quality Control
To provide a comparative analysis of MS sample preparation methods, we applied 16 widely used protein extraction protocols to isolate whole-cell proteomes from HeLa cells and compared their efficacy on the basis of quantitative and qualitative parameters ( Figure 1A). Our experimental setup covered in-solution digest (ISD) protocols and cleanup methods, including commercially available MS sample preparation kits.
For classical ISDs, cells were lysed either at room temperature (urea) or 60°C (GnHCl, SDC) to optimize cell lysis and protein solubilization. We observed similar protein extraction efficiencies for the three applied buffer systems (Supplemental Figure 1A). Lysates were split into three groups of aliquots of equal protein amounts. The first group was directly subjected to proteolytic digestion using trypsin. The protein fractions of the remaining aliquots were additionally purified prior to proteolysis by acetone, ethanol (given that GnHCl is not soluble in acetone), or chloroform/ methanol protein precipitation, respectively ( Figure 1A). Notably, some combinations of buffer systems and precipitation methods, such as urea-based buffer and chloroform/ methanol precipitation, resulted in significant sample losses. The highest yields were observed with SDC-based buffers (Supplemental Figure 1A), which correspond to previous observations. 20,42 Cleanup samples were prepared as previously described [10][11][12]15,21 or according to manufacturer's guidelines, with the exception of SP3, where, additionally to the detergentheavy buffer system, an easy to prepare buffer consisting of 1% SDC in Tris−HCl (see Materials and Methods for further information) was tested (SP3-SDC). The latter was included since this buffer composition delivered high performance in classical ISDs. 1,20 Overall, we obtained similar peptide concentrations after tryptic digestion in all cleanup samples (Supplemental Figure 1B), even though proteolysis differed in the reaction mix composition, reaction time, and peptide-toenzyme ratio (see Materials and Methods).
To achieve equal loading for MS measurements, peptide concentrations of all samples were determined using UV chromatogram peak areas and adjusted accordingly. MS measurements were performed on a quadrupole-orbitrap hybrid MS instrument ( Figure 1A). All 16 experimental conditions were analyzed in three technical replicates, resulting in a total of 48 MS runs that were measured in six consecutive batches. The performance of the LC−MS system was monitored by inspecting retention times, intensities, and peak shapes of spike-in standards (iRT) to ensure similar conditions within and between batches. Non-normalized summed protein group intensities indicated that comparable amounts of peptides were submitted to MS measurements (Supplemental Figure 1C).

Cost and Time Effort
Since the expenditure of time and money is important to consider, we determined the average cost in US dollars and hands-on sample processing times for the applied methods ( Figure 1B). ISD protocols come at very low consumable costs but are, with the exception of SPEED, considerably more timedemanding than commercial kits. EasyPep, iST, SPEED, and S-Trap protocols were found to have similar hand-on times of around 60 min. FASP, on the other hand, is inherently more time-consuming with long centrifugation steps, taking up to 4 h. Costs ranged from 1$ (ISD, SPEED, SP3, and SP3-SDC) to 5$ (FASP), ∼10$ (S-Trap), ∼20$ (iST), or ∼30$ (EasyPep) per sample. From this perspective, SPEED represents a competitive protocol that combines short handling times with low consumable costs.

Global Comparison of Performance
We first compared overall method performance, considering the total numbers of protein groups (protein IDs) and peptides (peptide IDs) identified by LC−MS/MS (Figure 2A, Supplemental Table 1). After filtering data (see Materials and Methods), we retrieved protein IDs ranging from 3500 to 4500 and peptide IDs ranging from 30,000 to 40,000, with SDC-based sample preparations resulting in the highest numbers. ISD protocols based on GnHCl, on the other hand, delivered the lowest numbers of identified peptides and proteins.
Extraction buffers containing chaotropes or detergents are known to interfere with the protease activity of trypsin, 43,44 which results in incomplete protein digestion and consequently in lower proteome coverage due to oversampling of different cleavage forms of abundant peptides. An analysis of missed cleavage frequencies clearly demonstrates strong differences between protocols, with iST and EasyPep showing the highest efficiencies, followed by ISD-SDC protocols (Supplemental Figure 2A). The high efficiency of iST and EasyPep can be most likely explained by the combined use of trypsin and Lys-C in these kits, in contrast to trypsin alone as in the other protocols. This suggests that all methods could probably benefit from the use of both enzymes (as previously described in refs 45 and 46), which needs to be considered when comparing results across protocols.
The differences in cleavage efficiency also help to interpret the results of protein and peptide IDs (Figure 2A). Some methods with high peptide ID numbers show comparably lower protein IDs (e.g., U-A, FASP, SP3). However, when considering peptides with no missed cleavages (Figure 2A, lower panel), it is evident that the lower digestion efficiency in these methods might result in a lower proteome coverage. The excellent performance of ISD-SDC protocols in terms of protein and peptide IDs even without additional use of Lys-C supports the originally reported properties of SDC to enhance trypsin activity and increase digestion efficiency. 47 Notably, the majority of cleanup protocols and the classical ISD-urea protocols and SPEED showed good performance and rather similar numbers of IDs. Conversely, samples prepared in GnHCl-based buffers displayed the lowest numbers of protein and peptide IDs, suggesting interference of GnHCl with trypsin protease activity even at low concentrations, as reported before. 3,48 The values depicted in Figure 2A represent the sum of multiple effects, which hampers an independent evaluation of the impact of single-method parameters, such as protein precipitation. To elucidate the unique impact of variables on the overall performance of ISD protocols, we applied linear regression modeling. In each model, the number of IDs was explained additively by the supposed independent effects of individual precipitation methods and buffer conditions, in addition to batch effects that derive from technical variance during the MS measurements (Supplemental Figure 2B). On the basis of model parameter estimates, we calculated protein and peptide IDs for individual precipitation strategies ( Figure  2B) and buffer conditions ( Figure 2C) that are corrected for the effects of all other model variables.
In general, protein precipitation only minimally affected the efficiency of protocols, with acetone and chloroform− methanol precipitation being slightly advantageous compared to the other methods ( Figure 2B). The strongest impact on method performance is caused by the type of extraction buffer, which confirms that effective protein digestion is a key determinant for proteome coverage. It is possible that there are additional interaction effects between variables. For example, the bimodal data distribution in acetone precipitated samples could hint that acetone precipitation efficiency is influenced by buffer type. However, such potential effects are difficult to resolve statistically with the current study design and with the available number of data points and would require further and more specific experiments. Generally, the SDC-based buffer resulted in the highest numbers of identified proteins and peptides even without precipitation, whereas other methods like urea ISD clearly benefitted from precipitation protocols. Certainly, as mentioned before, these results have been obtained with HeLa cells and might not be directly translatable to other cells, tissues, or organisms with more challenging properties or specific requirements.

Sample Preparation Artifacts
We next tested whether individual sample preparation methods are prone to protein modification artifacts. We reanalyzed the MS raw data by applying an open search strategy with the FragPipe proteomic software package. 26 The open search allows identifying modified peptides from MS data without the need to specify modifications of interest before the analysis. 49,50 We used the number of PSMs to estimate the abundance of modifications and observed that the majority of PSMs (76−80%) originated from unmodified peptides ( Figure  3A). Most of the detected modifications were equally abundant in the different samples ( Figure 3B and Supplementary Table  2), suggesting that they are either naturally occurring PTMs or inevitable, method-independent sample preparation artifacts. Nevertheless, we observed method-specific modifications and adducts, some of which have been previously described. 51−54 Notably, all method-specific modifications were low in abundance (≤1%). For example, peptides in ISD-urea samples showed increased levels of carbamylation ( Figure 3C), a wellknown artifact for this buffer compound. 54 Peptide artifacts deriving from reduction and alkylation steps could be observed in several methods ( Figure 3D). Despite reports on the disadvantages of using dithiothreitol (DTT) and iodoacetamide (IAA), we selected this protocol for the ISD as it is probably the most widely used and because it also allowed comparisons to other standard protocols such as FASP. Interestingly, the alkylation-related artifacts were rather rare and appeared not as problematic as reported in the literature. 55 Although typical artifacts like off-target alkylation or DTT adducts could be detected, they were found to occur at low levels (<0.5% or mostly lower), as also reported by Hains and Robinson. 56 Carbamidomethylated and carboxymethylated methionine or their according neutral losses 55 as well as potential dialkylation with IAA were not detected or occurred at levels below 0.01%. Among the minor effects, EasyPep and iST showed slightly elevated levels of off-target carbamidomethylation (+57.0215 Da predominantly on lysine and histidine), and in addition EasyPep displayed higher levels of unmodified cysteines (−57.0215 Da), suggesting nonoptimal Journal of Proteome Research pubs.acs.org/jpr Article reaction conditions for alkylation of free thiols. 51 Unfortunately, the type and concentration of chemicals used in these kits are not disclosed; however, based on the "one-pot" reaction conditions and published information, 15 it can be assumed that chemicals other than IAA and DDT are used. Nevertheless, their impact on artifacts and general method performance appears to be rather small when compared to the other protocols in this study. ISD-SDC, and to a minor extent S-Trap, resulted in increased levels of DTT adducts on cysteine (+151.9966 Da). We further recorded a modification seemingly specific to acetone precipitation with a delta mass of +40.0313 Da (propionaldehyde) in ISD-urea and especially ISD-SDC samples ( Figure 3C), possibly constituting acetone adducts. 53 Finally, we observed enrichment of a delta mass of +26.0157 Da in EasyPep samples, likely corresponding to Nterminal acetaldehyde Schiff base formation, 52 and a delta mass of +12.00 Da (formaldehyde adduct), previously described to be specific to FASP samples 57 ( Figure 3E). The open search strategy might not exhibit the sensitivity to reveal all modifications and artifacts occurring in the samples. However, it provided a rather unbiased, broad overview and revealed that only a negligible fraction of peptides was affected by method-induced modifications, indicating that artifacts induced by sample preparation pose only a minor problem for the protocols as they were applied in our study.

Proteome Coverage and Qualitative Differences
Apart from the numbers of proteins and potential artifacts, the most important question is certainly whether methods differ in terms of identity and quantity of the proteins they extract. We investigated whether the individual sample preparation methods covered largely similar or distinct fractions of the HeLa proteome ( Figure 4A). Based on this analysis, it appears that overall proteome coverage is rather comparable. We observed a predominant overlap of protein IDs when comparing classical ISD methods and SPEED (3498 proteins, 75.3% overlap). Similar observations were made when comparing the cleanup methods FASP, S-Trap and commercial kits EasyPep, iST (3711 proteins, 78.9% overlap) or when comparing SDC-A, FASP with SP3-based methods (3800 proteins, 81.9% overlap) ( Figure 4A). The overlap of all 16 methods (2989 proteins) was 61.6% (Supplemental Figure 3), but this lack of overlap is certainly also driven to a large extent by missing identifications of rather low abundant peptides due to the stochastic nature of data-dependent acquisition. It is clear though that a simple analysis of overlaps in protein IDs does not allow to reveal specific or more subtle differences.
In contrast, a principal component analysis (PCA) of labelfree quantified protein intensities separated out distinct clusters for the replicates corresponding to the different sample preparation methods, pointing toward qualitative differences in preparation-dependent variables ( Figure 4B). We observed clustering according to buffer and precipitation conditions, with chloroform/methanol precipitation being more distant from other approaches. Distinct grouping of SP3-derived samples was also observed, irrespective of the applied buffer systems, suggesting that the magnetic beadmediated protein pulldown poses a key variable for methodspecific protein extraction. Furthermore, iST and EasyPep clustered close to SDC-ISD protocols, suggesting similarity in their methodology.
To further elucidate method-specific differences systematically, we carried out an explorative k-means cluster analysis and thereby classified variation patterns in protein intensities ( Figure 5A). We first defined the optimal number of clusters using the sum of squares within (SSW) distances to the next cluster center. Our approach defined nine k-means centers of the cluster (k = 9) as the optimal number, each showing a distinct method-dependent signature pattern of centernormalized LFQ intensities (Supplemental Figure 4A). Each cluster therefore consists of an individual set of protein IDs ( Figure 5B and Supplemental Figure 4B). A downshift in center-normalized LFQ intensities suggests a method-dependent decrease in protein isolation efficacy in a given cluster. The opposite is true for observed upshifts. For clusters with a large number of elements, such as clusters 1 (n = 1112) and 2 (n = 1935), we observed similar performance of all sample preparation methods (Supplemental Figure 4B). This suggests that the majority of proteins are effectively extracted independent of the applied protocol, which is also in agreement with the Venn diagrams ( Figure 4A). Methodspecific up-or downshifts in center-normalized LFQ intensities were prominent in clusters of smaller size, such as cluster 9 (n = 45), showing the most profound differences. Shifts in LFQ intensities were generally trending downward. Figure 5C summarizes the relative efficiency of sample preparation methods for each cluster in a heatmap ( Figure 5C) and highlights that all methods display distinct profiles with unique features.
Clear clustering suggests that the respective proteins share common properties. We performed enrichment analysis on protein features that we extracted from selected databases, such as the Human Protein Atlas for subcellular localization; 29 PhosphoSitePlus for known PTMs; 30 PSIPRED for information on the secondary structure; 31 D2P2 providing a score for disordered regions; 32 Pdbtm for transmembrane domains; 33,34 Reactome.org for cellular pathways; 35 and three databases covering protein expression levels, 36 complex information, 37 and aggregation features 38 ( Figure 5A). To determine which properties promote effective extraction by a given sample preparation method, we calculated cluster-specific enrichment for individual protein features ( Figure 5D and Supplemental Table 3). By combining the information in Figure 5C,D, one can determine which methods can be used to purify specific protein features. In detail, Figure 5C (or Supplemental Figure  4B) illustrates whether a given method works well with a cluster (e.g., all ISD-Gnd methods are well suited for purification of proteins of cluster 9), while Figure 5D (or Supplemental Table 3) shows the cluster properties (the only protein feature enriched in the given example is ″ion uptake and transport″).
As stated above, cluster 1 (n = 1112) comprises a high number of proteins that become efficiently isolated by all sample preparation methods (Supplemental Figure 4B). We found several features connected to the histone deacetylase (HDAC) 1 complex to be enriched in cluster 1, indicating that the nuclear fraction of proteins can be purified with all tested methods at equal efficacy. Clusters 4 (n = 114) and 5 (n = 112) showed enrichment of several mitochondrion-associated properties, such as mitochondrial protein import, mitochondrial translation termination, respiratory electron transport, cytochrome c oxidase complex IV, and mitochondrial ribosomal large subunit ( Figure 5D). The fact that CMbased precipitation showed lower center-normalized LFQ intensity levels in clusters 4 and 5 ( Figure 5C and Supplemental Figure 4B) suggests that these protocols should be avoided for mitochondrial proteomics. Conversely, ISD (without CM) and SPEED protocols seem to be well suited for mitochondrial protein extraction, as they resulted in the highest intensity levels ( Figure 5C,D and Supplemental Figure  4B). Cluster 8 (n = 169) showed enrichment of vesicle-and membrane-associated protein properties ( Figure 5D), which is consistent with the good performance of ISD-SDC in this group 58 ( Figure 5C). Finally, proteins associated with iron uptake and transport were exclusively found to be enriched in cluster 9 (n = 45) ( Figure 5D). Successful extraction of this set of proteins seems to be best achieved using ISD protocols based on GnHCl buffers.
Certainly, the efficacy of protein extraction of all applied methods could be further optimized. Here, we provide a basis for doing so, indicating steps in sample preparation protocols that could be further fine-tuned. As suggested in previous reports, 1,4,13,19,20 different combinations of buffer components and buffer systems, reactor types, proteolytic digestion protocols, and the use of nucleases could be implemented. Changes to protocols should, however, be made with caution since cross-compatibility of reagents is not always guaranteed. For example, we occasionally observed gel-like phases in extracts when we used SDC in conjunction with phosphate buffers (unpublished observation). Our data also suggests that omitting a protein precipitation step during MS sample preparation can still result in sufficient proteome coverage for HeLa cells. Yet, we generally advise including a protein precipitation step to avoid carry-over of nonprotein cellular components such as lipids, nucleic acids, metabolites, etc., which could cause problems during later steps of sample preparation.
In general, different cell types and organisms may require different adaptations. To give an example, we observed that using buffer systems containing urea in combination with chloroform−methanol precipitation resulted in significant losses when proteins were extracted from Saccharomyces cerevisiae cells (unpublished observation). Doellinger et al. 4 have shown that the SPEED protocol outperforms other protocols when processing bacterial samples. Furthermore, it is well known that samples from plants or fungi often require specific protocols due to the high level of interfering metabolites.
Previous comparisons of sample preparation methods across species have shown that extraction biases do exist and that therefore a universal method is rather unlikely. 1,4 Our study additionally demonstrates that even within the same sample type there is no one-fits-all protocol because all methods have their own peculiarities. For example, even though the SPEED protocol performs well in many aspects it also exhibits an extraction bias toward certain protein groups, e.g., for proteins associated with the Golgi apparatus and transport to the plasma membrane (see cluster 6 and cluster 8, Figure 5C,D). However, despite these clear differences for specific clusters, our data also show that most methods, with the exception of GnHCl, perform overall rather similar in this cell type, which allows choosing methods rather on other parameters like ease of use, processing times, etc.
In summary, despite similar proteome coverage, we could extract qualitative differences between the different protocols that represent varied purification efficacy for certain sets of proteins. The presented matrix, the underlying data set, and the according methodology may serve as a guideline for the choice of a best-suited sample preparation method for a specific group of proteins of interest.

■ CONCLUSIONS
The present study provides an in-depth and solid comparison of 16 of the most widely used MS sample preparation protocols in a human cell line. Careful attention has been paid to quality control and experimental design to maximize reproducibility and comparability and to allow for unbiased statistical analyses. We demonstrate that the applied protocols had an overall rather similar performance with a low degree of protein modification artifacts and similar protein extraction efficiencies. Our analysis further revealed method-specific protein clusters, and we summarized their features in a guide matrix to assist in choosing an appropriate method. Ureaacetone, SDC-acetone, and FASP protocols perform well in terms of the number of covered protein/peptide IDs and enrichment of all classes of proteins. In addition, these methods are also comparatively cheap. A similar degree of performance was observed for the commercial kits, with the additional benefit that materials and reagents are provided in a standardized manner and handling is straightforward. SPEED delivered in general a good performance and its simplicity and low price make it an attractive alternative. However, our data also showed that several methods (SPEED, FASP, S-Trap, and SP3) could benefit from further refinements (e.g., trypsin and Lys-C digest). Finally, we also highlighted methods preferable for enrichment for specific protein characteristics. For example, ISD in combination with GnHCl buffer is well suited for the isolation of proteins associated with iron uptake and transport, however, at the cost of reduced efficacy of digestion and an overall lower proteome coverage.
Selected quality control steps from our experimental approach (Figure S1), missed cleavages and batch effect ( Figure S2), overlap of identified proteins between all methods ( Figure S3), and k-means cluster centers ( Figure S4) (PDF) Number of proteins, peptides, and missed cleavages (