Mass Spectrometry For Natural Product Discovery: © 2020 Elsevier Ltd. All Rights Reserved
Mass Spectrometry For Natural Product Discovery: © 2020 Elsevier Ltd. All Rights Reserved
Mass Spectrometry For Natural Product Discovery: © 2020 Elsevier Ltd. All Rights Reserved
Natural products chemistry is focused on small organic molecules produced by organisms, especially microbes and plants,
as secondary metabolites. They have proven to be rich sources of medically- and industrially-important bioactive molecules. The
discovery of penicillin, the first commercially used antibiotic, by Alexander Fleming from the culture broth of Penicillium
chrysogenum in 1929 opened up a decades-long era of natural product discovery.1 Since then, many natural products have been
successfully commercialized including antibiotics, for example, the anti-tuberculosis drug streptomycin, the anti-parasite com-
pound avermectin, the broad spectrum spinosyn insecticides, and artemisinin, a drug against malaria, and many more. The mid-
20th century witnessed the “Golden Age” of antibiotics discovery. Historically, scientists screened culture broths or extracts of
microbes and plants for desired activities, such as antimicrobial properties. This was followed by compound isolation, activity
screening, and structural analysis. Although this approach was effective in finding natural products with desired activities, the
discovery of new natural products became harder and harder due to the increasing rediscovery rate of the same compounds after
laborious screening, purification and identification steps. Nevertheless, recent advancements in genomics technologies have
revealed a treasure trove of natural products that remain to be discovered and characterized. Hence, analytical technologies have
also been developed to unveil the chemical world of living organisms, the “parvome” named by Julian Davies,2 and distinguish
known metabolites from novel compounds.
Mass spectrometry technologies have proven invaluable for characterizing a diversity of organic compounds including natural
products. A major initial application for mass spectrometry was in the petroleum industry. In early days, the major method of
ionization of analytes was by electron ionization (EI). Here, analytes are vaporized under high vacuum and pass through a beam of
high energy electrons, which remove a valence shell electron from the analytes. EI often causes fragmentation of molecular ions and
pure samples are required to obtain useful structural information. Therefore, gas chromatography (GC) was coupled with an EI
mass spectrometer to analyze impure samples. This hyphenated technology (GC-MS) became a powerful tool to identify com-
pounds that were adequately volatile and stable at 300 C. Some polar compounds with hydroxyl groups could also be analyzed
by derivatization of hydroxyl groups. As such, GC-MS is most useful to identify volatile natural products with low molecular weight
such as terpenes.
Although detection of many chemical compounds became faster using GC-MS, a number of natural products were still
overlooked due to the inability of GC to analyze non-volatile compounds. Liquid chromatography (LC; often high pressure liquid
chromatography, HPLC), on the other hand, is a robust technology to separate a wide range of chemical compounds. However,
before the development of approaches to coupling LC with mass spectrometry, liquid chromatography was used with ultraviolet-
visible (UV–vis) spectrophotometry such as a photodiode array detector (LC-PDA). LC-PDA is very useful to detect compounds
containing conjugated double bonds. Unfortunately, many natural products lack such features, and further, UV–vis does not
provide any meaningful structural information.
The advent of electrospray ionization (ESI) and atmospheric pressure chemical ionization (APCI) technologies overcame this
issue. Both technologies are complementary, as discussed in the following section on ionization, and ionize analytes under
atmospheric pressure and often generate protonated (or deprotonated) molecular ions without causing significant fragmenta-
tion. Hence, unlike EI mass spectrometry, these soft ionization techniques provide information that can be used to infer the
chemical composition of intact molecules. The invention of collision induced dissociation (CID) tandem mass spectrometry
(MS/MS) enabled generation and detection of fragment ions from a molecular ion generated by ESI, which can be used to infer
the chemical structure of analytes. By integrating LC and MS/MS, rapid detection of a wide variety of features of natural products
present in a sample together with their molecular ions and fragmentation spectra became feasible. ESI mass spectrometry is often
the primary choice for coupling with LC in an LC-MS system due to its ability to detect analytes over a wider range of masses and
polarity.
The ability to infer the structures of natural products based on the fragmentation patterns is one of the very important aspects of
GC-MS and LC-MS analyses. Both database searches of fragmentation spectra and the mass-to-charge ratio (m/z) of molecular ions
are useful for determining whether the query molecule has already been discovered beforehand (dereplication). This is a significant
advantage of mass spectrometry since researchers can avoid spending tremendous amounts of time and effort on “rediscovery” of
known molecules. In addition, related molecules can be grouped based on the similarity of their fragmentation spectra, and analytes
similar to known molecules can be easily identified. Several algorithms and databases have been developed to do this. Widely used
algorithms include MS-Clustering of molecular networking.3 Global Natural Products Social Molecular Networking (GNPS) has
libraries of fragmentation spectra from about 20,000 compounds and implements the molecular networking analysis for
dereplication.4
Isotope patterns can be diagnostic for elemental composition, especially for high resolution mass spectrometry data. For many
elements, the higher isotopes are much lower abundance than the lowest monoisotopic mass (M0). This is true for carbon,
hydrogen, oxygen and nitrogen, and many others. However, some elements have abundant higher mass isotopes, and this can be
Mass Spectrometry for Natural Product Discovery 265
very helpful in chemical formula determination. For example, the halogen atoms chlorine and bromine each have 2 major isotopes
of high abundance, M0 and M+2. Since a large number of natural products contain these halogen atoms (e.g. chloramphenicol),
they are easily identified by analyzing the isotope distribution of the molecular ions. Owing to the high resolution and mass
accuracy of many modern mass spectrometers, it is possible to directly determine the chemical formula for a given natural product,
especially for small molecules of low mass.
As metabolomics using LC-MS technology has become more common, the need for data analysis tools has also increased since
typical LC-MS analyses involve comparing thousands of signals with m/z and retention time tags, and the most intense signals can
also be associated with MS/MS spectra. Several programs have been developed to deconvolute LC-MS signals into features or peaks
and align features among multiple files. Subsequent to this pre-processing, statistical analyses are often required to prioritize any
LC-MS features or strains/conditions that are of interest. Principal component analysis (PCA) can group strains or cultivation
conditions based on the similarity of metabolite production patterns to prioritize metabolically unique strains/conditions.5 PCA
becomes less effective as the data size grows and is often strongly influenced by largest differences. Kohonen self-organizing map
(SOM) analytics is a complementary method to perform comparative metabolomics with multiplexed data.5 Features with similar
trend of fluctuation are grouped together and spotted onto nearby nodes of a map. SOM is improved by increasing the number of
datasets. By comparing the heatmaps of SOM analyses, metabolically unique strains/conditions can be prioritized for subsequent
experiments.
The more recent development of desorption electrospray ionization (DESI) mass spectrometry, especially nanospray desorption
electrospray ionization (nanoDESI) MS, has advanced in situ detection of secreted metabolites on agar plates.6 Although this was
already feasible with matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry imaging (MSI),7
nanoDESI requires no sample preprocessing and unlike MALDI-TOF, can be applied non-destructively to detect local metabolite
signals over the course of time, enabling time course measurements and visualization of the spatial distribution of metabolite
production. One of the applications of this technology is monitoring communication between plants and microbial or fungal
colonies by detecting secreted metabolites, and under certain conditions, result in the detection and identification of produced
natural products.8,9 Another application of this technology is high-throughput metabolomics. For instance, since DESI can analyze
one sample in 2 s with no sample preprocessing,10 high-throughput screening to find natural products with unique structural
features has become faster and easier.
Mass spectrometry plays a number of critical roles at different stages in the natural product discovery process, ranging from sample
selection, screening and fractionation to identification and dereplication. A generalized analytical workflow for natural product
discovery is shown in Fig. 1, illustrating both a traditional approach and a natural product library approach.10a Both approaches
begin with selecting an appropriate source material with high potential for natural product discovery. Following the traditional
approach, an extracted mixture is screened prior to separation and purification of components, whereas with the library approach,
these steps are reversed as libraries are created from samples and then screened. However, for each, the final step requires
identification of the active compound for potential production or other utility. Both approaches have been successfully applied
to result in discovery of new natural products with desirable properties.
The more traditional approach characterized early natural product discovery efforts in the 1940s to 1970s, or “Golden Age,” and
was based on performing phenotypic and activity screens on extracts from soil bacteria, plants and fungi.11 These extracts could be
either directly screened for activity or fractionated before. An activity screen may include growth inhibition, for instance, by applying
an extract to a susceptible organism’s culture and measuring growth. Extracts found to have desirable activities are then subject to
Mass spectrometry
Sample source LC-ESI/APCI, MS/MS, Identification of active
selection (extracts) MALDI, LC-MS-NMR, MS components
Imaging
Library
approach
Separation into Screen of pure
pure components components
Fig. 1 Role of mass spectrometry in each step of an analytical workflow for natural product discovery. Based on with permission from Gilbert, J. R.; Lewer, P.;
Duebelbeis, D. O.; Carr, A. W. The Central Role of Mass Spectrometry in Natural Products Discovery. In Integrated Strategies for Drug Discovery Using Mass
Spectrometry; Lee M. S. Ed.; Wiley, 2005. https://doi.org/10.1002/0471721034.ch6. Copyright (2005) John Wiley and Sons, Inc.
266 Mass Spectrometry for Natural Product Discovery
a variety of techniques including chromatography, further extraction, and other fractionation steps to purify and identify the active
compound(s). A combination of mass spectrometry and NMR is often finally applied for identification of active compounds,
including steps of dereplication, and structure determination. This traditional approach has led to the identification of over 1000
natural products with either antifungal, antiparasitic or antibacterial activity, as well as anticancer drugs based on screens with
cancer cell lines.
Mass spectrometry can also play an important role in determining which samples or extracts to select and subject to an activity
screen or further pursue with treatments to induce natural product production. For instance, to pre-screen and select samples that
have high potential for activity, extracts may be run first through LC-MS or MALDI-TOF.12,13 Spectra can be analyzed to determine
samples that may be prolific in producing secondary metabolites, or compared to control samples or databases of library spectra to
see which samples have new or infrequently occurring compounds.14 In work by Samat et al.,12 PCA was applied to LC-MS data
collected on 267 extracts to prioritize 27 that contained potentially new photosensitizers, leading to the identification of 2 new
compounds with this activity (Fig. 2). Alternatively, spectra can be quickly analyzed to determine if a recombinant technique has
successfully generated production of natural products, or what part of a biosynthetic pathway has been modified.15 In recent work
by Shih et al.,16 a new gene stacking method, jStack, was validated by confirming violacein production in leaves of a plant after
insertion of bacterial genes (Fig. 3).
From the 1970s through today, new strategies have emerged for finding natural products based on advancements in science and
technology including low-cost genome sequencing, synthetic biology (e.g. recombinant DNA), combinatorial biosynthesis, high-
throughput methods for screening and fractionation, and innovations in mass spectrometry. These have led to the ability to easily
create and screen large libraries of purified or semi-purified compounds from hundreds to thousands in size.
Fig. 2 (A) General workflow screening extracts with LC-MS and analyzing with PCA to identify new photosensitizers. (B) Example PCA plot showing outlier extracts,
with unique masses identified in LC-MS data driving group differences targeted for further investigation. (B) Adapted from Samat, N.; Tan, P. J.; Shaari, K.; Abas, F.;
Lee, H. B. Prioritization of Natural Extracts by LC–MS-PCA for the Identification of New Photosensitizers for Photodynamic Therapy. Anal. Chem. 2014, 86,
1324–1331. https://doi.org/10.1021/ac403709a. Copyright (2014) American Chemical Society.
Mass Spectrometry for Natural Product Discovery 267
Intensity (a.u.)
VioE
Protodeoxyviolaceinic acid
VioD
Protoviolaceinic acid
VioC
Fig. 3 Synthetic biology technique jStack used to engineer bacterial genes into a Nicotiana benthamiana plant for producing violacein, with production in
leaf confirmed using LC-MS. Modified under public license from Shih, P. M.; Vuu, K.; Mansoori, N.; Ayad, L.; Louie, K. B.; Bowen, B. P.; et al. A Robust
Gene-Stacking Method Utilizing Yeast Assembly for Plant Synthetic Biology. Nat. Commun. 2016, 7, 13215. https://doi.org/10.1038/ncomms13215. https://
creativecommons.org/licenses/by/4.0/.
An advantage of using a purified library compared to a simple extract relates to the complexity and diversity of compounds
found in a sample. When all compounds comprising a sample are tested simultaneously, there is a risk of false discovery. It is
possible that a single compound that is effective may not be present in high enough abundance for activity to be observed, activities
of ubiquitous compounds may interfere with or mask the activity of another, or multiple compounds may have the same type of
activity but at different levels or selectivity.17,18 Libraries make it possible to screen each compound individually (or in small
groups) and over a range of concentrations, overcoming these limitations.
Types of libraries vary depending on how they were created. As already discussed, libraries may be comprised of compounds
fractionated and purified from extracts obtained from natural sources. Creation of these libraries is typically performed using
automated equipment performing a series of steps including separation, fractionation and concentration. For example, SepBox is
one of the most widely used automated technologies for creating fraction libraries.19 Here, a sequential HPLC/SPE/HPLC/SPE
system is applied in which a large volume of extract is introduced into a preparative HPLC column (for separation) coupled to
a solid phase extraction system under high pressure (for concentration), with fractionation from the HPLC to SPE triggered either by
time, UV or evaporative light-scattering detection (ELSD). These SPE are coupled to additional HPLC columns for further
separation, fractionation and then concentration on another set of SPE, followed by final collection with elution off the columns
creating a semi-purified library ready for screening. In an approach like this, 600 fractions can be obtained in under 24 h, and further
selectivity can be achieved by using HPLC columns and SPE cartridges with different chemistries, varying mobile phase solvents and
fractionation trigger parameters. These are discussed in more detail in the following section on separation and fractionation.
Once a semi-purified library from an extract has been created, mass spectrometry is important for determining the purity of the
fraction as well as identifying what compounds are in each. Often, laser desorption MS techniques off specialized surfaces, including
MALDI-TOF, nanostructure-initiator mass spectrometry (NIMS), or desorption/ionization on silicon (DIOS), are used for this type
of testing owing to minimal sample processing steps and amenability for high-throughput.10,20,21 With these techniques, high
quality spectra can be generated in seconds or less from a single spotted sample only a few millimeters in diameter. Further, when
used in combination with acoustic printing of a sample onto target surfaces, amounts as small as only 1 nL may be all that is
required for analysis, conserving sample that may have limited availability and also significantly increasing throughput to make
possible screens of at least 10,000 samples/day (Fig. 4A).22 Often, direct injection into various types of MS sources (e.g. ESI, APCI) is
another option. Although less optimized for high-throughput, use of alternative sources have the ability to detect compounds that
may not ionize as well using other methods.23,24 However, with DESI or nanoDESI, this technique advantageously combines ESI
with surface-based sample preparations (Fig. 4B).25 As will be discussed later, depending on the ionization mechanism, different
MS methods are suitable for detecting different kinds of compounds.
Mass-triggered fractionation is another successful strategy for creating purified libraries from a crude extract or combinatorial
library.26,27 Here, a mass spectrometry system is coupled to an HPLC, supercritical fluid chromatography (SFC) or other separations
technology, and fraction collection triggered by mass detection. Simultaneous collection of UV and ELSD measurements can also be
collected for further characterization of each fraction. This technique is ideal for obtaining high purity samples prior to screening.
Additionally, since the composition of each fraction has already been characterized with mass spectrometry, fraction purity and
compound identification does not need to be determined with further processing steps.
While crude extracts from natural sources are one source of generating natural product libraries, synthetic and genetic techniques
have developed for production of natural products, and the knowledge necessary for creating libraries from combinatorial
biosynthesis. This includes mutasynthesis, in which mutated bacteria can be induced to produce analogs of a known natural
product by making modifications or blocks in specific steps of a synthesis pathway.28 A related technology, precursor-directed
268 Mass Spectrometry for Natural Product Discovery
Fig. 4 Desorption based high-throughput MS approaches. (A) Schematic of high-throughput NIMS screening in which samples are acoustically printed and
analyzed by NIMS. (B) DESI source schematic in which a nebulized solvent spray is rastered across a surface and desorbed ions pulled into an MS inlet. Modified
with permission from de Raad, M.; Fischer, C. R.; Northen, T. R. High-Throughput Platforms for Metabolomics. Curr. Opin. Chem. Biol. 2016, 30, 7–13. https://doi.
org/10.1016/j.cbpa.2015.10.012. Copyright (2016) Elsevier.
biosynthesis, relies on disrupting pathways used in synthesis of precursor molecules used in making a specific natural product, and
then adding different precursors to use as a “starting” position resulting in production of analogs.29 With recombinant DNA (rDNA)
technology, genes can be inserted into organisms for production of natural products or other biological modifications.30 Most
recently, clustered regularly interspaced short palindromic repeats (CRISPR)-Cas9 based approaches have been used for accurate
and efficient genome editing of organisms for the discovery, characterization and production of natural products, often by activating
silent biosynthetic gene clusters (BGCs).31–34 As shown in Fig. 5, detection of unique mass spectra peaks present in a modified
(promoter-inserted) versus control (parent) strain leads to identification of new natural products produced upon gene activation.
These and complementary innovations have led to greater understanding of the genes, gene clusters and biochemical pathways
involved in natural product biosynthesis, enabling combinatorial biosynthesis strategies and tools to produce hundreds of
derivatives based on a known structure.35
Whether a library or crude extract, numerous screening techniques are employed to help determine which samples have
desirable activity. Many of these integrate mass spectrometry techniques, either integrated as part of the screen or involved
as part of the fractionation and identification steps. For example, “high resolution screening” (HRS) with MS monitors a change
in fluorescence when molecules elute off an LC column and bind to a target receptor in the effluent stream.36 This active fraction is
directed to the MS for acquisition of spectra for identification and dereplication. Other screening techniques use mass spectrometry
itself as a direct measurement of activity. Many of these rely on detecting mass shifts associated with binding, either covalent or
noncovalent, between target and analyte.37 Multitarget affinity/specificity screening (MASS) is an example of one of these types of
screens, and can be applied for screening large natural product libraries (Fig. 6). Here, when a molecule binds to a target (e.g. ligand
binding to RNA), a new mass, or “shifted mass” resulting from binding to the target, is detected compared to a control solution with
only target or only extract/fraction.38,39
Two other innovative MS-based screening assays include structure-activity relationship (SAR) by MS and detection of
oligonucleotide–ligand complexes by electrospray ionization mass spectrometry (DOLCE-MS). With SAR by MS, a combinatorial
library having a range of classes of motifs may be screened against a target, with binding detected by MS. Swayze et al. used SAR by
MS to find 2 classes of binding motifs with high affinity for a specific RNA target,40 then further performed MS competition
Mass Spectrometry for Natural Product Discovery 269
Fig. 5 (A) CRISPR-Cas9 approach used to insert heterologous promoters to activate a silent BGC in a Streptomyces host strain. (B) Mass spectrometry is used
to detect newly synthesized compounds after promoter insertion into Streptomyces (blue and red spectra) that are not present in the parent strain (black spectra).
This is indicated in the spectra shown with unique mass spectra peaks appearing at m/z 405, 780 and 425 in strains with inserted promoter. Modified with
permission from Zhang, M. M.; Wong, F. T.; Wang, Y.; Luo, S.; Lim, Y. H.; Heng, E.; et al. CRISPR–Cas9 Strategy for Activation of Silent Streptomyces Biosynthetic
Gene Clusters. Nat. Chem. Biol. 2017, 13(6), 607. https://doi.org/10.1038/nchembio.2341. Copyright (2017) Springer Nature.Adapted from Zhang et al.
experiments to distinguish separate binding sites for structurally different ligands as well as competitive and concurrent binding of
different ligands. Seth et al. also used SAR by MS to find a new class of small molecules that bind with high affinity to a subdomain
of the Hepatitis C virus.41 Similarly, DOLCE-MS is able to detect non-covalent binding to single-stranded or double-stranded
oligonucleotides by interpreting mass shifts in mass spectra in combination with ratio of detected single- vs. double-strands. This
technique has shown utility as a secondary screen for compounds against viral-associated oligonucleotides, distinguishing whether
compounds from a “primary” viral screen act by either interacting with a target enzyme, or binding DNA or RNA, a mechanism that
may be more likely to be nonspecific and toxic.42 Alternatively, this method is also useful when screening for drug candidates that
specifically target dsDNA, with alternative ionization techniques possible (e.g. laser spray) that are able to maintain stability of the
noncovalent interaction in solution.43
In the following sections, we will discuss in more detail each of these steps in the natural product discovery process, focusing on
the role of mass spectrometry, as well as more recent innovations in mass spectrometry imaging and genomic analysis that are
currently changing the landscape of natural product discovery as new ways to find and detect and understand their biosynthesis.
6.12.3 Extraction
Typically the first step and primary challenge in natural product analysis workflows is sample preparation, which involves the
successful separation of target metabolites from the sample matrix using an appropriate extraction technique. Due to the great
diversity of raw materials and compounds with different chemical properties, a number of different extraction methods have been
developed. Different solvents and methods will be more appropriate for successfully extracting different types of samples. Extraction
methods are highly diverse and range from maceration, simple liquid-liquid extraction, Soxhlet extraction, supercritical fluid
extraction, and even sublimation. Table 1 lists some of these methods and the operating conditions under which they are applied.44
To achieve high yield, good reproducibility, low cost, and efficiency in the extraction process, various parameters in these methods
are optimized for different samples including solvent, temperature, pressure and time.
(16S + 818)–5
1889.69
3.0e+07 212%
2.0e+07
1889 1891 1893 m/z
0.0e+00
1700 1800 1900 2000 m/z
Fig. 6 Example of noncovalent binding activity screen of natural product library fractions using MASS. (A) Mass shifts detected in a mass spectrometry assay of
natural product fraction 146 with a synthetic 27-mer 16S rRNA A-site. Peaks are detected at m/z 1627 for 16S RNA, as well as m/z 1849 and m/z 1889
corresponding to a mass shift of 616 for paromomycin (PM) and 818 for a new, unknown molecule. (B) Mass spectra showing m/z of natural products detected in
fraction 146 with two m/z values corresponding to the detected mass shifts associated with 16S RNA. Reprinted with permission from Cummins, L. L.; Chen, S.;
Blyn, L. B.; Sannes-Lowery, K. A.; Drader, J. J.; Griffey, R. H.; Hofstadler, S. A. Multitarget Affinity/Specificity Screening of Natural Products: Finding and
Characterizing High-Affinity Ligands From Complex Mixtures by Using High-Performance Mass Spectrometry. J. Nat. Prod., 2003, 66(9), 1186–1190. https://doi.org/
10.1021/np0301137. Copyright (2003) American Chemical Society.
Fig. 7 Standard steps in a mass spectrometry workflow as part of the natural product discovery process.
Mass Spectrometry for Natural Product Discovery 271
Table 1 Methods and associated conditions commonly used for extracting natural products.
Maceration Water, aqueous and Room temperature Atmospheric Long Large Dependent on
non-aqueous solvents extracting solvent
Percolation Water, aqueous and Room temperature, Atmospheric Long Large Dependent on
non-aqueous solvents occasionally under heat extracting solvent
Decoction Water Under heat Atmospheric Moderate None Polar compounds
Reflux extraction Aqueous and non-aqueous Under heat Atmospheric Moderate Moderate Dependent on
solvents extracting solvent
Soxhlet extraction Organic solvents Under heat Atmospheric Long Moderate Dependent on
extracting solvent
Pressurized liquid Water, aqueous and Under heat High Short Small Dependent on
extraction non-aqueous solvents extracting solvent
Supercritical fluid Supercritical fluid (usually Near room temperature High Short None or small Nonpolar to
extraction S-CO2), sometimes with moderate polar
modifier compounds
Ultrasound assisted Water, aqueous and Room temperature, or Atmospheric Short Moderate Dependent on
extraction non-aqueous solvents under heat extracting solvent
Microwave Water, aqueous and Room temperature Atmospheric Short None or moderate Dependent on
assisted non-aqueous solvents extracting solvent
extraction
Pulsed electric field Water, aqueous and Room temperature, or Atmospheric Short Moderate Dependent on
extraction non-aqueous solvents under heat extracting solvent
Enzyme assisted Water, aqueous and Room temperature, or Atmospheric Moderate Moderate Dependent on
extraction non-aqueous solvents heated after enzyme extracting solvent
treatment
Hydro distillation Water Under heat Atmospheric Long None Essential oil (usually
and steam non-polar)
distillation
Modified under public license from Zhang, Q. W.; Lin, L. G.; Ye, W. C. Techniques for Extraction and Isolation of Natural Products: A Comprehensive Review. Chinas Med. 2018, 13,
20. https://doi.org/10.1186/s13020-018-0177-x. https://creativecommons.org/licenses/by/4.0/.
with MS instrumentation and do not require extensive drying times, thermal processing or other procedures that may result in
compound degradation. For example, some of the most common solvents used in MS extraction are methanol, ethanol and water.
Both methanol and ethanol are able to extract compounds with both polar and nonpolar properties, do not interfere with signals
when injected into an MS, and can be quickly and easily removed by evaporation or under low vacuum (e.g. SpeedVac) to
concentrate samples or in preparation for further extraction steps. Water is excellent for extraction of polar compounds,
MS-compatible, and is easily removed by freezing and sublimation techniques (e.g. lyophilizer). In contrast, solvents such
as dimethylsulfoxide (DMSO) and dimethylformamide (DMF), while excellent for solubilizing a wide range of compounds of
varying chemistries, are typically not used due to high boiling points (cannot be removed via SpeedVac or lyophilization), and at
higher concentrations, may cause interference with MS signal (e.g. ion suppression) or instrumentation (e.g. compatibility with
PEEK fittings). Acetone, while also an excellent solvent, may contain contaminants that interfere with mass spectrometry especially
when exposed to some plastics during the extraction procedure.
Health and environmental considerations also play an important role in solvent selection. For example, many chlorinated
solvents such as carbon tetrachloride and chloroform have adverse health effects, and methanol is also toxic. More recently, finding
and applying sustainable (“green”) extraction solvents has gained increasing interest.45 Ethanol, for instance, is completely
biodegradable and can be produced from agricultural plants. As a polar organic solvent, ethanol has been used in extraction of
a wide variety of natural product compounds including phenolics, terpenoids, alkaloids, and carbohydrates.46–48 Water, of course,
is considered a green extractant, and ideal for compounds with polar properties, including protein, sugar and organic acids.
However, low-polarity compounds can also be extracted when applied as pressurized hot water extraction (PHWE), or subcritical
water, in which temperature and pressure are raised to the point that extraction behavior more closely resembles that of ethanol.49
Another green alternative is supercritical CO2, which can be applied for applications in which hexane was previously used, and can
extract weakly polar compounds such as carotenoids and triglycerides.50
In consideration of the above, often the first step in solvent selection is making sure the selected solvent has the appropriate
properties to be capable of dissolving, or removing, the compounds of interest from the sample matrix In “liquid-liquid” and “solid-
liquid” extractions, the octanol-water partition coefficient (Kow) of solvent and target molecule, also known as the law of similarity
and intermiscibility (Similia similibus solventur in Latin),51 or like dissolves like, can be used to predict the solubility of a compound
in a particular solvent. While polar solvents easily dissolve ionic compounds and polar compounds, nonpolar solvents tend to
272 Mass Spectrometry for Natural Product Discovery
Fig. 8 Isoflavones were extracted using a range of solvent mixtures, with different solvents more effective at extracting specific isoflavones than others. Modified
with permission from Luthria, D. L.; Biswas, R.; Natarajan, S. Comparison of Extraction Solvents and Techniques Used for the Assay of Isoflavones From Soybean.
Food Chem. 2007, 105, 325–333. https://doi.org/10.1016/j.foodchem.2006.11.047. Copyright(2007) Elsevier.
dissolve nonpolar covalent compounds better. Solvents commonly used for natural product extraction that are also suitable for MS
sample preparation include n-Hexane, chloroform, ethyl acetate, ethanol, methanol, and water, listed in order of increasing solvent
polarity. As shown in Fig. 8, different solvent systems will be more effective at extracting particular compounds than others.52 Here,
diadzin and genistin, both glycosylated isoflavones, were both extracted well with 90% methanol. However, this solvent was not
able to extract the non-glycosylated, more hydrophobic version of diadzin, daidzein, which required a less polar solvent of 70%
ethanol for extraction. Solvents that were inefficient at extracting isoflavones included the most polar and most nonpolar solvents
tested, water and 58% acetonitrile respectively, likely since isoflavones had a combination of polar and nonpolar chemical
properties.
Nonpolar solvents n-hexane and chloroform are most effective at extracting highly nonpolar natural products, such
as polyphenolic, lipophilic compounds and certain peptides. For example, the antibacterials pediocin, nisin, and bacillin have all
been recovered from bacterial liquid culture using a chloroform extraction.53 Rapinel et al. reported that lipophilic carotenoids and
aromas could be extracted from carrots and caraway seeds using n-hexane or alternatively, liquefied n-butane.54 For extracting
somewhat less nonpolar compounds, such as flavonoids or phenolics, ethyl acetate is a choice solvent, as well as methanol or
ethanol.55 However, ethyl acetate will extract a lower proportion of polar compounds than either methanol or ethanol, especially
when used in conjunction with water, with which it is immiscible, to form a 2-phase system for removing more polar compounds
due to solubilization in the lower water phase. In comparison, both methanol and ethanol are miscible with water and further
purification steps would be necessary to remove polar compounds from an extract. Therefore, for less lipophilic and more polar
compounds, methanol and ethanol are more appropriate. For instance, iridoids are common types of monoterpenoids and can be
effectively extracted with methanol under 50 C overnight,56 while glucosinolates, anionic secondary metabolites involved in the
plant defense against insects, can be extracted with high efficiency in 70%–80% methanol.57,58
Water is efficient for extracting polar or water-soluble compounds, like amino acids, organic acids, and even phenolic
compounds when under high temperature and pressure.45 As mentioned previously discussing PHWE, temperature may dramat-
ically alter the physicochemical properties of a solvent and improve extraction efficiency. Often, increasing temperature will reduce
the viscosity of a solvent, concurrently raising the solubility and increasing diffusivity of metabolites in solution. Although water is
traditionally a polar solvent, it can be used to recover phenolic compounds, volatile oils, and isoflavones from plant materials when
high temperature and pressure is applied with PHWE.49 In another water extraction example, resveratrol and polydatin were
obtained in one simple extraction step by applying ultrasound-assisted extraction to a cyclodextrin (1.5% w/w) water solution.59
Finally, applying a subcritical water extraction technique to coriander seeds allowed essential oils to be extracted, which also
resulted in concentrating more oxygenated forms.60 One potential drawback of water extracts, however, is that depending on the
sample matrix, desalting steps may also be required if a large amount of salt is extracted with target products. Not only can salt
Mass Spectrometry for Natural Product Discovery 273
interfere with MS signal and cause ion suppression, salt can impede efforts to concentrate samples or resolubilization of
a compound in preparation for MS analysis.
For large-scale metabolite profiling, ethanol and methanol are more frequently used as a general extraction solvent since these
provide sufficient solubility over a broad range of compound chemistries, including both polar and nonpolar analytes. Also, further
extraction using either more hydrophobic and hydrophilic solvents can be applied for sample fractionation of dried ethanol and
methanol extracts. For example, in the analysis of plant tricalysiosides, following an initial extraction of Tricalysia dubia leaves with
methanol, concentrated alcoholic extracts were then separated into an ethyl acetate fraction, n-butanol fraction, and water fraction
resulting in the separation of several types of glycosides with differing polarity.61,62
Fig. 9 Generalized workflow for extracting compounds in preparation for LC-MS. Here, three different extraction techniques are combined to create multiple
fractions containing different compound chemistries from leaf starting material. After homogenizing samples, three different solvents are added together for
solid-liquid extraction—methanol, chloroform and water. In liquid-liquid extraction, immiscible phases are collected separately, then the aqueous methanol/water
phase is further subject to SPE prior to LC-MS. Here, different fractions are eluted from an SPE by applying solvents with decreasing polarity—water, methanol and
then acetonitrile. Modified with permission from Maia, M.; Monteiro, F.; Sebastiana, M.; Marques, A. P.; Ferreira, A. E.; Freire, A. P. et al. Metabolite Extraction for
High-Throughput FTICR-MS-Based Metabolomics of Grapevine Leaves. EuPA Open Proteom. 2016, 12, 4–9. https://doi.org/10.1016/j.euprot.2016.03.002.
Copyright (2016) Elsevier.
samples are also boiled. Percolation requires a percolator through which hot solvent is dripped at a moderate rate through dried
powdered samples.
(A) (B)
(C) (D)
Fig. 10 (A) SFE system (B) PFE system (C) MAE system (D) Graphs showing relative extraction efficiencies of various compounds (Naphthalene; F, fluorene; pH,
phenanthrene; Py, pyrene; BP, benzo(a)pyrene; IP, indeno(1,2,3-cd)pyrene) from coal fly ash (a-lignite, b-bituminous) comparing Soxhlet, SFE and PFE systems and
parameters. Modified with permission from Camel, V. Recent Extraction Techniques for Solid Matrices—Supercritical Fluid Extraction, Pressurized Fluid Extraction
and Microwave-Assisted Extraction: Their Potential and Pitfalls. Analyst 2001, 126, 1182–1193. https://doi.org/10.1039/B008243K. Copyright (2001) Royal Society
of Chemistry.
processing time, and much less consumption of solvent.81 Microbial natural products have been considered as an important origin
of antibiotics, immunosuppressive chemicals, and other potential pharmaceuticals.82 The carotenoid zeaxanthin, beta-carotene,
and fatty acids were recovered from microalgal (Synechococcus sp.) biomass using UAE with N,N-dimethylformamide as the
solvent.83
Extracts of natural products are typically complex mixtures consisting of numerous compounds each with different activities,
selectivities and other properties. As discussed earlier, conducting screens with these can be problematic for many reasons including
risk of false discovery, compound interactions or low abundance obscuring activity, and uncertainty in knowing what compound, or
combination of compounds, may be responsible for an effect. For reasons such as these, extracts are often further subjected to
a variety of separation and fractionation techniques, often creating a library, to overcome some of these limitations and enable
screening of individual compounds or smaller groups of compounds. These methods are also commonly used for isolation and
purification of natural products, and many of these are also set up inline with mass spectrometry or NMR systems to characterize
fractions and individual compounds comprising a sample.
UV, ELSD) to characterize properties of eluting compounds (e.g. mass, absorbance) and generate fractions based on time and other
properties determined by the applied separation parameters.
There are a wide range of liquid chromatography used for mass spectrometry based analysis of natural products. The most
common are described in more detail below. In general, these differ primarily in their selectivity for more hydrophobic or more
hydrophilic compounds, or possession of certain functional groups that determines specific interactions with a column chemistry.
In many cases, a single type of chromatography (e.g. stationary and mobile phase combination) may provide sufficient resolving
power for most metabolites and most sample matrices; however, a combination of different LC modes can be used to further
improve separation of metabolites or for more global profiling of natural products present in a sample.98,99
6.12.5.1.1 ESI
ESI was first introduced by Fenn in 1989 to generate intact molecular ions for MS analysis.129 In the ESI interface, eluent passes
through a sharp steel needle to which a high voltage, typically 2–6 kV, is applied.130 Charged eluent droplets are created, then
heated and evaporated to form a Taylor cone under the high electric field. Due to increased tension on the droplet surface, Coulomb
fission occurs, and droplets repeatedly decompose to much smaller droplets. Subsequently, gas phase analyte ions are released from
the droplets and carried into the MS analyzer with dry nitrogen. Typically, charged ions generated in the ESI interface are protonated
or deprotonated, [M + H]+ or [M − H]−, respectively, but can also form a range of other adducts with other elements in the solvent
including sodium, potassium, or ammonium. The ability of the MS to detect an analyte, or sensitivity, is highly dependent on the
ionization efficiency of the analyte, that is, how much of the analyte is ionized into detectable gas phase ions. For this reason, even
when two compounds are present at the same concentration in a sample, the MS signal for one may be orders of magnitude higher
than the other.
The ESI response is also affected by various factors including flow rate, solvent additives, and analyte physicochemical properties.
Excessive amounts of salt should be avoided in the mobile phase and removed if possible since this often leads to severe ion
suppression and a characteristically low MS signal.131 Modifying flow rate can also influence ionization, with one ESI MS study
showing that decreasing flow rate enhanced detection limit of peptides by 25–50 fold.132 Another strategy to improve ionization is
derivatization, or attaching a moiety to target analytes to improve ionization, usually by adding a moiety that will more easily
protonate or deprotonate compared to the original chemical structure.133 Recently, a quantitative structure-property relationship
(QSPR) study was performed that correlated compounds properties with ESI sensitivities and found that molecular volume, log P,
pKa, and surface activities most relevant to the ESI ionization efficiency.134 Another significant property influencing ionization in
ESI is a compound’s capability of being multiply charged. Here, large molecules, like proteins and peptides, can be multiply charged
due to having multiple charge carriers (amino and carboxyl groups). This charge state distribution of the analyte can provide critical
information about molecular conformation. For example, myoglobin is a common protein consisting of 153 residues, and when in
the unfolded conformation, shows a broad distribution of charge state ranging from 8+ to 21+, but in the native folded form, only
two charge states (8+ and 9+).135
6.12.5.1.2 APCI
The APCI interface is a similar ionization source to ESI, but here, high voltage is applied to a separate corona discharge needle and
the eluent is sprayed with dry nitrogen.136 In contrast to ESI, the ionization of analyte occurs in a vapor state rather than in liquid
droplets. Briefly, the analyte is first vaporized to the gas phase and the H3O+ and CH3OH+2 vapor typically serves as a proton donor.
Analytes with high ionization efficiency using APCI are typically volatile and have high gas-phase basicity.137 In a metabolomics
Mass Spectrometry for Natural Product Discovery 279
100
90
80
70
Normalized Area (%)
60
50
40
30
20
10
0
Phenolic Organic Simple Secoiridoids Flavonoids Lignans Fatty acids Pentacyclic Tocopherols Sterols
acids and acids and phenols and and and triterpenes
aldehydes coumarins derivatives derivatives derivatives
Fig. 11 Chart comparing the normalized area of different compound classes detected using various combinations of mass spectrometry sources (ESI and APCI),
mode (positive or negative) and chromatography (GC and LC). Modified under public license from Olmo-Garcia, L.; Kessler, N.; Neuweger, H.; Wendt, K.; Olmo-
Peinado, J. M.; Fernández-Gutiérrez, A.; Baessmann, C.; et al. Unravelling the Distribution of Secondary Metabolites in Olea europaea L.: Exhaustive Characterization
of Eight Olive-Tree Derived Matrices by Complementary Platforms (LC-ESI/APCI-MS and GC-APCI-MS). Molecules 2018, 23, E2419. https://doi.org/10.3390/
molecules23102419. https://creativecommons.org/licenses/by/4.0/.
study comparing ESI and APCI, APCI was found to be preferable for the ionization of carbohydrates and tartaric acid in grape
berries.138 In another study using ESI and APCI interfaces, Rebane et al. compared the ionization of 40 compounds across a wide
range of pKa and logP and found that some of the highly non-polar compounds such as pyrene and anthracene could only be
ionized using the APCI source. The data also showed that ionization efficiency was positively correlated with weighted average
negative sigma, logP and molar volume.137 In another study, a global profiling of secondary metabolites in olive trees using ESI and
APCI in both positive and negative mode (Fig. 11) found that polar metabolites, including phenolic acids, flavonoids, and
secoiridoids, demonstrated highest sensitivity in positive mode ESI, while organic acids and coumarin ionized better in negative
mode ESI.139 In comparison, nonpolar sterols and tocopherols ionized best using APCI.
6.12.5.1.3 DESI
DESI is a similar ionization method to ESI but allows for ambient sampling off solid surfaces such as mineral or biological tissue.
Here, solvent spray is emitted from a capillary that flushes analytes on the solid surface. High voltage applied to the capillary allows
direct ionization of the molecules, and these intact ions are directly transported to the MS analyzer. One advantage of DESI is that
the movement of solvent capillary across the sample surface can be programmed via computer, enabling the simultaneous
collection of spatial information of the molecules from the solid surface to create an MS image.140 DESI has been used for the
measurement of sugars, nucleotides, secondary metabolites, peptides, and protein. > 15 alkaloids were identified in situ on the
surface of plant seeds, roots, stems, leaves, and flowers without any pretreatment using DESI coupled to LTQ MS. The measured
spatial distribution of the alkaloids in the plant tissues was in agreement with the conventional extraction and MS analysis
results.141 Wiseman et al. reported an in vivo DESI imaging application on histological tissues including brain, kidney, and lung.
Concentrations of several key metabolites and the distribution of antipsychotic (clozapine) in rat organs were monitored by tracing
the intact ions of the analytes.142
6.12.5.1.4 MALDI
Matrix Assisted Laser Desorption/Ionization is a fairly mature and widely used technology.143 In MALDI-MS, analytes are
co-crystallized with a matrix material (commonly organic acid compounds) that absorbs laser light and transfers energy to the
analyte, releasing it into the gas phase in a process leading to ionization. The ionized analytes are subsequently introduced into
a mass analyzer for detection. The most commonly used mass analyzer with MALDI is time-of-flight (TOF) mass analysis, but
several other mass analyzers have been used including Fourier transform ion cyclotron resonance (FTICR), ion trap Orbitrap and
quadrupole Orbitrap analyzers.144 MALDI is a versatile ionization platform, combining relative high lateral resolution (typically
tens of microns), high detectable mass range, high speed (laser frequency up to 2 kHz), and molecular specificity.145 As MALDI can
be operated under atmospheric pressure or high vacuum, the source pressure has far-reaching influence on, for example, sample
conditions, detectability of volatile components, and molecular ion stability. The major drawback of MALDI is the interference of
matrix ions in the low mass range, which can obscure analytes <500 Da by abundant matrix cluster ions. This has motivated efforts
to develop matrix-free laser desorption ionization (LDI) approaches, including surface assisted laser desorption ionization (SALDI).
280 Mass Spectrometry for Natural Product Discovery
6.12.5.1.5 SALDI
A variety of matrix-free SALDI techniques have been developed that are based on nanoscopic structures that efficiently utilize the
laser pulse energy for the production of ions exclusively from the adsorbed analytes.146 The majority of these are based on some
form of nanostructured silicon, including desorption/ionization on silicon (DIOS), nanopost arrays (NAPA) and nanostructure-
initiator mass spectrometry (NIMS).147 While both DIOS and NAPA are based on the direct laser desorption/ionization of analytes
directly from these planar substrates, NIMS uses a unique approach to laser desorption in which a liquid “initiator” is coated onto
a nanostructured surface to aid ionization.148 Although the precise mechanism is not well understood, the laser heats the NIMS
surface, leading to the explosive vaporization of the trapped liquid coat and ionization of the adsorbed analytes on the NIMS
surface.
6.12.5.2.1 Quadrupole
One of the most popular and versatile mass analyzers is the quadrupole owing to being relatively low cost, compact, and robust, but
do not generally have very high mass accurace. These are composed of four rods in parallel creating a path through which ions flow
from an initiating source. Typically, both RF and DC fields are applied to the rods to cause oscillation of ions as they flow through.
By changing the RF and DC potentials, ions can be “filtered” by the quadrupole since some ions of particular m/z values will have
unstable trajectories and collide into the rods. Rather than mass filtration, quadrupoles can also be used as an ion transmission
guide (using RF only) to move all ions, for example, from the source into another analyzer. Also, when using only RF and varying
the applied energy, the quadrupole can be used as a collision cell and fragment ions via collision-induced dissociation (CID).
6.12.5.2.5 Orbitrap
The basic orbitrap configuration is a small inner electrode spindle between two hollow concave electrodes to which voltages are
applied. While both orbitrap and FT-ICR analyzers use Fourier transform to convert signals for measuring m/z, orbitraps use
an electric rather than magnetic field to induce oscillation of ions in a trap. Both FT-ICR and orbitraps have very high resolving
power suitable for determining elemental compositions; however, orbitrap resolving power decreases in the low mass range
(decrease proportional to square root of m/z measured) whereas FT-ICR decreases in the higher mass range (e.g. high molecular
weight proteins). Orbitrap analyzers are also advantageous in that they are relatively small and low-maintenance, and commercially
available instruments offer fast scan speeds suitable for processing hundreds to thousands of samples required for many natural
product discovery workflows.
6.12.5.2.7 IM-MS
More recently, the combination of ion mobility analyzers with mass analyzers has made a large impact on natural product discovery
efforts.151,152 Ion mobility is a type of gas-phase electrophoresis, with ions traveling through an electric field in a carrier gas, with
measured collision cross section (CCS) through the gas based upon structural properties (e.g. charge state, size and shape).
By combining accurate mass m/z measurements with ion mobility “drift time”, co-eluting isomers that do not separate using LC,
for example, may still be distinguished based on varying drift times. In natural product discovery, ion mobility can be important for
prioritization of features in which species can be seen to deviate from predicted mobility trendlines. This may indicate a compound
with unexpected physicochemical properties and a potentially new natural product. Additionally, IM is another feature to add to
traditional MS measurements of m/z, fragmentation and retention time to aid in dereplication, identification and structural
elucidation. MALDI-based systems, specifically, or others used in MS imaging applications and fast high-throughput screening,
are typically lacking LC chromatography retention time-based separation and most susceptible to simultaneous ionization of
isomers with the same m/z. These benefit greatly from integration with IM allowing separation and fragmentation of isomers, and
adding a time-dimensionality to acquired mass spectra increasing the depth of information capable of being collected.
an ion’s elution and (2) a higher likelihood of having collected associated MSMS fragmentation spectra for compound identification.
Using a longer LC column and longer LC method would allow for data collection at a rate allowing for higher mass accuracy and may
also improve separation between isomers. Unfortunately, this increases the overall runtime for each sample and may be overly time-
prohibitive when hundreds to thousands of samples need to be analyzed in some types of natural product discovery workflows. Given
these considerations, overall sample runtime needs to be balanced with quality and amount of MS and MSMS data capable of being
collected. For example, one strategy, when screening thousands of samples, could be to implement a fast LC method that collects only
MS1 data at a high acquisition rate but no MSMS, and using the MS1 data to determine which samples require follow-up with
a different LC method and MS/MSMS analysis. Or similarly, a fast LC method that collects low resolution MS1 data with low quality
peak shape but very high amounts of MSMS, and use MSMS as the main determinant of which samples require follow-up.
In collection of fragmentation spectra, a collision energy is specified to apply to either an isolated precursor ion or over a given mass
range of precursors, depending on whether the MSMS is data dependent or independent, respectively. Higher collision energies
typically result in a larger number of fragments with smaller m/z values and lower collision energies with less fragments of higher m/z
values. The fragments present at different collision energies are related to the molecule’s chemical and structural composition. These
are characteristic for individual ions leading to identification when compared to a standard, and in the absence of a standard, can also
be used to determine a molecule’s chemical and structural composition. However, sometimes a molecule fragments poorly and higher
collision energies are necessary to achieve any fragmentation at all. For this reason, when analyzing complex samples, more
information can be attained by applying a ramp or stepped collision energy combination (e.g. 10, 20, then 30 eV), and then measuring
the fragmentation from all energies in the same MSMS spectra. Although more informative to collect MSMS spectra for a compound at
each collision energy, this has the drawbacks of being time-consuming, reduces the quality of peak shape and limits the number of
other ions for which MSMS can be collected. As mentioned elsewhere in this article, fragmentation characteristics of ions are important
not only for identification, but for dereplication and feature prioritization in natural product discovery workflows.
For an isolated precursor, fragment ions are ideally generated and detected from only that compound and used for identification.
However, depending on the instrumentation, selecting a compound for isolation and fragmentation is often associated with a mass
window (e.g. 0.01–2 Da) that, if set too wide, may allow in another co-eluting ion with nearby m/z that also fragments with the
target precursor ion and lead to convoluted MSMS spectra, or if set too narrow, results in poor isolation of the precursor ion and
poor quality, low intensity MSMS spectra. This factor is something to consistently take into consideration when interpreting MSMS
spectra of complex samples.
Fragmentation may occur in either a data dependent or data independent manner, or some combination of both. With data
independent acquisition, all ions within a specified m/z range will be fragmented resulting in a multiplexed spectra from multiple
precursors. While it can be challenging to relate specific fragments to the originating precursor, the fragmentation data nevertheless
is unbiased and will be collected for all eluting ions, especially those that may be missed in a data dependent acquisition scheme.
Deconvolution of this spectra may include assigning fragments to precursors with matching elution patterns, for example. Or, in
some cases, specific fragment m/z values can be indicative of a particular class of compounds (e.g. 184 for phosphocholine lipids).
Data dependent acquisition occurs in two main formats. In a “targeted” format, specific ions will be fragmented from a pre-
determined list comprised of attributes including retention time, m/z, minimum intensity, and polarity, for example. In an
“untargeted” format, fragmentation spectra will be acquired based on MS1 spectra, with the goal often to be to acquire MSMS
data on as many compounds as possible without compromising MS1 data quality. Typical implementations are based on
fragmenting, for example, the x most intense ions detected via MS1, and after fragmenting them, adding these to an exclusion
list from fragmentation for y amount of time, and then continuously fragmenting the x most intense ions in MS1 as long as they are
not currently in the exclusion list. This results in not fragmenting the same ion over and over, and provides the opportunity for less
intense ions to also get fragmented. The main advantage of this type of data dependent acquisition is that in a single run,
fragmentation spectra can be obtained for a large number of individual ions for samples with unknown composition. However,
as discussed previously, obtaining higher amounts of MSMS data to include even the low abundance ions is associated with the
trade-offs of having to increase sample runtime, or lower quality peak shape, or lower mass accuracy MS1 measurements overall.
More recently, instrumentation software has been developed to improve MSMS coverage where ions fragmented in one injection are
then added to an exclusion list for the following injection, which is a re-injection of the same sample. This process can be repeated
until essentially all ions, including low abundance, are fragmented. Numerous other types of data dependent acquisition schemes
are possible and can be tailored to experimental design requirements.153,154
This is just a brief overview of some of the factors to consider when setting up the specific data acquisition scheme in a natural
product discovery workflow. More extensive literature discussing these concepts and more are available.155–157 Specific instrument
utilized (and available software) will be a primary factor determining the capabilities and MSMS options available for data
acquisition.
OCH3
O
HO O
H3CO OH OH
OH O
H3CO
HO O
O O OH O O O
O
O NH
H3CO
O O N O H3CO
O O OH
HOOC O
O
Rapamycin O
OH Rifamycin S
Monensin A Polyketide-peptide
Polyether antibiotic Ansamycin antibiotic immunosuppressant
OCH3
HO
O OCH3
O
O
O O
HO O O
OH
OH
O OH O
O O O
HO O OH OH OH O OH
COOH O
O
O O
OH OH
NH2
OH
Amphotericin B Lovastatin Avermectin B1b
Polyene antifungal Cholesterol-lowering agent Macrolide antiparasitic
HO OH (H3C)2N O
O OCH3
S N(CH3)2
OH O
HO OCH3
O OCH3
HO
N O O
O O
O
O
O O OCH3
O
O OH O
O OH
Epothilone
Polyketide-polypeptide Erythromycin A Spinosyn A
anticancer agent Macrolide antibiotic Macrolide insecticide
Fig. 12 Structural and bioactivity diversity of polyketides. Modified with permission from Weissman, K. J.; Leadlay, P. F. Combinatorial Biosynthesis of Reduced
Polyketides. Nat. Rev. Microbiol. 2005, 3, 925–936. https://doi.org/10.1038/nrmicro1287. Copyright (2005) Spring Nature.
applications such as antibiotics, anticancer drugs and immunosuppressants.159 Polyketides are synthesized by condensation of
acetyl units catalyzed by a multi-domain enzyme, polyketide synthase (PKS), which carries out varying degrees of keto reduction,
dehydration, enoyl reduction and cyclization reactions, followed by post-PKS tailoring reactions.159 The broad substrate specificity
of PKSs for starter and extender units, along with the variety of tailoring of reactions, results in a large diversity of chemical structures
and bioactivities (Fig. 12).159a Mass spectrometry has proven to be a useful tool in the discovery and structural characterization of
284 Mass Spectrometry for Natural Product Discovery
novel polyketides.160 PKSs have been utilized in medicinal chemistry for production of novel polyketides using non-natural
substrates and combinatorial biosynthesis strategies where mass spectrometry plays a vital role in rapid identification of novel
products.161,162 Further, while genome mining may identify novel biosynthetic clusters of polyketides, detection of their products
under natural conditions may be challenging due to the low level expression of biosynthetic genes. Thus study designs incorporating
condition screening and mass spectrometry have been beneficial for discovery of novel, natural scaffolds, such as Streptomyces-
produced formicamycins, linearolides and their analogs.163,164 Another challenge with polyketide identification is the vast array of
structural isomers, sometimes produced by a single organism. In this case, tandem mass spectrometry is a robust technique for
comparison of fragmentation patterns between these types of molecules; for example, the identification of two polyketide isomers
(isolasalocid A and lasalocid A) was achieved using MS3 in high resolution Fourier transform ion cyclotron resonance MS with
electron-induced dissociation and collision-activated dissociation.165 Stereochemical isomers, also common in polyketides,166,167
present an additional challenge that can be addressed by mass spectrometry with the combined use of chiral chromatography
and/or derivatization.
6.12.6.2 Glycosides
Glycosides present another diverse class of secondary metabolites, where a sugar moiety is linked to an aglycone molecule by an O-,
S-, N- or C-glycosidic bond.168–170 In some molecules, the glycosidic residue is necessary for bioactivity171 whereas in others, the
glycosides function as storage molecules from which hydrolysis releases an active aglycone172 or a cytotoxic defense chemical such
as hydrogen cyanide.173,174 Commonly, glycosides are classified by the aglycone residue, such as phenolic glycosides, flavonoid
glycosides, etc. or by bioactivity, such as cardiac glycosides, antibiotic glycosides.171 Glycosidic residues typically increase the
aqueous solubility, may alter bioavailability and are thought to control transport of the aglycone in tissues.171 In a recent review of
16,000 natural products from bacteria attempting to cover all currently known natural products, approximately 20% were
glycosylated, thus representing a significant source for discovery of novel therapeutics (Fig. 13).175 Due to the presence of common
residues in glycosides, neutral loss scanning of MS/MS and product ion scanning are logical analytical approaches, both of which
have been successfully applied in glycoside analysis.176,177 The issue of lability variability in complex mixtures of glycosides may be
addressed with collision energy gradients, thus improving structural information and coverage when screening samples with
a variety of classes of glycosides.178 Fragmentation information can elucidate: glycosidic bond types between sugar moieties and
to the aglycone, type and order of monosaccharides, and length of polysaccharide chains.179 For example, fragmentation pathways
were used for the structural identification of cyanogenic glucosides from Phyllagathis rotundifolia.180 Hydrolysis of the glycosidic
bond, prior to analysis, may also provide better structural information due to the ability to selectively fragment the aglycone moiety;
this technique yielded the identification of six cardiac glycosides including two novel compounds from yellow oleander seed.181
A systematic nomenclature for carbohydrate fragmentation has been developed.182 Consistency in the labeling and identification of
fragments has improved the ability to assign partial or complete identities to detected glycosides based fragments reported in prior
literature.177,180
6.12.6.4 Alkaloids
Alkaloids are a class of chemical compounds that contain nitrogen in a ring structure, such as isoquinoline and piperidine, and
typically derived from amino acids. They are widely distributed in various plants, animals, and microorganisms, and are bioactive
even in extremely low doses.192 Alkaloids have a long history of being used as drugs. Examples include vinblastine and vincristine,
which can be used as an anticancer agent by prohibiting the assembly of microtubules.193 Most alkaloids can be easily dissolved in
organic solvents rather than in water. They are thermally stable and can be easily extracted using conventional techniques with
methanol or ethanol.194,195 Six alkaloids which can inhibit acetylcholinesterase activities were extracted using ethanol from the
Mass Spectrometry for Natural Product Discovery 285
Fig. 13 Neutral loss and fragment analysis for identification and structural elucidation of glycosides. Modified with permission from Barnaba, C.; Dellacassa, E.;
Nicolini, G.; Nardin, T.; Serra, M.; Larcher, R. Non-targeted Glycosidic Profiling of International Wines Using Neutral Loss-High Resolution Mass Spectrometry.
J. Chromatogr. A 2018, 1557, 75–89. https://doi.org/10.1016/j.chroma.2018.05.008. Copyright (2018) Elsevier. Representing the carbohydrate fragmentation
nomenclature as defined by Domon and Costello.182
stems of a native Brazilian plant.195 Although alkaloids are typically weakly basic molecules, they are very diverse with their
polarities. Thus, RPLC, NPLC, HILIC, or IEX have been all successfully applied to the separation of alkaloids depending on the
physicochemical properties of the target compounds. RPLC can be performed under either acidic pH by using formic acid as a buffer
or under basic pH with ammonium.48,196 Alkaloids are mainly in ionic form under lower pH, which may significantly decrease the
retention time and better sharp peaks can be expected.197 Bagheri et al. tested the separation performance of five opium alkaloids on
HILIC columns with five different stationary phases, that is, bare silica, ZIC-HILIC, amino-, diol-, and cyano-using acetonitrile and
water with 60 mM ammonium acetate as mobile phase. Longest retention of the five alkaloids was obtained on the cyano-HILIC
column, but also led to broad peak widths and low peak heights. Best chromatographic responses (signal to noise ratio) was found
on diol-HILIC column and demonstrated a LOD of 13 pg (Fig. 14).198 Isolation of psilocin, psilocybin, morphine, codeine, and
thebaine was achieved using ZIC-HILIC with acidic pH.199,200 IEX is an ideal tool for separating highly polar compounds, and
an orthogonal 2D-HPLC consisting of RPLC and IEX was proposed recently to purify hydrophobic alkaloids and highly polar
alkaloids. This 2D analytical system demonstrated a high degree of orthogonality and the IEX effectively removed the non-basic
coeluted components in the samples. In total, nearly 20 alkaloids were identified from Scopolia tangutica Maxim extracts, including
one novel alkaloid compound. Although the mobile phase may not be ideal for MS detection due to the presence of sodium
phosphate, the 2D-HPLC proved to be an emerging technique for isolation of alkaloids (Fig. 15).201
286 Mass Spectrometry for Natural Product Discovery
8000
2
3
7000 2
1
4
5 E
6000
2 3
5000 1 4
mAU
5 D
2
4000
2
3
1
3000 4 5 C
3
2000
1
4
5
B
1
1000
4 5 A
0 2 4 6 8 10 12 14 16 18 20 22 24 26
Minutes
Fig. 14 Chromatographic separation of opium alkaloids using HILIC columns with five different types of stationary phase: (A) bare silica, (B) ZIC-HILIC, (C) amino,
(D) diol, and (E) cyano. Although elution order remained the same for noscapine, papaverine, thebaine, codeine and morphine, stationary phase chemistry changed
retention time and amount of separation between compounds. Modified with permission from Bagheri, M.; Taheri, M.; Farhadpour, M.; Rezadoost, H.;
Ghassempour, A.; Aboul-Enein, H. Y. Evaluation of Hydrophilic Interaction Liquid Chromatography Stationary Phases for Analysis of Opium Alkaloids. J. Chromatogr.
A 2017, 1511, 77–84. https://doi.org/10.1016/j.chroma.2017.06.072. Copyright (2017) Elsevier.
0.5
0.4
0.3
0.2
0.1
0
0 50
2 45
4 40
6 35
30
8 25
10 20 CX
XCha 12 ge S
rge C
18 14 10
15
XChar
16 5
18 0
Fig. 15 HPLC-UV chromatography of alkaloids separating by a 2D-C18 RPLC/SCX system. Modified with permission from Long, Z.; Guo, Z.; Xue, X.; Zhang, X.;
Liang, X. Two-Dimensional Strong Cation Exchange/Positively Charged Reversed-Phase Liquid Chromatography for Alkaloid Analysis and Purification. J. Sep. Sci.
2013, 36, 3845–3852. https://doi.org/10.1002/jssc.201300863. Copyright (2013) John Wiley and Sons, Inc.
Mass Spectrometry for Natural Product Discovery 287
6.12.6.5 Flavonoids
Flavonoids, while structurally diverse, contain a conserved core, the phenyl benzodihydropyran (flavan) as shown in Fig. 16A where
R groups are commonly hydroxy, methoxy, prenyl or glycosyl groups.76 Flavonoids are pigments produced in plants and function
as visual cues for pollination, as chemical deterrents to herbivory, and as UV-protectants.76 They exhibit pharmacological desirable
properties including anticancer, antioxidant, anti-inflammatory, and anti-bacterial activity.203 A range of liquid and solid phase
extraction techniques have been reported and reviewed in the literature.76,204 Flavonoid analysis from crude extracts may require
hydrolysis release free flavonoids from their glycosylated conjugates.76,205 Additionally, both reverse and normal phase chroma-
tographies have been developed for the separation and retention of flavonoids.204,206,207 Flavonoids absorb UV-vis light in two
bands: Band II between 220 and 280 nm is due to the benzoyl group (A ring) and Band I between 300 and 450 is due to the
cinnamoyl group (B and C rings) thus making incorporation of a spectrophotometer a useful tool for improving detection and
identification.204,208,209 In mass spectrometry, both APCI and ESI are commonly used ionization methods.210 A nomenclature for
flavonoid product ions produced from C-ring fragmentation events was developed by Ma et al., in 1997 (Fig. 16B),202 with
additional aglycone fragmentation events described in detail,211 while the nomenclature of flavonoid glycoconjugates follows the
conventions of Domon and Castello182 and has been expanded upon by Cavaliere et al.212 In fragmentation of aglycone flavonoids,
loss of functional groups and retro-Diels Alder reactions produce unique fragments that are useful in identification of either the
flavonoid or class of flavonoid.204 Additionally, flavonoid glycosides, produce unique fragments due to rearrangement events,
glycoside residues with predicted fragments, and sometimes depend on the type of aglycone at the flavonoid core.213 Recently,
a database of predicted fragments from approximately 7000 know flavonoid structures along with an annotation tool, Flavonoid-
Search, has been developed for annotation of flavonoids by Akimoto et al.214
6.12.6.6 Terpenoids/Terpenes
Terpenes are organic compounds consisting of isoprene, a five-carbon building block. A monoterpene is a cyclic molecule
composed of two isoprene units. Terpenoids are a class of secondary metabolites derived from terpenes with multiple cyclic groups
and oxygen. Both are derived from isopentenyl diphosphate and dimethylallyl diphosphate. Volatile terpenes are believed to play
critical roles in plant defense.215 Moreover, terpenes and terpenoids can be used as flavors, drugs, and fragrances. Terpenes are
typically volatile, while terpenoids may be non-volatile or semi-volatile as they normally contain other polar moieties.216 The
extraction and analysis protocols of terpenoids and terpenes largely depend on their polarities. Volatile or non-polar terpenes are
best extracted with hydrophobic solvent and further measured with GC-MS, but polar terpenoids are suggested to be analyzed with
LC-MS (Fig. 17).216 Unlike alkaloids and flavonoids, hexane is the major solvent used in the terpenes extraction. Terpenes have been
quantified from woods and leave hexane extracts using GC-MS in several studies, and liquid alkane hydrocarbon (dodecane,
pentadecane, and hexadecane) is utilized as an internal standard for quantification.217–219 An offline combination of C18 SPE and
solid phase micro extraction was developed to enable detection of low abundant terpene in the polar fraction.220 Sun et al. reported
a highly sensitive LC-MS based analytical method to quantify the bioactive polar terpenoid lactones in Ginkgo biloba tablets, in
which LOD is 35 fmol for Ginkgolide A and 4.9 fmol for Bilobalide.221 Polar terpenoids can also be identified using similar
analytical methods like flavonoids, for example, C8 or C18 RPLC-ESI-MS under acidic pH.47,222
Rather than going through the complex process of extraction and separation, mass spectrometry imaging (MSI) techniques have
been developed allowing direct examination of the spatial distribution of biological molecules within a material and even construct
two-dimensional (2D) and three-dimensional (3D) visualizations of these. MSI was popularized by Caprioli in the 1990s when he
demonstrated the spatial mapping of m/z values, using a MALDI MS system, to generate ion images of molecules comprising various
mammalian tissue sections.223 Since then, MSI has been used extensively to study the spatial distribution of biomolecules within
tissues, in microbial colonies, in chemical arrays, and other biological media, including the discovery of novel natural
products.147,224–226
MSI enables the analysis of low abundance molecules that may be obscured in bulk analyses and can correlate ions with
observed phenotypes, which may hint at the function of the compound. With MSI, mass spectra are obtained at discrete spatial
points, usually in a raster format, with a mass spectrum measured at each coordinate. From the collected mass spectra an ion image
can be created by calculating the cumulative or maximum intensity over a selection of ions for each position in the dataset, where for
example, each ion is represented by a unique color. MSI datasets can be analyzed by a wide variety of software tools, the choice of
which is determined by the nature of the research. An up-to-date overview of the most common tools for MSI data analysis has been
developed.227 Since MSI datasets are typically quite large, specialized software tools have been developed to enable visualization
and analysis. For example, OpenMSI is a powerful tool that allows for not only the storage of raw MS imaging data, but a web-based
interface with access to high-performance computing resources to efficiently visualize and analyze MSI datasets.228 Recently,
a complementary resource was developed, OpenMSI Arrayed Analysis Toolkit (OMAAT), comprised of a software package specific
for analyzing large arrays of samples acquired as an MS image.229 Together, these provide a mechanism to uncover novel natural
products in materials as well as rapidly generate and screen metabolite profiles in high-throughput for natural product discovery.
288 Mass Spectrometry for Natural Product Discovery
(A)
(B)
(C)
Fig. 16 (A) Common classes of flavonoids and representative compounds with R group substitutions indicated. (B) Mass spectrometry fragmentation in flavonoids.
An example retro-Diels-Alder reaction producing the 1,3A+ and 1,3B+ fragments commonly seen in flavonoids. (C) Additional C-ring fragmentation events
produce additional unique fragments to flavonoids. (A)–(C) modified with permission from de Rijke, E.; Out, P.; Niessen, W. M.; Ariese, F.; Gooijer, C.;
Udo, A. T. Analytical Separation and Detection Methods for Flavonoids. J. Chromatogr. A 2006, 1112, 31–63. https://doi.org/10.1016/j.chroma.2006.01.019.
Copyright (2006) Elsevier. Based on the nomenclature set forth in Ma et al.202
Mass Spectrometry for Natural Product Discovery 289
Fig. 17 A schematic representation of key terpenes and terpenoids differed in volatility and polarity. With increasing of polarity and decreasing of volatility,
compounds are preferably analyzed with LC-MS. Modified under public license from Jiang, Z.; Kempinski, C.; Chappell, J. Extraction and Analysis of Terpenes/
Terpenoids. Curr. Protoc. Plant Biol. 2016, 1, 345–358. https://doi.org/10.1002/cppb.20024. https://creativecommons.org/licenses/by/4.0/.
MSI has become more accessible and increasingly popular in the last 2 decades with the emergence of a large number of
ionization techniques and new sample preparation strategies. A wide array of MSI platforms are now available for the detection of
specific chemical compounds. Each MSI platform has unique benefits and drawbacks including variations in analysis time, what
sample types are amenable to imaging, and a wide range of sample preparation methods. The following section provides
an overview of MSI techniques with a highlight on applications geared towards analyzing natural products. These are summarized
in Table 2.
Table 2 Highlights from published literature showing the application of MSI for the analysis of natural products.
Desorption electrospray ionization (DESI) Microbial colonies and Rhamnolipids, siderophore [230]
fungi
Algae Bromophycolides, and callophycoic acids [231]
(diterpenoids)
Plants (seed and fruit) Chromone alkaloid [232]
nanoDESI Microbial colonies NRPS [233]
yeast, fungi and bacteria Phenazines, quinolones, rhamnolipids, cyclic [234]
lipopeptides
Laser ablation electrospray ionization (LAESI) Microbial colonies Antibiotics (Streptomycin) [235]
Flower petals Anthocyanins, polyphenols [236]
Laser ablation atmospheric pressure photoionization (LAAPPI) Plant leaves Terpenes and terpenoids [237]
Matrix-assisted laser desorption/ionization (MALDI) Fungal and bacterial Actinomycins, macrolides [238]
colonies
Plants (root/stem/leaf/ Glucosinolates, polyphenols, flavonoids among [145]
flower/fruit/seed) others
Plant leaves Sesquiterpene lactones [239]
Laser desorption ionization (LDI) Leaves and pollen Naphthodianthrones (hypericin and [240]
pseudohypericin) and flavonoids
Roots and nematodes Phenylphenalenones [241]
Symbiotic bacteria on Antibiotics (streptochlorin and piericidin [242]
insect cocoons derivatives)
Desorption/ionization on silicon (DIOS) and nanowire-assisted laser Marine molluscs Brominated indoles [243]
desorption ionization (NALDI)
Nanostructure-initiator mass spectrometry (NIMS) Tissues, bacterial colonies Secondary metabolites [244,245]
Cation-enhanced NIMS Plant (stems) Sugars [246]
Secondary-ion mass spectrometry (SIMS) Bacterial colonies Surfactins [247]
Bacterial colonies Pigmented antibiotics [248]
Bacterial biofilms Alkyl-quinolone and alkyl-quinoline N-oxide [249]
signaling molecules
Bacteria colonizing roots Cyclic lipopeptides [250]
The highest resolution imaging approaches use secondary ion mass spectrometry (SIMS), which is based on the rastering of
a focused ion beam. This primary beam bombards the sample, ejecting and ionizing molecules from the sample surface that are
subsequently analyzed in a mass spectrometer.253 As SIMS is operated at high vacuum, samples must be dry and vacuum stable.
SIMS has been applied to visualize secondary metabolites in bacterial colonies, tissues and insects.249,253
Laser desorption ionization is another common technique used in MSI platforms with MALDI being the most widely used.
As previously mentioned, one challenge with MALDI is that abundant matrix ions (<500 Da) can interfere with analysis. However,
due to advances in laser beam technology and application of novel matrices, matrix interference can often be minimized.
Commonly, MALDI-MSI will be performed under high vacuum; however, applications of MALDI at atmospheric pressure have
been reported.144,254 MALDI has been used to image natural products in microorganisms, plants and animal/human
tissues.144,145,255
Other laser desorption ionization (LDI) approaches typically use the laser to directly ablate compounds from the sample surface.
Unlike MALDI, LDI does not require an externally applied chemical matrix and circumvents the issues associated with matrix. Direct
LDI can be performed when the molecules of interest contain chromophores that are capable of absorbing the wavelength of energy
produced by the laser, to assist in the desorption/ionization process.226 As many natural products contain conjugated double-bond
systems like aromatic/heteroaromatic rings and show strong UV absorption at 337 or 355 nm; both levels are emitted by the most
common UV lasers.252 Using LDI, antibiotics, toxins and pigments have been imaged in microorganisms and plants.226,252
A wide-range of matrix-free, surface assisted laser desorption/ionization (SALDI) approaches can be used for MSI. Major SALDI
approaches include NIMS, described earlier, desorption/ionization on silicon (DIOS), NAPA and nanowire-assisted laser desorp-
tion ionization (NALDI). The majority of these approaches are based on the direct laser desorption/ionization of compounds
directly from nanostructured substrates.10 All these SALDI platforms have been applied to study secondary metabolites in plants,
tissues, biological fluids and marine snails.146,148,243
where studies have shown interactions through direct contact and communication via various secondary metabolites, or natural
products. For example, compounds such as acyl-homoserine lactones (AHLs), oligopeptides and autoinducers have been elucidated
as fundamental for processes such as quorum sensing, quorum quenching, kin selection and pathogenicity.256 MSI has proven
a powerful tool for exploring in situ the production and discovery of new natural products by interacting organisms through spatial
mapping of detected ions.
Co-cultivation of different microorganisms in close proximity on a solid surface allows for either direct or indirect sampling by
MSI platforms. Ambient ionization based MS techniques, including DESI, nanoDESI and LAESI, can be used to probe the
microorganisms and their environment directly without any disturbances (e.g. directly from cultures on agar). MS techniques
operated under vacuum, such as MALDI, SIMS and SALDI, typically require sample desiccation prior to MSI analysis. Alternatively,
MSI can be performed indirectly by imprinting or blotting analytes onto a MS compatible surface.245,257
Traxler et al. used both nanoDESI and MALDI MSI to investigate the chemical interactions of the model bacterium Streptomyces
coelicolor with five other actinomycetes.258 In each interaction, the majority of secreted compounds associated with S. coelicolor
colonies were unique, including an extended suite of at least 12 different desferrioxamines with acyl side chains of various lengths.
The production of these natural products was triggered by siderophores made by neighboring strains. In another MSI study using
DESI MSI on seeds, the alkaloid rohitukine could be traced both spatially and temporally throughout seed development, including
modified forms of the compound (Fig. 18A).232 An example of an indirect MSI technique is “replica-extraction-transfer” NIMS,
where a solvent-laden gel is first used to extract metabolites from a microbial sample, such as a biofilm or agar culture, and then the
metabolites are replica “stamped” onto the NIMS surface.245 This approach was used to analyze the metabolite composition of
interacting Shewanella oneidensis and Pseudomonas stutzeri colonies (Fig. 18B).
Fig. 18 Application of MSI. (A) Ion images, specified by m/z, of D. binectariferum seeds during different developmental stages acquired using DESI MS. The ion m/
z 306.2 corresponds to rohitukine, a chromone alkaloid. Figure adapted from Mohana Kumara et al.232 (B) Schematic of a typical REX-NIMS workflow (left), and ion
images comprised of different m/z, indicated by color, of interactions occurring in a microbial co-culture on agar (right). (A) Modified with permission from
Kumara, P. M.; Srimany, A.; Ravikanth, G.; Shaanker, R. U.; Pradeep, T. Ambient Ionization Mass Spectrometry Imaging of Rohitukine, a Chromone Anti-Cancer
Alkaloid, During Seed Development in Dysoxylum Binectariferum Hook.f (Meliaceae). Phytochemistry 2015, 116, 104–110. https://doi.org/10.1016/j.phytochem.
2015.02.031. Copyright (2015) Elsevier. (B) Modified and reprinted with permission from Louie, K. B.; Bowen, B. P.; Cheng, X.; Berleman, J. E.; Chakraborty, R.;
Deutschbauer, A.; et al. ‘Replica-Extraction-Transfer’ Nanostructure-Initiator Mass Spectrometry Imaging of Acoustically Printed Bacteria. Anal. Chem. 2013, 85,
10856–10862. https://doi.org/10.1021/ac402240q. Copyright (2013) American Chemical Society.
292 Mass Spectrometry for Natural Product Discovery
Medicinal use of natural products has been documented as early as the 30th century BCE.262 However, natural products chemistry
rapidly developed with the introduction of modern extraction, purification and analytical approaches such as mass spectrometry
and NMR.263 In the past decade, there has been increasing use of combinatorial libraries and synthesis, but natural products
research still plays a major role in the discovery of novel therapeutic drugs, a process which commonly relies on ethnobotanical
studies, discovery of novel taxonomies and high-throughput screening of microbial/plant/fungal natural products
collections.264,265
Central to the discovery of novel active compounds is the accurate identification of compounds. While mass spectrometry
provides a robust tool for achieving this goal, often complementary methods such as NMR, IR spectroscopy, X-ray diffraction and
crystallography may be used in conjunction to confirm molecular structure, formula, chirality and other chemical properties of
novel compounds. However, in many cases, the molecular formula and either partial or full structural details can be deduced using
only chromatography with mass spectrometry. A stepwise approach employing dereplication strategies and statistical analysis can
be employed to reduce the total number of possible candidates for identification so that the focus remains on identification of novel
natural products (Fig. 19).
Fig. 19 Steps for the identification of novel natural products after dereplication of knowns.
Mass Spectrometry for Natural Product Discovery 293
Fig. 20 Workflow for analysis of biological materials for detection of novel natural products. When possible, incorporation of activity screening, fractionation and
heterologous expression will improve identification capabilities.
294 Mass Spectrometry for Natural Product Discovery
as well as precursors that would inform steps during the assembly process. Using a mass spectrometry instrument with high
resolving power, the products can be observed with a mass difference between the 13C and 12C isotopes (which is n 1.003355 Da
where n is the number of atoms that have been substituted). In another stable isotope labeling application, labeling can be
performed with deuterated solvents to aid in compound identification.272,273 Here, labile hydrogens (e.g. those that can rapidly
exchange with the solvent) equilibrate with a deuterated solvent. A molecule’s detected m/z will shift by the number of labile
hydrogens multiplied by the mass of a hydrogen neutron. Combined with fragmentation studies, both of these approaches can yield
rich insight into both biosynthetic mechanisms and compound identity.
Fig. 21 Metabolite identification strategies. The knowledge of molecular identification between the researcher and the literature as represented in a Johari
window.
Mass Spectrometry for Natural Product Discovery 295
Golm Pure reference standards, also contains Free, online: 29,590 spectra (EI-MS, March 25, http://gmd. [276]
Metabolome metabolomics profiling data of plants/ 2019) mpimp-golm.
Database cyanobacteria mpg.de/
(GMD)
AntiBase— Focused on antimicrobials Commercial: 41,048 MS records (May 2017, https://www. [277]
Wiley ISBN:978-3-527-34359-1) wiley.com/
Spectral Mainly commercial chemical reagents 25,497 MS spectra (March 25, 2019) https://sdbs.db. [278]
Database for aist.go.jp
Organic
Compounds
(SDBS)
GNPS Third-party libraries (e.g. MassBank), spectral Free, online: 72941 spectra (March 25, 2019) https://gnps. [4]
libraries created for GNPS and spectra from the 221,000 MS/MS reference library spectra from ucsd.edu
NP community 18,163 compounds and 8853 MS/MS community
spectra from 5568 unique compounds
ReSpect and Phytochemicals from literature reports and Online (9017 spectra, Dec 14, 2018) http://spectra. [279]
AtMetExpress authentic standards psc.riken.jp/
Human Human metabolites from chemical library (HMDB) Online, free: 318,249 predicted mass spectra; http://www. [280]
Metabolome and third-party spectral databases with 29,665 experimental mass spectra (April 22, 2019) hmdb.ca/
Database experimentally collected, high-resolution MS/MS
(HMDB) and EI spectra
MassBank General small molecules related to life sciences 52,819 spectra, April 29, 2019 http://www. [281]
submitted by community contributors massbank.jp/
MassBank of Spectra from experimental and in-silico libraries, 271,335 spectra, from 77,409 compounds (April 28, http://mona. NA
North America and user contributions 2019) fiehnlab.
(MoNA) ucdavis.edu/
Metlin lipids, amino acids, carbohydrates, toxins, small Experimental spectra from >20,000 compounds and https://metlin. [282,283]
peptides, and natural products, and other small their isotopologues, in-silico spectra additional scripps.edu/
molecule classes known molecular structures contained in the
database(ref )
Spektraris Plant natural compounds Spectra from 500 compounds (May 01, 2019) http:// [284]
langelabtools.
wsu.edu/amt/
Yeast Yeast metabolites 50,598 MS spectra (includes experimental and http://www. [285]
Metabolome predicted MS/MS) from 15,979 compounds (May ymdb.ca/
01, 2019)
Fiehn GCMS Commercial libraries through Agilent and Leco
mzCloud Drugs, cosmetics and commercial chemicals, 4,739,606 spectra from 8779 compounds (includes https://www. NA
endogenous metabolites, natural products and in-silico, experimental spectra) mzcloud.org/
toxins
DrugBank Collection of drug related metabolites, spectrum EI spectra https://www. [286]
data from SDBS and NIST drugbank.ca/
E. coli E. coli metabolites 19,294 NMR and MS spectra (experimental and http://ecmdb.ca/ [287]
Metabolome predicted) for 3098 compounds
Database
(ECMDB)
LIPID MAPS and Lipids—standards, reported in the literature and 30,000 structures, a subset have MS/MS and https://www. [288,289]
LipidFinder computationally generated structures in silico predictions available lipidmaps.org/
NIST Reference standards Commercial: SRD spectra from 267,376 compounds https://www. [290,291]
(EI MS) and 13,808 compounds (MS/MS). Free: nist.gov/srd,
Mass spectra for >33,000 compounds (May 02, https://
2019) webbook.nist.
gov/
interest, as this will reduce the number of features of interest requiring investigation in downstream steps when attempting
identification of unknown unknowns.
A robust dereplication strategy uses a combination of targeted analysis (known knowns) and semi-targeted analysis (knowns
unknowns). A number of online databases and tools are available to support these steps (Table 3) and provide mass spectral
information. Many have significant overlap, either due to automated aggregation from other databases, or duplicate analysis of
compounds.292 For additional databases, including NMR spectral databases, see also Johnson and Lange.293 Chemical and pathway
296 Mass Spectrometry for Natural Product Discovery
databases such as PubChem (http://pubchem.ncbi.nlm.nih.gov/), Super Natural II,294 Dictionary of Natural Products (http://dnp.
chemnetbase.com), MarinLit (http://pubs.rsc.org/marinlit/), MetaCyc,295 and KEGG296 are useful tools for interpretation of natural
product mass spectra results. Additionally, numerous natural products repositories/collections are available online: e.g. https://dtp.
cancer.gov/organization/npb/introduction.htm.
For a recent review of metabolomics analysis tools and resources, see Misra and Mohapatra297 and for a recent review of
metabolomics analysis tools and resources, see Misra and Mohapatra.297
A targeted analysis is used to annotate expected or commonly detected metabolites that are produced by the organism(s) using
metabolite atlases.298,299 Pure reference standards analyzed using the same methods should be used for comparison of adducts,
retention times and fragmentation spectra with sample data. Next, features that are known in the literature but are not known or
present in the researchers own library (known unknowns) are annotated.300,301 Exact mass measurements, collected from high
resolution accurate mass instruments, compared with chemical databases produces a list of possible structures.274 Orthogonal
measurements such as retention time and MS/MS can be used to confirm identification. Retention order prediction302 and retention
indices303 are used to improve confidence of an identification. Comparison of measured sample MS/MS with spectral data-
bases293,304 allows the researcher to predict the most likely structures but may not differentiate between isomers. Comparison of
fragmentation spectra using molecular networking for identification of known compounds additionally provides information of
related analogues that may be useful for the researcher to include in future structural searches (Fig. 22).305 In silico fragmenta-
tion306,307 software is used to predict MS/MS spectra for a predicted structure. Lastly, fragmentation trees308,309 and evaluation of
neutral losses, isotope abundance ratios, and adducts are useful for piecing together substructures and more accurately predicting
identifications. These techniques are also used in the identification of unknown unknowns.
Often, along with the discovery of a novel natural product, the corresponding genes or biosynthetic cluster is reported.322 Following
from the central dogma of biology, enzyme complexes made from proteins and encoded by genes catalyze reactions that transform
metabolites including natural products. Thus, natural products are connected to genes via reactions and in some cases conserved
pathways.323,324 This linkage can facilitate integration of transcriptomics data, lead to expression of the biosynthetic cluster in
another organism, and allow the broad propagation of the learned function by sequence homology.325 In some cases, the mapping
between an observed natural product and the genes responsible for its synthesis is straightforward, but more often it is a laborious
process involving an iterative combination of genetic manipulation, screening, and bioinformatic techniques. Some of the
computational techniques pursue this challenge in a “gene-forward” manner: predict gene function and consequently the products.
Others pursue this challenge in a “compound-forward” manner: predict reactions necessary for synthesizing a product and search
Mass Spectrometry for Natural Product Discovery 297
Fig. 22 Molecular networking combined with MS/MS spectra used to identify and dereplicate two natural products in the same family as the known “seed”
compound carmabin A. Reprinted with permission from Yang, J. Y.; Sanchez, L. M.; Rath, C. M.; Liu, X.; Boudreau, P. D.; Bruns, N.; et al. Molecular Networking
as a Dereplication Strategy. J. Nat. Prod. 2013, 76, 1686–1699. https://doi.org/10.1021/np400413s. Copyright (2013) American Chemical Society.
298 Mass Spectrometry for Natural Product Discovery
+ +
O
-\+
++ O OH + + +
O
-\+ + +++ ++
HO
HO +
+
O
+++ OH
HO + + +
Metabolomics Data Reactions with Reactions catalyzed
Compound structures MAGI algorithm Enzyme Gene Sequence
(Mass Spectrum, m/z, nuetral mass) detected compound by enzyme
Fig. 23 This workflow is described by Erbilgin et al. specific to the Metabolite, Annotation and Gene Integration (MAGI) workflow, but other algorithms that link
metabolites to genes operate with a similar overall architecture. With MAGI, metabolomics data is associated with compounds and consequent reactions and genes
(compound forward) or genes are associated with reactions and consequent expected metabolites (gene forward). Algorithms like MAGI attempt to integrate and
reconcile these two approaches. Reprinted with permission from Erbilgin, O.; Ruebel, O.; Louie, K. B.; Trinh, M.; Raad, M. D.; Wildish, T.; et al. MAGI: A Method for
Metabolite Annotation and Gene Integration. ACS Chem. Biol. 2019. https://doi.org/10.1021/acschembio.8b01107. Copyright (2019) American Chemical Society.
for enzymes capable of performing those reactions. In some cases, simultaneous gene and compound scoring is performed
(Fig. 23).325a
The total diversity of natural products is likely much larger than the current listings in chemical databases. Yet, there are many
times that a sample will contain expected, well characterized natural products and several tools are available for mapping these
metabolites to genes and pathways. Unfortunately, many natural products are typically not easily integrated in this way because of
poor gene annotations and the lack of relevant reactions in databases.326,327 Since reactions serve as the pivotal connection between
metabolites and genes, these issues severely limit direct, explicit integration of metabolomics data with genomic data.
To attempt to overcome these challenges, several computational strategies have been developed to expand reaction space and
chemical databases. MyCompoundID,328,329 IIMDB,330 MINES331 and the ATLAS of biochemistry332 all use methods to enzymat-
ically enlarge compound space, though the enzymatic steps are quite general, for example, using the most basic transformations to
modify compounds (e.g. methylation at any functional group). While this method is quite effective at biochemically enlarging
biochemical compound space, the transformations used are not connected to an explicit reaction (e.g. methylation of cytosine), and
therefore are not connected to an explicit gene and thus not assessed in the context of a given genome. Retrorules333 and rePrime334
enlarge compound space by generating and using generalized reaction “rules” constructed from specific reactions in databases, and
thus are linked to explicit reference gene sequences that may plausibly catalyze the novel reaction, though these strategies have been
developed for retrosynthesis and pathway construction purposes,334,335 and are not currently easily used for interpreting mass
spectrometry data.
In addition to the strategies listed above, chemical networking has emerged as a valuable approach to addressing the dearth of
metabolites represented in reactions. Chemical networking relies on the similarity between metabolites. This similarity can be used
to expand reaction-space based on the probability that a new molecule can be substituted for a well-studied molecule in a chemical
reaction.305,332,336–338 By doing this, chemical networking bridges reaction databases with expansive metabolite databases to aid
integration. Once metabolites are associated with a reaction and a reaction is associated with a gene then putative metabolite
identifications can be assessed using the predicted metabolism of an organism.
The remaining challenge of connecting metabolites with specific gene products is that (like metabolite annotations) gene
annotations are often wrong. This is predominantly due to functional assertions being based on homology to reference sequences
unsupported by experimental validation.339 Specifically, annotation services296,340 attempt to annotate a gene product with
a specific biochemical function, sometimes choosing among equally probable but mutually exclusive functions or leaving them
unhelpfully vague. This practice can lead to false conclusions in the absence of biochemical experiments, since some enzymes can
have multiple substrates, are multifunctional, or have similar homology to several different reactions. Additionally, some annota-
tions are incorrect due to propagation of false annotations.341
Conducting one or more metabolomics experiments and then linking observed metabolites to gene sequences can provide direct
biochemical evidence for a gene product’s biochemical function, bolstering existing bioinformatic-based annotations, correcting
wrong annotations, and making vague annotations more specific. Recently, new approaches have been introduced that bridge the
gap between metabolomics and genomics and take into account enzyme promiscuity. The genomes-to-natural products (GNP)
platform was developed specifically for discovering new nonribosomal peptides (NRPs) and polyketides, and using a gene-forward
strategy, predicts possible chemical structures of NRP and polyketide synthases and then generates predicted MS/MS spectra based
on these (Fig. 24); predicted MS/MS spectra are then used to mine MS data.342 Pep2Path was also developed exclusively for NRPs
and post-translationally modified peptides (RiPPs), and using a Bayesian approach, scores putative NRPs and RiPPs based on gene
Mass Spectrometry for Natural Product Discovery 299
Fig. 24 The GNP. Modified under public license from Johnston, C. W.; Skinnider, M. A.; Wyatt, M. A.; Li, X.; Ranieri, M. R.; Yang, L.; et al. An Automated Genomes-
to-Natural Products Platform (GNP) for the Discovery of Modular Natural Products. Nat. Commun. 2015, 6, 8421. https://doi.org/10.1038/ncomms9421. http://
creativecommons.org/licenses/by/4.0/.
sequences in the assayed organism.343 Finally, a more general approach has been developed where major differences in the mass
spectrometry profiles of a mutant library are assayed, and these differences are then manually annotated using human intuition.344
Metagenomics and single-cell sequencing have enabled, for the first time, glimpses into the vast metabolic potential of Earth’s
collective biological systems. Yet, for the most part we cannot accurately predict nor identify the products of most biosynthetic
pathways. Most of what we know of microbial biochemistry is based on characterization of a few model microorganisms, and these
findings have been extended through sequence correlations to the rest of sequence space. Unfortunately, these extrapolations have
questionable validity for the vast majority of environmental microbes and therefore requires fundamentally different approaches
for directly linking novel sequences to their biochemical functions. The high-throughput and large scale application of genetic
techniques, product screening, and integrated informatic tools promises to deliver a detailed mapping of natural products, their
associated synthetic routes, and experimentally validated gene annotations.
6.12.10 Outlook
We see a very bright future for mass spectrometry approaches for discovery and analysis of natural products. The sensitivity and
specificity of mass spectrometers continues to dramatically improve as well as innovations in sample processing. Coupling mass
spectrometry to microfluidics has tremendous potential to increase throughput and reduce sample volumes. Such devices can be
integrated with DNA synthesis, systems biology tools, and microfluidic natural product production systems to create a relatively low-
cost platform for exploring natural product space. Equally important will be the development of computational tools that can quickly
sift through mass spectrometry data to identify novel natural products. While these will initially use pathway abstraction and chemical
networking approaches as we have described, increasingly they will shift to computational chemistry-based approaches that directly
predict structure de novo from fragmentation spectra. With these integrated capabilities mass spectrometry will continue to be a vital
tool in enabling researchers to explore biochemical space to discover urgently needed new natural products.
300 Mass Spectrometry for Natural Product Discovery
References
1. Fleming, A. On the Antibacterial Action of Cultures of a Penicillium, With Special Reference to Their Use in the Isolation of B. influenzæ. Br. J. Exp. Pathol. 1929, 10, 226.
2. Davies, J.; Ryan, K. S. Introducing the Parvome: Bioactive Compounds in the Microbial World. ACS Chem. Biol. 2012, 7, 252–259.
3. Frank, A. M.; Savitski, M. M.; Nielsen, M. L.; Zubarev, R. A.; Pevzner, P. A. De Novo Peptide Sequencing and Identification With Precision Mass Spectrometry. J. Proteome Res.
2007, 6, 114–123.
4. Wang, M.; et al. Sharing and Community Curation of Mass Spectrometry Data With Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016, 34,
828–837.
5. Goodwin, C. R.; et al. Structuring Microbial Metabolic Responses to Multiplexed Stimuli Via Self-Organizing Metabolomics Maps. Chem. Biol. 2015, 22, 661–670.
6. Watrous, J.; et al. Metabolic Profiling Directly From the Petri Dish Using Nanospray Desorption Electrospray Ionization Imaging Mass Spectrometry. Anal. Chem. 2013, 85,
10385–10391.
7. Yang, J. Y.; et al. Primer on Agar-Based Microbial Imaging Mass Spectrometry. J. Bacteriol. 2012, 194, 6023–6028.
8. Sica, V. P.; Raja, H. A.; El-Elimat, T.; Oberlies, N. H. Mass Spectrometry Imaging of Secondary Metabolites Directly on Fungal Cultures. RSC Adv. 2014, 4, 63221–63227.
9. Rath, C. M.; et al. Molecular Analysis of Model Gut Microbiotas by Imaging Mass Spectrometry and Nanodesorption Electrospray Ionization Reveals Dietary Metabolite
Transformations. Anal. Chem. 2012, 84, 9259–9267.
10. de Raad, M.; Fischer, C. R.; Northen, T. R. High-Throughput Platforms for Metabolomics. Curr. Opin. Chem. Biol. 2016, 30, 7–13.
10a. Lee, M. S. Integrated Strategies for Drug Discovery Using Mass Spectrometry; John Wiley & Sons, 2005.
11. Katz, L.; Baltz, R. H. Natural Product Discovery: Past, Present, and Future. J. Ind. Microbiol. Biotechnol. 2016, 43, 155–176.
12. Samat, N.; Tan, P. J.; Shaari, K.; Abas, F.; Lee, H. B. Prioritization of Natural Extracts by LC–MS-PCA for the Identification of New Photosensitizers for Photodynamic Therapy.
Anal. Chem. 2014, 86, 1324–1331.
13. Sikora, A. E.; Tehan, R.; McPhail, K. Utilization of Vibrio Cholerae as a Model Organism to Screen Natural Product Libraries for Identification of New Antibiotics. Methods Mol. Biol.
2018, 1839, 135–146.
14. Hou, Y.; et al. Microbial Strain Prioritization Using Metabolomics Tools for the Discovery of Natural Products. Anal. Chem. 2012, 84, 4277–4283.
15. Miao, V.; et al. Genetic Engineering in Streptomyces roseosporus to Produce Hybrid Lipopeptide Antibiotics. Chem. Biol. 2006, 13, 269–276.
16. Shih, P. M.; et al. A Robust Gene-Stacking Method Utilizing Yeast Assembly for Plant Synthetic Biology. Nat. Commun. 2016, 7, 13215.
17. Ingkaninan, K.; von Frijtag Drabbe Künzel, J. K.; IJzerman, A. P.; Verpoorte, R. Interference of Linoleic Acid Fraction in Some Receptor Binding Assays. J. Nat. Prod. 1999, 62,
912–914.
18. Manyi-Loh, C. E.; Clarke, A. M.; Ndip, R. N. Detection of Phytoconstituents in Column Fractions of N-Hexane Extract of Goldcrest Honey Exhibiting Anti-Helicobacter pylori
Activity. Arch. Med. Res. 2012, 43, 197–204.
19. Bhandari, M.; Bhandari, A.; Bhandari, A. Sepbox Technique in Natural Products. J. Young Pharm. 2011, 3, 226–231.
20. Woo, H.-K.; Northen, T. R.; Yanes, O.; Siuzdak, G. Nanostructure-Initiator Mass Spectrometry: A Protocol for Preparing and Applying NIMS Surfaces for High-Sensitivity Mass
Analysis. Nat. Protoc. 2008, 3, 1341–1349.
21. Wei, J.; Buriak, J. M.; Siuzdak, G. Desorption–Ionization Mass Spectrometry on Porous Silicon. Nature 1999, 399, 243–246.
22. Greving, M.; et al. Acoustic Deposition With NIMS as a High-Throughput Enzyme Activity Assay. Anal. Bioanal. Chem. 2012, 403, 707–711.
23. Stapels, M. D.; Barofsky, D. F. Complementary Use of MALDI and ESI for the HPLC-MS/MS Analysis of DNA-Binding Proteins. Anal. Chem. 2004, 76, 5423–5430.
24. Gropengiesser, J.; Varadarajan, B. T.; Stephanowitz, H.; Krause, E. The Relative Influence of Phosphorylation and Methylation on Responsiveness of Peptides to MALDI and ESI
Mass Spectrometry. J. Mass Spectrom. 2009, 44, 821–831.
25. Wleklinski, M.; et al. High Throughput Reaction Screening Using Desorption Electrospray Ionization Mass Spectrometry. Chem. Sci. 2018, 9, 1647–1653.
26. Zeng, L.; Burton, L.; Yung, K.; Shushan, B.; Kassel, D. B. Automated Analytical/Preparative High-Performance Liquid Chromatography–Mass Spectrometry System for the Rapid
Characterization and Purification of Compound Libraries. J. Chromatogr. A 1998, 794, 3–13.
27. Zeng, L.; Xu, R.; Zhang, Y.; Kassel, D. B. Two-Dimensional Supercritical Fluid Chromatography/Mass Spectrometry for the Enantiomeric Analysis and Purification of
Pharmaceutical Samples. J. Chromatogr. A 2011, 1218, 3080–3088.
28. Thomas Shier, W.; Rinehart, K. L.; Gottlieb, D. Preparation of Four New Antibiotics From a Mutant of Streptomyces fradiae. Proc. Natl. Acad. Sci. U. S. A. 1969, 63,
198–204.
29. Denoya, C. D.; et al. A Second Branched-Chain Alpha-Keto Acid Dehydrogenase Gene Cluster (bkdFGH) From Streptomyces avermitilis: Its Relationship to Avermectin
Biosynthesis and the Construction of a bkdF Mutant Suitable for the Production of Novel Antiparasitic Avermectins. J. Bacteriol. 1995, 177, 3504–3511.
30. Baltz, R. H. Streptomyces Temperate Bacteriophage Integration Systems for Stable Genetic Engineering of Actinomycetes (and Other Organisms). J. Ind. Microbiol. Biotechnol.
2012, 39, 661–672.
31. Ren, H.; Wang, B.; Zhao, H. Breaking the Silence: New Strategies for Discovering Novel Natural Products. Curr. Opin. Biotechnol. 2017, 48, 21–27.
32. Tong, Y.; Weber, T.; Lee, S. Y. CRISPR/Cas-Based Genome Engineering in Natural Product Discovery. Nat. Prod. Rep. 2018. https://doi.org/10.1039/c8np00089a.
33. Tao, W.; Yang, A.; Deng, Z.; Sun, Y. CRISPR/Cas9-Based Editing of Streptomyces for Discovery, Characterization, and Production of Natural Products. Front. Microbiol. 2018, 9,
1660.
34. Zhang, M. M.; et al. CRISPR-Cas9 Strategy for Activation of Silent Streptomyces Biosynthetic Gene Clusters. Nat. Chem. Biol. 2017. https://doi.org/10.1038/nchembio.2341.
35. Nguyen, K. T.; et al. Combinatorial Biosynthesis of Novel Antibiotics Related to Daptomycin. Proc. Natl. Acad. Sci. U. S. A. 2006, 103, 17462–17467.
36. Schobel, U.; et al. High Resolution Screening of Plant Natural Product Extracts for Estrogen Receptor Alpha and Beta Binding Activity Using an Online HPLC-MS Biochemical
Detection System. J. Biomol. Screen. 2001, 6, 291–303.
37. Hofstadler, S. A.; Sannes-Lowery, K. A. Applications of ESI-MS in Drug Discovery: Interrogation of Noncovalent Complexes. Nat. Rev. Drug Discov. 2006, 5, 585–595.
38. Hofstadler, S. A.; Griffey, R. H. Mass Spectrometry as a Drug Discovery Platform Against RNA Targets. Curr. Opin. Drug Discov. Devel. 2000, 3, 423–431.
39. Cummins, L. L.; et al. Multitarget Affinity/Specificity Screening of Natural Products: Finding and Characterizing High-Affinity Ligands From Complex Mixtures by Using High-
Performance Mass Spectrometry. J. Nat. Prod. 2003, 66, 1186–1190.
40. Swayze, E. E.; et al. SAR by MS: A Ligand Based Technique for Drug Lead Discovery against Structured RNA Targets. J. Med. Chem. 2002, 45, 3816–3819.
41. Seth, P. P.; et al. SAR by MS: Discovery of a New Class of RNA-Binding Small Molecules for the Hepatitis C Virus: Internal Ribosome Entry Site IIA Subdomain. J. Med. Chem.
2005, 48, 7099–7102.
42. Greig, M. J.; Robinson, J. M. Detection of Oligonucleotide-Ligand Complexes by ESI-MS (DOLCE-MS) as a Component of High Throughput Screening. J. Biomol. Screen. 2000,
5, 441–454.
43. Shi, X.; Takamizawa, A.; Nishimura, Y.; Hiraoka, K.; Akashi, S. Stability Analysis for Double-Stranded DNA Oligomers and Their Noncovalent Complexes With Drugs by Laser
Spray. J. Mass Spectrom. 2006, 41, 1086–1095.
44. Zhang, Q. W.; Lin, L. G.; Ye, W. C. Techniques for Extraction and Isolation of Natural Products: A Comprehensive Review. Chinas Med. 2018, 13, 20.
45. Chemat, F.; Vian, M. A.; Cravotto, G. Green Extraction of Natural Products: Concept and Principles. Int. J. Mol. Sci. 2012, 13, 8615–8627.
46. Duquesnoy, E.; Castola, V.; Casanova, J. Identification and Quantitative Determination of Carbohydrates in Ethanolic Extracts of Two Conifers Using 13C NMR Spectroscopy.
Carbohydr. Res. 2008, 343, 893–902.
47. Kumar, S.; Singh, A.; Kumar, B. Identification and Characterization of Phenolics and Terpenoids From Ethanolic Extracts of Phyllanthus Species by HPLC-ESI-QTOF-MS/MS.
J. Pharm. Anal. 2017, 7, 214–222.
Mass Spectrometry for Natural Product Discovery 301
48. Kumar, S.; et al. Structural Characterization of Monoterpene Indole Alkaloids in Ethanolic Extracts of Rauwolfia Species by Liquid Chromatography With Quadrupole Time-of-
Flight Mass Spectrometry. J. Pharm. Anal. 2016, 6, 363–373.
49. Teo, C. C.; Tan, S. N.; Yong, J. W.; Hew, C. S.; Ong, E. S. Pressurized Hot Water Extraction (PHWE). J. Chromatogr. A 2010, 1217, 2484–2494.
50. Herrero, M.; Cifuentes, A.; Ibanez, E. Sub- and Supercritical Fluid Extraction of Functional Ingredients From Different Natural Sources: Plants, Food-by-Products, algae and
microalgae A review. Food Chem. 2006, 98, 136–148.
51. Chiou, C. T.; Schmedding, D. W.; Manes, M. Partitioning of Organic-Compounds in Octanol-Water Systems. Environ. Sci. Technol. 1982, 16, 4–10.
52. Luthria, D. L.; Biswas, R.; Natarajan, S. Comparison of Extraction Solvents and Techniques Used for the Assay of Isoflavones From Soybean. Food Chem. 2007, 105, 325–333.
53. Burianek, L. L.; Yousef, A. E. Solvent Extraction of Bacteriocins From Liquid Cultures. Lett. Appl. Microbiol. 2000, 31, 193–197.
54. Rapinel, V.; et al. An Original Approach for Lipophilic Natural Products Extraction: Use of Liquefied N-Butane as Alternative Solvent to N-Hexane. LWT-Food Sci. Technol. 2017,
85, 524–533.
55. Anokwuru, C. P.; et al. Effect of Extraction Solvents on Phenolic, Flavonoid and Antioxidant Activities of Three Nigerian Medicinal Plants. Nat. Sci. Sleep 2011, 9, 53–61.
56. Suomi, J.; Sirén, H.; Hartonen, K.; Riekkola, M. L. Extraction of Iridoid Glycosides and Their Determination by Micellar Electrokinetic Capillary Chromatography. J. Chromatogr. A
2000, 868, 73–83.
57. Grosser, K.; van Dam, N. M. A. Straightforward Method for Glucosinolate Extraction and Analysis With High-Pressure Liquid Chromatography (HPLC). J. Vis. Exp. 2017. https://
doi.org/10.3791/55425.
58. Doheny-Adams, T.; Redeker, K.; Kittipol, V.; Bancroft, I.; Hartley, S. E. Development of an Efficient Glucosinolate Extraction Method. Plant Methods 2017, 13, 17.
59. Mantegna, S.; et al. A One-Pot Ultrasound-Assisted Water Extraction/Cyclodextrin Encapsulation of Resveratrol From Polygonum cuspidatum. Food Chem. 2012, 130,
746–750.
60. Eikani, M. H.; Golmohammad, F.; Rowshanzamir, S. Subcritical Water Extraction of Essential Oils From Coriander Seeds (Coriandrum sativum L.). J. Food Eng. 2007, 80,
735–740.
61. He, D. H.; et al. Tricalysiosides A-G: Rearranged ent-Kauranoid Glycosides From the Leaves of Tricalysia dubia. J. Nat. Prod. 2002, 65, 685–688.
62. Song, Y.; et al. Simultaneous Determination of Aconite Alkaloids and Ginsenosides Using Online Solid Phase Extraction Hyphenated With Polarity Switching Ultra-High
Performance Liquid Chromatography Coupled With Tandem Mass Spectrometry. RSC Adv. 2015, 5, 6419–6428.
63. Thurman, E. M.; Mills, M. S. Solid-Phase Extraction: Principles and Practice; Wiley: New York, 1998; Vol. 16.
64. Yung An, C.; et al. Efficiency of Polyphenol Extraction From Artificial Honey Using C18 Cartridges and AmberliteW XAD-2 Resin: A Comparative Study. J. Chem. Chem. Eng.
2016, 2016, 8356739.
65. Mansson, M.; et al. Explorative Solid-Phase Extraction (E-SPE) for Accelerated Microbial Natural Product Discovery, Dereplication, and Purification. J. Nat. Prod. 2010, 73,
1126–1132.
66. Cutignano, A.; et al. Development and Application of a Novel SPE-Method for Bioassay-Guided Fractionation of Marine Extracts. Mar. Drugs 2015, 13, 5736–5749.
67. Bucar, F.; Wube, A.; Schmid, M. Natural Product Isolation—How to Get From Biological Material to Pure Compounds. Nat. Prod. Rep. 2013, 30, 525–545.
68. Ostermann, A. I.; Willenberg, I.; Weylandt, K. H.; Schebb, N. H. Development of an Online-SPE–LC–MS/MS Method for 26 Hydroxylated Polyunsaturated Fatty Acids as Rapid
Targeted Metabolomics Approach for the LOX, CYP, and Autoxidation Pathways of the Arachidonic Acid Cascade. Chromatographia 2015, 78, 415–428.
69. Maia, M.; et al. Metabolite Extraction for High-Throughput FTICR-MS-Based Metabolomics of Grapevine Leaves. EuPA Open Proteom. 2016, 12, 4–9.
70. Azwanida, N. N. A Review on the Extraction Methods Use in Medicinal Plants, Principle, Strength and Limitation. Med. Aromat. Plants 2015, 4, 3.
71. Jovic, O.; Habinovec, I.; Galic, N.; Andrašec, M. Maceration of Extra Virgin Olive Oil With Common Aromatic Plants Using Ultrasound-Assisted Extraction: An UV-Vis
Spectroscopic Investigation. J. Spectrosc. 2018, 2018, 7510647.
72. Cui, H. Y.; et al. Comparison of Conventional and Ultrasound-Assisted Methods for Extraction of Nutraceutical Compounds From Dendrobium candidum. CyTA J. Food 2014, 12,
355–359.
73. Kongkiatpaiboon, S.; Gritsanapan, W. Optimized Extraction for High Yield of Insecticidal Didehydrostemofoline Alkaloid in Stemona Collinsiae Root Extracts. Ind. Crop Prod. 2013,
41, 371–374.
74. Lapornik, B.; Prosek, M.; Wondra, A. G. Comparison of Extracts Prepared From Plant by-Products Using Different Solvents and Extraction Time. J. Food Eng. 2005, 71,
214–222.
75. Hu, Y.; et al. Optimisation of Ethanol-Reflux Extraction of Saponins From Steamed Panax notoginseng by Response Surface Methodology and Evaluation of Hematopoiesis Effect.
Molecules 2018, 23, E1206.
76. Stalikas, C. D. Extraction, Separation, and Detection Methods for Phenolic Acids and Flavonoids. J. Sep. Sci. 2007, 30, 3268–3295.
76a. Camel, V. Recent Extraction Techniques for Solid Matrices—Supercritical Fluid Extraction, Pressurized Fluid Extraction and Microwave-Assisted Extraction: Their Potential and
Pitfalls. Analyst 2001, 126, 1182–1193.
77. Chemat, F.; et al. Ultrasound Assisted Extraction of Food and Natural Products. Mechanisms, Techniques, Combinations, Protocols and Applications. A Review. Ultrason.
Sonochem. 2017, 34, 540–560.
78. Iskalieva, A.; et al. Cavitation Assisted Delignification of Wheat Straw: A Review. Ultrason. Sonochem. 2012, 19, 984–993.
79. Ghafoor, K.; Choi, Y. H.; Jeon, J. Y.; Jo, I. H. Optimization of Ultrasound-Assisted Extraction of Phenolic Compounds, Antioxidants, and Anthocyanins From Grape (Vitis vinifera)
Seeds. J. Agric. Food Chem. 2009, 57, 4988–4994.
80. Agarwal, C.; Mathe, K.; Hofmann, T.; Csoka, L. Ultrasound-Assisted Extraction of Cannabinoids From Cannabis sativa L. Optimized by Response Surface Methodology. J. Food
Sci. 2018, 83, 700–710.
81. Dent, M.; et al. Comparison of Conventional and Ultrasound-Assisted Extraction Techniques on Mass Fraction of Phenolic Compounds From Sage (Salvia officinalis L.). Chem.
Biochem. Eng. Q. 2015, 29, 475–484.
82. Rutledge, P. J.; Challis, G. L. Discovery of Microbial Natural Products by Activation of Silent Biosynthetic Gene Clusters. Nat. Rev. Microbiol. 2015, 13, 509–523.
83. Cardoso, L. C.; Serrano, C. M.; Rodríguez, M. R.; de la Ossa, E. J. M.; Lubián, L. M. Extraction of Carotenoids and Fatty Acids From Microalgae Using Supercritical Technology.
Am. J. Anal. Chem. 2012, 3, 877.
84. Eskilsson, C. S.; Bjorklund, E. Analytical-Scale Microwave-Assisted Extraction. J. Chromatogr. A 2000, 902, 227–250.
85. Azzouz, A.; Ballesteros, E. Determination of 13 Endocrine Disrupting Chemicals in Environmental Solid Samples Using Microwave-Assisted Solvent Extraction and Continuous
Solid-Phase Extraction Followed by Gas Chromatography-Mass Spectrometry. Anal. Bioanal. Chem. 2016, 408, 231–241.
86. Routray, W.; Orsat, V. Microwave-Assisted Extraction of Flavonoids: A Review. Food Bioproc. Tech. 2011, 5, 409–424.
87. Xiao, W.; Han, L.; Shi, B. Microwave-Assisted Extraction of Flavonoids From Radix Astragali. Sep. Purif. Technol. 2008, 62, 614–618.
88. Wang, Y. L.; Xi, G. S.; Zheng, Y. C.; Miao, F. S. Microwave-Assisted Extraction of Flavonoids From Chinese Herb Radix puerariae (Ge Gen). J. Med. Plant Res. 2010, 4,
304–308.
89. Dahmoune, F.; Nayak, B.; Moussi, K.; Remini, H.; Madani, K. Optimization of Microwave-Assisted Extraction of Polyphenols From Myrtus communis L. leaves. Food Chem. 2015,
166, 585–595.
90. Mustafa, A.; Turner, C. Pressurized Liquid Extraction as a Green Approach in Food and Herbal Plants Extraction: A Review. Anal. Chim. Acta 2011, 703, 8–18.
91. Machado, A. P. D. F.; Pasquel-Reátegui, J. L.; Barbero, G. F.; Martínez, J. Pressurized Liquid Extraction of Bioactive Compounds From Blackberry (Rubus fruticosus L.) Residues:
A Comparison With Conventional Methods. Food Res. Int. 2015, 77, 675–683.
92. Zaghdoudi, K.; et al. Accelerated Solvent Extraction of Carotenoids From: Tunisian Kaki (Diospyros kaki L.), Peach (Prunus persica L.) and Apricot (Prunus armeniaca L.). Food
Chem. 2015, 184, 131–139.
302 Mass Spectrometry for Natural Product Discovery
93. Xi, J.; Shen, D.; Li, Y.; Zhang, R. Ultrahigh Pressure Extraction as a Tool to Improve the Antioxidant Activities of Green Tea Extracts. Food Res. Int. 2011, 44, 2783–2787.
94. Khaw, K. Y.; Parat, M. O.; Shaw, P. N.; Falconer, J. R. Solvent Supercritical Fluid Technologies to Extract Bioactive Compounds From Natural Sources: A Review. Molecules
2017, 22, E1186.
95. Herrero, M.; Mendiola, J. A.; Cifuentes, A.; Ibanez, E. Supercritical Fluid Extraction: Recent Advances and Applications. J. Chromatogr. A 2010, 1217, 2495–2511.
96. Knez, Ž.; et al. Industrial Applications of Supercritical Fluids: A Review. Energy 2014, 77, 235–243.
97. da Costa Lopes, A. M.; Brenner, M.; Falé, P.; Roseiro, L. B.; Bogel-Łukasik, R. Extraction and Purification of Phenolic Compounds From Lignocellulosic Biomass Assisted by Ionic
Liquid, Polymeric Resins, and Supercritical CO2. ACS Sustain. Chem. Eng. 2016, 4, 3357–3367.
98. Zhang, T.; Creek, D. J.; Barrett, M. P.; Blackburn, G.; Watson, D. G. Evaluation of Coupling Reversed Phase, Aqueous Normal Phase, and Hydrophilic Interaction Liquid
Chromatography With Orbitrap Mass Spectrometry for Metabolomic Studies of Human Urine. Anal. Chem. 2012, 84, 1994–2001.
99. Zhou, B.; Xiao, J. F.; Tuli, L.; Ressom, H. W. LC-MS-Based Metabolomics. Mol. Biosyst. 2012, 8, 470–481.
100. Zelena, E.; et al. Development of a Robust and Repeatable UPLC-MS Method for the Long-Term Metabolomic Study of Human Serum. Anal. Chem. 2009, 81, 1357–1364.
101. Guillarme, D.; Veuthey, J.-L. Alternative Strategies to Reversed-Phase Liquid Chromatography for the Analysis of Pharmaceutical Compounds. Am. Pharm. Rev. 2017, 20,
46–51.
102. Dorsey, J. G.; Dill, K. A. The Molecular Mechanism of Retention in Reversed-Phase Liquid Chromatography. Chem. Rev. 1989, 89, 331–346.
103. Naser, F. J.; et al. Two Complementary Reversed-Phase Separations for Comprehensive Coverage of the Semipolar and Nonpolar Metabolome. Anal. Bioanal. Chem. 2018, 410,
1287–1297.
104. Valkó, K.; Snyder, L. R.; Glajch, J. L. Retention in Reversed-Phase Liquid Chromatography as a Function of Mobile-Phase Composition. J. Chromatogr. A 1993, 656, 501–520.
105. Zhang, Y.; et al. A Review of the Extraction and Determination Methods of Thirteen Essential Vitamins to the Human Body: An Update From 2010. Molecules 2018, 23, E1484.
106. Higashi, T.; et al. A Specific LC/ESI-MS/MS Method for Determination of 25-Hydroxyvitamin D3 in Neonatal Dried Blood Spots Containing a Potential Interfering Metabolite,
3-epi-25-Hydroxyvitamin D3. J. Sep. Sci. 2011, 34, 725–732.
107. Lee, S. H.; Lee, N.; Hong, Y.; Chung, B. C.; Choi, M. H. Simultaneous Analysis of Free and Sulfated Steroids by Liquid Chromatography/Mass Spectrometry With Selective Mass
Spectrometric Scan Modes and Polarity Switching. Anal. Chem. 2016, 88, 11624–11630.
108. Jandera, P. Liquid chromatography | normal phase. In Reference Module in Chemistry, Molecular Sciences and Chemical Engineering, Elsevier, 2013.
109. Lu, J.; Rustum, A. M. Effect of Trace Amounts of Water in the Mobile Phase of Normal-Phase Enantioselective High-Performance Liquid Chromatography on Selectivity and
Resolution of Optical Isomers. J. Chromatogr. Sci. 2009, 47, 320–323.
110. Hao, Z.; Parker, B.; Knapp, M.; Yu, L. Simultaneous Quantification of Alpha-Tocopherol and Four Major Carotenoids in Botanical Materials by Normal Phase Liquid
Chromatography-Atmospheric Pressure Chemical Ionization-Tandem Mass Spectrometry. J. Chromatogr. A 2005, 1094, 83–90.
111. Heudi, O.; Trisconi, M.-J.; Blake, C.-J. Simultaneous Quantification of Vitamins A, D3 and E in Fortified Infant Formulae by Liquid Chromatography–Mass Spectrometry.
J. Chromatogr. A 2004, 1022, 115–123.
112. Alpert, A. J. Hydrophilic-Interaction Chromatography for the Separation of Peptides, Nucleic Acids and Other Polar Compounds. J. Chromatogr. 1990, 499, 177–196.
113. Buszewski, B.; Noga, S. Hydrophilic Interaction Liquid Chromatography (HILIC)—A Powerful Separation Technique. Anal. Bioanal. Chem. 2012, 402, 231–247.
114. Jandera, P. Stationary Phases for Hydrophilic Interaction Chromatography, Their Characterization and Implementation into Multidimensional Chromatography Concepts. J. Sep.
Sci. 2008, 31, 1421–1437.
115. Kowalska, S.; Krupczynska, K.; Buszewski, B. The Influence of the Mobile Phase pH and the Stationary Phase Type on the Selectivity Tuning in High Performance Liquid
Chromatography Nucleosides Separation. J. Sep. Sci. 2005, 28, 1502–1511.
116. Di Palma, S.; Boersema, P. J.; Heck, A. J.; Mohammed, S. Zwitterionic Hydrophilic Interaction Liquid Chromatography (ZIC-HILIC and ZIC-cHILIC) Provide High Resolution
Separation and Increase Sensitivity in Proteome Analysis. Anal. Chem. 2011, 83, 3440–3447.
117. Baran, R.; et al. Metabolite Identification in Synechococcus sp. PCC 7002 Using Untargeted Stable Isotope Assisted Metabolite Profiling. Anal. Chem. 2010, 82, 9034–9042.
118. Dell’mour, M.; et al. Hydrophilic Interaction LC Combined With Electrospray MS for Highly Sensitive Analysis of Underivatized Amino Acids in Rhizosphere Research. J. Sep. Sci.
2010, 33, 911–922.
119. Neubauer, S.; et al. Mass Spectrometry Based Analysis of Nucleotides, Nucleosides, and Nucleobases—Application to Feed Supplements. Anal. Bioanal. Chem. 2012, 404,
799–808.
120. Hu, Y.; Zheng, Q.; Zhang, S.; Noll, L.; Wanek, W. Significant Release and Microbial Utilization of Amino Sugars and d-Amino Acid Enantiomers From Microbial Cell Wall
Decomposition in Soils. Soil Biol. Biochem. 2018, 123, 115–125.
121. Strege, M. A. Hydrophilic Interaction Chromatography-Electrospray Mass Spectrometry Analysis of Polar Compounds for Natural Product Drug Discovery. Anal. Chem. 1998, 70,
2439–2445.
122. Schafer, M.; Brutting, C.; Baldwin, I. T.; Kallenbach, M. High-Throughput Quantification of More Than 100 Primary- and Secondary-Metabolites, and Phytohormones by a Single
Solid-Phase Extraction Based Sample Preparation With Analysis by UHPLC-HESI-MS/MS. Plant Methods 2016, 12, 30.
123. Hanko, V. P.; Rohrer, J. S. Determination of Carbohydrates, Sugar Alcohols, and Glycols in Cell Cultures and Fermentation Broths Using High-Performance Anion-Exchange
Chromatography With Pulsed Amperometric Detection. Anal. Biochem. 2000, 283, 192–199.
124. Tomiya, N.; Ailor, E.; Lawrence, S. M.; Betenbaugh, M. J.; Lee, Y. C. Determination of Nucleotides and Sugar Nucleotides Involved in Protein Glycosylation by High-Performance
Anion-Exchange Chromatography: Sugar Nucleotide Contents in Cultured Insect Cells and Mammalian Cells. Anal. Biochem. 2001, 293, 129–137.
125. Kaiser, K.; Benner, R. Determination of Amino Sugars in Environmental Samples With High Salt Content by High-Performance Anion-Exchange Chromatography and Pulsed
Amperometric Detection. Anal. Chem. 2000, 72, 2566–2572.
126. Karu, N.; Dicinoski, G. W.; Haddad, P. R. Use of Suppressors for Signal Enhancement of Weakly-Acidic Analytes in Ion Chromatography With Universal Detection Methods.
Trends Anal. Chem. 2012, 40, 119–132.
127. van Dam, J. C.; et al. Analysis of Glycolytic Intermediates in Saccharomyces cerevisiae Using Anion Exchange Chromatography and Electrospray Ionization With Tandem Mass
Spectrometric Detection. Anal. Chim. Acta 2002, 460, 209–218.
128. Saarnio, K.; Teinila, K.; Aurela, M.; Timonen, H.; Hillamo, R. High-Performance Anion-Exchange Chromatography-Mass Spectrometry Method for Determination of Levoglucosan,
Mannosan, and Galactosan in Atmospheric Fine Particulate Matter. Anal. Bioanal. Chem. 2010, 398, 2253–2264.
129. Fenn, J. B.; Mann, M.; Meng, C. K.; Wong, S. F.; Whitehouse, C. M. Electrospray Ionization for Mass-Spectrometry of Large Biomolecules. Science 1989, 246, 64–71.
130. Wilm, M. Principles of Electrospray Ionization. Mol. Cell. Proteomics 2011, 10, M111.009407.
131. Constantopoulos, T. L.; Jackson, G. S.; Enke, C. G. Effects of Salt Concentration on Analyte Response Using Electrospray Ionization Mass Spectrometry. J. Am. Soc. Mass
Spectrom. 1999, 10, 625–634.
132. Wahl, J. H.; Goodlett, D. R.; Udseth, H. R.; Smith, R. D. Use of Small-Diameter Capillaries for Increasing Peptide and Protein-Detection Sensitivity in Capillary-Electrophoresis
Mass-Spectrometry. Electrophoresis 1993, 14, 448–457.
133. Rebane, R.; Rodima, T.; Kutt, A.; Herodes, K. Development of Amino Acid Derivatization Reagents for Liquid Chromatography Electrospray Ionization Mass Spectrometric Analysis
and Ionization Efficiency Measurements. J. Chromatogr. A 2015, 1390, 62–70.
134. Hermans, J.; Ongay, S.; Markov, V.; Bischoff, R. Physicochemical Parameters Affecting the Electrospray Ionization Efficiency of Amino Acids After Acylation. Anal. Chem. 2017,
89, 9159–9166.
135. Konermann, L.; Ahadi, E.; Rodriguez, A. D.; Vahidi, S. Unraveling the Mechanism of Electrospray Ionization. Anal. Chem. 2013, 85, 2–9.
136. Kostiainen, R.; Kauppila, T. J. Effect of Eluent on the Ionization Process in Liquid Chromatography-Mass Spectrometry. J. Chromatogr. A 2009, 1216, 685–699.
137. Rebane, R.; et al. Establishing Atmospheric Pressure Chemical Ionization Efficiency Scale. Anal. Chem. 2016, 88, 3435–3439.
Mass Spectrometry for Natural Product Discovery 303
138. Commisso, M.; Anesi, A.; Dal Santo, S.; Guzzo, F. Performance Comparison of Electrospray Ionization and Atmospheric Pressure Chemical Ionization in Untargeted and Targeted
Liquid Chromatography/Mass Spectrometry Based Metabolomics Analysis of Grapeberry Metabolites. Rapid Commun. Mass Spectrom. 2017, 31, 292–300.
139. Olmo-Garcia, L.; et al. Unravelling the Distribution of Secondary Metabolites in Olea europaea L.: Exhaustive Characterization of Eight Olive-Tree Derived Matrices by
Complementary Platforms (LC-ESI/APCI-MS and GC-APCI-MS). Molecules 2018, 23, E2419.
140. Takats, Z.; Wiseman, J. M.; Gologan, B.; Cooks, R. G. Mass Spectrometry Sampling Under Ambient Conditions With Desorption Electrospray Ionization. Science 2004, 306,
471–473.
141. Talaty, N.; Takats, Z.; Cooks, R. G. Rapid In Situ Detection of Alkaloids in Plant Tissue Under Ambient Conditions Using Desorption Electrospray Ionization. Analyst 2005, 130,
1624–1633.
142. Wiseman, J. M.; et al. Desorption Electrospray Ionization Mass Spectrometry: Imaging Drugs and Metabolites in Tissues. Proc. Natl. Acad. Sci. U. S. A. 2008, 105,
18120–18125.
143. Karas, M.; Bachmann, D.; Bahr, U.; Hillenkamp, F. Matrix-Assisted Ultraviolet Laser Desorption of Non-volatile Compounds. Int. J. Mass Spectrom. Ion Processes 1987, 78,
53–68.
144. Spengler, B. Mass Spectrometry Imaging of Biomolecular Information. Anal. Chem. 2015, 87, 64–82.
145. Qin, L.; et al. Recent Advances in Matrix-Assisted Laser Desorption/Ionisation Mass Spectrometry Imaging (MALDI-MSI) for In Situ Analysis of Endogenous Molecules in Plants.
Phytochem. Anal. 2018, 29, 351–364.
146. Stolee, J. A.; Walker, B. N.; Zorba, V.; Russo, R. E.; Vertes, A. Laser–Nanostructure Interactions for Ion Production. Phys. Chem. Chem. Phys. 2012, 14, 8453.
147. de Rond, T.; Danielewicz, M.; Northen, T. High Throughput Screening of Enzyme Activity With Mass Spectrometry Imaging. Curr. Opin. Biotechnol. 2014, 31C, 1–9.
148. Northen, T. R.; et al. Clathrate Nanostructures for Mass Spectrometry. Nature 2007, 449, 1033–1036.
149. Haag, A. M. Mass Analyzers and Mass Spectrometers. Adv. Exp. Med. Biol. 2016, 919, 157–169.
150. Scigelova, M.; Makarov, A. Advances in Bioanalytical LC–MS Using the Orbitrap™ Mass Analyzer. Bioanalysis 2009, 1, 741–754.
151. Schrimpe-Rutledge, A. C.; Sherrod, S. D.; McLean, J. A. Improving the Discovery of Secondary Metabolite Natural Products Using Ion Mobility-Mass Spectrometry. Curr. Opin.
Chem. Biol. 2018, 42, 160–166.
152. May, J. C.; McLean, J. A. Ion Mobility-Mass Spectrometry: Time-Dispersive Instrumentation. Anal. Chem. 2015, 87, 1422–1436.
153. Yan, Z.; Caldwell, G. W.; Maher, N. Unbiased High-Throughput Screening of Reactive Metabolites on the Linear Ion Trap Mass Spectrometer Using Polarity Switch and Mass Tag
Triggered Data-Dependent Acquisition. Anal. Chem. 2008, 80, 6410–6422.
154. Rudomin, E. L.; Carr, S. A.; Jaffe, J. D. Directed Sample Interrogation Utilizing an Accurate Mass Exclusion-Based Data-Dependent Acquisition Strategy (AMEx). J. Proteome Res.
2009, 8, 3154–3160.
155. Bruderer, R.; et al. Optimization of Experimental Parameters in Data-Independent Mass Spectrometry Significantly Increases Depth and Reproducibility of Results. Mol. Cell.
Proteomics 2017, 16, 2296–2309.
156. Kalli, A.; Smith, G. T.; Sweredoski, M. J.; Hess, S. Evaluation and Optimization of Mass Spectrometric Settings during Data-Dependent Acquisition Mode: Focus on LTQ-Orbitrap
Mass Analyzers. J. Proteome Res. 2013, 12, 3071–3086.
157. Ernst, M.; Silva, D. B.; Silva, R. R.; Vêncio, R. Z. N.; Lopes, N. P. Mass Spectrometry in Plant Metabolomics Strategies: From Analytical Platforms to Data Acquisition and
Processing. Nat. Prod. Rep. 2014, 31, 784–806.
158. Baerson, S. R.; Rimando, A. M. A Plethora of Polyketides: Structures, Biological Activities, and Enzymes. Polyketides 2007, 955, 2–14.
159. Hertweck, C. The Biosynthetic Logic of Polyketide Diversity. Angew. Chem. Int. Ed. Engl. 2009, 48, 4688–4716.
159a. Weissman, K. J.; Leadlay, P. F. Combinatorial Biosynthesis of Reduced Polyketides. Nat. Rev. Microbiol. 2005, 3, 925–936.
160. Bumpus, S. B.; Kelleher, N. L. Accessing Natural Product Biosynthetic Processes by Mass Spectrometry. Curr. Opin. Chem. Biol. 2008, 12, 475–482.
161. Bhan, N.; Cress, B. F.; Linhardt, R. J.; Koffas, M. Expanding the Chemical Space of Polyketides Through Structure-Guided Mutagenesis of Vitis vinifera Stilbene Synthase.
Biochimie 2015, 115, 136–143.
162. Hagen, A.; et al. Engineering a Polyketide Synthase for In Vitro Production of Adipic Acid. ACS Synth. Biol. 2016, 5, 21–27.
163. Qin, Z.; et al. Formicamycins, Antibacterial Polyketides Produced by Streptomyces Formicae Isolated From African Tetraponera Plant-Ants. Chem. Sci. 2017, 8, 3218–3227.
164. Ueki, M.; et al. Isolation of New Polyketide Metabolites, Linearolides A and B, From Streptomyces sp. RK95-74. J. Antibiot. 2013, 66, 333–337.
165. Wills, R. H.; Tosin, M.; O’Connor, P. B. Structural Characterization of Polyketides Using High Mass Accuracy Tandem Mass Spectrometry. Anal. Chem. 2012, 84, 8863–8870.
166. Kwan, D. H.; Schulz, F. The Stereochemistry of Complex Polyketide Biosynthesis by Modular Polyketide Synthases. Molecules 2011, 16, 6092–6115.
167. Weissman, K. J. Polyketide Stereocontrol: A Study in Chemical Biology. Beilstein J. Org. Chem. 2017, 13, 348–371.
168. Paulsen, B. S. Glycosylated natural products. In Encyclopedia of Biophysics; Roberts, G. C. K., Ed.; Springer Berlin Heidelberg, 2013; p 931.
169. Thorson, J. S.; Vogt, T. Glycosylated natural products. In Carbohydrate-Based Drug Discovery; Wong, C.-H., Ed.; Wiley-VCH Verlag GmbH & Co. KGaA, 2005; pp 685–711.
170. Kren, V. Chemical biology and biomedicine of glycosylated natural compounds. In Glycoscience: Chemistry and Chemical Biology I–III; Fraser-Reid, B. O., Tatsuta, K., Thiem, J.,
Eds.; Springer Berlin Heidelberg, 2001; pp 2471–2529.
171. Křen, V. Glycoside vs. aglycon: The role of glycosidic residue in biological activity. In Glycoscience: Chemistry and Chemical Biology; Fraser-Reid, B. O., Tatsuta, K., Thiem, J.,
Eds.; Springer Berlin Heidelberg, 2008; pp 2589–2644.
172. Yu, C.; et al. Pretreatment of Baicalin and Wogonoside With Glycoside Hydrolase: A Promising Approach to Enhance Anticancer Potential. Oncol. Rep. 2013, 30, 2411–2418.
173. Neilson, E. H.; Goodger, J. Q.; Woodrow, I. E.; Moller, B. L. Plant Chemical Defense: At What Cost?Trends Plant Sci. 2013, 18, 250–258.
174. Zagrobelny, M.; et al. Cyanogenic Glucosides and Plant–Insect Interactions. Phytochemistry 2004, 65, 293–306.
175. Elshahawi, S. I.; Shaaban, K. A.; Kharel, M. K.; Thorson, J. S. A Comprehensive Review of Glycosylated Bacterial Natural Products. Chem. Soc. Rev. 2015, 44, 7591–7697.
176. Ding, L.; et al. Systematic Screening and Characterization of Glycosides in Tobacco Leaves by Liquid Chromatography With Atmospheric Pressure Chemical Ionization Tandem
Mass Spectrometry Using Neutral Loss Scan and Product Ion Scan. J. Sep. Sci. 2015, 38, 4029–4035.
177. Barnaba, C.; et al. Non-targeted Glycosidic Profiling of International Wines Using Neutral Loss-High Resolution Mass Spectrometry. J. Chromatogr. A 2018, 1557, 75–89.
178. Qu, J.; Liang, Q.; Luo, G.; Wang, Y. Screening and Identification of Glycosides in Biological Samples Using Energy-Gradient Neutral Loss Scan and Liquid Chromatography
Tandem Mass Spectrometry. Anal. Chem. 2004, 76, 2239–2247.
179. Cabrera, G. M. Mass Spectrometry in the Structural Elucidation of Natural Products: Glycosides. Phytochemistry 2006, 1–22.
180. Hooi Poay, T.; Sui Kiong, L.; Cheng Hock, C. Characterisation of Galloylated Cyanogenic Glucosides and Hydrolysable Tannins From Leaves of Phyllagathis rotundifolia by LC-ESI-
MS/MS. Phytochem. Anal. 2011, 22, 516–525.
181. Kohls, S.; Scholz-Bottcher, B. M.; Teske, J.; Zark, P.; Rullkotter, J. Cardiac Glycosides From Yellow Oleander (Thevetia peruviana) Seeds. Phytochemistry 2012, 75, 114–127.
182. Domon, B.; Costello, C. E. A Systematic Nomenclature for Carbohydrate Fragmentations in FAB-MS/MS Spectra of Glycoconjugates. Glycoconj. J. 1988, 5, 397–409.
183. Süssmuth, R. D.; Mainz, A. Nonribosomal Peptide Synthesis-Principles and Prospects. Angew. Chem. Int. Ed. Engl. 2017, 56, 3770–3821.
184. Finking, R.; Marahiel, M. A. Biosynthesis of Nonribosomal Peptides1. Annu. Rev. Microbiol. 2004, 58, 453–488.
185. Bloudoff, K.; Schmeing, T. M. Structural and Functional Aspects of the Nonribosomal Peptide Synthetase Condensation Domain Superfamily: Discovery, Dissection and Diversity.
Biochim. Biophys. Acta Proteins Proteomics 2017, 1865, 1587–1604.
186. Ibrahim, A.; et al. Dereplicating Nonribosomal Peptides Using an Informatic Search Algorithm for Natural Products (iSNAP) Discovery. Proc. Natl. Acad. Sci. U. S. A. 2012, 109,
19196–19201.
187. Caboche, S.; Leclère, V.; Pupin, M.; Kucherov, G.; Jacques, P. Diversity of Monomers in Nonribosomal Peptides: Towards the Prediction of Origin and Biological Activity.
J. Bacteriol. 2010, 192, 5143–5150.
304 Mass Spectrometry for Natural Product Discovery
188. Awan, A. R.; et al. Biosynthesis of the Antibiotic Nonribosomal Peptide Penicillin in Baker’s Yeast. Nat. Commun. 2017, 8, 15202.
189. Müller, S.; et al. Paenilamicin: Structure and Biosynthesis of a Hybrid Nonribosomal Peptide/Polyketide Antibiotic From the Bee Pathogen Paenibacillus larvae. Angew. Chem. Int.
Ed. Engl. 2014, 53, 10821–10825.
190. Ng, J.; et al. Dereplication and de Novo Sequencing of Nonribosomal Peptides. Nat. Methods 2009, 6, 596–599.
191. Caboche, S.; et al. NORINE: A Database of Nonribosomal Peptides. Nucleic Acids Res. 2008, 36, D326–D331.
192. Aniszewski, T. Alkaloids: Chemistry, Biology, Ecology, and Applications; Elsevier, 2015.
193. Khazir, J.; Mir, B. A.; Pilcher, L.; Riley, D. L. Role of Plants in Anticancer Drug Discovery. Phytochem. Lett. 2014, 7, 173–181.
194. Ptak, A.; et al. LCMS and GCMS for the Screening of Alkaloids in Natural and In Vitro Extracts of Leucojum aestivum. J. Nat. Prod. 2009, 72, 142–147.
195. Cardoso-Lopes, E. M.; et al. Alkaloids From Stems of Esenbeckia leiocarpa Engl. (Rutaceae) as Potential Treatment for Alzheimer Disease. Molecules 2010, 15, 9205–9213.
196. Xia, Y.; Xu, M.; Alexander, R. R.; Bernert, J. T. Measurement of Nicotine, Cotinine and Trans-30 -Hydroxycotinine in Meconium by Liquid Chromatography-Tandem Mass
Spectrometry. J. Chromatogr. B Analyt. Technol. Biomed. Life Sci. 2011, 879, 2142–2148.
197. Petruczynik, A. Analysis of Alkaloids From Different Chemical Groups by Different Liquid Chromatography Methods. Cent. Eur. J. Chem. 2012, 10, 802–835.
198. Bagheri, M.; et al. Evaluation of Hydrophilic Interaction Liquid Chromatography Stationary Phases for Analysis of Opium Alkaloids. J. Chromatogr. A 2017, 1511, 77–84.
199. Acevska, J.; et al. Chemometric Approach for the Development, Optimization and Validation of Hilic Methods Used for the Determination of Alkaloids From Poppy Straw. Maced.
J. Chem. Chem. Eng. 2014, 33, 73–83.
200. Nagy, J.; Veress, T. HPLC Analysis of Hallucinogenic Mushroom Alkaloids (Psilocin and Psilocybin) Applying Hydrophilic Interaction Chromatography (HILIC). J. Forensic Res. 2016, 7, 6.
201. Long, Z.; Guo, Z.; Xue, X.; Zhang, X.; Liang, X. Two-Dimensional Strong Cation Exchange/Positively Charged Reversed-Phase Liquid Chromatography for Alkaloid Analysis and
Purification. J. Sep. Sci. 2013, 36, 3845–3852.
202. Ma, Y. L.; Li, Q. M.; Van den Heuvel, H.; Claeys, M. Characterization of Flavone and Flavonol Aglycones by Collision-Induced Dissociation Tandem Mass Spectrometry. Rapid
Commun. Mass Spectrom. 1997, 11, 1357–1364.
203. Panche, A. N.; Diwan, A. D.; Chandra, S. R. Flavonoids: An Overview. J. Nutr. Sci. 2016, 5, e47.
204. de Rijke, E.; et al. Analytical Separation and Detection Methods for Flavonoids. J. Chromatogr. A 2006, 1112, 31–63.
205. Lin, L. Z.; Harnly, J. M. A Screening Method for the Identification of Glycosylated Flavonoids and Other Phenolic Compounds Using a Standard Analytical Approach for All Plant
Materials. J. Agric. Food Chem. 2007, 55, 1084–1096.
206. Liu, E.-H.; et al. Advances of Modern Chromatographic and Electrophoretic Methods in Separation and Analysis of Flavonoids. Molecules 2008, 13, 2521–2544.
207. De Villiers, A.; Venter, P.; Pasch, H. Recent Advances and Trends in the Liquid-Chromatography-Mass Spectrometry Analysis of Flavonoids. J. Chromatogr. A 2015, 1430,
16–78.
208. Feng, W.; Hao, Z.; Li, M. Isolation and structure identification of flavonoids. In Flavonoids—From Biosynthesis to Human Health; Justino, G. C., Ed.; InTech, 2017.
209. Mabry, T. J.; Markham, K. R.; Thomas, M. B. The Systematic Identification of Flavonoids; Springer Science & Business Media, 1970.
210. de Rijke, E.; Zappey, H.; Ariese, F.; Gooijer, C.; Brinkman, U. A. T. Liquid Chromatography With Atmospheric Pressure Chemical Ionization and Electrospray Ionization Mass
Spectrometry of Flavonoids With Triple-Quadrupole and Ion-Trap Instruments. J. Chromatogr. A 2003, 984, 45–58.
211. Fabre, N.; Rustan, I.; de Hoffmann, E.; Quetin-Leclercq, J. Determination of Flavone, Flavonol, and Flavanone Aglycones by Negative Ion Liquid Chromatography Electrospray Ion
Trap Mass Spectrometry. J. Am. Soc. Mass Spectrom. 2001, 12, 707–715.
212. Cavaliere, C.; Foglia, P.; Pastorini, E.; Samperi, R.; Laganà, A. Identification and Mass Spectrometric Characterization of Glycosylated Flavonoids in Triticum Durum Plants by
High-Performance Liquid Chromatography With Tandem Mass Spectrometry. Rapid Commun. Mass Spectrom. 2005, 19, 3143–3158.
213. Ma, Y. L.; Vedernikova, I.; Van den Heuvel, H.; Claeys, M. Internal Glucose Residue Loss in Protonated O-Diglycosyl Flavonoids upon Low-Energy Collision-Induced Dissociation.
J. Am. Soc. Mass Spectrom. 2000, 11, 136–144.
214. Akimoto, N.; et al. FlavonoidSearch: A System for Comprehensive Flavonoid Annotation by Mass Spectrometry. Sci. Rep. 2017, 7, 1243.
215. Pichersky, E.; Raguso, R. A. Why Do Plants Produce So Many Terpenoid Compounds?New Phytol. 2018, 220, 692–702.
216. Jiang, Z.; Kempinski, C.; Chappell, J. Extraction and Analysis of Terpenes/Terpenoids. Curr. Protoc. Plant Biol. 2016, 1, 345–358.
217. Harman-Ware, A. E.; Sykes, R.; Peter, G. F.; Davis, M. Determination of Terpenoid Content in Pine by Organic Solvent Extraction and Fast-GC Analysis. Front. Energy Res. 2016,
4.
218. Zhao, T.; et al. The Influence of Ceratocystis Polonica Inoculation and Methyl Jasmonate Application on Terpene Chemistry of Norway Spruce, Picea abies. Phytochemistry 2010,
71, 1332–1341.
219. Di Carro, M.; Ianni, C.; Magi, E. Determination of Terpenoids in Plant Leaves by GC-MS: Development of the Method and Application to Ocimum basilicum and Nicotiana
langsdorffii. Anal. Lett. 2013, 46, 630–639.
220. Dziadas, M.; Jelen, H. H. Analysis of Terpenes in White Wines Using SPE-SPME-GC/MS Approach. Anal. Chim. Acta 2010, 677, 43–49.
221. Sun, Y.; Li, W.; Fitzloff, J. F.; van Breemen, R. B. Liquid Chromatography/Electrospray Tandem Mass Spectrometry of Terpenoid Lactones in Ginkgo biloba. J. Mass Spectrom.
2005, 40, 373–379.
222. Albrecht, C. F.; et al. LC–MS-Based Metabolomics Assists With Quality Assessment and Traceability of Wild and Cultivated Plants of Sutherlandia frutescens (Fabaceae). S. Afr.
J. Bot. 2012, 82, 33–45.
223. Caprioli, R. M.; Farmer, T. B.; Gile, J. Molecular Imaging of Biological Samples: Localization of Peptides and Proteins Using MALDI-TOF MS. Anal. Chem. 1997, 69,
4751–4760.
224. Ryan, D. J.; Spraggins, J. M.; Caprioli, R. M. Protein Identification Strategies in MALDI Imaging Mass Spectrometry: A Brief Review. Curr. Opin. Chem. Biol. 2019, 48, 64–72.
225. Silva, L. P.; Northen, T. R. Exometabolomics and MSI: Deconstructing How Cells Interact to Transform Their Small Molecule Environment. Curr. Opin. Biotechnol. 2015, 34,
209–216.
226. Watrous, J. D.; Dorrestein, P. C. Imaging Mass Spectrometry in Microbiology. Nat. Rev. Microbiol. 2011, 9, 683–694.
227. de Raad, M.; Northen, T. R.; Bowen, B. P. Analysis and Interpretation of Mass Spectrometry Imaging Datasets. Compr. Anal. Chem. 2018, 82, 369–386.
228. Rübel, O.; et al. OpenMSI: A High-Performance Web-Based Platform for Mass Spectrometry Imaging. Anal. Chem. 2013, 85, 10354–10361.
229. de Raad, M.; et al. OpenMSI Arrayed Analysis Toolkit: Analyzing Spatially Defined Samples Using Mass Spectrometry Imaging. Anal. Chem. 2017, 89, 5818–5823.
230. Araújo, F. D. S.; et al. Desorption Electrospray Ionization Mass Spectrometry Imaging Reveals Chemical Defense of Burkholderia seminalis Against Cacao Pathogens. RSC Adv.
2017, 7, 29953–29958.
231. Lane, A. L.; et al. Desorption Electrospray Ionization Mass Spectrometry Reveals Surface-Mediated Antifungal Chemical Defense of a Tropical Seaweed. Proc. Natl. Acad. Sci.
U. S. A. 2009, 106, 7314–7319.
232. Mohana Kumara, P.; Srimany, A.; Ravikanth, G.; Uma Shaanker, R.; Pradeep, T. Ambient Ionization Mass Spectrometry Imaging of Rohitukine, a Chromone Anti-Cancer Alkaloid,
During Seed Development in Dysoxylum binectariferum Hook.f (Meliaceae). Phytochemistry 2015, 116, 104–110.
233. Watrous, J.; et al. Mass Spectral Molecular Networking of Living Microbial Colonies. Proc. Natl. Acad. Sci. U. S. A. 2012, 109, E1743–E1752.
234. Hsu, C.-C.; et al. Real-Time Metabolomics on Living Microorganisms Using Ambient Electrospray Ionization Flow-Probe. Anal. Chem. 2013, 85, 7014–7018.
235. Li, H.; Balan, P.; Vertes, A. Molecular Imaging of Growth, Metabolism, and Antibiotic Inhibition in Bacterial Colonies by Laser Ablation Electrospray Ionization Mass Spectrometry.
Angew. Chem. Int. Ed. 2016, 55, 15035–15039.
236. Etalo, D. W.; De Vos, R. C. H.; Joosten, M. H. A. J.; Hall, R. D. Spatially Resolved Plant Metabolomics: Some Potentials and Limitations of Laser-Ablation Electrospray Ionization
Mass Spectrometry Metabolite Imaging. Plant Physiol. 2015, 169, 1424–1435.
237. Vaikkinen, A.; et al. Laser Ablation Atmospheric Pressure Photoionization Mass Spectrometry Imaging of Phytochemicals From Sage Leaves. Rapid Commun. Mass Spectrom.
2014, 28, 2490–2496.
Mass Spectrometry for Natural Product Discovery 305
238. Boya, C. A. P.; et al. Imaging Mass Spectrometry and MS/MS Molecular Networking Reveals Chemical Interactions Among Cuticular Bacteria and Pathogenic Fungi Associated
With Fungus-Growing Ants. Sci. Rep. 2017, 7, 5604.
239. Brentan Silva, D.; et al. Direct Analyses of Secondary Metabolites by Mass Spectrometry Imaging (MSI) From Sunflower (Helianthus annuus L.) Trichomes. Molecules 2017, 22,
774.
240. Hölscher, D.; et al. Matrix-Free UV-Laser Desorption/Ionization (LDI) Mass Spectrometric Imaging at the Single-Cell Level: Distribution of Secondary Metabolites of Arabidopsis
thaliana and Hypericum Species. Plant J. 2009, 60, 907–918.
241. Hölscher, D.; et al. Phenalenone-Type Phytoalexins Mediate Resistance of Banana Plants (Musa spp.) to the Burrowing Nematode Radopholus similis. Proc. Natl. Acad. Sci. U. S.
A. 2014, 111, 105–110.
242. Kroiss, J.; et al. Symbiotic Streptomycetes Provide Antibiotic Combination Prophylaxis for Wasp Offspring. Nat. Chem. Biol. 2010, 6, 261–263.
243. Rudd, D.; Benkendorff, K.; Voelcker, N. H. Solvent Separating Secondary Metabolites Directly From Biosynthetic Tissue for Surface-Assisted Laser Desorption Ionisation Mass
Spectrometry. Mar. Drugs 2015, 13, 1410–1431.
244. Louie, K. B.; et al. Mass Spectrometry Imaging for In Situ Kinetic Histochemistry. Sci. Rep. 2013, 3, 1656.
245. Louie, K. B.; et al. Replica-Extraction-Transfer’ Nanostructure-Initiator Mass Spectrometry Imaging of Acoustically Printed Bacteria. Anal. Chem. 2013, 85, 10856–10862.
246. Patti, G. J.; et al. Detection of Carbohydrates and Steroids by Cation-Enhanced Nanostructure-Initiator Mass Spectrometry (NIMS) for Biofluid Analysis and Tissue Imaging. Anal.
Chem. 2010, 82, 121–128.
247. Debois, D.; et al. In Situ Localisation and Quantification of Surfactins in a Bacillus subtilis Swarming Community by Imaging Mass Spectrometry. Proteomics 2008, 8,
3682–3691.
248. Vaidyanathan, S.; et al. Subsurface Biomolecular Imaging of Streptomyces coelicolor Using Secondary Ion Mass Spectrometry. Anal. Chem. 2008, 80, 1942–1951.
249. Dunham, S. J. B.; et al. Quantitative SIMS Imaging of Agar-Based Microbial Communities. Anal. Chem. 2018, 90, 5654–5663.
250. Nihorimbere, V.; et al. Impact of Rhizosphere Factors on Cyclic Lipopeptide Signature From the Plant Beneficial Strain Bacillus amyloliquefaciens S499. FEMS Microbiol. Ecol.
2012, 79, 176–191.
251. Parrot, D.; Papazian, S.; Foil, D.; Tasdemir, D. Imaging the Unimaginable: Desorption Electrospray Ionization—Imaging Mass Spectrometry (DESI-IMS) in Natural Product
Research. Planta Med. 2018, 84, 584–593.
252. Bartels, B.; Svatoš, A. Spatially Resolved In Vivo Plant Metabolomics by Laser Ablation-Based Mass Spectrometry Imaging (MSI) Techniques: LDI-MSI and LAESI. Front. Plant Sci.
2015, 6, 1–7.
253. Gamble, L. J.; Anderton, C. R. Secondary Ion Mass Spectrometry Imaging of Tissues, Cells, and Microbial Systems. Microscopy Today 2016, 24, 24–31.
254. Kompauer, M.; Heiles, S.; Spengler, B. Atmospheric Pressure MALDI Mass Spectrometry Imaging of Tissues and Cells at 1.4-mm Lateral Resolution. Nat. Methods 2017, 14,
90–96.
255. Bouslimani, A.; Sanchez, L. M.; Garg, N.; Dorrestein, P. C. Mass Spectrometry of Natural Products: Current, Emerging and Future Technologies. Nat. Prod. Rep. 2014, 31,
718–729.
256. Sidebottom, A. M.; Carlson, E. E. A Reinvigorated Era of Bacterial Secondary Metabolite Discovery. Curr. Opin. Chem. Biol. 2015, 24, 104–111.
257. Li, B.; Comi, T. J.; Si, T.; Dunham, S. J. B.; Sweedler, J. V. A One-Step Matrix Application Method for MALDI Mass Spectrometry Imaging of Bacterial Colony Biofilms. J. Mass
Spectrom. 2016, 51, 1030–1035.
258. Traxler, M. F.; Watrous, J. D.; Alexandrov, T.; Dorrestein, P. C.; Kolter, R. Interspecies Interactions Stimulate Diversification of the Streptomyces Coelicolor Secreted Metabolome.
MBio 2013, 4, e00459-13.
259. Fraser, P. D.; Enfissi, E. M. A.; Goodfellow, M.; Eguchi, T.; Bramley, P. M. Metabolite Profiling of Plant Carotenoids Using the Matrix-Assisted Laser Desorption Ionization Time-of-
Flight Mass Spectrometry. Plant J. 2007, 49, 552–564.
260. Manicke, N. E.; Kistler, T.; Ifa, D. R.; Cooks, R. G.; Ouyang, Z. High-Throughput Quantitative Analysis by Desorption Electrospray Ionization Mass Spectrometry. J. Am. Soc. Mass
Spectrom. 2009, 20, 321–325.
261. Xu, F.; et al. A Genetics-Free Method for High-Throughput Discovery of Cryptic Microbial Metabolites. Nat. Chem. Biol. 2019, 15, 161–168.
262. Dias, D. A.; Urban, S.; Roessner, U. A Historical Overview of Natural Products in Drug Discovery. Metabolites 2012, 2, 303–336.
263. Carter, G. T. N. P. MS since 1970: From the Basement to the Bench Top. Nat. Prod. Rep. 2014, 31, 711–717.
264. Butler, M. S. The Role of Natural Product Chemistry in Drug Discovery. J. Nat. Prod. 2004, 67, 2141–2153.
265. Shen, B. A. New Golden Age of Natural Products Drug Discovery. Cell 2015, 163, 1297–1300.
266. Oppong-Danquah, E.; Parrot, D.; Blümel, M.; Labes, A.; Tasdemir, D. Molecular Networking-Based Metabolome and Bioactivity Analyses of Marine-Adapted Fungi Co-cultivated
With Phytopathogens. Front. Microbiol. 2018, 9, 2072.
267. Newman, D. Screening and Identification of Novel Biologically Active Natural Compounds. F1000Res 2017, 6, 783.
268. Jacyna, J.; Kordalewska, M.; Markuszewski, M. J. Design of Experiments in Metabolomics-Related Studies: An Overview. J. Pharm. Biomed. Anal. 2019, 164, 598–606.
269. Beckett, C.; Eriksson, L.; Johansson, E.; Wikström, C. Multivariate data analysis (MVDA). In Pharmaceutical Quality by Design; Schlindwein, W. S., Gibson, M., Eds.; John
Wiley & Sons, Ltd, 2018; Vol. 1, pp 201–225.
270. Sung, A. A.; Gromek, S. M.; Balunas, M. J. Upregulation and Identification of Antibiotic Activity of a Marine-Derived Streptomyces sp. Via Co-cultures With Human Pathogens.
Mar. Drugs 2017, 15, E250.
271. Rinkel, J.; Dickschat, J. S. Recent Highlights in Biosynthesis Research Using Stable Isotopes. Beilstein J. Org. Chem. 2015, 11, 2493–2508.
272. Fischer, C. R.; Wilmes, P.; Bowen, B. P.; Northen, T. R.; Banfield, J. F. Deuterium-Exchange Metabolomics Identifies N-Methyl Lyso Phosphatidylethanolamines as Abundant
Lipids in Acidophilic Mixed Microbial Communities. Metabolomics 2012, 8, 566–578.
273. Ruttkies, C.; et al. Supporting Non-target Identification by Adding Hydrogen Deuterium Exchange MS/MS Capabilities to MetFrag. Anal. Bioanal. Chem. 2019, 411, 4683–4700.
274. Little, J. L.; Cleven, C. D.; Brown, S. D. Identification of ‘Known Unknowns’ Utilizing Accurate Mass Data and Chemical Abstracts Service Databases. J. Am. Soc. Mass Spectrom.
2011, 22, 348–359.
275. Xu, Y.-F.; Lu, W.; Rabinowitz, J. D. Avoiding Misannotation of in-Source Fragmentation Products as Cellular Metabolites in Liquid Chromatography-Mass Spectrometry-Based
Metabolomics. Anal. Chem. 2015, 87, 2273–2281.
276. Hummel, J.; Selbig, J.; Walther, D.; Kopka, J. The Golm Metabolome database: A database for GC-MS based metabolite profiling. In Metabolomics: A Powerful Tool in Systems
Biology; Nielsen, J., Jewett, M. C., Eds.; Springer Berlin Heidelberg, 2007; pp 75–95.
277. Laatsch, H. AntiBase: The Natural Compound Identifier; Wiley-VCH, 2017.
278. National Institute of Advanced Industrial Science and Technology, Spectral Database for Organic Compounds SDBS. SDBSWeb; https://sdbs.db.aist.go.jp.
279. Sawada, Y.; et al. RIKEN Tandem Mass Spectral Database (ReSpect) for Phytochemicals: A Plant-Specific MS/MS-Based Data Resource and Database. Phytochemistry 2012,
82, 38–45.
280. Wishart, D. S.; et al. HMDB 4.0: The Human Metabolome Database for 2018. Nucleic Acids Res. 2018, 46, D608–D617.
281. Horai, H.; et al. MassBank: A Public Repository for Sharing Mass Spectral Data for Life Sciences. J. Mass Spectrom. 2010, 45, 703–714.
282. Guijas, C.; et al. METLIN: A Technology Platform for Identifying Knowns and Unknowns. Anal. Chem. 2018, 90, 3156–3164.
283. Smith, C. A.; et al. METLIN: A Metabolite Mass Spectral Database. Ther. Drug Monit. 2005, 27, 747–751.
284. Cuthbertson, D. J.; et al. Accurate Mass-Time Tag Library for LC/MS-Based Metabolite Profiling of Medicinal Plants. Phytochemistry 2013, 91, 187–197.
285. Ramirez-Gaona, M.; et al. YMDB 2.0: A Significantly Expanded Version of the Yeast Metabolome Database. Nucleic Acids Res. 2017, 45, D440–D445.
286. Wishart, D. S.; et al. DrugBank 5.0: A Major Update to the DrugBank Database for 2018. Nucleic Acids Res. 2018, 46, D1074–D1082.
287. Sajed, T.; et al. ECMDB 2.0: A Richer Resource for Understanding the Biochemistry of E. coli. Nucleic Acids Res. 2015, 44, D495–D501.
306 Mass Spectrometry for Natural Product Discovery
288. Sud, M.; Fahy, E.; Cotter, D.; Dennis, E. A.; Subramaniam, S. LIPID MAPS-Nature Lipidomics Gateway: An Online Resource for Students and Educators Interested in Lipids.
J. Chem. Educ. 2012, 89, 291–292.
289. Fahy, E.; et al. LipidFinder on LIPID MAPS: Peak Filtering, MS Searching and Statistical Analysis for Lipidomics. Bioinformatics 2019, 35, 685–687.
290. NIST Office of Data & Informatics, NIST Chemistry WebBook. National Institute of Standards and Technology; 1997. https://webbook.nist.gov/chemistry/.
291. Stein, S. Mass Spectral Reference Libraries: An Ever-Expanding Resource for Chemical Identification. Anal. Chem. 2012, 84, 7274–7282.
292. Vinaixa, M.; et al. Mass Spectral Databases for LC/MS- and GC/MS-Based Metabolomics: State of the Field and Future Prospects. Trends Anal. Chem. 2016, 78, 23–35.
293. Johnson, S. R.; Lange, B. M. Open-Access Metabolomics Databases for Natural Product Research: Present Capabilities and Future Potential. Front. Bioeng. Biotechnol. 2015, 3,
22.
294. Banerjee, P.; et al. Super Natural II—A Database of Natural Products. Nucleic Acids Res. 2015, 43, D935–D939.
295. Caspi, R.; et al. The MetaCyc Database of Metabolic Pathways and Enzymes. Nucleic Acids Res. 2018, 46, D633–D639.
296. Kanehisa, M.; Sato, Y.; Kawashima, M.; Furumichi, M.; Tanabe, M. KEGG as a Reference Resource for Gene and Protein Annotation. Nucleic Acids Res. 2016, 44, D457–D462.
297. Misra, B. B.; Mohapatra, S. Tools and Resources for Metabolomics Research Community: A 2017–2018 Update. Electrophoresis 2019, 40, 227–246.
298. Bowen, B. P.; Northen, T. R. Dealing With the Unknown: Metabolomics and Metabolite Atlases. J. Am. Soc. Mass Spectrom. 2010, 21, 1471–1476.
299. Yao, Y.; et al. Analysis of Metabolomics Datasets With High-Performance Computing and Metabolite Atlases. Metabolites 2015, 5, 431–442.
300. Tawfike, A. F.; Viegelmann, C.; Edrada-Ebel, R. Metabolomics and Dereplication Strategies in Natural Products. Methods Mol. Biol. 2013, 1055, 227–244.
301. Kind, T.; Fiehn, O. Strategies for Dereplication of Natural Compounds Using High-Resolution Tandem Mass Spectrometry. Phytochem. Lett. 2017, 21, 313–319.
302. Bach, E.; Szedmak, S.; Brouard, C.; Böcker, S.; Rousu, J. Liquid-Chromatography Retention Order Prediction for Metabolite Identification. Bioinformatics 2018, 34, i875–i883.
303. Kind, T.; et al. FiehnLib: Mass Spectral and Retention Index Libraries for Metabolomics Based on Quadrupole and Time-of-Flight Gas Chromatography/Mass Spectrometry. Anal.
Chem. 2009, 81, 10038–10048.
304. Mohimani, H.; et al. Dereplication of Microbial Metabolites Through Database Search of Mass Spectra. Nat. Commun. 2018, 9, 4035.
305. Yang, J. Y.; et al. Molecular Networking as a Dereplication Strategy. J. Nat. Prod. 2013, 76, 1686–1699.
306. Blaženovic, I.; et al. Comprehensive Comparison of In Silico MS/MS Fragmentation Tools of the CASMI Contest: Database Boosting Is Needed to Achieve 93% Accuracy.
J. Chem. 2017, 9, 32.
307. Wolf, S.; Schmidt, S.; Müller-Hannemann, M.; Neumann, S. In Silico Fragmentation for Computer Assisted Identification of Metabolite Mass Spectra. BMC Bioinformatics 2010,
11, 148.
308. Böcker, S.; Dührkop, K. Fragmentation Trees Reloaded. J. Chem. 2016, 8, 5.
309. Vaniya, A.; Fiehn, O. Using Fragmentation Trees and Mass Spectral Trees for Identifying Unknown Compounds in Metabolomics. Trends Anal. Chem. 2015, 69, 52–61.
310. Wu, Q. Q. Multistage Accurate Mass Spectrometry: A ‘Basket in a Basket’ Approach for Structure Elucidation and Its Application to a Compound From Combinatorial Synthesis.
Anal. Chem. 1998, 70, 865–872.
311. Kind, T.; Fiehn, O. Seven Golden Rules for Heuristic Filtering of Molecular Formulas Obtained by Accurate Mass Spectrometry. BMC Bioinformatics 2007, 8, 105.
312. Jensen, P. R. Natural Products and the Gene Cluster Revolution. Trends Microbiol. 2016, 24, 968–977.
313. Medema, M. H.; Fischbach, M. A. Computational Approaches to Natural Product Discovery. Nat. Chem. Biol. 2015, 11, 639–648.
314. Gugisch, R.; et al. MOLGEN 5.0, a molecular structure generator. In Advances in Mathematical Chemistry and Applications, Elsevier, 2015; pp 113–138.
315. Scheubert, K.; Hufsky, F.; Böcker, S. Computational Mass Spectrometry for Small Molecules. J. Chem. 2013, 5, 12.
316. Ma, Y.; Kind, T.; Yang, D.; Leon, C.; Fiehn, O. MS2Analyzer: A Software for Small Molecule Substructure Annotations From Accurate Tandem Mass Spectra. Anal. Chem. 2014,
86, 10724–10731.
317. Duncan, K. R.; et al. Molecular Networking and Pattern-Based Genome Mining Improves Discovery of Biosynthetic Gene Clusters and their Products From Salinispora species.
Chem. Biol. 2015, 22, 460–471.
318. Quinn, R. A.; et al. Molecular Networking as a Drug Discovery, Drug Metabolism, and Precision Medicine Strategy. Trends Pharmacol. Sci. 2017, 38, 143–154.
319. Sumner, L. W.; et al. Proposed Minimum Reporting Standards for Chemical Analysis: Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI).
Metabolomics 2007, 3, 211–221.
320. Goodacre, R.; et al. Proposed Minimum Reporting Standards for Data Analysis in Metabolomics. Metabolomics 2007, 3, 231–241.
321. Cheng, F.; Li, W.; Liu, G.; Tang, Y. In Silico ADMET Prediction: Recent Advances, Current Challenges and Future Trends. Curr. Top. Med. Chem. 2013, 13, 1273–1289.
322. Medema, M. H.; et al. Minimum Information About a Biosynthetic Gene Cluster. Nat. Chem. Biol. 2015, 11, 625–631.
323. Dhanasekaran, A. R.; Pearson, J. L.; Ganesan, B.; Weimer, B. C. Metabolome Searcher: A High Throughput Tool for Metabolite Identification and Metabolic Pathway Mapping
Directly From Mass Spectrometry and Using Genome Restriction. BMC Bioinformatics 2015, 16, 62.
324. Li, S.; et al. Predicting Network Activity From High Throughput Metabolomics. PLoS Comput. Biol. 2013, 9, e1003123.
325. Epstein, S. C.; Charkoudian, L. K.; Medema, M. H. A Standardized Workflow for Submitting Data to the Minimum Information About a Biosynthetic Gene Cluster (MIBiG)
Repository: Prospects for Research-Based Educational Experiences. Stand Genomic Sci. 2018, 13, 16.
325a. Erbilgin, O.; et al. MAGI: A Method for Metabolite Annotation and Gene Integration. ACS Chem. Biol 2019. https://doi.org/10.1021/acschembio.8b01107.
326. Caspi, R.; et al. The MetaCyc Database of Metabolic Pathways and Enzymes and the BioCyc Collection of Pathway/Genome Databases. Nucleic Acids Res. 2016, 44,
D471–D480.
327. Morgat, A.; et al. Updates in Rhea—An Expert Curated Resource of Biochemical Reactions. Nucleic Acids Res. 2017, 45, D415–D418.
328. Li, L.; et al. MyCompoundID: Using an Evidence-Based Metabolome Library for Metabolite Identification. Anal. Chem. 2013, 85, 3401–3408.
329. Huan, T.; et al. MyCompoundID MS/MS Search: Metabolite Identification Using a Library of Predicted Fragment-Ion-Spectra of 383,830 Possible Human Metabolites. Anal.
Chem. 2015, 87, 10619–10626.
330. Menikarachchi, L. C.; Hill, D. W.; Hamdalla, M. A.; Mandoiu, I. I.; Grant, D. F. In Silico Enzymatic Synthesis of a 400,000 Compound Biochemical Database for Nontargeted
Metabolomics. J. Chem. Inf. Model. 2013, 53, 2483–2492.
331. Jeffryes, J. G.; et al. MINEs: Open Access Databases of Computationally Predicted Enzyme Promiscuity Products for Untargeted Metabolomics. J. Chem. 2015, 7, 44.
332. Hadadi, N.; Hafner, J.; Shajkofci, A.; Zisaki, A.; Hatzimanikatis, V. ATLAS of Biochemistry: A Repository of All Possible Biochemical Reactions for Synthetic Biology and Metabolic
Engineering Studies. ACS Synth. Biol. 2016, 5, 1155–1166.
333. Duigou, T.; du Lac, M.; Carbonell, P.; Faulon, J.-L. RetroRules: A Database of Reaction Rules for Engineering Biology. Nucleic Acids Res. 2019, 47, D1229–D1235.
334. Kumar, A.; Wang, L.; Ng, C. Y.; Maranas, C. D. Pathway Design Using De Novo Steps Through Uncharted Biochemical Spaces. Nat. Commun. 2018, 9, 184.
335. Delépine, B.; Duigou, T.; Carbonell, P.; Faulon, J.-L. RetroPath2.0: A Retrosynthesis Workflow for Metabolic Engineers. Metab. Eng. 2018, 45, 158–170.
336. Hatzimanikatis, V.; et al. Exploring the Diversity of Complex Metabolic Networks. Bioinformatics 2005, 21, 1603–1609.
337. Li, C.; et al. Computational Discovery of Biochemical Routes to Specialty Chemicals. Chem. Eng. Sci. 2004, 59, 5051–5060.
338. Hattori, M.; Tanaka, N.; Kanehisa, M.; Goto, S. SIMCOMP/SUBCOMP: Chemical Structure Search Servers for Network Analyses. Nucleic Acids Res. 2010, 38, W652–W656.
339. Temperton, B.; Giovannoni, S. J. Metagenomics: Microbial Diversity Through a Scratched Lens. Curr. Opin. Microbiol. 2012, 15, 605–612.
340. Aziz, R. K.; et al. The RAST Server: Rapid Annotations Using Subsystems Technology. BMC Genomics 2008, 9, 75.
341. Wu, C. H.; Huang, H.; Yeh, L.-S. L.; Barker, W. C. Protein Family Classification and Functional Annotation. Comput. Biol. Chem. 2003, 27, 37–47.
342. Johnston, C. W.; et al. An Automated Genomes-to-Natural Products Platform (GNP) for the Discovery of Modular Natural Products. Nat. Commun. 2015, 6, 8421.
343. Medema, M. H.; et al. Pep2Path: Automated Mass Spectrometry-Guided Genome Mining of Peptidic Natural Products. PLoS Comput. Biol. 2014, 10, e1003822.
344. Sévin, D. C.; Fuhrer, T.; Zamboni, N.; Sauer, U. Nontargeted In Vitro Metabolomics for High-Throughput Identification of Novel Enzymes in Escherichia coli. Nat. Methods 2017,
14, 187–194.