Académique Documents
Professionnel Documents
Culture Documents
Journal of Chromatography A
journal homepage: www.elsevier.com/locate/chroma
a r t i c l e i n f o a b s t r a c t
Article history: In this paper is reported the use of the chromatographic profiles of volatiles to determine disease markers
Received 22 November 2012 in plants – in this case, leaves of Eucalyptus globulus contaminated by the necrotroph fungus Teratosphaeria
Received in revised form 3 January 2013 nubilosa. The volatile fraction was isolated by headspace solid phase microextraction (HS-SPME) and
Accepted 4 January 2013
analyzed by comprehensive two-dimensional gas chromatography–fast quadrupole mass spectrome-
Available online 12 January 2013
try (GC × GC–qMS). For the correlation between the metabolic profile described by the chromatograms
and the presence of the infection, unfolded-partial least squares discriminant analysis (U-PLS-DA) with
Keywords:
orthogonal signal correction (OSC) were employed. The proposed method was checked to be independent
Comprehensive two-dimensional gas
chromatography
of factors such as the age of the harvested plants. The manipulation of the mathematical model obtained
Mass spectrometry also resulted in graphic representations similar to real chromatograms, which allowed the tentative iden-
Metabolic profiling tification of more than 40 compounds potentially useful as disease biomarkers for this plant/pathogen
Orthogonal signal correction, Unfolded pair. The proposed methodology can be considered as highly reliable, since the diagnosis is based on the
partial least squares discriminant analysis whole chromatographic profile rather than in the detection of a single analyte.
Solid phase microextraction © 2013 Elsevier B.V. All rights reserved.
0021-9673/$ – see front matter © 2013 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.chroma.2013.01.013
L.W. Hantao et al. / J. Chromatogr. A 1279 (2013) 86–91 87
In this context, with the introduction of more modern analytical sections OSC processing prior to unfolded partial least squares-
platforms the study of metabolites – coined metabolome – is being discriminant analysis (U-PLS-DA) will be described as well as some
extensively used for the diagnosis of diseases in plants. However, non-conventional strategies for data interpretation. In addition, a
because of their lack of motility the plants have more pronounced careful inspection and interpretation of biomarkers, as well as their
metabolic responses to stress than mammals [4]. This makes the tentative identification, will be presented.
inspection of the metabolome for the diagnosis of diseases in plants
a particularly interesting and challenging task. 2. Theory
In phytopathology, the volatile and semi-volatile fraction is
frequently assessed and very often terpenes and other classes of The correlation between the chromatograms and presence of
compounds are implicated as potential biomarkers [5]. Hence, the fungal infection on the plants was determined through U-PLS-DA.
use of solid phase microextraction (SPME) is an interesting alterna- PLS is a very useful tool to model dependent variables (denoted
tive since it can be performed in most laboratories without the need by Y) as a function of independent responses (denominated X). In
for specific instrumentation [6]. The feasibility of using SPME for this case, a two-way data set X (I × J), where I corresponds to the
these studies arises from the combination of sampling and sample number of chromatograms and J is the number of scans in the chro-
preparation into a simple and unique step [6]; and under relatively matogram, is decomposed into a score vector t related to the first
mild conditions the isolation of the biogenic volatile organic com- order of the data and in a weight vector w assigned to the sec-
pounds (BVOC) can be performed from the matrix with some degree ond order of the data, as seen in Eqs. (1) and (2). These vectors
of analyte pre-concentration [7,8]. are defined to have the maximum covariance with the dependent
Despite the inherent advantages of SPME for BVOC sampling, variable Y, Eq. (3):
there are essentially two issues that must be met for proper
metabolic profiling. Firstly, due to the amount of sorbent mate-
z
X= tz wzT + E (1)
rial available in the SPME fiber the maximum extracted amount
Z=1
of analytes will be significantly less, when compared to stir-
bar sorptive extraction (SBSE) or needle trap devices (NTD), for
z
instance. Secondly, the structural and functional diversity of the Y= uz qTz + E (2)
volatiles present in the BVOC ranges from aliphatic alcohols, Z=1
aldehydes, methylketones, acids, lactones, esters, terpenoids, fura- ⎡ ⎛ ⎞⎤
nones and pyrones [5]. Consequently, some degree of coelution I J
has to be expected. Hence, comprehensive two-dimensional gas max ⎣cov(tz , y) min ⎝ (xij − tz,i wz,j )⎠ ⎦ (3)
wz
chromatography–mass spectrometry (GC × GC–MS) is probably the i=1 j=1
ideal companion for SPME because it presents higher detectability
– achieved by band compression in the modulator – and sensitivity where tz and uz are scores vector, wz and qz are loading vectors,
offered by the MS. Also, GC × GC–MS provides at least two tunable z is the number of latent variables (LV) and E is the residual matrix
dimensions for separation [9]. Thus, GC × GC has being recognized [16]. In addition, DA-based methods correlate the input data to
as a potential tool for metabolomics. binary or other discrete response sets – i.e., the samples are discrim-
The main advantage of using the metabolic profile obtained inated between two groups separated by a discriminant hyperplane
by SPME and GC × GC–MS is the possibility to diagnose the infec- in the X-space.
tion during the initial stage of the disease. However, many other However, very often much information present in the chro-
sources that can cause stress to the plant will also have an impact matograms is not correlated to the property of interest (Y), because
on the measured chromatographic profile. This fact makes visual several sources contribute to variation present in the experimen-
distinction/identification of biomarkers an extremely difficult and tal measurements. Hence, OSC can be used for preprocessing of
time-consuming task. In this sense, many partial least squares (PLS) the chromatographic data in order to eliminate this non-correlated
based approaches have already been reported for GC × GC data variation from the descriptor variables (X). Briefly, the OSC algo-
[10–13]. Briefly, the use of partial least squares for discriminant rithm uses a PCA/PLS-related solution that removes only so much
analysis (PLS-DA) is an interesting alternative for the differen- of X as is unrelated (orthogonal) to Y, a more comprehensive dis-
tiation of classes based on the metabolic profile, because it is cussion was reported by Wold et al. [17].
able to discriminate the within group variation (12 ) from the
between group variation (22 ) [14]. Hence, if classical principal 3. Materials and methods
component analysis (PCA) was used for the pattern recognition
of metabolic profiles, where 12 > 22 it would potentially fail to Samples. All samples were collected at a commercial forest (Pin-
detect the difference between these classes, while the PLS-DA heiro Machado, RS, Brazil). Leaves from E. globulus, both healthy
approach could be employed successfully [14]. An example of and infected with the fungal pathogen T. nubilosa and from plants
PLS-DA analysis of GC × GC–TOFMS (TOFMS – mass spectrome- with varied ages, were harvested. The detection of the infection, in
try with a time-of-flight mass analyzer) data for metabolomic the E. globulus plants, was performed by the microscopic inspec-
purposes was successfully carried out by Gröger and Zimmer- tion of Mycosphaerella leaf disease’s symptoms [18,19]. Also, the
mann [11]. However, pre-processing techniques such as orthogonal identification, and confirmation, of the pathogen was performed
signal correction (OSC) can also be used for the same purpose, by analyzing its genetic material. The samples were immediately
thus it often provides an optimal platform for PLS modeling frozen with liquid nitrogen (LN2 ) after removal from the plant, to
allowing for better predictive ability and interpretation [15]. cease all physiological processes, and kept under refrigeration (dry
Remarkably, Li et al. [12] reported the use of OSC-PLS-DA for the ice, −78 ◦ C) until analysis.
analysis of GC × GC–TOFMS data to distinguish between healthy Reagents and materials. For the determination of the 1 D lin-
and diabetic patients; and the implication of potentially new ear temperature programmed retention indices (LTPRI), a mix of
biomarkers. C7–C20 n-alkanes was used (Sigma–Aldrich – St. Louis, MO, USA).
In this context, the combination of SPME with GC × GC–qMS For the HS-SPME procedure, glass v-vials (Wheaton Science Prod-
(qMS – mass spectrometry with rapid quadrupolar mass analyzer) ucts – Millville, NJ, USA), magnetic stirrers and appropriate screw
for the metabolic profiling of plants is proposed. So, in the following caps with PTFE/silicon septa (Sigma–Aldrich) were employed, as
88 L.W. Hantao et al. / J. Chromatogr. A 1279 (2013) 86–91
Fig. 2. Illustration of the (A) scores graph for the first and second latent variables; (B) is the predicted response for the samples in the calibration (䊉) and external validation
() data set. The calculated threshold value was of 0.46. A prediction efficiency of 100% was achieved.
in E. globulus is robust to most non-relevant variations, including healthy samples, and above this same value for samples taken from
age. diseased plants. Consequently, it can be seen that for all samples in
Fig. 2(A) presents the score plot from U-PLS-DA modeling. It the calibration and validation set the diagnosis results were consis-
can be readily noticed that the distinction of samples from healthy tent. Thus, the proposed combination of SPME and GC × GC–qMS
E. globulus plants from diseased ones can be performed with the with multivariate analysis can be used as a potential tool for the
information contained in the first LV. In a quantitative sense, a accurate diagnosis of diseases in plants.
threshold value was determined for proper distinction between Additionally, because the information responsible for the differ-
samples taken from healthy and infected plants, at a confidence entiation between healthy and diseased samples from E. globulus
level of 95%. In this sense, in Fig. 2(B) is illustrated a classification was contained in the first LV; a non-conventional method was con-
plot with the predicted values for the E. globulus samples. Through ceived for the analysis of the loadings from this LV. It essentially
the inspection of the calibration set (䊉) and prior knowledge of the consisted of a simple routine that sorts the variables from the load-
samples, it can be observed that a 100% prediction was achieved, as ing by its value. So, variables with positive values were displaced
no sample was misclassified. However, to effectively test the model into a vector l1 while the modules of the negative values were
an external validation set of samples was used. The evaluation of placed into a second vector l2 . However, during the construction
the validation samples () showed that, also, no misclassification of these vectors (l1 and l2 ), whenever the conditional statement
was performed, thus, a prediction efficiency of 100% was obtained. was violated, a value of zero would be placed in the correspond-
Although ideally the fitted prediction values should be zero for ing variable in this new vector. Then, each vector was exported in
healthy samples and one for infected samples, due to presence of ASCII format and the contour plot was constructed using GCImage
non-relevant information a small deviation is to be expected. The software, as illustrated in Fig. 3.
responses provided by the U-PLS-DA model should be compared Remarkably, the proposed visualization of the loadings graph
to the 95% confidence critical cutoff value provided by the algo- allows an easier interpretation of their information because it sorts
rithm. The threshold was determined, by the U-PLS-DA algorithm, the contributions of both classes (healthy and infected samples)
as being equal to 0.46. Hence, the key for the classification by U-PLS- and it resembles a chromatogram – which in turn makes peak
DA is that the predicted value has to be below this 0.46 threshold for assignment a readily accomplishable task. A careful inspection of
Fig. 3. Loadings graph of the first LV. In (A) are illustrated the analytes present in higher intensities in samples from healthy E. globulus; in (B) are the analytes in higher
intensities in the samples from E. globulus infected with T. nubilosa. The tentative identification is illustrated in Table 1.
90 L.W. Hantao et al. / J. Chromatogr. A 1279 (2013) 86–91
Table 1
Tentative identification of the analytes found in the loadings graph. The peak assignment is illustrated in Fig. 3.
Exp. Lit.
M.F., molecular formula; LTPRI, 1 D linear temperature programmed retention indices determined experimentally (exp.) and found in the MS databases (lit.).
a
Analytes whose identity were confirmed with analytical standards.
the peak regions which were implicated, by the model, as potential mass spectral similarity searches and LTPRI comparison. As a
biomarkers showed that the negative values in the original loading result, over 40 biomarkers were accurately determined by the
vector corresponded to the peaks that were in higher intensities proposed model and identified. The tentative identification of the
in the healthy samples [Fig. 3(A)], while the positive values cor- analytes, indicated in Fig. 3(A) and (B), is listed in Table 1.
responded to peaks that were in higher intensities in the infected In addition, to evaluate the role of theses analytes in the defense
samples [Fig. 3(B)]. At first sight, no immediate hypothesis could mechanism of E. globulus, two standard solutions were used in the
be inferred without further knowledge of the system. However, it in vitro assays with spores of the pathogen, T. nubilosa. The first
is known that E. globulus is susceptible to infection by T. nubilosa. solution (#1) consisted of pure ␣-pinene, related to both healthy
Hence, the infection of E. globulus plants by T. nubilosa leads and infected plants (Fig. 3); while the second solution consisted of
to the higher production of the semi-volatile compounds, such namely carvone, ␣-terpineol and geraniol, related to the infected
as sesquiterpenes, when compared to more volatile species, as plants. To evaluate the biological activity of these volatiles, the ger-
terpenes. The last observation could probably be supported by the mination of the spores was analyzed. In Fig. 4(A) is illustrated the
findings reported by Bollina et al. [26] where flavonoids were also germination of the spores in the absence of the analytes in the
implicated as biomarkers; and it is known that heavier terpenoids system’s headspace. In Fig. 4(B) is illustrated the inhibition of the
play an important role in the biological synthesis of flavonoids germination of the pathogen’s spores when the headspace was sat-
[27] and in the defense mechanism [28] of plants. Lastly, most urated with analytes from standard solution #2. In addition, in vitro
analytes were tentatively identified by the combination of the assay with standard solution #1 also inhibited the germination
L.W. Hantao et al. / J. Chromatogr. A 1279 (2013) 86–91 91
Fig. 4. Germination images obtained from the in vitro assays with the pathogen’s spores, T. nubilosa. In (A) is illustrated the growth of the spores with no analytes in the
system’s headspace. In (B) is illustrated the inhibition of the germination of the spores when the headspace was saturated with analytes, from standard solution #2.
of the spores. Consequently, the in vitro assays showed that both References
standard solutions (#1 and 2#) inhibited in 100% the germination
of the spores from T. nubilosa. Hence, it was confirmed that most [1] G.N. Agrios, Plant Pathology, 5th ed., Academic Press, San Diego, 2005.
[2] R. Burns, Plant Pathology: Techniques and Protocols, 1st ed., Humana Press,
likely some terpenes and sesquiterpenes, indicated by the loadings New York, 2008.
graph, are indeed related to the defense mechanism of E. globulus. [3] A.A. Bazzini, C.A. Manacorda, T. Tohge, G. Conti, M.C. Rodriguez, A. Nunes-Nesi,
S. Villanueva, A.R. Fernie, F. Carrari, S. Asurmedi, PLoS One 6 (2011) 1.
[4] A.R. Fernie, Phytochemistry 68 (2007) 2861.
5. Conclusions [5] W. Schwab, R. Davidovich-Rikanati, E. Lewinsohn, Plant J. 54 (2008) 712.
[6] F. Augusto, A.L.P. Valente, Trends Anal. Chem. 21 (2002) 428.
The combination of SPME and GC × GC–qMS was successfully [7] F. Augusto, A. Leite e Lopes, C.A. Zini, Trends Anal. Chem. 22 (2003) 160.
[8] C.A. Zini, F. Augusto, E. Christensen, E.B. Camarão, J. Pawliszyn, J. Agric. Food
used for the assessment of BVOC produced by plants. Additionally, Chem. 50 (2002) 7199.
this combination was successfully employed as a tool for the diag- [9] J.B. Phillips, J. Beens, J. Chromatogr. A 856 (1999) 331.
nosis of T. nubilosa infection in E. globulus. However, the proposed [10] L.A.F. de Godoy, M.P. Pedroso, L.W. Hantao, R.J. Poppi, F. Augusto, Talanta 83
(2011) 1302.
methodology can be extended to most pathosystems available. [11] T. Gröger, R. Zimmermann, Talanta 83 (2011) 1289.
Also, once the OSC-U-PLS-DA model is established, the diagnosis [12] X. Li, Z. Xu, X. Lu, X. Yang, P. Yin, H. Kong, Y. Yu, G. Xu, Anal. Chim. Acta 633
of the disease can be performed within hours after sample collec- (2009) 257.
[13] K.J. Johnson, B.J. Prazen, D.C. Young, R.E. Synovec, J. Sep. Sci. 27 (2004) 410.
tion. In addition, this tool for diagnosis was proven to be robust
[14] A.J. Hobro, J. Kuligowski, M. Döll, B. Lendl, Anal. Bioanal. Chem. 398 (2010) 2713.
against difference in age from the harvested plant material. Even [15] S. Wold, J. Trygg, A. Berglund, H. Antti, Chemom. Intell. Lab. Syst. 58 (2011) 131.
more, the inspection of the loadings graph allowed distinguishing [16] R.G. Brereton, Analyst 125 (2000) 2125.
between BVOC produced in higher amounts by healthy plant from [17] S. Wold, H. Antti, F. Lindgren, J. Öhman, Chemom. Intell. Lab. Syst. 16 (2002)
283.
infected plants. Thus, the proposed model allowed for the tentative [18] G. Pérez, G.C. Hunter, B. Slippers, C. Pérez, B.D. Wingfield, M.J. Wingfield, Eur.
identification of over 40 potential biomarkers and a further under- Plant Pathol. 125 (2009) 109.
standing of defense mechanism of E. globulus during infection by T. [19] P.W. Crous, U. Braun, J.Z. Groenwald, Stud. Mycol. 58 (2007) 1.
[20] M.P. Pedroso, E.C. Ferreira, L.W. Hantao, S. Bogusz Jr., F. Augusto, J. Sep. Sci. 34
nubilosa. (2011) 1547.
[21] S. Bogusz Jr., L.W. Hantao, S.C.G.N. Braga, V.C.R.M. França, M.F. Costa, R.D.
Acknowledgments Hamer, F. Augusto, J. Sep. Sci. 35 (2012) 2438.
[22] G. Tomasi, F. van den Berg, C. Andersson, J. Chemom. 18 (2004) 231.
[23] L.A.F. de Godoy, L.W. Hantao, M.P. Pedroso, R.J. Poppi, F. Augusto, Anal. Chim.
This work was funded by FAPESP (Foundation for Research Sup- Acta 699 (2011) 120.
port of the State of São Paulo) and CNPq (Brazilian National Council [24] M. Barker, W. Rayens, J. Chemom. 17 (2003) 166.
[25] D.L. Massart, B.G.M. Vandeginste, S.N. Deming, Y. Michotte, L. Kaufman, Chemo-
for Research and Technological Development) through the National metrics: A Textbook, 1st ed., Elsevier, Amsterdam, 1988.
Institutes of Science and Technology Program (INCT). Hantao grate- [26] V. Bollina, G.K. Kumaraswamy, C.K. Ajjamada, T.M. Choo, Y. Dion, S. Rioux, D.
fully thanks FAPESP for a doctoral grant. Aleme and Ribeiro thank Faulbert, H. Hamzehzarghani, Mol. Plant Pathol. 11 (2010) 769.
[27] E. Breitmaier, Terpenes: Flavors, Fragrances, Pharmaca, Pheromones, 1st ed.,
CAPES (Brazilian Ministry of Education Agency for Improvement of Wiley-VHC, Weinheim, 2006.
Graduate Personnel) and Passador thanks CNPq for the postdoctoral [28] K.A. Leiss, Y.H. Choi, R. Verpoorte, P.G.L. Klinkhamer, Phytochem. Rev. 10 (2011)
grants. 205.