Académique Documents
Professionnel Documents
Culture Documents
Hypericin-Containing Hypericum
perforatum Leaf Tissues as Revealed by De
Novo Assembly of RNA-Seq
Miroslav Sotk, Odeta Czerankov,
Daniel Klein, Katarna Nigutov, Lothar
Altschmied, Ling Li, Adarsch Jose, Eve
Syrkin Wurtele, et al.
Plant Molecular Biology Reporter
ISSN 0735-9640
Plant Mol Biol Rep
DOI 10.1007/s11105-016-0982-2
1 23
1 23
ORIGINAL PAPER
Eva ellrov
eva.cellarova@upjs.sk
Lothar Altschmied
lothar@ipk-gatersleben.de
Ling Li
liling@iastate.edu
Adarsch Jose
adarshjos@gmail.com
Eve Syrkin Wurtele
mash@iastate.edu
Introduction
The scientific effort to study new bioactive compounds from
medicinal plants is rapidly increasing nowadays. Pure compounds essentially isolated from Hypericum spp., namely,
naphthodianthrones and phloroglucinols, are proven to possess anti-depressive, anti-cancer, anti-viral, anti-inflammatory,
and other activities (Crockett and Robson 2011). Drugs based
on Hypericum perforatum (St. Johns wort) extracts are
among the most widely prescribed pharmaceuticals for depression treatment (Linde and Knuppel 2005; Huang et al.
2011).
H. perforatum leaf contains different secretory structures
such as dark glands, translucent glands, and secretory canals,
similarly to other aerial plant parts, especially flowers. The
dark glands (nodules) accumulate naphtodianthrones
(hypericin and pseudohypericin) including their protoforms
and are localized mostly along the leaf margins (Curtis and
Lersten 1990). These structures are multi-cellular, more nodule-like, and characterized by a cluster of irregularly shaped
specialized cells surrounded by a single- or double-layered
sheath (Ciccarelli et al. 2001). Their secretory activity begins
at early stages of leaf development (Onelli et al. 2002). The
accumulation of naphthodianthrones in the dark glands is
more likely a mechanism to avoid the toxicity of these metabolites toward the adjacent cells (Ciccarelli et al. 2001). The
size and number of these glands correlate positively with the
content of naphtodianthrones. Zobayed et al. (2006) suggested
that the site of the biosynthesis of hypericin is in the dark
glands. Recently, Kusari et al. (2015) analyzed the metabolites
of the leaves by MALDI-HRMS in selected Hypericum spp.
an d re ve aled loc aliza tion of e modi n, h ype ricin ,
pseudohypericin, and their protoforms. Hypericin precursor
emodin was detected throughout the leaf tissue with the
highest concentration in close proximity to the glands.
Hypericin content was proven inside the dark glands but also
around and/or outside the dark glands, particularly around the
bottom of the dark glands (Kusari et al. 2015). This metabolic
study indicates the hypericin biosynthesis is accordingly presented also in dark glands surrounding tissues.
2008). HpPKS2 was found to be an octaketide synthase, specifically expressed in the dark glands accumulating hypericins
(Karppinen et al. 2008).
Next-generation sequencing (NGS), especially RNA-Seq
and de novo transcriptome assembly, as a recently developed
approach for profiling transcriptomes, is the preferred method
to obtain knowledge of genes and their expression in nonmodel organisms (Strickler et al. 2012). RNA-Seq is more
practical than the whole-genome sequencing because of its
cost-effectiveness, high sensitivity, simple sequence annotation, and reduced problems with repetitive sequence elements.
De novo assembly of the sequenced transcriptomes of the
non-model species including H. perforatum allows identification of potential genes associated with the secondary metabolism coding for specific enzymes involved in the respective
metabolic pathways. Despite the global importance of secondary products from H. perforatum and growing number of raw
genomic information provided by different resources, the
knowledge of secondary metabolites and related pathways
remains insufficient. Actually, the main repositories for genomic data of H. perforatum are SRA NCBI containing 37 entries of raw reads. Medicinal Plant Genomic resource provided first assembly and annotation in 2011. The PhytoMetaSyn
Project (www.phytometasyn.ca), dedicated to research on
Experimental Procedures
Plant Material and RNA Isolation
Diploid H. perforatum L. seeds were germinated and the seedlings were cultured in vitro on basal medium containing salts
according to Murashige and Skoog (1962), Gamborgs B5
vitamins (Gamborg et al. 1968), 30 g l1 sucrose, 100 mg l1
myoinositol, 2 mg l1 glycine, and 7 g l1 agar. The pH of the
media was adjusted to 5.6 before autoclaving. The plants were
cultured at 23 C, 40 % relative humidity, under 16/8-h (day/
night) photoperiod and artificial irradiance of
80 mol m2 s1. Leaf tissues from 4-week-old seedlings were
isolated. Each sample contained approximately 10 individual
genetically identical plants representing biological replicates
from one specimen. The dark glands with adjacent leaf tissue
(MSN1, MSN2) and the inner part leaf tissue without dark
glands (MSX1, MSX2) were selected for differential expression analysis (Fig. 2). The samples were processed on ice
under sterile conditions, immediately frozen in liquid nitrogen, and stored at 80 C. Total RNA was extracted in accordance with Spectrum Plant Total RNA Kit (Sigma-Aldrich)
protocol. The samples were homogenized by TissueLyser II
(Qiagen) and placed in denaturing lysis buffer according to the
manufacturers instructions. The quality and quantity of isolated RNA were evaluated on the basis of UVabsorption ratios
(i.e., 260/280 and 260/230 nm) assessed by Nanodrop 2000C
spectrophotometer (Thermo Scientific), and the RNA integrity was tested on 2100 Bioanalyzer (Agilent Technologies).
Transcriptome Sequencing
The messenger RNA (mRNA) from total RNA was enriched
by the use of oligo(dT) magnetic beads for transcriptome analysis. Then, the mRNA was fragmented into short pieces (180
Fig. 2 H. perforatum leaf. (A) Dark glands with adjacent leaf tissue; (B)
the inner part leaf tissue without dark glands
the biological process, molecular function, and cellular component. GO terms taxonomically specified to green plants
(Viridiplantae) were assigned to each query sequence with a
reference to the top Blastx hit against SwissProt database.
Subsequently, to acquire the protein domain information for
the putative sequences and to determine functional motifs, the
embedded feature InterProScan (Zdobnow and Apweiler
2001) was carried out at the default parameters.
InterProScan annotation was also conducted via Blast2GO.
The annotation was refined by running BAugment
Annotation by ANNEX^ function (Myhre et al. 2006).
BValidate annotation^ and BRemove 1st level annotation^
were used to remove all the redundant GO terms for a given
sequence and to assign only the most specific GO terms.
The contigs were searched against the reference canonical
pathways in Kyoto Encyclopedia of Genes and Genomes
(KEGG) to further explore the gene interactions and biological functions in H. perforatum leaves. KEGG PATHWAY
Database (http://www.genome.jp/kegg/pathway.html) is an
important resource for interpreting high-level functions and
utilities of the biological system, such as the cell, the organism, and the ecosystem, from molecular-level information,
especially large-scale molecular datasets generated by genome
sequencing and other high-throughput experimental technologies. KEGG Database represents the collection of manually
drawn pathway maps, and it helps our understanding of the
biological functions of genes (Kanehisa et al. 2014).
into physiological classes to describe large quantity of transcripts and for evaluating functional differences among the
subgroups of sequences. The assembled contigs were used
for similarity searches against public protein databases after
filtering out short-length and redundant sequences. All six
frame translations of the sequences were searched against
the SwissProt protein database using Blastx (E-value 105).
SwissProt database was selected due to our interest in the key
enzymes of polyketide synthesis and the expectation of high
similarity to already verified proteins. Database matches were
found for 66,817 (47.74 %) contigs while 72,999 were blasted
with no hits (Fig. 4). Protein domains and motif information
were retrieved by InterProScan via Blast2GO. The expressed
H. perforatum genes were searched against the GO database
to categorize the standardized gene functions.
Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis was accomplished in order to get an overview of the gene
pathway networks. Annotations were conducted only for
contigs with significant Blast hits below E-value 105, value
of 55 as the annotation cutoff, and value of 5 as the GO
weight. Based on sequence homology searches (Blastx)
against the KEGG database, 45,863 contigs were assigned to
147 metabolic pathways. The most represented pathways
were BPurine metabolism^ (4221 members), BThiamine metabolism (3481 members), BBiosynthesis of antibiotics^ (2173
members), BAminobenzoate degradation^ (1605 members),
BStarch and sucrose metabolism^ (1582 members), and
BDrug metabolism other enzymes^ (1028 members).
Identification of Differentially Expressed Genes Using
RNA-Seq Data
Fig. 7 Blast2GO data annotation score distribution of upregulated genes in the dark glands with adjacent leaf tissue
dark glands with adjacent leaf tissue were found at adjusted P value of <0.05 in 799 sequences (2058 isoforms) (Fig. 6). The Pearson correlation index between
two technical replicates was calculated from raw read
counts for tissues containing nodules (0.973) and for
tissues without nodules (0.978) after adjusting outliers.
Annotation and GO Classification of the Upregulated
Genes in the Dark Glands with Adjacent Leaf Tissue
The upregulated sequences were annotated against the
NCBI non-redundant protein (NR) database and
SwissProt database with threshold E-value 10 5 .
Blast2GO assigned GO terms using a pro-SimilarityHit-Filter value of 15, an annotation cutoff value of
55, and a GO weight value of 5. Approximately 89 %
of the 799 upregulated genes had a Blast hit against the
NR NCBI database (Fig. 7) and 75.95 % against the
SwissProt database. The upregulated genes were further
used for the examination of GO and pathway analysis
with Blast2GO and grouped into 47 GO subcategories
(Fig. 8).
KEGG analysis based on SwissProt annotation revealed that the upregulated contigs were highly associated with metabolism of purines, thiamine, starch and
sucrose, drugs, and biosynthesis of antibiotics (Table 1).
Pathway
Sequences
Purine metabolism
163
Thiamine metabolism
155
65
59
Biosynthesis of antibiotics
Aminobenzoate degradation
51
38
30
29
29
27
26
Glycolysis/gluconeogenesis
26
Phenylpropanoid biosynthesis
Glycerolipid metabolism
26
24
21
20
19
Galactose metabolism
Fructose and mannose metabolism
Tryptophan metabolism
Retinol metabolism
17
17
16
16
Lysine degradation
Arachidonic acid metabolism
Methane metabolism
Pyrimidine metabolism
Pentose phosphate pathway
15
14
14
14
13
Glycerophospholipid metabolism
Cyanoamino acid metabolism
Glyoxylate and dicarboxylate metabolism
Amino sugar and nucleotide sugar metabolism
Flavonoid biosynthesis
13
13
13
12
12
Pyruvate metabolism
Arginine and proline metabolism
Valine, leucine, and isoleucine degradation
Selenocompound metabolism
Tyrosine metabolism
11
10
10
10
10
Seq name
Seq description
Seq
Min E-value
length
Mean
GO terms
similarity
TR25393|c0_
g2_i1
Benzophenone synthase
780
3.33153E-157 86.6 %
TR56958|c0_
g1_i1
Benzophenone synthase
1014
4.68185E-160 86.0 %
TR56958|c0_
g1_i2
Benzophenone synthase
767
5.32363E-120 87.6 %
402
1.42911E-85
824
1.16869E-161 91.4 %
865
5.08618E-161 90.8 %
827
9.13773E-124 83.4 %
489
1.43533E-94
88.6 %
TR37481|c0_
g1_i1
Polyketide synthase
1680
0.0
87.8 %
TR59463|c0_
g1_i1
Putatative chalcone
synthase
361
3.26544E-19
87.6 %
TR5576|c0_
g1_i1
Vinorine synthase-like
1595
8.65539E-111 59.6 %
1420
7.45769E-168 83.8 %
1424
3.49132E-155 83.8 %
254
4.97738E-8
78.4 %
322
9.26818E-8
78.2 %
1229
0.0
90.6 %
TR96435|c0_
g1_i3
3-ketoacyl-synthase 10
2090
0.0
90.6 %
TR96435|c0_
g1_i1
TR14589|c0_
g1_i1
TR82074|c0_
g1_i1
TR96435|c0_
g1_i4
0.0
91.2 %
3-ketoacyl-synthase 11
318
7.05499E-68
99.0 %
3-ketoacyl-synthase 11
333
1.25091E-14
94.6 %
Beta-ketoacyl-synthase
family protein
1989
0.0
91.2 %
89.8 %
Seq description
Seq
Min E-value
length
Mean
GO terms
similarity
long-chain fatty acid metabolic process; P:fatty acid biosynthetic
process; P:cuticle development
TR61322|c1_
g1_i7
Cyclopropane-fatty-acylphospholipid synthase
family protein
3241
0.0
90.8 %
TR61322|c1_
g1_i10
Cyclopropane-fatty-acylphospholipid synthase
family protein
4171
0.0
90.2 %
found in the dataset of DEGs with potential to catalyze condensation or cyclization reactions. Two decarboxylases were
found, namely, lysine decarboxylase family protein isoform 2
and aromatic-amino-acid decarboxylase-like protein supposedly without relation to hypericin biosynthesis. Emodin
anthrone further oxidizes to emodin. Blast2GO identified only
one (five isoforms) aldehyde dehydrogenase enzyme belonging to oxidoreductases and acting on aldehyde or oxo group.
Emodin-anthrone-oxygenase (Chen et al. 1995) was not found
to be differentially expressed. For the conversion of emodin to
hypericin, an initial condensation reaction between emodin
and emodin anthrone takes place, followed by dehydration
to form emodin dianthrone, which may undergo oxidation to
form protohypericin and finally hypericin by irradiation with
visible light (Zobayed et al. 2006; Huang et al. 2014).
Plant secondary products can be induced by elicitors, such
as jasmonic acid and its analogs, via the jasmonate pathway,
involved in the plant stress defense system (Zhao et al. 2005).
Hypericins are considered to be involved in the chemical defense arsenal of plant against herbivores and plant pathogens
while they can be increased in the presence of exogenous
methyl jasmonate (Sirvent and Gibson 2002). According to
GO terms linked to stress and defense, 32 DEGs were computed with functional similarities (Table 3). Three transcripts
(four isoforms) were recognized as phenolic oxidative coupling protein. He et al. (2012) also revealed in complex transcriptome 12 contigs homologous to hyp-1 and by the use of
PKSIIIexplorer 2291 contigs belonging to PKS III proteins.
The gene named hyp-1 (phenolic oxidative coupling protein) has been originally proposed as a gene with role in the
final steps of hypericin biosynthesis, possibly a condensation
followed by dehydration and two phenolic oxidative coupling
reactions (Bais et al. 2003). However, this presumption has
not been later proved. The highest expression level of hyp-1
was found in roots, which contain neither dark glands nor
hypericin (Kouth et al. 2007). The gene was also expressed
in 15 different Hypericum species regardless of whether
hypericins and emodin were detected in the plants (Kouth
et al. 2011). The Hyp-1 protein shares about 50 % sequence
similarity with pathogenesis-related proteins (PR-10) and has
Conclusion
Knowledge on the biosynthetic pathways of unique metabolites is fundamental for their biotechnological commercial
production. Specialized plant metabolites have often complex
biosynthetic pathways, and it is challenging to identify all of
the enzymes that catalyze the numerous metabolic transformations. Our understanding of the biochemical pathways in
H. perforatum L. that synthesize the commercially relevant
Stress-related contigs
Seq name
Seq description
Seq
Min. E-value
length
Mean
GO terms
similarity
TR24220|c0_
g1_i1
TR82269|c0_
g1_i1
TR93881|c0_
g1_i1
TR93881|c0_
g1_i2
TR1044|c0_
g1_i1
TR6737|c0_
g2_i1
915
2.2018E-52
71.6 %
906
1.69115E-50
71.0 %
1026
2.37227E-41
64.4 %
1071
3.33743E-41
64.4 %
1060
9.4963E-33
59.4 %
3688
0.0
95.4 %
TR35073|c1_
g1_i1
1846
0.0
96.8 %
TR35073|c1_
g2_i1
3696
0.0
90.0 %
TR53602|c0_
g1_i1
TR56451|c0_
g1_i2
1090
6.48547E-68
80.6 %
1250
1.26752E-41
85.0 %
TR60524|c0_
g1_i1
411
4.14401E-56
84.6 %
TR60524|c0_
g1_i2
617
6.6804E-70
79.2 %
TR62294|c0_
g1_i1
Gibberellin-regulated protein
14-like
720
6.13342E-20
85.2 %
TR74456|c1_
g1_i1
Peroxiredoxin chloroplastic
1166
2.24936E-105 83.2 %
TR74456|c1_
g1_i2
Peroxiredoxin chloroplastic-like
794
6.70597E-94
91.4 %
1678
0.0
88.8 %
Seq description
Seq
Min. E-value
length
Mean
GO terms
similarity
process; P:brassinosteroid biosynthetic process; P:oxidationreduction process
C:nucleus; C:plant-type cell wall; C:extracellular matrix;
C:apoplast; F:manganese ion binding; F:nutrient reservoir
activity; P:auxin-activated signaling pathway; P:stomatal
complex morphogenesis; P:photosynthesis, light reaction;
P:cellular cation homeostasis; P:defense response to
bacterium; P:divalent metal ion transport
TR5230|c0_
g1_i1
1197
3.08115E-87
85.2 %
TR5230|c0_
g1_i2
1216
1.75685E-88
85.8 %
TR24258|c0_
g2_i5
468
1.93943E-69
95.2 %
TR27746|c0_
g1_i1
425
1.80674E-15
82.2 %
TR36927|c0_
g1_i1
TR36927|c0_
g1_i2
mlo-like protein 4
1744
0.0
81.2 %
mlo-like protein 4
1131
8.08737E-143 83.8 %
951
1.26962E-49
68.8 %
TR53888|c0_
g1_i1
TR53888|c0_
g1_i2
TR77683|c0_
g1_i2
949
6.43926E-49
68.8 %
Auxin-binding protein
abp19a-like
388
1.26965E-15
82.2 %
TR94097|c1_
g1_i1
8.85042E-148 76.2 %
Seq description
Seq
Min. E-value
length
Mean
GO terms
similarity
TR94340|c0_
g1_i1
TR97567|c0_
g1_i1
863
1.23722E-79
87.4 %
892
2.25496E-79
87.4 %
TR99891|c0_
g1_i1
Porphobilinogen deaminase
432
3.98284E-86
95.0 %
TR30252|c0_
g1_i1
1069
1.79908E-76
61.8 %
TR39058|c0_
g1_i2
3633
0.0
76.8 %
TR81320|c0_
g1_i2
6.98096E-12
70.2 %
References
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM,
Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP,
Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE,
Ringwald M, Rubin GM, Sherlock G (2000) Consortium GO: Gene
Ontology: tool for the unification of biology. Nat Genet 25:2529
Austin MB, Noel JP (2003) The chalcone synthase superfamily of type III
polyketide synthases. Nat Prod 20:79110
Bais HP, Vepachedu R, Lawrence CB, Stermitz FR, Vivanco JM (2003)
Molecular and biochemical characterization of an enzyme responsible for the formation of hypericin in St. Johns Wort (Hypericum
perforatum L.). J Biol Chem 278(34):3241322
Banks HJ, Cameron DW, Raverty WD (1976) Chemistry of the
coccoidea. II Condensed polycyclic pigments from two Australian
pseudococcids (Hemiptera). Aust J Chem 29:1509
Chen Z, Fujii I, Ebizuka Y, Sankawa U (1995) Purification and characterization of emodinanthrone oxygenase from Aspergillus terreus.
Phytochemistry 38:299305
Ciccarelli D, Andreucci AC, Pagni AM (2001) Translucent glands and
secretory canals in Hypericum Perforatum L. (Hypericaceae):