Académique Documents
Professionnel Documents
Culture Documents
Deep, Staged Transcriptomic Resources for the Novel Coleopteran Models Atrachya menetriesi
3
4
Matthew A. Benton1+, Nathan J. Kenny2+, Kai H. Conrads1, Siegfried Roth1* and Jeremy A.
Lynch3*
6
7
50674, Germany
9
10
11
Research of the State Key Laboratory of Agrobiotechnology, The Chinese University of Hong
12
Simon F.S. Li Marine Science Laboratory of School of Life Sciences and Center for Soybean
13
14
15
States of America
16
17
18
19
* Co-corresponding authors
20
21
22
23
24
25
26
1
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
27
28
Abstract:
29
Background: Despite recent efforts to sample broadly across metazoan and insect diversity,
30
current sequence resources in the Coleoptera do not adequately describe the diversity of the
31
clade. Here we present deep transcriptomic data generated with Illumina platform sequencing of
32
a number of key stages in the development of two coleopteran species, the false melon beetle
33
Atrachya menetriesi (Faldermann 1835) and the cowpea weevil Callosobruchus maculatus
34
(Fabricius 1775). These species are both agricultural pests and are of great interest to
35
36
genomic analysis.
37
38
Results: By sampling at a range of timepoints covering ovary development and several crucial
39
phases of embryonic development we assembled five and three timed transcriptomes for A.
40
41
transcriptomic resource combining all reads into a single, combined assembly for each species,
42
and we analysed each of these resources in detail. The combined A. menetriesi assembly
43
consists of 228,096 contigs with an N50 of 1,598 bp, while the combined C. maculatus
44
assembly consists of 128,837 contigs with an N50 of 2,263 bp. For each of these assemblies,
45
we identified 78,879 (34.6%) and 41,744 (32.4%) of contigs by BLASTx against the nr database
46
using Blast2GO, and we found that 97% and 98.3% of the BUSCO set of metazoan orthologs
47
were present in these assemblies. We also carried out manual analysis of 38 key
48
developmental genes, and found that nearly all expected genes were present in our
49
transcriptomes. Each of these analyses showed that these transcriptomes are of very high
50
quality. Lastly, we performed read mapping using the individual timed RNA samples, allowing us
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
51
to identify up- and down-regulated contigs at key stages in the development of these beetles for
52
further analysis.
53
54
Conclusions: The data presented here represent a significant increase in the publicly available
55
transcriptomic resources in the Coleoptera. They have also allowed us to note significant
56
changes in expression at important embryonic stages, and will provide a firm basis for a variety
57
of experimentation, both in developmental biology and in wider exploration of the genomic basis
58
59
60
61
62
63
Background:
64
The order Coleoptera is the most speciose clade of animals currently known. Despite the best
65
efforts of generations of biologists, its species are only sparsely sampled and are yet to be
66
67
uncategorized (e.g. [1, 2]). A similar discrepancy exists at the molecular level; while several
68
genomic resources are available in this clade, their number and phylogenetic distribution is only
69
just beginning to accurately sample that of the Coleoptera as a whole. A wide range of
70
transcriptomic data is available in whole organisms (for example [35], among others), specific
71
body parts (such as [6, 7]), and in several cases for staged embryos following RNA interference-
72
mediated gene knock-down [8, 9]. The i5K project [10] will also greatly advance our knowledge
73
of the genomic complement of Coleopterans, with 69 species of this Order listed on that
74
75
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
76
[11]. However, the majority of this information is largely still outside of the public domain, the
77
Coleoptera are still relatively undersampled compared to the Diptera and Hymenoptera, and, in
78
particular, timed embryonic resources (e.g. [12]) are rare in the literature.
79
The true phylogeny of the Coleoptera is still under investigation but, in general, four
80
suborders, 17 superfamilies, and 168 families are recognised [13]. The structure of the
81
coleopteran clade can be seen summarised in Fig 1A. Coleopterans have long been used for
82
research into embryology, and in the pre-molecular era, the Chrysomelidae (summarised
83
phylogeny shown in Fig 1B) was one of the best studied superfamilies. For example, the first
84
functional embryonic experiments in any insect were carried out in the Colorado potato beetle,
85
Leptinotarsa decemlineata (sub-family Chrysomelinae, see Fig 1B), leading to the discovery of
86
the function of pole cells and the existence of germ plasm in insects [14, 15]. Further, the larval
87
cuticle preparation technique, so vital for arthropod developmental biology, was first perfected in
88
the bean beetle, Callosobruchus maculatus [16]. Chrysomelid beetles are also interesting from
89
an ecological and economic viewpoint, as members of the group are usually pest species,
90
perhaps most famously the aforementioned Colorado potato beetle, which is a major pest of
91
92
93
94
PRJNA171749) and C. maculatus [17] are the subject of ongoing genomic sequencing.
95
However, while a number of transcriptomes are planned or published in this clade (e.g. [5, 6,
96
1822]) none yet sample across embryonic time points in a fashion allowing insight into the
97
genetic mechanisms behind key developmental stages and deep sampling of developmentally
98
important genes. Here we present transcriptomic sequences for two species of chrysomelid
99
beetle, the false melon beetle, Atrachya menetriesi, and the aforementioned C. maculatus (each
100
of which are pictured along with their relative phylogenetic positions in Fig 1).
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
101
102
103
pest, and feeds on a variety of plants such as clover and lettuce. Although there has only been
104
a small amount of research carried out on this beetle, the work that has been done has
105
highlighted several interesting developmental traits, including the possibility of generating twin
106
embryos after egg bisection [23], or up to four, seemingly complete, embryos following
107
treatment with low temperatures [23, 24]. Another interesting trait exhibited by this beetle is that
108
almost all eggs enter diapause at a certain stage, and this is only broken in the wild by winter
109
conditions. However, a small proportion of eggs skip this diapause and continue to adulthood.
110
The ratio of diapause to non-diapause eggs varies in different parts of Japan [25], and is a
111
heritable trait [26]. Further, A. menetriesi embryos undergo a very short germ band mode of
112
development [27], contrasting strongly with beetles such as C. maculatus (see below). As the
113
last common ancestor of these two species is estimated to have existed only 80 million years
114
ago [28], how their developmental mechanisms have diverged so greatly is a potentially
115
fascinating area for future study. Research on these topics would greatly benefit from modern
116
molecular studies.
117
C. maculatus (Fabricius) is native to West Africa [29] but is now found worldwide, and is
118
a common pest of stored legumes. It is also known as the southern cowpea weevil, however, it
119
is not a true weevil. As noted above, this beetle was the focus of active developmental research
120
in the pre-molecular era, with special focus on segmentation [30, 31]. Segmentation in C.
121
maculatus has also been studied more recently via immunohistochemistry for the even-skipped
122
protein [32]. This recent work confirmed previous reports that the embryos undergo the long
123
124
hymenopterans like Apis mellifera [33]. It is commonly believed that the long germ mode of
125
126
phylogenetic distribution of short and intermediate germ development within the Coleoptera [34,
5
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
127
35], it seems likely that the long germ mode of development also evolved independently in the
128
129
maculatus and other long germ insects, plus with more closely related species that feature short
130
germ development, such as well studied flour beetle, Tribolium castaneum (super-family
131
Tenebrionoidea, see Fig.1 A) and A. menetriesi, would yield crucial information on how
132
developmental pathways have evolved to generate the long germ mode of development. C.
133
maculatus has also been studied in other fields and is a useful system for undergraduate lab
134
teaching [17].
135
136
investigations in the Coleoptera and beyond, we present here deep, multi-stage transcriptomic
137
resources from a range of key time points in the development of these two beetle species.
138
Using RSEM-based methods we have compared transcript abundance across these life stages,
139
which will allow the investigation of genes that play key roles in developmental changes in these
140
species, particularly at the maternal/zygotic transition. We have carried out extensive searches
141
for key genes in developmental patterning and cell signalling pathways, and from our analyses
142
we conclude that the transcriptomes for both species are of very high coverage, with almost all
143
expected genes being present with full open reading frames. We have already made extensive
144
use of these resources for our own studies on the embryonic development of these two beetles,
145
and are confident that they will be of broad utility to a range of fields in genomics and
146
developmental biology.
147
148
149
150
Ovaries and embryos were collected as described in Materials and Methods, and as seen in Fig
151
2. The chosen time-windows cover a variety of important stages in the development of these
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
152
species, and can be expected to contain the majority of embryonically active transcripts in their
153
expressed complement. Briefly, RNA was collected from dissected ovaries from each species
154
and from four embryonic time windows for A. menetriesi and two embryonic time windows for C.
155
maculatus. Samples were sequenced on the Illumina HiSeq platform (one lane per species),
156
adaptor trimming was performed, along with preliminary assessment of read quality, and data
157
158
159
160
Table 1 summarises initial read information, and these reads are available from the NCBI SRA,
161
162
data, FastQC [36] was run on all read data. This showed Phred quality scores were high, with
163
median scores always exceeding 28 through to the 101st base, and generally in the mid-30s. A
164
slight bias was found in initial nucleotide sequence. To ensure that Illumina adaptors were
165
removed in their entirety, Trimmomatic v.0.32 [37] was used to check for residual adaptor
166
sequence, but none were observed. We therefore posit that this bias is due to known biases in
167
168
169
170
read assemblies empirically found them to be less well assembled than those using un-trimmed
171
reads, with a shorter N50 and less overall sequence recovery. Trimmed assemblies were
172
therefore discarded in favour of untrimmed assemblies, which were used for all further analyses.
173
Assays of initial assemblies noted a small amount of fungal contamination in the data for
174
both species. DeconSeq [40] was therefore run on the Trinity output for both species, with high
175
stringency as noted in the methods. A total of 336 and 206 contigs with some homology to
176
fungal sequence was removed from the A. menetriesi and C. maculatus total assemblies as a
177
result, before read mapping was performed. In the A. menetriesi assembly, we observed
7
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
178
particularly high contamination with the protists, notably Dictyostelium discoideum and
179
Naegleria gruberi, and some of this almost certainly remains in the transcriptome. Metrics for
180
final assemblies after the removal of contamination can be seen in Table 2, alongside the
181
182
183
and 128,837 contigs for A. menetriesi and C. maculatus respectively, contain a large number of
184
well-assembled transcripts, with the number of contigs greater than 1 kb in length (52,217 in A.
185
186
maculatus) indicating a very well assembled dataset. This size is sufficient to span most protein
187
coding domains, allowing easy inference of homology, and will be the full length of many
188
transcripts. It is important to note that many of the contigs in our assembly will represent splice
189
variants of single genes, and some genes will have multiple splice variants, which will affect
190
these statistics. However, excellent recovery of splicing variation itself will be useful for a range
191
of later analysis. GC content of the final assembled transcriptomes closely mirrors that of reads
192
193
on library).
194
195
ran TransDecoder v 2.01 [41] to identify open reading frames (ORFs) and filtered for the results
196
that were at least 100 amino acids long. For the A. menetriesi assembly, this analysis yielded
197
71,961 raw and 51,912 filtered contigs, while for the C. maculatus assembly, the analysis
198
yielded 65,433 raw and 36,535 filtered contigs. The mean average length of the predicted
199
polypeptides, after the filtering step, was 351 (A. menetriesi) and 448 (C. maculatus) amino
200
acids. This is long enough for us to be confident that our assembly adequately spans coding
201
regions, as the average eukaryotic protein length is 361 amino acids [42]. Together, these
202
analyses suggest that we have recovered the vast majority of coding sequence in our combined
203
assemblies, with sufficient length to adequately span ORFs, a conclusion further supported by
8
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
204
gene annotation data as discussed further below. The numbers of ORFs presented here are
205
considerably more than the 16,404 gene models observed in the T. castaneum genome
206
(Tribolium Genome Sequencing Consortium, 2008), and this is likely due to both spurious ORFs
207
in our dataset and to multiple splice variants. All assemblies are available from Figshare online
208
209
10.6084/m9.figshare.2056467).
210
211
212
Key developmental stages for both A. menetriesi and C. maculatus can be easily observed
213
following fixation and nuclei staining (data not shown). Briefly, egg lay to uniform blastoderm
214
215
formation, gastrulation and elongation occur from 24-100 hours in A. menetriesi, and from 7-24
216
217
characteristic stages of short germ development pertinent to our research interests. These
218
stages, along with assayed sample periods, are shown diagramatically in Fig 2. As well as being
219
used for combined assemblies as described earlier, RNA extracted from mature ovaries and
220
from embryos collected during the aforementioned time periods was also individually assembled
221
using Trinity.
222
The timed transcriptome assemblies for each species often possess better assembly
223
quality when compared to the combined assemblies by metrics such as N50 and mean contig
224
length (Fig 2, Table 2). As a result of being made up of a subset of the total reads, the individual
225
assemblies do not possess the breadth of the combined assemblies, with fewer contigs,
226
especially at long contig length (e.g. 1kb <). As such, the staged transcriptomes were used
227
solely for comparison of expression levels across time, while the combined assemblies were
228
used for gene family analyses. We emphasise that no technical replicate was performed for
229
these comparisons, and any conclusions drawn from them should hold this consideration in
9
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
230
regard. With this limitation in mind, we carried out differential expression analyses and observed
231
232
First, we generated matrices from general comparison between time points in order to
233
find the most similar samples (Fig 3 A,B). Generally, these results are congruent with steady
234
changes in gene expression across the course of development, with most time points being
235
most similar to those immediately preceding and following them. However, a split can be seen in
236
A. menetriesi (Fig 3 A,B) between the ovary and 0-24 hour samples and all others, with the
237
three later samples resembling each other more closely than the 0-24 hour dataset resembles
238
the 24-48 hour sample. This could be due to the maternal:zygotic transition, which will occur at
239
some point in this time frame. This cannot be seen for C. maculatus, and could be due to more
240
admixture of RNA within the last 7-24 hour sample. Focused analysis on when the
241
242
Next, we clustered the results of our differential expression calculations (Fig 3 C,D). The
243
results shown are those with RSEM considering each isoform separately (rather than taking into
244
account clustering into genes performed by Trinity). Numerous up- and down-regulated contigs
245
can be seen at each time point, with some time points more obviously possessing or lacking a
246
247
While replicates have not been performed and we have not analysed up and down
248
regulated transcripts in detail, these data are available to download from Additional Files 6 and
249
7 attached to this document online. These data will act as a good initial guide for those
250
interested in tracking differential expression of specific genes across development and will be
251
252
253
254
To gain an understanding of the depth of coverage of our datasets we used the BUSCO library
255
of well-annotated genes [43], which are known to be highly conserved in single copy across the
10
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
256
Metazoa, as a basis for comparison with our combined, assembled transcriptomes. Of the
257
BUSCO set of 843 metazoan orthologs, the A. menetriesi assembly possesses 801 (95%)
258
complete (of which 225, 26%, are duplicated), 16 fragmented (1.8%) and 26 missing genes
259
(3.0%), for a total recovery of 97% of the BUSCO dataset. The C. maculatus assembly contains
260
815 (96%) complete (with 202, 23%, duplicated), 13 (1.5%) fragmented and 15 (1.7%) missing
261
genes (98.3% recovery). This extremely high level of recovery gives us confidence that at least
262
all housekeeping genes expected to be present in these species are found in our datasets,
263
which strongly suggests that these transcriptomes contain the vast majority of the gene cassette
264
of these species.
265
The number of duplicate genes in our BUSCO analyses likely reflects the construction of
266
our transcriptomes from mixed RNA samples, with the allelic variation that this implies.
267
Discerning true duplicates from allelic and splice variant data is largely contingent on the
268
269
our assemblies underlines that our RNA sequencing and assembly was of good depth and
270
quality (respectively). With this information in-hand, a range of investigations will be made
271
possible, particularly into the regulation and expression of developmentally important genes.
272
A further understanding of the content of our assemblies was gained from Blast2GO
273
analysis [44, 45]. Genes were annotated using b2gpipe, on the basis of BLASTx (E value cutoff,
274
10-3) comparisons made against the nr database as downloaded on the 26 January 2015. Of
275
228,096 (A. menetriesi) and 128,837 (C. maculatus) contigs in each assembly, 78,879 (34.6%)
276
and 41,744 (32.4%) possessed a hit in the nr databases above the threshold. After further
277
annotation with ANNEX and Interproscan, a total of 36,315 (15.9%) and 13,096 (10.2%) contigs
278
were assigned to one or more GO categories. These numbers, while only a fraction of the total
279
number of contigs within our transcriptomes, more closely reflect the expected eukaryotic
280
protein complement in number. Blast2GO annotations are given in Additional Files 2 and 3
11
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
281
Fig 4A shows the distribution of species best hit by BLASTx comparison of contigs from
282
A. mentriesi and C. maculatus with the nr database. For both species, T. castaneum is the best
283
represented species - a reflection of both the phylogenetic position of this species and its well
284
annotated genome. The fact that contigs in our transcriptomes match T. castaneum and other
285
coleopteran and insect species more closely than those of other species suggests that gene
286
orthology will be easy to assign in many cases, and that a higher than normal rate of molecular
287
288
The distribution of GO terms within our datasets are shown in Fig 4B, alongside those of
289
the well-annotated D. melanogaster and Mus musculus proteomes. In general, our datasets
290
resemble one another more than they mirror that of the two sequenced genomes noted. Our
291
292
293
294
Given our results for more targeted investigations, found below, we feel this is likely a result of
295
poor annotation of these by Blast2GO, rather than true absence from the transcriptome.
296
297
appears more similar to that seen in the two model species than in Molecular Function or
298
Biological Process categories, although this has not been tested statistically. This would
299
suggest these well-conserved structural components were more readily assigned GO categories
300
than the developmentally interesting categories listed above. While not all GO categories are as
301
well-annotated by this process as may be desired, the broad classification of our data into a
302
303
annotations of our data are a useful starting point for more focussed investigations and
304
305
306
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
307
308
analysis of individual, developmentally important gene families. Both the Hox family of
309
transcription factors and the TGF- cassette were exceptionally well recovered in our dataset.
310
The Hox genes, and in particular the ANTP HOXL class, which pattern the anterior-posterior
311
body axis, are recovered almost in their entirety in both species when compared to well-
312
catalogued databases (e.g. [46]). This can be seen in Fig 5, which shows the phylogenetic
313
distribution of recovered ANTP HOXL class sequences from our transcriptomes alongside
314
315
In A. menetriesi, sequences for all the ANTP HOXL class genes are recovered in our
316
transcriptome, as can be seen in Fig 5, with sequences and alignments in Additional File 1. We
317
note, however, that the A. menetriesi Hox 2 / maxillopedia (mxp)/ proboscipedia (pb) sequence
318
bears some BLAST similarity with Hox 4 / Dfd sequences, while the putative Hox 4 / Deformed
319
(Dfd) recovered does not include the homeodomain sequence - whether it has lost this crucial
320
domain, or if this portion of the sequence is simply not recovered in our assembly is at present
321
unknown. The putative A. menetriesi Hox 4 / Dfd sequence is given in Additional File 1,
322
although it is not shown in the phylogeny in Fig 5 due to its truncated length.
323
While 2 (A. menetriesi) and 4 (C. maculatus) Hox 3 / zerknllt (zen) variants are seen in
324
our species, these are identical at the coding level, and therefore seem to be splice or allelic
325
variants, rather than the two paralogous genes seen in T. castaneum [47]. Similarly, no
326
evidence of the caudal (cad) duplication seen in T. castaneum [48] can be found in our
327
transcriptomic assemblies, suggesting this is perhaps specific to the flour beetle lineage. Both
328
zen and caudal are important embryonic patterning genes, and comparison of these genes in
329
our two species and Tribolium would be an excellent situation in which to study how sub- and
330
neo-functionalisation occurs. Likely allelic or splice variants are also observed for other HOXL
331
genes in both A. menetriesi and C. maculatus. It should be noted that these could represent
332
very recent duplications, or the effect of gene conversion, although this can only be tested fully
13
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
333
with the advent of a complete genomic resource. No Pdx/Xlox gene was seen, adding further
334
circumstantial evidence for the broad scale loss of this gene across the Arthropoda.
335
Our C. maculatus transcriptome also contains the full complement of ANTP HOXL
336
genes, although, similar to the case of Hox 4 / Dfd in A. menetriesi, the 4 recovered abdominal-
337
A (abd-A) homologs lack the whole homeodomain sequence, with several residues missing.
338
These truncations excluded them from the phylogeny shown in Fig 5, but the sequences for
339
340
341
variants are noted in C. maculatus, particularly for the Hox 6/8 superfamily and Hox 9-13 /
342
Abdominal-B (Abd-B) gene family. Of the Hox 6/8 gene superfamily, normally represented by
343
four genes in the beetle (prothoraxless (ptl), fushi tarazu (ftz), Ultrathorax (Utx) and abd-A), 12
344
ptl, 1 ftz, 4 Utx and 4 abd-A representatives were found in our analysis. Furthermore, up to 26
345
different potential allelic or splice variants of abdB are recorded. As our transcriptome is made
346
of mixed embryonic samples, it is perhaps not surprising that a diversity of splice/allelic variants
347
are observed, but the excellent recovery of this data confirms the deep coverage provided by
348
349
The TGF- cassettes of the insects have been very well described previously (e.g. [49,
350
50]). Our datasets recover almost the full expected complement of the Coleoptera. A slight
351
exception to this is Activin (Act), a partial sequence of which is recovered for both species: a
352
portion of the propeptide which does not span the mature signal peptide sequence. Whether this
353
is a consequence of loss of the mature domain in these species or low levels of expression at
354
the sampled timepoints remains to be established. A BMPx ortholog of clear homology to genes
355
of that family can be found in C. maculatus, but has been excluded from the tree seen in Fig 6
356
as it is incomplete in length. Its sequence can be found in Additional File 1, and we have no
357
doubt as to its identity due to high levels of sequence conservation between it and the T.
358
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
359
The glass bottom boat (gbb) duplication observed in T. castaneum cannot be found in
360
our data, but we can recover a range of splice or allelic variants for other genes, especially in A.
361
menetriesi. These do not differ in the protein coding regions, which leads us to suspect that
362
these are not from gene duplications (unless the duplication(s) occurred very recently). The
363
phylogeny shown in Fig 6 confirms the homology of all genes, and splice/allelic variant numbers
364
observed are given there in brackets, with all sequences available in Additional File 1.
365
We also note the discovery of an additional TGF- ligand in C. maculatus. This gene has
366
367
it is truly of this family, which is also known as GDF1/3/Univin/Vg1, this would be a surprise, as
368
its presence outside the Deuterostomia is controversial [51]. If proof could be found for this
369
being a bona fide GDF1/3/Univin/Vg1, the presence of this gene in more than one coleopteran
370
could suggest that this might in fact be ancestrally present in all bilaterian species, but further
371
investigation is warranted before strong conclusions can be drawn in this regard. We could gain
372
no phylogenetic support for placing either of these beetle sequences in the GDF1/3 clade, and it
373
374
Our recovery of not only the full expected complements of these vital developmental
375
genes, but also a remarkable diversity of alternative variants, demonstrates the depth of our
376
assemblies as a resource. Whether used as the basis for simple cloning or more sophisticated
377
analysis of patterns of gene variation and diversification, these transcriptomes will be of wide
378
379
380
Pathway Recovery
381
382
commonly studied in insects [52]. This allows us to both note how well-recovered such
383
pathways are in our species as a measure of transcriptome utility, as well as note interesting
384
differences between these pathways in our species when compared to others. We did this using
15
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
385
both automated (KEGG KAAS mapping) and manual (BLAST based) methods. Some
386
representative results of KEGG KAAS mapping can be seen in Fig 7, and all KEGG annotations
387
388
KEGG KAAS mapping uses BLAST results to annotate known pathways, and gives a
389
rapid overview of the recovery of these. Here we have shown the well-known Wnt, Notch and
390
Hedgehog pathways to indicate the depth of our transcriptomes, and show how they may be
391
useful for future research. However, these maps often use terminology based on vertebrate
392
nomenclature, and contain genes known to be absent from particular clades. We have therefore
393
indicated in Fig 7 (using unshaded boxes as shown in the Key) genes that may be absent
394
ancestrally in the Coleoptera, based on their absence from the T. castaneum pathway. Of genes
395
expected to be present in the Coleoptera we find almost total recovery in our transcriptomes. In
396
the three pathways examined, only three genes noted to be present in T. castaneum were noted
397
as absent from both of our transcriptome datasets, all in the Wnt cascades (Fig 7A). The
398
expected Hedgehog cassette was recovered in toto (Fig 7B) and in the Notch signalling
399
cascade (Fig 7C), only APH-1 was noted to be absent, and only from the C. maculatus
400
transcriptome. We must note that these may not be true absences - KEGG KAAS mapping is
401
based on automatic BLAST assignation, and if these sequences are divergent in our
402
403
We also examined pathways manually, using reciprocal BLAST hits and closer manual
404
investigation to confirm identity of individual genes, the results of which can be seen in Table 3.
405
The anterior-posterior patterning genes cad (mentioned earlier) and hunchback (hb) are present
406
in both species. Of the germline establishment and localization genes examined, nanos was
407
surprisingly absent in both species, while bruno (bru; also known as arrest), exuperantia (exu),
408
tudor (tud; 2 copies in A. menetriesi), oskar (osk), vasa (vas) and valois (vls) were present.
409
410
across two contigs), while A. menetriesi possesses a total of seven copies. The different A.
16
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
411
menetriesi pum copies varied both at the nucleotide level and in their amino acid sequences,
412
strongly suggesting that they are in fact paralogs. In depth analysis of these genes is required to
413
uncover why they have undergone several rounds of duplication. Orthologs of the Drosophila
414
gene swallow (swa) could not be found in either of our transcriptome resources, nor is it present
415
in several other insects (data not shown) and we suggest it may therefore be a schizophoran
416
novelty.
417
Canonical gap genes Krppel (Kr), knirps (kni), giant (gt), huckebein (hkb), tailless (tll; 2
418
paralogs in C. maculatus), buttonhead (btd), empty spiracles (ems) and both orthodenticle
419
orthologs (Otd and Otd2) were recovered in both species examined here. The C. maculatus
420
specific duplication of tll would be another excellent opportunity to study gene duplication and
421
evolution. The pair rule genes even skipped (eve), hairy (h), fushi tarazu (ftz), odd paired (opa),
422
odd skipped (odd), paired (prd), runt (run) and sloppy paired 1 (slp1) were present in single
423
copy. The segment polarity gene cassette was also present in both species, with the notable
424
absence of gooseberry-neuro (gsb-n) from our datasets. The genes armadillo (arm), cubitis
425
interruptus (ci), Engrailed (En), fused (fu), gooseberry (gsb), hedgehog (hh), pangolin (pan),
426
patched (ptc) and wingless (wg) were all present, and their sequences can be found in
427
Additional File 1.
428
All of these pathways are commonly studied in insects, and the annotations provided
429
here, along with preliminary timed expression data, will provide a basis for a wide range of
430
targeted investigations into the embryonic development of these two species, and how these
431
pathways have changed over the course of evolution. Furthermore, the excellent recovery of
432
these pathways by both automated (KEGG-KAAS) and manual annotation gives us high
433
confidence in the completeness of our transcriptomic resources. This confirms the results of our
434
BUSCO analysis, and our datasets are therefore likely to contain the vast majority of transcribed
435
genes in these two species, with only lowly expressed and temporally restricted genes absent
436
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
437
438
Conclusions:
439
Our production of deep transcriptomic sequence data for A. menetriesi and C. maculatus will
440
assist in the inference of character gain and loss across the Coleoptera, aid in future
441
phylogenetic efforts, and allow a range of investigations into the embryonic development of
442
these species at the molecular level. The status of these organisms as common agricultural
443
pests also suggests that such resources may allow targeted control mechanisms to be
444
developed for these species. This data will be another key building block in our understanding of
445
the transcriptomic basis to embryological development, and provide a window into the basic
446
447
448
Methods:
449
Animal Husbandry
450
A. menetriesi eggs were kindly provided by Dr Yoshikazu Ando, and were reared at 25C on wet
451
sand or soil and fed fresh lettuce. C. maculatus beetles were kindly provided by Dr Joel Savard,
452
453
454
455
RNA was extracted from ovary and timed embryonic samples using a TRIzol RNA extraction kit
456
according to the manufacturers protocols. RNA quantity and quality was checked using a
457
Thermo Scientific Nanodrop 2000C Spectrophotometer and 1 g was sent for sequencing by
458
the Cologne Centre for Genomics. Adaptor trimming and initial quality control was performed by
459
the commercial provider according to their internal standards. This cleaned data was then made
460
available to us for download from an external server. Paired end read quality after sequencing
461
18
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
462
463
464
Assemblies used in our final analysis were made using Trinity version 2013_08_14 [39], with the
465
default settings. Full assemblies were made using reads from all time points, and individual
466
assemblies were then constructed using each sampled time point individually. All assemblies
467
468
469
run on full assemblies with settings -i 95 -c 95, using the bact, fungi, hsref, and prot databases.
470
Comparative expression analyses were performed using RSEM [53] as packaged in the Trinity
471
module, and results shown here are the as-isoform data, although as-gene data is provided in
472
Additional Files. BUSCO v1.1b1 [43] was used to assess gene complement completeness.
473
474
Functional Annotation
475
Our combined assemblies were automatically assigned homologs and annotated according to
476
gene ontology (GO) terms using Blast2GO [44, 45]. Initially, BLASTx was run using BLAST
477
2.2.29+ against the NCBI nr database as downloaded to a local server on the 17/01/2015, with
478
479
melanogaster and H. sapiens genomes were downloaded from B2GO-FAR [54] and calculated
480
using the Combined Graph function of Blast2GO. KEGG KAAS mapping was automatically
481
482
best hit with default BLAST settings, and with the eukaryote dataset as a basis for annotation.
483
484
Gene Identification
485
Gene sequences were manually identified and their homology confirmed by independently using
486
tBLASTn [55] searches using gene sequences of known homology downloaded from the NCBI
487
nr database as queries against standalone databases created on a local server using BLAST
19
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
488
2.2.29+ or the CLC Main workbench. Genes putatively identified using this method were
489
reciprocally BLASTed against the online NCBI nr database using BLASTx to confirm their
490
identity. Where identity was uncertain, phylogenetic analysis was used to confirm identity.
491
492
493
Sequences were aligned using MAFFT 7 [56] unless otherwise stated under the L-INS-i
494
strategy. Alignments were then saved and exported to MEGA 6, where regions of poor
495
alignment were manually excluded and maximum likelihood phylogenetic trees were
496
constructed using the LG model, 1000 bootstrap replicates as indicated, 4 gamma categories
497
and invariant sites, and all other default prior settings [57].
498
499
500
501
BLAST: Basic Local Alignment Search Tool, BUSCO: Benchmarking Universal Single-Copy
502
503
and Genomes - Automatic Annotation Server, MAFFT: Multiple Alignment using Fast Fourier
504
Transform, NCBI: National Centre for Biotechnology Information,ORF: Open Reading Frame,
505
506
507
508
Competing Interests:
509
510
511
Authors' contributions:
20
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
512
MAB, KC and JAL maintained animals, extracted RNA and arranged sequencing. MAB and NJK
513
assembled transcriptomes, analysed raw data and wrote the manuscript. MAB, NJK, KC, JAL
514
and SR designed experiments and contributed to gene family analyses presented in this
515
516
517
Acknowledgements:
518
The authors would like to thank the members of their laboratories for their support and
519
discussions. In addition, we thank Dr Y Ando for providing A. menetriesi eggs and advice on
520
establishing cultures, Dr J Savard for providing C. maculatus, and Dr F Marletaz for help in
521
running BUSCO analyses. The efforts of editors and reviewers in considering this manuscript
522
are gratefully acknowledged. MAB was supported by a Humboldt Fellowship for Postdoctoral
523
Researchers. KHC, SR and JAL were supported by the DFG Collaborative Research Center
524
grant 680.
525
526
527
The datasets supporting the conclusions of this article are available in the NCBI SRA repository
528
529
530
531
532
Bibliography:
533
1. Stork NE: Insect diversity - facts, fiction and speculation. Biol J Linn Soc 1988, 35:321
534
337.
535
536
21
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
537
4. Keeling CI, Henderson H, Li M, Yuen M, Clark EL, Fraser JD, Huber DPW, Liao NY, Docking
538
TR, Birol I, Chan SK, Taylor GA, Palmquist D, Jones SJM, Bohlmann J: Transcriptome and
539
full-length cDNA resources for the mountain pine beetle, Dendroctonus ponderosae
540
Hopkins, a major insect pest of pine forests. Insect Biochem Mol Biol 2012, 42:525536.
541
542
543
544
545
546
tremulae reveals new gene families in Coleoptera. Insect Biochem Mol Biol 2009, 39:403
547
413.
548
549
550
551
552
signaling integrates patterning and metabolism of the insect growth zone. Development
553
2014, 141:47404750.
554
9. Jacobs CGC, Braak N, Lamers GEM, van der Zee M: Elucidation of the serosal cuticle
555
556
Knickkopf1, Retroactive and Laccase2. Insect Biochem Mol Biol 2015, 60:712.
557
10. i5k-Consortium: The i5K Initiative: advancing arthropod genomics for knowledge,
558
559
11. Richards S, Gibbs R a, Weinstock GM, Brown SJ, Denell R, Beeman RW, Gibbs R, Bucher
560
561
Zdobnov EM, Muzny D, Attaway T, Bell S, Buhay CJ, Chandrabose MN, Chavez D, Clerk-
562
Blankenburg KP, Cree A, Dao M, Davis C, Chacko J, Dinh H, Dugan-Rocha S, Fowler G, et al.:
22
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
563
The genome of the model beetle and pest Tribolium castaneum. Nature 2008, 452:94955.
564
12. Yin A, Pan L, Zhang X, Wang L, Yin Y, Jia S, Liu W, Xin C, Liu K, Yu X, Sun G, Al-hudaib K,
565
566
567
568
569
14. Hegner RW: Effects of Removing the Germ-Cell Determinants from the Eggs of Some
570
571
15. Hegner RW: The origin and early history of the germ-cells in some chrysomelid
572
573
16. Meer JM Van Der: Optical clean and permanent whole mount preparation for phase-
574
contrast microscopy of cuticular structures of insect larvae. Dros Inf Serv 1977, 52.
575
576
[http://www.beanbeetles.org/]
577
578
encoding novel plant cell wall degrading enzymes. PLoS One 2010, 5:e15635.
579
19. Kirsch R, Wielsch N, Vogel H, Svatos A, Heckel DG, Pauchet Y: Combining proteomics
580
581
582
20. Flagel LE, Bansal R, Kerstetter RA, Chen M, Carroll M, Flannagan R, Clark T, Goldman BS,
583
Michel AP: Western corn rootworm (Diabrotica virgifera virgifera) transcriptome assembly
584
585
21. Strauss AS, Wang D, Stock M, Gretscher RR, Groth M, Boland W, Burse A: Tissue-
586
specific transcript profiling for ABC transporters in the sequestering larvae of the
587
588
22. Chi YH, Salzman R a, Balfe S, Ahn J-E, Sun W, Moon J, Yun D-J, Lee SY, Higgins TJ V,
23
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
589
590
591
2009, 18:97110.
592
593
594
595
596
597
598
599
600
601
26. Ando Y, Miya K: Diapause character in the false melon beetle, Atrachya menetriesi
602
Faldermann, produced by crossing between diapause and non diapause strains. Bull Fac
603
604
605
606
607
28. Gmez-Zurita J, Hunt T, Kopliku F, Vogler AP: Recalibrated tree of leaf beetles
608
609
610
29. Tran BMD, Credland PF: Consequences of inbreeding for the cowpea seed beetle,
611
612
30. Meer J van der: of metameric order in the insect Callosobruchus maculatus
613
614
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
615
31. Meer JM Van Der: Parameters influencing reversal of segment sequence in posterior
616
617
32. Patel NH, Condron BG, Zinn K: Pair-rule expression patterns of even-skipped are found
618
619
33. Osborne PW, Dearden PK: Expression of Pax group III genes in the honeybee (Apis
620
621
622
Insects, Vol 1. Edited by Counce SJ, Waddinton CH. New York: Academic Press; 1972:165
623
242.
624
35. Davis GK, Patel NH: SHORT, LONG, AND BEYOND: Molecular and Embryological
625
626
36. FastQC: A quality control tool for high throughput sequence data
627
[http://www.bioinformatics.babraham.ac.uk/projects/fastqc/]
628
37. Bolger AM, Lohse M, Usadel B: Trimmomatic: a flexible trimmer for Illumina sequence
629
630
38. Hansen KD, Brenner SE, Dudoit S: Biases in Illumina transcriptome sequencing caused
631
632
39. Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L,
633
634
635
assembly from RNA-Seq data without a reference genome. Nat Biotechnol 2011, 29:644
636
652.
637
638
639
41. Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB,
640
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
641
Westerman R, William T, Dewey CN, Henschel R, Leduc RD, Friedman N, Regev A: De novo
642
transcript sequence reconstruction from RNA-seq using the Trinity platform for reference
643
644
42. Brocchieri L, Karlin S: Protein length in eukaryotic and prokaryotic proteomes. Nucleic
645
646
43. Simao FA, Waterhouse RM, Ioannidis P, Kriventseva E V, Zdobnov EM: BUSCO:
647
648
649
44. Conesa A, Gotz S, Garcia-Gomez JM, Terol J, Talon M, Robles M: Blast2GO: a universal
650
651
652
45. Gotz S, Garcia-Gomez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, Robles M, Talon
653
M, Dopazo J, Conesa A: High-throughput functional annotation and data mining with the
654
655
46. Zhong Y-F, Butts T, Holland PWH: HomeoDB: a database of homeobox gene diversity.
656
657
47. van der Zee M, Berns N, Roth S: Distinct Functions of the Tribolium zerknullt Genes in
658
659
48. Schulz C, Schrder R, Hausdorf B, Wolff C, Tautz D: A caudal homologue in the short
660
germ band beetle Tribolium shows similarities to both, the Drosopila and the vertebrate
661
662
49. Van der Zee M, da Fonseca RN, Roth S: TGFbeta signaling in Tribolium: vertebrate-like
663
664
50. Ozuak O, Buchta T, Roth S, Lynch JA: Ancient and diverged TGF-beta signaling
665
666
51. Kenny NJ, Namigai EKO, Dearden PK, Hui JHL, Grande C, Shimeld SM: The
26
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
667
668
669
52. Gilbert SF (Ed): Developmental Biology. 10th edition. Sunderland, MA: Sinauer Associates,
670
Inc.; 2013.
671
53. Li B, Dewey CN: RSEM: accurate transcript quantification from RNA-Seq data with or
672
673
674
675
2011, 27:919924.
676
55. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool.
677
678
56. Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7:
679
680
681
682
683
Figure legends
684
Figure 1: A) Cladogram of Coleoptera simplified from that determined by Hunt et al. (2007) [13].
685
Red asterisk indicates the superfamily in which the Chrysomelidae are located. B) Cladogram of
686
Chrysomelidae simplified from that determined by Gomez-Zurita et al. (2007) [28]. Black and
687
grey asterisks indicate the sub-families in which A. menetriesi and C. maculatus are located
688
689
690
Figure 2: Summary of RNA sources, life stages sampled and sequencing results, with
691
27
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
692
693
Figure 3: Overview of results of differential expression analysis performed by RSEM within the
694
Trinity framework. A) and B) show the sample correlation matrix for A. menetriesi and C.
695
696
697
698
Figure 4: Blast2GO results A) Distribution of BLASTx best hits by species, showing metazoans
699
700
a percentage of annotated contigs which were assigned a term within each of the three
701
702
703
704
MEGA 6 using the LG+Freqs model with 4 gamma categories and invariant sites, based on a 59
705
amino acid alignment spanning the homeodomain. Numbers at base of nodes represent
706
bootstrap percentages of 1000 replicates. Scale bar at base of phylogeny gives substitutions
707
per site at given unit distance. Red underline indicates A. menetriesi and C. maculatus
708
sequences, coloured boxes are used to delineate known gene families (and in the case of Hox
709
6/7/8, a superfamily).
710
711
Figure 6: Maximum Likelihood Phylogeny of TGF- ligands, as determined using MEGA under
712
the LG+Freqs model with 4 gamma categories and invariant sites, on the basis of a 72 amino
713
acid alignment of mature peptide sequences. The given scale depicts the number of
714
substitutions per site per unit length. Bootstrap percentages (of 1000 replicates) are given at
715
base of nodes. A. menetriesi and C. maculatus sequences are underlined in red. Coloured
716
boxes represent known gene families with representatives in our transcriptomic resources, while
717
all gene families, including those not found in our datasets, are indicated at right.
28
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
718
719
Figure 7: KEGG style pathway maps showing recovery in our transcriptome resources of A) the
720
Wnt signalling pathway in canonical and non-canonical contexts, B) the Hedgehog signalling
721
pathway and C) the Notch signalling pathway. Coloration of genes indicates presence, absence
722
or ancestral absence from the Coleoptera as detailed in the key, which also gives other
723
information as noted.
724
725
Tables
726
Library: (hpf=
C. maculatus
Ovary
0-24 hpf
24-48 hpf
48-72 hpf
72-100 hpf
Ovary
0-7 hpf
7-24 hpf
44,695,199
56,139,626
39,911,618
46,217,614
45,213,862
67,026,913
61,060,357
55,273,927
101
101
101
101
101
101
101
101
7.3
9.5
6.8
7.8
7.7
10.8
9.8
8.8
38
36
37
37
37
43
41
40
hours post
fertilization)
Number of
Paired-end
Reads
Read Length
(bp)
Amount of
Data
(Gigabytes)
GC content
(%)
727
728
729
730
731
29
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
732
Table 2: Assembly metrics for individual time point and combined transcriptomic resources
A. menetriesi
Ovary
C. maculatus
0-24 hpf
24-48 hpf
48-72 hpf
72-100
Combined
Ovary
0-7 hpf
7-24 hpf
Combined
hpf
Number
100,084
106,548
120,497
140,868
126,400
228,096
98,082
57,879
80,665
128,837
20,759
17,765
18,398
17,591
17,180
20,757
30,742
22,941
22,956
29,933
983.95
1027.36
924.27
879.03
914.2
844.66
1074.86
1216.55
1106.56
991.04
471
490
453
426
448
401
416
526
474
391
1,894
1,993
1,721
1,643
1,686
1,598
2,450
2,570
2,308
2,263
14,196
15,200
17,511
19,814
18,105
31,196
11,695
7,874
10,978
15,155
28,255
31,944
32,307
34,520
33,002
52,217
27,763
19,919
25,529
33,250
# bases,
98,477,17
109,463,3
111,371,6
123,827,2
115,555,1
192,664,2
105,424,2
70,412,72
89,260,49
127,682,5
total
62
55
26
46
13
20
07
# bases in
68,737,07
78,526,34
75,208,10
80,796,00
77,199,41
122,986,5
78,519,05
55,032,54
67,296,56
91,466,09
contigs >
47
of contigs
Max
contig
length
(bp)
Mean
contig
length
(bp)
Median
contig
length
(bp)
N50
contig
length
(bp)
# contigs
in N50
# contigs
> 1kb
1kb
30
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
GC
33.54
33.14
33.14
33
33.17
32.82
39.26
39.21
38.59
38.7
Content
% (2dp)
733
734
735
736
737
738
A. menetriesi
C.maculatus
Maternal Effect:
Pathway/Gene
A. menetriesi
C.maculatus
Pair rule:
caudal
present
present
even skipped
present
present
hunchback
present
present
fushi tarazu
hairy
present
present
odd paired
present
present
Germline:
bruno/arrest
present
present
odd skipped
present
present
exuperantia
present
present
paired
present
present
nanos
absent
absent
runt
present
present
oskar
present
present
sloppy paired 1
present
present
pumilio
present - 7 copies
present
swallow
absent
absent
Segment Polarity:
tudor
present - 2 copies
present
armadillo
present
present
valois
present
present
cubitis interruptus
present
present
vasa
present
present
Engrailed
present
present
31
bioRxiv preprint first posted online Jan. 6, 2016; doi: http://dx.doi.org/10.1101/035998. The copyright holder for this preprint (which was not
peer-reviewed) is the author/funder. It is made available under a CC-BY-NC-ND 4.0 International license.
Gap genes:
fused
present
present
gooseberry
present
present
buttonhead
present
present
gooseberry-neuro
absent
absent
empty spiracles
present
present
hedgehog
present
present (but on 3
contigs)
giant
present
present
pangolin
present
present
huckebein
present
present
patched
present
present
knirps
present
present
wingless
present
present
Krppel
present
present
orthodenticle 1
present
present
orthodenticle 2
present
present
tailless
present
present - 2 copies
739
740
741
Additional Files:
742
Additional File 1: Sequences of all genes referred to in text, and alignments used in
743
744
745
746
747
748
749
32
A)
Coleoptera
B)
Chrysomelidae
C)
Atrachya
menetriesi
2 cm
D)
Callosobruchus
maculatus
1 cm
Atrachya menetriesi
Developing Gamete
Ovarian
Development
Callosobruchus maculatus
Early Egg to
Blastoderm Stage
Embryo Formation,
Gastrulation,
Germband Elongation
Final, Combined
Assemblies
02
06
CoVa
orue
Key
go
Co or Key
yeK royeoK
C ro oC
mp
o
on m
mp
mono pm o
pm
m no
oom u
dm
og
god
m dm o o
u moo oom
m
u
A achya mene es
Samp e Co e a on Ma x
mp
oom
001_27
27_84
6 02 0
eu yaVeu aV
ova
00 22 0 4 0
0 66 0 8 1
1
20
Vaa ue
Re
ve
S m ar y
ova y
24_48
ova
y
72_100
72_100
24_48
72_100
24_48
0_24
0-24 hour
D)
-5 0
0 55 10
10
Vaatue
Re
ve
express on
7_24
7-24 hour
Co or Key
samp es vs ea u es
Ca
osobruchus
macu atus
so o msc us e ed ma x og2 cen e ed
Cont g Re at ve Express on
0_24
-10
10
0_7
0-7 hour
ova y
ovary
72_100
24_48
ova y
24_48
0_24
ovary
48_72
24_48
ova y
0_24
48_72
ova y
24_48
0_24
24_48
samp es vs ea u es
Atrachya
menetr es
so o msc us e ed ma x og2 cen e ed
Cont g Re at ve Express on
0-7 hour
0_7
ova y
ova y
72_100
0_24hour
72-100
0_24
48_72hour
48_72
0-24
24_48
ovary
84_42
og
84_42
dm
on m
27_84
24_48
72_100
ova
y
48_72
ovary
27_84
u
o
ova y
72_100
0_24
24-48 hour
48_72
0_24
24_48
001_27
001_27
y avo
y avo
Co or
Co
orKey
Key
-5
5
0 55
0
Re
Vaatueve
express on
7-24 hour
7_24
0-7 hour
0_7
0_24
72-100 hour
ovary
48-72 hour
48_72
0-24 hour
24_48
ovary
comp86476_c0_s
comp111618_c1_seq10
comp111618_c1_seq12
comp111618_c1_seq13
comp111618_c1_seq9
comp111618_c1_seq1
comp111618_c1_seq3
comp111618_c1_seq14
comp111618_c1_seq6
comp86579_c2_seq1
comp92303_c0_seq1
comp109712_c0_seq1
comp73271_c0_seq1
comp111925_c0_seq22
comp111618_c1_seq2
comp63201_c0_seq1
comp100088_c0_seq1
comp99616_c0_seq3
comp54614_c0_seq1
comp97104_c0_seq1
comp103563_c2_seq2
comp109922_c1_seq6
comp36407_c0_seq1
comp101297_c0_seq2
comp97615_c0_seq6
comp19619_c1_seq1
comp91877_c0_seq2
comp91877_c0_seq1
comp94921_c0_seq1
comp78259_c0_seq1
comp94921_c0_seq2
comp111698_c2_seq2
comp97214_c0_seq3
comp80407_c0_seq1
comp108382_c0_seq3
comp109142_c1_seq1
comp109142_c0_seq1
comp97293_c0_seq4
comp94097_c0_seq1
comp92193_c0_seq1
comp103118_c0_seq12
comp90155_c0_seq1
comp79141_c0_seq1
comp108292_c3_seq1
comp83140_c1_seq1
comp109210_c0_seq1
comp99761_c0_seq2
comp104281_c0_seq2
comp110523_c0_seq5
comp112032_c1_seq2
comp112318_c0_seq1
comp112318_c0_seq2
comp112032_c1_seq3
comp105249_c2_seq1
comp102392_c0_seq1
comp111014_c0_seq1
comp95123_c0_seq2
comp107616_c3_seq1
comp112669_c0_seq1
comp112901_c0_seq1
comp112901_c0_seq2
comp110388_c0_seq5
comp112535_c0_seq5
comp89538_c0_seq1
comp105267_c0_seq4
comp108665_c0_seq3
comp106426_c0_seq1
comp104777_c0_seq1
comp99083_c1_seq1
comp110641_c0_seq2
comp90591_c0_seq3
comp103124_c0_seq1
comp96887_c1_seq8
comp98158_c1_seq4
comp112516_c0_seq4
comp90591_c0_seq1
comp101683_c0_seq1
comp105106_c1_seq3
comp96125_c0_seq6
comp112483_c0_seq6
comp101909_c0_seq3
comp106285_c0_seq4
comp105040_c2_seq2
comp96838_c0_seq2
comp92433_c0_seq1
comp101922_c0_seq4
comp101922_c0_seq2
comp111965_c0_seq1
comp105781_c0_seq4
comp112483_c0_seq11
comp112483_c0_seq7
comp97193_c0_seq1
comp112483_c0_seq4
comp109204_c1_seq1
comp108304_c0_seq2
comp79350_c0_seq1
comp107274_c0_seq2
comp105627_c2_seq4
comp105565_c0_seq2
comp101714_c1_seq1
comp89770_c0_seq1
comp112375_c0_seq6
comp97909_c0_seq1
comp104183_c0_seq1
comp110155_c0_seq5
comp93561_c0_seq1
comp103737_c0_seq1
comp75591_c0_seq1
comp106635_c0_seq1
comp112375_c0_seq4
comp105781_c0_seq5
comp112483_c0_seq9
comp112516_c0_seq2
comp112483_c0_seq2
comp96749_c0_seq1
comp88969_c0_seq1
comp103439_c0_seq3
comp106297_c0_seq1
comp97734_c1_seq1
comp96749_c0_seq2
comp108913_c0_seq3
comp108913_c0_seq1
comp96033_c0_seq1
comp102823_c5_seq2
comp55179_c0_seq1
comp98640_c0_seq1
comp84657_c0_seq4
comp84657_c0_seq5
comp90725_c0_seq1
comp90741_c0_seq1
comp112655_c0_seq1
comp107793_c1_seq1
comp100296_c0_seq3
comp94579_c0_seq6
comp95553_c0_seq1
comp91580_c1_seq2
comp111960_c0_seq4
comp112483_c0_seq3
comp101103_c0_seq1
comp102740_c0_seq1
comp99435_c1_seq1
comp107972_c0_seq2
comp97867_c0_seq1
comp97902_c0_seq3
comp105794_c0_seq8
comp106310_c0_seq7
comp106310_c0_seq10
comp79975_c0_seq2
comp105023_c0_seq4
comp106310_c0_seq8
comp86745_c0_seq1
comp107891_c0_seq2
comp111473_c0_seq1
comp104370_c0_seq5
comp83099_c0_seq1
comp111151_c0_seq1
comp98829_c2_seq4
comp89402_c0_seq1
comp96678_c0_seq1
comp106310_c0_seq3
comp102598_c0_seq2
comp97423_c0_seq1
comp98515_c0_seq1
comp108868_c2_seq3
comp97077_c0_seq1
comp97077_c0_seq4
comp101067_c0_seq1
comp85025_c0_seq1
comp97077_c0_seq3
comp107267_c1_seq2
comp108782_c0_seq3
comp106227_c1_seq1
comp83044_c0_seq1
comp92257_c0_seq2
comp105310_c2_seq5
comp97055_c0_seq1
comp97055_c0_seq2
comp100405_c2_seq2
comp86527_c0_seq1
comp109035_c0_seq1
comp102403_c0_seq10
comp92372_c0_seq1
comp88651_c0_seq1
comp111543_c0_seq7
comp111422_c3_seq1
comp109237_c1_seq1
comp75880_c0_seq1
comp103824_c0_seq11
comp99703_c1_seq1
comp73244_c0_seq1
comp105794_c0_seq6
comp106192_c1_seq1
comp104264_c0_seq2
comp104894_c3_seq1
comp106245_c2_seq3
comp104825_c0_seq4
comp104894_c3_seq2
comp100032_c0_seq1
comp96783_c1_seq1
comp98693_c0_seq2
comp96783_c1_seq3
comp111174_c1_seq5
comp109161_c1_seq15
comp84657_c0_seq6
comp103743_c1_seq2
comp98729_c0_seq1
comp86914_c0_seq1
comp103743_c1_seq1
comp94958_c0_seq1
comp92377_c0_seq1
comp109784_c1_seq5
comp109784_c1_seq1
comp93383_c0_seq2
comp97895_c0_seq8
comp95597_c0_seq1
comp106510_c0_seq1
comp95597_c0_seq2
comp101554_c0_seq1
comp78259_c0_seq2
comp104284_c0_seq4
comp102354_c0_seq1
comp106999_c0_seq1
comp108106_c1_seq5
comp104233_c4_seq4
comp104233_c4_seq9
comp92765_c0_seq1
comp71953_c0_seq1
comp92961_c0_seq3
comp104701_c0_seq1
comp97599_c0_seq2
comp98637_c0_seq1
comp111111_c2_seq1
comp103332_c0_seq3
comp104737_c0_seq1
comp110155_c0_seq1
comp104284_c0_seq1
comp111543_c0_seq3
comp105501_c0_seq13
comp111111_c1_seq2
comp91664_c0_seq1
comp104284_c0_seq2
comp101985_c0_seq4
comp104425_c0_seq1
comp85712_c0_seq1
comp97803_c0_seq1
comp64793_c0_seq1
comp71566_c0_seq1
comp93814_c0_seq3
comp99517_c0_seq2
comp93814_c0_seq1
comp105247_c0_seq1
comp111230_c0_seq6
comp101985_c0_seq2
comp98829_c2_seq3
comp103167_c0_seq1
comp97520_c0_seq1
comp94248_c0_seq1
comp97599_c0_seq1
comp104284_c0_seq3
comp104602_c0_seq1
comp104602_c0_seq4
comp104602_c0_seq2
comp98048_c1_seq2
comp103489_c0_seq1
comp102278_c0_seq2
comp110439_c0_seq1
comp105676_c0_seq1
comp107770_c0_seq2
comp103350_c0_seq1
comp104602_c0_seq3
comp103588_c1_seq1
comp103588_c0_seq1
comp111646_c0_seq2
comp112516_c0_seq1
comp95190_c0_seq1
comp106424_c0_seq3
comp111646_c0_seq5
comp104231_c0_seq1
comp111646_c0_seq3
comp101696_c0_seq5
comp104231_c0_seq7
comp86017_c0_seq1
comp97909_c0_seq2
comp105886_c1_seq1
comp108888_c0_seq5
comp84704_c0_seq1
comp99952_c3_seq1
comp97520_c0_seq2
comp106424_c0_seq2
comp105683_c0_seq3
comp104231_c0_seq3
comp106424_c0_seq1
comp105274_c1_seq4
comp106099_c0_seq1
comp97741_c0_seq1
comp108843_c0_seq1
comp104386_c0_seq7
comp110213_c0_seq3
comp97077_c0_seq2
comp106691_c0_seq4
comp96615_c0_seq1
comp105310_c2_seq1
comp107334_c2_seq6
comp111664_c0_seq8
comp94483_c0_seq1
comp108998_c0_seq3
comp108843_c0_seq2
comp108782_c0_seq2
comp105886_c1_seq3
comp104370_c0_seq1
comp97215_c0_seq1
comp105310_c2_seq13
comp92598_c0_seq1
comp97754_c0_seq2
comp99413_c0_seq1
comp105794_c0_seq5
comp94483_c1_seq1
comp108846_c0_seq1
comp112880_c0_seq1
comp111664_c0_seq3
comp109398_c1_seq1
comp104059_c0_seq2
comp104140_c0_seq1
comp109970_c1_seq2
comp109970_c1_seq1
comp101369_c0_seq1
comp111664_c0_seq2
comp111664_c0_seq5
comp111664_c0_seq4
comp111432_c2_seq3
comp111664_c0_seq7
comp111664_c0_seq1
comp111664_c0_seq6
comp111664_c0_seq9
comp108626_c0_seq5
comp103758_c0_seq2
comp103758_c0_seq1
comp109564_c0_seq3
comp93478_c0_seq1
comp101442_c0_seq1
comp107955_c0_seq15
comp103830_c0_seq1
comp111807_c0_seq7
comp102452_c0_seq3
comp111865_c1_seq1
comp111865_c1_seq2
comp112752_c0_seq14
comp82755_c0_seq1
comp92232_c3_seq1
comp100021_c0_seq1
comp89440_c1_seq1
comp108292_c0_seq3
comp112982_c0_seq2
comp110739_c0_seq1
comp101804_c0_seq8
comp107439_c0_seq1
comp105310_c2_seq6
comp97767_c0_seq1
comp109908_c2_seq2
comp32549_c0_seq1
comp100439_c2_seq9
comp112413_c0_seq11
comp97989_c0_seq1
comp112413_c0_seq4
comp112413_c0_seq16
comp111692_c0_seq19
comp90532_c0_seq2
comp104425_c0_seq2
comp111692_c0_seq28
comp101571_c0_seq1
comp110445_c0_seq1
comp106459_c0_seq4
comp110213_c0_seq1
comp105794_c0_seq1
comp101804_c0_seq17
comp108721_c0_seq10
comp102094_c2_seq6
comp34039_c0_seq1
comp102419_c3_seq3
comp89279_c0_seq1
comp109035_c0_seq3
comp106133_c0_seq2
comp106691_c0_seq7
comp100519_c0_seq3
comp101442_c0_seq2
comp109035_c0_seq2
comp81109_c1_seq2
comp106003_c0_seq12
comp104430_c0_seq3
comp111432_c2_seq2
comp89474_c0_seq1
comp93580_c0_seq1
comp108626_c0_seq11
comp108626_c0_seq8
comp108626_c0_seq7
comp108016_c0_seq4
comp98124_c0_seq1
comp86021_c0_seq1
comp87413_c0_seq1
comp88224_c0_seq4
comp108779_c1_seq2
comp92066_c0_seq1
comp108754_c2_seq6
comp112595_c0_seq2
comp110445_c0_seq2
comp88832_c0_seq2
comp105794_c0_seq7
comp105090_c0_seq6
comp101510_c0_seq1
comp89236_c0_seq1
comp98413_c0_seq1
comp109775_c1_seq2
comp103425_c0_seq3
comp100817_c1_seq1
comp92257_c0_seq1
comp100807_c0_seq2
comp103425_c0_seq2
comp94456_c1_seq1
comp109822_c0_seq1
comp106225_c0_seq5
comp100172_c0_seq2
comp108281_c0_seq1
comp109199_c1_seq2
comp90622_c0_seq1
comp51844_c0_seq1
comp96365_c0_seq3
comp100133_c0_seq1
comp96329_c0_seq21
comp89006_c0_seq1
comp71373_c0_seq1
comp57994_c0_seq1
comp101872_c0_seq1
comp106029_c1_seq1
comp105132_c0_seq2
comp93651_c0_seq1
comp87132_c0_seq2
comp38217_c0_seq1
comp104244_c0_seq1
comp109775_c1_seq11
comp106008_c0_seq1
comp59400_c0_seq1
comp105429_c0_seq2
comp96014_c0_seq1
comp104826_c0_seq1
comp107485_c0_seq11
comp98941_c0_seq1
comp28348_c0_seq1
comp108937_c0_seq16
comp111377_c0_seq5
comp104826_c0_seq3
comp105429_c2_seq1
comp93660_c0_seq2
comp100040_c0_seq3
comp100040_c0_seq4
comp98941_c0_seq2
comp98941_c0_seq3
comp107485_c0_seq10
comp107485_c0_seq17
comp108937_c0_seq14
comp101389_c1_seq8
comp90017_c0_seq1
comp101389_c1_seq10
comp100040_c0_seq2
comp112315_c0_seq5
comp98389_c0_seq1
comp104991_c0_seq1
comp105482_c0_seq1
comp101389_c1_seq11
comp112221_c0_seq2
comp93230_c0_seq3
comp111377_c0_seq6
comp101757_c0_seq1
comp79223_c0_seq1
comp93391_c0_seq1
comp107485_c0_seq8
comp107485_c0_seq20
comp104679_c0_seq2
comp96884_c0_seq1
comp104679_c0_seq1
comp111533_c3_seq8
comp104679_c0_seq4
comp104679_c0_seq3
comp111533_c3_seq1
comp82488_c0_seq1
comp100040_c0_seq1
comp112221_c0_seq3
comp105697_c1_seq1
comp102996_c0_seq2
comp111377_c0_seq4
comp107485_c0_seq5
comp104679_c0_seq5
comp107485_c0_seq19
comp107485_c0_seq21
comp84718_c0_seq1
comp107485_c0_seq15
comp114490_c0_seq1
comp96526_c2_seq3
comp106558_c1_seq1
comp109765_c0_seq7
comp94652_c0_seq1
comp105176_c0_seq2
comp110908_c0_seq1
comp100857_c0_seq2
comp101447_c0_seq2
comp111533_c3_seq4
comp66561_c0_seq1
comp111533_c3_seq9
comp111533_c3_seq3
comp90540_c0_seq1
comp86295_c0_seq1
comp91917_c1_seq1
comp111533_c3_seq5
comp104231_c0_seq6
comp55567_c0_seq1
comp107485_c0_seq12
comp99259_c3_seq4
comp111087_c0_seq3
comp111116_c0_seq1
comp103788_c0_seq25
comp92523_c0_seq1
comp111377_c0_seq1
comp88490_c0_seq1
comp106840_c0_seq3
comp55343_c0_seq1
comp99338_c0_seq1
comp80556_c0_seq1
comp98375_c0_seq1
comp98375_c0_seq2
comp109188_c0_seq3
comp23992_c0_seq1
comp109685_c0_seq1
comp103788_c0_seq16
comp93867_c0_seq1
comp109685_c0_seq3
comp110506_c0_seq10
comp78733_c0_seq1
comp104982_c0_seq1
comp110506_c0_seq9
comp102417_c0_seq4
comp87979_c0_seq1
comp87979_c0_seq2
comp102417_c0_seq1
comp51377_c0_seq1
comp102417_c0_seq2
comp97234_c0_seq2
comp79066_c0_seq1
comp43486_c0_seq1
comp97234_c0_seq1
comp77944_c1_seq2
comp69318_c0_seq1
comp98752_c1_seq1
comp104231_c0_seq8
comp86733_c0_seq1
comp92821_c0_seq2
comp91917_c0_seq1
comp108121_c0_seq7
comp92332_c0_seq1
comp104910_c0_seq3
comp107546_c1_seq6
comp107546_c1_seq8
comp105898_c0_seq4
comp110077_c0_seq1
comp104518_c2_seq1
comp102076_c0_seq1
comp104518_c2_seq2
comp103630_c2_seq1
comp89604_c0_seq1
comp102076_c0_seq2
comp106249_c0_seq2
comp92332_c0_seq2
comp92821_c0_seq1
comp111934_c0_seq9
comp96222_c1_seq5
comp111934_c0_seq5
comp108221_c0_seq1
comp88587_c1_seq1
comp70520_c0_seq1
comp103903_c0_seq4
comp103539_c1_seq6
comp115096_c0_seq1
comp88846_c1_seq3
comp86154_c1_seq1
comp104287_c1_seq8
comp64719_c0_seq1
comp111934_c0_seq8
comp91917_c2_seq1
comp99324_c0_seq2
comp100501_c0_seq1
comp105314_c1_seq4
comp97536_c1_seq2
comp108496_c0_seq5
comp106839_c0_seq7
comp105898_c0_seq2
comp101021_c0_seq1
comp104256_c0_seq2
comp99945_c1_seq3
comp86172_c0_seq1
comp111533_c3_seq7
comp111533_c3_seq2
comp111533_c3_seq6
comp84380_c1_seq1
comp93100_c0_seq1
comp95459_c0_seq1
comp108407_c1_seq3
comp97169_c0_seq2
comp108834_c0_seq3
comp104471_c0_seq1
comp105898_c0_seq1
comp105898_c0_seq3
comp99778_c0_seq1
comp108256_c0_seq2
comp106839_c0_seq2
comp70131_c0_seq1
comp92875_c0_seq1
comp56265_c0_seq2
comp56265_c0_seq4
comp99047_c0_seq8
comp56265_c0_seq3
comp71038_c0_seq2
comp107177_c0_seq1
comp80417_c0_seq2
comp77832_c0_seq2
comp34280_c0_seq1
comp94542_c1_seq1
comp100875_c2_seq1
comp88270_c0_seq1
comp102777_c0_seq1
comp95178_c0_seq2
comp36910_c0_seq1
comp51736_c1_seq1
comp82641_c0_seq1
comp107383_c3_seq2
comp88901_c0_seq1
comp107648_c0_seq1
comp110962_c1_seq1
comp81810_c0_seq1
comp101071_c3_seq6
comp60179_c1_seq1
comp69042_c0_seq1
comp114600_c0_seq1
comp21942_c0_seq1
comp53269_c0_seq1
comp109226_c0_seq10
comp95041_c0_seq1
comp90471_c0_seq1
comp103556_c1_seq6
comp108587_c0_seq1
comp71560_c0_seq7
comp34622_c0_seq1
comp109789_c0_seq3
comp109946_c0_seq9
comp104257_c0_seq2
comp23555_c0_seq1
comp21299_c0_seq1
comp114675_c0_seq1
comp86773_c0_seq2
comp86736_c0_seq1
comp92461_c0_seq4
comp100403_c0_seq1
comp106260_c0_seq2
comp91432_c0_seq2
comp111749_c1_seq1
comp93281_c1_seq3
comp71241_c0_seq1
comp35161_c0_seq1
comp92461_c0_seq3
comp92128_c0_seq1
comp104287_c1_seq7
comp97615_c0_seq5
comp88587_c1_seq2
comp101939_c0_seq3
comp92821_c0_seq5
comp98512_c0_seq1
comp98720_c0_seq8
comp92821_c0_seq4
comp20700_c0_seq1
comp79845_c0_seq1
comp24724_c0_seq1
comp89598_c0_seq1
comp78961_c0_seq2
comp99284_c0_seq1
comp100134_c0_seq2
comp92461_c0_seq2
comp28379_c0_seq1
comp35921_c0_seq1
comp93156_c0_seq9
comp98375_c0_seq4
comp109570_c0_seq4
comp35174_c0_seq1
comp95667_c1_seq1
comp66635_c0_seq1
comp96353_c0_seq1
comp102056_c0_seq20
comp97027_c0_seq19
comp106184_c0_seq1
comp29830_c0_seq1
comp109429_c2_seq13
comp83260_c0_seq1
comp102446_c0_seq1
comp110481_c1_seq2
comp99506_c1_seq13
comp69873_c0_seq1
comp99259_c3_seq8
comp84690_c0_seq2
comp111797_c1_seq14
comp32172_c0_seq1
comp85171_c0_seq2
comp103986_c0_seq1
comp61529_c0_seq1
comp102983_c0_seq3
comp92821_c0_seq3
comp72295_c0_seq1
comp104616_c0_seq3
comp93383_c0_seq4
comp104616_c0_seq2
comp94223_c0_seq1
comp33476_c0_seq1
comp106375_c0_seq1
comp102748_c0_seq5
comp63924_c0_seq1
comp57280_c0_seq1
comp102748_c0_seq2
comp99527_c0_seq1
comp99768_c0_seq3
comp32656_c0_seq1
comp53366_c0_seq1
comp81300_c0_seq1
comp102748_c0_seq6
comp23630_c0_seq1
comp102748_c0_seq4
comp97171_c0_seq1
comp99600_c0_seq16
comp110372_c0_seq1
comp97439_c0_seq4
comp104231_c0_seq9
comp100447_c0_seq1
comp56158_c0_seq1
comp106972_c0_seq8
comp89576_c0_seq1
comp103332_c0_seq7
comp97356_c0_seq1
comp98829_c2_seq10
comp108668_c0_seq2
comp90404_c0_seq1
comp95463_c0_seq1
comp100677_c0_seq1
comp109334_c0_seq1
comp103539_c1_seq8
comp101902_c0_seq4
comp80758_c0_seq1
comp103539_c1_seq12
comp103539_c1_seq1
comp103539_c1_seq13
comp103539_c1_seq11
comp101594_c0_seq4
comp30060_c0_seq1
comp11344_c0_seq1
comp114539_c0_seq1
comp59447_c0_seq1
comp59445_c0_seq1
comp52857_c0_seq1
comp54285_c0_seq1
comp51847_c0_seq1
comp63126_c0_seq1
comp73554_c0_seq1
comp111790_c0_seq6
comp34608_c0_seq1
comp89875_c0_seq1
comp93502_c0_seq2
comp93156_c0_seq3
comp81645_c0_seq1
comp104231_c0_seq2
comp77273_c0_seq1
comp114301_c0_seq1
comp66723_c0_seq2
comp106217_c0_seq5
comp77832_c0_seq1
comp64675_c0_seq2
comp78585_c0_seq1
comp98267_c0_seq1
comp115787_c0_seq1
comp340_c0_seq1
comp77187_c0_seq1
comp88736_c0_seq3
comp98098_c2_seq2
comp88736_c0_seq1
comp61003_c0_seq1
comp82945_c0_seq1
comp5225_c0_seq1
comp60478_c0_seq2
comp114729_c0_seq1
comp27769_c0_seq1
comp98606_c1_seq7
comp88086_c0_seq3
comp67568_c0_seq1
comp100156_c0_seq1
comp54641_c0_seq1
comp64564_c0_seq1
comp35129_c0_seq1
comp93502_c0_seq1
comp81391_c0_seq2
comp95520_c0_seq11
comp52313_c0_seq1
comp93156_c0_seq4
comp112618_c0_seq29
comp106777_c0_seq5
comp106092_c0_seq3
comp43523_c0_seq1
comp60347_c0_seq1
comp103539_c1_seq4
comp60067_c0_seq1
comp89365_c0_seq1
comp111790_c0_seq3
comp91819_c0_seq1
comp115614_c0_seq1
comp78733_c0_seq2
comp110506_c0_seq5
comp108834_c0_seq8
comp108221_c0_seq2
comp86154_c0_seq1
comp109718_c1_seq2
comp88587_c1_seq5
comp109429_c2_seq14
comp103539_c1_seq10
comp114260_c0_seq1
comp103539_c1_seq5
comp77101_c0_seq1
comp34639_c0_seq1
comp89823_c0_seq1
comp86962_c0_seq1
comp85107_c0_seq1
comp93711_c0_seq1
comp72497_c0_seq1
comp114282_c0_seq1
comp89771_c1_seq1
comp93262_c0_seq1
comp100875_c0_seq1
comp91938_c0_seq2
comp100875_c0_seq2
comp108687_c0_seq8
comp79020_c0_seq1
comp91938_c0_seq1
comp73999_c0_seq1
comp60209_c0_seq1
comp91819_c0_seq3
comp93013_c0_seq2
comp81645_c0_seq2
comp74977_c0_seq1
comp74003_c0_seq1
comp107574_c0_seq1
comp80567_c0_seq1
comp106705_c1_seq2
comp110059_c0_seq2
comp97202_c0_seq5
comp108221_c0_seq5
comp97202_c0_seq1
comp105751_c1_seq1
comp108221_c0_seq4
comp97202_c0_seq2
comp88225_c0_seq1
comp91071_c0_seq1
comp91071_c0_seq2
comp107509_c0_seq1
comp111597_c0_seq4
comp96021_c0_seq1
comp83408_c0_seq1
comp108395_c0_seq1
comp88347_c0_seq1
comp93078_c0_seq1
comp104018_c0_seq2
comp46292_c0_seq1
comp110043_c0_seq13
comp110413_c0_seq5
comp81717_c0_seq1
comp102848_c0_seq4
comp103332_c0_seq1
comp110413_c0_seq4
comp102998_c1_seq10
comp110413_c0_seq3
comp97423_c0_seq2
comp100978_c0_seq2
comp102303_c0_seq3
comp102395_c0_seq1
comp100978_c0_seq1
comp106192_c1_seq4
comp73753_c0_seq1
comp87864_c0_seq1
comp97350_c0_seq1
comp100399_c0_seq1
comp104387_c2_seq1
comp106192_c1_seq5
comp70718_c0_seq1
comp100439_c2_seq7
comp105886_c1_seq2
comp100439_c2_seq4
comp104897_c0_seq1
comp107539_c0_seq10
comp104897_c0_seq2
comp100090_c0_seq1
comp99616_c0_seq2
comp110413_c0_seq1
comp105607_c0_seq3
comp104717_c0_seq2
comp86298_c0_seq1
comp111507_c1_seq1
comp99372_c0_seq2
comp92212_c0_seq1
comp101708_c0_seq3
comp111673_c0_seq12
comp69232_c0_seq1
comp92212_c1_seq1
comp102139_c0_seq5
comp90409_c0_seq1
comp97156_c0_seq1
comp35671_c0_seq1
comp103780_c0_seq2
comp104913_c0_seq4
comp104913_c0_seq6
comp104913_c0_seq5
comp104913_c0_seq1
comp103054_c0_seq3
comp108486_c0_seq1
comp105607_c0_seq1
comp90804_c0_seq1
comp105850_c0_seq8
comp100022_c0_seq3
comp52271_c0_seq1
comp24197_c0_seq1
comp98256_c0_seq2
comp21981_c0_seq1
comp94480_c0_seq2
comp5813_c0_seq1
comp98606_c1_seq4
comp96250_c0_seq1
comp100206_c0_seq1
comp73924_c1_seq1
comp98867_c0_seq2
comp101708_c0_seq1
comp69215_c0_seq1
comp75689_c0_seq2
comp95275_c3_seq8
comp103167_c0_seq2
comp97439_c0_seq1
comp108292_c0_seq1
comp103905_c0_seq4
comp104231_c0_seq10
comp106389_c0_seq3
comp110413_c0_seq2
comp106972_c0_seq7
comp98351_c1_seq10
comp106972_c0_seq5
comp65879_c0_seq2
comp99256_c1_seq3
comp97418_c0_seq2
comp107527_c0_seq8
comp104140_c0_seq5
comp106638_c0_seq4
comp100381_c0_seq1
comp102277_c0_seq1
comp100447_c0_seq2
comp84657_c0_seq1
comp105753_c0_seq4
comp105753_c0_seq2
comp107552_c2_seq13
comp89440_c0_seq1
comp103066_c1_seq1
comp111629_c0_seq7
comp107552_c2_seq6
comp103863_c2_seq3
comp103863_c2_seq2
comp33054_c0_seq1
comp110249_c1_seq1
comp105356_c0_seq2
comp38615_c0_seq2
comp106638_c0_seq3
comp84657_c0_seq2
comp84657_c0_seq3
comp100812_c0_seq5
comp109784_c1_seq3
comp109784_c1_seq4
comp103905_c0_seq7
comp111678_c3_seq1
comp106375_c0_seq2
comp92584_c0_seq2
comp105859_c0_seq2
comp105859_c0_seq1
comp97615_c0_seq1
comp92584_c0_seq1
comp110059_c1_seq1
comp106972_c0_seq1
comp89146_c0_seq3
comp97202_c0_seq3
comp110059_c1_seq2
comp110413_c0_seq7
comp102748_c0_seq1
comp94837_c1_seq1
comp91565_c0_seq1
comp111543_c0_seq1
comp95712_c2_seq1
comp111131_c1_seq2
comp94731_c0_seq1
comp105274_c1_seq10
comp105274_c1_seq2
comp102848_c0_seq5
comp103332_c0_seq6
comp105274_c1_seq11
comp110903_c0_seq4
comp90163_c0_seq1
comp110903_c0_seq7
comp95991_c0_seq1
comp110903_c0_seq2
comp99430_c0_seq2
comp107374_c0_seq1
comp95002_c0_seq1
comp102983_c0_seq4
comp101593_c0_seq2
comp79772_c0_seq3
comp91313_c0_seq1
comp88470_c0_seq1
comp102748_c0_seq8
comp110675_c1_seq1
comp104018_c0_seq1
comp101621_c0_seq1
comp84955_c0_seq1
comp56309_c0_seq1
comp103584_c0_seq3
comp108982_c0_seq3
comp579_c0_seq1
comp87790_c1_seq1
comp102431_c0_seq3
comp105451_c0_seq2
comp59108_c0_seq1
comp83228_c0_seq1
comp109207_c1_seq5
comp93552_c0_seq1
comp90016_c0_seq1
comp98375_c0_seq6
comp106267_c1_seq2
comp35602_c0_seq1
comp104927_c1_seq5
comp94551_c0_seq1
comp114799_c0_seq1
comp92068_c0_seq2
comp90016_c0_seq2
comp2408_c0_seq1
comp23773_c0_seq1
comp107527_c0_seq26
comp101593_c0_seq1
comp94067_c0_seq1
comp99164_c0_seq1
comp108779_c1_seq4
comp98256_c0_seq1
comp106926_c0_seq8
comp44561_c0_seq1
comp48527_c0_seq1
comp75050_c0_seq1
comp98207_c0_seq1
comp92510_c0_seq2
comp60628_c1_seq1
comp86865_c0_seq1
comp86865_c0_seq3
comp105917_c0_seq1
comp86384_c0_seq1
comp35636_c0_seq1
comp118194_c0_seq1
comp24142_c0_seq1
comp97202_c0_seq4
comp74988_c0_seq1
comp100574_c0_seq3
comp2002_c0_seq1
comp95178_c0_seq3
comp116128_c0_seq1
comp69982_c0_seq1
comp115934_c0_seq1
comp115992_c0_seq1
comp102150_c0_seq1
comp104440_c0_seq1
comp80046_c0_seq2
comp43906_c0_seq1
comp81391_c0_seq1
comp108876_c0_seq4
comp76512_c0_seq1
comp107385_c0_seq2
comp69828_c0_seq1
comp104913_c0_seq7
comp108834_c0_seq13
comp93876_c0_seq1
comp103141_c0_seq8
comp108876_c0_seq6
comp78718_c0_seq1
comp102921_c0_seq7
comp82618_c0_seq1
comp105850_c0_seq13
comp101262_c3_seq1
comp68167_c0_seq1
comp74710_c0_seq2
comp103054_c0_seq7
comp91059_c0_seq1
comp63402_c0_seq1
comp99257_c0_seq1
comp110903_c0_seq8
comp85660_c0_seq1
comp108307_c0_seq2
comp106840_c0_seq2
comp101662_c1_seq6
comp110506_c0_seq1
comp110506_c0_seq8
comp109188_c0_seq4
comp22223_c0_seq1
comp103054_c0_seq6
comp108486_c0_seq2
comp60478_c0_seq1
comp57280_c0_seq2
comp92461_c0_seq1
comp103141_c0_seq6
comp67623_c0_seq1
comp20691_c0_seq1
comp111951_c0_seq10
comp102613_c2_seq2
comp34276_c0_seq2
comp95806_c0_seq1
comp90044_c0_seq3
comp91016_c0_seq1
comp89886_c0_seq1
comp89484_c0_seq1
comp51736_c0_seq1
comp71089_c0_seq1
comp104913_c0_seq2
comp96721_c0_seq2
comp1899_c0_seq1
comp95806_c3_seq1
comp109789_c0_seq2
comp100574_c0_seq4
comp107383_c3_seq3
comp61109_c0_seq1
comp81391_c0_seq3
comp21602_c0_seq1
comp62932_c0_seq2
comp89771_c0_seq1
comp2438_c0_seq1
comp96396_c0_seq1
comp90173_c0_seq1
comp84309_c0_seq1
comp92209_c0_seq2
comp111461_c0_seq1
comp18018_c0_seq1
comp74679_c0_seq1
comp98207_c0_seq2
comp87426_c0_seq1
comp118469_c0_seq1
comp83981_c0_seq1
comp73871_c0_seq2
comp104511_c0_seq2
comp106217_c0_seq6
comp54039_c0_seq1
comp111727_c0_seq4
comp96721_c0_seq1
comp112618_c0_seq20
comp98238_c0_seq1
comp111790_c0_seq1
comp88060_c0_seq1
comp103539_c1_seq2
comp106217_c0_seq4
comp98098_c2_seq1
comp89790_c1_seq1
comp70661_c0_seq1
comp90044_c0_seq2
comp89717_c0_seq6
comp85793_c0_seq1
comp35828_c0_seq1
comp72098_c0_seq1
comp41475_c0_seq1
comp79135_c0_seq1
comp80274_c0_seq1
comp111597_c0_seq3
comp112618_c0_seq14
comp112618_c0_seq6
comp112618_c0_seq32
comp112618_c0_seq24
comp112618_c0_seq8
comp112618_c0_seq19
comp112618_c0_seq5
comp112618_c0_seq13
comp112618_c0_seq3
comp112618_c0_seq7
comp112618_c0_seq11
comp108468_c0_seq7
comp112618_c0_seq30
comp112618_c0_seq27
comp112618_c0_seq21
comp112618_c0_seq12
comp112618_c0_seq22
comp112618_c0_seq16
comp112618_c0_seq1
comp112618_c0_seq15
comp112618_c0_seq26
comp112618_c0_seq33
comp112618_c0_seq17
comp112618_c0_seq18
comp112618_c0_seq31
comp112618_c0_seq28
comp112618_c0_seq25
comp112618_c0_seq23
comp112618_c0_seq2
comp112618_c0_seq4
comp112618_c0_seq10
comp112618_c0_seq34
comp103734_c2_seq4
comp116816_c0_seq1
comp93013_c0_seq1
comp44610_c0_seq1
comp114811_c0_seq1
comp112618_c0_seq9
comp89717_c0_seq4
comp101662_c1_seq5
comp89717_c0_seq3
comp71081_c0_seq1
comp103398_c1_seq4
comp23668_c0_seq1
comp91032_c0_seq1
comp102840_c1_seq1
comp103054_c0_seq9
comp111930_c0_seq1
comp98048_c2_seq2
comp108307_c0_seq8
comp106571_c0_seq1
comp104913_c0_seq3
comp109712_c0_seq2
comp107439_c2_seq1
comp109715_c0_seq5
comp109715_c0_seq6
comp88224_c0_seq2
comp100133_c1_seq1
comp112048_c1_seq4
comp109970_c0_seq4
comp108756_c0_seq8
comp104047_c1_seq1
comp106981_c0_seq4
comp62290_c0_seq2
comp105686_c1_seq4
comp97409_c0_seq2
comp109712_c0_seq8
comp109712_c0_seq4
comp106981_c0_seq3
comp108756_c0_seq4
comp109970_c0_seq1
comp109199_c0_seq1
comp111146_c1_seq1
comp104195_c0_seq1
comp96223_c0_seq1
comp94356_c0_seq1
comp108193_c0_seq2
comp106015_c0_seq1
comp111845_c0_seq2
comp91243_c0_seq1
comp106133_c0_seq1
comp101969_c1_seq1
comp100510_c0_seq1
comp113647_c0_seq1
comp108756_c0_seq6
comp101969_c1_seq5
comp77651_c0_seq1
comp100439_c2_seq8
comp111673_c0_seq16
comp89770_c1_seq1
comp97527_c0_seq1
comp108888_c0_seq6
comp111543_c0_seq4
comp108756_c0_seq2
comp97855_c0_seq1
comp104840_c0_seq2
comp105274_c1_seq8
comp105274_c1_seq15
comp99616_c0_seq1
comp105274_c1_seq7
comp113847_c0_seq1
comp105274_c1_seq14
comp105274_c1_seq9
comp105274_c1_seq3
comp108888_c0_seq4
comp105274_c1_seq13
comp64379_c0_seq1
comp103612_c1_seq4
comp105274_c1_seq12
comp105274_c1_seq6
comp105274_c1_seq1
comp88224_c0_seq6
comp101148_c0_seq3
comp97641_c0_seq1
comp112724_c1_seq1
comp99122_c0_seq1
comp79848_c0_seq2
comp103485_c0_seq1
comp105274_c1_seq5
comp105850_c0_seq6
comp105459_c0_seq1
comp106981_c0_seq1
comp91154_c0_seq1
comp62696_c0_seq1
comp105683_c0_seq1
comp98325_c0_seq1
comp110213_c0_seq6
comp100990_c0_seq1
comp105030_c2_seq2
comp109359_c0_seq2
comp112091_c1_seq15
comp112091_c1_seq7
comp112091_c1_seq2
comp112091_c1_seq18
comp107955_c0_seq6
comp100990_c0_seq2
comp99164_c0_seq2
comp114802_c0_seq1
comp107527_c0_seq7
comp104927_c1_seq1
comp104927_c1_seq3
comp104927_c1_seq2
comp106972_c0_seq4
comp70492_c0_seq1
comp104697_c0_seq2
comp103857_c0_seq2
comp111223_c0_seq1
comp110903_c0_seq5
comp76434_c0_seq2
comp121306_c0_seq1
comp110397_c0_seq5
comp110397_c0_seq17
comp74580_c0_seq2
comp67976_c0_seq1
comp103857_c0_seq1
comp72080_c0_seq2
comp86243_c0_seq2
comp100812_c0_seq1
comp81687_c0_seq1
comp109926_c0_seq1
comp92480_c0_seq1
comp93599_c0_seq3
comp86440_c0_seq1
comp111165_c0_seq10
comp55443_c0_seq1
comp63445_c1_seq1
comp100788_c0_seq1
comp43508_c0_seq1
comp83399_c0_seq1
comp60346_c0_seq1
comp112091_c1_seq8
comp106197_c0_seq3
comp69967_c1_seq1
comp112091_c1_seq9
comp68361_c0_seq1
comp65727_c0_seq1
comp112091_c1_seq22
comp107527_c0_seq13
comp106197_c0_seq1
comp101593_c0_seq3
comp89300_c0_seq1
comp108868_c5_seq1
comp89005_c0_seq2
comp102136_c0_seq2
comp104840_c0_seq1
comp90948_c0_seq1
comp90616_c0_seq1
comp101593_c0_seq4
comp112091_c1_seq11
comp112091_c1_seq3
comp114597_c0_seq1
comp95516_c0_seq1
comp99899_c2_seq2
comp86800_c0_seq1
comp83961_c0_seq1
comp105176_c0_seq3
comp81547_c0_seq1
comp96558_c0_seq1
comp105176_c0_seq1
comp34335_c0_seq1
comp111165_c0_seq21
comp100484_c0_seq1
comp112795_c0_seq11
comp112795_c0_seq10
comp100574_c0_seq1
comp105451_c0_seq3
comp117082_c0_seq1
comp84535_c0_seq1
comp101662_c1_seq9
comp27769_c0_seq2
comp103612_c0_seq5
comp89717_c0_seq1
comp103905_c0_seq5
comp100210_c0_seq3
comp44105_c0_seq1
comp83228_c0_seq2
comp77872_c0_seq1
comp118090_c0_seq1
comp118635_c0_seq1
comp103539_c1_seq3
comp85623_c0_seq1
comp88736_c0_seq2
comp43908_c0_seq1
comp78535_c0_seq1
comp66035_c0_seq2
comp103612_c0_seq15
comp86351_c0_seq1
comp88861_c0_seq1
comp106571_c0_seq2
comp81070_c0_seq1
comp80148_c0_seq1
comp105664_c0_seq2
comp105694_c0_seq5
comp78118_c0_seq1
comp33782_c0_seq2
comp103893_c0_seq2
comp67667_c0_seq1
comp103893_c0_seq1
comp27815_c0_seq1
comp85028_c0_seq1
comp24445_c0_seq1
comp28884_c0_seq1
comp100410_c0_seq1
comp67426_c0_seq1
comp65816_c0_seq1
comp111461_c0_seq2
comp109519_c0_seq8
comp103788_c0_seq7
comp111461_c0_seq3
comp103788_c0_seq11
comp103788_c0_seq24
comp103788_c0_seq2
comp87475_c0_seq1
comp103788_c0_seq27
comp92172_c0_seq1
comp108834_c0_seq6
comp103788_c0_seq19
comp103788_c0_seq14
comp107485_c0_seq18
comp103788_c0_seq17
comp103788_c0_seq20
comp29458_c0_seq1
comp89085_c0_seq1
comp106705_c1_seq3
comp104826_c0_seq2
comp98492_c0_seq3
comp109926_c0_seq7
comp108834_c0_seq10
comp110908_c0_seq2
comp110506_c0_seq12
comp22796_c0_seq1
comp69134_c0_seq1
comp103788_c0_seq21
comp107485_c0_seq6
comp107485_c0_seq9
comp96558_c0_seq3
comp109519_c0_seq12
comp74574_c0_seq1
comp112379_c0_seq3
comp100748_c0_seq3
comp72069_c0_seq1
comp108007_c2_seq1
comp112705_c0_seq4
comp112379_c0_seq4
comp112379_c0_seq1
comp109922_c0_seq3
comp110313_c1_seq5
comp110313_c1_seq1
comp100545_c0_seq2
comp106972_c0_seq2
comp97439_c0_seq3
comp83272_c0_seq1
comp108132_c1_seq2
comp98409_c1_seq1
comp58782_c0_seq1
comp107770_c0_seq7
comp103804_c1_seq1
comp107770_c0_seq6
comp102840_c1_seq2
comp108610_c0_seq1
comp88884_c0_seq2
comp85874_c0_seq1
comp83133_c0_seq2
comp75494_c0_seq1
comp98704_c0_seq6
comp101317_c0_seq1
comp110469_c0_seq2
comp63008_c0_seq1
comp100784_c0_seq1
comp109230_c0_seq2
comp78022_c0_seq1
comp109973_c0_seq3
comp109230_c0_seq3
comp109230_c0_seq4
comp108114_c0_seq7
comp91705_c0_seq2
comp81861_c0_seq1
comp109922_c1_seq3
comp102446_c0_seq2
comp109819_c0_seq4
comp94212_c0_seq1
comp78970_c0_seq1
comp111698_c2_seq14
comp101317_c0_seq2
comp111698_c2_seq10
comp102765_c0_seq1
comp111797_c1_seq13
comp102765_c0_seq2
comp104109_c0_seq1
comp96499_c3_seq1
comp73220_c0_seq1
comp106586_c0_seq12
comp82475_c0_seq1
comp71272_c0_seq1
comp30096_c0_seq1
comp94106_c0_seq1
comp97600_c1_seq1
comp102417_c1_seq1
comp84065_c0_seq1
comp112823_c0_seq1
comp108721_c0_seq3
comp89717_c0_seq2
comp90047_c0_seq1
comp91705_c0_seq3
comp96564_c1_seq2
comp102996_c1_seq3
comp111698_c2_seq8
comp104287_c1_seq2
comp100588_c0_seq1
comp113819_c0_seq1
comp81097_c1_seq1
comp103533_c0_seq4
comp94837_c2_seq1
comp105305_c0_seq2
comp94716_c0_seq1
comp90420_c0_seq2
comp97625_c1_seq3
comp109778_c0_seq1
comp92882_c0_seq1
comp97293_c0_seq2
comp100621_c1_seq3
comp97293_c0_seq3
comp97293_c0_seq5
comp78253_c0_seq1
comp88691_c0_seq1
comp108114_c0_seq1
comp108114_c0_seq10
comp100936_c0_seq1
comp111255_c0_seq10
comp94779_c0_seq1
comp97981_c0_seq2
comp90420_c0_seq1
comp108114_c0_seq11
comp111749_c0_seq1
comp86145_c0_seq1
comp97795_c0_seq2
comp103905_c0_seq1
comp97795_c0_seq1
comp99988_c0_seq1
comp98704_c0_seq3
comp109819_c0_seq7
comp38615_c0_seq1
comp106883_c0_seq1
comp107583_c4_seq2
comp87941_c0_seq1
comp63942_c0_seq1
comp109922_c1_seq5
comp98295_c0_seq1
comp108114_c0_seq6
comp80955_c0_seq1
comp86212_c0_seq1
comp73950_c0_seq1
comp100826_c0_seq2
comp94212_c2_seq1
comp100595_c1_seq6
comp96623_c0_seq3
comp102630_c0_seq4
comp85806_c0_seq1
comp100439_c2_seq2
comp105886_c2_seq1
comp110739_c0_seq4
comp110739_c0_seq3
comp100439_c2_seq3
comp112038_c0_seq6
comp108846_c0_seq4
comp102610_c0_seq1
comp74979_c0_seq1
comp68737_c0_seq1
comp98371_c0_seq5
comp106192_c1_seq6
comp73222_c0_seq1
comp105886_c0_seq4
comp97941_c0_seq2
comp95637_c0_seq1
comp100439_c2_seq6
comp104387_c1_seq2
comp100439_c0_seq1
comp110739_c0_seq5
comp108846_c0_seq2
comp98392_c0_seq1
comp105886_c3_seq1
comp29085_c0_seq1
comp98048_c0_seq1
comp109630_c0_seq1
comp94694_c0_seq1
comp100439_c1_seq1
comp84086_c0_seq2
comp24966_c0_seq1
comp95230_c0_seq1
comp107955_c0_seq7
comp89911_c0_seq1
comp98369_c1_seq1
comp104537_c0_seq1
comp91345_c0_seq1
comp105900_c2_seq12
comp104252_c0_seq1
comp71450_c0_seq1
comp105900_c1_seq1
comp111373_c1_seq1
comp95175_c0_seq2
comp95175_c0_seq1
comp107898_c3_seq3
comp103070_c2_seq1
comp98869_c0_seq13
comp88058_c0_seq1
comp107898_c3_seq1
comp98829_c2_seq11
comp104825_c0_seq3
comp62990_c0_seq1
comp108531_c0_seq3
comp82519_c0_seq1
comp108531_c0_seq1
comp97669_c1_seq6
comp82997_c0_seq1
comp108531_c0_seq4
comp108531_c0_seq2
comp112490_c0_seq1
comp76718_c0_seq1
comp87017_c0_seq1
comp89557_c0_seq2
comp82522_c0_seq1
comp104537_c0_seq3
comp109775_c1_seq3
comp109775_c1_seq5
comp88857_c0_seq1
comp105625_c0_seq2
comp93592_c0_seq1
comp103425_c0_seq1
comp109198_c0_seq1
comp109198_c0_seq2
comp108754_c2_seq8
comp110861_c1_seq1
comp110104_c0_seq3
comp110104_c0_seq4
comp107844_c2_seq1
comp95928_c2_seq7
comp90102_c0_seq1
comp81259_c0_seq2
comp78342_c0_seq1
comp103749_c0_seq1
comp106057_c0_seq4
comp97951_c0_seq7
comp101885_c0_seq1
comp98797_c1_seq1
comp88857_c0_seq2
comp106995_c0_seq4
comp102403_c0_seq1
comp102709_c0_seq3
comp86699_c0_seq1
comp96746_c0_seq2
comp110413_c1_seq2
comp96746_c0_seq1
comp112483_c0_seq1
comp109237_c0_seq1
comp55050_c0_seq1
comp101186_c0_seq2
comp95883_c0_seq1
comp100839_c0_seq1
comp110921_c0_seq2
comp105106_c0_seq1
comp99765_c0_seq2
comp109161_c1_seq12
comp106148_c0_seq2
comp99765_c0_seq1
comp96893_c0_seq1
comp109865_c0_seq1
comp103318_c0_seq2
comp110362_c1_seq1
comp110362_c1_seq2
comp109545_c0_seq3
comp104765_c0_seq3
comp89606_c0_seq1
comp93317_c0_seq1
comp85989_c0_seq1
comp105037_c0_seq3
comp106057_c0_seq1
comp105037_c0_seq2
comp101186_c0_seq1
comp107891_c0_seq7
comp100657_c0_seq1
comp106353_c0_seq21
comp97659_c0_seq1
comp104321_c0_seq10
comp106148_c0_seq1
comp96579_c0_seq2
comp108846_c0_seq5
comp106848_c0_seq1
comp65783_c0_seq1
comp89040_c0_seq1
comp105723_c2_seq8
comp110413_c0_seq6
comp110413_c0_seq10
comp101672_c0_seq15
comp112360_c0_seq16
comp96677_c1_seq1
comp105958_c0_seq1
comp102403_c0_seq7
comp93090_c0_seq1
comp105404_c2_seq1
comp107476_c0_seq1
comp94434_c0_seq1
comp69962_c0_seq1
comp111318_c0_seq4
comp108960_c0_seq2
comp105841_c2_seq1
comp93164_c0_seq1
comp103527_c0_seq1
comp96450_c0_seq2
comp110319_c0_seq1
comp95107_c0_seq4
comp100948_c0_seq1
comp91756_c0_seq1
comp101673_c1_seq1
comp104765_c0_seq6
comp92390_c0_seq1
comp105771_c0_seq2
comp103675_c1_seq4
comp93951_c0_seq1
comp105916_c0_seq3
comp85989_c1_seq1
comp89355_c0_seq4
comp107048_c2_seq2
comp108106_c1_seq2
comp107757_c0_seq3
comp100874_c1_seq1
comp108499_c0_seq3
comp87480_c0_seq1
comp98158_c1_seq2
comp87480_c0_seq2
comp107757_c0_seq1
comp106669_c1_seq1
comp105676_c0_seq2
comp98797_c0_seq1
comp91966_c0_seq1
comp88887_c0_seq2
comp106669_c0_seq1
comp80690_c0_seq1
comp105310_c2_seq10
comp101573_c0_seq2
comp96928_c0_seq2
comp112010_c2_seq76
comp101947_c0_seq3
comp101732_c0_seq1
comp104822_c1_seq1
comp61516_c0_seq1
comp73123_c0_seq1
comp92563_c1_seq1
comp93991_c1_seq1
comp97530_c0_seq1
comp106007_c0_seq2
comp94577_c0_seq1
comp105528_c3_seq4
comp105528_c3_seq2
comp111955_c0_seq3
comp99739_c0_seq1
comp89737_c0_seq1
comp113115_c0_seq1
comp74177_c0_seq1
comp88595_c0_seq1
comp74543_c0_seq1
comp97967_c0_seq1
comp60793_c0_seq1
comp59780_c0_seq1
comp78950_c2_seq1
comp59625_c0_seq1
comp92169_c1_seq2
comp93621_c1_seq1
comp91296_c0_seq1
comp92489_c0_seq1
comp59818_c0_seq1
comp82810_c0_seq3
comp37636_c0_seq1
comp93621_c0_seq1
comp108820_c3_seq10
comp105372_c0_seq2
comp108820_c3_seq9
comp37203_c0_seq1
comp19501_c0_seq1
comp89162_c0_seq1
comp91297_c0_seq1
comp90992_c0_seq1
comp98331_c0_seq2
comp98331_c0_seq1
comp89199_c1_seq1
comp105114_c0_seq1
comp83626_c0_seq1
comp79426_c0_seq1
comp79824_c0_seq1
comp84131_c0_seq1
comp78365_c0_seq2
comp78365_c0_seq1
comp87978_c0_seq1
comp106950_c0_seq4
comp87978_c0_seq3
comp57610_c0_seq1
comp39914_c0_seq1
comp97077_c1_seq2
comp61123_c0_seq1
comp16543_c0_seq1
comp94267_c0_seq3
comp111921_c3_seq14
comp93365_c0_seq1
comp93158_c0_seq1
comp92724_c0_seq1
comp86738_c0_seq1
comp91987_c0_seq1
comp107922_c2_seq3
comp107225_c0_seq1
comp106426_c0_seq7
comp107225_c0_seq2
comp105030_c1_seq2
comp110318_c0_seq5
comp110077_c0_seq3
comp112163_c0_seq1
comp107898_c3_seq2
comp100521_c0_seq2
comp90391_c0_seq1
comp83617_c0_seq1
comp110028_c1_seq3
comp97335_c0_seq2
comp108523_c0_seq2
comp108523_c0_seq1
comp72302_c0_seq1
comp104544_c2_seq1
comp104544_c1_seq1
comp92984_c0_seq2
comp92984_c0_seq1
comp81230_c0_seq1
comp100223_c1_seq11
comp97468_c0_seq1
comp79662_c0_seq2
comp111921_c3_seq50
comp97127_c0_seq6
comp110413_c0_seq9
comp104517_c1_seq2
comp71560_c0_seq8
comp110413_c0_seq8
comp71560_c0_seq4
comp88343_c0_seq1
comp93968_c0_seq1
comp113543_c0_seq1
comp39195_c0_seq1
comp62522_c0_seq1
comp85852_c0_seq1
comp110377_c0_seq6
comp85981_c0_seq3
comp95526_c0_seq1
comp75902_c0_seq1
comp69813_c0_seq1
comp82777_c0_seq4
comp102777_c0_seq4
comp71560_c0_seq2
comp108868_c1_seq1
comp109429_c0_seq5
comp79045_c0_seq1
comp79045_c0_seq4
comp85981_c0_seq1
comp106014_c0_seq7
comp113824_c0_seq1
comp95778_c0_seq1
comp95834_c0_seq6
comp95778_c0_seq2
comp101801_c1_seq1
comp85981_c0_seq4
comp105658_c2_seq5
comp105658_c2_seq4
comp95625_c0_seq1
comp73276_c0_seq1
comp80410_c0_seq1
comp87183_c0_seq1
comp77755_c0_seq1
comp90718_c0_seq1
comp82122_c0_seq4
comp83449_c0_seq1
comp105755_c1_seq1
comp105910_c0_seq1
comp104477_c0_seq5
comp99495_c0_seq2
comp99495_c0_seq9
comp78001_c0_seq2
comp102875_c2_seq6
comp110083_c2_seq6
comp102521_c1_seq8
comp87485_c0_seq2
comp104801_c0_seq1
comp94046_c1_seq2
comp112732_c2_seq1
comp105658_c2_seq1
comp106366_c1_seq7
comp94267_c0_seq2
comp94790_c0_seq6
comp95279_c0_seq1
comp104104_c2_seq6
comp70982_c0_seq2
comp38495_c1_seq1
comp81562_c0_seq1
comp86606_c0_seq2
comp94079_c0_seq1
comp86560_c1_seq2
comp106335_c0_seq2
comp106335_c0_seq1
comp83400_c0_seq2
comp88587_c1_seq6
comp39092_c0_seq1
comp98512_c0_seq2
comp98512_c0_seq4
comp91568_c0_seq2
comp95398_c0_seq1
comp109941_c0_seq2
comp77723_c0_seq1
comp49972_c0_seq1
comp82777_c0_seq3
comp104357_c0_seq2
comp101071_c2_seq1
comp89538_c1_seq1
comp68274_c0_seq1
comp99495_c0_seq1
comp85454_c0_seq1
comp93520_c0_seq1
comp86606_c0_seq1
comp68524_c0_seq1
comp112646_c0_seq51
comp100875_c3_seq1
comp89846_c0_seq1
comp113438_c0_seq1
comp104517_c1_seq1
comp107110_c0_seq2
comp78509_c0_seq1
comp104939_c0_seq1
comp40330_c0_seq1
comp85123_c0_seq1
comp85350_c2_seq1
comp101223_c1_seq2
comp39164_c0_seq1
comp75892_c0_seq1
comp111636_c0_seq10
comp83546_c0_seq1
comp91423_c0_seq2
comp94908_c0_seq4
comp79254_c1_seq1
comp71560_c0_seq3
comp89908_c0_seq1
comp109429_c2_seq2
comp102040_c0_seq2
comp109429_c2_seq6
comp101059_c0_seq2
comp101059_c0_seq1
comp108687_c0_seq5
comp105755_c2_seq8
comp102521_c1_seq7
comp92870_c0_seq1
comp109226_c0_seq11
comp99495_c0_seq10
comp113877_c0_seq1
comp88912_c0_seq1
comp79382_c1_seq1
comp92413_c0_seq1
comp82639_c0_seq1
comp108684_c0_seq2
comp95834_c2_seq1
comp70265_c0_seq1
comp110032_c0_seq2
comp85981_c0_seq2
comp109429_c2_seq11
comp109429_c2_seq1
comp105723_c2_seq16
comp109429_c2_seq4
comp81821_c0_seq2
comp95547_c0_seq1
comp109429_c2_seq15
comp109429_c0_seq1
comp104722_c2_seq1
comp109429_c2_seq5
comp81136_c0_seq1
comp90999_c0_seq1
comp99495_c0_seq5
comp102777_c0_seq2
comp108587_c0_seq2
comp108194_c0_seq8
comp28387_c0_seq1
comp113586_c0_seq1
comp109226_c0_seq8
comp92870_c0_seq2
comp109429_c2_seq10
comp78001_c0_seq1
comp81821_c0_seq1
comp73276_c0_seq2
comp95988_c0_seq1
comp111673_c0_seq10
comp85338_c0_seq1
comp106366_c1_seq13
comp109718_c0_seq1
comp111432_c1_seq5
comp106366_c1_seq9
comp106366_c1_seq11
comp84511_c0_seq2
comp87188_c1_seq1
comp105658_c2_seq8
comp100595_c1_seq10
comp92648_c0_seq1
comp103903_c0_seq11
comp98047_c0_seq1
comp105755_c0_seq1
comp106951_c0_seq5
comp81392_c1_seq1
comp71560_c0_seq5
comp71038_c0_seq3
comp87485_c0_seq1
comp102094_c2_seq2
comp93920_c0_seq2
comp105658_c2_seq9
comp84997_c1_seq1
comp109429_c2_seq3
comp84493_c0_seq1
comp102816_c0_seq2
comp86560_c1_seq1
comp110917_c0_seq10
comp84768_c0_seq1
comp94837_c0_seq1
comp111934_c0_seq4
comp110258_c0_seq1
comp96561_c0_seq2
comp105658_c2_seq11
comp105658_c2_seq7
comp105658_c2_seq6
comp113784_c0_seq1
comp79308_c0_seq1
comp105658_c2_seq2
comp66891_c0_seq1
comp109147_c0_seq5
comp100595_c1_seq5
comp93920_c0_seq1
comp91768_c0_seq1
comp91768_c0_seq2
comp74727_c0_seq1
comp96561_c0_seq1
comp93265_c0_seq1
comp111934_c0_seq6
comp111934_c0_seq3
comp95757_c2_seq5
comp87467_c0_seq2
comp88092_c0_seq1
comp84768_c0_seq3
comp79662_c0_seq1
comp98606_c3_seq2
comp70982_c0_seq1
comp113599_c0_seq1
comp92848_c0_seq1
comp101380_c1_seq5
comp95251_c0_seq1
comp36491_c1_seq1
comp98101_c0_seq2
comp78435_c0_seq1
comp74346_c0_seq1
comp100826_c0_seq3
comp98594_c0_seq2
comp92136_c0_seq1
comp109456_c0_seq2
comp104544_c0_seq4
comp102765_c1_seq3
comp92256_c0_seq1
comp110903_c0_seq6
comp100963_c0_seq1
comp102765_c1_seq2
comp78406_c0_seq1
comp94790_c0_seq2
comp99514_c0_seq2
comp93649_c0_seq1
comp93937_c0_seq1
comp95322_c0_seq6
comp99165_c0_seq1
comp105146_c0_seq1
comp93216_c0_seq1
comp86157_c0_seq1
comp94790_c0_seq5
comp94191_c0_seq1
comp88920_c0_seq1
comp60251_c0_seq1
comp91581_c1_seq1
comp99510_c0_seq3
comp85473_c0_seq1
comp75567_c0_seq1
comp97009_c0_seq1
comp60221_c0_seq1
comp105985_c2_seq5
comp105985_c2_seq7
comp106950_c0_seq5
comp105372_c0_seq1
comp82016_c0_seq1
comp76040_c0_seq1
comp104801_c0_seq3
comp95348_c0_seq1
comp96638_c0_seq1
comp95162_c0_seq1
comp113368_c0_seq1
comp108868_c0_seq1
comp90581_c0_seq1
comp95162_c0_seq2
comp67874_c0_seq1
comp81716_c0_seq1
comp109364_c0_seq3
comp101565_c2_seq1
comp88239_c0_seq1
comp109364_c0_seq2
comp44579_c0_seq2
comp90341_c0_seq1
comp80561_c0_seq1
comp87972_c0_seq1
comp93606_c0_seq1
comp92169_c1_seq4
comp104357_c0_seq3
comp101565_c1_seq1
comp100091_c0_seq1
comp71479_c0_seq1
comp101438_c0_seq1
comp104357_c0_seq4
comp108684_c0_seq1
comp104357_c0_seq1
comp104801_c0_seq2
comp109429_c0_seq3
comp113321_c0_seq1
comp101801_c0_seq1
comp19408_c0_seq1
comp112104_c0_seq6
comp75559_c0_seq1
comp61329_c0_seq1
comp107770_c0_seq12
comp76152_c0_seq1
comp102313_c1_seq4
comp113271_c0_seq1
comp26807_c0_seq1
comp113240_c0_seq1
comp37011_c0_seq1
comp93216_c0_seq2
comp106426_c0_seq9
comp106426_c0_seq3
comp106426_c0_seq11
comp106426_c0_seq14
comp113218_c0_seq1
comp106426_c0_seq4
comp69484_c1_seq1
comp95865_c1_seq1
comp72957_c0_seq1
comp85350_c0_seq1
comp36387_c0_seq1
comp85550_c0_seq1
comp86039_c0_seq3
comp72579_c0_seq1
comp80561_c0_seq2
comp96588_c0_seq1
comp94118_c0_seq2
comp96653_c1_seq12
comp85350_c1_seq1
comp29772_c0_seq1
comp24504_c0_seq1
comp107387_c0_seq3
comp96653_c1_seq8
comp86039_c0_seq1
comp97941_c0_seq1
comp69484_c0_seq1
comp113279_c0_seq1
comp96653_c0_seq1
comp100452_c0_seq3
comp36242_c0_seq1
comp72738_c0_seq1
comp107972_c0_seq3
comp93058_c0_seq1
comp87978_c0_seq2
comp99495_c0_seq12
comp87978_c0_seq4
comp85350_c3_seq1
comp106597_c1_seq3
comp97199_c0_seq1
comp98057_c0_seq1
comp107386_c2_seq2
comp104517_c3_seq1
comp106950_c0_seq1
comp104517_c0_seq1
comp7525_c0_seq1
comp94118_c0_seq1
comp96969_c0_seq6
comp104722_c0_seq1
comp94579_c0_seq2
comp75875_c0_seq1
comp90644_c0_seq1
comp69417_c0_seq1
comp100793_c0_seq11
comp98512_c0_seq3
comp84768_c0_seq2
comp91581_c0_seq1
comp79247_c0_seq2
comp108820_c3_seq6
comp79247_c0_seq1
comp113304_c0_seq1
comp104517_c4_seq1
comp36491_c0_seq1
comp104093_c0_seq1
comp87326_c0_seq1
comp9232_c0_seq1
comp94150_c0_seq1
comp106918_c0_seq5
comp88299_c1_seq1
comp83400_c0_seq3
comp87649_c0_seq1
comp105429_c0_seq1
comp97536_c1_seq1
comp93068_c0_seq1
comp77363_c0_seq1
comp78075_c0_seq1
comp98729_c2_seq1
comp100628_c5_seq4
comp92141_c0_seq2
comp88208_c0_seq1
comp110418_c1_seq1
comp93537_c0_seq1
comp89199_c1_seq2
comp99164_c0_seq3
comp32489_c0_seq1
comp111955_c0_seq2
comp108604_c1_seq1
comp103439_c0_seq6
comp100521_c0_seq1
comp104333_c0_seq2
comp80422_c0_seq1
comp101341_c0_seq2
comp15488_c0_seq2
comp104504_c0_seq4
comp111710_c0_seq6
comp112862_c0_seq2
comp103743_c2_seq1
comp109784_c1_seq2
comp72379_c0_seq1
comp85624_c0_seq1
comp108419_c3_seq11
comp25172_c0_seq1
comp38596_c0_seq1
comp37409_c0_seq1
comp108713_c1_seq6
comp73857_c0_seq1
comp107110_c0_seq3
comp110880_c0_seq2
comp95707_c1_seq1
comp110880_c0_seq4
comp70355_c0_seq1
comp94888_c0_seq1
comp39125_c0_seq1
comp83892_c0_seq1
comp107507_c2_seq1
comp98351_c1_seq3
comp86386_c0_seq1
comp110880_c0_seq3
comp92655_c0_seq1
comp107313_c0_seq1
comp97595_c0_seq1
comp107313_c0_seq2
comp109945_c0_seq2
comp109945_c0_seq1
comp109945_c0_seq3
comp95547_c0_seq6
comp108687_c0_seq10
comp99284_c0_seq2
comp90047_c0_seq2
comp78961_c0_seq3
comp91819_c0_seq4
comp94747_c0_seq1
comp103903_c0_seq8
comp109789_c0_seq4
comp102875_c2_seq4
comp106177_c0_seq1
comp89889_c0_seq2
comp67270_c0_seq1
comp109789_c0_seq1
comp34026_c0_seq1
comp100875_c2_seq2
comp93768_c0_seq1
comp95667_c1_seq2
comp87115_c0_seq1
comp115020_c0_seq1
comp95667_c1_seq4
comp107383_c3_seq1
comp27921_c1_seq1
comp93125_c0_seq4
comp88736_c0_seq4
comp103539_c1_seq9
comp59447_c0_seq2
comp80043_c0_seq1
comp98720_c0_seq3
comp86929_c0_seq1
comp82122_c0_seq1
comp98940_c0_seq1
comp3745_c0_seq1
comp76779_c0_seq1
comp69042_c0_seq2
comp102458_c0_seq2
comp88086_c0_seq2
comp70267_c0_seq1
comp109570_c0_seq2
comp54147_c0_seq1
comp85171_c0_seq1
comp98940_c0_seq2
comp103759_c0_seq1
comp73648_c0_seq1
comp88715_c0_seq3
comp62012_c0_seq1
comp71560_c0_seq1
comp72693_c0_seq1
comp95667_c1_seq3
comp109429_c2_seq9
comp109429_c2_seq7
comp83449_c0_seq2
comp84511_c0_seq1
comp44579_c0_seq1
comp82704_c0_seq2
comp71560_c0_seq6
comp93698_c0_seq1
comp95041_c0_seq2
comp102777_c0_seq5
comp99942_c0_seq4
comp96969_c0_seq9
comp110083_c2_seq2
comp67602_c0_seq1
comp108868_c5_seq2
comp56265_c0_seq1
comp102458_c0_seq1
comp32656_c0_seq2
comp88798_c0_seq1
comp92413_c0_seq4
comp114569_c0_seq1
comp35654_c0_seq1
comp21791_c0_seq1
comp108868_c5_seq3
comp93768_c1_seq1
comp35473_c0_seq1
comp95959_c0_seq3
comp85678_c1_seq1
comp92413_c0_seq2
comp21473_c0_seq1
comp85343_c0_seq1
comp97127_c0_seq12
comp108687_c0_seq6
comp106236_c0_seq1
comp87188_c0_seq1
comp109429_c0_seq2
comp85678_c0_seq1
comp88587_c1_seq4
comp20986_c0_seq1
comp86773_c0_seq1
comp80417_c0_seq1
comp81136_c0_seq2
comp82260_c0_seq1
comp109429_c2_seq12
comp109429_c0_seq4
comp109429_c0_seq6
comp108687_c0_seq9
comp97127_c0_seq11
comp103616_c0_seq2
comp105030_c1_seq1
comp89394_c0_seq1
comp82122_c0_seq2
comp87236_c0_seq1
comp98606_c1_seq5
comp32124_c0_seq1
comp84645_c0_seq1
comp94604_c0_seq1
comp92655_c0_seq2
comp99559_c0_seq1
comp101733_c1_seq1
comp105114_c0_seq3
comp108668_c0_seq3
comp67297_c0_seq1
comp93281_c1_seq1
comp108668_c0_seq4
comp104737_c0_seq2
comp103332_c0_seq2
comp103332_c0_seq5
comp103332_c0_seq4
comp92961_c0_seq1
comp95389_c0_seq2
comp69951_c0_seq1
comp104737_c0_seq3
comp78027_c0_seq1
comp107429_c0_seq3
comp99987_c0_seq13
comp95389_c0_seq1
comp109359_c0_seq4
comp98729_c1_seq1
comp93658_c1_seq1
comp108668_c0_seq1
comp70262_c0_seq1
comp91561_c0_seq1
comp100179_c0_seq3
comp103616_c0_seq1
comp94200_c0_seq2
comp109926_c0_seq3
comp72320_c0_seq1
comp110077_c0_seq4
comp109334_c0_seq2
comp110077_c0_seq2
comp108604_c0_seq1
comp79254_c0_seq1
comp98351_c1_seq12
comp100215_c0_seq1
comp100215_c0_seq2
comp108493_c0_seq6
comp19227_c0_seq1
comp97261_c0_seq1
comp32611_c0_seq1
comp109379_c1_seq3
comp85870_c0_seq1
comp106192_c1_seq3
comp108774_c1_seq1
comp108774_c1_seq3
comp108774_c1_seq2
comp106192_c1_seq2
comp107955_c0_seq3
comp106192_c1_seq8
comp98351_c1_seq5
comp105247_c0_seq2
comp69048_c0_seq1
comp110658_c0_seq1
comp103082_c2_seq2
comp91055_c0_seq1
comp83105_c0_seq1
comp97127_c0_seq7
comp92141_c0_seq1
comp91987_c0_seq2
comp107499_c1_seq1
comp91423_c0_seq1
comp110880_c0_seq1
comp98453_c0_seq1
comp105627_c2_seq9
comp111553_c0_seq1
comp101994_c0_seq1
comp103317_c0_seq1
comp99757_c3_seq1
comp107803_c0_seq11
comp94954_c2_seq7
comp100636_c1_seq7
comp106406_c0_seq1
comp91970_c0_seq1
comp108929_c0_seq1
comp107757_c0_seq2
comp82807_c0_seq1
comp93998_c0_seq2
comp93998_c0_seq1
comp107269_c0_seq3
comp101673_c0_seq1
comp98789_c0_seq3
comp111895_c3_seq5
comp112667_c0_seq2
comp102976_c0_seq1
comp110982_c0_seq2
comp101863_c1_seq1
comp104997_c0_seq1
comp92570_c0_seq8
comp95107_c0_seq5
comp95408_c0_seq1
comp92789_c0_seq1
comp71308_c0_seq1
comp107284_c0_seq2
comp83094_c0_seq1
comp78869_c0_seq1
comp106045_c0_seq1
comp112183_c0_seq3
comp106270_c0_seq5
comp106270_c0_seq2
comp101331_c2_seq1
comp95815_c1_seq1
comp107867_c1_seq1
comp99077_c2_seq6
comp108801_c0_seq1
comp107795_c0_seq1
comp71751_c0_seq1
comp103187_c1_seq5
comp99591_c0_seq1
comp110561_c0_seq2
comp110561_c0_seq6
comp110561_c0_seq4
comp113034_c0_seq1
comp88532_c1_seq1
comp92564_c0_seq1
comp111350_c0_seq3
comp104604_c0_seq5
comp96487_c0_seq1
comp93792_c0_seq1
comp99687_c0_seq1
comp89957_c0_seq1
comp105799_c0_seq1
comp105103_c0_seq3
comp109767_c0_seq1
comp112183_c0_seq2
comp112183_c0_seq4
comp108241_c0_seq2
comp108801_c1_seq1
comp107844_c1_seq1
comp102204_c0_seq2
comp108241_c0_seq3
comp100077_c3_seq10
comp100077_c3_seq3
comp100077_c3_seq8
comp100077_c3_seq15
comp100077_c3_seq13
comp95815_c1_seq3
comp100077_c3_seq1
comp100077_c3_seq9
comp100077_c3_seq5
comp113294_c0_seq1
comp96488_c0_seq1
comp59901_c0_seq1
comp104964_c0_seq1
comp112654_c1_seq2
comp106613_c0_seq2
comp106558_c2_seq2
comp104928_c3_seq1
comp112654_c1_seq1
comp110561_c0_seq9
comp110561_c0_seq1
comp110561_c0_seq5
comp110561_c0_seq8
comp95815_c1_seq4
comp85823_c1_seq1
comp94842_c0_seq1
comp99820_c0_seq1
comp99687_c0_seq2
comp105579_c1_seq2
comp91986_c0_seq1
comp97694_c0_seq1
comp108280_c1_seq11
comp109436_c0_seq3
comp105666_c0_seq1
comp98844_c0_seq1
comp109380_c0_seq1
comp91932_c0_seq2
comp91120_c0_seq1
comp105498_c0_seq3
comp95222_c0_seq1
comp95364_c0_seq1
comp106272_c1_seq4
comp109726_c0_seq1
comp93369_c0_seq1
comp99651_c0_seq1
comp112710_c0_seq1
comp88359_c0_seq1
comp103680_c0_seq1
comp104824_c0_seq2
comp103899_c0_seq1
comp86129_c0_seq2
comp36419_c0_seq1
comp86269_c0_seq1
comp109726_c0_seq2
comp100541_c0_seq1
comp102204_c0_seq1
comp95373_c0_seq1
comp95559_c0_seq1
comp94957_c0_seq1
comp96093_c1_seq3
comp95559_c1_seq1
comp93297_c0_seq5
comp101210_c1_seq11
comp106270_c0_seq1
comp106270_c0_seq3
comp101210_c1_seq9
comp63648_c0_seq1
comp72249_c1_seq2
comp72249_c1_seq1
comp84271_c0_seq1
comp101344_c0_seq1
comp101210_c1_seq4
comp101210_c1_seq10
comp104690_c0_seq10
comp104690_c0_seq8
comp101210_c1_seq8
comp104690_c0_seq5
comp101210_c1_seq2
comp101210_c1_seq6
comp101210_c1_seq5
comp88076_c0_seq2
comp88076_c0_seq1
comp104690_c0_seq19
comp104690_c0_seq6
comp94905_c0_seq1
comp101210_c1_seq13
comp101210_c1_seq3
comp101210_c1_seq1
comp101210_c0_seq1
comp72743_c0_seq1
comp104690_c0_seq4
comp104690_c0_seq16
comp104690_c0_seq20
comp104690_c0_seq15
comp104690_c0_seq11
comp104690_c0_seq3
comp104690_c0_seq12
comp104690_c0_seq1
comp104690_c0_seq18
comp13992_c0_seq1
comp86976_c0_seq1
comp86158_c0_seq1
comp37458_c1_seq1
comp104690_c0_seq2
comp106270_c0_seq6
comp102947_c0_seq4
comp72335_c0_seq1
comp104690_c0_seq13
comp99132_c0_seq1
comp101210_c1_seq12
comp87445_c0_seq1
comp97638_c0_seq1
comp87445_c0_seq2
comp99133_c0_seq1
comp108348_c1_seq1
comp103318_c0_seq1
comp109178_c0_seq4
comp108913_c0_seq2
comp109178_c0_seq2
comp103261_c0_seq1
comp105267_c0_seq1
comp111816_c0_seq1
comp105941_c0_seq1
comp95123_c0_seq1
comp105356_c0_seq1
comp109757_c0_seq2
comp109757_c0_seq1
comp106388_c1_seq2
comp98821_c0_seq2
comp104548_c1_seq2
comp106489_c0_seq1
comp104824_c0_seq1
comp98821_c0_seq1
comp111020_c0_seq1
comp92411_c0_seq1
comp103957_c0_seq1
comp108280_c1_seq10
comp91932_c0_seq1
comp101947_c2_seq1
comp99090_c0_seq1
comp110974_c1_seq1
comp78963_c0_seq1
comp78963_c1_seq1
comp71539_c0_seq1
comp93329_c0_seq1
comp95214_c0_seq1
comp112862_c0_seq1
comp100622_c0_seq1
comp97615_c0_seq3
comp94419_c0_seq1
comp93973_c0_seq2
comp93973_c0_seq1
comp103000_c0_seq1
comp111020_c1_seq1
comp111020_c0_seq2
comp110804_c0_seq3
comp108248_c1_seq4
comp71539_c1_seq1
comp72335_c0_seq2
comp108248_c0_seq1
comp99514_c0_seq5
comp106485_c1_seq1
comp97486_c0_seq1
comp99214_c0_seq2
comp99214_c0_seq1
comp57165_c0_seq1
comp88048_c0_seq1
comp110961_c1_seq1
comp99127_c0_seq4
comp99127_c0_seq2
comp94430_c0_seq1
comp111798_c1_seq2
comp96790_c0_seq1
comp105037_c0_seq1
comp95639_c0_seq1
comp88732_c0_seq1
comp61236_c0_seq1
comp102403_c0_seq8
comp100829_c0_seq2
comp103064_c1_seq1
comp108163_c0_seq1
comp98980_c3_seq4
comp106054_c0_seq1
comp106995_c0_seq2
comp102403_c0_seq4
comp97697_c0_seq1
comp71686_c0_seq1
comp92937_c0_seq1
comp97326_c0_seq1
comp99147_c0_seq1
comp100104_c3_seq2
comp110413_c1_seq1
comp94302_c1_seq1
comp98606_c1_seq8
comp99495_c1_seq6
comp102947_c0_seq3
comp61731_c0_seq1
comp88684_c0_seq1
comp102403_c0_seq13
comp37500_c1_seq1
comp15383_c0_seq1
comp78115_c0_seq1
comp87713_c0_seq2
comp105528_c3_seq6
comp87713_c0_seq3
comp87713_c0_seq4
comp87713_c0_seq1
comp113065_c0_seq1
comp93430_c0_seq1
comp88108_c0_seq1
comp37471_c0_seq1
comp20156_c0_seq1
comp15575_c0_seq1
comp102061_c0_seq2
comp61006_c0_seq1
comp78116_c0_seq1
comp88752_c0_seq1
comp91491_c0_seq1
comp37352_c0_seq1
comp99776_c0_seq1
comp99776_c0_seq2
comp84382_c0_seq1
comp108601_c0_seq1
comp92526_c0_seq1
comp84300_c0_seq1
comp61062_c0_seq1
comp88506_c0_seq1
comp88506_c0_seq2
comp15519_c0_seq1
comp84327_c1_seq1
comp15523_c0_seq1
comp106657_c1_seq2
comp106657_c1_seq1
comp72247_c0_seq1
comp92582_c1_seq1
comp84327_c1_seq2
comp15564_c0_seq1
comp37411_c0_seq1
comp107387_c0_seq6
comp107387_c0_seq5
comp107387_c0_seq4
comp107387_c0_seq7
comp107387_c0_seq1
comp99404_c0_seq1
comp84316_c0_seq1
comp72623_c0_seq2
comp72623_c0_seq1
comp11873_c0_seq1
comp108589_c0_seq1
comp102313_c1_seq2
comp106950_c0_seq3
comp77718_c0_seq1
comp38193_c0_seq1
comp107522_c2_seq3
comp91910_c0_seq1
comp113232_c0_seq1
comp113259_c0_seq1
comp107041_c0_seq1
comp107387_c0_seq2
comp109364_c1_seq1
comp93704_c0_seq1
comp97600_c2_seq2
comp17431_c0_seq1
comp110659_c0_seq1
comp78058_c0_seq1
comp88656_c0_seq1
comp82564_c0_seq1
comp72176_c0_seq2
comp107522_c0_seq1
comp63774_c0_seq1
comp107522_c3_seq1
comp60251_c1_seq1
comp106426_c0_seq8
comp106426_c0_seq13
comp113198_c0_seq1
comp112397_c1_seq4
comp61761_c0_seq1
comp101176_c0_seq1
comp17861_c0_seq1
comp101901_c0_seq1
comp72351_c0_seq1
comp37502_c0_seq1
comp83885_c0_seq2
comp96868_c1_seq1
comp83885_c0_seq1
comp63774_c1_seq1
comp113133_c0_seq1
comp72176_c0_seq1
comp72351_c0_seq3
comp95818_c0_seq1
comp37290_c0_seq1
comp72351_c0_seq4
comp89253_c0_seq1
comp102977_c0_seq1
comp72579_c0_seq2
comp11899_c0_seq1
comp113104_c0_seq1
comp99032_c1_seq3
comp110659_c1_seq4
comp14961_c0_seq2
comp102313_c0_seq1
comp107972_c0_seq1
comp14961_c0_seq1
comp15488_c0_seq1
comp61354_c0_seq1
comp15323_c0_seq1
comp113070_c0_seq1
comp72313_c0_seq1
comp106471_c5_seq1
comp71948_c0_seq1
comp108820_c3_seq8
comp108820_c3_seq1
comp72302_c1_seq1
comp108820_c3_seq4
comp60221_c0_seq2
comp37377_c0_seq1
comp108601_c1_seq2
comp102947_c0_seq2
comp102313_c1_seq1
comp103221_c0_seq1
comp20532_c0_seq1
comp105985_c0_seq1
comp88864_c0_seq1
comp37502_c0_seq2
comp89595_c0_seq1
comp105985_c3_seq1
comp72176_c0_seq3
comp72351_c0_seq2
comp100632_c0_seq1
comp72351_c0_seq5
comp60959_c0_seq1
comp90730_c0_seq1
comp100621_c1_seq7
comp78845_c0_seq1
ovary
ovary
72_100
ova
y
48_72
72_100
7-24 hour
7_24
og
48-72 hour
48_72
oom
mp
72_100
72-100 hour0_24
on m
dm
ova y
42_0
42_0
06
48_72
16 0
0_24
48_72
0_24
84_42 84_42
Va ue
o
u
48_72
0_24
24_48
ova y
72_100
48_72
24_48
Ca osob
uchus
macu a us
mp
o
on m
o o me Co
u
d
Samp
ema onogMa x
og
0_24
48_72
ova y
48_72
24_48
72_100
06
Va ue
001_27001_27
27_84 27_84
mp
oom
og
02
24_48
Co or Key
24_48
0_24
72_100
72_100
72_100
on m
dm
48_72
ova y
o
u
y avo y avo
C)
B)
0_24
og
ova y
Co or Key
42_0 42_0
02
on m
dm
Va ue
mp
oom
on m
dm
48_72
ova yhour
48-72
ova y
06
Va ue
48_72
02
06
84_42
1
00 22 0 4 00 66 0 8 1
02
Va ue
Re a ve
S m ar y
o
0_24
o
u
42_0
Va ue
y avo
06
Co or
Key
72_100
24-48 hour
A)
02
24_48
72_100hour
24-48
72_100
Si
gn C a
t
al
(T aly
ra
&
ns c(A
du
c&
St
c
v
ru
ct Rec er(A ity(
ur
c
e
&
p
al
(M tor vit
(A y(
o
Tr lecu c&v
an
i
l
sp e(A ty(
or c&
v
te
r(A ity(
c&
vi
En
ty
zy
m Pro Bin (
e(
t
d
e
Ox Reg in(B ing(
id ula in
or
d
ed tor( ing
(
uc Ac
&v
t
T
a
r
Io
an se
i
n(
(A ty(
sf
Tr
c
e
&
an
Hy rase vit
sm
y
(
dr
em
ol Ac& (
as
br
vi
ty
e
an
Lig (Ac& (
e(
Tr
an ase vit
sp (A y(
or c&
v
t
Ch er(A ity(
an c&
ne vit
l(A y(
c&
Ex
vi
tra
ty
(
ce
l
Ex lul
tra ar(
ce Re
llu gio
la
r(S n(
pa
ce
(
Ce
Nu ll(
Ch
c
ro leu
m
s
os (
M
Cy om
ac
e
t
op (
ro
l
m
ol Me asm
M
ec
m
(
ul
u
&c
la bra
r(C ne
el
lu
om (
la
r (O
pl
ex
rg
(
an
ism
T
r
al
(D ans
M eve por
et
l
t
ab opm (
o
Ca lic ent
(
ta (Pr
Bi
o
b
os oli ces
yn c(P
s(
th
r
M
e& oce
ac
Ce c ( P ss(
ro
llu roc
m
C
ol
ec ell( lar(P ess(
D
ul
e( ie roc
M
re ess
Re
e
t
gu
ab n&a (
la
ol
&
&o
ic( on
Pr
n(
(
Of
oc
(B
es
io Se
s
Re log cre (
sp
i
&
on cal( on
(
se Pr
B
Ce iol (To oce
llu ogi (S& ss(
Ox lar ca
l( mu
(
id
a& Me Reg lus(
on tab ula
o
IR
&
ed lic( on
uc Pro (
&o
ce
s
n(
Pr s(
oc
es
s(
Percentage(of(GO(Category((BP,(CC(or(MF)(with(Given(Annota&on((
B)
St
ro
en
Molecular Function
Cellular Component
ng
an
ch
io
vi
90"
80"
70"
Biological Process
rid
ta
ae
le
O
th
e
flo
pt
i
gy
te
ae
ni
at
us
ur
lla
a
st
om
ite
rp
is
te
r
as
en
og
ns
s
tio
pu
na
ip
r
to
r
ito
ar
is
lta
de
hi
tri
p
an
de
ap
Ae
sa
va
pu
ia
tu
s
tro
on
el
ne
lg
us
ro
i
an
m
ol
ex
ec
pl
de
vu
rid
bi
tri
Number'of'Top'Hits'
A)
Br
en
oc
s
m
si
la
as
hi
N
op
s
ex
au
yr
m
an
ra
at
ho
iti
yd
pl
gn
op
m
m
te
r
ro
s
yl
Zo
o
ro
Ac
ar
pe
ro
ic
flo
ci
or
hy
ac
tu
s
ap
in
or
um
su
ar
by
ae
os
eu
pi
im
os
on
ph
Bo
ia
er
no
ph
si
hu
yp
po
am
od
er
st
an
nd
ca
po
us
iu
th
o
yr
Ac
ol
to
n
oc
Tr
ib
dr
St
eg
30000
A. menetriesi
25000
C. maculatus
20000
15000
10000
5000
Species'Hit'
Atrachya(menetriesi(
Callosobruchus(maculatus(
Drosophila(melanogaster(
Mus(musculus(
60"
50"
40"
30"
20"
10"
0"
0 .2
Hox6/7/8
Utx/Ubx
Abd-A
Ptl/Antp
Ftz
Hox5 / Cx/Scr
Hox4 / Dfd
Hox3 / Zen
Hox9-13 / Abd-B
Cdx / Cad
Hox2 / Mxp/Pb
Hox1 / Lab
Gsx / Ind
Mox / Btn
Evx / Eve
Gbx / Unpg
Mnx / Exex
37
66
50
38
70
Dpp / BMP2/4
26
42
23
13
99
GDF 5/6/7
78
92
47
28
21
30
54
25
32
51
86
15
6
44
66
49
65
99
74
98
GDF 1/3
83
37
ADMP / BMP3
47
56
87
97
99
Nodal
95
Maverick
97
100
83
63
TGF
100
11
50
88
39
26
51
57
39
81
Inhibin / Activin
88
10
87
38
16
37
99
66
43
100
99
65
0 .5
Lefty
Inhibin / Activin
Dawdle / ALP
Unknown Derriere-Like
BMP15 / GDF9
A) Wnt Signalling:
MAPK Signalling
Pathway
Hip
Canonical Pathway:
PS-1
+P
TAK1
NLK
Gas1
PKA
IDAX
c-myc
+P
Duplin
FRP
Cer1
Hedgehog
Ptc
Wnt
c-jun
CKI
Frizzled
Scaffold
GBP
Porc
Dvl
Pontin52
+P
GSK3
-catenin
TCF/LEF
Wnt
CK2
LRP5/6
Axin
DNA
SMAD4
CKI
APC
XSox17
CtBP
-TrCP
p53
Fu
Cholesterol
Su(fu)
Zic2
Rab23
PKA
Slmb
Uterine
GSK3
CK1
Phosphorylation
Independence
SIP
Cul1
Skp1
Rbx1
C) Notch Signalling:
Ubiquitin Mediated
Proteolysis
+u
Co-activators
Proteolysis
TBL1
Dvl
Fringe
MAML
Numb
Focal Adhesion
Frizzled
RhoA
Cytoskeletal
Changes
ROCK2
PSE2
Serrate
Dvl
Rac
NCSTN
Gene
Transcription
JNK
PSEN
CtBP
Groucho
Co-repressors
Via MAPK/RAS
HDAC
MAPK Signalling
Wnt/Ca2+ Pathway:
CaMKII
Frizzled
PLC
Ca2+
CaN
G Protein?
PKC
-P
NFAT
DNA
Onward Effects
Key:
Present
Both
Present
A.menetriesi
Cell
Membrane
Present
C. maculatus
Internal
Membrane
Absent
Both
Coleopteran
Absence (?)
Up-regulation
or Movement
Other
Pathway
Down-regulation
State
Change
HES1/5
PreT
SMRT
Others
APH-1
MAPK Signalling
Pathway
Wnt5
DNA
CSL
Hairless
Secretase
Complex
TASE
Knypek
SKIP
Notch
Intracellular
Domain
Notch
Delta
Prickle
Daam1
Wnt11
HATS
Delta
Sfom
Wnt Signalling
TGF Signalling
Ptc
Megalin
Skp1
Nkd
Phosphorylation
Independence
Dpp
TGF Signalling
Siah-1
APC
Ci
Cell Cycle
+P
p53 Signalling
Pathway
PAR-1
Cos2
Smo
PPAR8
Groucho
BAMBI
Dkk
fra-1
cyc-D
SMAD3
ICAT
Axam
SOST
Microtubule
CBP
Dally
Wif-1
B) Hedgehog Signalling:
Adherens
Junctions
Alzheimers
Disease
Disassociation
CIR