Académique Documents
Professionnel Documents
Culture Documents
UNIVERSIT DI BOLOGNA
FACOLT DI SCIENZE MATEMATICHE, FISICHE E NATURALI
CORSO DI LAUREA MAGISTRALE IN BIOINFORMATICA
Candidato:
Id Antonino
Relatore:
Prof. Giovanni Perini
Abstract
two
Transcription Factor Binding Site (TFBS) and identify the differences amongst
the sequences according to the differing amount of fatty acids in these strain.
Index
1 Introduction ..........................................................................................................................1
1.1Statement of the problem ..................................................................................................1
2 Background information .....................................................................................................5
2.1 Algae ............................................................................................................................5
2.2 Fatty acid composition...................................................................................................9
2.3 Biosynthesis of Fatty acids and triaciyglycerols ............................................................14
2.4 Regulation of fatty acid Synthesis .................................................................................18
2.5 Knowledge in other species............................................................................................19
2.6 Feedback regulation .......................................................................................................23
2.7What Controls Promoter Activity of FAS Genes?...........................................................24
3 State of the art ...................................................................................................................25
3.1 Comparison of lipid metabolism in algae and higher plants..........................................25
3.2 Factor affecting tryacilglycerolipids accumulation and fatty acids composition............26
3.2.1 Nutrients..................................................................................................................27
3.2.2Temperature.............................................................................................................28
3.2.3Light intensity..........................................................................................................29
3.2.4Growth phase and Physiological status ...................................................................30
3.2.5Physiological roles of triacylglycerol accumulation ...............................................32
3.3Algae genomic and proposed model system in biofuel production ................................33
3.4Acetil CoA carboxylase protein and genetic characterization ........................................38
4 Methods................................................................................................................................40
4.1 Genetic source ...............................................................................................................40
4.2 Sequence analysis...........................................................................................................43
4.3 Computational methodologies........................................................................................44
4.3.1Bioprospector ..........................................................................................................46
4.3.1.1Scoring segment with background Markov dependency..................................47
4.3.1.2Using motif score distribution to measure goodness of a motif.......................48
4.3.2GALF_P ..................................................................................................................48
4.3.2.1Representation .................................................................................................49
4.3.2.2Fitness Evaluation............................................................................................50
4.3.2.3Selection and Genetic Operators .....................................................................51
4.3.2.4Local filtering Operator ..................................................................................52
4.3.2.5Replacement Strategy.......................................................................................53
4.3.2.6Shift Operator ..................................................................................................53
4.3.3Implementation........................................................................................................54
4.4 Primer design ................................................................................................................55
4.5Experimental procedure .................................................................................................58
5 Result and future perspective.............................................................................................65
1 Introduction
1.1 Statement of the problem
The current global energetic crisis, and the possibility of an ever decreasing oil
and gasoline production, leads to the necessity to research and explore effective
alternatives. Among these effective alternatives biodiesel stands out. Biodiesel
can be defined as the ester derived from oils and fats extracted from renewable
biological sources (1)
vegetable oil", "WVO", "used vegetable oil", "UVO", "pure plant oil", "PPO"(3).
It is virtually free of sulphur, therefore reducing the quantity of sulfur oxides
normally produced during combustion. The emission of hydrocarbons, carbon
monoxide and particulates during combustion are also significantly reduced in
comparison to emission from petroleum diesel. These properties make biodiesel
effective and in full compliance with the Kyoto Protocol concerning the
greenhouse gas emission. An additional benefit of biodiesel is that it is thought
to be a non-toxic, biodegradable fuel and provides essentially the same energy
content and power output as petroleum-based diesel fuel while reducing
emissions.
Biodiesel production is constantly increasing, with an average annual growth
rate of over 40% during the 2002-2006 period (4). In 2006, the amount of
biodiesel produced in the world ranged 5-6 million tonnes, with 4.9 million
tonnes processed in Europe (of which 2.7 million tonnes in Germany), and great
part of the remaining quantity processed in the USA (5). the sole European
fabrication increased to 5.7 million tonnes (6).Accordingly, the volume of
biodiesel produced in Europe in 2008 has been calculated for a total of 16
million tonnes. These figures have to be considered as part of the circa 490
million tonnes (147 billion gallons) demand for diesel fuel in the US and
Europe(6). Moreover, the global production of vegetable oil for all purposes in
2005/06 touched 110 million tonnes, of which about 34 million tonnes of palm
increasing oil demand along with the high costs of good quality
2 Background information
2.1
Dunaliella bardawill.(22) in the latter case, the chloroplastic lipid bodies are
referred to as plastoglobuli.
that can be found in algae at quantities generally <5% DCW.(23) Only the
colonial green algae Botryococcus Brauni, has been shown to produce, under
adverse environmental condition, large quantities (up to 80% DCW) of very long
chain (C23-C40) hydrocarbons, similar to those found in petroleum, and thus has
been explored over the decades as a feedstock for biofuel and biomaterials.(24)
As many algal species have been found to grow rapidly and produce substantial
amounts of TAG or oil, and are thus referred to as oleaginous algae, it has long
been postulated that algae could be employed as a cell factories to produce oils
and other lipids for biofuel and biomaterial (25)(26)
The potential advantages of algae as feedstock for biofuel and biomaterial
include their ability to:
synthesize and accumulate large quantities of neutral lipids/oil (20-50% DCW),
grow at high rate (e.g. 1-3 doubling per day),
thrive in the saline/brackish water/coastal seawater for which there are few
competing demands
tolerate marginal lands (e.g. desert, arid and semiarid lands) that are not
countries, with interest and funding growing and waning according to the
fluctuations of the world petroleum oil market over the past few decades, no
effort in algae based biofuel production have proceeded beyond rather small
laboratory or field testing stage. The lipid yields obtained from algal mass
culture effort performed to date fall short of the theoretical maximum (at least
10-20 times lower), and have historically made algal oil production technology
prohibitively
expensive.(27)(28)
Recent soaring oil prices, diminishing world oil reserves, and the environmental
deterioration associated with fossil fuel consumption have generated renewed
interest in using algae as an alternative and renewable
production. However, before this concept can become commercial reality, many
fundamental biological questions relating to the biosynthesis and regulation of
fatty acid and TAG in algae need to be fully answered. Clearly, physiological and
genetic manipulations of growth and lipid metabolism must be readily
implementable, and critical engineering breakthroughs related to alga mass
culture and down-stream processing are necessary.
2.2
10
Algae synthesize fatty acids as building blocks for the formation of various types
of lipids. The most commonly synthesized fatty acids have chain lengths that
range from C16 to C18 (Table1), similar to those of higher plants (29). Fatty
acids are either saturated or unsaturated, and unsaturated fatty acids may vary in
the number and position of double bonds on the carbon chain backbone. In
general, saturated and mono-unsaturated fatty acids are predominant in most
algae examined (26). Specifically, the major fatty acids are C16:0 and C16:1 in
the Bacillariophyceae, C16:0 and C18:1 in the Chlorophyceae, C16:0 and C18:1
in the Euglenophyceae, C16:0, C16:1 and C18:1 in the Chrysophyceae, C16:0
and C20:1 in the Cryptophyceae, C16:0 and C18:1 in the Eustigmatophyceae,
C16:0 and C18:1 in the Prasinophyceae, C16:0 in the Dinophyceae, C16:0,
C16:1 and C18:1 in the Prymnesiophyceae, C16:0 in the Rhodophyceae, C14:0,
C16:0 and C16:1 in the Xanthophyceae, and C16:0, C16:1 and C18:1 in
cyanobacteria (30).Polyunsaturated fatty acids (PUFAs) contain two or more
double bonds. Based on the number of double bonds, individual fatty acids are
named dienoic, trienoic, tetraenoic, pentaenoic and hexaenoic fatty acids. Also,
depending on the position of the first double bond from the terminal methyl end
() of the carbon chain, a fatty acid may be either an 3 PUFA (i.e. the third
carbon from the end of the fatty acid) or an 6 PUFAs (i.e. the sixth carbon
from the end of the fatty acid). The major PUFAs are C20:53 and C22:63 in
Bacillarilophyceae, C18:2 and C18:33 in green algae, C18:2 and C18:3 3 in
11
the
diatom
Phaeodactylum
tricornutum
and
the
dinoflagellate
12
It should be noted that much of the data provided previously comes from the
limited number of species of algae that have been examined to date, and most of
the analyses of fatty acid composition from algae have used total lipid extracts
rather than examining individual lipid classes. Therefore, these data represent
generalities, and deviations should be expected. This may explain why some
fatty acids seem to occur almost exclusively in an individual algal taxon. In
addition, the fatty acid composition of algae can vary both quantitatively and
qualitatively with their physiological status and culture conditions.
The properties of biodiesel are largely determined by the structure of its
component fatty acid esters (35).The most important characteristics include
ignition quality (i.e. cetane number), cold-flow properties and oxidative stability.
While saturation and fatty acid profile do not appear to have much of an impact
on the production of biodiesel by the trans-esterification process, they do affect
the properties of the fuel product. For example, saturated fats produce a
biodiesel with superior oxidative stability and a higher cetane number, but rather
poor low-temperature properties. Biodiesels produced using these saturated fats
are more likely to gel at ambient temperatures. Biodiesel produced from
feedstocks that are high in PUFAs, on the other hand, has good cold-flow
properties. However, these fatty acids are particularly susceptible to oxidation.
Therefore, biodiesel produced from feedstocks enriched with these fatty acid
species tends to have instability problems during prolonged storage.
13
14
Table 1: Abbreviation of algal species: B.a., Biddulphia aurica (Orcuut and patterson,
1975); C.sp. Chaetoceros sp. (Renaud et al., 2002); N.sp., Nannochloropsis sp.
(Sukenik,1999); M.s., Monodus subterraneus (Cohen,1999); C.s., Chlorella sorokiniana
(Patterson,1970); C.v., Chlorella vulgaris (Herris et al.,1965); P.i., Parietochloris incise
(Khonizin-Goldberg et al., 2002); E.h., Emiliania huxleyi (Volkman et al., 1981); I.g.,
Isochrysis galbana (Volkman et al., 1981); P.p., Phaeomonas parva (Kawachi et al., 2002);
G.c., Glossomastrix chrysoplasta (Kawachi et al., 2002); A.sp., Aphanocapsa sp. (Kenyon,
1972); S.p., Spirulina platensis (Mhling et al., 2005); T.e., Trichodesmium erythraeum
(Parker et al ., 1967); H.b., Hemiselmis brunescens (Chuecas and Riley, 1969); R.l.,
Rhodomonas lens (Beach et al., 1970); G.s., Gymnodinium sanguineum; S.sp., Scrippsiella
sp. (Mansour et al., 1999).
15
2.3
Lipid metabolism, particularly the biosynthetic pathways of fatty acids and TAG,
has been poorly studied in algae in comparison to higher plants. Based upon the
sequence homology and some shared biochemical characteristics of a number of
genes and/or enzymes isolated from algae and higher plants that are involved in
lipid metabolism, it is generally believed that the basic pathway of fatty acids
and TAG biosynthesis in algae are directly analogous to those demonstrated in
higher plants. It should be noted that because the evidence obtained from algal
lipid research is still fragmentary, some broad generalization are made in this
section based on limited experimental data.
In algae, the de novo synthesis of fatty acid occur primarily in the chloroplast. A
generalized scheme for fatty acids biosynthesis is show in (Figure 2). Overall,
the pathway produces a 16- or 18-carbon fatty acids or both. These are then used
as a precursor for the synthesis of chloroplast and other cellular membranes as
well as for the synthesis of neutral storage lipids, manly TAGs. The committed
step in fatty acid synthesis is the conversion of acetyl CoA to malonyl CoA,
catalyzed by acetyl CoA carboxylase (Acetyl-CoA carboxylase). In the
16
17
All subsequent reactions of the pathway involve ACP until the finished products
are ready for transfer to glycerolipids or export from the chloroplast. The
malonyl group of malonyl ACP participates in a series of condensation reactions
with acyl ACP (or acetyl CoA) acceptors. The first condensation reaction forms
a four-carbon product, and is catalyzed by the condensing enzyme, 3-ketoacyl
ACP synthase III (KAS III). Another condensing enzyme, KAS I, is responsible
for producing varying chain lengths (616 carbons). Three additional reactions
occur after each condensation. To form a saturated fatty acid the 3-ketoacyl ACP
product is reduced by the enzyme
3-ketoacyl ACP reductase, dehydrated by hydroxyacyl ACP dehydratase and
then reduced by the enzyme enoyl ACP reductase (Figure 2). These four
reactions lead to a lengthening of the precursor fatty acid by two carbons. The
fatty acid biosynthesis pathway produces saturated 16:0- and 18:0-ACP. To
produce an unsaturated fatty acid, a double bond is introduced by the soluble
enzyme stearoyl ACP desaturase. The elongation of fatty acids is terminated
either when the acyl group is removed from ACP by an acyl-ACP thioesterase
that hydrolyzes the acyl ACP and releases free fatty acid or acyltransferases in
the chloroplast transfer the fatty acid directly from ACP to glycerol-3-phosphate
or monoacylglycerol-3-phosphate. The final fatty acid composition of individual
algae is determined by the activities of enzymes that use these acyl ACPs at the
termination phase of fatty acid synthesis.
18
Figure
2.
Fatty
acid
de
novo
synthesis
pathway
in
chloroplasts.
Acetyl CoA enters the pathway as a substrate for acetyl CoA carboxylase (Reaction 1) as well
as a substrate for the initial condensation reaction (Reaction 3). Reaction 2, which is catalyzed
by malonyl CoA:ACP transferase and transfers malonyl from CoA to form malonyl ACP.
Malonyl ACP is the carbon donor for subsequent elongation reactions. After subsequent
condensations, the 3-ketoacyl ACP product is reduced (Reaction 4), dehydrated (Reaction 5)
and reduced again (Reaction 6), by 3-ketoacyl ACP reductase, 3-hydroxyacyl ACP dehydrase
and enoyl ACP reductase, respectively (adapted from Ohlrogge and Browse 1995).
19
All algae must produce fatty acids, and this synthesis must be tightly controlled
to the balance supply and demand for acyl chains. This need can be highly
variable and it depend on
the
20
2.5
Comparing algae with plants and other species, and based upon sequence
homology of gene and enzyme from algae and plants involved in the lipid
pathway, some generalization hypothesis have been made about to understand
which enzyme regulate fatty acids synthesis. Acetyl-CoA carboxylase is
frequently considered the first committed step in the fatty acid biosynthetic
pathway.
In animals(38) and in yeast (39) there is evidence for Acetyl-CoA carboxylase as
a major regulatory enzyme in fatty acid production. Also for plants this enzyme
was proposed to be rate determining for fatty acid biosynthesis. Several lines of
21
22
Most of the substrates and intermediates of plant Fatty acids synthesis are
attached to acyl carrier protein (ACP) (Figure 2). Analysis of acyl-ACPs is aided
because the chain length of fatty acids attached to ACP alters the mobility of the
protein in native or urea PAGE gels. Because of these alterations in mobility,
most of the acyl-ACP intermediates of fatty acid synthesis can be resolved, and
when transferred to nitrocellulose, antibodies to ACP can provide sensitive
detection at nanogram levels. Although the acyl-ACP intermediates have a half
life in vivo of only a few seconds (43) (44), by rapidly freezing tissues in liquid
nitrogen it has been possible to determine the relative concentrations of free,
nonacylated ACPs and of the individual acyl-ACPs. Analysis of acyl- ACP pools
has been used to study regulation of fatty acids synthesis in spinach leaf and
seed (45) in chloroplasts in developing castor seeds and in tobacco suspension
cultures (46)(47) . The initial examination of the composition of the acyl-ACP
pools provided information about the potential regulatory reactions in plant fatty
acid biosynthesis. The various saturated acyl-ACP intermediates between 4:0
and 14:0 occur in approximately equal concentrations. Because the 3-ketoacylACPs, enoyl-ACPs, or 3-hydroxyacyl-ACPs, which are substrates for the two
reductases and dehydrase reactions, were not detected, it is likely that these
reactions are close to equilibrium and that the in vivo activities of these enzymes
are in excess. Thus it is not likely that these enzymes are regulatory. In contrast,
the concentration of acetyl-ACP was considerably above that of malonyl-ACP.
23
This result suggests that the acetyl-CoA carboxylase reaction, which has an
equilibrium constant slightly favouring malonyl-CoA formation, is significantly
displaced from equilibrium and therefore potentially regulatory (43). The
condensing enzymes can also be considered displaced from equilibrium because
of the concentration of malonyl-ACP and the saturated acyl-ACPs. To obtain
more information on sites of regulation, the changes in pool sizes when flux
through the fatty acid biosynthetic pathway changes were examined. The rate of
spinach leaf fatty acid biosynthesis in the dark is approximately one sixth the
rate observed in the light (49).
In the light, the predominant form of ACP was the free, nonacylated form,
whereas acetyl-ACP represented about 56% of the total ACP (46). In the dark,
the level of acetyl-ACP increased substantially with a corresponding decrease in
free ACP, such that acetyl-ACP was now the predominant form of ACP. In
similar experiments, when chloroplasts are shifted to the dark, malonyl-ACP and
malonyl-CoA disappear within a few seconds, and acetyl-ACP levels increase
over a period of several minutes. The rapid decrease in malonyl-ACP and
malonyl-CoA when fatty acid synthesis slows, together with the increase in
acetyl-ACP and lack of change in other intermediate acyl-ACP pools all lead to
the conclusion that Acetyl-CoA carboxylase activity is the major determinant of
light/dark control over fatty acids synthesis rates in leaves.
24
The above experiments on acyl-ACP and acyl-CoA pools have been carried out
with dicot plants. Gramineae species such as maize and wheat have a
substantially different (homodimeric) structure of Acetyl-CoA carboxylase. (50)
by a different approach toward evaluating metabolic control. They
took
25
26
additional controls.
Understanding how cells regulate the production of these fatty acids and direct
them toward their different functions is thus central to understanding a large
range of fundamental questions in algae biology. We now have convincing
evidence that Acetyl-CoA carboxylase is one enzyme that is involved in
regulating fatty acid synthesis rates, this is only the beginning. But what
molecules regulate Acetyl-CoA carboxylase by feedback or other mechanisms
and what metabolic signals or mechanisms control those molecules? We don't
have information about the nature of these controls. Thus, understanding
regulation of fatty acid synthesis is a rich and relatively unexplored field with
much work left to be done.
27
Although algae generally share similar fatty acid and TAG synthetic pathways
with higher plants, there is some evidence that differences in lipid metabolism
do occur. In algae, for example, the complete pathway from carbon dioxide
fixation to TAG synthesis and sequestration takes place within a single cell,
whereas the synthesis and accumulation of TAG only occur in special tissues or
organs (e.g. seeds or fruits) of oil crop plants. In addition, very long PUFAs
above C18 cannot be synthesized in significant amounts by naturally occurring
higher plants, whereas many algae (especially marine species) have the ability to
synthesize and accumulate large quantities of very long PUFAs, such as
eicosapentaenoic acid (C20:53), docosahexaenoic acid (C22:63) and
arachidonic acid (C20:46). Recently, annotation of the genes involved in lipid
metabolism in the green alga C.reinhardtii has revealed that algal lipid
metabolism may be less complex than in Arabidopsis, and this is reflected in the
presence and/or absence of certain pathways and the apparent sizes of the gene
families that represent the various activities (55)
28
Although the occurrence and the extent to which TAG is produced appear to be
species/strain-specific, and are ultimately controlled by the genetic make-up of
individual organisms, oleaginous algae produce only small quantities of TAG
under optimal growth or favourable environmental conditions (57). Synthesis
and accumulation of large amounts of TAG accompanied by considerable
alterations in lipid and fatty acid composition occur in the cell when oleaginous
algae are placed under stress conditions imposed by chemical or physical
environmental stimuli, either acting individually or in combination. The major
chemical stimuli are nutrient starvation, salinity and growth-medium pH. The
major physical stimuli are temperature and light intensity. In addition to
chemical and physical factors, growth phase and/or aging of the culture also
affects TAG content and fatty acid composition.
3.2.1
Nutrients
Of all the nutrients evaluated, nitrogen limitation is the single most critical
nutrient affecting lipid metabolism in algae. A general trend towards
29
lipid
content,
mainly
TAG,
in
Monodus
subterraneus
30
3.2.2 Temperature
Temperature has been found to have a major effect on the fatty acid composition
of algae. A general trend towards increasing fatty acid unsaturation with
decreasing temperature and increasing saturated fatty acids with increasing
temperature has been observed in many algae and cyanobacteria (64) . It has
been generally speculated that the ability of algae to alter the physical properties
and thermal responses of membrane lipids represents a strategy for enhancing
physiological acclimatization over a range of temperatures, although the
underlying regulatory mechanism is unknown (65). Temperature also affects the
total lipid content in algae. For example, the lipid content in the chrysophytan
Ochromonas danica (66) and the eustigmatophyte Nannochloropsis salina (67)
31
32
that low light favors the formation of PUFAs, which in turn are incorporated into
membrane structures. On the other hand, high light alters fatty acid synthesis to
produce more of the saturated and mono-unsaturated fatty acids that mainly
make up neutral lipids.
33
34
35
under stress. TAG synthesis may also utilize PC, PE and galactolipids or toxic
fatty acids excluded from the membrane system as acyl donors, thereby serving
as a mechanism to detoxify membrane lipids and deposit them in the form of
TAG.
36
nutrient acquisition, and the biosynthesis and functions of lipids. The advantage
of C.reinhardtii as a model for oxygenic photosynthesis derives mainly from its
ability to grow either photo-, mixo- or heterotrophically (in the dark and in the
presence of acetate) while maintaining an intact, functional photosynthetic
apparatus. This property has allowed researchers to study photosynthetic
mutations that are lethal in other organisms. Moreover, C.reinhardtii spends most
of its life cycle as a haploid organism of either mating type + or .
Gametogenesis is triggered by environmental stresses, particularly nitrogen
deprivation , and its occurrence can be synchronized by light/dark periods of
growth. During its haploid stage, C. reinhardtii can be genetically engineered
and single genotypes easily generated. Additionally, different phenotypes can be
obtained by crossing two haploid mutants of different mating types carrying
different genotypes. Conversely, single-mutant genotypes can be unveiled by
back-crossing mutants carrying multiple mutations with the wild-type strain of
the opposite mating type.
Global expression profiling of Chlamydomonas under conditions that produce
biofuels (H2 in this case) (78) has been reported using second-generation
microarrays with 10 000 genes of the over 15 000 genes predicted (77). Much of
the information that was reported involves fermentative metabolism. No
concerted effort to characterize up- and downregulation of genes associated with
37
14C-labeling
techniques) were
in
anaerobic
Chlamydomonas
cells(78).
Unfortunately
lipid
metabolism was not studied. Finally, researchers are starting to ask whether
Chlamydomonas and other green algae have the required metabolic pathways to
38
Genus species
Sequencing status
reference
green algae
complete assembly release,
v 3.0
Chlamydomonas reinhardtii
JGI genome project
complete assembly release
Chorella spNC64A
JGI genome project
v 1.0
Chorella vulgaris
in progress
genbank
complete assembly release
v.2.0
Ostreococcus lucimaris
JGI genome project
Complete assembly release
Ostreococcus tauri
JGI genome project
v2.0,
Complete assembly release
v1.0,
Volvox carteri
JGI genome project
Scenedesmus obliquus
minimal
genbank
Dunaliella salina
minimal
genbank
diatom
complete assembly release
Phaeodactylum tricornutum
v2.0
JGI genome project
complete assembly release
v 3.0
Thalassiosira pseudonana
JGI genome project
red algae
Cyanidioschyzon
merolae
Cyanidioschyzon merolae
complete
project
brown algae
Complete assembly release
Aureococcus anophagefferens v1.0,
JGI genome project
39
To characterized the first committed step in the fatty acid biosynthetic pathway,
different study have been conducted to seek for Acetyl-CoA carboxylase but just
in few species it is know .
The gene that encodes Acetyl-CoA carboxylase in Cyclotella cryptica has been
isolated and cloned [82]. The gene was shown to encode a polypeptide
composed of 2089 amino acids, with a molecular mass of 230 kDa. The deduced
amino acid sequence exhibited strong similarity to the sequences of animal and
yeast Acetyl-CoA carboxylases in the biotin carboxylase and carboxyltransferase
domains. Less sequence similarity was observed in the biotin carboxyl carrier
protein domain, although the highly conserved Met-Lys-Met sequence of the
biotin binding site was present. The N-terminus of the predicted Acetyl-CoA
carboxylase sequence has characteristics of a signal sequence, indicating that the
40
enzyme may be imported into chloroplasts via the endoplasmic reticulum. Also
the protein has been purified and kinetically characterized from two unicellular
algae, the diatom Cyclotella cryptica [82] and the prymnesiophyte Isochrysis
galbana [83]. Native Acetyl-CoA carboxylase isolated from Cyclotella cryptica
has a molecular mass of approximately 740 kDa and appears to be composed of
four identical biotin-containing subunits. The molecular mass of the native
Acetyl-CoA carboxylase from I.galbana was estimated at 700 kDa. This
suggests that Acetyl-CoA carboxylases from algae and the majority of AcetylCoA carboxylases from higher plants are similar in that they are composed of
multiple identical subunits, each of which are multi-functional peptides
containing domains responsible for both biotin carboxylation and subsequent
carboxyl transfer to acetyl CoA [83].
Investigated changes in the activities of various lipid and carbohydrate
biosynthetic enzymes in the diatom Cyclotella cryptica in response to silicon
deficiency. The activity of Acetyl-CoA carboxylase increased approximately
two- and fourfold after 4 and 15 h of silicon-deficient growth, respectively,
suggesting that the higher enzymatic activity may partially result from a covalent
modification of the enzyme. As the increase in enzymatic activity can be
blocked by the addition of protein synthesis inhibitors, it was suggested that the
enhanced Acetyl-CoA carboxylase activity could also be the result of an increase
in the rate of enzyme synthesis[82].
41
No more experimental result are available about the sequence of this gene, but
from the algae's genome know some sequence are predicted by comparison with
other species
4 Methods
4.1
Genetic source
The purpose of this thesis is to analyse the gene (ACCase) and the amount of
fatty acids in algae. Thus the main source of information is represented by
amount of fatty acids, (TAGs), predicted genes and related information on them.
As mentioned above, only one gene from Cyclotella cryptica has been
experimental characterized. To retrieve the most part of this information we
used different databases, principally AlgaeBase and Joint Genome Institute
databases, although Genebank and EMBL- BANK were also used. A brief
description of the less known databases are give below.
AlgaeBase is a database of information on algae that includes terrestrial, marine
42
and freshwater organisms. At present, the data for the marine algae, particularly
seaweeds, are the most complete and also include sea-grasses. Unfortunately it
is also a work in progress and much of the data is incomplete. This database was
used particularly to collect information about the quantity of fatty acids and
(TAGs) from previously published articles.
JGI is a comprehensive database of the genomes of Eukaryotic species and it
has the most complete genomic data of algae strains, data is organized following
the scheme whereby the main object is the genome; this is organized into
portions of the genomic sequence reconstructed from the end sequence , called
the scaffold. They are composed of contigs and gaps. One chromosome may be
represented by many scaffolds, depending on the extend of the genome
information. The database holds a list of prediction genes that can be retrieved
by browser (KOG) EuKaryotic Orthologous Group. Using a KOG tool for a JGI
sequence organism provides a way to find predicted genes by functional
classification or identified number as well as protein sequence and related. the
genes are referred to using Internal and external cross-referenced annotations.
The data was found using a query on the KOG browser by functional KOG
database and by following the cross reference annotation to NCBI genebank
stored in local. More details are available in the (table 2)
43
44
Genus species
Alpha
Beta
Biotin
Lipids
Source
Clamidomonas reinhardtii
ID:5722616
ID:5728708
ID:5727859
18-24% DW
NCBI
Chorella spNC64A
ID: 36222
-----
-----
29% DW
GJI
Chorella Protothecoides
-----
-----
15-55% DW
Xu et al. 2006
Volvox Carteri
ID: 106840*
Scenedesmus obliquus
ID:EC187118**
Scenedesmus TR84
-----
-----
ID: 82311
-----
-----
GJI
-----
-----
-----
EBI
-----
-----
Locus :CR954208
Ostreococcus Tauri
Sheehan et al.1998
-----
NCBI GenBank
ID:4999505
-----
NCBI
-----
-----
EBI
Ostreococcus Lucimaris
-----
Dunaliella Salina
-----
Dunadiella Tertioleca
-----
-----
-----
36-42% DW
Sticoccus
-----
-----
-----
33% DW
sheehan et al.1998
Ankistrodesmus TR84
-----
-----
-----
28-40 DW
Botryococcus Braunii
-----
-----
-----
29-75% DW
Parietochloris Incisa
-----
-----
-----
35% DW
Roessler (1988)
Thalassiosira Pseudonana
-----
-----
21-31% DW
GJI
Cyclotella Criptica
21% DW
NCBI GenBank
Cyclotella Di-35
Nitzschia TR-114
Hantzschia DI 160
M Phaeodactylum Triconutum
-----
45% DW
ID:EF363909*
ID: 6770
Locus :L20784
-------------
-----
-----
-----
42% DW
Sheehan et al.1998
-----
-----
28-50% DW
-----
-----
11-31% DW
Sheehan et al.1998
11-31% DW
GJI
-----
ID:55209
Table2:of gene sequence and quantity of fatty acids (*cDNA sequence; ** besed on proteine
aligment
45
4.2
Sequence analysis
In order to find a data set as reliable as possible, different strategies were applied
these depended on the database employed. Nevertheless, in most cases the use of
a Basic Local Alignment sequence tool (Blast) in comparison to the nucleotide
or protein resulted in being able to see what was relevant and of particularly
importance. BLAST uses statistical theory to produce a bit-score and expect
value (E-value) for each alignment pair. The E-value gives an indication of the
statistical significance of a given pairwise alignment and reflects the size of the
database and the scoring system used. The lower the E-value, the more
significant the hit. A sequence alignment that has an E-value of 0.05 means that
this similarity has a 5 in 100 (1 in 20) chance of occurring by chance alone. A
strict, high E-value threshold (1-80) has been applied in order to keep the most
reliable sequences. Following these strategies we managed to seek out ACCase
predicted genes and some sequences that point out high E-value beyond the
threshold, furthermore to evaluate meaningful sequences ClustalW in local was
used. This perform a global-multiple sequences alignment by the progressive
method with the following steps: perform pair-wise alignment of all sequences
by dynamic programming, use the alignment scores to produce a phylogenetic
46
4.3
Computational methodologies
47
The methodology that was used to carry out this study was supported by
bioinformatic tools to find out the promoter gene sequence. In the first place,
the theoretical idea will be introduced and after that the tools that were used
Computational methods for the discovery of the conserved TFBSs can be split
into two broad categories: the 'single species, many genes' approach and the
'single gene, many species' one [84]. In the first case, a set of regions (i.e.,
promoters) from co-regulated genes were analysed to look for over-represented
motifs, that is, the TFBSs responsible for the co-regulation of the genes; while in
the second approach, known as phylogenetic footprinting , a single gene was
investigated and non coding regions flanking were compared to their homologs
in other species. Non coding sequence elements that were found to be conserved
by evolution are likely to be involved in the regulation of the expression of the
gene.
Clearly, the two approaches can be merged and each set of co-regulated genes
can be compared both to its homologs and to each other, this analysis can also be
performed on a full-genomic scale . Given that an ever increasing number of
annotated genomic sequences are available, phylogenetic footprinting has
become more and more widely used since it avoids the need to assemble a set of
co-regulated genes, this in turn implies the need to build a reliable dataset, and
allows for the investigation of a single gene.
48
The available methods that are more commonly used first build an alignment,
either local or global, of the sequences investigated, or to take advantage of the
pre-computed full genomic alignments now available. One simple solution is to
identify conserved functional elements by using descriptors of the binding
specificity of TFBSs I.e
example, in the TRANSFAC database, and also to look for conserved aligned
regions fitting the descriptor. This approach can be used for the detection of
single TFBSs.
Methods of this kind need reliable descriptors of the binding specificity of the
different TFs. Usually, PWMs yield a large number of false positive matches and
in requiring a match to be conserved throughout different sequences it is
possible to reduce them, however, the problem of defining whether a match is
significant in all the species considered remains. Secondly, and more
importantly, is the need to have a reliable alignment of the sequences
investigated. TFBSs tend to be quite short (815 nucleotides), in comparison
with a normally analysed region of 5001000 bp, in these cases the sequences
aligned are too divergent resulting in the possibilities that conserved TFBSs
may be missed simply because they are not aligned correctly.
To overcome this, different solution were identified : Bioprospector [85] and
Galf_P [86]were used. Both of them used a heuristic method and both seek for
TFBSs without any prior preset knowledge, a brief description of both programs
is given in the following paragraphs.
49
4.3.1 Bioprospector
This examines the upstream region of genes in the same gene expression pattern
group and looks for conservative sequence motifs. BioProspector uses zero to
third-order Markov background models, whose parameters are either given by
the user or estimated from a specified sequence file. The significance of each
motif found is judged based on a motif score distribution .
It takes the following input parameters; a file where the flanking region are
stored, a file with the background distribution using calculations on the bases of
the input file and the widths of the motif. At the end of the bioprospector run the
following results can be obtained : The motif score, significance value and the
number of the significance alignments segments. A regular expression of the
motif consensus and degenerates, as well as a probability matrix expression of
the motif. The number of segments each input sequence contributes to the motif,
the starting position and the sequence of each segment.
Within each run of BioProspector, a process called threshold sampler is
performed a number of times. Threshold sampler initializes a motif probability
matrix by a random alignment of the input sequences and improves the matrix
iteratively and stochastically
50
51
4.3.2 GALF_P
This is a genetic algorithm which combines and utilises both, position-led and
consensus-led motif representations.
The consensus-led approach is the one in which an individual is encoded as
potential consensus represented by (A,T,G,C). The position-led represent in a
array the possible starting position of TFBS in each sequence. Both methods
present advantages and disadvantages. The individuals of the consensus method
can be randomly generated or picked up randomly from the subsequences with
motif length among all the input sequences. One disadvantage of consensus-led
approaches is that they require the scanning of all the sequences to align each
one to the consensus when evaluating a single individual. this imposes an
intensive computational load. Additionally, when the consensus happens to be a
shifted version of the true one, they have no easy techniques to correct the
consensus. Position-led approaches have more flexibility to move around the
search space compared with consensus-led ones, because there is freedom to
change any starting position, which consequently changes the motif
configuration, it is easy to shift the motif by changing all positions with the
same small step. However, the representation cannot provide a global view of the
quality of each TFBS position and cannot remove a small portion of the
unsuitable positions easily.
This algorithm not only used both of the representations to take advantage, but
also, to some extend, limits the drawbacks by the implementation of a filtering
operator, which employs consensus-led representation and avoids the tendency
of position-led to accumulate false positives within an individual. Consequently
52
4.3.2.1Representation
For an individual, the basic representation is the position- led one, which is an
array storing the starting position in each sequence. Starting from each position
in an individual, a subsequence with the motif width can be extracted and is
called a motif instance. The consensus of an individual is represented by a
Position-specific Weight Matrix (PWM) generated from the motif instances.
Each cell in the PWM indicates the normalized frequency of the nucleotide in a
particular position of the motif instances
4.3.2.2Fitness Evaluation
The fitness function adopt for each individual is its information content. For
each position i in the extracted motif instances, the positional information
content is :
53
where bi is the nucleotide in position i of the motif instance, and PWM (bi , i) is
54
55
One of the feature operators in GALF is the local filtering operator, which can
filter out the false positives in a position-led individual in terms of the motif
instances' similarities to the consensus represented by PWM.
Firstly, the motif instances within an individual is ranked by their similarity
scores to the consensus. Secondly, the sequence containing the instance with the
lowest similarity score is scanned. Among all the possible starting sites of the
instance, the one giving the best similarity to the consensus is chosen. If the rank
does not change, which means this best instance is not, in fact, better than its
original preceding instance based on the other sequence in terms of similarity
score. In this case the local filtering stops, otherwise the preceding instance
becomes the worst, the sequence containing it is then selected and scanned as in
the first step. This is iterated until the rank does not change or the sequence
containing the original second best instance is scanned. It is notice that the
PWM will not be updated before the local filtering is finished for two purposes;
one is to save the computational load compared with the on-line update, the
other purpose is to try not to be too greedy. In order to keep the contribution of
evolutionary process, the filtering operator is only triggered once after certain
generation intervals and applied to those only newly generated
4.3.2.5Replacement Strategy
56
Replacement is applied to keep the population size constant after the increase of
individuals during reproduction. Before replacement, all duplicate individuals
will be removed to avoid a take-over rate that is too fast . This is done by
assigning an arbitrarily low fitness to those duplicates. Each individual competes
with K randomly chosen from other individuals and scores a win if its fitness is
higher than its competitor. K is user defined and fixed at 10 . The number of
wins of each individual are recorded and ranked, when there is a tie in the
number of wins between the two different individuals. They are then re-ranked
by their fitness. Those whose final rankings are beyond the desired population
size will be eliminated.
4.3.2.6Shift Operator
When the individual with best fitness stagnates, which means it does not change
after certain generations ( the generations of stagnation), a small number of
shifts (all of the positions of the individual are moved in either
direction by the same bases) are tested for improvement of fitness. Based on the
gain of fitness, the smallest shift with positive gain will be chosen. If both
directions of the same shift number achieve improvement, the direction with
better gain is chosen. This moderate shift operator is to prevent a drastic shift
which may drag the solutions to local optima too fast before convergence.
57
4.3.3 Implementation
Both softwares were run with the same dataset, consisting of the upstream alpha
fragment region of Clamidomonas reindhardtii; Volvox Carterii ; Chorella sp
NC64A. It was necessary to keep those sequences as no other sequences had
upstream parts available.
To keep the parameters as the author suggested for GALF_P the population is
initialised by randomly generating the start position in each sequence. The
population size was set at 500 and the offspring 250, whilst for BioProspector
a background sequence for the whole genome of Clamidomonas Reinardhii and
40 time reinitializes options that try to avoid a local maximal were set. These
were ran 5 time with different motif width (8; 10; 12; 14; 16; ) to underline
the shared motif. Only 3 shared motifs were found, these occurred several times
showing a high motif score or fitness evaluation as shown in table3.
Sequences
Bioprospector + Bg
GALF_P
TGTTTT(N)C
Fitness : 3.344442
AA(N)CCTGCA(N)
Fitness : 2.900421
AATC(N)TGC(N)C
Fitness : 3.004174
58
4.4
Primer design
Once the sequences were found, a primer was design to characterise the ACCase
in Scenedesmus Protuberans. To reach this purpose, the first step consisted in
using the sequences among the homologous strains phylogenetically as closely
as possible, in order to find the sub-sequences highly conserved. Although few
sequences were available for this purpose, a script in python was written to use
the power of regular expression in order to calculate a potential sequences
primer. The script takes an input-file from Clustalw in multi-alignment format.
In this way a list of potential primers was obtained, derived from the length of
the alignment sequences, without mismatch, with a threshold of extension on
the temperature of annealing that had been set out with the formula (Tm = 2(A +
T) + 4(G + C) ). Also the percentage of GC
stop or start signal and the possible matches between the primer sequences were
proofed. The mismatch extension was allowed if the sequences did not reach the
temperature threshold. An example of output in (table 4) is shown below .
59
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Forward 5'-3'
Reverse 3'-5'
Primer design
ipotetical primer
lenght %GC Cmelting Pb conservative GAP
GTATCGGCAGCATCAATGG
19 52,63
58
15
GTCACTATTGATGTCTCATCT
21 38,09
58
ACGAGCGTTGAGCTCAACG
19 57,89
60
CTTCTTGTCTTCGTACGGTT
20
45
58
GTATCGGCAGCATCAATGG
19 52,63
58
CTTCTTGTCTTCGTACGGTT
20
45
58
GTGCGTTGCCACGAGCGT
18 66,66
60
13
CGCGAACCGACCCGCG
16 81,25
58
11
ATCGGCAGCATCAATGGCA
19 52,63
58
13
CCCACCGTGCGCGCCA
16 81,25
58
12
TAGCGATAATGAGAGCAGGG
20
50
60
12
GCCCTCATTCACTTGCGTG
19 57,89
60
13
TCGCCGCAACTTCGGCAT
18 61,11
58
12
GCTCCTCCTCTAGTTCGAC
19 57,89
60
12
TAACCCACTTGAGGAGCAC
19 52,63
58
12
CCAGGTTCCGACACATGC
18 61,11
58
11
CGCGTGCGTGGCCGCG
16
87,5
60
11
CCAACGTCGTACCACGCG
18 66,66
60
14
TAAGCGCATCAAGGAGGTG
19 52,63
58
13
CACTGTGCTGACAGCTGTT
19 52,63
58
14
ATGCTCCCTGTGGGCACA
18 61,11
58
12
TGGGTGCTGGGCGGGC
16 81,25
58
11
TGACGAAGACTCAGATTGTAT
21 38,09
58
14
CGACCGTCCCACGGCTC
17 76,47
60
13
GCGTCTGCAAGTCGCTCG
18 66,66
60
14
CCGCCTCGTTATTGCTTAC
19 52,63
58
13
TTTCGCGCCAAAAGCTATCT
20
45
58
13
GGGGGCGGTCGTCGTTG
17 76,47
60
13
CCCACTTGAGGAGCACCT
18 61,11
58
12
GTAACGGGTTCTCGTCTATG
20
50
60
13
CACCACGACGCTTGAGTTT
19 52,63
58
14
GCGAAAGTACCGACCGCG
18 66,66
60
12
TGGATCTGGAGCAATTGGC
19 52,63
58
15
GGCCGAGCGGTCGCGG
16
87,5
60
12
AGCACCAGCTGGCGGGA
17 70,58
58
13
TTGCCGCGGCAGCAGTTG
18 66,66
60
12
fragment
3
1
3
1
5
1
0
1
2
1
1
0
0
0
0
1
1
0
2
1
1
0
1
1
0
2
1
1
0
0
2
2
2
1
2
1
4002
4002
4279
4279
3889
3889
3920
3920
2950
2950
2442
2442
1796
1796
4794
4794
4441
4441
4229
4229
4152
4152
3911
3911
3549
3549
2776
2776
4697
4697
4008
4008
3615
3615
3030
3030
strat/ stop
ATG(16)
stop*
TGA(9)
ATG(14)
ATG(16)
ATG(14)
N. start
N.stop
ATG(14)
N.stop
start/ stop*
TGA(11);TAA(7)
N. start
N.stop
TGA(10);TAA(1)
N.stop
N. start
ATG(10)
TAA(1)
TGA(2)
ATG(0)
N.stop
TGA(1)
N. start
ATG(17);TAA(11)
N. start
N.stop
TGA(6)
N.stop
TGA(13)
ATG(8)
N. start
N.stop
N. start
N.stop
Unfortunately the alignments did not give reliable primers because the
alignments were not
60
61
62
63
From that alignments it was possible to design some perfect matches and
degenerate primer sequences as shown below :
SceneForward 5'ACCTGCCTGGACATCATCCTNAACATCAC 3' Tm=64 GC=51,7 no dimer
Scene Reverse 5'TACCGGAGCGGGACCGGGTCGA 3'
64
65
Compound
500 mL
Filtered Seawater
75 mg
NaNO3
8.83 x 10-4 M
5 mg
3.63 x 10-5 M
1 mL
Tracemetal solution
---
0.6 mL
Vitamin solution
----
Molar
concetration in
final Medium
(50 mL)
1575 mg
FeCl3 . 6 H2O
1 x 10-5 M
2180 mg
Na EDTA . 2 H2O
1 x 10-5 M
0.49 mg
Cu SO4 . 5 H2O
4 x 10-8 M
0.315 mg
Na Mo O4. 2 H2O
3 x 10-8 M
1.1 mg
Zn SO4. 7 H2O
8 x 10-8 M
0.5 mg
Co Cl2 . 6 H2O
5 x 10-8 M
5 mg
Mn Cl2 . 4 H2O
9 x 10-9 M
Vitamin solution
2.5 l
Biotin
2 x 10-9 M
This recipe was autoclavate and stored in refrigerator, NA2 SiO3 . 9 H2O were
omitted as the author suggested to avoid silica precipitation form, in the same
way the second medium was prepared and glycerol added.
Unfortunately the culture from Ege Biotecnology was contaminated resulting in
66
the algae growing rate being too slow. To eliminate this unknown contamination
the antibiotic as Spectinoycin was added, however this was not successful. An
extraction of DNA by (Qiagen KIT ) was attempted from the culture but as was
predicted beforehand the algae DNA was not obtained as shown in the photo of
electrophoresys analysis below.
67
To carry out the experiment a new sample was ordered from (EBILTEM Ege
University ) this time the algae was already grown so that it was possible to
directly extracted a DNA by a standard phenol- chloroform extraction obtaining
a pure and large amount of DNA for each sample (ng/ul 1 : 163,28 ; 2 : 158,49 ;
3 : 214,47 ) , as shown in the photo of electrophoresys gel.
68
Using this sample two different PCR analysis were obtained which identified
this sequence.
69
The first one, PCR analysis, was conducted with the following thermal cycle,
initial denaturation 94 C for 2 min, denaturation 94 C for 20 sec , Annealing
61,5 C 10 sec, extention 72 C 15 sec final extention 72C 5 min. The cycle was
repeat 30 times. According to primer temperature and the length of the expected
product
The second one, was carried out with a gradient PCR and a different primer
reverse (Rcre), in this way it was possible to optimised the melting temperature
and avoid unspecific product. The reaction was conducted with 8 samples at
different temperatures, calculated from the average annealing temperature 58.5
C as show in the picture :
70
The next analysis was used to check if a digestion by restriction enzyme gave
the expect fragment by gel purification of the last 3 upper bands from the last
gradient PCR done. A SphI was used for this in accordance with the current
71
assets of the laboratory and the restriction site on the map, having just one on
the sequence. Unfortunately no negative control was possible due to the lack of
enzyme
72
and the final phenotypes of an organism. Typically, when certain TFs bind to the
TFBS in the promoter region of the corresponding sequence, the transcriptional
process is signalled and initiated. On the other hand, when other competing
molecules interacted with the binding site the transcription factor fails to bind
and the transcriptional process, in the worst case scenario, was inhibited The
molecule and hence the TFBS can cause modulation in the transcriptional
process which in turn produces more or less mRNA. Therefore it was necessary
to identify the TFBSs sequence to be able to decipher the mechanism of gene
regulation. Based upon this a hypothesis was made to prove that algae with a
high fatty acid content would have stronger promoters for ACCase genes.
Therefore it was necessary to be able to indentify a sequence which conserved
the common factors for those strains which produce a significant quantity of
fatty acids that might be lacking in other strains. Unfortunately due to the
limitations of the genomic data and a lack of quantified data in respect of the
quantity of fatty acids in algae currently available it was not possible to carry out
this study. Instead the upper part of the alpha fragment homologous predicted
gene was used to search for the generic conservative sequence was shared. To
explore this field a bio-informatics approach was used; although a biological
experiment method such as DNA foot printing and gel electrophoresis would
have been more reliable and accurate to identify TFBSs, this had the
disadvantage of being very expensive and time consuming. Consequently two
73
74
position
Bioprospector + Bg
TGTTTT(N)C
Fitness : 3.344442
Fitness : 2.900421
AATC(N)TGC(N)C
Fitness : 3.004174
GALF_P
At this time it is not possible to make any firm conclusions regarding the TBFS
factor but it would not be unreasonable to consider this a sound proposal for
further investigative research.
On the other hand, in the experimental phase using the information collect
allowed for the design primers in order to identify the gene from Scenedesmus
Protuberans, a green algae already discussed in a previous study, which has the
ability to store a large amount of fatty acids in its cells. The degenerated primers
sequence was determined, using Clustalw by means of alignment of the
sequences of Scenedescmus Obliqus and Chlamidomonas Reinhatii in the Alpha
part. These primers gave the possibility to search for the gene fragment in the
75
strain in question but still the results from the analysis remained ambiguous
because the unspecific PCR products were found not even with restriction
analysis of the sequence and therefore it was not possible to recognise the
fragment that was searched for. It is only possible to determine with some
certainty the gene fragments identity by sequencing of the fragments found. At
the present moment these results are not yet available.
Bibliography
1: Agarwal A.K. Biodiesel development and characterization for use as biofuel J.Eng 2001
2: Durrett, T., Benning, C. and Ohlrogge, J, Plant triacylglycerols as feedstocks for the
production of biofuels., 2008
3: National Biodiesel bord, National Biodiesel Board, 2008
4: Exposy, exposy news 2007, 2007
5: Martinot, Eric, Renewables 2007 Global status Report, 2008
6: National Biodiesel Board, Statistic. the EU biodiesel industry, 2008
7: Biopower london, Biodiesel to drive up the price of the coosking oil, 2006
8: Herer, Jack, the Emperor Wears no Clothes, 1985
9: Klass, Donald ., Biomass for renewable Energy Fuels, and Chemicals.,
10: Kitani, Osamu, Energy and Biomass Engineering, 1999
11: Enviromental news, Biofuel some numbers,
12: Purdue, Purdue report,
13: American fuel, Biodiesel Yield even Higher Energy Balance,
14: United stated Department of Energy, Biodiesel just the basic, 2007
15: Australia Brodcasting Corporation, Biofuel Demend markes fried food expensive in
indonesia,
16: Lester Brown, How Food and Fuel Compete for land,
17: Falkowski, P.G. and Raven, J.A, Aquatic Photosynthesis, 1997
18: Van den Hoek, C., Mann, D.G. and Jahns, H.M., Algae: An Introduction to Phycology,
1995
19: Guschina and Harwood,, Lipids and lipid metabolism in eukaryotic algae, 2006
20: Thompson, G.A., Lipids and membrane function in green algae., 1996
21: Wada and Murata,, Membrane lipids in cyanobacteria. In Lipids in Photosynthesis:
Structure,, 1998
22: Ben-Amotz, A., Shaish, A. and Avron, M. , Mode of action of the massively accumulated
-carotene of Dunaliella bardawil in protecting the alga against damage by excess irradiation.,
23: Bigogno, C., Khozin-Goldberg, I., Boussiba, S., Vonshak, A. and Cohen, Z, Lipid and
fatty acid composition of the green oleaginous alga Parietochloris incisa, the richest plant
source of arachidonic aci, 1971
24: metzger and Largeau, Botryococcus braunii: a rich source for hydrocarbons and related
ether lipids, 2005
25: Benemann, J.R., Pursoff, P. and Oswald, W.J, Microalgae as a Source of Liquid Fuels,
1982
26: Borowitzka, M, Fats, oils and hydrocarbons. In Microalgal Biotechnology , 1988
27: Hu, Q., Zhang, C.W. and Sommerfeld, M, Biodiesel from Algae: Lessons Learned Over
the Past 60 Years and Future Perspectives, 2006
28: Sheehan, J., Dunahay, T., Benemann, J. and Roessler, P.G., A Look Back at the US
Department of Energy's Aquatic Species Program Biodiesel from Algae, Close Out Report ,
1998
29: Ohlrogge and Browse, , 1995
30: Cobelas, M.A. and Lechado, J.Z., ,
31: Basova, M.M, Fatty acid composition of lipids in microalgae, 2005
32: Parker, P.L., van Baalen, C. and Maurer, L., , 1967
33: Bigogno, C., Khozin-Goldberg, I., Boussiba, S., Vonshak, A. and Cohen, Z, A.R, , 1971
34: De Swaaf, M.E., de Rijk, T.C., Eggink, G. and Sijtsma, L., Optimisation of
62: Sato, N., Hagio, M., Wada, H. and Tsuzuki, M. , Environmental effects on acidic lipids of
thylakoid membranes. In Recent Advances in the Biochemistry of Plant Lipids, 2000
63: Saha, S.K., Uma, L. and Subramanian, G, Nitrogen stress induced changes in the marine
cyanobacterium Oscillatoria willei BDU 130511, 2003
64: Renaud, S.M., Thinh, L.V., Lambrinidis, G. and Parry, D.L., Effect of temperature on
growth, chemical composition and fatty acid composition of tropical Australian microalgae
grown in batch cultures., 2002
65: Somerville, C., Direct tests of the role of membrane lipid composition in low-temperatureinduced photoinhibition and chilling sensitivity in plants and cyanbacteria., 1995
66: Aaronson, S, Effect of incubation temperature on the macromolecular and lipid content of
the phytoflagellate Ochromonas danica,
67: Boussiba, S., Vonshak, A., Cohen, Z., Avissar, Y. and Richmond, A., Lipid and biomass
production by the halotolerant microalga Nanochloropsis salina. Biomass, , 1987
68: Patterson, G., Effect of temperature on fatty acid composition of Chlorella sorokiniana.
Lipids, ,
69: Falkowski, P.G. and Owens,, Lightshade adaptation: two strategies in marine
phytoplankton., 1980
70: Khotimchenko, S.V. and Yakovleva, I.M. , Lipid composition of the red alga Tichocarpus
crinitus exposed to different levels of photon irradiance., 2005
71: Fabregas, J., Maseda, A., Dominquez, A. and Otero, A. , ) The cell composition of
Nannochloropsis sp. changes under different irradiances in semicontinuous culture, 2004
72: Sukenik, A., Yamaguchi, Y. and Livne, A, .Alterations in lipid molecular species of the
marine eustigmatophyte Nannochloropsis sp. J., 1993
73: Tonon, T., Larson, T.R. and Graham, I.A. , Long chain polyunsaturated fatty acid
production and partitioning to triacylglycerols in four microalgae., 2002
74: Alonso, D.L., Belarbi, E.H., Fernandez-Sevilla, J.M., Rodriguez-Ruiz, J. and Grima, E.M.,
Acyl lipid composition variation related to culture age and nitrogen concentration in
continuous culture of the microalga Phaeodactylum tricornutum. , 2000
75: Liang, Y., Beardall, J. and Heraud, P., Changes in growth, chlorophyll fluorescence and
fatty acid composition with culture age in batch cultures of Phaeodactylum tricornutum and
Chaetoceros muelleri (Bacillariophyceae). Bot. Mar. , 2006
76: Zhekisheva, M., Boussiba, S., Khozin-Goldberg, I., Zarka, A. and Cohen, Z. ,
Accumulation of oleic acid in Haematococcus pluvialis (Chlorophyceae) under nitrogen
starvation or high light is correlated with that of astaxanthin esters. , 2002
77: Merchant, S.S., Prochnik, S.E., Vallon, O. et al. , The Chlamydomonas genome reveals
the evolution of key animal and plant functions., 2007
78: Mus, F., Dubini, A., Seibert, M., Posewitz, M.C. and Grossman, A.R., Anaerobic
adaptation in Chlamydomonas reinhardtii: anoxic gene expression, hydrogenase induction and
metabolic pathways, 2007
79: Blling, C. and Fiezhn, O. , Metabolite profiling of Chlamydomonas reinhardtii under
nutrient deprivation. , 2005
80: Ghirardi, M.L., Posewitz, M.C., Maness, P.C., Dubini, A., Yu, J. and Seibert, M.,
Hydrogenases and hydrogen photoproduction in oxygenic photosynthetic organisms. , 2007
81: Stauber, E.J. and Hippler, M., Chlamydomonas reinhardtii proteomics. , 2004
82:Roessler, P.G. and Ohlrogge, J.B. Cloning and characterization of the gene that encodes
acetyl-coenzymeA carboxylase in the alga Cyclotella cryptica. J. Biol. Chem.1993
83:Livne, A. and Sukenik, A. (1990) Acetyl coenzyme A carboxylase from the marine
Prymnesiophyte Isochrysis galbana. Plant Cell Physiol. 31, 851858.
84: Giulio,P. Federico Z. and Graziano P. algorithms for finding conserved regulatory motif
and region in homologous sequences. BMC Bioinformatics 2007
85: Bioprospector http://ai.stanford.edu/~xsliu/BioProspector/
86:Galf_P http://appsrv.cse.cuhk.edu.hk/~tmchan/GALFP/
Acknowledgements
First of all, I would like to thank at my supervisor prof. dr. Ugur Sezerman
accepted me as one of his students and for his precious comments and remarks
for all the long talks and urgent e-mails and document too.
Thanks also to the other phD students I have met at Sabancy University: Alper
kucukural and Gunseli Akcapinar
Thanks to all the erasmus students who have invited me to parties and lunches
and thanks to Paolo that shared with me this experience. Especially thanks to
Ceyda and Eda for the fun we had together for the free english lessons and for
having made me feel like at home. Thanks also to the internet for all the free
calls to home.
Finally, many thanks to my family and Anna that keep supporting me all the
time.