Académique Documents
Professionnel Documents
Culture Documents
Molecular Biology
Computational Biology
Bioinformatics
Computer Science
Genomics
Genomics
Sub-fields of genomics:
1. Structural genomics-genetic and physical mapping of
genomes.
2. Functional genomics-analysis of gene function (and nongenes).
3. Comparative genomics-comparison of genomes across
species.
Evolutionary genomics.
COMPARATIVE
GENOMICS
Brief Review
Definition
A comparison of gene numbers , gene
locations & biological functions of gene, in
the genomes of different organisms is
known as comparative genomics.
Major objective : to identify gene or group
of genes that play a unique biological role
in a particular organism.
Few Terminologies
Homology :- Homology is the relationship of any
two characters ( such as two proteins that have
similar sequences ) that have descended,
usually through divergence, from a common
ancestral character.
Homologues are thus components or characters
(such as genes/proteins with similar sequences)
that can be attributed to a common ancestor of
the two organisms during evolution.
Analogues
Analogues are non-homologues
genes/proteins that have descended
convergently from an unrelated ancestor.
They have similar functions although they
are unrelated in either sequence or
structure.
Methods
A DNA walk of a genome represents how the
frequency of each nucleotide of a pairing nucleotide
couple changes locally.
This analysis implies measurement of the local
distribution of Gs in the content of GC and of Ts in the
content of TA.
Lobry was the first to propose this analysis (1996,
1999). Two complementary representations can be
derived from the DNA walk: the cumulative TA- and
the GC-skew analysis.
1) DNA walk
1.1) Drawing a DNA walk by reading a sequence file nucleotide
by nucleotide.
A simple algorithm can be used to draw a DNA walk by simply
assigning a direction to each nucleotide.
Lets T, C, A, and G correspond the E(ast), S(outh), W(est), and
N(orth) directions, respectively. Reading the nucleotide
sequence nucleotide by nucleotide, and following the rule, a
path clearly emerges on the graph.
Figure 1:
1: DNA walk of the sequence
GTCTGGTGTCTGGAGTTCCTGGGTCTTGAGACCACAGGACC
CACCAGGGACCCAGGACCC
Starting from the bottom left (bold blue line), the curve end at the bottom left (pink line)
Methods (dry)
Bioinformatics.
Its tools (software)
Computational analysis
Shannon entropy is a measure of variation
or change over a time series.Genes that
exhibit significant changes are regarded
as good target candidates.
Clustering is a method for grouping
patterns by similarities in their shapes.
GCG tools
Founded in 1982 as a service of the Department of
Genetics at the University of Wisconsin, GCG became a
private company in 1990 and was acquired by Oxford
Molecular Group in 1997. The company was one of the
pioneers of bioinformatics and its Wisconsin Package
sequence analysis tools are widely used and well regarded
throughout the pharmaceutical and biotechnology
industries and in academia. To support enterprise
bioinformatics efforts, GCG developed SeqStore, its
Oracle-based data management system. Desktop solutions
are delivered to bench scientists through products such as
MacVector and OMIGA
PAUP* version 4.0 is a major upgrade and new release of the software
package for inference of evolutionary trees, for use in Windows,
UNIX/VMS, or DOS-based formats.
Target Validation
Target validation involves taking steps to prove that a
DNA, RNA, or protein molecule is directly involved in a
disease process and is therefore a suitable target for
development of a new therapeutic compound.
Genes that do not belong to an established family are
critical to many disease processes and also need to be
validated as potential targets.
Outcomes/ Benefits
Provides first pass information on the function
of the putative protein based on the existence of
conserved protein sequence motifs.
Advancements in computer software
technologies (Bioinformatics) has made
comparative analysis of genomes an extremely
powerful approach for functional genomics too.
These studies can also reveal insights into the
recruitment of enzymes in a pathway
Outcomes/ Benefits
It will help us to understand the genetic basis of diversity
in organisms, both speciation & variation, events that are
important aspects of evolutionary biology.
Comparative genomics provides a powerful way in which
to analyze sequence data.
Indeed, there is already a long list of 'model' organisms,
which allow comparative analyses in a variety of ways.