Académique Documents
Professionnel Documents
Culture Documents
Genomics can be said to have appeared in the 1980s, and took off in the 1990s with the
initiation of genome projects for several species. A major branch of genomics is still
concerned with sequencing the genomes of various organisms, although the knowledge of
full genomes have created the possibility for the field of functional genomics, mainly
concerned with patterns of gene expression during various conditions. The most
important tools here are microarrays and bioinformatics.
Proteomics is the large-scale study of protein, particularly their structures and functions.
It is also the study of the full set of proteins in a cell type or tissue, and the changes
during various conditions. This term was coined to make an analogy with genomics, and
while it is often viewed as the "next step", proteomics is much more complicated than
genomics. Most importantly, while the genome is a rather constant entity, the proteome
differs from cell to cell and is constantly changing through its biochemical interactions
with the genome and the environment. One organism has radically different protein
expression in different parts of its body, in different stages of its life cycle and in different
environmental conditions.
The word is derived from PROTEins and by genOME, since proteins are expressed by
the genome. The proteome refers to all the proteins produced by an organism, much like
the genome is the entire set of genes. The human body may contain more than 2 million
different proteins, each having different functions. Thus, proteomics is the study of the
composition, structure, function, and interaction of the proteins directing the activities of
each living cell. As the main components of the physiological pathways of the cells,
proteins serve as vital functions in the body.
Comparative genomics exploits both similarities and differences in the proteins, RNA,
and regulatory regions of different organisms to infer how selection has acted upon these
elements. Those elements that are responsible for similarities between different species
should be conserved through time (stabilizing selection), while those elements
responsible for differences among species should be divergent (positive selection).
Finally, those elements that are unimportant to the evolutionary success of the organism
will be unconserved (selection is neutral).
Having come a long way from its initial use of finding functional proteins, comparative
genomics is now concentrating on finding regulatory regions and siRNA molecules.
Recently, it has been discovered that distantly related species often share long conserved
stretches of DNA that do not appear to code for any protein. It is unknown at this time
what function such ultra-conserved regions serve.
Drug design is the approach of finding drugs by design, based on their biological targets.
Typically a drug target is a key molecule involved in a particular metabolic or signaling
pathway that is specific to a disease condition or pathology, or to the infectivity or
survival of a microbial pathogen.
Some approaches attempt to stop the functioning of the pathway in the diseased state by
causing a key molecule to stop functioning. Drugs may be designed that bind to the active
region and inhibit this key molecule. However these drugs would also have to be
designed in such a way as not to affect any other important molecules that may be similar
in appearance to the key molecules. Sequence homologies are often used to identify such
risks.
The structure of the drug molecule that can specifically interact with the biomolecules
can be modeled using computational tools. These tools can allow a drug molecule to be
constructed within the biomolecule using knowledge of its structure and the nature of its
active site. Construction of the drug molecule can be made inside out or outside in
depending on whether the core or the R-groups are chosen first. However many of these
approaches are plagued by the practical problems of chemical synthesis.
Newer approaches have also suggested the use of drug molecules that are large and
proteinaceous in nature rather than as small molecules. There have also been suggestions
to make these using mRNA. Gene silencing may also have therapeutical applications.
There are over 1,000 public and commercial biological databases. These biological
databases usually contain genomics and proteomics data, but databases are also used in
taxonomy. The data are nucleotide sequences of genes or amino acid sequences of
proteins. Biological databases have become an important tool in assisting scientists to
understand and explain a host of biological phenomena from the structure of
biomolecules and their interaction, to the whole metabolism of organisms and to
understanding the evolution of species. This knowledge helps facilitate the fight against
diseases, assists in the development of medications and in discovering basic relationships
amongst species in the history of life.
By far the most important resource for biological databases is a special (yearly) issue of
the journal "Nucleic Acids Research" (NAR). The Database Issue is freely available, and
categorizes all the publicly available online databases related to computational biology
(or bioinformatics).
The term "sequence analysis" in biology implies subjecting a DNA or peptide sequence
to sequence alignment, sequence databases, repeated sequence searches, or other
bioinformatics methods on a computer.
In biology, phylogenetics (Greek: phylon = tribe, race and genetikos = relative to birth,
from genesis = birth) is the study of evolutionary relatedness among various groups of
organisms (e.g., species, populations). Also known as phylogenetic systematics,
phylogenetics treats a species as a group of lineage-connected individuals over time.
Phylogenetic taxonomy, which is an offshoot of, but not a logical consequence of,
phylogenetic systematics, constitutes a means of classifying groups of organisms
according to degree of evolutionary relatedness.
Phylogeny (or phylogenesis) is the origin and evolution of a set of organisms, usually a
set of species. A major task of systematics is to determine the ancestral relationships
among known species (both living and extinct). The most commonly used methods to
infer phylogenies include parsimony, maximum likelihood, and MCMC-based Bayesian
inference. Distance-based methods construct trees based on overall similarity which is
often assumed to approximate phylogenetic relationships. All methods depend upon an
implicit or explicit mathematical model describing the evolution of characters observed
in the species included, and are usually used for molecular phylogeny where the
characters are aligned nucleotide or amino acid sequences.
Molecular dynamics (MD) is a form of computer simulation where atoms and molecules
are allowed to interact for a period of time under known laws of physics. Because in
general molecular systems consist of a large number of particles, it is impossible to find
the properties of such complex systems analytically. MD simulation circumvents this
problem by using numerical methods. It represents an interface between laboratory
experiments and theory and can be understood as a virtual experiment.
Even though we know matter consists of interacting particles in motion at least since
Boltzmann in the 19th Century, many still think of molecules as rigid museum models.
Richard Feynman said in 1963 that "everything that living things do can be understood in
terms of the jiggling and wiggling of atoms." [1] One of MD's key contributions is creating
awareness that molecules like proteins and DNA are machines in motion. [2] MD probes
the relationship between molecular structure, movement and function.
Molecular dynamics is a multidisciplinary field. Its laws and theories stem from
mathematics, physics and chemistry. MD employs algorithms from computer science and
information theory. It was originally conceived within theoretical physics in the 1950's,
but is applied today mostly in materials science and biomolecules.
• Goals
– Creation of methods for manipulating structural data
– Application of these methods to solving problems in biology and discovery of new
patterns.