Académique Documents
Professionnel Documents
Culture Documents
CHEMOINFORMATICS
HISTORY OF CHEMOINFORMATICS
Cheminformatics (also known as chemoinformatics, chemioinformatics and
chemical informatics) is the use of computer and informational techniques
applied to a range of problems in the field of chemistry. Chem(o)informatics is a
generic term that encompasses the design, creation, organisation, storage,
management, retrieval, analysis, dissemination, visualisation and use of
chemical information, not only in its own right, but as a surrogate or index for
other data, information and knowledge.
The word chemoinformatics is rather new, but papers that fall under this field
date back to the mid-1960s, where structureactivity relationships (SAR) were
proposed based on the work of Hansch and Fujita (1964) and Fujita et al. (1964)
and the earlier work of Hammett and Taft. The first textbooks on chemoinformics
have been published recently (Leach and Gillet, 2003; Gasteiger and Engel, 2003;
Gasteiger, 2003; Bajorath, 2004).
The related field bioinformatics was fully established in the 1990s, and has
become an integrated activity in most major pharmaceutical companies. The basis
behind the success of bioinformatics was the access to a vast amount of
experimental data, together with the structured nature of genetic information.
The developments in information technology in the last decades of the 20th
century have fundamentally changed the way in which scientific information is
being communicated and used. A scientific discipline where the impact of these
changes has been particularly significant is (bio)chemistry. Up to less than 25
years ago, molecular modeling was a hardly-existent computational chemistry
niche, only practiced at those few institutes that could afford the very expensive
specialised hardware. Also rapid access to not only the primary literature but,
possibly even more importantly, to the factual primary data about millions of
chemical compounds, to reactions, structures, and spectra, and to the genomic
data of various organisms including humans, can only be provided by digital
storage and retrieval techniques.
The roots of what we now call cheminformatics began very early in the history of
computing: 1950's for statistical models, 1960's for first computer
representations, mainly by curious chemists. However the term
"cheminformatics" wasn't adopted until the early 1990's (the spelling of this cheminformatics or chemoinformatics - is still in dispute). The bulk of the
foundational work was done in the 70's and 80's, and was strongly supported by
the pharmaceutical industry and the need for computational drug discovery
research.
The term chemoinformatics was defined by F.K. Brown in 1998. With the advent
of computers and the ability to store and retrieve chemical information, serious
efforts to compile relevant databases and construct information retrieval systems
began. One of the first efforts to have substantial long term impact was to collect
crystal structure information for small molecules by Olga Kennard.
Chemoinformatics is the mixing of those information resources to transform data
into information and information into knowledge for the intended purpose of
making better decisions faster in the area of drug lead identification and
optimization. Since then, both spellings have been used, and some have evolved
to be established as Cheminformatics, while European Academia settled in 2006
for Chemoinformatics.
The first, and still the core, journal for the subject, The Journal of Chemical
Documentation, started in 1961( the name changed to The Journal of
Information and Computer Science in 1975)
The first book appeared in 1971 (Lynch, Harrison, Town and Ash, Computer
Handling of Chemical Structure Information)
CURRENT STATUS OF
CHEMOINFORMATICS
In recent years, there has been an explosion in the availability of publicly
accessible chemical information, including chemical structures of small molecules,
structure-derived properties and associated biological activities in a variety of
assays. These data sources present us with a significant opportunity to develop
and apply computational tools to extract and understand the underlying
structure-activity relationships. Furthermore, by integrating chemical data
sources with biological information (protein structure, gene expression and so
on), we can attempt to build up a holistic view of the effects of small molecules in
biological systems. Equally important is the ability for non-experts to access and
utilize state of the art cheminformatics method and models.
The chemoinformatics field continues to evolve at the interface between
computer science and chemistry. Chemical information and computational
approaches in pharmaceutical research are major focal points of
chemoinformatics. However, the boundaries of this discipline are rather fluid and
the chemoinformatics spectrum is difficult to delineate.
In the area of methodology development, recent work on characterizing
structure-activity landscapes, Quantitative Structure Activity Relationship (QSAR)
model domain applicability and the use of chemical similarity in text mining has
been done. In the area of infrastructure, a distributed web services framework
that allows easy deployment and uniform access to computational (statistics,
cheminformatics and computational chemistry) methods, data and modelshas
been done. The development of PubChem derived databases and highlight
techniques allow us to scale the infrastructure to extremely large compound
collections, by use of distributed processing on Grids.
SCOPE OF CHEMOINFORMATICS
APPLICATIONS OF CHEMOINFORMATICS
Storage and retrieval
The primary application of cheminformatics is in the storage, indexing and search
of information relating to compounds. The efficient search of such stored
information includes topics that are dealt with in computer science as data
trend in bioinformatics five to ten years ago. It should be mentioned that this
graph is based on one database only, PubMed, and is intended to give an idea
about the development in publishing frequency in these areas, and not as a
complete overview.
CONCLUSION
Chemoinformatics is a rapidly growing field, with a huge application potential.
Chemoinformatics concerns the gathering and systematic use of chemical
information, and the use of those data to predict the behavior of unknown
compounds in silico.