Académique Documents
Professionnel Documents
Culture Documents
AN OVERVIEW OF BIOINFORMATICS
I. TEXT 1. Introduction
Biology is in the middle of a major paradigm shift driven by computing technology. Although it is already an informational science in many respects, the field has been rapidly becoming much more computational and analytical. Rapid progress in genetics and biochemistry research combined with the tools provided by modern biotechnology has generated massive volumes of genetic and protein sequence data. Bioinformatics has been defined as a means for analysing, comparing, graphically displaying, modeling, storing, systemising, searching, and ultimately distributing biological information, which includes sequences, structures, function, and phylogeny. Thus bioinformatics may be defined as a discipline that generates computational tools, databases, and methods to support genomic and postgenomic research. It comprises the study of DNA structure and function, gene and protein expression, protein production, structure and function, genetic regulatory systems, and clinical applications. Bioinformatics needs the expertise from Computer Science, Mathematics, Statistics, Medicine, and Biology.
global information sets, for example, transcriptomes or proteomes change as systems are perturbed. The genomes of the model organisms yeast, worm, fly etc., have demonstrated the fundamental conservation among all living organisms of the basic informational pathways. Hence systems can be perturbed in model organisms to gain insight into their functioning, and these data will provide fundamental insights into human biology. From the genome, the information pathways and networks can be extracted to begin understanding their logic of life. Further more, different genomes can be compared to identify similarities and differences in the strategies for the logic of life and these provide fundamental insights into development, physiology and evolution. The first eukaryotic genome that has been fully sequenced and annotated is Saccharomyces cerevisiae. This highly helps to develop biological and computational tools for genomic and postgenomic research. In the era of automated DNA sequencing and revolutionary advances in DNA sequence analysis, the attention of many researchers is now shifting away from the study of single genes or small gene clusters to whole genome analyses. Knowing the complete sequence of a genome is only the first step in understanding how the myriad of information contained within the genes is transcribed and ultimately translated into functional proteins. In the post genomic era, functional genomic and proteomic studies helps to obtain an image of the dynamic cell.
7. System Biology
Biology is a highly informational science. There are mainly two types of biological information.
The information of genes or proteins, which are the molecular machines of life The information of the regularity networks that coordinate and specify the expression patterns of the genes and proteins.
All biological information is hierarchical. Initially DNA will change over to mRNA, which in turn goes to protein. Proteins enacts protein interactions, which creates some informational pathways. These pathways form informational networks, which in turn become cells. Now cells forms networks of cells. Finally an individual is a collection of cells. A host of individuals forms population and a variety of populations becomes ecologies. This evolution brings a primary challenge for researchers and scientists to create tools and mechanisms to capture and integrate these different levels of biological information and integrate it towards gaining insight of their curious functionings. All of these paradigm shift lead to the view that the major challenges for biology and medicine in this new century will be the study of complex systems and the approach necessary for studying these biological complexities. Here comes a viable approach.
i. ii. iii.
iv. v.
Identify all elements, such as sequence of genomes in the system with currently available discovery tools Use current knowledge of the sytem to formulate a model predicting its behavior Perturb the system in a model organism using biological, genetic or environmental perturbations, capture information at all relevant levels, such as DNA, mRNA, protein, protein interactions, etc. and integrate the collected information Compare theoretical predictions and experimental data, carry out additional perturbations to bring theory and experiment into closer apposition, integrate new data into model, Iterate steps iii) and iv) till the mathematical model can predict the structure of the system and its systems or emergent properties given particular perturbations.
The Integration of technology, biology, and computation. The integration of the various levels of biological information and the modeling . The proper annotation of biological information and its its storage and integration in databases. The inclusion of other molecules, large and small, in the systems approach. The integration imperatives of systems biology presents many challenges to industry and academia.
9. Conclusion
With the confluence of biology and computer science, the computer applications of molecular biology are drawing a greater attention among the life science researchers and scientists these days. As it becomes imperative for biologists to seek the help of information technology professionals to accomplish the ever growing computational requirements of a host of exciting and needy biological problems, the synergy between modern biology and computer science is to blossom in the days to come. Thus the research scope for all the mathematical techniques and algorithms coupled with software programming languages, software development and deployment tools are to get a real boost. In addition, information technologies such as databases, middleware, graphical user interface (GUI) design, distributed object computing, storage area networks (SAN), data compression, network and communication and remote management are all set to play a very critical role in taking forward the goals for which the bioinformatics field came into existence.
NCBI Home
Established in 1988 as a national resource for molecular biology information, NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates
biomedical information - all for the better understanding of molecular processes affecting human health and disease.
MaizeDB
The goals of this project are to provide a central repository for public maize information and present it in a way that creates intuitive biological connections for the researcher with minimal effort as well as provide a series of computational tools that directly address the questions of the biologist in an easy-to-use form.
Barley Genomics
AREAS Of RESEARCH: Barley Genome Mapping , Map-Based Cloning, Molecular Breeding, Mutant Isolation & Characterization, Functional Genomics, BAC Address Calculator, Developmental Mutants.
PathDB Pathways
PathDB is a beta level research tool for scientists interested in analyzing their experimental or computational data in the context of biological pathways and networks.
proteins involved in nucleic acid metabolism and/or interactions. It enables the rapid publication of papers under the following categories: chemistry, computational biology, genomics, molecular biology, RNA and structural biology. A Survey and Summary section provides a format for brief reviews. The first issue of each year is devoted to biological databases, and an issue in July is devoted to papers describing web-based software resources of value to the biological community.
II. Vocabulary
Accomplish ['kmpli] v. Analyse ['nlaiz] v. = analyze Analytical [,n'litikl] adj.
= Analytic
Hon thnh, lm xong Phn tch (C tnh/ thuc) phn tch Ch gii, ch thch ng dng, trnh ng dng Th nghim, th nghim, phn tch Thuc th mc, chng mc Thch thc, th thch, nhim v kh khn (Thuc hay c lin quan n) khm hay iu tr bnh, lm sng Cng ngun gc, h hng gn, cng bn cht So snh C s dng my tnh, thuc v khoa hc my tnh Phi hp, sp xp Ch yu, ct yu, quyt nh C s d liu C nhim v, bao gm vic Chng minh, gii thch Trin khai, dn trn S thit k, v Ph bin, gieo rc Thuc con s, k thut s Ngnh kin thc, mn hc, quy tc Hin th, ph by ra, s hin th Phn phi, phn b, phn loi Ni bt, r nt S xc nhn, xc nh Cp thit, cp bch Thc thi, thc hin y S suy ra, kt lun S i mi, s cch tn S thu hiu, hiu bit su sc Kt hp, ho hp, hp nht Gi thuyt, l thuyt S thnh tho, s tinh thng Trch, trch xut, chit ra 8
Annotate ['nouteit] v. Application [,pli'kein] n. Assay ['sei] n. v. Bibliographic [,bibli'grfik] adj. Challenge ['tlind] n. Clinical ['klinikl] adj. Cognate ['kgneit] adj. Compare [km'pe] v. Computational [kmpju:'teinl] adj. Co-ordinate [kou':dineit] v. Crucial ['kru:l] adj. Database ['deitbeiz] n Deal (with) v. Demonstrate ['demnstreit] v. Deploy [di'pli] v. Designing [di'zaini] n. Disseminate [di'semineit] v. Digital ['diditl] adj. Discipline ['disiplin] n. Display [dis'plei] v., n. Distribute [dis'tribju:t] v. Emergent [i'm:dnt] Identification [ai,dentifi'kein] n. Imperatives [im'pertiv] Adj. Implement ['implimnt] v. Inference ['infrns] n. Innovation [,inou'vein] n. Insight ['insait] n. Integrate ['intigreit] v. Hypothesis [hai'pisis] n. Expertise [,eksp'ti:z] n. Extract ['ekstrkt - iks'trkt] v.
Framework ['freimw:k] n. Mandatory ['mndtri] adj. Manually ['mnjulli] adv. Middleware ['midl'wer] n. Model ['mdl] v., n. Monitor ['mnit] v. Network ['netw:k] Paradigm ['prdaim] n. Perturb [p't:b] v. Phylogeny [fai'ldni] n.
=phylogenesis [,fail'dineisis]
Khun kh; c cu t chc, ct li C tnh bt buc (Lm g ) bng tay, th cng Middleware is a computer software that connects software components or applications. Phn kt ni trung gian. M hnh ho, m hnh, kiu mu Gim st, ch huy Mng li, h thng H c s l thuyt (ca mt mn khoa hc), nn tng Lm xo trn, lm ri tung ln S pht sinh loi, chng loi pht sinh Tin on, d bo Xc nh s lng, nh lng (c tnh) iu khin, iu ho Ly li, khi phc li, gi ra (thng tin c lu tr) Tm kim, s tm kim nh r, ch r Thng k hc S d tr, ct tr, kho H thng ho, sp xp theo h thng Nhim v, ngha v Thuc/c tnh l thuyt Sn lng, nng sut Tnh bin thin, tnh hay thay i
Predict [pri'dikt] v. Quantitation ['kwntitn] Regulatory [regju:'leitri] adj. = regulator [regju:'leit] Retrieve [ri'tri:v] v. Search [s:t] v., n. Specify ['spesifai] v. Statistics [st'tistiks] n. Storing [st:] n. = store, repository Systemize ['sistmaiz] v.
= systematize ['sistmtaiz]
Task [t:sk] n. Theoretical [,i'retikl] adj. Throughput ['ru:put] n. = Output or production Variability [,veri'bilti] n.
Compounding Sentences
A compound sentence consists of two or more independent clauses. That means that there are at least two units of thought within the sentence, either one of which can stand by itself as its own sentence. The clauses of a compound sentence are either separated by a semicolon (relatively rare) or connected by a coordinating conjunction (which is, more often than not, preceded by a comma). And the two most common coordinating conjunctions are and and but. (The others are or, for, yet, and so.) This is the simplest technique we have for combining ideas:
Meriwether Lewis is justly famous for his expedition into the territory of the Louisiana Purchase and beyond, but few people know of his contributions to natural science. Lewis had been well trained by scientists in Philadelphia prior to his expedition, and he was a curious man by nature. Notice that the and does little more than link one idea to another; the but also links, but it does more work in terms of establishing an interesting relationship between ideas. The and is part of the immediate language arsenal of children and of dreams: one thing simply comes after another and the logical relationship between the ideas is not always evident or important. The word but (and the other coordinators) is at a slightly higher level of argument.
10
(Please review the rules of comma usage when you combine two independent clauses with a coordinating conjunction.)
Objects: When the subject(s) is/are acting upon two or more things in parallel, the objects can be combined.
President Jefferson believed that the headwaters of the Missouri reached all the way to the Canadian border. He also believed that meant he could claim all that land for the United States. President Jefferson believed that the headwaters of the Missouri might reach all the way to the Canadian border and that he could claim all that land for the United States. Notice that the objects must be parallel in construction: Jefferson believed that this was true and that was true. If the objects are not parallel (Jefferson was convinced of two things: that the Missouri reached all the way to the Canadian border and wanted to begin the expedition during his term in office.) the sentence can go awry. (Please review the principles of parallelism.) Verbs and verbals: When the subject(s) is/are doing two things at once, ideas can sometimes be combined by compounding verbs and verb forms.
He studied the biological and natural sciences. He learned how to categorize and draw animals accurately. He studied the biological and natural sciences and learned how to categorize and draw animals accurately. Notice that there is no comma preceding the "and learned" connecting the compounded elements above.
11
In Philadelphia, Lewis learned to chart the movement of the stars. He also learned to analyze their movements with mathematical precision. In Philadelphia, Lewis learned to chart and analyze the movement of the stars with mathematical precision. OR In Philadelphia, Lewis learned to chart the stars and analyze their movements with mathematical precision. (Notice in this second version that we don't have to repeat the "to" of the infinitive to maintain parallel form.) Modifiers: Whenever it is appropriate, modifiers such as prepositional phrases can be compounded.
Lewis and Clark recruited some of their adventurers from river-town bars. They also used recruits from various military outposts. Lewis and Clark recruited their adventurers from river-town bars and various military outposts. Notice that we do not need to repeat the preposition from to make the ideas successfully parallel in form.
William Clark was not officially granted the rank of captain prior to the expedition's departure. Captain Lewis more or less ignored this technicality and treated Clark as his equal in authority and rank. Although William Clark was not officially granted the rank of captain prior to the expedition's departure, Captain Lewis more or less ignored this technicality and treated Clark as his equal in authority and rank.
The explorers approached the headwaters of the Missouri. They discovered, to their horror, that the Rocky Mountain range stood between them and their goal, a passage to the Pacific. As the explorers approached the headwaters of the Missouri, they discovered, to their horror, that the Rocky Mountain range stood between them and their goal, a passage to the Pacific.
12
When we use subordination of clauses to combine ideas, the rules of punctuation are very important. It might be a good idea to review the definition of clauses at this point and the uses of the comma in setting off introductory and parenthetical elements.
Notice that in the second sentence, above, Sacagawea's name is a parenthetical element (structurally, the sentence adequately identifies her as "a pregnant, fifteen-yearold Indian woman"), and thus her name is set off by commas; Charbonneau's name, however, is essential to the meaning of the sentence (otherwise, which fur-trader are we talking about?) and is not set off by a pair of commas.
Captain Lewis allowed his men to make important decisions in a democratic manner. This democratic attitude fostered a spirit of togetherness and commitment on the part of Lewis's fellow explorers. Allowing his men to make important decisions in a democratic manner, Lewis fostered a spirit of togetherness and commitment among his fellow explorers. In the sentence above, the participial phrase modifies the subject of the sentence, Lewis. Phrases like this are usually set off from the rest of the sentence with a comma.
The expeditionary force was completely out of touch with their families for over two years. They put their faith entirely in Lewis and Clark's leadership. They never once rebelled against their authority.
13
Completely out of touch with their families for over two years, the men of the expedition put their faith in Lewis and Clark's leadership and never once rebelled against their authority.
Lewis's fame and fortune was virtually guaranteed by his exploits. Lewis disappointed the entire world by inexplicably failing to publish his journals. His fame and fortune virtually guaranteed by his exploits, Lewis disappointed the entire world by inexplicably failing to publish his journals.
Lewis's long journey was finally completed. His men in the Corps of Discovery were dispersed. Lewis died a few years later on his way back to Washington, D.C., completely alone. His long journey completed and his men in the Corps of Discovery dispersed, Lewis died a few years later on his way back to Washington, D.C., completely alone.
14
V. PRACTICE
Combining sentences into one: 1. Over the past few decades, major advances in the field of molecular biology have led to an explosive growth in the biological information. The advances in genomic technologies have also contributed to the explosive growth in biological information. The explosive growth in the biological information have been generated by scientific community.
2. A biological database is a large body of persiatent data. A biological database is a organized body of persistent data. A biological database is usually associated with computerized software. The software is designed to update and query components of the data stored within the system. The software is also designed to retrieve components of the data stored within the system.
3. A simple data might be a single file The file contains many records of information. Each of the records includes the same set of information.
4. Molecular modeling may not be as accurate at determining a proteins structure as experimental methods. Molecular modeling is still extremely helpful in proposing and testing various biological hypotheses.
15