Académique Documents
Professionnel Documents
Culture Documents
3
© The Author 2013. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of DOI: 10.1093/aje/kws449
Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. Advance Access publication:
January 7, 2013
Commentary
Initially submitted August 14, 2012; accepted for publication November 8, 2012.
The first phase of the Human Microbiome Project communities of bacteria, archaea, viruses, and fungi inter-
(HMP) was recently completed; the results of this compre- act directly and indirectly with the host, shaping the
hensive analysis of microbial life existing in and on the immune system, developing tissues, and resisting or en-
healthy human body appeared in June 2012 issues of the hancing invasion by pathogens. The gut microbiota are as
journals Nature (1–4), Science (5–14), and PLoS (15–24). metabolically active as our livers, making them effectively
These studies demonstrated that 1) the composition and another organ (28). Gut microbiota modulate lipid metabo-
function of human microbiota contribute to metabolic, reg- lism and glucose homeostasis, absorb dietary fats and lipid-
ulatory, and morphogenetic networks in the human host, soluble vitamins, and detoxify xenobiotics among other
making humans a “superorganism” that relies on human functions. Gut microbiota can activate (or inactivate) drugs,
and extrahuman genomes for functioning (25); 2) the com- create a toxic by-product from a drug or food, or alter host
position of microbiota is essentially unique to the individu- drug metabolism by changing host gene expression (13).
al; and 3) the functions of microbiota are conserved across There are hints that the composition of gut microbiota can
individuals. This strongly suggests that characterizing mediate the effectiveness of oral vaccines (29) and modify
human microbiota structure and function in well-designed risk of chronic diseases such as asthma (30).
epidemiologic studies will give new insight into disease eti- All microbiota are known to resist pathogen invasion,
ology, help to explain variability in host susceptibility, and modulate pH and other functions, and respond rapidly to
be a source for novel diagnostic and prognostic markers. changes in their environment. Therefore, changes in the
On the human body, there are 10 bacterial cells for every types or relative amounts of bacteria present or changes in
human cell and an estimated 10 viral particles for every their functional output might be used as markers of expo-
bacterial cell (26). Microbes are not limited to the skin, sure to environmental hazards or of disease risk. In a rat
mouth, and gastrointestinal tract. Instead, every body model, the composition of intestinal microbiota changes
surface that has a portal to the external environment is following radiation, making it potentially a more sensitive
colonized by microbes, including the lung (27). These marker of biological dose (31). Gut microbial profiles are
Am J Epidemiol. 2013;177(3):197–201
Implications of the Human Microbiome Project 199
(metagenomes); the functional potential of these microor- example, the read length generated by the Roche 454 plat-
ganisms (metatranscriptomics); the actual functions being forms (Roche Diagnostics Corporation, Indianapolis,
performed (metaproteomics); and what cellular by-products Indiana) is a lengthy 400–700 base pairs, yet their error
were produced (metabolomes) (Web Figure 1 available at rate, mostly because of insertions and deletions, is approxi-
http://aje.oxfordjournals.org/). Currently, the primary method mately 1%. Conversely, the read length generated by the
for characterizing microbiota uses metagenomics, which we Illumina HiSeq platforms (Illumina, Inc., San Diego, Cali-
briefly describe. For more in-depth discussions of these fornia) is around 200–300 base pairs, and their error rate,
techniques, including metagenomics, the reader is referred mostly because of substitutions, is just over 0.1% (44).
to several excellent recent reviews (40–43). For metage- Analysis of the resulting sequences involves filtering
nomics, the most common technique uses polymerase reads to remove artifacts introduced by polymerase chain re-
chain reaction followed by sequencing of a segment of the action and sequencing errors, sequence assembly (if applicable),
ribosomal gene that is present in all cells (16S RNA for sequence alignment, taxonomic classification, functional as-
bacteria and 18S RNA for fungi). The advantage of using signment of reads and identification of metabolic pathways
segments of the ribosomal gene is that there are several da- (if applicable), and use of phylogenetic and/or taxonomic
tabases available to which the sequences can be mapped. binning approaches for microbial community characteriza-
This makes it possible to determine which phylum, genus, tion. Several analytical tools, mostly open source and each
or species is present, but the resolution is dependent on the with its advantages and disadvantages, have been and con-
segment(s) amplified. It is also possible to sequence all tinue to be developed to address the challenges of handling
DNA present by using a comprehensive shotgun metagenom- very large, multidimensional data (45). Although demand-
ic approach, whereby DNA from the collective genome of ing the novice user to overcome a learning curve in terms of
microorganisms is sheared and sequenced in small frag- ecological theory and familiarity with command line inter-
ments. This approach enables detection of known genes face, most include an integrated pipeline for comprehensive
and thus determination of the functional potential. In either analysis that is not insurmountable to the first-time user.
case, sample preparation upon DNA extraction is required, However, if preferred, most large research institutions, espe-
involving reduction of sample contaminants (e.g., removal cially those having sequencing core centers, are now start-
of host DNA), amplification, attachment of barcodes, and ing to provide bioinformatics support; this partnership
so on. For whole-genome metagenomic sequencing, further should be fostered.
processing involves fragmentation of DNA and construc- The resulting data structure (e.g., relative abundance
tion of paired-end libraries. Several sequencing platforms of taxa per sample) and some of the analytical software
exist based on different technologies (e.g., capillary, pyro- may be unfamiliar, but to epidemiologists—who are used
sequencing, integrated semiconductors), each bringing its to large data sets that integrate multiple different types
own level of expected throughput, bias, and error rate. For of measures—this should be a tractable problem. As in
Am J Epidemiol. 2013;177(3):197–201
200 Foxman and Rosenthal
genetic studies, there are multiple variables; analytical susceptibility to disease, tolerance to environmental
techniques include ordination techniques (e.g., principal hazards, and response to treatment? Thinking of microbes
component analysis), cluster analysis, factor analysis, regres- as self provides another lever to pry open the proverbial
sion-based techniques, and analysis-of-variance–based tech- black box.
niques comparing microbial community matrices to one another.
As a starting point, readers are directed to the Human Mi-
crobiome Project website, that includes the software, online
resources, and standard operating protocols used as part of the ACKNOWLEDGMENTS
Human Microbiome Project (http://www.hmpdacc.org/tools_
protocols/tools_protocols.php). Author affiliation: Center for Molecular and Clinical Epi-
Adding microbiota assessments to ongoing epidemiologic demiology of Infectious Diseases, Department of Epidemi-
studies where samples are already collected only marginally ology, University of Michigan School of Public Health,
increases the costs of sample processing. For metagenomics and Ann Arbor, Michigan (Betsy Foxman, Mariana Rosenthal).
metatranscriptomics, the costs of DNA and RNA extraction The authors contributed equally to the work.
and preparation for sequencing are approximately $100 per This work was supported by the Interdisciplinary Train-
sample, but as high-throughput procedures are implement- ing Program in Infectious Diseases (T32 AI049816 (M.R.)
ed, these costs are falling rapidly. Once samples are pre- and R01 DE014899 (B.F.)).
We thank Dr. Sara Adar for her helpful comments on an
Am J Epidemiol. 2013;177(3):197–201
Implications of the Human Microbiome Project 201
16. Flores R, Shi J, Gail MH, et al. Association of fecal microbial 33. Farrell JJ, Zhang L, Zhou H, et al. Variations of oral
diversity and taxonomy with selected enzymatic functions. microbiota are associated with pancreatic diseases including
PLoS One. 2012;7(6):e39745. pancreatic cancer. Gut. 2012;61(4):582–588.
17. Jumpstart Consortium Human Microbiome Project Data 34. Landy J, Al-Hassi HO, McLaughlin SD, et al. Review article:
Generation Working Group. Evaluation of 16S rDNA-based faecal transplantation therapy for gastrointestinal disease.
community profiling for human microbiome research. PLoS Aliment Pharmacol Ther. 2011;34(4):409–415.
One. 2012;7(6):e39315. 35. Rosenthal M, Goldberg D, Aiello A, et al. Skin microbiota:
18. Aagaard K, Riehle K, Ma J, et al. A metagenomic approach microbial community structure and its potential association
to characterization of the vaginal microbiome signature in with health and disease. Infect Genet Evol. 2011;11(5):
pregnancy. PLoS One. 2012;7(6):e36466. 839–848.
19. Wylie KM, Truty RM, Sharpton TJ, et al. Novel bacterial 36. Gordon JI. Honor thy gut symbionts redux. Science.
taxa in the human microbiome. PLoS One. 2012;7(6):e35294. 2012;336(6086):1251–1253.
20. Li K, Bihan M, Yooseph S, et al. Analyses of the microbial 37. Srinivasan U, Ponnaluri S, Villareal L, et al. Gram stains: a
diversity across the human microbiome. PLoS One. 2012; resource for retrospective analysis of bacterial pathogens in
7(6):e32118. clinical studies. PLoS One. 2012;7(10):e42898.
21. Cantarel BL, Lombard V, Henrissat B. Complex carbohydrate 38. Claesson MJ, Wang Q, O’Sullivan O, et al. Comparison of
utilization by the healthy human microbiome. PLoS One. two next-generation sequencing technologies for resolving
2012;7(6):e28742. highly complex microbiota composition using tandem
22. Liu B, Faller LL, Klitgord N, et al. Deep sequencing of the variable 16S rRNA gene regions. Nucleic Acids Res. 2010;
Am J Epidemiol. 2013;177(3):197–201