Nature 10242

ARTICLE
doi:10.1038/nature10242
An integrated semiconductor device enabling non-optical genome sequencing

Jonathan M. Rothberg1, Wolfgang Hinz1, Todd M. Rearick1, Jonathan Schultz1, William Mileski1, Mel Davey1, John H. Leamon1, Kim Johnson1, Mark J. Milgrew1, Matthew Edwards1, Jeremy Hoon1, Jan F. Simons1, David Marran1, Jason W. Myers1, John F. Davidson1, Annika Branting1, John R. Nobile1, Bernard P. Puc1, David Light1, Travis A. Clark1, Martin Huber1, Jeffrey T. Branciforte1, Isaac B. Stoner1, Simon E. Cawley1, Michael Lyons1, Yutao Fu1, Nils Homer1, Marina Sedova1, Xin Miao1, Brian Reed1, Jeffrey Sabina1, Erika Feierstein1, Michelle Schorn1, Mohammad Alanjary1, Eileen Dimalanta1, Devin Dressman1, Rachel Kasinskas1, Tanya Sokolsky1, Jacqueline A. Fidanza1, Eugeni Namsaraev1, Kevin J. McKernan1, Alan Williams1, G. Thomas Roth1 & James Bustillo1
The seminal importance of DNA sequencing to the life sciences, biotechnology and medicine has driven the search for more scalable and lower-cost solutions. Here we describe a DNA sequencing technology in which scalable, low-cost semiconductor manufacturing techniques are used to make an integrated circuit able to directly perform non-optical DNA sequencing of genomes. Sequence data are obtained by directly sensing the ions produced by template-directed DNA polymerase synthesis using all-natural nucleotides on this massively parallel semiconductor-sensing device or ion chip. The ion chip contains ion-sensitive, field-effect transistor-based sensors in perfect register with 1.2 million wells, which provide confinement and allow parallel, simultaneous detection of independent sequencing reactions. Use of the most widely used technology for constructing integrated circuits, the complementary metal-oxide semiconductor (CMOS) process, allows for low-cost, large-scale production and scaling of the device to higher densities and larger array sizes. We show the performance of the system by sequencing three bacterial genomes, its robustness and scalability by producing ion chips with up to 10 times as many sensors and sequencing a human genome.
DNA sequencing and, more recently, massively parallel DNA sequencing14 has had a profound impact on research and medicine. The reductions in cost and time for generating DNA sequence have resulted in a range of new sequencing applications in cancer5,6, human genetics7, infectious diseases8 and the study of personal genomes911, as well as in fields as diverse as ecology12,13 and the study of ancient DNA14,15. Although de novo sequencing costs have dropped substantially, there is a desire to continue to drop the cost of sequencing at an exponential rate consistent with the semiconductor industrys Moores Law16 as well as to provide lower cost, faster and more portable devices. This has been operationalized by the desire to reach the $1,000 genome17. To date, DNA sequencing has been limited by its requirement for imaging technology, electromagnetic intermediates (either X-rays18, or light19) and specialized nucleotides or other reagents20. To overcome these limitations and further democratize the practice of sequencing, a paradigm shift based on non-optical sequencing on newly developed integrated circuits was pursued. Owing to its scalability and its low power requirement, CMOS processes are dominant in modern integrated circuit manufacturing21. The ubiquitous nature of computers, digital cameras and mobile phones has been made possible by the low-cost production of integrated circuits in CMOS. Leveraging advances in the imaging fieldwhich has produced large, fast arrays for photonic imaging22we sought a suitable electronic sensor for the construction of an integrated circuit to detect the hydrogen ions that would be released by DNA polymerase23 during sequencing by synthesis, as opposed to a sensor designed for the detection of photons. Although a variety of electrochemical detection methods have been studied24,25, the ion-sensitive field-effect transistor (ISFET)26,27 was most applicable to our chemistry and scaling requirements because of
1
its sensitivity to hydrogen ions, and its compatibility with CMOS processes2831. Previous attempts to detect both single-nucleotide polymorphisms (SNPs)32 and DNA synthesis33 as well as sequence DNA electronically34 have been made. However, none of them produced de novo DNA sequence, addressed the issue of delivering template DNA to the sensors, or scaled to large arrays. In addition, previous efforts in ISFETs were limited in the number of sensors per array, the yield of working independent sensors and readout speed35,36, and encountered difficulty in exposing the sensors to fluids while protecting the electronics37. Here, we overcome previous limitations with electronic detection and enable the production of chips with a large number of fast, uniform, working sensors. Our focus has been on the development of these ion chips, as well as the biochemical methods, supporting instrumentation and software needed to enable de novo DNA sequencing for applications requiring millions to billions of bases (Supplementary Fig. 1). A typical 2-h run using an ion chip with 1.2 M sensors generates approximately 25 million bases. The performance of the ion chips and overall sequencing platform is demonstrated through whole-genome sequencing of three bacterial genomes. The scalability of our chip architecture is demonstrated by producing chips with up to 10 times the number of sensors and producing a low-coverage sequence of the genome of Gordon Moore, author of Moores law16.
A CMOS integrated circuit for sequencing

We have developed a simple, scalable ISFET sensor architecture using electronic addressing common in modern CMOS imagers (Supplementary Fig. 2). Our integrated circuit consists of a large array of sensor elements, each with a single floating gate connected to an underlying ISFET (Fig. 1a). For sequence confinement we rely on a
Ion Torrent by Life Technologies, Suite 100, 246 Goose Lane, Guilford, Connecticut 06437, USA.
3 4 8 | N AT U R E | V O L 4 7 5 | 2 1 J U LY 2 0 1 1
2011 Macmillan Publishers Limited. All rights reserved
ARTICLE RESEARCH
a
dNTP Well dNTP + + H+ pH Row select register Sensor array
c
Selected cell
DNA template Bead Metal-oxide-sensing layer Sensor plate Floating metal gate Bulk Drain Source Silicon substrate
Q Column select register V To column receiver Output driver To reader board Column receivers and multiplexer
Figure 1 | Sensor, well and chip architecture. a, A simplified drawing of a well, a bead containing DNA template, and the underlying sensor and electronics. Protons (H1) are released when nucleotides (dNTP) are incorporated on the growing DNA strands, changing the pH of the well (DpH). This induces a change in surface potential of the metal-oxide-sensing layer, and a change in potential (DV) of the source terminal of the underlying field-effect
transistor. b, Electron micrograph showing alignment of the wells over the ISFET metal sensor plate and the underlying electronic layers. c, Sensors are arranged in a two-dimensional array. A row select register enables one row of sensors at a time, causing each sensor to drive its source voltage onto a column. A column select register selects one of the columns for output to external electronics.
3.5-mm-diameter well formed by adding a 3-mm-thick dielectric layer over the electronics and etching to the sensor plate (Fig. 1b). A tantalum oxide layer provides for proton sensitivity (58 mv pH21; ref. 38). High-speed addressing and readout are accomplished by the semiconductor electronics integrated with the sensor array (Fig. 1c). The sensor and underlying electronics provide a direct transduction from the incorporation event to an electronic signal. Unlike light-based sequencing technology, we do not use the elements of the array to collect photons and form a larger image to detect the incorporation of a base; instead we use each sensor to independently and directly monitor the hydrogen ions released during nucleotide incorporation. Ion chips are manufactured on wafers (Fig. 2a), cut into individual die (Fig. 2b) and robotically packaged with a disposable polycarbonate flow cell that isolates the fluids to regions above the sensor array and away from the supporting electronics to provide convenient sample loading as well as electrical and fluidic interfaces to the sequencing instrument (Fig. 2c). Chips were designed and fabricated with 1.5 M, 7.2 M and 13 M ISFETs (Supplementary Fig. 3). On the basis of the placement of the flow cell on the sensor array, 1.2 M, 6.1 M and 11 M
a b
wells and sensors are exposed to fluids, with 99.9% of the sensors sensitive to pH and usable for DNA sequencing (Supplementary Fig. 4). Increasing the numbers of sensors per chip was first achieved by increasing the die area, from 10.6 mm 3 10.9 mm to 17.5 mm 3 17.5 mm, and then by increasing the density of the sensors by reducing the number of transistors per sensor from three to two. Chip density is limited by the selection of the CMOS node and the number of transistors per sensing element. Using a 0.35 mm CMOS node the minimum spacing for a three-transistor sensor is 5.1 mm and for a two-transistor sensor it is 3.8 mm (Supplementary Fig. 5). To understand further the limits on density, we show that 1.3 mm wells are readily manufactured, can be aligned to sensors, enable the generation of high-quality sequence (Supplementary Fig. 6) and can, using a 110 nm node, be fabricated with a spacing as small as 1.68 mm (Supplementary Fig. 7).
Sequencing on a semiconductor device

The all-electronic detection system used by the ion chip simplifies and greatly reduces the cost of the sequencing instrument (Supplementary
c
Sensor interface and readout
Row select registers
Sensor array
Sensor interface and readout
Figure 2 | Wafer, die and chip packaging. a, Fabricated CMOS 899 wafer containing approximately 200 individual functional ion sensor die. b, Unpackaged die, after automated dicing of wafer, with functional regions
indicated. c, Die in ceramic package wire bonded for electrical connection, shown with moulded fluidic lid to allow addition of sequencing reagents.
2 1 J U LY 2 0 1 1 | V O L 4 7 5 | N AT U R E | 3 4 9
RESEARCH ARTICLE
Fig. 8). The instrument has no optical components, and is comprised primarily of an electronic reader board to interface with the chip, a microprocessor for signal processing, and a fluidics system to control the flow of reagents over the chip (Supplementary Fig. 9). Genomic DNA is prepared for sequencing as described in Supplementary Methods. Briefly, DNA is fragmented, ligated to adapters, and adaptor-ligated libraries are clonally amplified onto beads. Template-bearing beads are enriched through a magneticbead-based process. Sequencing primers and DNA polymerase are then bound to the templates and pipetted into the chips loading port. Individual beads are loaded into individual sensor wells by spinning the chip in a desktop centrifuge. A 2 mm acrylamide bead was chosen to deliver sufficient copies of the template to the sensor well to achieve a high signal-to-noise ratio (SNR) (800 K copies, SNR, 10; Supplementary Methods and Supplementary Fig. 10), while well depth was selected to allow only a single bead to occupy a well. In ion sequencing, all four nucleotides are provided in a stepwise fashion during an automated run (Supplementary Methods). When the nucleotide in the flow is complementary to the template base directly downstream of the sequencing primer, the nucleotide is incorporated into the nascent strand by the bound polymerase. This increases the length of the sequencing primer by one base (or more, if a homopolymer stretch is directly downstream of the primer) and results in the hydrolysis of the incoming nucleotide triphosphate, which causes the net liberation of a single proton for each nucleotide incorporated during that flow. The release of the proton produces a shift in the pH of the surrounding solution proportional to the number of nucleotides incorporated in the flow (0.02 pH units per single base incorporation). This is detected by the sensor on the bottom of each well, converted to a voltage and digitized by off-chip electronics (Fig. 3). The signal generation and detection occurs over 4 s (Fig. 3b). After the flow of each nucleotide, a wash is used to ensure nucleotides do not remain in the well. The small size of the wells allows diffusion into and out of the well on the order of a one-tenth of a second and eliminates the need for enzymatic removal of reagents1. applied and fit to the raw trace from each well and the incorporation signals are extracted. A base caller corrects the signals for phase and signal loss, normalizes to the key, and generates corrected base calls for each flow in each well to produce the sequencing reads (Fig. 3c and Supplementary Fig. 12). Next, each read is sequentially passed through two signal-based filters to exclude low-accuracy reads. The first filter measures the fraction of flows in which an incorporation event was measured. When this value is unusually large (greater than 60% of the first 60 flows) the read is not clonal. The second filter measures the extent to which the observed signal values match those predicted by the phasing model. When there is poor agreement (median absolute difference more than 0.06 over the first 60 flows) between the two, it corresponds to higher error rates. Lastly, per-base quality values are predicted using an adaptation of the Phred method39 that quantifies the concordance between the phasing model predictions and the observed signal. These ab initio scores track closely with post-alignment derived quality scores, and are used to trim back low-quality sequence from the 39 end of a read (Supplementary Fig. 13).
Sequencing bacterial genomes

Bacterial genome sequencing and signal processing was performed as described earlier. We succeeded in sequencing all three genomes fivefold to tenfold in individual runs using the small ion chip, covering 96.80% to 99.99% of each genome, with genome-wide consensus accuracies as high as 99.99% (Table 1 and Supplementary Fig. 14). Escherichia coli sequencing with three successively larger ion chips produced 46 to over 270 megabases of sequence (Table 1). To characterize run quality, we aligned each read to the corresponding reference genome (Supplementary Fig. 15). The per-base accuracy was observed to be 99.569% 6 0.001% within the first 50 bases and 98.897% 6 0.001% within the first 100 bases (Supplementary Fig. 16a). This accuracy is similar at 50 bases and higher at 100 bases than light-based methods using modified nucleotides (1.1% versus 5% error40). The per-base accuracy in calling a homopolymer of length 5 is 97.328% 6 0.023% (Supplementary Fig. 16b) and higher than pyrosequencing-based sequencing methods1,41. For each genome, the observed distribution of per-base coverage matches closely with the theoretical Poisson distribution reflecting the uniform nature of the coverage (Supplementary Fig. 17). The distribution of coverage was also relatively unbiased across GC content (Supplementary Fig. 18). Ion sequencing technology has allowed the routine acquisition of 100-base read lengths, and perfect read lengths exceeding 200 bases (Supplementary Fig. 19). At present, 2040% of the sensors in a given
c
Signal processing and base calling

To change raw voltages into base calls, signal-processing software converts the raw data into measurements of incorporation in each well for each successive nucleotide flow using a physical model. Sampling the signal at high frequency relative to the time of the incorporation signal allows signal averaging to improve the SNR. The physical model takes into consideration diffusion rates, buffering effects and polymerase rates (Supplementary Fig. 11). The model is
a
50 45 40 35 30 25 20 15 10 5 5 10 15 20 25 30 35 40 45 50 0 20 0 Count 80 60 40 20
b
100
4 3
Bases
2 1 0 4 0 20 Flow 40
Bases 1 2 3 4 Time (s) 5 6 7 8
3 2 1 0 50 70 Flow 90
Figure 3 | Data collection and base calling. a, A 50 3 50 region of the ion chip. The brightness represents the intensity of the incorporation reaction in individual sensor wells. b, 1-nucleotide incorporation signal from an individual sensor well; the arrow indicates start of incorporation event, with the physical
3 5 0 | N AT U R E | V O L 4 7 5 | 2 1 J U LY 2 0 1 1
model (red line) and background corrected data (blue line) shown. c, The first 100 flows from one well. Each coloured bar indicates the corresponding number of bases incorporated during that nucleotide flow.
ARTICLE RESEARCH
Table 1 | Vibrio fisheri, E. coli, Rhodopseudomonas palustris and Homo sapiens
V. fisheri R. palustris E. coli E. coli E. coli H. sapiens
GC content Genome size Number of runs x ion chip size
38% 4.2 Mb 1 3 1.2 M
65% 5.5 Mb 1 3 1.2 M
51% 4.7 Mb 1 3 1.2 M
51% 4.7 Mb 1 3 6.1 M
51% 4.7 Mb 1 3 11 M
Fold coverage Coverage Reads $21 bases Reads $50 bases Reads $100 bases Mapped bases
6.2-fold 96.80% 261,313 233,049 156,391 26.0 Mb
6.9-fold 99.64% 444,750 399,360 160,726 37.8 Mb
11.3-fold 99.99% 507,198 487,420 400,743 47.6 Mb
36.2-fold 100.00% 1,852,931 1,698,852 1,012,918 169.6 Mb
58.4-fold 100.00% 2,594,031 2,343,880 1,779,237 273.9 Mb
41% 2.9 Gb 1,601 3 1.2 M 267 3 6.1 M 28 3 11.1 M 10.6-fold 99.21% 366,623,578 306,042,650 139,624,090 30.2 Gb
Coverage shows percentage of genome covered based on one or more reads mapping to each base of the reference genome. Reads align with 98% or greater accuracy.
Number of bases
To illustrate the scalability of semiconductor sequencing we produced whole-genome sequence data from an individual, G. Moore42 (Fig. 4). Written consent was provided by G. Moore to sequence and publish his genome and resulting findings. Reads from his genome were deposited in the Sequence Read Archive (SRA) under accession number ERP000682. The mean coverage of the G. Moore genome was 10.6-fold (Table 1). The degree to which the observed distribution of reads conforms to a Poisson distribution is indicative of a general lack of bias in coverage depth (Fig. 4b). We found 2,598,983 SNPs in the G. Moore genome, of which 3.08% were found to be novel, consistent with previous reports4,9,11 (Supplementary Methods). To confirm the accuracy of our analysis, we also sequenced the G. Moore genome using ABI SOLiD Sequencing43 to 15-fold coverage and validated 99.95% of the heterozygous and 99.97% of the homozygous genotypes (Supplementary Tables 1 and 2). We used the Online Mendelian Inheritance in Man database44 and the 23andMe functional SNP collection (https://www.23andme.com) to identify a subset of validated SNPs known to be involved in human disease and interesting phenotypes (Supplementary Table 3). We also examined the G. Moore sequence for the 7,693 deletions and inversions discovered by the 1000 Genomes Consortium and computationally found 3,413 of them in the G. Moore genome at a 99.94% positive predictive value (Supplementary Methods, Supplementary Table 4 and Supplementary Fig. 20). To determine G. Moores maternal ancestry, reads were also mapped to human mitochondrial DNA45 for a mean coverage of 732-fold. G. Moores mitochondria belong to haplogroup H, the most common in Europe46.
Ch r18
Ch
r2
Post-light sequencing of G. Moore
Ch r2 0 Ch r1 9
Chr1
run yield mappable reads. The gap between the number of sensors on a chip and the number yielding sequence is primarily the result of incomplete loading of the chip, poor amplification of a fragment onto the bead, and lack of clonality of the template. With continued improvements in loading and template preparation, along with improvements in signal processing and base calling, it is expected that the percentage of sensors yielding reads, the average read length and read accuracy will all improve significantly, as it has for other sequencing technologies14,911.
comprising about one billion sensors. By demonstrating the ability to make larger and denser arrays, use fewer transistors per sensor, and sequence from wells as small as 1.3 mm, our work suggests that readily available CMOS nodes should enable the production of one-billionsensor ion chips and low-cost routine human genome sequencing.
ChrY
ChrX
rM Ch r22 Ch 21 r Ch
Chr1
Chr16
Chr
Chr15
Chr4
Chr14
Chr5
Ch
r13
Ch
hr
12
r6
r11
Ch
Ch
r7
Chr8
Chr1
b
3.5 108 3.0 108 2.5 108 2.0 108 1.5 108 1.0 108 5.0 107 0.0 100 0
Discussion
We have demonstrated the ability to produce and use a disposable integrated circuit fabricated in standard CMOS foundries to perform, for the first time, post-light genome sequencing of bacterial and human genomes. With fifty billion dollars spent per year on CMOS semiconductor fabrication and packaging technologies, our goal was to leverage that investment to make a highly scalable sequencing technology. Using the G. Moore genome we demonstrated the feasibility of sequencing a human genome. The G. Moore genome sequence required on the order of a thousand individual ion chips
10 20 Per-base coverage
Chr9
30
Figure 4 | G. Moore genome. a, Circular genome plot. The average sequencing coverage (green) and average GC content (red) within 100-kb intervals is shown. b, Distribution of the observed per-base coverage depth along the genome (red) compared with the distribution expected from random coverage (green).
2 1 J U LY 2 0 1 1 | VO L 4 7 5 | N AT U R E | 3 5 1
RESEARCH ARTICLE
Received 8 March; accepted 26 May 2011. 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. Margulies, M. et al. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437, 376380 (2005). Shendure, J. et al. Accurate multiplex polony sequencing of an evolved bacterial genome. Science 309, 17281732 (2005). Bentley, D. R. et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456, 5359 (2008). Drmanac, R. et al. Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science 327, 7881 (2010). Thomas, R. K. et al. Sensitive mutation detection in heterogeneous cancer specimens by massively parallel picoliter reactor sequencing. Nature Med. 12, 852855 (2006). Ley, T. J. et al. DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature 456, 6672 (2008). Ng, S. B. et al. Exome sequencing identifies the cause of a mendelian disorder. Nature Genet. 42, 3035 (2010). Andries, K. et al. A diarylquinoline drug active on the ATP synthase of Mycobacterium tuberculosis. Science 307, 223227 (2005). Wheeler, D. A. et al. The complete genome of an individual by massively parallel DNA sequencing. Nature 452, 872876 (2008). Lupski, J. R. et al. Whole-genome sequencing in a patient with CharcotMarie Tooth neuropathy. N. Engl. J. Med. 362, 11811191 (2010). Schuster, S. C. et al. Complete Khoisan and Bantu genomes from southern Africa. Nature 463, 943947 (2010). Venter, J. C. et al. Environmental genome shotgun sequencing of the Sargasso Sea. Science 304, 6674 (2004). Sogin, M. L. et al. Microbial diversity in the deep sea and the underexplored rare biosphere. Proc. Natl Acad. Sci. USA 103, 1211512120 (2006). Noonan, J. P. et al. Genomic sequencing of Pleistocene cave bears. Science 309, 597599 (2005). Green, R. E. et al. Analysis of one million base pairs of Neanderthal DNA. Nature 444, 330336 (2006). Moore, G. E. Cramming more components onto integrated circuits. Electronics 38, 114117 (1965). Davies, K. The $1,000 Genome (Free Press, 2010). Sanger, F., Nicklen, S. & Coulson, A. R. DNA sequencing with chain-terminating inhibitors. Proc. Natl Acad. Sci. USA 74, 54635467 (1977). Smith, L. M. et al. Fluorescence detection in automated DNA sequence analysis. Nature 321, 674679 (1986). Metzker, M. L. Sequencing technologiesthe next generation. Nature Rev. Genet. 11, 3146 (2010). Wanlass, F. M. Low stand-by power complementary field effect circuitry. U.S. patent 3,356,858: (1967). Theuwissen, A. J. P. CMOS image sensors: state-of-the-art. Solid-State Electron. 52, 14011406 (2008). Sakurai, T. & Husimi, Y. Real-time monitoring of DNA polymerase reactions by a micro ISFET pH sensor. Anal. Chem. 64, 19961997 (1992). Fritz, J., Cooper, E. B., Gaudet, S., Sorger, P. K. & Manalis, S. R. Electronic detection of DNA by its intrinsic molecular charge. Proc. Natl Acad. Sci. USA 99, 1414214146 (2002). Drummond, T. G., Hill, M. G. & Barton, J. K. Electrochemical DNA sensors. Nature Biotechnol. 21, 11921199 (2003). Bergveld, P. Development of an ion-sensitive solid-state device for neurophysiological measurements. IEEE Trans. Biomed. Eng. 17, 7071 (1970). Bergveld, P. Thirty years of ISFETOLOGY. What happened in the past 30 years and what may happen in the next 30 years. Sens. Actuators B Chem. 88, 120 (2003). Yeow, T., Haskard, M., Mulcahy, D., Seo, H. & Kwon, D. A very large integrated pHISFET sensor array chip compatible with standard CMOS processes. Sens. Actuators B Chem. 44, 434440 (1997). Bausells, J., Carrabina, J., Errachid, A. & Merlos, A. Ion-sensitive field-effect transistors fabricated in a commercial CMOS technology. Sens. Actuators B Chem. 57, 5662 (1999). Milgrew, M., Hammond, P. & Cumming, D. The development of scalable sensor arrays using standard CMOS technology. Sens. Actuators B Chem. 103, 3742 (2004). Hizawa, T., Sawada, K., Takao, H. & Ishida, M. Fabrication of a two-dimensional pH image sensor using a charge transfer technique. Sens. Actuators B Chem. 117, 509515 (2006). 32. Purushothaman, S., Toumazou, C. & Ou, C. P. Protons and single nucleotide polymorphism detection: A simple use for the ion sensitive field effect transistor. Sens. Actuators B Chem. 114, 964968 (2006). 33. Pourmand, N. et al. Direct electrical detection of DNA synthesis. Proc. Natl Acad. Sci. USA 103, 64666470 (2006). 34. Sakata, T. & Miyahara, Y. DNA sequencing based on intrinsic molecular charges. Angew. Chem. 118, 22832286 (2006). 35. Milgrew, M. J., Riehle, M. O. & Cumming, D. R. S. A large transistor-based sensor array chip for direct extracellular imaging. Sens. Actuators B Chem. 111112, 347353 (2005). 36. Milgrew, M. J. & Cumming, D. R. S. Matching the transconductance characteristics of CMOS ISFET arrays by removing trapped charge. Electron Devices. IEEE Trans. Electron Devices 55, 10741079 (2008). 37. Hammond, P. & Cumming, D. Encapsulation of a liquid-sensing microchip using SU-8 photoresist. Microelectron. Eng. 7374, 893897 (2004). 38. Mikolajick, T., Kuhnhold, R. & Ryssel, H. The pH-sensing properties of tantalum pentoxide films fabricated by metal organic low pressure chemical vapor deposition. Sens. Actuators B Chem. 44, 262267 (1997). 39. Ewing, B. & Green, P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186194 (1998). 40. Claesson, M. J. et al. Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regions. Nucleic Acids Res. 38, e200 (2010). 41. Huse, S. M., Huber, J. A., Morrison, H. G., Sogin, M. L. & Mark Welch, D. Accuracy and quality of massively-parallel DNA pyrosequencing. Genome Biol. 8, R143 (2007). 42. Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 16391645 (2009). 43. McKernan, K. J. et al. Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding. Genome Res. 19, 15271541 (2009). 44. Hamosh, A., Scott, A. F., Amberger, J. S., Bocchini, C. A. & McKusick, V. A. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 33, D514D517 (2005). 45. Andrews, R. M. et al. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nature Genet. 23, 147 (1999). 46. Kloss Brandstatter, A. et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum. Mutat. 32, 2532 (2011). Supplementary Information is linked to the online version of the paper at www.nature.com/nature. Acknowledgements We want to thank G. Moore for his willingness to participate in this study. We thank G. Fergus, M. Jain, J. Kole, L. Stevens and the ION team for supporting our efforts, and H. Peckman, V. Tadigotla, D. Holloway and S. Mclaughlin for help on the variant analysis, and M. Ross of the Broad Institute for help on quality scores. This research was supported, in part, by a grant from the National Human Genome Research Institute (NHGRI), RFA-HG-08-008, Revolutionary Genome Sequencing TechnologiesThe $1000 Genome. Grant number: R01 HG005094. Author Contributions J.M.R. conceived the technology, supervised the project and wrote the manuscript with input from co-authors. K.J., M.J.M. and J.B. designed chips. J.F.D., M.A., D.L., J.W.M., J.F.S., E.N., M.S., X.M., A.B., T.A.C., M.H., I.B.S., B.R., J.S., E.F., M.S., J.A.F., K.J.M. and J.H.L. developed methods. M.D., J.T.B., M.E., J.H., N.H., T.M.R., B.P.P., S.E.C., M.L., Y.F. and A.W. wrote software and analysed data. W.H., J.S., W.M., D.M., J.R.N. and G.T.R. designed the instrument. E.D., D.D., R.K. and T.S. sequenced the human sample. Author Information Sequences for Homo sapiens, Escherichia coli, Vibrio fisheri and Rhodopseuomanas palustris were deposited in the Sequence Read Archive (SRA) under accession numbers ERP000682, ERP000541, ERP000542 and ERP000543 respectively. Reprints and permissions information is available at www.nature.com/ reprints. This paper is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike Licence, and is freely available to all readers at www.nature.com/nature. The authors declare competing financial interests: details accompany the full-text HTML version of this paper at www.nature.com/nature. Readers are welcome to comment on the online version of this article at www.nature.com/nature. Correspondence and requests for materials should be addressed to J.M.R. (Jonathan.Rothberg@Lifetech.com).
3 5 2 | N AT U R E | V O L 4 7 5 | 2 1 J U LY 2 0 1 1

Nature 10242

Transféré par

Informations du document

Description originale:

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Nature 10242

Transféré par

Droits d'auteur :

Formats disponibles

ARTICLE

An integrated semiconductor device enabling non-optical genome sequencing

A CMOS integrated circuit for sequencing

2011 Macmillan Publishers Limited. All rights reserved

Sequencing on a semiconductor device

Sensor interface and readout

Row select registers

Sensor interface and readout

2011 Macmillan Publishers Limited. All rights reserved

Sequencing bacterial genomes

Signal processing and base calling

Bases 1 2 3 4 Time (s) 5 6 7 8

2011 Macmillan Publishers Limited. All rights reserved

GC content Genome size Number of runs x ion chip size

38% 4.2 Mb 1 3 1.2 M

65% 5.5 Mb 1 3 1.2 M

51% 4.7 Mb 1 3 1.2 M

51% 4.7 Mb 1 3 6.1 M

6.2-fold 96.80% 261,313 233,049 156,391 26.0 Mb

6.9-fold 99.64% 444,750 399,360 160,726 37.8 Mb

11.3-fold 99.99% 507,198 487,420 400,743 47.6 Mb

36.2-fold 100.00% 1,852,931 1,698,852 1,012,918 169.6 Mb

58.4-fold 100.00% 2,594,031 2,343,880 1,779,237 273.9 Mb

Post-light sequencing of G. Moore

2011 Macmillan Publishers Limited. All rights reserved

2011 Macmillan Publishers Limited. All rights reserved

Vous aimerez peut-être aussi