Académique Documents
Professionnel Documents
Culture Documents
Since the first demonstration of single-molecule characterization by a biological nanopore two decades ago (11), interest
has grown in using nanopores as sensors for DNA base discrimination. One approach is strand sequencing, in which each
base is identified as it moves through an ion-conducting channel,
ideally producing a characteristic current blockade event for
each base. Progress in nanopore sequencing has been hampered
by two physical limitations. First, single-base translocation can be
too rapid for detection (13 s per base), and second, structural
similarities between bases make them difficult to identify unambiguously (12). Some attempts to address these issues have
used enzymes as molecular motors to control single-stranded
DNA (ssDNA) translocation speeds but still rely on identifying
multiple bases simultaneously (1315). Other approaches used
Significance
DNA sequencing has been dramatically expanding its scope in
basic life science research and clinical medicine. Recently, a set of
polymer-tagged nucleotides were shown to be viable substrates
for replication and electronically detectable in a nanopore. Here,
we describe the design and characterization of a DNA polymerasenanopore protein construct on an integrated chip. This
system incorporates all four tagged nucleotides and distinguishes singletagged-nucleotide addition in real time. Coupling protein catalysis and nanopore-based detection to an
electrode array could provide the foundation of a highly scalable, single-molecule, electronic DNA-sequencing platform.
www.pnas.org/cgi/doi/10.1073/pnas.1608271113
Author contributions: P.B.S., M. Palla, S. Kalachikov, J.N., S. Kumar, I.M., A. Bibillo, R.C., R.D.,
J.J.R., C.W.F., S.R., J.J., and G.M.C. designed research; P.B.S., M. Palla, S. Kalachikov, J.N., M.D.,
A.T., S. Kumar, M. Porel, M.C., C.T., I.M., Z.L., S.S., A. Aberra, C.A., A.Y., A. Aguirre, E.T.H., and
C.W.F. performed research; P.B.S., M. Palla, S. Kalachikov, M.D., S. Kumar, D.K., J.P., A. Bhat,
D.G., A. Bibillo, and R.C. contributed new reagents/analytic tools; P.B.S., M. Palla, and D.K.
analyzed data; and P.B.S. and M. Palla wrote the paper.
Reviewers: J.H.G., University of Washington; A.M., Boston University; and M.W.,
Northeastern University.
Conflict of interest statement: The Nanopore SBS technology has been exclusively licensed
by Genia. In accordance with the policy of Columbia University, the coinventors (S. Kumar,
M.C., C.T., Z.L., S. Kalachikov, J.J.R., and J.J.) are entitled to royalties through this license.
G.M.C. is a member of the Scientific Advisory Board of Genia, other potential conflicts are
described here: arep.med.harvard.edu/gmc/tech.html.
This article is a PNAS Direct Submission.
Freely available online through the PNAS open access option.
1
APPLIED BIOLOGICAL
SCIENCES
Edited by Stephen T. Warren, Emory University, Atlanta, Georgia, and approved September 28, 2016 (received for review August 12, 2016)
5
Template
5
(A) tag 1
(T) tag 2
(C) tag 3
Primer
Polymerase
Linkage
CIS
(G) tag 4
dNTP
Tag
pA
Nanopore
Lipid bilayer
TRANS
tag 2
tag 3
tag 1
tag 4
Time (s)
Fig. 1. Principle of single-molecule DNA sequencing by a nanopore using
tagged nucleotides. Each of the four nucleotides carries a different polymer
tag (green square, A; red oval, T; blue triangle, C; black square, G). During
SBS, the complementary nucleotide (T shown here) forms a tight complex
with primer/template DNA and the nanopore-coupled polymerase. As the
tagged nucleotides are incorporated into the growing DNA template, their
tags, attached via the 5-phosphate, are captured in the pore lumen, which
results in a unique current blockade signature (Bottom). At the end of the
polymerase catalytic reaction, the tag is released, ending the current
blockade, which returns to open-channel reading at this time. For the purpose of illustration, four distinct tag signatures are shown in the order of
their sequential capture. A large array of such nanopores could lead to
highly parallel, high-throughput DNA sequencing.
APPLIED BIOLOGICAL
SCIENCES
Fig. 2. Assembly of the porinpolymerase construct. (A) Protein constructs used to form the porinpolymerase conjugate include unmodified HL with a
Strep-tag, HL with a C-terminal SpyTag peptide and 6-His-tag, and 29 with a C-terminal SpyCatcher domain. (B) Assembly steps. HLSpyTag6-His and
unmodified HL are oligomerized with lipid, and the 1:6 SpyTag:unmodified assembled porin is purified. Addition of 29SpyCatcher to the 1:6 pore yields
one polymerase per HL pore. (C) A molecular model generated with Rosetta using the determined structures for 29 polymerase (PDB ID code 2PYJ), HL
(PDB ID code 7AHL), and SpyCatcher/SpyTag (PDB ID code 2X5P). Colors of the proteins match the cartoon representations in A and B. The expected tag exit
site on the polymerase and the opening to the nanopore can be in close proximity with distances as short as 46 in some models. (D) The stoichiometry in
solution of the porin subunits was confirmed by SDS/PAGE without boiling. To confirm the assembly, excess 29SpyCatcher was added to 1:6 pore. The
combination yields only pores with one polymerase attached.
Stranges et al.
PNAS PLUS
Fig. 3. Representative current versus time traces for the various stages of the pore assembly. (A) When neither tagged nucleotide nor polymerase is present,
only stable open-channel current is observable. (B) Attachment of polymerase does not change the mean open-channel current. The current root-meansquare fluctuation (RMSF) increase in B may be an indication of the polymerase coupled to the pore. (C) When no polymerase is attached to the pore and
tagged nucleotide is introduced, transient events are observed. (D) When polymerasetemplate is attached to the pore and the complementary base dG6PdT30 is added, there are prolonged capture events as well as transient events as observed in C.
PNAS PLUS
APPLIED BIOLOGICAL
SCIENCES
Fig. 4. Tagged-nucleotide discrimination on a semiconductor chip array. All measurements were taken on a porepolymerasetemplate complex under
noncatalytic conditions where the first base on the template is complementary to the added tagged nucleotide. (A) Current versus dwell time (duration of
each current blockade) plots for captures of all tagged nucleotides. Capture events cluster into distinct current and dwell time regions for each tagged
nucleotide. (B) Representative single-pore traces of tagged-nucleotide capture shown in A. Current blockade levels for each are marked in red. The blockades
demonstrate unique, single-molecule events corresponding to the four distinct tag captures.
residual current (SI Appendix, Fig. S10). TCCs were differentiated from background captures by requiring their dwell time to
be greater than 10 ms. Then, we used a classification algorithm
derived from the characteristic dwell time and residual current
intervals for each set of ternary capture experiments to estimate
the accuracy with which one could call a given TCC event. We
Stranges et al.
96.77
2.15
1.08
0.00
14.38
78.77
2.05
4.80
0.78
0.00
99.22
0.00
0.00
0.00
1.61
98.39
A
Template complement > TATGATGATCCCAGTAGTAGTCCCGCGCTCGAG
G
10 s
G
A A
T
C
G
A
G
(A)
G
A
1 s
Fig. 5. Representative examples of real-time detection of numerous successive tagged-nucleotide incorporations into a self-priming DNA hairpin
template catalyzed by nanopore-bound polymerase on the Genia chip.
(A) Two base captures of tagged C and G nucleotides with standard A and T
nucleotides. Part of the template sequence is shown in red (SI Appendix, Fig.
S6). The only captures observed in the trace match the expected levels for
dG6P-T30 and dC6P-dSp3. (B) Four-base sequencing. Events with dwell time
>10 ms were categorized by manually assigning current blockade events to
their respective tag capture boxes (Methods). Homopolymer regions in the
template and raw sequencing reads were considered a single base for local
sequence alignment. A 12-bp section of such an alignment is shown in red.
Stranges et al.
Methods
Protein Expression and Purification. The 29 DNA polymeraseSpyCatcher
construct with an N-terminal Strep-tag was expressed in BL21 DE3 Star cells by
growing them in Magic Media (Invitrogen) at 37 C until OD 0.6, followed by
overnight growth at 25 C. Cells were resuspended and lysed by sonication in
Polymerase Buffer (PolBuff): 50 mM Tris, pH 7.5, 150 mM NaCl, 0.1 mM EDTA,
0.05% (vol/vol) Tween 20, and 5 mM 2-mercaptoethanol. Benzolase nuclease
was added after cell lysis to remove excess bound DNA. The protein was purified using Streptactin columns per the manufacturers instructions (IBA). Purified protein was eluted with PolBuff with added desthiobiotin. Both HLStreptag and HLSpyTag-6-His were expressed in BL21 DE3 Star pLys-S cells grown
in Magic Media for 8 h at 37 C. Each was lysed by sonication in 50 mM Tris,
pH 8.0, 200 mM NaCl. Strep-tagged HL was purified on Streptactin columns
and eluted in the same buffer with desthiobiotin. His-tagged HL was purified
with a cobalt column and eluted with 300 mM imidazole.
1:6 Porin Assembly Formation and Isolation. To form a 1:6 SpyTag:unmodified
HL pore, purified HL proteins were mixed in a ratio of 1:6 SpyTag construct:unmodified. The lipid 1,2-diphytanoyl-sn-glycero-3-phosphocholine
(DPhPC) was added to a final concentration of 5 mg/mL, followed by in-
Stranges et al.
PNAS PLUS
APPLIED BIOLOGICAL
SCIENCES
1. Shendure J, Lieberman Aiden E (2012) The expanding scope of DNA sequencing. Nat
Biotechnol 30(11):10841094.
2. Lander ES (2011) Initial impact of the sequencing of the human genome. Nature
470(7333):187197.
3. Soon WW, Hariharan M, Snyder MP (2013) High-throughput sequencing for biology
and medicine. Mol Syst Biol 9(1):640.
4. Chen CY (2014) DNA polymerases drive DNA sequencing-by-synthesis technologies:
Both past and present. Front Microbiol 5(JUN):305.
5. Fuller CW, et al. (2009) The challenges of sequencing by synthesis. Nat Biotechnol
27(11):10131023.
6. Perkins TT, Quake SR, Smith DE, Chu S (1994) Relaxation of a single DNA molecule
observed by optical microscopy. Science 264(5160):822826.
7. Smith SB, Cui Y, Bustamante C (1996) Overstretching B-DNA: The elastic response of
individual double-stranded and single-stranded DNA molecules. Science 271(5250):
795799.
8. Rief M, Clausen-Schaumann H, Gaub HE (1999) Sequence-dependent mechanics of
single DNA molecules. Nat Struct Biol 6(4):346349.
9. Harris TD, et al. (2008) Single-molecule DNA sequencing of a viral genome. Science
320(5872):106109.
10. Eid J, et al. (2009) Real-time DNA sequencing from single polymerase molecules.
Science 323(5910):133138.
11. Kasianowicz JJ, Brandin E, Branton D, Deamer DW (1996) Characterization of individual polynucleotide molecules using a membrane channel. Proc Natl Acad Sci USA
93(24):1377013773.
12. Feng Y, Zhang Y, Ying C, Wang D, Du C (2015) Nanopore-based fourth-generation
DNA sequencing technology. Genomics Proteomics Bioinformatics 13(1):416.
13. Cherf GM, et al. (2012) Automated forward and reverse ratcheting of DNA in a
nanopore at 5- precision. Nat Biotechnol 30(4):344348.
14. Manrao EA, et al. (2012) Reading DNA at single-nucleotide resolution with a mutant
MspA nanopore and phi29 DNA polymerase. Nat Biotechnol 30(4):349353.
15. Laszlo AH, et al. (2014) Decoding long nanopore sequencing reads of natural DNA.
Nat Biotechnol 32(8):829833.
16. Clarke J, et al. (2009) Continuous base identification for single-molecule nanopore
DNA sequencing. Nat Nanotechnol 4(4):265270.
17. Astier Y, Braha O, Bayley H (2006) Toward single molecule DNA sequencing: Direct
identification of ribonucleoside and deoxyribonucleoside 5-monophosphates by using an engineered protein nanopore equipped with a molecular adapter. J Am Chem
Soc 128(5):17051710.
18. Ayub M, Hardwick SW, Luisi BF, Bayley H (2013) Nanopore-based identification of
individual nucleotides for direct RNA sequencing. Nano Lett 13(12):61446150.
19. Kumar S, et al. (2012) PEG-labeled nucleotides and nanopore detection for single
molecule DNA sequencing by synthesis. Sci Rep 2:684.
20. Fuller CW, et al. (2016) Real-time single-molecule electronic DNA sequencing by
synthesis using polymer-tagged nucleotides on a nanopore array. Proc Natl Acad Sci
USA 113(19):52335238.
21. Zakeri B, et al. (2012) Peptide tag forming a rapid covalent bond to a protein, through
engineering a bacterial adhesin. Proc Natl Acad Sci USA 109(12):E690E697.
22. Howorka S, Cheley S, Bayley H (2001) Sequence-specific detection of individual DNA
strands using engineered nanopores. Nat Biotechnol 19(7):636639.
23. Davis R, Chen R, Bibillo A, Korenblum D, Dorwart M (2014) Nucleic acid sequencing
using tags. US Patent Application 14/073,445.
24. Song L, et al. (1996) Structure of staphylococcal alpha-hemolysin, a heptameric
transmembrane pore. Science 274(5294):18591866.
25. Berman AJ, et al. (2007) Structures of phi29 DNA polymerase complexed with substrate: The mechanism of translocation in B-family polymerases. EMBO J 26(14):
34943505.
26. Leaver-Fay A, et al. (2011) ROSETTA3: An object-oriented software suite for the
simulation and design of macromolecules. Methods Enzymol 487:545574.
27. Santos E, Lzaro JM, Prez-Arnaiz P, Salas M, de Vega M (2014) Role of the LEXE motif
of protein-primed DNA polymerases in the interaction with the incoming nucleotide.
J Biol Chem 289(5):28882898.
28. Robinson-Mosher A, Shinar T, Silver PA, Way J (2013) Dynamics simulations for engineering macromolecular interactions. Chaos 23(2):025110.
29. Reddy Chichili VP, Kumar V, Sivaraman J (2013) Linkers in the structural biology of
protein-protein interactions. Protein Sci 22(2):153167.
30. Klein JS, Jiang S, Galimidi RP, Keeffe JR, Bjorkman PJ (2014) Design and characterization of structured protein linkers with differing flexibilities. Protein Eng Des Sel
27(10):325330.
31. Derrington IM, et al. (2015) Subangstrom single-molecule measurements of motor
proteins using a nanopore. Nat Biotechnol 33(10):10731075.
32. Nivala J, Marks DB, Akeson M (2013) Unfoldase-mediated protein translocation
through an -hemolysin nanopore. Nat Biotechnol 31(3):247250.
33. Merolla PA, et al. (2014) Artificial brains. A million spiking-neuron integrated circuit
with a scalable communication network and interface. Science 345(6197):668673.
8 of 8 | www.pnas.org/cgi/doi/10.1073/pnas.1608271113
Stranges et al.
Wyss Institute for Biologically Inspired Engineering at Harvard University, Boston, MA 02115
Center for Genome Technology and Biomolecular Engineering, Department of Chemical Engineering, Columbia
University, New York, NY 10027
d
S1
SI APPENDIX METHODS
S3
1. Protein Modeling
S3
S7
S9
S12
S18
S29
S32
8. Chip Reusability
S34
SI APPENDIX REFERENCES
S36
S2
SI APPENDIX METHODS
1. Protein Modeling
A model of the HL-SpyTag-29-SpyCatcher conjugate was made by hand using PyMOL
(Schrdinger, LLC). Structures for HL, 29 DNA polymerase and, FBAB-B (which forms the
SpyTag/SpyCatcher complex) (PDB codes 7AHL, 2PYJ, and 2X5P respectively) were arranged
in an expected tagged nucleotide capturing orientation and Gly/Ser linkers were built to join the
HL-SpyTag and 29-SpyCatcher (Fig. S1). This final structure was then repacked/minimized in
Rosetta to remove clashes. The conformational freedom of the conjugate was explored using the
FloppyTail backbone sampling protocol in Rosetta (1). The backbone torsion angles of the
linkers between HL and SpyTag, and 29 and SpyCatcher were allowed freedom while the
backbone of the rest of the protein was held fixed. The results of backbone sampling are shown if
Fig. S2.
Sample submission code follows:
./FloppyTail.mpi.linuxgccrelease
-database /path/to/main/database
-s 7AHL_SpyTAG_phi29_SpyCatch_assemble_repacked.pdb
-nstruct 5000
-in:file:movemap movemap_file
-packing:repack_only
-AnchoredDesign:refine_repack_cycles 30
-AnchoredDesign:perturb_cycles 15000
-AnchoredDesign:refine_cycles 3000
The MoveMap file represents the residue numbers of the linkers. All other backbone angles are
held constant.
S3
>HL-SpyTag-His
MADSDINIKTGTTDIGSNTTVKTGDLVTYDKENGMHKKVFYSFIDDKNHNKKLLVIRTKGTIAGQYRVYS
EEGANKSGLAWPSAFKVQLQLPDNEVAQISDYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGGLIGAN
VSIGHTLKYVQPDFKTILESPTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAAEN
FLDPNKASSLLSSGFSPDFATVITMDRKASKQQTNIDVIYERVRDDYQLHWTSTNWKGTNTKDKWTDRSS
ERYKIDWEKEEMTNGGSSGGSSGGAHIVMVDAYKPTKKGHHHHHH
Color assignment: HL linker SpyTag 6xHis
Fig. S1. Sequences of 29 DNA polymerase and HL constructs with colored annotations for the various protein
sequence regions.
S4
Fig. S2. Linker sampling of the nanopore-polymerase complex in Rosetta. Backbone torsion angles were sampled
on the linkers highlighted using Rosettas FloppyTail application. Various final models are shown in different
colors. The majority of the models place the expected tag site >100 away. A small fraction of models place the tag
exit site within 50 , demonstrating that these linker lengths make it possible for a polymerase-bound tagged
nucleotide to be captured by the pore.
S5
Fig. S3.
stoichiometries on MonoS column and the isolation of 1:6 (SpyTag:unmodified) HL complex. Monomeric
unmodified HL and SpyTag-6-His HL were mixed at a ratio of 6:1 respectively and oligomerized by addition of
lipid. After oligomerization, the lipid vesicles were broken with detergent and the oligomers were separated on a
MonoS column (GE Healthcare) with a linear NaCl gradient from 0 M to 2 M. The ratio of the SpyTag-6-His HL
to unmodified HL is indicated near the corresponding peak on the chromatographic profile. The fraction
representing 1:6 pore assembly was further characterized by polyacrylamide gel electrophoresis under denaturing
conditions, assayed for ability to bind only one SpyCatcher fusion polymerase and used to assemble the sequencing
complexes in further experiments. FT = flow-through.
S6
S7
Fig. S4. Rolling circle amplification (RCA) of purified HL-29 conjugates. (a) SDS/PAGE gel of purified HLSpyTag-His and 29-SpyCatcher-StrepTag for evaluation by RCA. Additional HisTrap pulldown by magnetic beads
(Qiagen) was performed to ensure no free 29-SpyCatcher was present. (b) Agarose gel of RCA using purified
proteins from a. Samples are numbered as in a. Negative control contained no polymerase. Positive control
contained wild-type 29 (NEB). Amplified products can be seen in all samples where 29-SpyCatcher is present,
indicating that polymerase activity is maintained when fused to SpyCatcher and when conjugated to HL-SpyTag.
S8
1
() = ( ) 2
=1
where T is the total number of time steps during the nanopore measurment; It is the measured
current level at time step t and is the time-averaged current level of the same pore. Then the
mean and standard deviation of RMSF current was computed for that experiment. This algorithm
was implemented in MATLAB (2014b, MathWorks, Natick, MA).
S9
Fig. S5. Structures of the four polymer tagged nucleotides. Nucleotides used in this study are 5-hexaphosphates
(red) connected to a common linker (green) and an oligonucleotide tag (black) consisting of Cy3 and either an
unmodified oligonucleotide chain (dT 30) or oligonucleotides with a variety of modifications including runs of abasic
nucleotides (dSp8 and dSp30) or two thymidines substituted with fluorescein (FL). Notation used throughout: dG6PT30, dA6P-FL, dC6P-dSp3, dT6P-dSp30.
S10
JAMN
5' - TTT TT(G CGC TCG AGA TCT CCG TAA GGA GAT CTC GAG CGC) GGG ACT ACT ACT
GGG ATC ATC ATN (GCC ACC TCA GCT GCA CGT AAG TGC AGC TGA GGT GGC) - 3'
Reverse complement:
5NATGATGATCCCAGTAGTAGTCCCGCGCTCGAGATCTCCTTACGGAGATCTCGAGCGCAAAAA-3
Reverse complement with homopolymers reduced to a single base:
5-NATGATGATCAGTAGTAGTCGCGCTCGAGATCTCTACGAGATCTCGAGCGCA-3
Fig. S6. General sequence and predicted secondary structure of the DNA hairpins used as templates. The first query
base (highlighted in yellow) probes complementary base N for tag discrimination. The N is replaced with the four
bases (A, T, C, G) in the experimental oligonucleotides. Structure predicted by UNAFold (2). The reverse
complement (bases that would be added by the nanopore-bound polymerase) is shown below the JAMN sequence.
In addition, the complementary strand with homopolymer runs reduced to a single base is shown.
S11
With the established 1:6 pore assembly, we evaluated the background signal due to tagged
nucleotides in the absence of template DNA. After incorporating the porin-polymerase conjugate
into the membrane the tagged nucleotides were added under non-catalytic (Ca2+ ion containing)
buffer conditions with a constant 100 mV potential. Addition of each tagged nucleotide caused
current fluctuations more often than in the channel without tagged nucleotides. The deflections in
the open channel current were evaluated for the amount of time spent below 70% of open
channel current (dwell time) and the mean current over the same time. While each tagged
nucleotide produced a wide distribution of deflection current, all dwell times were below ~10
ms, consistent with rapid translocations through the nanopore, and readily distinguishable from
longer polymerase/template/nucleotide ternary captures (Figs. S7-9).
To further test the viability of our approach for real-time sequencing, we evaluated the
tagged nucleotide background in Mn2+ containing buffer with the established 1:6 pore assembly.
After inserting the pore into the membrane the tagged nucleotides were added and a constant
potential of 100 mV was applied across the membrane. Addition of each tagged nucleotide
resulted in more frequent current deflections than in the channel without tagged nucleotides
similar to what was observed for the Ca2+ buffer experimental counterpart. As before, all tagged
nucleotides produced similar dwell times below ~10 ms (Fig. S15). By using frequency analysis,
we found that there was only a 0.00-1.20% probability of falsely identifying a background event
as a true ternary complex capture event, which reflects a very low false positive rate due to
background even under catalytic conditions (Table S3) required for sequencing.
S12
Fig. S7. Residual current versus dwell time scatter plots of current blockade events for ternary complex captures
(red) and tagged nucleotide background (blue). Ternary complex capture experiments were run at 100 mV with Ca 2+
as the divalent cation to enable the polymerase to bind a nucleotide. Each tagged nucleotide was added to a
nanopore-polymerase-template complex where the first base in the template was the complementary base to the
nucleotide. In a separate experiment, background events were measured by adding the individual tagged nucleotides
to the chip, after the 1:6 HL pore insertion. A constant 100 mV potential was applied and the background signals of
tagged nucleotides were observed. Buffer contained 300 mM NaCl, 3 mM CaCl2, and 20 mM HEPES pH 7.5.
Individual tagged nucleotides were added at 3 M.
S13
Fig. S8. Dwell time distributions of the four tagged nucleotides within the tag capture current band. Current band
levels were selected based on mean capture level distributions (Fig. S9). Both capture and background distributions
are normalized by the total number of counts. Dwell times for captures show a shift to longer times than the
background current blockade events. Most background captures are shorter than 10 ms while ternary complex
captures last longer than 10 ms.
S14
Fig. S9. Comparison of residual current and dwell time characteristics for the four tagged nucleotides. Box-andwhisker plots were used to discriminate the four tagged nucleotides (dG6P-T30 = G, dA6P-FL = A, dC6P-dSp3 = C,
dT6P-dSp30 = T) during ternary complex capture (TCC a and c) and tagged nucleotide background (TNB b and
d) experiments. The central red mark represents the median, while the bottom and top blue edges of the box are the
1st and 3rd quartile median values respectively. The whiskers extend to the lowest and highest values within 1.5 IQR
of the 1st and 3rd quartile medians. Events are collected over at least 3 experiments for both TCC and TNB
experiments. Number of events (n) for captures: nG = 376, nA = 229, nC = 512, nT = 243 and for background: nG =
7688, nA = 6844, nC = 4068, nT = 3069.
S15
Table S1. Residual current statistics of ternary complex capture events with complete protein
construct for each of the four tagged nucleotides (dG6P-T30 = G, dA6P-FL = A, dC6P-dSp3 =
C, dT6P-dSp30 = T) under non-catalytic (Ca2+ ion containing) buffer conditions.a
Tagged Nucleotide
Residual Current
Median
0.2108
0.2269
0.3132
0.4768
0.2108
0.2254
0.3141
0.4735
0.0119
0.0086
0.0102
0.0202
Buffer containing 300 mM NaCl, 3 mM CaCl2 and 20 mM HEPES pH 7.5 was used to create a buffer condition to
inhibit polymerase catalysis for preventing sequential nucleotide incorporation on the DNA hairpin template. b Tag
capture box was defined as described in Methods (see Classification of Capture Events). Median, mean and
standard deviation of dwell and residual current was calculated from dwell and residual current data presented in
Fig. S9.
S16
Fig. S10. Residual current histogram for all current blockade events with dwell time greater than 10 ms during
ternary complex capture of individual tagged nucleotides. All experiments were performed as described in Fig. S7.
Buffer contained 300 mM NaCl, 3 mM CaCl2, and 20 mM HEPES pH 7.5. Individual tagged nucleotides were
added at 3 M.
S17
S18
Fig. S11. Characteristic two-level current blockade signature of captured dA6P-FL. When the A-tag containing
two bulky FL groups enters the pore, the open channel current reading drops to a stable current blockade level of
~9 pA (state i) due to the partial ion current blockade. The pore lumen is not wide enough to smoothly translocate
this tag, so it folds up into a knot-like structure. This knot significantly blocks the current flow at this point, so the
stable current blockade level drops even closer to baseline level (state ii). The voltage-driven back-pressure builds
up in the pore lumen exerting enough force on the folded A-tag to push it into the trans side of the lipid bilayer. At
this point, the signal returns to open-current reading indicating pore clearing and completion of the A-tag
translocation (state iii). Color assignment: maroon DNA template, blue primer, orange SpyCather-SpyTag
linker, red A-tag, black FL group, grey 29 polymerase, dark green HL, light green lipid bilayer.
S19
Table S2. Percent probability of identifying the tagged nucleotide background events as ternary
complex captures for each of the four tagged nucleotides (dG6P-T30 = G, dA6P-FL = A, dC6PdSp3 = C, dT6P-dSp30 = T) under non-catalytic (Ca2+ ion containing) buffer conditions.a
Total Eventsb
3884
1711
4068
3069
15
64
51
29
% of G Capture Events
0.08
1.17
0.17
0.39
% of A Capture Events
0.08
0.88
0.39
0.36
% of C Capture Events
0.10
1.23
0.42
0.13
% of T Capture Events
0.13
0.47
0.27
0.07
Ca2+
Buffer containing 300 mM NaCl, 3 mM CaCl2 and 20 mM HEPES pH 7.5 was used to create non-catalytic buffer
conditions. b Capture events were measured at 100 mV applied potential and identified as described in Methods (see
Event Detection and Data Analysis). c Tag capture box was defined as described in Methods (see Classification of
Capture Events). There is only a 0.07-1.23% probability of inaccurately identifying a background event as a true
ternary complex capture event, which reflects a very low false positive rate due to background.
S20
Table S3. Percent probability of identifying the tagged nucleotide background events as ternary
complex captures for each of the four tagged nucleotides (dG6P-T30 = G, dA6P-FL = A, dC6PdSp3 = C, dT6P-dSp30 = T) under catalytic (Mn2+ ion containing) buffer conditions.a
Total Eventsb
751
917
3230
963
10
24
26
10
% of G Capture Events
0.00
0.33
0.15
0.10
% of A Capture Events
0.27
0.87
0.28
0.21
% of C Capture Events
0.67
1.20
0.12
0.42
% of T Capture Events
0.40
0.22
0.25
0.31
Mn2+
Buffer containing 300 mM NaCl, 0.1 mM MnCl2 and 20 mM HEPES pH 7.5 was used to create catalytic buffer
conditions. b Capture events were measured at 100 mV applied potential and identified as described in Methods (see
Event Detection and Data Analysis). c Tag capture box was defined as described in Methods (see Classification of
Capture Events).
S21
Fig. S12. Ternary complex capture of one tagged nucleotide in the presence of all tagged nucleotides. A nanoporepolymerase-JAMG template complex (Fig. S6) was inserted into lipid bilayers. All four tagged nucleotides were
added at equimolar concentrations (3 M) and a constant 100 mV potential was applied. Expected tag capture levels
are highlighted on the right side of the plot with complementary C level in bold. Buffer contained 300 mM NaCl, 3
mM CaCl2, and 20 mM HEPES pH 7.5.
S22
Table S4. Percent probability of identifying the correct ternary complex capture event of the
complementary tagged nucleotide in the presence of all tagged nucleotides (dG6P-T30 = G,
dA6P-FL = A, dC6P-dSp3 = C, dT6P-dSp30 = T) under non-catalytic (Ca2+ ion containing)
buffer conditions.a
JAMGb | C
Total Eventsc
1087
256
144
% of G Captures in TCB
20.1390
% of A Captures in TCB
6.9444
% of C Captures in TCB
68.7500
% of T Captures in TCB
4.1667
Buffer containing 300 mM NaCl, 3 mM CaCl2 and 20 mM HEPES pH 7.5 was used to create buffer conditions to
inhibit polymerase catalysis for preventing sequential nucleotide addition to the DNA template. b DNA hairpin with
a G as the first position on the strand to be replicated (Fig. S6). c Capture events were measured at 100 mV applied
potential and identified as described in Methods (see Event Detection and Data Analysis). d Background dwell time
cutoff of 10-2 s was used as determined by tagged nucleotide background analysis (Fig. S9d). e Tag capture box was
defined as described in Methods (see Classification of Capture Events).
S23
Fig. S13. Two base capture under catalytic conditions. A nanopore-polymerase-template complex was loaded into
the bilayer followed by addition of 3 M of dC6P-dSp3, dG6P-T30, dTTP, and dATP in 300 mM NaCl, 0.1 mM
MnCl2, and 20 mM HEPES pH 7.5. A constant 100 mV potential was applied. (a) Current versus dwell time scatter
plot for two tag captures. Expected current levels for dC6P-dSp3, dG6P-T30 are shown. (b) Current versus time
trace for one pore in this experiment. Expected tag capture levels for dC6P-dSp3, and dG6P-T30 are shown. (c)
Dwell time histogram for all current blockade events for the tagged G/C nucleotide capture. (d) Residual current
histogram for all observed blockade events below 0.70 times the open channel. Expected G and C levels indicated
above the respective peak.
S24
Fig. S14. Example current versus time traces for captures of dG6P-T30 and dC6P-dSp3 during strand elongation.
Equimolar dATP, dTTP, dG6P-T30 and dC6P-dSp3 in Mn2+ containing buffer were added to pore-polymerasetemplate complex. Expected current blockade levels for captured dG6P-T30 and dC6P-dSp3 are highlighted on the
traces. Captures of the tagged G and C nucleotides can be observed in all traces. Each trace represents signal from a
different pore.
S25
Table S5. Frequency count of ternary complex capture events with full protein construct during
sequential nucleotide additions using two tagged nucleotides (dG6P-T30 = G, dC6P-dSp3 = C)
along with natural dATP and dTTP nucleotidesa under catalytic (Mn2+ ion containing) buffer
conditions.b
Translocation Signal
Frequency (cnt)
Total Eventsc
1159
824
749
% of G Captures in TCB
55.27
% of C Captures in TCB
44.73
Natural nucleotides also present (dATP, dTTP) did not produce capture events. b Buffer containing 300 mM NaCl,
0.1 mM MnCl2 and 20 mM HEPES pH 7.5 was used to create buffer conditions to promote polymerase catalysis for
sequential nucleotide incorporation on the DNA hairpin template. c Capture events were measured at 100 mV applied
potential and identified as described in Methods (see Event Detection and Data Analysis). d Background dwell time
cutoff of 10-2 s was used as determined by tagged nucleotide background analysis (Fig. S15). e Tag capture box was
defined as described in Methods (see Classification of Capture Events).
S26
Fig. S15. Comparison of residual current versus dwell time scatter plots for tagged nucleotide background events in
catalytic (magenta) and non-catalytic (green) conditions. After the 1:6 HL pore insertion, individual tagged
nucleotides were added to the chip. A constant 100 mV potential was applied and the background signal of tagged
nucleotides was observed. Buffer contained 300 mM NaCl, 20 mM HEPES pH 7.5 and either 0.1 MnCl2 (calalytic)
or 3 mM CaCl2 (non-catalytic) conditions. Individual tags were added at 3 M. The divalent cation had little effect
on the signal from free tagged nucleotides.
S27
Fig. S16. Examples of addition of all four tagged nucleotides. All tagged nucleotides in Mn 2+ containing buffer
were added to the pore-polymerase-template complex and the resulting current blockades were observed.
Deflections in current were observed at all four expected levels based on previously observed individual tag
captures. All panels represent different pores, except for the two at the bottom as the highlight shows. Current
blockade events were manually categorized as tag captures based on previously characterized current blockade level
and dwell time.
S28
As an initial test to discriminate between tagged nucleotide switching events and repeated tag
captures of the same nucleotide, we conducted a frequency analysis of ternary complex capture
event transitions using two tagged nucleotides (dG6P-T30 and dC6P-dSp3) along with natural
dATP and dTTP nucleotides under catalytic (Mn2+) sequencing conditions. Capture events were
measured at 100 mV applied potential and identified as described in Methods (see Event
Detection and Data Analysis). Our statistical analysis of wait time distributions concluded that
the wait time for NN transitions were clearly shorter than for the NM transitions, with
S29
average wait times of ~0.07 s compared to ~1.41 s respectively (Fig. S17). These results could
prove important for future base calling algorithms to correctly identify incorporated nucleotide
signal.
S30
Fig. S17. Comparison of wait time characteristics of ternary complex capture events during sequential addition of
nucleotides. All experiments were performed as described in Fig. S13. Box-and-whisker plots were used to
discriminate the four possible transitions (NN = {GG, CC}, NM = {GC, CG}) during template
progression. The central red mark represents the median, while the bottom and top blue edges of the box are the 1 st
and 3rd quartile median values respectively. The whiskers extend to the lowest and highest values within 1.5 IQR of
the 1st and 3rd quartile medians. Events are collected over at least 3 independent experiments (Table S5).
S31
S32
Fig. S18. Pore stability during nanopore experiments. (a) During the first 200 s of data acquisition tag capture
events are present throughout, indicating a functional pore complex. (b) After the first voltage cycle, the pore seems
to clog, which is indicated by a low ion current signal (~1 pA). (c) After the second voltage cycle, the pore clog was
eliminated by the voltage fluctuation, possibly ejecting a jammed tag because if the polarity change. (d) After the
third voltage cycle, the pore seems to be clogged again similarly to (b), but after about 125 s the jammed entity
leaves the pore, indicated by the return of the stable open channel current and the frequent capture events. (e) After
the fourth voltage cycle, the pore shows capture events for the first 80 s, but after that it is clogged again. (f) Finally,
after the fifth voltage cycle, the pore is functional again for the full 200 s as indicated by multiple tag capture events.
Note, that capture events are present even after 40 min indicating that this ON/OFF voltage protocol can maintain
pore life during sequencing data acquisition.
S33
8. Chip Reusability
We have tested the reusability of our metal-oxide-semiconductor (CMOS) based electrode arrays
for nanopore measurements by regenerating the chip using an automated cleaning protocol.
Briefly, after a full experiment composed of (1) lipid bilayer formation, (2) pore complex
insertion, (3) tagged nucleotide addition and finally (4) data acquisition while applying 100 mV
constant voltage for 5 min. An automated syringe pump (Tecan, Mnnedorf, Switzerland) was
utilized to deliver reagents into the microfluidic chamber of the CMOS chip. First, 50 L of
hexane (Sigma-Aldrich, St. Louis, MO) was pumped into the chamber to remove all lipid bilayer
residues from the electrode surface. This was then followed by a 50 L wash with pure ethanol
(Crystalgen, Commack, NY). Finally, the chip chamber was washed with 100 L buffer solution
(300 mM NaCl, 3 mM CaCl2 and 20 mM HEPES pH 7.5). This three-stage washing cycle was
repeated five times. All injection flow rates were at 100 L/s. Software control was implemented
in Python, which interfaced with the pump via RS 232 communication protocol.
After these cleaning steps, the success of regeneration was evaluated by demonstrating that new
lipid membranes can be formed over the electrode arrays and by confirming viable nanoporepolymerase construct insertions into this membrane. Next, by applying a trapezoidal waveform
(stage 1: linearly increasing voltage ramp from -50 mV to +100 mV with a rate of +75 mV/s,
stage 2: constant +100 mV DC for 3 s, stage 3: linearly decreasing voltage ramp from +100 mV
to -50 mV with a rate of -75 mV/s), we also confirmed successful pore complex insertion into the
membrane by observing stable open channel current (~30 pA) at the majority of the electrodes
indicating single pore insertions. We performed three successive regeneration cycles for the
same CMOS chip obtaining 105 single pores in the first cycle, 79 in the second and 21 in the
S34
third cycle. Our results indicate that the number of single pores inserted in the membrane
decreases with each regeneration cycle, suggesting that improved regeneration methods may be
beneficial.
S35
SI APPENDIX REFERECNES
1.
Kleiger G, Saha A, Lewis S, Kuhlman B, Deshaies RJ (2009) Rapid E2-E3 assembly and
disassembly enable processive ubiquitylation of cullin-RING ubiquitin ligase substrates.
Cell 139(5):95768.
2.
Zuker M (2003) Mfold web server for nucleic acid folding and hybridization prediction.
Nucleic Acids Res 31(13):34063415.
S36