Académique Documents
Professionnel Documents
Culture Documents
High-Speed I/O
Mark Horowitz
Computer Systems Laboratory
Stanford University
horowitz@stanford.edu
Copyright © 2007 by Mark Horowitz, with material from
Stefanos Sidiropoulos, and Vladimir Stojanovic
• Readings
– Techniques for High-speed Implementation of Nonlinear
Cancellation, Sanjay Kasturia and Jack H. Winters
• Overview:
– Your project will be the design of a circuit that processes the
input data from a high-speed I/O. This processing is
generally done in a mixed signal manner today, but your job
will be to build a digital implementation of the algorithm. This
lecture will try to give you some background about why I/O
rates are important, and what issues need to be resolved to
achieve high performance. The next lecture will discuss the
operation of the circuit you need to build.
CPU
FSB, HT
DDR, RDRAM
AGP, PCI-E FBDIMM
DVI, HDMI
Graphics System
Display Controller
Memory
Controller
>1GB/s >4GB/s >4GB/s
PCI-X
PCI-E
>0.1GB/s
Storage I/O
Network Controller
I/O
RTERM RTERM
Tx Rx
Channel
1 0 0 1 0 1 0
tbit /2
M Horowitz EE371 Lecture 2 6
Transmission Lines
H(s)
(transfer
function)
Frequency
• PCB Loss : skin & dielectric loss
– Skin Loss ∝ √f
– Dielectric loss ∝ f : a bigger issue at higher f
M Horowitz EE371 Lecture 2 8
Dealing With Current Return/References
• Wire Utilization:
– Single Ended
shared signal return path
– Differential -
+
explicit signal return path
ref
-
– “Pseudo” Differential +
Z2 – Z 1 2Z2
-------------------- --------------------
Z 1 + Z2 Z 1 + Z2
Z1 Z2
Z2
Z1
Z2
Package
On-chip parasitic
Package
(termination resistance and via
Line card trace device loading capacitance)
Backplane via
• Line attenuation
• Reflections from stubs (vias)
• Loss is variable
0
Attenuation [dB]
– Same backplane
– Different lengths -10 9" FR4
1
(skin-effect, dielectric loss)
0.8
0.6 Tsymbol=160ps
• Reflections – long latency
0.4 (impedance mismatches –
connectors, via stubs,
0.2 device parasitics, package)
0
0
M Horowitz 1 2 3
EE371 Lecture 2 15
ns
ISI
Error!
0.8
Amplitude
0.6
0.4
0.2
0
0 2 4 6 8 10 12 14 16 18
Symbol time
• Middle sample is corrupted by
– 0.2 trailing ISI (from the previous symbol),
– 0.1 leading ISI (from the next symbol) resulting in 0.3 total ISI
• As a result middle symbol is detected in error
M Horowitz EE371 Lecture 2 16
Equalization For Loss :
Goal is to Flatten Response
• Channel is band-limited
• Equalization : boost high-frequencies; or attenuate low freq
M Horowitz EE371 Lecture 2 17
Equalization Mechanisms
0.6
1
No equalization Tx equalization
0.8 0.4
Amplitude
Amplitude
0.6 0.2
0.4 0
0.2 -0.2
0
-0.4
0 2 4 6 8 10 12 14 16 18 0 2 4 6 8 10 12 14 16 18
Symbol time Symbol time
• Tx equalization
– Pre-filter the pulse with the inverse of the channel
– Filters the low freq. to match attenuation of high freq.
• Rx feedback equalization
– Subtract the error from the signal
M Horowitz EE371 Lecture 2 18
Removing ISI
Channel
50Ω 50Ω TapSel
Causal outP Logic
taps outN
d d
Decision-feedback equalizer
I eq 0
Attenuation [dB]
Anticausal taps
Peak power constraint
unequalized
Tx -5
Data
-10
equalized
-15
Channel
-20
Causal frequency [GHz]
taps -25
0 0.5 1 1.5 2 2.5
• Many sources
– On-chip
– Package
– PCB traces
– Inside connector
• Differential signaling can help
– Minimize xtalk generation & make effects common-mode
• Both NEXT & FEXT
– NEXT very destructive if RX and TX pairs are adjacent
• Full swing-TX coupling into attenuated RX signal
• Effect on SNR is multiplied by signal loss
– Simple solution : group RX/TX pairs in connector
– NEXT typically 3-6%, FEXT typically 1-3%
– No attenuation Feedback
0.8
equalization
Amplitude
0.6
0.4
Channel
50Ω 50Ω TapSel
Causal outP Logic
taps outN
d d
Decision-feedback equalizer
I eq 0
−1 + α −1 + α −α d n | d n −1 = 0
−1
−1 − α −1 − α dClk
Data_E
in
ref pre latch
Input
Data_O
10
Data_E
01 2 3 4 5
bit time (normalized to FO4)
d1
CKX
CKX
DI
CKC1 CKC2
d2 CKC1
on-chip
logic CKC2
DI
PLL/DLL
CKX
CKC CKX
DI
on-chip
logic
DI CKC
VCO VCDL
clk
clk
÷N
PD PD
ref ref
clk clk
Filter Filter
• Synchronous:
Same frequency and phase
t t
• Conventional buses
F0
• Mesochronous
Same frequency, unknown phase
• Fast memories
tA tB
• Internal system interfaces
• MAC/Packet interfaces tA≠ tB F0
• Plesiochronous:
Almost the same frequency
– Mostly everything else today F1 F2
F1≈ F2
data rcvr
logic
ref
CKSRC
data D0 D1 D2 D3
CKRCV
rcvr
logic D0 D1
DIN
CKR
DIN
CKR
CDR