Académique Documents
Professionnel Documents
Culture Documents
Applied DSP and VLSI Research Group, Medical BU, 10815 Rancho BemardoRd, Suite 210,
115 New Cavendish Street, London, WIW 6 U W ,UK San Diego, CA 92 127, USA.
FMY(.L”Z
V-358
A HB ND TDL structure was used to realise the two halfband 3. DSP ARCHITECTURE
allpass filters. These structures are the same as the first-order The computational tasks of the CODEC can be partitioned into
allpass structures shown in Figure6 with the single delayors those operating at the modulator rate (1.28 MHz) and those
replaced by double delayors. The outputs of each allpass filter are operating at significantly lower speeds (between 2 0 W z and
separately squared then summed to produce the AGC output. 8OWz). The former are implemented by custom processors
whose architecture has the same form as their behavioural
2.6 Interpolator structure. The latter processing are undertaken by the Low-Speed
The interpolator translates the highpass-filtered and scaled input Processor (LSP) so named because it takes care of all the lower-
!?om the 20 ksps input rate by a factor of 4 to 80 ksps in two stages rate processing with the exception of the Slink differencer which
each implemented using polyphase filters. A simple zero-order- requires a 23-hit wordlength bigger than the 20 bits of the LSP.
hold (ZOH) register effects the final interpolation of 8. The overall block diagram of the architecture of the CODEC DSP
Both stages use the structure shown in Figure 1I to increase the is shown in Figure 13.
sample rate by a factor of 2. This is based on a halfband two-path
polyphase lowpass filter operating at the output rate. Since this is
equivalent to zero-insertion interpolation (ZII), there is a system r"S""I rn<nr.rr Slink
gain of 0.5. This is compensated by the removal of the 0.5 scaling
TO SrrYl ,..rhri
factor normally at the output of the filter. Because of the ZII and
sample offset in one path, the final addition is merely an
interleaving of the outputs !?om each allpass filter.
Allpas Filler
J+
v-359
multiplier utilisation! The two-point averager allows decimator (20 kHz) so the toggling of locations is effected by locating the
recombination and HPF gain adjustment to be done on-the-fly delayor pairs adjacent to each other on even address boundaries.
without wasting cycles in the main DMAC. A simplified view of For these address locations, the LSB of the address is
the architecture of this processor is shown in Figure 14. The basic automatically toggled at the end of each processing frame.
datapath is 20 bits wide which is necessary to keep the round-off
3.3 Control
noise well below the noise floor of the modulators. To remove any
The operation of the LSP is controlled by a microcode sequence
danger of word wrap, a guard bit is added at the output of the
which decodes the cycle counter of the Master Timer. A separate
differencer and another at the output of the multiplier. The two
microcode counter is not used. Since the LSP operates at a
guard bits are removed by the limiters which clip any overshoot at
640 kHz rate. there is a maximum of 32 processor cycles per
the output of the DMAC and averager. The output of the
Nyquist-rate sample operating at 20 kHz. Of these, 28 are actually
multiplier is rounded back to 19 fractional bits by the rounder
which used either the round-to-zero or the convmgent rounding used. For the remaining cycles the processor is suspended to
reduced power dissipation. The microcode is synthesised to gates
algorithm under microprogram control. The implicit division by 2
to minimise power and area.
wired into the averager uses convergent rounding. The state
At I .25 V it consumes only 80 pl per cycle when implemented on
variables are stored in a triple-port RAM (TRAM) with two
readlwrite addresses and one read-only address. Since only 29 the Zarlink 0.35 vm mixed-signal process. The whole processor,
locations are required, this is easily implemented in standard logic. including the digital Z-A modulator, consumes 90 pA at 1.25 V.
lnpuf fmm Scnd Inkrfncc loput hom Slink Ar~umuIalor
3.4 Design-for-Test Circuitry
Two areas of the LSP were identified as being difficult to test: the
TPRAM and the DMAC. Therefore, additional Dff hardware was
added to improve testabilitj' of these. In both cases, the test signal
input are generated intemally and the outputs are compressed to a
digital signature which are output serially. Both circuits are tested
using the normal clock strobes allowing at-speed testing and the
detection of timing-related faults. Since both circuits operate at a
640 WHZ rate with the T R A M read on the first half cycle and the
DMAC read during the second half cycle, the two serial outputs
are multiplexed to give a single serial signature output operating at
1280 Wz. Since the generation of the signatures is done with
dedicated circuitry, the signature analysis can mn during normal
operation and provide additional observability.
The digital loopback path is extremely useful for functionally
testing the digital circuitry in that the loop is entirely digital and
therefore the response can be predicted right down to the LSB.
These techniques achieved a fault coverage of over 98%.
4. POWER SAVING TECHNIQUES
The following general comments can be made:
Extremely effective multirate filtering can be achieved using
the ND-TDL allpass-based polyphase structure.
The DMAC processor architecture is capable of implementing
not only allpass filters hut also resonators and highpass filters
in a single cycle.
Clock gating two-phase latch-based timing offer lower power
Figure 14: LSP Architecture
and area compared with fully synchronous solutions.
In the allpass filters used in the differential Hilbert filter and the Signature-based BIST techniques provide a high degree of
NHB allpass filters used in the first suh-stage of the second stage fault coverage without the need for power-hungry scan paths.
decimator, the delayors occur in pairs. It is necessary to toggle the
mapping between the state variables and the RAM pair locations REFERENCES
on each cycle of the filters. Thus the second delayor is read on [ I ] Morling, R, I. Kale and C.W. Tsang, "The design of a sigma-
each cycle and the new feedfonvard or feedback sample is written delta CODEC for mobile telephone applications", Proc 2nd
back later in the same cycle. On the next execution of the filter, Int. Conf on Advanced A-D and D-A Conv. Techniques &
the addresses are swapped so that the first delayor becomes the their Appls. (ADDA '94),pp, 11-17. Cambridge, UK, July 94.
second and vice versa. Consequently, the content of the first [2] Kale, I., N.P. Murphy and M. V. Patel, "On establishing the
delayor is automatically available as the second delayor without bounds for binary scaled coefficients of fifth and seventh
the need for housekeeping transfers between memory locations. order polyphase halfband filters", Proc. IEEE Int. Symp. on
For the MiB filter operating at 40 Wz, this is simply implemented Ccts.&Sysfs.(lSCAS'94). vol.2, pp.473-476, London, May 94.
in the microcode by reversing the RAM locations in the second (31 Morling, R, & I Kale, "Dynamic range of allpass filter
invocation of the filter in the microcode. The differential Hilbert stmctures", Proc. IEEE In!. Symp. on Ccfs. & Sysfs.
filters, however, are only executed once per program cycle (ISCAS'O2). vol. 4, pp. 433-436, Phoenix, May 2002.
V-360