Vous êtes sur la page 1sur 13

Indian Journal of Engineering & Materials Sciences

Vol. 19, October 2012, pp. 307-319

VLSI implementation of high throughput MIMO OFDM transceiver for


4th generation systems
J Raja* & M Kannan
Department of Electronics Engineering, MIT, Anna University, Chennai 600 044, India
Received 24 February 2012; accepted 13 August 2012

This paper aims to maximize throughput by minimizing power as minimum as possible. Scores of optimization
techniques such as FFT, IFFT and memory optimization are available for reducing power of mobile OFDM systems. An
approach for achieving reduction in power of MIMO OFDM system by optimizing FFT architecture is addressed in this
paper. Memory references in MIMO OFDM transceiver are costly due to their long delay and high power consumption. To
implement fast Fourier transform (FFT) algorithms on MIMO OFDM, involves many memory references to access butterfly
inputs and twiddle factors. Conventional FFT implementations require unused memory references to load the same twiddle
factors for butterflies from different stages in FFT diagrams. To minimize memory references due to twiddle factors for
implementing FFT algorithms in MIMO OFDM systems, memory reference reduction method is incorporated here. Twiddle
factor is calculated using binary scaling technique. The proposed FFT structure is the combination of memory reference
reduction method with binary scaling technique and Radix-4 booth multiplier. Here multipliers in FFT are realized with
shifters, adders and subtractors. The proposed structure is evaluated using performance parameters such as BER and SNR.
Structural realization and analysis pertaining to timing, power and throughput are implemented in Virtex-4 and analysis is
carried out in Altera respectively.

Keywords: Multiple input multiple output (MIMO), Orthogonal frequency division multiplexing (OFDM),
Fast Fourier transform (FFT), Inverse fast Fourier transform (IFFT), Inter symbol interference (ISI)

Multiple input multiple output (MIMO) system section for wireless specification, but the difficulty is
consists of multiple antennas at the transmitter and hardware complexity3. The reported architecture4 is
receiver ends to improve link reliability and data rates implemented in matrix inversion circuit in MIMO
of the wireless communication system. Orthogonal detection wireless specification, the problem is the
frequency division multiplexing is efficient in increase in hardware complexity and throughput is
synchronizing the received signal under fading less compared with conventional approach. Variable
environment and has been used in past times in length FFT processor design based on Radix 2, Radix
applications that require a huge data rate. Fast Fourier 4 and Radix 8 with complex multipliers containing 3
transform (FFT)/ inverse FFT (IFFT) processors are multiplications described earlier5, consumes more
proposed for multiple-input multiple-output power.
orthogonal frequency division multiplexing based ASIC based MIMO for OFDM provides data rate
IEEE 802.11n. Here the processor not only supports of 192 Mbps with a 20 MHz bandwidth for IEEE
the operation of FFT/IFFT but also provides sufficient 802.11a standard. Here the paper mainly focuses on
throughput rates but the drawback is hardware silicon complexity of MIMO OFDM system.
complexity is more, compared with conventional Throughput of the system is very less compared with
approaches1. FFT architecture which is optimized for other systems6. MIMO OFDM Base band transceiver
a processor with a memory system containing a cache implementation is based on ASIC. Verification is
is discussed elsewhere2. The processor uses a cached based on testbed and GUI monitor. Here the authors
memory with increased efficiency and performance have developed separate testbeds for MIMO OFDM
for MIMO system, but the achieved throughput is system for verification purpose but haven’t
very less. The architecture is mainly focused on concentrated on throughput and BER7. In order to
minimum mean square error detector in decoding meet IEEE 802.11n requirements8,9 the processor not
_______________________
only supports the operation of FFT/IFFT in 128 points
*Corresponding author (E-mail: s.t.jayamani@gmail.com) and 64 points but can also provide different
308 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

throughput rates for simultaneous data sequences. The The data is modulated into waveform at inverse
difficulty is number of point’s increases more and fast Fourier transform (IFFT) stage of the transmitter.
more. Hence, the cost and complexity of the system is The IFFT is used to modulate each sub-channel onto
also increased. IEEE 802.11a established for WLAN the appropriate carrier. The receiver performs the
standard, provides 54 Mbps throughput using SISO vice-versa process of the transmitter. FFT is used to
OFDM transceiver. Here the throughput of SISO demodulate the data.
OFDM is very less; to increase the throughput, One of the main drawback in wireless
MIMO OFDM is preferred10. The IEEE 802-11n communication systems is ISI which is caused due to
provides data rate up to 600 Mbps with transmission the multi-path reflections in the channel. In order to
speed of 80 MHz. Latency is incurred by reduce ISI a cyclic prefix is added. The added cyclic
pre-processing the channel matrices for MIMO prefix is a part of the first stage of a symbol which is
detection11. Different approaches for design and described in Fig. 2. This addition is necessary because
implementation of 4×4 MIMO OFDM transceiver it enables multi-path representations of the source
which provides date rate of 1 Gbps is discussed signal to fade, so that it won’t interfere with the
elsewhere12-15. Here MMSE MIMO detector is used to subsequent symbol.
reduce the latency but the disadvantage is that it After the addition of cyclic prefix to the sub-carrier
requires larger circuit for high speed transmission. In channels, it must be transmitted as a single signal.
any n bit 2’s complement multiplier the maximum Thus, the process of parallel to serial conversion stage
number of addition and subtraction operation is n/216. is done by adding all sub-carriers and combining them
However, the above mentioned results are still into a single signal. At the end of this process all
insufficient in day to day requirements like sub-carriers are generated perfectly in a simultaneous
transmission speed and power. In this paper, the VLSI manner.
implementation of MIMO OFDM transceiver suitable
for low power application has been studied. The FFT MIMO OFDM
optimization helps to reduce the delay time, power MIMO OFDM (multiple input multiple output,
thereby simultaneously increasing speed. orthogonal frequency division multiplexing) is a
technology that combines MIMO and OFDM together
Orthogonal Frequency Division Multiplexing to transmit data in wireless communications, in order
Orthogonal frequency division multiplexing to deal with frequency selective channel effect. The
(OFDM) is a technique which is a subset of frequency OFDM signal on each subcarrier can overcome
division multiplexing in which a single channel uses narrowband fading. Therefore, OFDM can transform
multiple sub-carriers on neighbouring frequencies.
The sub-carriers of an OFDM system overlap each
other. Normally the neighbouring channels which
overlap may obstruct one another. Since the sub-
carriers in an OFDM system are exactly orthogonal to
one another, they are capable of overlapping without
interfering. As a result, OFDM systems are capable of
maximizing spectral efficiency without causing
neighbouring channel interference. Since multiple
sub-carriers are used to transmit in an individual
channel, an OFDM communication system must Fig. 1 OFDM structure
perform several steps, which are described in Fig.1.
In an OFDM system, each channel is divided into a
number of sub-carriers. A serial bit stream is converted
into several parallel bit streams. After the bit stream
division among the individual sub-carriers, each sub-
carrier is modulated. The receiver performs the vice-
versa process, which first divides the incoming signal
into appropriate sub-carriers and then demodulate them
individually before recovering the original bit stream. Fig. 2 Cyclic prefix insertion format
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 309

frequency-selective fading channels into parallel flat Calculation of FFT is done using decimation in
ones. Then by combining MIMO and OFDM time (DIT) or decimation in frequency (DIF)
technology together, MIMO algorithms can be algorithm. If it is DIT, first twiddle factor is
applied in broadband transmission multiplied and later it is summed up. It is computed
A MIMO OFDM system transmits data modulated with the help of Eq. (2). The butterfly diagram
by OFDM from multiple antennas simultaneously. At pertaining to DIT is shown in Fig. 4.
the receiver, after OFDM demodulation, the signals
N −1 N −1 N −1
are recovered by decoding each sub-channel from all X ( k ) = ∑ x (n)W Nnk = ∑ x(n)W nk
N + ∑ x(n)W nk
N
transmitting antennas. n=0 n = 0 ( even ) n = 0 ( odd )
N N
In the basic structure of MIMO OFDM shown in 2
−1
2
−1 … (2)
Fig. 3, the signals are modulated by OFDM = ∑ x(2r )W
r =0
2 rk
N + ∑ x( 2r + 1)W N( 2 r +1) k
r =0
modulator, then they are transmitted by MIMO N
−1
N
−1
system, finally, the signals are recovered by the 2 2

OFDM demodulator.
= ∑ x (r )W
r =0
1
rk
N +W k
N ∑x
r =0
2 ( r )W Nrk
2 2

Therefore, MIMO OFDM achieves spectral = X 1 ( k ) + W Nk X 2 ( k ) (k = 0,1,2,  N − 1)


efficiency, increased throughput and thus the
inter-symbol interference (ISI) can be reduced. If it is DIF, first the input is summed up and then
the twiddle factor is multiplied. It is computed using
Fast Fourier Transform the Eq. (3). The butterfly diagram pertaining to DIF is
Conventional FFT algorithm shown in Fig. 5.
The modulation of OFDM can be done using an
N
IDFT. The fast implementation of IDFT is N −1 2
−1
N −1

accomplished by deploying IFFT, which reduces


X (k ) = ∑ n=0
x ( n )W Nnk = ∑
n=0
x ( n )W Nnk + ∑ N
x ( n )W Nnk
n=
2

processing time and hardware usage. The N


2
−1
N
2
−1 N
N
∑ ∑
(n+ )k

demodulation is through an DFT or better by using = x ( n )W Nnk + x(n + )W N 2


n=0 n=0 2
FFT, which is an efficient way of implementation. N
2
−1
 N
k N  nk
In order to reduce numerous calculations involved = ∑ 2
 x(n ) + W N x (n + ) W N
2 
n=0 
in calculation of DFT, FFT is used. To achieve the N
2
−1
 N  nk
increased efficiency an additional step is involved = ∑  x ( n ) + ( − 1)
n=0
k
x (n + ) WN
2 
( k = 0 ,1,  N − 1)

(to reverse order the data). These additional steps … (3)


won’t increase the computational complexity of the
FFT calculation. As a result, FFT is an extremely
efficient algorithm that provides a good The conventional N point FFT, requires (N/2) log2
implementation in hardware.
N multiplication operations and N log2 N addition
If N is very large, the computation efficiency is
operations. If it is N bit multiplier, it will generate 2N
increased by dividing DFT successively in the smaller
partial products. The entire FFT butterfly module of
calculation. The DFT of discrete signal x(n) can be
computed directly using Eq. (1).
N −1
X ( K ) = ∑ x(n)W nkN , k=0,1,2,3............N-1 … (1)
n=0

Fig. 3 MIMO OFDM structure Fig. 4 Radix 2 DIT FFT butterfly diagram
310 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

Fig. 6 Modified DIT FFT butterfly diagram

For example, in DIT butterfly diagram shown in


Fig. 5– Radix 2 DIF FFT butterfly diagram
Fig. 3, the butterflies with twiddle factor W160 appears
in Stage 1, Stage 2, Stage 3 and Stage 4. Hence, these
twiddle factors are grouped together and computed in
any order in the 16-pt radix-2 DIT FFT diagram. This
can be achieved using the below identities (5)-(8).
m N/ 4 m−N / 4 m−N / 4
W =W .W
N N N
= − jWN m (N/4, N/2) … (5)
=W .W = −W
N/2
N
m−N / 2
N
m−N / 2
N
m (N/2, 3N/4) … (6)
=W3NN/4.WmN−3N/4=jWmN−3N/4 m (3N/4, N) … (7)
=W m
N
m (0, N/4) … (8)
The butterflies with twiddle factor which W164
appears in Stage 4, Stage 2 and Stage 3 are computed.
Hence, twiddle factors are grouped together and
computed in any order in the 16-pt radix-2 DIT FFT
Fig. 5a Block diagram of conventional FFT butterfly module
diagram. Similarly, the butterflies with twiddle factors
conventional FFT is shown in Fig. 5a. Due to the W162 and W166 appears only in Stage3 and Stage4 of the
partial products the delay is increased and it consumes 16-pt radix-2 DIT FFT diagram. These twiddle factors
more power. Hence, it will affect the throughput of are grouped together and computed, without affecting
the mobile system. Thus, there is a need for reducing the computations of other butterflies. The butterflies
the consumed power with decreased delay thereby can be computed together by loading only one twiddle
dragging down the throughput of the system for m
which we propose modified FFT algorithms which
factor W N
m and it is described in Eqs (9)-(11).
helps to overcome the above mentioned issues. m = (n mod N/2i-1)x2i-1 … (9)
( n + ( N / 2 i +1 )) mod( N / 2 i −1 )) x 2 i −1 k ( n mod N / 2 i −1 )) x 2 i −1
Modified fast Fourier transform algorithm W N
=− jW N … (10)
During FFT implementation identical twiddle ( n mod N / 2 i −1
)) x 2 i −1 ( n mod N / 2 i −1 )) x 2 i −1
factors occupies enormous memory. In this paper W N
=− jW N … (11)
identical twiddle factors are grouped together and
computed, before computing the other twiddle factors. where i is the number of stages.
The twiddle factors are fed in to the structure only Finally, twiddle factors W161, W163, W165 and W167
once and the amount of memory due to the identical appear only in Stage 4. These twiddle factors are
twiddle factors can be minimized. The twiddle factors grouped together and computed, without affecting the
are computed using the binary scaling method. In computations of other butterflies. Each twiddle factor is
binary scaling the twiddle factors are calculated by loaded only once and redundant memory references for
simply shifting the bits. The entire multiplication identical twiddle factors are removed which is shown in
operation is completely replaced by shifting Fig. 4. Twiddle factor W165 can be replaced by -jW161
operation. with a simple derivation as described in Eq. (12).
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 311

W16 5 = W16 –j*(2π/16)*5 The above problems are nullified by using


= (W16 –j*(2π/16)*4 ) * (W16 –j*(2π/16)*1) modified Radix-4 booth algorithm. It requires only
= -jW16 1 … (12) N/3 partial product. The procedure of Radix-4 booth
6 7 algorithm is described as: (i) if N is even then extend
Similarly, twiddle factors W16 and W16 can be
the sign bit, (ii) append a 0 to the LSB of the
replaced by -jW162 and -jW163, respectively.
multiplier and (iii) based on the block value, each
The entire FFT butterfly module of modified FFT
partial product will be 0, +X, -X, +2X and -2X. The
is shown in Fig. 6a. Modified FFT is the combination
negative values of X are obtained by taking the 2’s
of binary scaling, which will convert floating point in
complement. Based on the above result the booth
to fixed point and Radix 4 multiplier, which will
encoded table is developed and tabulated in the Table 2.
generates N/3 partial product.
The recoding is done by encoding three bits of
Here, binary scaling technique is used to calculate
multiplier X.
the twiddle factor. Binary scaling is a programming
The proposed FFT processor described above has
technique used mainly in DSP to perform a pseudo
been implemented using Xilinx Virtex-4 FPGA.
floating point using integer arithmetic. It is both faster
Based on the analysis, device utilization summary is
and more accurate than directly using floating point.
given in Table 3. From the results, we can observe
In binary scaling the floating point is multiplied with
that the modified FFT algorithm requires only 10% of
256, which will convert floating point in to fixed
slices, 11% of flip flops and 7.5% of LUTs compared
point and it is given in Table 1.
to the conventional FFT Algorithm.
Radix-4 multiplier
Table1 Binary scaled value
The complexity of conventional FFT is dominated
by multiplier. Ordinary multipliers generate 2N partial S.No Twiddle factor Binary scaled value
products. Due to the partial products, the latencies and 1 1+j0 256+j0
2 0+j1 0+j256
power consumption of the MIMO OFDM transceiver 3 0-j1 0-j256
are increased. The Radix-4 booth multiplier generates 4 0.707+j0.707 181+j181
N/3 partial product. It consumes less power and short 5 -0.707-j0.707 -181-j181
delay because of its high speed parallel multiplier. The Table 2 Radix-4 encoded table
proposed FFT structure is the combination of binary
scaling technique and Radix-4 booth multiplier. Block Partial product
In general Radix-4 booth multiplier is preferred rather 000 0*X
than Radix-2 booth multiplier because of the following 001 1*X
disadvantages of Radix-2 booth multiplier: (i) the number 010 1*X
011 2*X
of addition, subtraction and shift operations in Radix-2 100 -2*X
booth algorithm are variable and it is very difficult to 101 -1*X
incorporate in designing of parallel multipliers, (ii) the 110 -1*X
Radix-2 multiplier becomes suitable only when there are 111 0*X
isolated 1’s and (iii) the number of partial product is N. Table 3 Device utilization summary

Modified FFT Conventional FFT


algorithm algorithm
Logic Used Available Used Available
Utilization
No of slices 74 6144 782 6144
No of slices 92 12288 1040 12288
Flip flop
No of 4 input 142 12288 1080 12288
LUTS
No. of 187 240 321 240
bonded IOBs
No of 1 32 1 32
GCLKS
No. of DSP 12 32 32 32
Fig. 6a Block diagram of modified FFT Butterfly Module 48s
312 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

Throughput of the mobile system entirely depends periodically adjusting the phase and amplitude. Four-
on the power consumption and delay. The proposed QAM uses four combinations of phase and amplitude.
system consumes less power and short delay. Hence, Moreover, each combination is assigned a 2-bit digital
the throughput of the mobile system is increased. pattern. For example, to generate the bit stream
Based on the results, the performance of the MIMO (1,1,0,0,1,1). Because each symbol has a unique 2-bit
OFDM is compared and given in Table 4. digital pattern, these bits are grouped in two’s so that
they can be mapped to the corresponding symbols. In
Results of Simulation our example, the original bit stream (1,1,0,0,1,1)
Code development and simulation were carried out grouped into three symbols (11,00,11). The
on a system running on Intel P4 configuration having constellation plot in Fig. 7 shows the symbol map of
4 GB RAM working in windows 7 Platform. The 4-QAM with each possible phase (Θ) and amplitude
MIMO-OFDM structure for determination of (A) of a carrier signal in polar coordinate form.
constellation points and BER were coded and Bit error rate (BER) is defined as the ratio of
simulated in MATLAB. Structural realisation of number of error bits and total number of bits. There
MIMO OFDM transceiver is implemented in Xilinx. are many ways of reducing BER. Here, we focused on
Analysis pertaining to timing, frequency is done in Alamouti code and modulation techniques. In this
Altera. The experimental results and analysis obtained study we have taken AWGN is communication
using Matlab, Xilinx and Altera are presented and channel where noise gets spread over the entire
discussed. spectrum of frequencies. BER has been calculated by
MATLAB implementation output comparing the transmitted signal with the received
QAM able to carry higher data rates than the signal and computing the error count over the total
other schemes. QAM sends digital information by number of bits. For any given modulation, the BER is
normally expressed in terms of signal to noise ratio
Table 4– Performance Comparisons (SNR). The bit error rate is improved from 10-6 with
S.No Parameter MIMO Mixed Radix Modified the almost need of 10 dB of SNR as shown in Fig. 8.
OFDM FFT FFT
architecture architecture FPGA implementation output
based based Table 5 shows the percentage minimization of
MIMO MIMO power, memory references, space in bytes and clock
OFDM OFDM
cycles, and Table 6 shows the reduction in number of
1 Power dissipation 597 mW 540 mW 111 mW slices, Flip flops, LUT’s and I/O blocks.
2 Number of 32 24 15
memory reference
3 Storage space in 32 30 12
bytes
4 No. of clock cycle 698 600 540
5 Throughput 600 Mbps 700 Mbps 1.4 Gbps
6. Delay 9.43 ns 8.89 ns 4.04 ns
7. Logic elements 9940 9813 5844
Table 5 Percentage reduction of parameters compared with
conventional approaches
Sl. No Parameter A B C
1 % of power 90 20 5
dissipation
2 % of 75 45 11
memory
reference
3 % of space 94 38 9
in bytes
4 % of clock 86 77 19
cycle
A – Mixed Radix FFT + MIMO OFDM Transceiver;
B – Modified FFT + MIMO OFDM Transceiver
C – Modified FFT + SISO OFDM Transceiver Fig. 7 QPSK signal constellation points
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 313

Table 6 Device utilization expressed in percentage Synthesis output


SISO OFDM RTL view
Sl. No Utilization A B The OFDM structure was synthesized and
1 % of Slices 3 13 simulated in Vertex-4 as shown in Fig. 9. The
2 % of Slices flip flop 1 8 proposed method can achieve average of 11%
3 No. of 4 Input LUTS 1.1 9 reduction in number of memory references, 9%
4 % of bonded IOBs 78 99.99 saving of memory space due to the twiddle factor and
A – Modified FFT; B– Conventional FFT average of 19% reduction in number of clock cycles.
MIMO OFDM RTL view
MIMO OFDM structure was synthesized and simulated
in Vertex-4 FPGA as shown in Fig. 10. The proposed
structure was constructed using four transmitting and four
receiving antennas. Transmitter outputs and receiver inputs
are combined using spatial multiplexing technique. The
proposed method can achieve average of 20% reduction in
number of memory reference, 38% saving of memory
space due to twiddle factor and average of 77% reduction
in the number of clock cycle as compared with
conventional approaches which is evident in Table 5.
FFT RTL View
To implement the conventional FFT method, it
requires 1040 number of FFs, 321 numbers of I/O
blocks and 1080 number of LUTs. Number of FFs
and number of LUTs decide the complexity and cost
Fig. 8 SNR versus BER which is shown in Fig. 11.

Fig. 9 OFDM transceiver RTL output


314 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

Fig. 10 MIMO OFDM transceiver RTL output

Fig. 11 FFT RTL output

Proposed FFT RTL View Analysis output


To implement the proposed FFT method, it requires Power analysis
92 numbers of FFs, 187 numbers of I/O blocks and Power consumed by the MIMO OFDM
142 numbers of LUTs. When compared with conventional transceiver system is 111 mW is shown in Fig. 13.
method, the proposed method requires only 1% of FFs, 78% Total power is the sum of the static power, dynamic
of IOBs and 1.1% of LUTs. This is depicted in Fig. 12. power and I/O power.
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 315

Fig. 12 Modified FFT with binary scaling technique and radix-4 Booth multiplier RTL output

Fig. 13 Power analysis Output


Timing analysis Simulation output
FFT Simulation output
Fmax of MIMO OFDM system is 235 Mhz. Based Simulation output of FFT is shown in Fig. 15. The
on the Fmax Throughput of the system is calculated binary inputs in0, in1, in2, in3, in4, in5, in6 and in7
as 1.4 Gbps. Fmax of MIMO OFDM transceiver is are passed through the butterfly units. Finally, the real
shown in the Fig. 14. part outputs are taken from outr0,outr1,outr2,outr3,
316 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

Fig. 14 Timing analysis Output

Fig. 15 FFT simulation output

outr4,outr5,outr6 and outr7 and imaginary part reference reduction technique and higher radix
outputs are taken from outi0,outi1,outi2,outi3, multiplier. Finally, the real part outputs are taken
outi4,outi5,outi6 and outi7. from outr0,outr1, outr2, outr3, outr4, outr5, outr6 and
Modified FFT simulation output
outr7 and imaginary part outputs are taken from outi0,
outi1, outi2,outi3, outi4, outi5, outi6 and outi7.
Simulation output of modified FFT with binary
scaling radix-4 booth multiplier technique is shown in Simulation output
the Fig. 16. The binary inputs in0, in1, in2, in3, in4, Simulation output 4×4 MIMO OFDM transceiver
in5, in6 and in7 are passed through the memory is shown in Fig. 17. The inputs in, in2, in3 and in4 are
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 317

Fig. 16 Modified FFT with binary technique and radix-4 booth multiplier simulation output

Fig. 17 MIMO OFDM transceiver Simulation output

passed through the modified FFT algorithm in the proportional to throughput. Maximum throughput is
transmitter section. The transmitted outputs are achieved by minimizing power as minimum as
spatially multiplexed and the transmitted data are possible. The proposed system power is 111 mW
received by receiver using MRC (maximum ratio which is minimum. Hence, throughput of the system
combining) techniques. Finally, the outputs are taken is increased as shown in Fig. 18.
from out1, out2, out3 and out4. Delay analysis
Performance analysis Proposed system generates short delay which is
Power analysis very less compared with other systems. Delay is
Proposed system requires less power which is very inversely proportional to the throughput. Maximum
less compared with other systems. Power is inversely throughput is achieved by minimizing delay as
318 INDIAN J. ENG. MATER. SCI., OCTOBER 2012

minimum as possible. The gate delay of proposed hardware. When compared with other system the
system is 4.04 ns which is very small. Hence, the proposed system requires very minimum number of
throughput of the system can be increased as shown in logic elements, hence the cost and complexity of the
Fig. 19. system is reduced as shown in Fig. 21.
Throughput analysis Power dissipation versus throughput
The throughput of the proposed system is 1.4 Gbps, Maximum throughput is obtained at minimum
this can be achieved with the help of modified FFT power that is 111 mW power. When the power
architecture. Modified FFT architecture reduces increases more and more correspondingly throughput
redundant memory reference, number of partial of the system gradually decreases as shown in Fig. 22.
products and number of non zero digits as shown in From the graph we conclude that the proposed system
Fig. 20. consumes less power and transmits maximum number
Logic elements analysis
bits per second.
Only 58% of logic elements is required to Memory reference versus performance parameters of different
implement the proposed system. The number of logic MIMO OFDM systems
elements determines the cost and complexity of the The main objective of this paper is to reduce the
power and increase the throughput. Memory reference

Fig. 18 Power versus different MIMO OFDM systems

Fig. 20 Throughput versus different MIMO OFDM systems

Fig. 21 Logic element analysis versus different MIMO OFDM


Fig. 19 Delay versus different MIMO OFDM systems systems
RAJA & KANNAN: VLSI IMPLEMENTATION OF HIGH THROUGHPUT MIMO OFDM TRANSCEIVER 319

throughput. The circuit was implemented in a 28 nm


(transistor gate size) library with less circuit area and
evaluated in lower power dissipation. This can be
achieved by optimizing FFT architecture (memory
reference reduction and binary scaling methods are
used to minimize redundant memory references) and
Radix-4 booth multiplier algorithm (Radix-4
multiplier technique is used to reduce the power
dissipation) which reduces number of partial products.
References
1 Fu Bo & Ampadu Paul, J Signal Process Syst, 56(1) (2009)
59-68.
2 Chang Y & Park S C, IEICE Trans Fundamentals, E87-A
Fig. 22 Power versus throughput (11) (2004) 3020-3024.
3 Kim Hun Seok, Zhu Weijun, Bhatia Jatin, Mohammed
Karim, Shah Anish & Daneshrad Babak, EURASIP J Adv
Signal Process, 2008.
4 LaRoche Isabelle & Roy Sebastien, An efficient regular
matrix inversion circuit architecture for MIMO processing,
IEEE Int Symp on Circuits and Systems (ISCAS), May 2006,
pp.4819-4822.
5 Lin Y T, Tsai P Y & Chiueh T D, IEE Proc Comput Digit
Technol, 152(4) (2005) 499-506.
6 Perels D, Haene S, Luethi P, Burg A, Felber N, Fichtner W
& Bolcskei H, IEEE Trans VLSI Syst, 5 (2005) 215-218.
7 Greisen Pierre, Haene Simon & Burg, EURASIP J Embedded
Syst, 2008, Article ID242584.
8 Reisis D & Vlassopoulos N, IEEE Trans Circuits Syst I,
55(11) (2008) 3438-3447.
9 Radhouane R, Liu P & Modlin C, in Proc. IEEE Int Symp
Circuits Syst, 1 (May 2000) 116-119.
10 Yoshizawa Shingo & Miyanaga Yoshikazu, VLSI
implementation of SISO-OFDM and MIMO-OFDM
Fig. 23 Memory Reference versus Performance parameters of transceivers, IEEE Int Symp Communtions and Information
Different MIMO OFDM Systems Technologies (ISCIT), No. T2D-4, Oct 2006.
11 Yoshizawa Shingo, Yamauchi Yasushi & Miyanaga
in FFT is costly due to the long delay and high power Yoshikazu, A complete pipelined MMSE detection
consumption. Memory reference reduction method is architecture in a 4x4 MIMO-OFDM receiver, IEEE Int
Symp on Circuits and Systems (ISCAS), May 2008, pp.
used in FFT to eliminate redundant memory 1248-1251.
reference. Based on the result the graph is plotted in 12 Yoshizawa Shingo, Yamauchi Yasushi & Miyanaga
Fig. 23. From the graph we conclude that the Yoshikazu, VLSI Architecture of a 4x4 MIMO-OFDM with
proposed method delivers 1.4 Gbps maximum an 80-MHz Channel Bandwidth Transceiver, IEEE Int Symp
throughput with minimal number of memory on Circuits and Systems (ISCAS), May 2009, pp. 4244-4247.
13 Yoshizawa Shingo, Yamauchi Yasushi & Miyanaga
references of 15 with the corresponding delay of Yoshikazu, VLSI Implementation of a 4x4 MIMO-OFDM
4.04 ns and the power is 111 mW. Transceiver for 1Gbps Data Transmission, IEEE Int Symp
on Circuits and Systems (ISCAS), May 2010, pp. 1743-1746.
14 Lamarca Rey F & Vazquez M G, IEEE Trans Signal
Conclusions
Process, 53 (3) (2009) 1741-1755.
In this work, we have presented a VLSI 15 Shin M & Lee H, A high-speed four-parallel radix-2 4
implementation of high throughput MIMO OFDM FFT/IFFT processor for UWB applications, Proc. IEEE Int.
transceiver for 4G system which achieves 1.4 Gbps Symp. Circuits and Systems, May 2008, pp.960-963.
16 Ma G K & Taylor F J, IEEE ASSP Mag, Jan 1990, pp. 6-20.

Vous aimerez peut-être aussi