Vous êtes sur la page 1sur 5

2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)

QPSK Modulation and Demodulation Implementation on


CEVA XC-321 DSP for IEEE 802.11b Wi-Fi Phone
Application
Somnath Mondal, Santu Sardar and K. Ananda babu
Scientist, ANURAG, DRDO, Hyderabad, India
{somnath, santusardar, kababu}@anurag.drdo.in

AbstractThis work aims to design and implement Application Specic Integrated Circuit (ASIC) chips are
Quadrature Phase Shift Keying (QPSK) baseband modulation- used in WLANs to meet requests of reliability, low power
demodulation (MODEM) system on the CEVA-XC321 xed point consumption and low cost. In literature [6], a chip is realized
digital signal processor (DSP) and to compare its performance
with industry standard DSP processors and FPGA based in 0.25 m, 40 GHz BiCMOS technology for the WLANs
implementations. The modem function was developed in Vec-C standards 802.11a/b/g. A ip chip single die WiFi FEM is
language by utilizing two Vector Computation Units (VCU) of developed using (HBT+EID-PHEMT) technology for smart
CEVA and veried on CEVA-XC321 based emulation platform. phone application in [7]. In [8], an ASP (application specic
The implementation of the modem functions includes sampling, processor) with 512-bit SIMD (Single Instruction Multiple
NRZ encoding, QPSK modulation, data packet formation using
payload and packet header, receiver matched ltering, QPSK Data) and 192-bit VLIW (Very Long Instruction Word) archi-
demodulation and decision device. We have considered additive tecture optimized for wireless baseband processing was imple-
white Gaussian Noise (AWGN) channel to analyze the effect of mented in 130nm CMOS technology. This baseband processor
the channel on the transmitted data packet. The implementation provides 100 GOPS computing ability at 117.6MHz. But these
results and the theoretical results are consistent and ascertained chips dont have sufcient programming features and their
the correctness of this design. The modem parameters are
selected in accordance with IEEE 802.11b Wi-Fi standard. We functionality cant easily be changed or improved once the
have tested the QPSK modem implementation in conformance product is realized. Also they incur a huge development time-
to 802.11b Wi-Fi performance metrics and established its cycle. FPGAs may be an alternative ( [9][11]) but they are not
applicability in Wi-Fi phone system. The results showed that used in performance and power critical mobile applications.
the proposed DSP based QPSK implementation can greatly They can only be used for testing in pre-assessment phase
improve the performance efciency with less development time
and cost for high data rate wireless local area network (WLAN) [12].
applications.
In this paper, we present an optimized DSP based im-
Keywords - AWGN, CEVA, DSP, FPGA, modem, QPSK, VCU, plementation of QPSK baseband MODEM based on IEEE
Vec-C, Wi-Fi. 802.11b specications [13] and its implementation on CEVA-
DSP processor [14] with the aim to achieve optimum data-
I. I NTRODUCTION
rate as required by Wi-Fi phone application. CEVA-XC321
During early deep space program, phase shift keying (PSK) is a xed point DSP processor core which is being used
was developed and still being widely used in military and com- to develop a System on Chip (SoC) [14] by ANURAG. In
mercial applications [1] because it offers lowest probability of this SoC, CEVA core will be used for DSP intensive digital
error. QPSK and Binary PSK (BPSK) are two variants which baseband operations. Compared to work done on FPGAs in
offer low power and low BER data transmission rate, primary [9] or DSP chips in ( [6][8], [15]), proposed design in DSP
requirements of wireless and satellite communications. At core has advantages of ease of programmability, portability
same Bit Error Rate (BER) and xed bandwidth, QPSK can and short development life cycle, which are inuential to
transmit twice the data rate compared to BPSK. Therefore, the performance and capabilities of MODEM. Power aware
it is more spectrally efcient. In addition, QPSK requires features in the design and exibility to operate the chip at
less transmitter power than Quadrature Amplitude Modulation higher operating frequency has made the SoC suitable for Wi-
(QAM) to achieve same BER [2] and therefore it is more Fi hand-held applications.
power efcient scheme as well. With applications of WLANs
in various elds, there is an increasing demand for reliable, Rest of the paper is organized as follows. Section II gives
higher data rate WLAN at low cost ( [3][5]). The IEEE the functional description of CEVA-XC321 DSP. The details
802.11 standards committee published 802.11b standard in of QPSK baseband MODEM and its implementation method-
2.4 GHz (2.4 to 2.4835 GHz) ISM band [4], which provides ology on CEVA-DSP are presented in section III. Results are
WLANs Medium Access Control (MAC) and Physical Layer provided in section IV and concluding remarks are given in
(PHY) specications. section V.

978-1-5090-0035-7/16/$31.002016IEEE 476
2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)

equation (1), where E= Bit Energy, T= bit time and S(t) stands
for symbol and is the carrier angular frequency.

2 2E
S(t) = cos(t + (t)) (1)
T
Here (t) is the phase as a function of information content,
and can have values of 45o , 135o , 225o , and 315o . And the
value of (t) is:

(t) = atan[sin(t + Im )/cos(t + Re )] (2)


where Re is the initial phase of the real channel and Im is
initial phase of the imaginary channel.
The QPSK baseband system requirements are evolved in
Fig. 1: CEVA XC321 Architecture Block Diagram accordance with IEEE 802.11b Wi-Fi standard (table I) and
as per Wi-Fi phone modem functions [13]. The design was
carried out for a carrier frequency of 11 MHz to achieve 2
II. F UNCTIONAL D ESCRIPTION OF CEVA DSP Mbps target data rate. The block diagram of QPSK modulation
CEVA-XC DSP is a fully programmable solution, which and demodulation system is shown in gure 3. Input binary
supports multiple air interfaces in software. It is a mix of data is generated as a pseudo-random (PN) sequence. In
VLIW and SIMD [17] architectures. The functional block dia- modulator, the baseband data is converted into even and odd
gram of the DSP is shown in gure 1. CEVA-XC321 combines unipolar data streams which are changed into bipolar by using
a General Computation Unit (GCU) with two Vector Compu- non return to zero (NRZ) encoding technique. This NRZ
tation Units (VCUs). Each GCU consists of: one 16x16-bit encoded data is combined with the carrier generated from
signed and unsigned multiplier, one 40-bit arithmetic unit, one (Digital Data Synthesizer) DDS and resulting in-phase (I) and
40-bit logical unit, one 40-bit bit-manipulation unit, including quadrature-phase (Q) data are added to generate QPSK signal.
a full barrel shifter and an exponent unit, one 40-bit data A BPSK modulated known symbol sequence of 24 Bytes, as
pack and unpack unit and eight 40-bit accumulators. Vector header, was appended in the beginning of the waveform for
Arithmetic (VA) unit, Vector Bit-manipulation (VB) unit, phase recovery and detection of start of data at the receiver.
Vector Move and Pack data (VM) unit, Vector register le
and optional vector maximum likelihood detector (VD) unit TABLE I. Specications of QPSK Modem
Sl. Parameter Value
are provided in each VCU. 1 Input Character (ASCII) used 32 - 128
CEVA provides speed-ups up to 1025x over pure-software 2 Input Length (Digital) used 256 - 1024 Bits
implementations. It can support WiFi, WiMAX, LTE and W- 3 Frequency Band 2.4 GHz
CDMA [19]. Tensilica ConnX Baseband Engine [20] and 4 MODEM Technique QPSK
5 Number of bit per symbol 2
NXP CoolFlux [21] are also licensable fully synthesizable 6 Targeted Bit Rate 2 Mbps
DSP cores for baseband processing. In comparison, Ceva uses 7 Channel Model AWGN
single-core to reduce design complexity. A single Ceva-XC 8 Channel Bandwidth 2 MHz
core can support next-generation 4G/LTE phones up to LTE 9 Samples/bit 8-12
10 Filter Type Low pass (LPF)
CAT5 (300Mb/s) [22]. 11 Filter taps 129
12 Filter cut-off frequency 4KHz (Speech Signal)
III. P ROPOSED QPSK BASED C OMMUNICATION S YSTEM 13 Matched Filter length 1200
14 Header Length 24 Bytes
QPSK uses equi-spaced four points around a circle on the 15 Payload Length 64 - 1518 bytes
constellation diagram and can therefore encode two bits per 16 Output Length (digital) 32 - 128
symbol as shown in gure 2. The bit-error probability of the 17 Output Character (ASCII) 256 - 1024 Bits
QPSK signal is:
 The modulated waveform is then passed through a channel
1 Eb
Pb = erf c[ 2 ] model ( [23], [24]) which injects AWGN noise with different
2 N0
signal to niose ratio (SNR) values and then fed to the demod-
In this equation, Pb stands for probability of bit-error, ulator to demodulate the modulated data.
erf c(.) is complementary error function, Eb is energy per bit The received signal at the demodulator is initially passed
and N0 is noise power spectral density (PSD). on to the matched lter of length 1200 to detect the start
QPSK can be seen as viewed as two independently mod- of the frame and extraction of the payload. The extracted
ulated quadrature carriers each being BPSK modulated [15]. symbols are sent sequentially to demodulation block. Here,
The four waveforms of different phases are obtained using the symbol sequence is rst given to I and Q channel of

477
2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)

the demodulator to multiply with locally generated carriers. SNR values, the constellation diagram at the Receiver along
The mixed outputs are then passed through a low pass lter with the Constellation diagram at the transmitter are shown
of 129 taps to reject high frequency components. We have in gure 5. It can be observed from the constellation diagram
selected lter cut-off frequency of 4KHz to utilize it for that, as the SNR value decreases, more and more phase error
speech communication also. We used nite impulse response gets accumulated at the receiver. We plotted the eye diagram
(FIR) lter due to its linear phase characteristics within pass- also for the above situations in gure 6 since eye diagrams
band. The recovered waveform is then passed to the decision make it easy to nd the worst-case signaling conditions at the
device. In the transmitter all the input values were sampled receiving end. To maximize noise margins we must select the
twelve times. Therefore, at the receiver, decision is performed best sample point at the widest point in the eye and pick the
for all the sampled values and then aggregated to form the best digitization threshold as half-way across width. The eye is
nal output. This procedure reduces the bit-error rate (BER) more open in case higher SNR values which means that lower
compared to the case of without sampling. I and Q channel bits BER will be obtained for higher SNR. ISI effect is reduced
are concatenated after the decision block to get the recovered here because after every 12 samples the next symbol starts and
bit stream. therefore one symbol is not affecting the next one [9]. Figure
7 shows the plot of BER versus Eb /N0 for the QPSK system.
The red curve is the theoretical BER and the blue curve is
the obtained BER for our implementation. The theoretical and
practical values of BER are close to each other which veries
the functionality of the QPSK MODEM implementation.

Fig. 2: Constellation Diagram of a QPSK Modulated Carrier

The modulation and demodulation functions are imple-


mented in xed point and achieved 9.27x performance im-
provement compared to corresponding oating point imple-
mentation on the DSP because CEVA is a xed point DSP.
We have used Vec-C language to implement the FIR lter.
Complex lter input is considered where the real part is
considered as the I value and the imaginary part is the Q
value. Accordingly the real lter output is the ltered I output
and imaginary output is the ltered Q output because the
lter coefcients are all real values. Vec-C implementation Fig. 4: (a) Transmitted Signal at the Transmitter, (b)
achieved 19786.7x less clock cycles compared to correspond- Received Signal at the Receiver for SNR = 35 dB, (c)
ing C implementation. This is due to the fact that, Vec- Received Signal at the Receiver for SNR = 20 dB, (d)
C implementation utilizes the 2 VCUs, optimized for DSP Received Signal at the Receiver for SNR = 1 dB
operations whereas C implementation uses only the GCU.
We have taken measures to optimize the code in terms of
cycle count and code size. For CEVA compiler, level 04 cycle
count optimization and level 03 code size optimization are
used. To overcome stack overhead, global variables were used.
Moreover, we minimized loop content by removing any non-
dependent contents from loops. Memory accesses that dont
change throughout the loop are copied to local variables and
copied back if necessary. Known limits to loops are used by (a) (b) (c)
copying loop limit to local variable. Simplied conditions are
used inside the loops. To optimize the code memory and data
memory, similar loops are merged together and minimized the
usage of if else statement. Large functions are split into
small and simple ones.
(d)
IV. R ESULTS Fig. 5: Constellation Diagram: (a) At the Transmitter, (b)
The transmitted waveform at the transmitter is shown in Received Signal for SNR = 35 dB, (c) Received Signal for
gure 4(a). The received signal at the receiver for SNR value SNR = 20 dB, (d) Received Signal for SNR = 1 dB
of 35 dB after AWGN noise insertion by the channel, is shown
in gure 4(b). The received signal for SNR of 20dB and 1 The baseband SoC will be used for running Wi-Fi stack
dB are shown in gure 4(c)and 4(d) respectively. For these based on IEEE 802.11b standards where PHY layer will

478
2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)

Fig. 3: Block Diagram of QPSK based MODEM

in the form of cycle count, is calculated. The CEVA-XC


emulation platform is based on Altera Stratix-4 Device (Al-
tera EP4SE530F43C4) with 256KB Internal Program, 512KB
Internal Data in 4 Blocks and it uses 80% of the device
(a) (b)
logic 8. The board provides a standalone minimal system for
software development. It includes 6 Hi-Speed connectors to
interface with other CEVA emulation platforms or base boards.
The CEVA-XC operates using 50 MHz DSP clock with 256
KB L1 program memory and 512 KB L1 data memory. The
emulation platform contains 8 MB external FLASH memory
(c) (d) CEVA- XC JTAG, Altera JTAG conguration port, LEDs and
Fig. 6: Eye Diagram for the I and Q components of the Dip-switches. The emulation platform is connected to JBOX2
QPSK signal: (a) At the Transmitter, (b) Received Signal for which is useful for real-time emulation support and supports
SNR = 35 dB, (c) Received Signal for SNR = 20 dB, (d) ethernet and USB connection to host PC. It will be used on
Received Signal for SNR = 1 dB nal SoC with JTAG interface to the DSP.
The input length to the QPSK modulator is taken as 3072
bits and accordingly the output length after QPSK modulation
becomes 1536 symbols. The results are summarized in tableII.
As the DSP core estimated operating frequency is 400 MHz in
the SoC, module execution time needs 110.32 s. Therefore,
the achieved data rate is 13.9231 Mbps. The QPSK demodu-
lation requires 221531 clock cycles i.e. completes in 553.8275
s achieving a data rate of 2.9901 Mbps. The selected param-
eter values and the measurements indicate that the CEVA-
XC321 Core could handle the baseband QPSK MODEM
functionality and deliver acceptable real-time performance for
Fig. 7: BER versus Eb/N0 IEEE 802.11b Wi-Fi applications.

be implemented in CEVA core. According to IEEE 802.11b


specications QPSK can be used for modulation-demodulation
to achieve 2 Mbps data rate [12]. Also the payload size in a
transmission packet varies between 32 bytes to 2048 bytes, 30
to 32 bytes being most common ( [25], [26]). In our case, we
have considered a payload size of 384 bytes.
The CEVA core is operating at 400 MHz in the SoC RTL
simulation model and targeted the chip to work at the same Fig. 8: CEVA emulation platform Board
frequency. We have ported and veried the proposed QPSK
MODEM on CEVA-XC emulation Platform using the CEVA Compared to licensable DSP cores, CEVA is most suitable
integrated development platform (IDE). CEVA-proler tool in for Wi-Fi applications in terms of computation efciency,
the CEVA-IDE is used to prole the design and performance, speed and power ( [14], [17], [20], [22]). Compared to TI DSP

479
2016 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC)

TABLE II. QPSK Modem Implementation Results [5] IEEE Standard 802.11g, 2003.
No. of Clock Cy- Processing Data Rate [6] T. Ruhlicke, M. Zannoth, and B. U. Klepser, A highly integrated, dual-
Sl. Module
cles Required time (s) Achieved (Mbps) band, multi-mode wireless lan transceiver, Proceedings of ESSCIRC03,
1 Modulator 44128 110.32 13.9231 pp. 229232, 2003.
2 Demodulator 221531 553.8275 2.9901 [7] C. Yuen, K. Laursen, D. Chu, Y.-C. Pao, and A. Chernyakov, A compact
ip chip single die wi fem for smart phone application, in IMS IEEE
conference, vol. 10, 2010, pp. 10981101.
[8] Z. Ziyuan, T. Shan, S. Yongtao, H. Juan, S. Gang, and S. Jinglin, A
100 gops asp based baseband processor for wireless communication,
core, in [27], Angelis et. al. implemented QPSK transmitter in IEEE conference on EDAA, 2013.
on TMS320C6711DSP, which can operate at a maximum [9] A. Zafar and S. Z. Farooq, Implementation and analysis of qpsk &
frequency of 200 MHz and achieved a maximum data-rate 16qam modulator & demodulator, in Proceedings of ICAST, vol. 2,
2008, pp. 6468.
of 1840 kbps. [10] A. Corporation, Qpsk modem reference design, Application Note,
QPSK modulator is implemented on FPGA devices( [28], vol. 2, no. 281, pp. 132, 2003.
[29]). In [28], 512 Kbps data-rate is achieved using Actel [11] S. Popescu, A.S.Gontean, and D.Ianchis, Implementation of a qpsk
system on fpga, 2011 IEEE 9th International Symposium on Intelligent
ProASIC 3000 kit whereas in [29] attained data-rate is only Systems and Informatics, pp. 365370, 2011.
500 Kbps on Xilinx FPGA. In contrast, we have achieved more [12] T. Chunjiang, Z. Xin, L. Bo-an, and C. hongyi, The design of 802.11b
than 2 Mbps for QPSK MODEM. wlan baseband processor, in ASIC 2003 Proceedings, vol. 2, October
2003, pp. 852855.
CEVA Integrated Development Environment (IDE) supports [13] F. Daneshgaran, M. Laddomada, F. Mesiti, and M. Mondin, On
high level language for application development [17]. This the behavior of the distributed coordination function of ieee 802.11
feature facilitated us to develop parameterized design in ANSI- with multirate capability under general transmission conditions, IEEE
Transactions on Wireless Communications, pp. 118, 2007.
C language, which shortened the development life-cycle. The [14] CEVA.Inc., Ceva news - ceva makes it mobile, CEVA Newsletter,
parameterization makes the code reusable for other applica- vol. 9, no. 1, pp. 116, 2010.
tions as well. This CEVA emulation platform can be used for [15] P. Chandan, T. H. Bahubali, G. L. Mohanaraj, and P. S. Gowra,
Implementation of qpsk based communication system on tms320c6713
developing multiple hi-end applications [19]. This reduces the dsk, World Research Journal of Ad Hoc and Ubiquitous Computing,
development cost since we dont need to incur high equipment vol. 1, no. 1, pp. 1115, 2012.
expenditure to migrate from one generation to next [16]. [16] S. D. Deshpande, Software implementation 0f ieee 802.11b wireless lan
standard, Proceeding of the SDR 04 Technical Conference and Product
V. C ONCLUSION Exposition, 2004.
[17] CEVA.Inc., ceva xc321T M architecture specication, CEVA Ar-
We have proposed a QPSK based transmitter and receiver chitecture Specication, vol. 1, no. 2, pp. 1163, 2010.
and implemented it on CEVA-XC321 DSP. CEVA is selected [18] Tensilica.Inc., Xtensa lx2, Product Brief, pp. 17, 2007.
[19] M. Bansal, J. Mehlman, S. Katti, and P. Levis, Openradio: A pro-
due to its superior performance among other licensable soft grammable wireless dataplane, HotSDN-12, Helsinki, Finland., pp.
DSP cores as discussed in this paper. We have utilized Vec- 109114, 2012.
C programming language to use the VCU of CEVA for [20] T. R. Halfhill, Tensilica plays baseband: New connx dsp core aims for
low-power wireless communications, Reed Electronics Group, pp. 19,
better performance as demonstrated. AWGN channel model is 2009.
developed in Matlab for exhaustive verication of the system [21] N. Semiconductors, The coolux bsp: the novel low cost ultra low
for a range of SNR values. We analyzed the constellation plot, power processor core for software dened digital basebands, pp. 122,
2007.
eye diagram as part of performance evaluation of the system. [22] K. Williston, Ceva dsp does lte in software, DSP DcsignLine, 2009.
The MODEM was implemented in the CEVA-XC321 based [23] P. Barsocchi, G. Oligeri, and F. Potorti, Frame error model in rural
emulation platform connected to JBOX2, useful to interface to wi- networks, in Modeling and Optimization in Mobile, Ad Hoc and
Wireless Networks and Workshops, 2007. WiOpt 2007. 5th International
the DSP in the nal SoC. The proposed system improves the Symposium on, 2007, pp. 16.
developing efciency, shorten developing period and reduce [24] M. Khosroshahy, Study and implementation of ieee 802.11 physical
costs. We established the applicability of this modem for IEEE channel model in yans (ns3 prototype) network simulator, INRIA
Report, pp. 161, 2006.
802.11b Wi-Fi phone application. The performance of this [25] M. A. M. M. El-Bendary, A. E. Abou-El-azm, N. A. El-Fishawy,
system is compared with other COTS based implementations F. Shawki, F. E. Abd-ElSamie, M. A. R. El-Tokhy, and H. B. Kazemian,
as published in literature. Performance of the audio signals transmission over wireless networks
with the channel interleaving considerations, EURASIP Journal on
In the future work, this proposed system will be ported on Audio, Speech and Music Processor, vol. 4, no. 1, pp. 114, 2012.
to the baseband SoC evaluation board. To further improve the [26] M. Veeraraghavan, N. Cocker, and T. Moors, Support of voice services
performance, we can model and use assembly optimized Vec- in ieee 802.11 wireless lans, in INFOCOM 2001. Twentieth Annual
Joint Conference of the IEEE Computer and Communications Societies.
C functions. Error correcting capability will be included in Proceedings. IEEE, vol. 1, 2001, pp. 488497 vol.1.
the modem system to improve the BER performance. Spread [27] C. T. Angelis and A. Lambros, An qpsk 3g transmitter based on the
Spectrum (SS) can be used in this system to increase the tms320c6711dsp, Rom. Journ. Phys., vol. 50, no. 7-8, pp. 669677,
2005.
security of the system. [28] K. A. Monpara and S. B. Parmar, Design and implementation of qpsk
modulator using digital sub-carrier, Journal of Information, Knowledge
R EFERENCES and Research in Electronics and Communication Engineering, vol. 2,
[1] A. S. Ozlem and K. S. Hacer, An x-band transmitter for satellite no. 2, pp. 569573, 2012.
communication, pp. 110. [29] T. Sakla, D. Jain, and S. Gautam, Implementation of digital qpsk
[2] A. Zafar and S. Z. Farooq, Advances in space technologies, 2008. modulator by using vhdl/matlab, International Journal of Engineering
[3] IEEE Standard 802.11a, 1999. Service and Technology, vol. 2, no. 9, pp. 48274831, 2010.
[4] IEEE Standard 802.11b, 1999.

480

Vous aimerez peut-être aussi