Vous êtes sur la page 1sur 37

Department of Instrumentation

Technical seminar on

Mahitha.G USN:1RV12LBI07 M.TECH II-sem BMSPI.


6/9/2013
DEPT. OF IT-BMSP&I

Under the guidance of Dr.K.V.Padmaja Associate professor &Dean Dept. of IT,RVCE.


1

INTRODUCTION
Voice Recognition Systems are those systems which can recognize the voices of the individuals. Voice Recognition is the process of translating spoken words into text words on the computer. Voice recognition is an alternative to typing on a key board.
6/9/2013
DEPT. OF IT-BMSP&I

How Humans do This????????

Articulation produces sound waves which the ear conveys to the brain for processing.
6/9/2013
DEPT. OF IT-BMSP&I

3/34

How might computers do this????


Acoustic waveform
Acoustic signal

Digitization Acoustic analysis of the speech signal Linguistic interpretation


6/9/2013
DEPT. OF IT-BMSP&I

Speech recognition

4/34

Isolated Systems requires a brief pause between the spoken words. While Continuous Systems doesnt.
Speaker Dependent Systems recognize speech from only one speaker. While Speaker-Independent Systems can recognize anyones speech.

6/9/2013

DEPT. OF IT-BMSP&I

How to create a Voice Recognition System


Speech Acquisition(Collection). Speech Analysis. User Interface Development.

6/9/2013

DEPT. OF IT-BMSP&I

Speech Acquisition
For training purpose the speech is acquired using the microphone, for the analysis. The sound card of PC converts the Analog Speech input into digital format for further analysis.

6/9/2013

DEPT. OF IT-BMSP&I

Speech Analysis
The First important step in Speech Analysis is to separate each word from the ambient noise. Further each spoken word is compared with the inbuilt acoustic model or dictionary which is created during the training session. The above step is done with the help of an efficient speech detection algorithm.

6/9/2013

DEPT. OF IT-BMSP&I

User Interface Development


Final Step is to Develop a User Interface, so that all users can use these system with ease. For Example: Speech Recognition System of Windows 7 looks so compact & is as shown below.

6/9/2013

DEPT. OF IT-BMSP&I

Developing a voice recognition system

2013-14

Block diagram of a voice recognition system

6/9/2013

DEPT. OF IT-BMSP&I

10

Process of Speech Recognition

Voice Input

Analog to Digital

Acoustic Model

Language Model

Feedback

Display

Speech Engine

6/9/2013

DEPT. OF IT-BMSP&I

11

Speech recognition

How it works?

Record voice command (Time domain) Transform into frequency domain using Fourier Transform and get the magnitude spectrum Compare spectrum of voice commands

6/9/2013

DEPT. OF IT-BMSP&I

12

Program block diagram


Voice Command Fourier Transform

Frequency Spectrum

NO

Compare with stored voice commands

Do they match?

YES

Command 6/9/2013
DEPT. OF IT-BMSP&I

13

Fourier transform

Speech Signal (time domain)

Frequency domain

6/9/2013

DEPT. OF IT-BMSP&I

14

APPLICATIONS
Telephony and Other Domains. People with Disabilities. Training Air-Traffic Controller. High Performance Fighter Air-Craft. Electronic Medical Records etc

6/9/2013

DEPT. OF IT-BMSP&I

15

People with Disability


This picture shows how Voice Recognition System is helping a Disable man to complete his work.

6/9/2013

DEPT. OF IT-BMSP&I

16

Voice Recognition Gadgets


Voice Recognition Systems can be embedded with modern gadgets. Example:-TV etc

6/9/2013

DEPT. OF IT-BMSP&I

17

Voice Recognition in HighPerformance Fighter Aircrafts


Voice Recognition System has substantially added in High-Performance Fighter Aircrafts. This System helps the pilot to control the various subsystem in an effective manner.
F-35 is the first US Fighter Aircraft with Voice Recognition System able to hear the pilot spoken commands to manage various aircraft subsystems, such as communications and navigation.

6/9/2013

DEPT. OF IT-BMSP&I

18

Developing a voice recognition system

Speech Recognition: MS Office 2003 Open MS Word Tools Speech This enables the language bar for both speech-to-text and text-to-speech options You will be guided through training needed to create a user voice profile (15 minutes) You will need a microphone Can dictate directly into MS Office, not other applications
6/9/2013

Speech Recognition Vista and Windows 7 Built into the Operating System :

2013-14

Open Speech Recognition by clicking the Start button , clicking Control Panel, clicking Ease of Access, and then clicking Speech Recognition. Click Set up microphone, follow the instructions in the wizard.
19

DEPT. OF IT-BMSP&I

2013-14

Microsoft Speech Recognition Windows 7

6/9/2013

DEPT. OF IT-BMSP&I 20 http://www.microsoft.com/enable/products/windowsvista/speech.aspx

Developing a voice recognition system

2013-14

Dictating and Correcting

6/9/2013

DEPT. OF IT-BMSP&I

21

Voice Recognition System: Flaws and Weakness


Low signal-to-noise ratio. Overlapping speech. Differentiation b/w Homonyms. Intensive use of computer power.

6/9/2013

DEPT. OF IT-BMSP&I

22

How to Remove Flaws and Weakness of VRS


Using High Quality Microphone. Use Good Quality of Sound Cards. System must be trained properly. If possible work in quiet environment.

6/9/2013

DEPT. OF IT-BMSP&I

23

HUMAN PERFORMANCE
(According to a paper written by Lippmann)

Digits
0.8 0.6 0.4 Digits 7 6 5 4 3 2 0 Machine Human 1 0

Letters

Letters

0.2

Machine Humans
Digits Error Rate 6/9/2013
DEPT. OF IT-BMSP&I

Word Error Rate 24

Developing a voice recognition system

2013-14

BENEFITS AND CHALLENGES Spelling attention Ergonomics Hands-free use Endurance Concentration and Reading and speech Pronunciation & articulation

6/9/2013

DEPT. OF IT-BMSP&I

25

Developing a voice recognition system

2013-14

Recent Improvements in SR

Faster training ~10 min. Better recognition ~95% More compatible software Better system control/command

6/9/2013

DEPT. OF IT-BMSP&I

26

Developing a voice recognition system

2013-14

Future of SR SUI Speech-based User Interface Improvements needed: - Greater accuracy - Greater system control/command - More compatible software

6/9/2013

DEPT. OF IT-BMSP&I

27

CONCLUSION
Human performance figures suggests that we still have enormous room for improvement. At present several new algorithms are developed to implement voice recognition system.

6/9/2013

DEPT. OF IT-BMSP&I

28

IEEE PAPERS ON VOICE RECOGNITION SYSTEM


6/9/2013 29

[1] An Interactive and Efficient Voice Processing System for Embedded Applications
The objective of this paper is to present the design of an embedded system that will be helpful for the physically impaired individuals in their day to day life. This paper proposes a speech recognition and colour sensing technique based on Formant frequency and Euclidean distance analysis for embedded systems. The complete system consists of three subsystems, the speech recognition system, a central controller and the robotic arm. The experimental and simulation results show that the proposed algorithm makes a good balance between the computational complexity and recognition accuracy, and thus is more useful for embedded systems. speech recognition is successfully implemented on Atmels Atmega16 microcontroller. When compared to the existing system, the proposed system provides robustness and reliability.
6/9/2013
DEPT. OF IT-BMSP&I

30

[2]Hardware-Software co design of automatic speech recognition system for embedded Real-time applications
The system consists of a standard microprocessor and a hardware accelerator for Gaussian mixture model (GMM) emission probability calculation implemented on a field-programmable gate array.

Experiments on widely used benchmark data show that the real-time factor of the proposed system is 0.62, which is about three times faster than the pure software-based baseline system, while the word accuracy rate is preserved at 93.33%. As a part of the recognizer, a new adaptive beam-pruning algorithm is also proposed and implemented, which further reduces the average real-time factor to 0.54 with the word accuracy rate of 93.16%. The proposed speech recognizer is suitable for integration in various types of voice (speech)-controlled applications.

6/9/2013

DEPT. OF IT-BMSP&I

31

[2]Hardware-Software co design of automatic speech recognition system for embedded Real-time applications(Contd..)
The proposed ASR system shows much better real-time factors than the other approaches without decreasing the word accuracy rate. Other advantages of the proposed approach include rapid prototyping, flexibility in design modifications, and ease of integrating ASR with other applications. These advantages, both quantitative and qualitative, suggest that the proposed co processing architecture is an attractive approach for embedded ASR. Aside from better word accuracy and timing performance, power consumption is also another important issue for embedded applications. The proposed architecture is not tied to any specific target technology.
6/9/2013
DEPT. OF IT-BMSP&I

32

6/9/2013

33

Developing a voice recognition system

2013-14

REFERENCES
[1] Robert Keefer, Yan Liu, and Nikolaos Bourbakis, THE DEVELOPMENT AND EVALUATION OF AN EYES-FREE INTERACTION MODEL FOR MOBILE READING DEVICES, ieee transactions on human-machine systems, vol. 43, no.1, january 2013.

[2] Vinayak nayyar department of electronics and instrumentation. Abhinav Kumar department of electronics and communication SRM university chennai,India AN INTERACTIVE AND EFFICIENT VOICE PROCESSING SYSTEM FOR EMBEDDED APPLICATIONS Mediterranean Conference on Embedded Computing MECO 2012.
[3] yujing si, ta li, shang cai, jielin pan, yonghong yan, RECURRENT NEURAL NETWORK LANGUAGE MODEL IN MANDARIN VOICE INPUT SYSTEM 2012 8th international conference on natural computation (ICNC 2012).

6/9/2013

DEPT. OF IT-BMSP&I

34

Developing a voice recognition system

2013-14

REFERENCES( Contd)
[4] Octavian Cheng, Member, IEEE, Waleed Abdulla, Member, IEEE, and Zoran Salcic, Senior Member, IEEE HARDWARESOFTWARE CO DESIGN OF AUTOMATIC SPEECH RECOGNITION SYSTEM FOR EMBEDDED REAL-TIME APPLICATIONS,Ieee transactions on industrial electronics, vol. 58, no. 3, march 2011. [5] Chee cheun huang1,2and julien epps1,school of electrical engineering and telecommunications,the university of new south wales, sydney, NSW 2052, australia ,national ICT australia (NICTA), australian technology park, sydney, NSW 1430, australia , A STUDY OF AUTOMATIC PHONETIC SEGMENTATION FOR FORENSIC VOICE COMPARISON ICASSP 2012. [6] Afeez Olalekan, Alex Page, Ying Sun, PhD Department of Electrical, Computer and Biomedical Engineering, University of Rhode Island, Kingston, RI 02881-0805 USA Optimizing the Functionality of a Voice Recognition System for Assistive Technology.
6/9/2013
DEPT. OF IT-BMSP&I

35

Developing a voice recognition system

2013-14

REFERENCES( Contd)
[7] Kurzweil Applied Intelligence, Inc Developing Continuous Speech Recognition Technology that Uses Natural Language Processing Commands Research and data for status report 93-01-0101 were collected during july september 2001 and april - june 2002. [8] Luke Makischuk,Abderahmane Sebaa ,Ameneh Sadat,Yazdaninik Asma Faizi SPEECH RECOGNITION AND ITS APPLICATION IN VOICEBASED ROBOT CONTROL SYSTEM. [9] Agus Trihandoyo, Adam Belloum and Kun-Mean Hou Heudiasyc CNRS URA 817, UniversitC de Technologie de Compikgne B.P. 649, 60206 Compikgne Cedex, FRANCE A REAL-TIME SPEECH RECOGNITION ARCHITECTURE FOR A MULTI-CHANNEL INTERACTIVE VOICE RESPONSE SYSTEM 1995 IEEE. [10] Yfan Gong and Yu-Hung Kao,Texas Instruments Incorporated,P.O. Box 6601 99, MS-8649, Dallas, TX 75266, USA IMPLEMENTING A HIGH ACCURACY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNIZER ON A FIXED-POINT DSP ,2000 IEEE.

6/9/2013

DEPT. OF IT-BMSP&I

36

6/9/2013

DEPT. OF IT-BMSP&I

37

Vous aimerez peut-être aussi