Voice Recognition System Seminar

Department of Instrumentation
Technical seminar on
Mahitha.G USN:1RV12LBI07 M.TECH II-sem BMSPI.

6/9/2013
DEPT. OF IT-BMSP&I
Under the guidance of Dr.K.V.Padmaja Associate professor &Dean Dept. of IT,RVCE.

1
INTRODUCTION
Voice Recognition Systems are those systems which can recognize the voices of the individuals. Voice Recognition is the process of translating spoken words into text words on the computer. Voice recognition is an alternative to typing on a key board.
6/9/2013
DEPT. OF IT-BMSP&I
How Humans do This????????
Articulation produces sound waves which the ear conveys to the brain for processing.
6/9/2013
DEPT. OF IT-BMSP&I
3/34
How might computers do this????

Acoustic waveform
Acoustic signal
Digitization Acoustic analysis of the speech signal Linguistic interpretation

6/9/2013
DEPT. OF IT-BMSP&I
Speech recognition
4/34
Isolated Systems requires a brief pause between the spoken words. While Continuous Systems doesnt.
Speaker Dependent Systems recognize speech from only one speaker. While Speaker-Independent Systems can recognize anyones speech.
6/9/2013
DEPT. OF IT-BMSP&I
How to create a Voice Recognition System

Speech Acquisition(Collection). Speech Analysis. User Interface Development.
6/9/2013
DEPT. OF IT-BMSP&I
Speech Acquisition
For training purpose the speech is acquired using the microphone, for the analysis. The sound card of PC converts the Analog Speech input into digital format for further analysis.
6/9/2013
DEPT. OF IT-BMSP&I
Speech Analysis
The First important step in Speech Analysis is to separate each word from the ambient noise. Further each spoken word is compared with the inbuilt acoustic model or dictionary which is created during the training session. The above step is done with the help of an efficient speech detection algorithm.
6/9/2013
DEPT. OF IT-BMSP&I
User Interface Development

Final Step is to Develop a User Interface, so that all users can use these system with ease. For Example: Speech Recognition System of Windows 7 looks so compact & is as shown below.
6/9/2013
DEPT. OF IT-BMSP&I
Developing a voice recognition system
2013-14
Block diagram of a voice recognition system
6/9/2013
DEPT. OF IT-BMSP&I
10
Process of Speech Recognition
Voice Input
Analog to Digital
Acoustic Model
Language Model
Feedback
Display
Speech Engine
6/9/2013
DEPT. OF IT-BMSP&I
11
Speech recognition
How it works?

Record voice command (Time domain) Transform into frequency domain using Fourier Transform and get the magnitude spectrum Compare spectrum of voice commands
6/9/2013
DEPT. OF IT-BMSP&I
12
Program block diagram

Voice Command Fourier Transform
Frequency Spectrum
NO
Compare with stored voice commands
Do they match?
YES
Command 6/9/2013
DEPT. OF IT-BMSP&I
13
Fourier transform
Speech Signal (time domain)
Frequency domain
6/9/2013
DEPT. OF IT-BMSP&I
14
APPLICATIONS
Telephony and Other Domains. People with Disabilities. Training Air-Traffic Controller. High Performance Fighter Air-Craft. Electronic Medical Records etc
6/9/2013
DEPT. OF IT-BMSP&I
15
People with Disability

This picture shows how Voice Recognition System is helping a Disable man to complete his work.
6/9/2013
DEPT. OF IT-BMSP&I
16
Voice Recognition Gadgets

Voice Recognition Systems can be embedded with modern gadgets. Example:-TV etc
6/9/2013
DEPT. OF IT-BMSP&I
17
Voice Recognition in HighPerformance Fighter Aircrafts

Voice Recognition System has substantially added in High-Performance Fighter Aircrafts. This System helps the pilot to control the various subsystem in an effective manner.
F-35 is the first US Fighter Aircraft with Voice Recognition System able to hear the pilot spoken commands to manage various aircraft subsystems, such as communications and navigation.
6/9/2013
DEPT. OF IT-BMSP&I
18
Speech Recognition: MS Office 2003 Open MS Word Tools Speech This enables the language bar for both speech-to-text and text-to-speech options You will be guided through training needed to create a user voice profile (15 minutes) You will need a microphone Can dictate directly into MS Office, not other applications
6/9/2013
Speech Recognition Vista and Windows 7 Built into the Operating System :
2013-14
Open Speech Recognition by clicking the Start button , clicking Control Panel, clicking Ease of Access, and then clicking Speech Recognition. Click Set up microphone, follow the instructions in the wizard.
19
DEPT. OF IT-BMSP&I
2013-14
Microsoft Speech Recognition Windows 7
6/9/2013
DEPT. OF IT-BMSP&I 20 http://www.microsoft.com/enable/products/windowsvista/speech.aspx
2013-14
Dictating and Correcting
6/9/2013
DEPT. OF IT-BMSP&I
21
Voice Recognition System: Flaws and Weakness

Low signal-to-noise ratio. Overlapping speech. Differentiation b/w Homonyms. Intensive use of computer power.
6/9/2013
DEPT. OF IT-BMSP&I
22
How to Remove Flaws and Weakness of VRS

Using High Quality Microphone. Use Good Quality of Sound Cards. System must be trained properly. If possible work in quiet environment.
6/9/2013
DEPT. OF IT-BMSP&I
23
HUMAN PERFORMANCE
(According to a paper written by Lippmann)
Digits
0.8 0.6 0.4 Digits 7 6 5 4 3 2 0 Machine Human 1 0
Letters
Letters
0.2
Machine Humans
Digits Error Rate 6/9/2013
DEPT. OF IT-BMSP&I
Word Error Rate 24
2013-14
BENEFITS AND CHALLENGES Spelling attention Ergonomics Hands-free use Endurance Concentration and Reading and speech Pronunciation & articulation
6/9/2013
DEPT. OF IT-BMSP&I
25
2013-14
Recent Improvements in SR
Faster training ~10 min. Better recognition ~95% More compatible software Better system control/command
6/9/2013
DEPT. OF IT-BMSP&I
26
2013-14
Future of SR SUI Speech-based User Interface Improvements needed: - Greater accuracy - Greater system control/command - More compatible software
6/9/2013
DEPT. OF IT-BMSP&I
27
CONCLUSION
Human performance figures suggests that we still have enormous room for improvement. At present several new algorithms are developed to implement voice recognition system.
6/9/2013
DEPT. OF IT-BMSP&I
28
IEEE PAPERS ON VOICE RECOGNITION SYSTEM

6/9/2013 29
[1] An Interactive and Efficient Voice Processing System for Embedded Applications
The objective of this paper is to present the design of an embedded system that will be helpful for the physically impaired individuals in their day to day life. This paper proposes a speech recognition and colour sensing technique based on Formant frequency and Euclidean distance analysis for embedded systems. The complete system consists of three subsystems, the speech recognition system, a central controller and the robotic arm. The experimental and simulation results show that the proposed algorithm makes a good balance between the computational complexity and recognition accuracy, and thus is more useful for embedded systems. speech recognition is successfully implemented on Atmels Atmega16 microcontroller. When compared to the existing system, the proposed system provides robustness and reliability.
6/9/2013
DEPT. OF IT-BMSP&I
30
[2]Hardware-Software co design of automatic speech recognition system for embedded Real-time applications
The system consists of a standard microprocessor and a hardware accelerator for Gaussian mixture model (GMM) emission probability calculation implemented on a field-programmable gate array.
Experiments on widely used benchmark data show that the real-time factor of the proposed system is 0.62, which is about three times faster than the pure software-based baseline system, while the word accuracy rate is preserved at 93.33%. As a part of the recognizer, a new adaptive beam-pruning algorithm is also proposed and implemented, which further reduces the average real-time factor to 0.54 with the word accuracy rate of 93.16%. The proposed speech recognizer is suitable for integration in various types of voice (speech)-controlled applications.
6/9/2013
DEPT. OF IT-BMSP&I
31
[2]Hardware-Software co design of automatic speech recognition system for embedded Real-time applications(Contd..)
The proposed ASR system shows much better real-time factors than the other approaches without decreasing the word accuracy rate. Other advantages of the proposed approach include rapid prototyping, flexibility in design modifications, and ease of integrating ASR with other applications. These advantages, both quantitative and qualitative, suggest that the proposed co processing architecture is an attractive approach for embedded ASR. Aside from better word accuracy and timing performance, power consumption is also another important issue for embedded applications. The proposed architecture is not tied to any specific target technology.
6/9/2013
DEPT. OF IT-BMSP&I
32
6/9/2013
33
2013-14
REFERENCES
[1] Robert Keefer, Yan Liu, and Nikolaos Bourbakis, THE DEVELOPMENT AND EVALUATION OF AN EYES-FREE INTERACTION MODEL FOR MOBILE READING DEVICES, ieee transactions on human-machine systems, vol. 43, no.1, january 2013.
[2] Vinayak nayyar department of electronics and instrumentation. Abhinav Kumar department of electronics and communication SRM university chennai,India AN INTERACTIVE AND EFFICIENT VOICE PROCESSING SYSTEM FOR EMBEDDED APPLICATIONS Mediterranean Conference on Embedded Computing MECO 2012.
[3] yujing si, ta li, shang cai, jielin pan, yonghong yan, RECURRENT NEURAL NETWORK LANGUAGE MODEL IN MANDARIN VOICE INPUT SYSTEM 2012 8th international conference on natural computation (ICNC 2012).
6/9/2013
DEPT. OF IT-BMSP&I
34
2013-14
REFERENCES( Contd)
[4] Octavian Cheng, Member, IEEE, Waleed Abdulla, Member, IEEE, and Zoran Salcic, Senior Member, IEEE HARDWARESOFTWARE CO DESIGN OF AUTOMATIC SPEECH RECOGNITION SYSTEM FOR EMBEDDED REAL-TIME APPLICATIONS,Ieee transactions on industrial electronics, vol. 58, no. 3, march 2011. [5] Chee cheun huang1,2and julien epps1,school of electrical engineering and telecommunications,the university of new south wales, sydney, NSW 2052, australia ,national ICT australia (NICTA), australian technology park, sydney, NSW 1430, australia , A STUDY OF AUTOMATIC PHONETIC SEGMENTATION FOR FORENSIC VOICE COMPARISON ICASSP 2012. [6] Afeez Olalekan, Alex Page, Ying Sun, PhD Department of Electrical, Computer and Biomedical Engineering, University of Rhode Island, Kingston, RI 02881-0805 USA Optimizing the Functionality of a Voice Recognition System for Assistive Technology.
6/9/2013
DEPT. OF IT-BMSP&I
35
2013-14
REFERENCES( Contd)
[7] Kurzweil Applied Intelligence, Inc Developing Continuous Speech Recognition Technology that Uses Natural Language Processing Commands Research and data for status report 93-01-0101 were collected during july september 2001 and april - june 2002. [8] Luke Makischuk,Abderahmane Sebaa ,Ameneh Sadat,Yazdaninik Asma Faizi SPEECH RECOGNITION AND ITS APPLICATION IN VOICEBASED ROBOT CONTROL SYSTEM. [9] Agus Trihandoyo, Adam Belloum and Kun-Mean Hou Heudiasyc CNRS URA 817, UniversitC de Technologie de Compikgne B.P. 649, 60206 Compikgne Cedex, FRANCE A REAL-TIME SPEECH RECOGNITION ARCHITECTURE FOR A MULTI-CHANNEL INTERACTIVE VOICE RESPONSE SYSTEM 1995 IEEE. [10] Yfan Gong and Yu-Hung Kao,Texas Instruments Incorporated,P.O. Box 6601 99, MS-8649, Dallas, TX 75266, USA IMPLEMENTING A HIGH ACCURACY SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNIZER ON A FIXED-POINT DSP ,2000 IEEE.
6/9/2013
DEPT. OF IT-BMSP&I
36
6/9/2013
DEPT. OF IT-BMSP&I
37

Voice Recognition System Seminar

Transféré par

Informations du document

Description originale:

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Voice Recognition System Seminar

Transféré par

Droits d'auteur :

Formats disponibles

Department of Instrumentation

Mahitha.G USN:1RV12LBI07 M.TECH II-sem BMSPI.

Under the guidance of Dr.K.V.Padmaja Associate professor &Dean Dept. of IT,RVCE.

How Humans do This????????

How might computers do this????

Digitization Acoustic analysis of the speech signal Linguistic interpretation

How to create a Voice Recognition System

User Interface Development

Developing a voice recognition system

Block diagram of a voice recognition system

Process of Speech Recognition

Program block diagram

Compare with stored voice commands

Speech Signal (time domain)

People with Disability

Voice Recognition Gadgets

Voice Recognition in HighPerformance Fighter Aircrafts

Developing a voice recognition system

Microsoft Speech Recognition Windows 7

DEPT. OF IT-BMSP&I 20 http://www.microsoft.com/enable/products/windowsvista/speech.aspx

Developing a voice recognition system

Dictating and Correcting

Voice Recognition System: Flaws and Weakness

How to Remove Flaws and Weakness of VRS

Word Error Rate 24

Developing a voice recognition system

Developing a voice recognition system

Developing a voice recognition system

IEEE PAPERS ON VOICE RECOGNITION SYSTEM

Developing a voice recognition system

Developing a voice recognition system

Developing a voice recognition system

Vous aimerez peut-être aussi