Académique Documents
Professionnel Documents
Culture Documents
Outline
Introduction
Project Description
Motivation
Sample Output
Automatic Speech Recognition Data Files
Future Work
Strategic Approach
Introduction
Project Description
This project represents one of the many design and development activities offered to
undergraduate students in the area of Science, Technology, Engineering and
Mathematics.
This presentation describes the design of an automatic speaker recognition system
using the Matlab software environment, which was a part of a NASA Langley
Research collaboration through the Chesapeake Information Based Aeronautic
Consortium (CIBAC).
Motivation
Speech Recognition
Process
Feature Extraction The process of extracting unique information from speech files that can
later be used to identify the speaker.
Feature Matching The process of actually identifying the speaker which involves comparing
the unknown voice data with a database of know speakers stored in the systems database.
A voiceprint represents the most basic, yet unique, features of the speech command
in the frequency domain.
A voiceprint is merely a matrix of numbers in which each number represents the
energy or average power that is heard in a particular frequency band during a specific
interval.
During the feature extraction stage a database of voiceprints is created in order to
be used as a reference in the feature matching stage.
Signal Models
Generally exploit some known specific properties of the signal to determine values of
new signal parameters
One tries to characterize only the statistical properties of the signal via a stochastic
process. A probability of the likelihood of unknown signal is computed using a given
model.
Given the model parameters created for the reference speakers, the
probability of hidden states that could have generated a particular unknown
output sequence is computed using the Viterbi algorithm.
The Viterbi algorithm makes one key assumption.
The most likely hidden sequence up to a certain point t must depend only on
the observed event at point t, and the most likely sequence at point t 1.
Proposed MATLAB
based Speaker
Recognition System
Training
Testing
Graphical Output
acoustic_data.xls workbook
which contains voiceprints of each
speaker; analogous to a database.
test_data.xls, unknown.wav,
unknown.mat (not pictured)
contain voiceprint, waveform and
Hidden Markov model of unknown
speaker respectively. These files are
deleted immediately following testing
stage.
Future Work
Using an additional program referred to as the MATLAB compiler, we will convert the
application into self contained C-code.
Implement the Speech Recognition program into Texas Instruments DSP hardware
(TMS320).