Bienvenue sur Scribd !

Gender Classification by Speech Analysis

Transféré par

0% ont trouvé ce document utile (0 vote)

61 vues12 pages

Human listeners can extract information beyond just language from speech, such as the speaker's gender, age, and emotional state. Gender classification is useful for speech and speaker recognition systems, as models that account for gender differences can improve word error rates. However, identifying acoustic features that indicate gender while also accommodating differences between individuals is challenging. Physiological differences between male and female vocal tracts lead to differences in pitch and formant frequencies that influence gender perception. Both time and frequency domain analyses of speech signals can be used to extract features for gender classification.

Description originale:

gender recognition

Copyright

Formats disponibles

PPTX, PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Droits d'auteur :

Formats disponibles

Téléchargez comme PPTX, PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

61 vues12 pages

Gender Classification by Speech Analysis

Transféré par

irfanangp

Droits d'auteur :

Formats disponibles

Téléchargez comme PPTX, PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 12

Rechercher à l'intérieur du document

• Human listeners are capable of extracting information from the

acoustic signal beyond just the linguistic message

---- speaker's personality, emotional state, gender, age, dialect and
the status of his/her health

• Gender classification is useful in speech and speaker recognition

---- better performance has been reported when gender-dependent
acoustic-phonetic models are used by decreasing the word error rate
of a baseline speech recognition system by 1.6%
No!

• Factors that limit gender classification include our inability to

identify acoustic features sensitive to the task and yet robust enough
to accommodate speaker articulation differences, vocal tract
differences, prosodic variations
• The selected features should be time-invariant, phoneme
independent, and identity-independent for speakers of the same
gender
• There is always some NOISE
• Physiological
Vocal tract length of females is less than that of males

The differences in physiological parameters lead to differences

in acoustical parameters
• Acoustic
 Pitch
 Formant Frequencies
 Zero Crossing Rate …
• Vocal Tract can be modeled as a resonance tube with shape being
varied according to phoneme uttered

• Fundamental frequency inversely proportion to length

• So, Pitch of female > Pitch of male

• The perception of voice gender primarily relies on the
fundamental frequency [f0] that is on average higher by an
octave in female than male voices; yet, pitch overlaps
considerably between male and female voices
• Although voice pitch and gender are linked, other information is
used to recognise an individual’s gender from his/her voice.
• The techniques used to process the speech signals can be
classified as
 Time domain analysis
 Frequency domain analysis
• In Time domain analysis, the measurements are performed
directly on the speech signal to extract information
• In Frequency domain analysis, the information is extracted
from the frequency content of the signal to form the spectrum
Formant features:
• The distinction between men and women have been represented
by the location in the frequency domain of the first 3 formants for
vowels.
• Hence the set of formant features comprised by the statistics of
the 4 formant frequency contours namely: Mean, minimum,
maximum and variance values of the first four formants are
considered.

LPC coefficients and Cepstral Coefficients

• Short-time Energy (STE) is the energy associated with the signal
in time domain
• It is given by
E(n) = σ∞ 𝑚=−∞ [s(m) * w(n-m) ]^2 where,
E(n) – Short-time Energy,
s(m) – Speech signal,
w(n) - window
• The Average short-time energy value for female voices is
observed to be greater than that of male voices
[1] Douglas O’shaughnessy, “Speech Communications:
Human and Machine”, IEEE Press, Newyork, 2000.
[2] Sedaaghi M. H., “Gender classification in emotional
speech”, In Speech Recognition: Technologies and
Applications, pp. 363–376, Itech, Vienna, Austria, 2008.
[3] BhagyaLaxmi Jena, Abhishek Majhi, Beda Prakash
Panigrahi, “Gender classification by Speech Analysis”.
[4] M.Gomathy, K.Meena and K.R. Subramaniam,
“Performance Analysis of Gender Clustering and
Classification Algorithms”, International Journal on
Computer Science and Engineering (IJCSE), 2009.
Thank You!

Vous aimerez peut-être aussi

System Administration Jakarta
Document347 pages
System Administration Jakarta
Lorena Castillero
80% (10)
Piaggio X8 250 I.E. (EN)
Document289 pages
Piaggio X8 250 I.E. (EN)
Manualles
100% (8)
Ctopp Presentation - Second Version - Modified
Document22 pages
Ctopp Presentation - Second Version - Modified
api-163017967
Pas encore d'évaluation
Imageformula Dr-m260 SM Rev0!1!200128
Document144 pages
Imageformula Dr-m260 SM Rev0!1!200128
Ernesto De la Torre
Pas encore d'évaluation
Signal Processing in Auditory Neuroscience: Temporal and Spatial Features of Sound and Speech
D'Everand
Signal Processing in Auditory Neuroscience: Temporal and Spatial Features of Sound and Speech
Yoichi Ando
Pas encore d'évaluation
Electronic Modular Control Panel II Paralleling Emcp II P Systems Operation Troubleshooting Testing and Adjusting Caterpillar
Document200 pages
Electronic Modular Control Panel II Paralleling Emcp II P Systems Operation Troubleshooting Testing and Adjusting Caterpillar
Abdo Malki
Pas encore d'évaluation
New Model For Predicting Thermal Radiation From Flares and High Pressure Jet Fires For Hydrogen and Syngas
Document15 pages
New Model For Predicting Thermal Radiation From Flares and High Pressure Jet Fires For Hydrogen and Syngas
thlim19078656
Pas encore d'évaluation
How To Read An ODBC Trace File
Document13 pages
How To Read An ODBC Trace File
ganeshharidas
Pas encore d'évaluation
Devi Priya SECOND PAPER
Document7 pages
Devi Priya SECOND PAPER
Naga Raju G
Pas encore d'évaluation
MDVP
Document10 pages
MDVP
Carlos Carrasco Venegas
100% (1)
Gender Voice Classification With Huge Accuracy Rate: TELKOMNIKA Telecommunication, Computing, Electronics and Control
Document6 pages
Gender Voice Classification With Huge Accuracy Rate: TELKOMNIKA Telecommunication, Computing, Electronics and Control
Adedayo tunji
Pas encore d'évaluation
EEE 6211 Digital Speech Processing
Document40 pages
EEE 6211 Digital Speech Processing
Shoaib Sami
Pas encore d'évaluation
Feature Extraction Using PCA
Document36 pages
Feature Extraction Using PCA
Kamaraj Naidu
Pas encore d'évaluation
Gender Recognition by Speech Analysis
Document24 pages
Gender Recognition by Speech Analysis
Agnila Joye
Pas encore d'évaluation
Synopsis Voice Recognition
Document9 pages
Synopsis Voice Recognition
amardeepsinghseera
Pas encore d'évaluation
Exploring The Acoustic Physics of Speech Sounds in Phonetics
Document6 pages
Exploring The Acoustic Physics of Speech Sounds in Phonetics
Research Park
Pas encore d'évaluation
Speech Production
Document8 pages
Speech Production
Gokul
Pas encore d'évaluation
A Review Paper On Vocal Tract Modelling With Dynamic Simulation
Document10 pages
A Review Paper On Vocal Tract Modelling With Dynamic Simulation
archerselevators
Pas encore d'évaluation
Speech Intelligilbility and PA System
Document13 pages
Speech Intelligilbility and PA System
رمزي سلي
Pas encore d'évaluation
Sentiment Analysis - Voice Analysis For PD Detection
Document22 pages
Sentiment Analysis - Voice Analysis For PD Detection
Hasmiza Ibrahim
Pas encore d'évaluation
Forensic Speech Recognition
Document11 pages
Forensic Speech Recognition
Aqsa
Pas encore d'évaluation
Is Tonal Language A Problem For Speaker Identifica
Document11 pages
Is Tonal Language A Problem For Speaker Identifica
Neal
Pas encore d'évaluation
Long-Term Average Spectrum in Screening of Voice Quality in Speech: Untrained Male University Students
Document6 pages
Long-Term Average Spectrum in Screening of Voice Quality in Speech: Untrained Male University Students
María Belén Soto Alarcón
Pas encore d'évaluation
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
Document17 pages
Advanced Topics in Speech Processing (IT60116) : K Sreenivasa Rao School of Information Technology IIT Kharagpur
Arjya Sarkar
Pas encore d'évaluation
Method To Study Speech Synthesis
Document43 pages
Method To Study Speech Synthesis
MSc Audio B
Pas encore d'évaluation
Identification of Hindi Dialects.
Document5 pages
Identification of Hindi Dialects.
Saurabh Bhardwaj
Pas encore d'évaluation
Gender Recong Paper 4
Document9 pages
Gender Recong Paper 4
Kumara S
Pas encore d'évaluation
Chapter 3 - Week4
Document49 pages
Chapter 3 - Week4
Nguyễn Tuyết Duy
Pas encore d'évaluation
Text Independent Amharic Language Speaker Identifi
Document6 pages
Text Independent Amharic Language Speaker Identifi
lamrotgetaw15
Pas encore d'évaluation
Speech Enhancement Using Spectral Subtraction Based On SFM1
Document15 pages
Speech Enhancement Using Spectral Subtraction Based On SFM1
vinay
Pas encore d'évaluation
A Study of Gender Specific Pitch Variation Pattern of Emotion Expression For Hindi Speech
Document9 pages
A Study of Gender Specific Pitch Variation Pattern of Emotion Expression For Hindi Speech
IAEME Publication
Pas encore d'évaluation
Emotion Recognition From Speech With Recurrent Neural Network Emotion Recognition From Speech With Recurrent Neural Network
Document31 pages
Emotion Recognition From Speech With Recurrent Neural Network Emotion Recognition From Speech With Recurrent Neural Network
Amith T
Pas encore d'évaluation
Speech Recognition Using Neural Networks: A. Types of Speech Utterance
Document24 pages
Speech Recognition Using Neural Networks: A. Types of Speech Utterance
jwalith
Pas encore d'évaluation
Impact of Variabilities On Speech Recognition: SPECOM'2006, St. Petersburg, 25-29 June 2006
Document14 pages
Impact of Variabilities On Speech Recognition: SPECOM'2006, St. Petersburg, 25-29 June 2006
abhijeet286
Pas encore d'évaluation
Voice Pathology Detection Based On The Modified Voice Contour and SVM
Document9 pages
Voice Pathology Detection Based On The Modified Voice Contour and SVM
WISSAL
Pas encore d'évaluation
Speech Recognition Using DSP PDF
Document32 pages
Speech Recognition Using DSP PDF
Mohammed Wahhab
Pas encore d'évaluation
Furus, Hernando and Ejarque 2007 - Jitter and Shimmer PDF
Document4 pages
Furus, Hernando and Ejarque 2007 - Jitter and Shimmer PDF
dinadinic
Pas encore d'évaluation
Analysis of Fatigue Through Speech Analysis
Document10 pages
Analysis of Fatigue Through Speech Analysis
siddharth shekhar
Pas encore d'évaluation
Video Spectrographic Analysis: Submitted To: DR - Amit Chauhan Submitted From
Document9 pages
Video Spectrographic Analysis: Submitted To: DR - Amit Chauhan Submitted From
ritika kumari
Pas encore d'évaluation
Review of Feature Extraction Techniques in Automatic Speech Recognition
Document6 pages
Review of Feature Extraction Techniques in Automatic Speech Recognition
Fréjusf Laleye
100% (1)
Application of Automatic Speaker Recognition Techniques To Pathological Voice Assessment
Document12 pages
Application of Automatic Speaker Recognition Techniques To Pathological Voice Assessment
casesilva
Pas encore d'évaluation
SSRN Id3769897
Document13 pages
SSRN Id3769897
arshjaikaran
Pas encore d'évaluation
IT Report-1
Document14 pages
IT Report-1
khushal
Pas encore d'évaluation
IJSLL Imitations
Document24 pages
IJSLL Imitations
dinadinic
Pas encore d'évaluation
2019 Classification of VD
Document7 pages
2019 Classification of VD
Hemal
Pas encore d'évaluation
Thesis 1 Intro2
Document7 pages
Thesis 1 Intro2
Avrie Mercado
Pas encore d'évaluation
The Science If Flirting
Document9 pages
The Science If Flirting
Eyog Victoria
Pas encore d'évaluation
ENGLISH Phonetic Theories
Document11 pages
ENGLISH Phonetic Theories
Freeman Solomon
Pas encore d'évaluation
Speech Recognition UTHM
Document30 pages
Speech Recognition UTHM
Dineshwaran Daniel Gunalan
Pas encore d'évaluation
Speech Recognition Seminar
Document19 pages
Speech Recognition Seminar
going12345
Pas encore d'évaluation
Jitter and Shimmer Measurements For Speaker Recognition
Document4 pages
Jitter and Shimmer Measurements For Speaker Recognition
Diego Reche's
Pas encore d'évaluation
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
Document9 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
IAEME Publication
Pas encore d'évaluation
Usual Voice Quality Features and Glottal Features For Emotional Valence Detection
Document4 pages
Usual Voice Quality Features and Glottal Features For Emotional Valence Detection
Ariel Ferreira
Pas encore d'évaluation
Statistical Treatment of Data
Document1 page
Statistical Treatment of Data
Jocelle Tuazon
Pas encore d'évaluation
Lecture 1-7: Source-Filter Model
Document6 pages
Lecture 1-7: Source-Filter Model
hadaspeery
Pas encore d'évaluation
Rosemary Lester-Smith - Lester-Smith - Effects of Voice Adjustments On Vocal Tremor and Vibrato
Document15 pages
Rosemary Lester-Smith - Lester-Smith - Effects of Voice Adjustments On Vocal Tremor and Vibrato
Lore Zazpi
Pas encore d'évaluation
Acoustic Representation of BODO and RABHA Phonemes: Issn No
Document9 pages
Acoustic Representation of BODO and RABHA Phonemes: Issn No
WARSE Journals
Pas encore d'évaluation
Phone Xia
Document21 pages
Phone Xia
Umair
Pas encore d'évaluation
28 Somashekara Etal
Document6 pages
28 Somashekara Etal
editorijmrhs
Pas encore d'évaluation
BIOEN 303 Final Project Report
Document21 pages
BIOEN 303 Final Project Report
chaocharliehuang
Pas encore d'évaluation
Speech Tasks and Interrater Reliability in Perceptual Voice Evaluation
Document8 pages
Speech Tasks and Interrater Reliability in Perceptual Voice Evaluation
Mafa ILmi
Pas encore d'évaluation
Epoch-Based Analysis of Speech Signals
Document47 pages
Epoch-Based Analysis of Speech Signals
Bob
Pas encore d'évaluation
Acoustic Properties of The Voice Source and The Vocal Tract - 2016 - Journal of
Document14 pages
Acoustic Properties of The Voice Source and The Vocal Tract - 2016 - Journal of
elinlau0
Pas encore d'évaluation
Dialectal Variability in Spoken Language: A Comprehensive Survey of Modern Techniques For Language Dialect Identification in Speech Signals
Document14 pages
Dialectal Variability in Spoken Language: A Comprehensive Survey of Modern Techniques For Language Dialect Identification in Speech Signals
index Pub
Pas encore d'évaluation
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
Document13 pages
Speech Recognition: College Name: Guru Nanak Engineering College Authors: Shruthi Tapse
jnanesh582
Pas encore d'évaluation
Prosody-Preserving Voice Transformation To Evaluate Brain Representations of Speech Sounds
Document13 pages
Prosody-Preserving Voice Transformation To Evaluate Brain Representations of Speech Sounds
Afshan Kaleem
Pas encore d'évaluation
A Sliding Mode Controller For A Three Phase Induction Motor
Document4 pages
A Sliding Mode Controller For A Three Phase Induction Motor
irfanangp
Pas encore d'évaluation
Irjet V6i1203
Document10 pages
Irjet V6i1203
irfanangp
Pas encore d'évaluation
Automatic Diagnosis of Diabetic Retinopathy: Using Retinal Fundus Image
Document25 pages
Automatic Diagnosis of Diabetic Retinopathy: Using Retinal Fundus Image
irfanangp
Pas encore d'évaluation
Speaker Age Estimation and Gender Detection Based On Supervised Non-Negative Matrix Factorization
Document6 pages
Speaker Age Estimation and Gender Detection Based On Supervised Non-Negative Matrix Factorization
irfanangp
Pas encore d'évaluation
Classification of Rice Disease Using Digital Image Processing and SVM Classifier
Document6 pages
Classification of Rice Disease Using Digital Image Processing and SVM Classifier
irfanangp
Pas encore d'évaluation
Yang Slt12
Document6 pages
Yang Slt12
irfanangp
Pas encore d'évaluation
Forces Revision Questions 1. Resistive Force.: Island School 1
Document13 pages
Forces Revision Questions 1. Resistive Force.: Island School 1
Deepal Prasanka
Pas encore d'évaluation
C 13
Document33 pages
C 13
rgerwwaa
Pas encore d'évaluation
Precima Frenos FDW ATEX Operating Instructions
Document6 pages
Precima Frenos FDW ATEX Operating Instructions
Toni Renedo
Pas encore d'évaluation
Oracle Database 11g Transparent Data Encryption
Document40 pages
Oracle Database 11g Transparent Data Encryption
Yelena Bytenskaya
Pas encore d'évaluation
MCAT Uhs Past Paper (2008-2016)
Document180 pages
MCAT Uhs Past Paper (2008-2016)
Abdullah Sheikh
Pas encore d'évaluation
Numerical Methods in Rock Mechanics - 2002 - International Journal of Rock Mechanics and Mining Sciences
Document19 pages
Numerical Methods in Rock Mechanics - 2002 - International Journal of Rock Mechanics and Mining Sciences
Anderson Lincol Condori Paytan
Pas encore d'évaluation
Individual Spirituality, Workplace Spirituality and Work Attitudes An Empirical Test of Direct and Interaction Effects
Document19 pages
Individual Spirituality, Workplace Spirituality and Work Attitudes An Empirical Test of Direct and Interaction Effects
Basharat Naeem
Pas encore d'évaluation
截屏 2021-08-05 17.02.20
Document98 pages
截屏 2021-08-05 17.02.20
4WEM GT
Pas encore d'évaluation
B28 Viva
Document27 pages
B28 Viva
shubham
Pas encore d'évaluation
Plate Fin Heat Exchanger
Document14 pages
Plate Fin Heat Exchanger
Tushar Panchal
Pas encore d'évaluation
The Mathematical Society of Serbia - 60 Years
Document23 pages
The Mathematical Society of Serbia - 60 Years
Branko Ma Branko Tadic
Pas encore d'évaluation
Analytic Geometry Parabola Problems
Document14 pages
Analytic Geometry Parabola Problems
Ojit Quizon
Pas encore d'évaluation
Pediatric Appendicitis Score
Document6 pages
Pediatric Appendicitis Score
Pinandhito Latukolan
Pas encore d'évaluation
Atmos S 351 - Service Manual
Document40 pages
Atmos S 351 - Service Manual
cuetlaxochitl
Pas encore d'évaluation
351 Datasheet
Document14 pages
351 Datasheet
Rafael Navarro
Pas encore d'évaluation
Deductive Reasoning
Document2 pages
Deductive Reasoning
Mariel C. Bombita
Pas encore d'évaluation
Replica User Manual
Document7 pages
Replica User Manual
jefz2607
Pas encore d'évaluation
Concrete Mix Design
Document11 pages
Concrete Mix Design
V Vinoth Edac
100% (1)
Active Mathematics PDF
Document22 pages
Active Mathematics PDF
goingforward77
Pas encore d'évaluation
Myp Math Standard Unit 02
Document4 pages
Myp Math Standard Unit 02
Suran Lee
Pas encore d'évaluation
Studi Dampak Lalu Lintas Pembangunan Gudang Dan Kantor Pt. Wismilak Group Di Jalan Raya Solo Sragen
Document8 pages
Studi Dampak Lalu Lintas Pembangunan Gudang Dan Kantor Pt. Wismilak Group Di Jalan Raya Solo Sragen
Ilung Marpaung
Pas encore d'évaluation
UNIT1-Demand Management in Supply Chain Demand Planning and Forecasting
Document20 pages
UNIT1-Demand Management in Supply Chain Demand Planning and Forecasting
shenbha
50% (2)
Yr 6 Maths G-6 E P-I PDF
Document168 pages
Yr 6 Maths G-6 E P-I PDF
dina171279
Pas encore d'évaluation
Presentation On Drowsiness Detection System
Document11 pages
Presentation On Drowsiness Detection System
movies downloader
Pas encore d'évaluation