Bienvenue sur Scribd !

The Human Speechome Project: ASRU 2007

Transféré par

0% ont trouvé ce document utile (0 vote)

26 vues41 pages

The Human Speechome Project aims to Advance our understanding of how children acquire language in natural contexts. It uses longitudinal, ultra-dense, in vivo recordings + data mining and behavioral modeling. Two orders of magnitude more behavioral data than previous studies, far fewer observer effects, new analysis tools. The Goal is to illuminate language acquisition, behavioral phenotyping, video search, smart homes, parenting aids, security, retail.

Description originale:

Titre original

P5-Roy

Copyright

Formats disponibles

PDF, TXT ou lisez en ligne sur Scribd

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Signaler ce document

Droits d'auteur :

Attribution Non-Commercial (BY-NC)

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

0% ont trouvé ce document utile (0 vote)

26 vues41 pages

The Human Speechome Project: ASRU 2007

Transféré par

balajis_empathy620

Droits d'auteur :

Attribution Non-Commercial (BY-NC)

Formats disponibles

Téléchargez comme PDF, TXT ou lisez en ligne sur Scribd

Signaler comme contenu inapproprié

Passer à la page

Vous êtes sur la page 1sur 41

Rechercher à l'intérieur du document

The Human Speechome Project

Brandon Roy MIT Media Lab

ASRU 2007

The Human Speechome Project

MIT Media Lab
Cognitive Machines Group

Northeastern University
Communication Analysis and Design Laboratory

Prof. Deb Roy

Prof. Rupal Patel

Philip Decamp

Brandon Roy

Alexia Salata

Jethran Guinness

Rony Kubat

HUMAN SPEECHOME PROJECT

Goal: Advance our understanding of how children acquire language in natural contexts

HUMAN SPEECHOME PROJECT

Goal: Advance our understanding of how children acquire language in natural contexts Approach: Longitudinal, ultra-dense, in vivo recordings + data mining and behavioral modeling

HUMAN SPEECHOME PROJECT

Goal: Advance our understanding of how children acquire language in natural contexts Approach: Longitudinal, ultra-dense, in vivo recordings + data mining and behavioral modeling Differentiators: Two orders of magnitude more behavioral data than previous studies, far fewer observer effects, new analysis tools

HUMAN SPEECHOME PROJECT

Speech

in the

home

Speechome

DATA COLLECTION

A house in the Boston area

Looking up at the ceiling

Recording control system

Recording Infrastructure

14 Microphones

Preamps Audio/Video Processing

Ethernet GbS Ethernet 11 Cameras Control Server

Wireless Hubs 11 PDAs 5TB Raid

Storage at the Media Lab

250,000 GB capacity 80,000 hrs video 120,000 hrs audio ~2 yrs of recording

48 KHz audio, 1 MP video, ~15 fps

DATA ANALYSIS

Tracing the Birth of a Word

Transcribe all speech heard and produced by child

Tracing the Birth of a Word

Transcribe all speech heard and produced by child Annotate video surrounding all uses of target word

Tracing the Birth of a Word

Transcribe all speech heard and produced by child Annotate video surrounding all uses of target word Analyze role of contextual factors in word learning

Speech Transcription

Semi-automatic speech transcription

automate

Semi-automatic speech transcription

Evolution of water

Video Annotation

Tracking algorithms to follow people throughout the house

Video Annotation

Tracking algorithms to follow people throughout the house Estimate head pose to help determine focus of attention

Video Annotation

Tracking algorithms to follow people throughout the house Estimate head pose to help determine focus of attention Combine automatic and manual methods for tracking and head pose annotation

Using head pose to estimate focus of attention

Summary

Build a system to make it easy to collect lots of multimodal data

Summary

Build a system to make it easy to collect lots of multimodal data Interesting speech and language modeling challenges

Summary

Build a system to make it easy to collect lots of multimodal data Interesting speech and language modeling challenges Issues of privacy management, making the system cheaper, smarter recording strategies...

Summary

Vous aimerez peut-être aussi

Divine Manifestation PDF
Document31 pages
Divine Manifestation PDF
Keeran Raj Mahendran
100% (1)
Burt Goldman Quantum Jumping
Document295 pages
Burt Goldman Quantum Jumping
Cristian Catalina
100% (38)
Letting Go of Our Adult Children
Document153 pages
Letting Go of Our Adult Children
Ted Hussey
Pas encore d'évaluation
Building Resilience
Document8 pages
Building Resilience
Ramovic Elvir-Beli
Pas encore d'évaluation
Consumer Buying Behavior
Document36 pages
Consumer Buying Behavior
Susil Kumar Sa
Pas encore d'évaluation
Fallacies of Insufficient Evidence
Document7 pages
Fallacies of Insufficient Evidence
SM
Pas encore d'évaluation
Solution Manual For Project Management A Systems Approach To Planning Scheduling and Controlling 11th Edition by Kerzner
Document7 pages
Solution Manual For Project Management A Systems Approach To Planning Scheduling and Controlling 11th Edition by Kerzner
yaueys
0% (2)
Seven Habits of Highly Effective People
Document93 pages
Seven Habits of Highly Effective People
jocamanuel
100% (2)
Speech Recognition Report
Document20 pages
Speech Recognition Report
Ramesh k
100% (1)
376PAGESadrian Johnston A New German Idealism Hegel Zizek and Dialectical Materialism PDF
Document376 pages
376PAGESadrian Johnston A New German Idealism Hegel Zizek and Dialectical Materialism PDF
xan30tos
Pas encore d'évaluation
Natural Language Processing with Python: Natural Language Processing Using NLTK
D'Everand
Natural Language Processing with Python: Natural Language Processing Using NLTK
Frank Millstein
Évaluation : 3.5 sur 5 étoiles
3.5/5 (4)
Socio Cultural Anthropology
Document10 pages
Socio Cultural Anthropology
Dibini Jap
100% (1)
Group Work (FINAL)
Document9 pages
Group Work (FINAL)
Roselle Barcelon
56% (9)
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
D'Everand
Applied Natural Language Processing with Python: Implementing Machine Learning and Deep Learning Algorithms for Natural Language Processing
Taweh Beysolow II
Pas encore d'évaluation
Strategies of Genius Vol I - Aristotle
Document70 pages
Strategies of Genius Vol I - Aristotle
Molox
100% (7)
7 Steps To Fearless Speaking
Document159 pages
7 Steps To Fearless Speaking
tailieu2015
Pas encore d'évaluation
Synopsis
Document18 pages
Synopsis
KHUSHBOO PAL
Pas encore d'évaluation
Forms & Conventions of Film & Moving Pictures (Characterization) .Docx (Revised) 2
Document3 pages
Forms & Conventions of Film & Moving Pictures (Characterization) .Docx (Revised) 2
Harley Laus
100% (2)
Natural Language Processing: All You Need To Know About
Document45 pages
Natural Language Processing: All You Need To Know About
Chaitanya Sai
Pas encore d'évaluation
Natural Language Processing
Document49 pages
Natural Language Processing
Bharti Gupta
100% (3)
A Framework For Deepfake V2
Document24 pages
A Framework For Deepfake V2
Abdullah fawaz altulahi
Pas encore d'évaluation
Chapter 1 Introduction
Document4 pages
Chapter 1 Introduction
Sabahat Irfan
Pas encore d'évaluation
Visual Assist
Document53 pages
Visual Assist
keerthan9105
Pas encore d'évaluation
Automatic Subtitle Generator
Document25 pages
Automatic Subtitle Generator
ravi060791
0% (1)
A Brief Introduction To Automatic Speech Recognition
Document22 pages
A Brief Introduction To Automatic Speech Recognition
Pham Thanh Phu
Pas encore d'évaluation
Computer Based Automatic Speech Processing: Pham Van Tuan
Document70 pages
Computer Based Automatic Speech Processing: Pham Van Tuan
hondaitodung
Pas encore d'évaluation
Call Development 2013
Document17 pages
Call Development 2013
Hisyam Ahmad
Pas encore d'évaluation
Speech Recognition: Fundamentals and Applications
D'Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
Pas encore d'évaluation
Natural Language Understanding: Fundamentals and Applications
D'Everand
Natural Language Understanding: Fundamentals and Applications
Fouad Sabry
Pas encore d'évaluation
Preliminary Synopsis Report 2021-22
Document12 pages
Preliminary Synopsis Report 2021-22
NEWBIE HU
Pas encore d'évaluation
Research Papers On Speech Recognition System
Document8 pages
Research Papers On Speech Recognition System
fzg6pcqd
100% (1)
Automatic Lip Reading Classification of
Document5 pages
Automatic Lip Reading Classification of
Hajar Bouchama
Pas encore d'évaluation
SPEECH RECOGNITION SYSTEM Final
Document16 pages
SPEECH RECOGNITION SYSTEM Final
Mard Geer
Pas encore d'évaluation
Application and Development Prospect of AI Speech Recognition Technology
Document5 pages
Application and Development Prospect of AI Speech Recognition Technology
mounteverest276
Pas encore d'évaluation
Theoretical Work On Voice Recognition (Speech Recognition)
Document10 pages
Theoretical Work On Voice Recognition (Speech Recognition)
Maksym Akimov
Pas encore d'évaluation
Research Papers On Speech Recognition 2013
Document6 pages
Research Papers On Speech Recognition 2013
gvw6y2hv
100% (1)
Speech Recognition: - Shetul Chothani
Document19 pages
Speech Recognition: - Shetul Chothani
Sunil Pillai
Pas encore d'évaluation
Speech Recognition Report
Document87 pages
Speech Recognition Report
Kapil Dev Sharma
Pas encore d'évaluation
CASE STUDY - Speech Recognition
Document25 pages
CASE STUDY - Speech Recognition
naina nautiyal
Pas encore d'évaluation
Tejaswini Group Report
Document18 pages
Tejaswini Group Report
Riya
Pas encore d'évaluation
Voice Communication With Computers (VanNostrand) (1993)
Document342 pages
Voice Communication With Computers (VanNostrand) (1993)
Andrea Matti
Pas encore d'évaluation
Archivo - 01 (Outra Cópia)
Document5 pages
Archivo - 01 (Outra Cópia)
SRT MLops
Pas encore d'évaluation
V2S: Voice To Sign Language Translation System For Malaysian Deaf People
Document10 pages
V2S: Voice To Sign Language Translation System For Malaysian Deaf People
Anca Dragoi
Pas encore d'évaluation
Archivo - 01 (Cópia)
Document5 pages
Archivo - 01 (Cópia)
SRT MLops
Pas encore d'évaluation
Voice Recognition Technology 1-1
Document14 pages
Voice Recognition Technology 1-1
Kola Keerthana
Pas encore d'évaluation
Sign Language Recognition System Using Machine Learning
Document6 pages
Sign Language Recognition System Using Machine Learning
IJRASETPublications
Pas encore d'évaluation
Dissertation Speech Recognition
Document6 pages
Dissertation Speech Recognition
PaySomeoneToWriteAPaperForMeFargo
100% (1)
Dreu Paper
Document5 pages
Dreu Paper
api-330056374
Pas encore d'évaluation
Synthesised Voice
Document4 pages
Synthesised Voice
gbv8rcfq
100% (1)
Voice Recognition Thesis Topic
Document8 pages
Voice Recognition Thesis Topic
sandysimonsenbillings
100% (2)
Personal Voice Assistant in Python
Document14 pages
Personal Voice Assistant in Python
Movies Villa
Pas encore d'évaluation
Research Papers On Voice Recognition System
Document4 pages
Research Papers On Voice Recognition System
rtggklrif
100% (1)
Book Reading System For BlindPeople
Document60 pages
Book Reading System For BlindPeople
yashvant
Pas encore d'évaluation
Project New
Document2 pages
Project New
AAKASH
Pas encore d'évaluation
Building A Data Corpus For Audio-Visual Speech Recognition: March 2012
Document6 pages
Building A Data Corpus For Audio-Visual Speech Recognition: March 2012
Ahsan Thoriq
Pas encore d'évaluation
Literature Review On Speech Recognition System
Document4 pages
Literature Review On Speech Recognition System
ea7e9pm9
100% (1)
Accessible Technology For Interactive Systems: A New Approach To Spoken Language Research
Document5 pages
Accessible Technology For Interactive Systems: A New Approach To Spoken Language Research
Candida Noronha
Pas encore d'évaluation
AI With Python Â - Speech Recognition
Document10 pages
AI With Python Â - Speech Recognition
Chandu Chandrakanth
Pas encore d'évaluation
Individual Project - Mason Leary
Document15 pages
Individual Project - Mason Leary
api-273094620
Pas encore d'évaluation
Amharic ASR Project Proposal
Document7 pages
Amharic ASR Project Proposal
Addis Belayhun
Pas encore d'évaluation
Speech Recognition Report
Document20 pages
Speech Recognition Report
Ramesh k
Pas encore d'évaluation
Text To Speech
Document5 pages
Text To Speech
Abdul Rehaan
Pas encore d'évaluation
Mini Project
Document19 pages
Mini Project
oopmbcoe
Pas encore d'évaluation
SPEECH
Document8 pages
SPEECH
sam
Pas encore d'évaluation
Natural Language User Interface: Fundamentals and Applications
D'Everand
Natural Language User Interface: Fundamentals and Applications
Fouad Sabry
Pas encore d'évaluation
Natural Language Processing, The Jewel On The Crown: Abstract
Document5 pages
Natural Language Processing, The Jewel On The Crown: Abstract
Kevin Zhang
Pas encore d'évaluation
Contenido
Document5 pages
Contenido
Rei
Pas encore d'évaluation
A Review of Conversational System Framework - Final - Submitted
Document12 pages
A Review of Conversational System Framework - Final - Submitted
dani wafa
Pas encore d'évaluation
v1 Covered
Document32 pages
v1 Covered
mrunal shethiya
Pas encore d'évaluation
Radha Govind Engineering College, Meerut
Document11 pages
Radha Govind Engineering College, Meerut
akanksha91
Pas encore d'évaluation
Speech Recognition As Emerging Revolutionary Technology
Document4 pages
Speech Recognition As Emerging Revolutionary Technology
bbaskaran
Pas encore d'évaluation
JETIR1902381
Document4 pages
JETIR1902381
Vinay Singh
Pas encore d'évaluation
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
Document8 pages
Key Application: - Audrey System - The First Speech Recognition System Introduced by Bell Laboratories in 1952
sam
Pas encore d'évaluation
Features: Digital Assistant
Document8 pages
Features: Digital Assistant
sam
Pas encore d'évaluation
Introduction To Probability and Random Signals
Document139 pages
Introduction To Probability and Random Signals
prapun
100% (9)
Cruseoprocessor 100111224228 Phpapp02
Document17 pages
Cruseoprocessor 100111224228 Phpapp02
balajis_empathy620
Pas encore d'évaluation
Optical Computing Technology
Document16 pages
Optical Computing Technology
balajis_empathy620
Pas encore d'évaluation
The Automobile: Where It's Been
Document24 pages
The Automobile: Where It's Been
balajis_empathy620
Pas encore d'évaluation
r05220205 Control Systems
Document9 pages
r05220205 Control Systems
Srinivasa Rao G
100% (3)
Final Exam Ap Psyc 2018
Document2 pages
Final Exam Ap Psyc 2018
api-391411195
Pas encore d'évaluation
Middle Life Cycle Numbers
Document3 pages
Middle Life Cycle Numbers
Irina Cebanu
Pas encore d'évaluation
Three Aspects of Interpersonal Trust
Document18 pages
Three Aspects of Interpersonal Trust
Cts Kenny
Pas encore d'évaluation
245 Q4 AR Extra Practice
Document9 pages
245 Q4 AR Extra Practice
KausarAtique
Pas encore d'évaluation
English 219 Project 3 - Recommendation Report
Document4 pages
English 219 Project 3 - Recommendation Report
api-251582078
Pas encore d'évaluation
Skinner and His Contributions
Document29 pages
Skinner and His Contributions
Jorge Andrés Gaitán Bohórquez
Pas encore d'évaluation
Shigeru Ban
Document1 page
Shigeru Ban
Karen Aguilar
Pas encore d'évaluation
Ubd Lesson Plan Template
Document4 pages
Ubd Lesson Plan Template
Kristine Yeckley-Torres
Pas encore d'évaluation
The Importance of Leadership and Teamwork in School
Document7 pages
The Importance of Leadership and Teamwork in School
Gumunnan Raja
Pas encore d'évaluation
Ethics and Liability Slides Notes
Document134 pages
Ethics and Liability Slides Notes
Sudin Pradhan
Pas encore d'évaluation
The Phonological History of English Consonant Clusters Is Part of The Phonological History of The English Language
Document6 pages
The Phonological History of English Consonant Clusters Is Part of The Phonological History of The English Language
Maria Arce
Pas encore d'évaluation
Forces Operating Within A Group in Social Interaction
Document15 pages
Forces Operating Within A Group in Social Interaction
JISHA
0% (1)
Reflexive Monism
Document2 pages
Reflexive Monism
ruin_2832
Pas encore d'évaluation
13 Satanic Rules of Business
Document2 pages
13 Satanic Rules of Business
Robert Ing
100% (1)
Effective Climate Communication in An Organisation
Document14 pages
Effective Climate Communication in An Organisation
Dulcet Lyrics
Pas encore d'évaluation
Bruce Tuckman Theory
Document3 pages
Bruce Tuckman Theory
Roel Asi
Pas encore d'évaluation