Académique Documents
Professionnel Documents
Culture Documents
and we are moving from the visual paradigm to the voice paradigm. Voice browser is the technology to enter this paradigm. A voice browser is a device which interprets a (voice) markup language and is capable of generating voice output and/or interpreting voice input, and possibly other input/output modalities.
verbally present to the user as well as when to present each piece of information.
Speech Recognition
Speech Synthesis
users via a combination of prerecorded speech, synthetic speech and music. You can select voice characteristics (name, gender and age) and the speed, volume, pitch, and emphasis. There is also provision for overriding the synthesis engine's default pronunciation.
VoiceXML
Speech Synthesis Speech Recognition Speech Grammars Semantic Interpretation Stochastic Language Models
telephony applications, where users are restricted to voice and DTMF (touch tone) input.
Browser text.html
Web Server
text.vxml
Internet
The specification defines a markup language for prompting users via a combination of prerecorded speech, synthetic speech and music. We can select voice characteristics (name, gender and age) and the speed, volume, pitch, and emphasis. There is also provision for overriding the synthesis engine's default pronunciation.
USER
encourage the user to answer in a form that matches context free grammar rules. Speech Grammars allow authors to specify rules covering the sequences of words that users are expected to say in particular contexts. These contexual clues allow the recognition engine to focus on likely utterances, improving the chances of a correct match.
In some applications it is appropriate to use open ended prompts (how can I help). In these cases, context free grammars are unuseful.
The solution is to use a stochastic language model. Such models specify the probability that one word occurs following certain others. The probabilities are computed from a collection of utterances collected from many users.
speech grammar, building a parse tree as a byproduct. There are two approaches to harvesting semantic results from the parse tree:
1. Annotating grammar rules with semantic interpretation tags. 2. Representing the result in XML.
when the page is accessed. May or may not produce a voice feed back.
browsing. Less space requirements. Portable voice browsers can also be implemented. Practical interface for functionally blind users. Users can browse web while keeping there hands and eyes for other jobs
Voice browsing will become visual(Multi-model) Can be integrated to an OS Integrated to every application.
and we are moving from the visual paradigm to the voice paradigm. Voice browser is the technology to enter this paradigm. Voice browser is a device which interpret voice input and generate voice output.
http://www.w3.org/standards/webofdevices/voice
http://xml.coverpages.org/ccxml.html http://reactos.ccp14.ac.uk/Voice/
http://www.w3.org/Voice/1998/Workshop/PhilJenkins