Académique Documents
Professionnel Documents
Culture Documents
Runtime Environment
Installation and Configuration of ASR
and TTS
For OCMP Release 2.4
Fourth Edition
Notice
The information contained in this document is subject to change without
notice.
Hewlett-Packard makes no warranty of any kind with regard to this
material, including, but not limited to, the implied warranties of
merchantability and fitness for a particular purpose. Hewlett-Packard
shall not be liable for any errors contained herein, or for incidental or
consequential damages in connection with the furnishing, performance
or use of this material.
This document contains proprietary information which is protected by
copyright. All rights reserved. No part of this document may be
photocopied, reproduced, or translated into another language without
the prior written consent of Hewlett-Packard Company.
2005 Copyright Hewlett-Packard Development Company, L.P.
Reproduction, adaptation or translation without prior written
permission is prohibited, except as allowed under the copyright laws.
Printing History
First Edition
Second Edition
Third Edition
Fourth Edition
Trademarks
The following are trademarks or registered trademarks of
Hewlett-Packard: HP-UX
Microsoft Windows and Microsoft Internet Explorer are either
registered trademarks or trademarks of Microsoft Corporation in the
United States and/or other countries.
Java and all Java-based marks are trademarks or registered trademarks
of Sun Microsystems, Inc. in the United States and other countries.
ii
Hewlett-Packard Company
OpenCall Business Unit
38053 GRENOBLE Cedex 9
France
iii
iv
1. Introduction
Scope . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
System Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
Resource Allocation and Management . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
ASR and TTS Language . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
Planning and Dimensioning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
Pre-requisites . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
Related Information. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
Abbreviations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2. Installation
ASR Installation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
ScanSoft OpenSpeech Recognizer 3.0.1 MRCP ASR . . . . . . . . . . . . . 18
Nuance MRCP ASR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
TTS Installation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
ScanSoft Telecom RealSpeak MRCP TTS . . . . . . . . . . . . . . . . . . . . . . 20
Nuance Vocalizer 3.0 MRCP TTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
3. Configuration Overview
Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
4. Configuring OCMP-Base for ASR and TTS
Creating Audio Services and Audio Servers. . . . . . . . . . . . . . . . . . . . . . 26
Creating an Audio Service . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
Creating an Audio Server. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5. Configuring MRCP ASR
Non-SRGS/XML Grammars, Precompiled Grammars and Grammars
Referenced by File URIs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Configuring OSR 3 ASR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Result Structure Configuration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
Semantic Interpretation in OSR 3 Grammars . . . . . . . . . . . . . . . . . . 31
v
Audio Logging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Starting OSR 3 MRCP . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Known issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Determining the Language in OSR 3 . . . . . . . . . . . . . . . . . . . . . . . . .
Language Mapping . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
32
32
32
35
35
36
36
36
37
38
38
38
38
39
vi
Introduction
This section provides an introduction to an OCMP system running ASR
and TTS.
Chapter 1
Introduction
Scope
Scope
This document describes how to install, configure, and troubleshoot
Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) for the
HP OpenCall Media Platform (OCMP). The ASR and TTS installations
are targeted for Microsoft Windows servers. The OCMP platform
supports ASR and TTS using the Media Resource Control Protocol
(MRCP). Any ASR or TTS engine supporting MRCP is thus functional in
principle. However, there may be implementation differences from
vendor to vendor. Therefore, tests for interoperability between the OCMP
and the products described in this document have been performed in
order to confirm the functionality. The verification process of ASR and
TTS products is continuously progressing; please contact HP for the most
recent information on supported products.
Installation and configuration is described for the MRCP Server
products in the following:
Nuance Vocalizer 3.0 - using Nuance MRCP Server v1.0 service pack
7
This guide should be used when installing and configuring ASR and TTS
in a standard OCMP system that includes the OCMP Browser Runtime
Environment (OCMP-BRE). However, it can also be used when setting
up a system that includes only the OCMP Base Platform (OCMP-Base)
with ASR and TTS. In this case, the sections relating to the OCMP-BRE
systems can be disregarded.
Chapter 1
Introduction
System Overview
System Overview
The OCMP-BRE is an application implemented on top of the
OCMP-Base platform connecting via the CCAPI. When using the
OCMP-BRE, the VoiceXML interpreter (OCMP-VoiceXML), the Resource
Manager (OCMP-RM) and the Operation and Maintenance (OCMP-OM)
systems are available.
In the OCMP-OM, the number of ASR and TTS resources allocated to a
service can be defined. The OCMP-RM ensures that the allocations
defined in the OCMP-OM are respected. The maximum number of ASR
and TTS resources that can be used are limited by the OCMP-BRE
license. However, the primary task of the OCMP-RM and OCMP-OM is
not to perform license policing, but to allocate and manage resources.
Most ASR and TTS products provide their own license enforcement
functionality.
Figure 1-1 shows an example of a full OCMP system.
Figure 1-1
Telephony
ISUP
OCMP
Base
CCXML
interpreter
OCMP - OM
OCMP Voice XML
MRCP
MRCP
OCMP - RM
CCXML
Document server
http
TTS
http
ASR
Voice XML
Document server
Chapter 1
Introduction
System Overview
The OCMP-Base connects the audio streams from the caller and the
MRCP server so that the result of TTS can be played back on the
telephone line and the incoming audio can be processed by the ASR
in the MRCP server.
The ASR results are returned from the MRCP server to the
OCMP-VoiceXML node via the OCMP-Base.
Chapter 1
Introduction
Resource Allocation and Management
Chapter 1
Introduction
Resource Allocation and Management
NOTE
10
Chapter 1
Introduction
ASR and TTS Language
Chapter 1
11
Introduction
Planning and Dimensioning
12
Chapter 1
Introduction
Pre-requisites
Pre-requisites
The pre-requisites for the installation of the ASR and TTS components
are described in the vendor documentation. Generally, Windows 2000
with Service Pack 3 is required. Please check to see if a special type of
processor is required.
Chapter 1
13
Introduction
Related Information
Related Information
The following documents contain information that may be useful to read
along with this document. These documents are referenced throughout
the rest of this guide. The vendor specific documentation is provided by
the ASR and TTS vendors and is generally supplied with the software
installation.
14
Chapter 1
Introduction
Abbreviations
Abbreviations
Abbreviation
Chapter 1
Meaning
OCMP-Base
OCMP-BRE
OCMP-VoiceXML
OCMP-RM
OCMP-OM
MRCP
<configdir>
15
Introduction
Abbreviations
16
Chapter 1
Installation
This section covers installation of the MRCP, ASR, and TTS servers.
Chapter 2
17
Installation
ASR Installation
ASR Installation
This section covers the installation of the MRCP and ASR servers. The
procedure varies slightly depending on which MRCP server is used.
ScanSoft OpenSpeech Recognizer 3.0.1 MRCP ASR
OSR should preferably be installed before the MRCP Server software.
Previous versions of OSR and SWMS should be uninstalled prior to
installing the software. See the OSR 3 Installation Guide text file for
details on the installation. To install:
Step 1. Run Setup.exe provided in the software installation for OSR. In the
OSR installation, the server should preferably not be installed. In the
GUI installation, click the plus sign for the OSR feature and unselect the
server. OSR 3 and SWMS currently only run in all-in-one mode
(standalone). Whether the license manager feature should be installed or
not is a system planning issue; see the OpenSpeech Recognizer Licensing
Handbook. If a license manager has already been installed during a
RealSpeak TTS installation, it should not be installed again.
Step 2. Install the target language: run Setup.exe from the Language supplied
with the software installation.
Step 3. Update /bin/srinet.dll in the OSR installation with the version
which is provided separately.
NOTE
Step 4. Run the SWMS installerby launching the Setup.exe file provided in the
software installation. In the first menu, press Install Products, then
SpeechWorks Media Server.
In the SWMS distribution, a web server can be installed to provide
access to stored wave forms. Consider any existing web server
installations before accepting this installation, and consider potential
port clashes.
Known Issue:
18
Chapter 2
Installation
ASR Installation
"Error - Ordinal not located in DLL file: An error message may pop up
indicating that an ordinal could not be located in a dynamic link library.
If this occurs, please refer to SpeechWorks MediaServer 3.1.2 - Getting
Started Guide, page 45 for instructions.
Nuance MRCP ASR
Read the Nuance MRCP document for information on how to prepare the
installation for languages other than US English. To install:
Step 1. Install the Nuance System v 8.5.0 SP7 including the Nuance MRCP
Server: run the Setup.exe file provided in the software installation. The
Nuance 8.5.0 installation sets the environment variable %NUANCE% and
%MRCP%.
Step 2. Install any required language packages. American English is installed as
the default; if any other languages are required, they must be installed
manually. Uncompress the appropriate language module packages and
extract to the path the %NUANCE% environment variable is set to.
Step 3. Verify that the default MRCP package has compiled correctly after
installation by checking that the directory %MRCP%/mrcp-en-us has
been created.
Install the components following the instructions in the Nuance System
Installation Guide and Nuance MRCP Server: Users Guide.
Chapter 2
19
Installation
TTS Installation
TTS Installation
This section covers the installation of the MRCP and TTS servers. Refer
to the suppliers documentation for more detail.
ScanSoft Telecom RealSpeak MRCP TTS
Follow this procedure to install:
Step 1. Install Telecom RealSpeak MRCP. If a license manager has already been
installed with OSR, this feature should not be installed again for TTS.
Whether the license manager feature should be installed or not is a
system planning issue: see the OpenSpeech Recognizer Licensing
Handbook.
Step 2. Install one or more RealSpeak voices.
Step 3. If you are also installing ScanSoft OSR 3 ASR, do so before installing the
MRCP Server.
Step 4. Install SWMS. Installing this component creates a Windows service
called SpeechWorks Media Server that can be used to start and stop the
server.
Nuance Vocalizer 3.0 MRCP TTS
Follow this procedure to install:
Step 1. Install the Nuance System V8.5.0 SP7 including the Nuance MRCP
Server. Choose typical when prompted. The installation spawns a
command line console that compiles a package needed for the MRCP
server. Do not press finish in the main wizard before this console is done,
as this may result in an installation where the package has to be
manually compiled later. When installation is complete, a service is
created called Nuance Watcher Daemon. This service can be used to start
and stop the server.
Step 2. Install Nuance Vocalizer 3.0.
Step 3. Install at least one Vocalizer voice.
20
Chapter 2
Configuration Overview
This section describes an overview of the configuration needed to run the
OCMP with ASR and TTS.
Chapter 3
21
Configuration Overview
Overview
Overview
The following must be configured in an OCMP system in order to run
ASR and TTS:
22
Chapter 3
Configuration Overview
Overview
MRCP servers. The requests are made in a round robin fashion. Note
that the properties shown in the example ocmp_provider.xml file are
not complete XML.
Figure 3-1
Chapter 3
23
Configuration Overview
Overview
24
Chapter 3
Chapter 4
25
To be more descriptive, you can also include the ASR or TTS product
name in the Audio Service name.
The value of the property ocmp.asr.mrcpserver.<id> in the
OCMP-BRE configuration file <configdir>/ocmp_provider.xml, must
match the name of the Audio Service. See the sections Configuring
OCMP-BRE for ASR on page 48 and Configuring OCMP-BRE for TTS
on page 50 for information on defining this type of property.
26
Chapter 4
rtsp://<ASRservername>:4900/media/speechrecognizer is the
default for OSR 3
rtsp://<TTSservername>:4900/media/speechsynthesizer for
RealSpeak running under SWMS
The actual port number should be set according to the MRCP server
configuration.
Step 5. Enter a description.
Step 6. Enter the required number of concurrent ASR sessions (the maximum
number of sessions that the OCMP-Base server should be allowed to
run). Set Pre-Opened Sessions to 0.
NOTE
Step 7. Click Create Server. The server should start automatically. Check that
there are no red lights in the Fault Management View of the OCMP
Platform Monitoring view.
An audio server can be stopped and started by selecting the server from
the list of servers in the Audio Server Management view.
Chapter 4
27
NOTE
28
If several audio servers are defined for the same audio service, they
should only reference resources of the same type; that is, ASR or TTS
(rtsp URIs).
Chapter 4
Chapter 5
29
30
Chapter 5
Chapter 5
31
Audio Logging
The default configuration for OSR 3 creates logs of the audio data sent to
the recognizer. The audio logs are very useful for development and
tuning but require a lot of disk space. For this reason, it is recommended
to switch off the audio logs for production systems except when
performing tuning. Audio logging is disabled by setting the OSR
configuration parameter swirec_suppress_waveform_logging to 1. For
information on SWMS and OSR text logs, please refer to the section
Logs on page 59.
Starting OSR 3 MRCP
Run %SWISVRSDK%/bin/startserver.bat to start the MRCP server
from the command line, or start the Windows Service SpeechWorks
Media Server. The license service is called OSR Licensing Server.
The default name of the license file is osr.lic and should be placed in
%SWISRSDK%/flexlm/license folder.
Optionally, if installed, start the Apache web server installed with OSR
3.
Known issues
There is a known problem in the installation of OSR3 related to the error
message Error-Ordinal not located in DLL file. This message indicates
that an ordinal could not be located in a dynamic link library. If this
occurs, please refer to SpeechWorks MediaServer 3.1 Getting Started
Guide for instructions.
32
Chapter 5
Chapter 5
33
34
Chapter 5
Chapter 5
35
Watcher daemon
The Nuance watcher daemon is a process that can start, stop, and
supervise all the Nuance processes in the system. By configuring the
watcher daemon startup script, all the processes needed for an MRCP
ASR system can be started.
In general, the following processes need to be started in the system:
resource-manager
recserver
compilation-server
mrcp-server
Language Configuration
To add another language, see the section "Configuring a multilingual
server" in the Nuance MRCP Server: Users Guide.
The Nuance engine will use the default language package in runtime for
grammar compilation and recognition if not explicitly set to another
language; it does not determine the language from the xml:lang
attribute in the grammar. The com.hp.mrcp.asrspeechlanguage
property can be used in the VoiceXML page to set the language. For
instance, to set French:
36
Chapter 5
NOTE
The Nuance server still has to be configured for the language in question
as described in the Nuance MRCP Server: Users Guide.
Other Configurations
In the file %MRCP%/mrcp-config:
Verify that the property mrcp.RtspPort is set to match the rtsp URL
specified for this Audio Server on the OCMP-Base that receives the
call.
37
Language Mapping
Nuance provides the file %MRCP%\mrcp-lang-map to map arbitrary
language codes to installed language resources. The encoding to be used
can also be defined in this file. For deatils, refer to the section "Using
mrcp-lang-map" in the Nuance MRCP Server 1.0 User's Guide.
Logging
For information on Nuance text logs, please refer to the section Logs on
page 59.
38
Chapter 5
NOTE
The above section only defines the Arabic particulars. The Nuance
MRCP Server still has to be prepared for the language resource in the
regular way as described in the section "Configuring a multilingual
server" in the Nuance MRCP Server: Users Guide.
Chapter 5
39
40
Chapter 5
Chapter 6
41
42
Chapter 6
Chapter 6
43
NOTE
A line starting the Nuance license manager for the MRCP Server:
nlm <licenseFilePath>
A line starting the nuance license manager for the vocalizer: nlm
<license> e.g. nlm %VOCALIZER%/license.txt
There should be a line for each mode starting the Vocalizer itself:
%VOCALIZER%/\bin\win32\vocalizer.exe -voice persona name
-encoding <alaw-or-mulaw> tts.Port=32323 -dictionary_port
22552 -num_channels <NumChannels> -text_type SSML
lm.Addresses=localhost:8471 -voices_from_disk
config.LogFileRootDir=%MRCP%/mrcp/logs
44
Chapter 6
Verify that the property mrcp.RtspPort is set to match the rtsp URL
specified for this Audio Server on the OCMP-Base that receives the
call.
Chapter 6
45
46
Chapter 6
Chapter 7
47
48
Chapter 7
Chapter 7
49
50
Chapter 7
Chapter 7
51
52
Chapter 7
Troubleshooting
This section provides information on how to troubleshoot an OCMP
system running ASR and TTS.
Chapter 8
53
Troubleshooting
General Troubleshooting
General Troubleshooting
If the ASR or TTS does not work, check the items below for some common
types of errors. The items are not ordered since the error may be of
different types; that is, configuration, installation, or application errors.
Do calls not requiring ASR and TTS work? This would indicate an
error specific to the ASR and TTS usage if the call is running on the
same set of OCMP servers.
Is the MRCP server started? If the ASR or TTS runs in client server
mode, then also check if the back-end server is started. For instance,
in the case of Nuance, the watcher web interface at
http://<host>:7080 can be used to verify that all the required
processes are started.
54
Chapter 8
Troubleshooting
General Troubleshooting
Chapter 8
55
Troubleshooting
Configuration Dependencies - Checklist
56
Chapter 8
Troubleshooting
Configuration Dependencies - Checklist
<name>ocmp.tts.mode.RealSpeak_Jennifer</name>
<value>ScanSoft,MRCP_RealSpeak,Jennifer,female,adult,en,
MRCP</value>
</property>
Chapter 8
57
Troubleshooting
Configuration Dependencies - Checklist
must match one of the numbers configured in the OCMP-Base. In
http//<ocmp-base-host>:4242 | Services, press View/Modify
for VoiceXML_enabler:
Service Addresses: 1234567
58
Chapter 8
Troubleshooting
Logs
Logs
The different MRCP servers, as well as ASR and TTS components,
provide a number of logs which are valuable for finding errors.
Chapter 8
59
Troubleshooting
Logs
The OCMP platform provides the following sources. The level of detail
depends on configurations in the platform; see the HP OpenCall Media
Platform - BRE Operations Guide for more information:
60
Chapter 8