Académique Documents
Professionnel Documents
Culture Documents
discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/266629722
READS
201
4 authors, including:
Luc Lamontagne
Francois Laviolette
Laval University
Laval University
SEE PROFILE
SEE PROFILE
Richard Khoury
Lakehead University Thunder Bay Campus
39 PUBLICATIONS 63 CITATIONS
SEE PROFILE
ABSTRACT
This paper describes a framework to support the
construction of intelligent virtual assistants (IVAs). An
IVA agent is a software assistant capable of interacting
with a user to support sense-making tasks, to determine
information needs, to provide relevant information and to
improve its performance based on user feedbacks.
Currently, there is no integrated software environment
available to develop such agents. We are exploring how
we can integrate machine learning and natural language
processing technologies, available as open source
software, to support the construction of intelligent virtual
assistants. The framework relies on a combination of
question answering (Q/A), information extraction (IE) and
user modeling components. In this paper, we present an
overview of the work that is being conducted to build a
prototype of the framework.
KEY WORDS
Intelligent virtual assistant, question answering systems,
information extraction, topic modeling, machine learning,
natural language processing.
1. Introduction
The motivation for this research effort is the development
of an Intelligent Virtual Assistant (IVA) capability for
supporting the intelligence community in their
sensemaking tasks. The design of such systems relies on
intelligent personal assistant technology [1], i.e. software
agents capable of performing tasks with minimum
guidance from its users. Intelligent virtual assistants
combine artificial intelligence technologies to support
users, to organize information and to adapt to changing
situations.
The field of intelligent assistants is currently an
active area of research and development with some
products being much publicized:
CALO and PAL were projects lead by the
Stanford Research Institute (SRI) to investigate
how machine learning can help to design
software agents performing in an office
automation environment. Various modules were
developed during this project for categorization
2.
3.
Figure 1. Main components of the IVA framework
The three components of the framework are further
described in the next sections. Moreover, we then explain
some of the interactions taking place among these
components.
3.2
4. Preliminary experiments
We conducted some preliminary experiments early in the
project to determine the baseline performance of the IVA
framework for our military intelligence application
domain.
The annotation component relies on two extraction
technologies. We first evaluated the NER extraction
component that we have selected for the project (Stanford
NER). We obtained accuracy greater than 80% on a
corpus of geopolitical news reports representative of our
military application domain. We estimate that these
results fully meet our expectations. However additional
training could be performed on our domain corpus if
higher levels of accuracy are required for some other
application domains.
We also evaluated the performance of REVERB [6]
as relation extractor for our framework. Experimentations
conducted on our corpus of geopolitical news reports
resulted in an accuracy of approximately 60% of the
relations present in these documents. Moreover, most of
the relations extracted by the system were valid and could
be made to contribution in the framework. This result
suggests that most of the important relations of an
application domain can be identified and exploited as part
of the Q/A process.
(2)
6. Conclusion
This paper gave an overview of the efforts we are
devoting to the development of a framework for building
intelligent virtual assistants. We described the three main
components of it and explained how they are in the
framework. Moreover, we discussed some of the practical
issues we are facing. We have discussed three adaptation
techniques that are being added to the framework in order
to improve its performance. Details were provided on
technical decision made during the implementation.
Future work on this project will include a global
evaluation of the IVA system including the adaptation
scheme presented in Section 5 of the paper. And further
experimentations will also be required to explore how to
Acknowledgement(s)
We would like to thank the Natural Sciences and
Engineering Research Council of Canada (NSERC) and
the Canadian Department of National Defence for the
financial support of the work through their joint DNDNSERC research program.
References
[1] K. Myers, P. Berry, J. Blythe, K. Conley, M.
Gervasio, D. McGuinness, D. Morley, A. Pfeffer, M.
Pollack, & M. Tambe, An Intelligent Personal Assistant
for Task and Time Management, AI Magazine, 28(2),
2007, 47-61.
[2] D. Gouin, V. Lavigne, & A. Bergeron-Guyard,
Human-computer interaction with an intelligence virtual
analyst, KSCO, 2012.
[3] D. Poussart, Future intelligence analysis capability towards a cohesive R&D program definition, Technical
Report, TM 2012-9999, DRDC Valcartier, 2012.
[4] G. Michaud, A. Burkov, M. Brub, L. Lamontagne,
& F. Laviolette, Identification of iVAC capabilities,
Technical report, CR 2011-243, DRDC Valcartier, 2012.
[5] D. Nadeau & S. Sekine, A survey of named entity
recognition
and
classification,
Linguisticae
Investigationes, 30(1), 2007, 3-26.
[6] J. R. Finkel, T. Grenager & C. Manning,
Incorporating Non-local Information into Information
Extraction Systems by Gibbs Sampling, Proceedings of
the 43nd Annual Meeting of the Association for
Computational Linguistics, 2005, 363-370.
[7] M. Marrero, S. Sanchez-Cuadrado, J. Morato & Y.
Andreadakis, Evaluation of Named Entity Extraction
Systems, Research in Computer Science, vol. 41, 2009,
47-58.
[8] M. Banko, M. Cafarella, S. Soderland, M.
Broadhead, & O. Etzioni, Open information extraction
from the Web, Proceedings of IJCAI, 2009, 2670-2676.
[9] A. Fader, S. Soderland, & O. Etzioni, Identifying
relations for open information extraction, Proceedings of
EMNLP, 2011, 1535-1545.
[10] F. Wu & D. S. Weld. Open information extraction
using Wikipedia. Proceedings of the ACL, 2010, 118-127.
[11] M. Mausam, R. Schmitz, R. Bart, S. Soderland, & O.
Etzioni, Open language learning for information
extraction, Proceedings of the 2012 Joint Conference on
Empirical Methods in Natural Language Processing and
Computational Natural Language Learning, 2012, 523534.
[12] D. Gildea & D. Jurafsky, Automatic Labeling of
Semantic Roles, Computational Linguistics, 28(3), 2002,
245-288.
[13] T. Mitsumori, M. Murata, Y. Fukuda, K. Doi & H.
Doi, Semantic Role Labeling Using Support Vector