Académique Documents
Professionnel Documents
Culture Documents
Introduction - Ontology
An ontology is a formal and explicit specification of a shared conceptualization.
Ontology consists of :
Classes
Properties (Taxonomic and Non-taxonomic)
Individuals
Values
Axioms- used to verify the consistency of ontology.
E.g. sorting algorithm can be considered as an algorithm if and only if it solves a certain Computer Science problem
Problem
Solve
Algorithm
Has
Complexity
Is a
Sorting
Algorithm
2
Noise
Experts have different
Viewpoints
Assumptions
Needs regarding the same domain
Ontology Learning
Rules
Relations
Concept Hierarchy
Concepts
Synonyms
Terms
Contrastive Corpus
Bio Medical
Cricket
Other Domains
Rules
Relations
Concept Hierarchy
Concepts
Synonyms
Terms
Rules
Relations
Concept Hierarchy
Concepts
Synonyms
Terms
Relations
Concept Hierarchy
Concepts
Synonyms
Terms
Terms
Selecting and
Organizing corpora
Corpus Annotation
Selecting Corpora
Select corpora that are good in lexical richness
Occurrence (normalized by length)%
Frequency
of Words
MK
NUS
FAO
1.31
2.23
1.69
2.46
0.08
20.82
0.39
0.55
0.81
0.63
6.49
0.19
0.25
0.39
0.27
2.26
3.09
0.12
0.15
0.26
0.17
0.12
1.82
0.08
0.10
0.19
0.11
1.26
Total
2.11
3.31
3.63
3.66
2.48
33.50
10
Organizing Corpuses
Target domain is iteratively selected
Contrastive Domain
Target Domain
Mikalai
Krapivin
GENIA
Computer
Science
Bio Medical
GENIA
Mikalai
Krapivin
Cricinfo RSS
Bio Medical
Computer
Science
Contrastive Domain
Mikalai Krapivin
Cricinfo RSS
Computer Science
11
Corpus Annotation
GENIA
Cricinfo
Linguistic rules
Domain Weight Calculation for each term t(e.g. processor computational complexity etc.)
GENIA
Cricinfo
ai
arg max (term ) ai
Domain Weight Calculation for each term t(e.g. processor computational complexity etc.)
14
Our approach
Existing approaches
Top
700
52.5%
55%
47%
28%
Evaluation of the best 300 simple terms and best 300 complex terms for the Bio Medical domain
Our approach
Existing approaches
Top
300
62%
80%
55%
32%
15
Conclusion
Our contribution in term extraction for ontology learning
Implemented a mechanism to select corpora and discussed an approach
to organize corpora
16
Questions ?
Thank You
17