Académique Documents
Professionnel Documents
Culture Documents
Imprecise Data
Automobile
All
Sedan Truck
Location
All
West East
CA TX NY MA
Measures, Facts, and Queries Auto = Truck
Automobile Loc = East
ALL
SUM(Repair) = ?
Auto = F150 Sedan Truck
p2
East
p1
NY
ALL
Location
p8 p6 p4
TX
p7 p5
West
p3 Cell
CA
Extend
Extend the
the OLAP
OLAP model
model to
to handle
handle data
data
ambiguity
ambiguity
Imprecision
Imprecision
Uncertainty
Uncertainty
Imprecision
Automobile ALL
p11 p9
East
NY
p1
ALL
Location
p8 p6 p10 p4
TX
p7 p5
West
p3
CA
Representing Imprecision using Dimension
Hierarchies
F150 Sierra
p3 p4
We
We propose
propose desiderata that
that enable
MA
desiderata
p5 enable
appropriate
appropriate definition
definition of
of query
query
East
semantics
semantics for
for imprecise
imprecise data
data
NY
p1 p2
Desideratum I: Consistency
Truck Consistency
specifies the
F150 Sierra relationship
p3 p4 between answers
MA
p5 to related queries
on a fixed data
East
set
NY
p1 p2
Desideratum II: Faithfulness
Data Set 1 Data Set 2 Data Set 3
F150 Sierra F150 Sierra F150 Sierra
p5 p5 p5
MA
MA
MA
p3 p4 p3 p4 p3 p4
NY
NY
NY
p1 p2 p1 p2 p1 p2
MA
p3 p4
NY
p1 p2
F150 Sierra
F150 Sierra
MA
p3
p5 p4
w1
MA
p3
p4
p5 w4
w2
NY
w3 p2
NY
p2 p1
p1
Possible
MA
MA
p4 p5 p4
Worlds p5
[Kripke63,…]
p3 p3
NY
NY
p2 p2
p1 p1
Possible Worlds Query Semantics
p3 p4
4 3 F150 NY 150 0.4
East
p1 p2
6 5 F150 MA 100 0.5
Measure Correlation
Ignored Ignored Used
Correlation
Dimension
Uniform
Count EM
Used
Results on Query Semantics
APPROXIMATE AVERAGE
E[SUM] / E[COUNT] instead of
E[SUM/COUNT]
simpler and more efficient
satisfies consistency
extends to aggregation operators for
uncertain measures
Uncertainty