Académique Documents
Professionnel Documents
Culture Documents
USC Linguistics
2 1 1 8 3 3 2
R(−ed) = I( , , 0) + I( , , ) = .2 ∗ 1 + .8 ∗ 1.56 = 1.45 (5)
10 2 2 10 8 8 8
1 9 4 3 2
R(−s) = I(0, 1, 0) + I( , , ) = .1 ∗ 0 + .9 ∗ 1.53 = 1.38 (6)
10 10 9 9 9
1
Notice immediately, that there is no way to disambiguate the written representation “They read”;
unless, perhaps, the past is more likely to be eventive, and more likely to want a direct object.
1
2 8 4 4
R(will) = I(0, 0, 1) + I( , , 0) = .2 ∗ 0 + .8 ∗ 1 = .8 (7)
10 10 8 8
2 1 1 8 3 3 2
R(be) = I( , , 0) + I( , , ) = .2 ∗ 1 + .8 ∗ 1.56 = 1.45 (8)
10 2 2 10 8 8 8
6 3 2 1 4 1 2 1
R([sing]) = I( , , ) + I( , , ) = .6 ∗ 1.45 + .4 ∗ 1.5 = 1.475 (9)
10 6 6 6 10 4 4 4
<<<< ∩ ∩ ∩∩ >>
will:T will:F
>> <<<< ∩ ∩ ∩∩
4 4
I( , ) = 1 (10)
8 8
2 1 1 6 3 3
R(−ed) = I( , ) + I( , ) = .25 + .75 = 1 (11)
8 2 2 8 6 6
1 7 4 3
R(−s) = I(0, 1) + I( , ) = 0 + .86 = .86 (12)
8 8 7 7
2 1 1 6 3 3
R(be) = I( , ) + I( , ) = .25 + .75 = 1 (13)
8 2 2 8 6 6
5 3 2 3 1 2
R([sing]) = I( , ) + I( , ) = .61 + .34 = .95 (14)
8 5 5 8 3 3
<<<< ∩ ∩ ∩∩ >>
will:T will:F
>> <<<< ∩ ∩ ∩∩
-s:T -s:F
∩ <<<< ∩ ∩ ∩
2
-ed -s w-1 [sing] t(e)
He read. F F - T t(e)<ST
He was hungry. F F was T t(e)<ST
He ran. F F - T t(e)<ST
They liked it T F - F t(e)<ST
He reads. F T - T t(e)∩ST
He is happy F F is T t(e)∩ST
They need it. T F - F t(e)∩ST
They want it. F F - F t(e)∩ST
He will read. F F will T t(e)>ST
They will get it. F F will F t(e)>ST
6 3 3 1 1 2
R(w − 1) = I( , , 0) + I(1, 0, 0) + I(0, 1, 0) + I(0, 0, 1) = .6 (15)
10 6 6 10 10 10
<<<< ∩ ∩ ∩∩ >>
3 2 1 1 3 1 2 1 2
R(w − 1) = I( , , 0) + I(1, 0, 0) + I( , , 0) + I(0, 1, 0) + I(0, 0, 1) (16)
10 3 3 10 10 3 3 10 10
3
Venkataraman2 credits Quinlan with the concept of GainRatio:
Gain(A)
GainRatio(A) = X (18)
−P (v)log2 P (v)
v∈A
1 χ2
• “Don’t use low numbers, especially not zero.”
T otal(row) ∗ T otal(col)
E= (22)
N
‘Expected’ by the “independent hypothesis”3 :
Observed Expected
< ∩ > < ∩ >
- 30 30 0 60 - 24 24 12 60
was 10 0 0 10 was 4 4 2 10
is 0 10 0 10 is 4 4 2 10
will 0 0 20 20 will 8 8 4 20
total 40 40 20 100 total 40 40 20 100
X (O − E)2
χ2 = (23)
E
4
(0 − 4)2 (10 − 4)2 (0 − 2)2 (0 − 8)2 (0 − 8)2 (20 − 4)2
+ + + + + = 125 (25)
4 4 2 8 8 4
for integers:
Γ(n) = (n − 1)! (28)
gamma pdf:
λα α−1 −λx
p(x) = x e (29)
Γ(α)
χ2 :
f◦ 1
α= ,λ = (30)
2 2
f◦
( 12 ) 2 f◦
−1 1 2
2
p(χ ) = ◦ x 2 e− 2 χ (31)
Γ( f2 )
0.2
0.1
0
0 2 4 6 8 10
5
Observed Expected
-s < ∩ > -s < ∩ >
T 0 10 0 10 T 5 5 0 10
F 40 30 0 70 F 35 35 0 70
total 40 40 0 80 total 40 40 0 80
Observed Expected
-s < ∩ > -s < ∩ >
T 0 1 0 1 T .5 .5 0 1
F 4 3 0 7 F 3.5 3.5 0 7
total 4 4 0 8 total 4 4 0 8