Académique Documents
Professionnel Documents
Culture Documents
Total
Item Relation Total “I” Corpus (Tokens)
I (MAC) 1.13% 37,127 3,300,000
I (BNC/C) 3.28% 132,397 4,022,428
I (COLT) 2.90% 14,868 511,834
I (LTT) 2.60% 323 12,406
I (SCO) 2.26% 2,693 119,079
I (WSC) 2.27% 27,691 1,218,957
I (SEC rec.) 2.42% 1,343 55,561
Table 1: I use in different spoken corpora
Top 10 collocates MAC & BNC/C
MAC Col. % total BNC/C Col. % total
IT 21.0 7,627 TO 18.1 23,984
YOU 18.0 6,684 IT 17.5 23,231
AND 15.8 5,860 THE 17.4 23,099
TO 14.1 5,224 AND 17.3 22,895
KNOW 13.5 4,987 YOU 17.1 22,660
THE 13.2 4,882 KNOW 15.4 20,386
THAT 12.3 4,578 A 13.9 18,389
DON'T 11.9 4,399 DON'T 12.9 17,130
THINK 11.4 4,215 THAT 12.5 16,488
A 10.9 4,042 THINK 12.3 16.262
T a b le 2: 10 most frequent collocates of I in MAC and BNC/C
I’M NOT IN
Clear
Cleardifferences:
differences: Long clusters
long clusters.
MAC Occ. COLT Occ. WSC Occ. SEC Occ
I DON’T KNOW 1412 I DON'T KNOW 674 I DON'T KNOW 1571 I THINK THE 22
I MEAN I 1238 YEAH I KNOW 324 I DON'T THINK 459 I WANT TO 22
III 946 I DON'T THINK 240 YOU KNOW I 357 I DON'T THINK 18
I THINK I 742 NO I DON'T 160 DON'T KNOW I 333 WHEN I WAS 13
I KNOW I 614 YOU KNOW I 160 I DON'T KNOW I 319 I DON'T KNOW 12