Vous êtes sur la page 1sur 4

Wednesday December 12 17:08:02 2018 Page 1

Statistics/Data Analysis

name: <unnamed>
log: D:\bukub3\etlan1.smcl
log type: smcl
opened on: 12 Dec 2018, 17:07:09

1 . clear all

2 . set more off

3 .
4 . //merging//
5 . use D:\bukub3\b3b_km.dta

6 . keep pidlink hhid14 pid14 km02a km04 km08 km12 km14 km15

7 . merge 1:1 pidlink using D:\bukub3\b3a_tk1.dta

Result # of obs.

not matched 203


from master 5 (_merge==1)
from using 198 (_merge==2)

matched 34,266 (_merge==3)

8 . keep if _merge==3
(203 observations deleted)

9 . keep pidlink hhid14 pid14 km02a km04 km08 km12 km14 km15 tk01a tk01

10. merge m:m hhid14 using D:\bukub3\bk_ar1.dta

Result # of obs.

not matched 6,389


from master 0 (_merge==1)
from using 6,389 (_merge==2)

matched 82,993 (_merge==3)

11. keep if _merge==3


(6,389 observations deleted)

12. keep pidlink hhid14 pid14 km02a km04 km08 km12 km14 km15 tk01a tk01 ar07 ar08a ar13

13. describe

Contains data from D:\bukub3\b3b_km.dta


obs: 82,993
vars: 14
size: 9,461,202

storage display value


variable name type format label variable label

pid14 double %12.0g 2014 PID


km02a double %12.0g km02a INTERVIEWER NOTE: CIGARETTE/CIGARS?
km04 double %12.0g km04 Do you still have the habit or have you
totally quit?
km08 double %12.0g In one day about how many
cigars/cigarettes did you consume?
km12 double %12.0g km12 Do you find it difficult to refrain from
smoking in places where it is
forbidden
km14 double %12.0g km14 Do you smoke/chew tobacco more
frequently during the first hours
after waking th
km15 double %12.0g km15 When you are so ill that you are in bed
most of the day, do you smoke/chew
Wednesday December 12 17:08:03 2018 Page 2

tobac
hhid14 str8 %8s 2014 Household ID
pidlink str10 %10s Pidlink
tk01a double %12.0g tk01a During the past week
tk01 double %12.0g tk01 Primary activity during past week
ar13 double %12.0g ar13 Marital status
ar07 double %12.0g Sex
ar08a double %12.0g Age in the last interview

Sorted by:
Note: Dataset has changed since last saved.

14. sum

Variable Obs Mean Std. Dev. Min Max

pid14 82,993 3.434217 2.988384 1 27


km02a 29,832 1.113502 .4627407 1 3
km04 29,832 1.284862 .6989949 1 3
km08 28,906 12.29291 8.677566 0 98
km12 26,787 2.356106 .9392451 1 9

km14 26,787 2.576436 .8226214 1 9


km15 26,787 2.549408 .8409141 1 9
hhid14 0
pidlink 0
tk01a 82,993 1.713036 .9599262 1 8

tk01 82,993 4.391322 13.91916 1 95


ar13 76,826 1.837516 1.033199 1 9
ar07 76,826 2.009111 .999965 1 3
ar08a 60,971 27.37137 27.08917 0 999

15.
16. //recode//
17. recode km02 km04 km12 km14 km15 tk01a ar07 (3 = 0)
(km02a: 1693 changes made)
(km04: 4249 changes made)
(km12: 18143 changes made)
(km14: 21094 changes made)
(km15: 20732 changes made)
(tk01a: 29557 changes made)
(ar07: 38763 changes made)

18. recode tk01 (2 3 4 5 7 95 = 0)


(tk01: 36240 changes made)

19. recode ar13 (2 = 1) (1 3 4 5 6 = 0)


(ar13: 76340 changes made)

20.
21. //cleaning//
22. drop if km02a==.
(53,161 observations deleted)

23. drop if km04==.


(0 observations deleted)

24. drop if km08==.


(926 observations deleted)
Wednesday December 12 17:08:03 2018 Page 3

25. drop if km12==.


(2,681 observations deleted)

26. drop if km12==9


(5 observations deleted)

27. drop if km14==.


(0 observations deleted)

28. drop if km15==.


(0 observations deleted)

29. drop if tk01a==.


(0 observations deleted)

30. drop if tk01==.


(0 observations deleted)

31. drop if ar08a==.


(6,944 observations deleted)

32. drop if ar08a==999


(6 observations deleted)

33. drop if ar08a==998


(5 observations deleted)

34.
35. //logit//
36. logit tk01 km02a km08 km15 ar08a ar13

Iteration 0: log likelihood = -9840.5655


Iteration 1: log likelihood = -9680.788
Iteration 2: log likelihood = -9677.8985
Iteration 3: log likelihood = -9677.8969
Iteration 4: log likelihood = -9677.8969

Logistic regression Number of obs = 19,265


LR chi2(5) = 325.34
Prob > chi2 = 0.0000
Log likelihood = -9677.8969 Pseudo R2 = 0.0165

tk01 Coef. Std. Err. z P>|z| [95% Conf. Interval]

km02a -.2678389 .1013966 -2.64 0.008 -.4665726 -.0691051


km08 .0373327 .0025466 14.66 0.000 .0323416 .0423239
km15 -.0925511 .0446927 -2.07 0.038 -.1801471 -.0049551
ar08a -.0085241 .0011205 -7.61 0.000 -.0107202 -.006328
ar13 .1738157 .0313379 5.55 0.000 .1123946 .2352367
_cons 1.306258 .1076737 12.13 0.000 1.095221 1.517294

37. estat classification

Logistic model for tk01

True
Classified D ~D Total

+ 15265 4000 19265


- 0 0 0

Total 15265 4000 19265


Wednesday December 12 17:08:03 2018 Page 4

Classified + if predicted Pr(D) >= .5


True D defined as tk01 != 0

Sensitivity Pr( +| D) 100.00%


Specificity Pr( -|~D) 0.00%
Positive predictive value Pr( D| +) 79.24%
Negative predictive value Pr(~D| -) .%

False + rate for true ~D Pr( +|~D) 100.00%


False - rate for true D Pr( -| D) 0.00%
False + rate for classified + Pr(~D| +) 20.76%
False - rate for classified - Pr( D| -) .%

Correctly classified 79.24%

38.
39. quietly logit tk01 km02a km08 km15 ar08a ar13

40. margins, dydx (*)

Average marginal effects Number of obs = 19,265


Model VCE : OIM

Expression : Pr(tk01), predict()


dy/dx w.r.t. : km02a km08 km15 ar08a ar13

Delta-method
dy/dx Std. Err. z P>|z| [95% Conf. Interval]

km02a -.0433624 .0164099 -2.64 0.008 -.0755252 -.0111996


km08 .0060441 .0004084 14.80 0.000 .0052436 .0068445
km15 -.0149838 .0072341 -2.07 0.038 -.0291623 -.0008053
ar08a -.00138 .0001808 -7.63 0.000 -.0017344 -.0010257
ar13 .0281403 .0050665 5.55 0.000 .0182102 .0380704

41. margins, dydx (*) atmeans

Conditional marginal effects Number of obs = 19,265


Model VCE : OIM

Expression : Pr(tk01), predict()


dy/dx w.r.t. : km02a km08 km15 ar08a ar13
at : km02a = .9656372 (mean)
km08 = 12.28944 (mean)
km15 = .2216974 (mean)
ar08a = 26.68829 (mean)
ar13 = .6707501 (mean)

Delta-method
dy/dx Std. Err. z P>|z| [95% Conf. Interval]

km02a -.0431448 .0163265 -2.64 0.008 -.0751443 -.0111454


km08 .0060137 .0004025 14.94 0.000 .0052248 .0068026
km15 -.0149086 .0071973 -2.07 0.038 -.029015 -.0008022
ar08a -.0013731 .0001799 -7.63 0.000 -.0017258 -.0010204
ar13 .0279991 .0050332 5.56 0.000 .0181341 .0378641

42. log off


name: <unnamed>
log: D:\bukub3\etlan1.smcl
log type: smcl
paused on: 12 Dec 2018, 17:07:19

Vous aimerez peut-être aussi