Académique Documents
Professionnel Documents
Culture Documents
UNIVERSITY OF MUMBAI
CERTIFICATE
SUNETRA JOGAL
RUPESH KHANEKAR
SAYALI PARKAR
NITIN PAWAR
AMRUTA PRADHAN
ADITYA SAWANT
RAHUL SINGH
Have satisfactory completed the Group Project entitled ARE SUBWAYS
(UNDERGROUND PATHWAYS) ACROSS MUMBAI & THANE SAFE FOR WOMEN?
towards partial fulfillment of the Post Graduate Diploma in Applied Statistics with
Software as laid by University of Mumbai.
1. ACKNOWLEDGEMENT
2. INTRODUCTION
3. OBJECTIVES
4. METHODOLOGY
5. QUESTIONNAIRE
6. TECHNIQUESUSED
7. ANLYSIS
i) GRAPHSSHOWINGCOMMUTINGPREFER,GENDER,AGE
HARRESMENTFACED
ii) VENNDIAGRAM
iii) PARETOCHARTS
iv) CHISQUARETESTOFINDEPENDANCE
v) NONPARAMETRICTEST
vi) MULTIPLELOGISTICREGRESSION
8. OVERALLCONCLUSIONANDRECOMMENDATIONS
9. BIBLIOGRAPHY
1) ACKNOWLEDGEMENTS
We would also like to thank all the professors and the entire
support staff of the Department of Statistics for their constant
help, support and encouragement.
Mumbaiisthecitywhichneversleeps.Itisthecitywhichisoccupiedbydiverse
setsofpeoplewithdiverseneeds.Thequalityofthecityisjudgedbywhatit
offers to the people living here. For instance, the right to live, right to move
aroundwithdignityandsafety.Millionsofpeoplehereusepublictransporton
adailybasis.Morethan80%ofpeopleinMumbaitravelthroughtrains.
There are many subways (or underground tunnels) that have been located in
thevicinityofrailwaystations.Thesesubwaysareusedbyalmost75%ofthe
people travelling through trains and by many other people who find it less
distant.
The first task of the Project was to explore the issue that whether these
subways are safe enough for commuters. The Project looked for any
informationthatcouldbefoundonwomen'ssafety,howitrelatedtosubways
and what had been done by public bodies such as councils and police, to
improve it.Women's safety involves strategies, practices and policies which
aim to reduce gender based violence or violence against women, including
womensfearofcrime.Womenandgirlsarefrequentlysubjecttoviolenceand
abuse from physical and verbal harassment to assault and rape on city
streets, public transportation or in their own neighborhoods. It is rarely
acknowledged but can have serious long term effect such as depression,
anxietyandalostsenseofsafety.
Many women have lived with feeling unsafe while commuting through
subways at certain times of the day. Commuters have become absolutely
discontentedwiththesesubways.Gettingthroughthesubwayswithoutvisual
or physical harassment during peak hours or when it is crowded is more of a
challengetogirlsandwomencommuters.
Over80%ofwomenexperiencegenderbasedharassmentfromunknownmen
in public including whistling, sexuallyexplicit comments, staring, etc. Such
dailyoccurrenceslimittherightsandfreedomsofwomenasequalcitizensto
enjoytheirneighborhoodsandcities.
METHODOLOGY
The following are the steps involved in the research process:
Demographics
Preferences and comparisons
Awareness
3) PILOT SURVEY
4) DATA COLLECTION
A sample of 667 was collected from across Mumbai and
Thane. For analysing, data was categorised area wise, into
eastern suburbs, western suburbs and thane city.
7) FINAL REPORT
QUESTIONNAIRE
ARE SUBWAYS (UNDERGROUND PATHWAYS) ACROSS MUMBAI & THANE SAFE FOR WOMEN?
The following questionnaire is designed to study the problems faced by commuters using subways
AGE GROUP
Occupation :- Business [ ]
Service [ ]
Housewife [ ]
Student [ ]
Unemployed [ ]
Retired [ ]
Others __________________
7) What will be your reaction when you see girls/women being harassed?
a) Oppose the culprit [ ]
b) Call the police [ ]
c) Support the victim [ ]
d) Get public support [ ]
e) Prefer not to get involved [ ]
a) Lighting [ ]
b) CCTV Cameras [ ]
c) Security [ ]
d) Public Phones [ ]
e) Dustbins [ ]
12) Who do you think can play a vital role in addressing safety to the common masses?
a) Police [ ]
b) Political Parties [ ]
c) NGOs [ ]
d) Yuvak Sangh [ ]
e) Street Plays [ ]
13) What kind of changes in the society would make you feel safer in the near future?
Suggest your views. (Optional)
__________________________________________________________________________________________________
__________________________________________________________________________________________________
__________________________________________________________________________________________________
__________________________________________________________________
TABULATION AND ANLYSING
GENDER
MALE 119 17.84%
CONCLUSION: - Our sample consists of 119, i.e 17.84% of males and 548 i.e 82.16% of
females.
2) GRAPH OF DISTRIBUTION OF AGE-GROUP
AGE GROUP
AGE TOTAL Frequency
16-20 116 17%
21-30 314 47%
31-40 150 22%
41-60 87 13%
TOTAL 667 100%
CONCLUSION:- Most of our sample i.e 47.08% of crowed lies in the age group 21.30
3) DISTRIBUTION OF OCCUPATION
OCCUPTION
OCCUPTION TOTAL FREQUENCY
BUSINESS 69 10.38%
UNEMPLOYED 11 1.65%
RETIRED' 5 0.75%
SERVICE 370 55.64%
STUDENT 160 24.06%
HOUSEWIFE 50 7.52%
TOTAL 665 100.00%
Income
40.00%
30.00%
20.00%
10.00%
0.00%
BELOW1LAC
1TO3LAC
3TO5LAC
ABOVE5LAC
FREQUENCY
Conclusion:37.22%ofoursampleearnsintherangeof1lakh3lakhs.
5) GRAPHICL REPRESENTATION OF MOBILE OPERATOR
CONCLUSION :- Almost 70% of our sample prefers SUBWAY over OVERHEAD BRIDGES.
7) CommutingmeansMale
MALE TOTAL FREQUENCY
SUBWAY 84 68.29%
OVERHEADBRIDGE 39 31.71%
Conclusion:68.29%ofMalecrowduseSubway.
8) CommutingmeansFemale
FEMALE TOTAL FREQUENCY
SUBWAY 392 70.00%
ReasonOfCommutingSubway
SAFEENOUGH
LESSDISTANCE
COMP.PREFERISCLOSEBY
LESSCROWDED
0.00%
5.00%
10.00%
15.00%
20.00%
25.00%
30.00%
35.00%
10) Reasons for Preferring subways (Male)
MALE
RESONS COUNT FREQUENCY
LESS CROWDED 39 34.82%
COMP.PREFER IS CLOSE BY 29 25.89%
LESS DISTANCE 27 24.11%
SAFE ENOUGH 17 15.18%
TOTAL 112 100.00%
11) ReasonforpreferringSubwayFemale
FEMALE
RESONS COUNT FREQUENCY
LESS CROWDED 128 24.15%
COMP.PREFER IS CLOSE BY 76 14.34%
LESS DISTANCE 169 31.89%
SAFE ENOUGH 157 29.62%
TOTAL 530 100.00%
ReasonsOfCommutingSubway
(FEMALE)
LESSCROWDED
SAFEENOUGH 24%
30%
COMP.PREFERIS
CLOSEBY
14%
LESSDISTANCE
32%
Conclusion:31.89%ofFemalespreferSubwaysbecausetheyareless
crowded
12) Graphical representation of frequency of using subways
CONCLUSION:- 66.27% of our MALE sample & 62.051% of our Female sample use Subways
Every day. 21.69% & 22.051% respectively MALE & FEMALE sample use subways on
weekdays. 12.05% of Male sample & 15.897% of Female sample use subways on weekends.
13) HARRESMENT FACED BY FEMALES
FEMALE
TYPES OF HARRESMENT FACED COUNT FREQUENCY
VERBAL 245 33.06%
PHYSICAL 138 18.62%
VISUAL 202 27.26%
NEVER FACED 156 21.05%
TOTAL 741 100.00%
Conclusion:42.88%ofFemalesapproachtopeoplenearbytoseekhelp
whenharassed
15 )SeekingHelpFemale
FEMALE
RECEIVED HELP COUNT FREQUENCY
YES 306 63.62%
NO 175 36.38%
TOTAL 481 100.00%
Conclusion:63.62%ofFemaleshavereceivedhelpwhenaskedfor.
16)HelpMALE
MALE
EVER HELPED COUNT FREQUENCY
YES 66 62%
NO 41 38%
TOTAL 107 100.00%
EverHelped(Male's)
YES NO
38%
62%
Conclusion:62%ofMalehavehelpedthepeoplewhowereorarebeing
harassed.
17)HelpFEMALE
FEMALE
EVER HELPED COUNT FREQUENCY
YES 337 63.58%
NO 193 36.42%
TOTAL 530 100.00%
EverHelped(Female's)
YES NO
36%
64%
Conclusion:63.58%ofFemaleshavehelpedthepeoplewhowereorarebeing
harassed.
18)ReactionwhenharassedMALE.
MALE
REACTION COUNT FREQUENCY
OPPOSE THE CULPRIT 35 21.47%
CALL THE POLICE 46 28.22%
SUPPORT THE VICTIM 37 22.70%
GET PUBLIC SUPPORT 29 17.79%
PREFER NOT TO GET INVOLVED 16 9.82%
Conclusion:28.22%ofMalehaveapproachedPoliceForcetoseekhelp
from,22.07%ofthemhavesupportedthevictimwhereas21.47%ofthem
haveopposedthevictim
19)ReactionwhenharassedFEMALE.
FEMALE
REACTION COUNT FREQUENCY
OPPOSE THE CULPRIT 115 14.268%
CALL THE POLICE 197 24.442%
SUPPORT THE VICTIM 276 34.243%
GET PUBLIC SUPPORT 177 21.960%
PREFER NOT TO GET INVOLVED 41 5.087%
TOTAL 806 100.000%
Conclusion: 34.243% of Females have supported the victim when harassed
whereas 24.442% of them seek help from Police Force
20)ServiceProviderMALE.
MALE
NETWORK COUNT FREQUENCY
YES 48 9.11%
NO 44 8.35%
DEPENDS 29 5.50%
TOTAL 121 22.96%
Conclusion:39.67%ofMalesreceivegoodnetworkaccesswhileinSubways
36.36%ofthemsaytheydonotgetnetworkwhereas23.97%malessay
thattheymayormaynotgetgoodnetworkwhileinSubway.
21)ServiceProviderFEMALE.
FEMALE
NETWORK COUNT FREQUENCY
YES 185 33.82%
NO 222 40.59%
DEPENDS 140 25.59%
TOTAL 547 100.00%
Conclusion:33.82%ofFemalesreceivegoodnetworkaccesswhileinSubways
40.59%saythattheydonotgetnetworkwhereas25.59%ofthemsaythey
mayormaynotgetgoodnetworkwhileinSubway.
22)NeedofSafetyAlarm.
SafetyAlarm
YES NO
6%
94%
Conclusion:94%ofoursamplefeelsthatthereisaneedofSafetyAlarmin
SubwayssoastoensureSecurity
23)NeedofCanteen.
NEED OF CANTEEN
CANTEEN TOTAL FREQUENCY
YES 419 63.68%
NO 239 36.32%
TOTAL 658 100.00%
Canteen
YES NO
36%
64%
Conclusion:64%ofoursamplefeelhatthereisaneedofCanteenin
Subwayssoastokeepitcrowdedallthetime.
24) GRAPHICAL REPRESENTATION OF ADDRESSING SAFTY
CONCLUSION :- 31.88%ofoursamplefeelsthatNGOsplayavitalrolein
addressingsafety.
4) VENN DIAGRAM
BOTH
2.70%
OVERHEADBRIDGE
USER
SUBWAY USER
28.49%
68.81%
CONCLUSION:
We can see that more than half the population i.e. 68.81% prefer commuting through
subways while a relatively smaller section of 28.49% prefers overhead bridges 2.70% are
comfortable with both i.e. subways as well as overhead bridges
3) Pareto Analysis
Introduction:
A Pareto chart is used to graphically summarize and display the relative importance of the
differences between groups of data.The Pareto Chart is a very simple but effective tool for
prioritizing problem causes.
Pareto Analysis aims at highlighting those elements which demand attention and should be
examined first. It conveys that, tackle the Vital Few and ignore the Trivial Many.
It is essentially a special form of a vertical bar chart that puts items in order (from the highest to the
lowest) relative to some measurable effect of interest such as frequency, cost or time. The Pareto
principle describes a phenomenon in which 80 percent of variation observed can be explained by a
mere 20 percent of the causes of that variation.
The Pareto Chart also include a line graph that indicates the cumulative percentage of occurrences
at each bar of the bar graph. This line graph, referred to as the 'cumulative percentage line', is used
to determine which of the bars belong to the 'vital few' and which ones are relegated to the 'trivial
many.'
The Pareto curve makes it clear as to where effort must be concentrated so as to give maximum
effect.
1st Use : Now , to find the major kind of harassment faced by female commuters while
commuting through subway, we applied Pareto analysis . We calculated the frequency of all
the commuters who have faced harassment while commuting through subway.
FEMALE
500
80
400
60
Percent
COUNT
300
40
200
20
100
0 0
TYPES OF HARRESMENT FACED VERBAL VISUAL PHYSICAL
COUNT 245 202 138
Percent 41.9 34.5 23.6
Cum % 41.9 76.4 100.0
CONCLUSION :-
The first two factors contribute nearly 80% of the harassment faced.
The first major type of harassment that is faced by women is verbal which includes whistling,
commenting etc. Women feel that they face such kind of harassment very often.
The second major type of harassment faced by women is visual which includes starring,
gesticulating etc.
2nd Use of Pareto :-
To find the most trustable authorities according to sample collected from female
commuters.
We calculated the frequency of the female commuters who have approached for help from
authorities while facing harassment.
FEMALE
VENDORS 96 18.22%
300 60
200 40
100 20
0 0
APPROCHED Y S S
RB CE
E END DOR OR
SS RI N F
PA /F VE CE
LI
ES PO
IV
LA
T AY
W
RE IL
RA
COUNT 226 116 96 89
Percent 42.9 22.0 18.2 16.9
Cum % 42.9 64.9 83.1 100.0
CONCLUSION :-
The first three factors contribute to more than 80% of the sample who approached.
The first major help is from Passerby. When women are alone and they face harassment, they seek
help from people passing by.
The second major help is from Relatives & friends. Women seldom call up their friends and
relatives when they face harassment.
The Third major help is from Vendors. Many times women feel that the vendors in the subway can
help them in some or the other way when they are being harassed.
4) CHI-SQUARE TEST OF INDEPENDENCE
The Chi Square Test of Independence compares frequencies (counts) of
nominalorordinalleveldatafortwosamplesacrosstwoormoresubgroups
displayed in a crosstabulation table. This test is more common and more
flexiblethanztestsofproportions.
Assumptions:
1.Independentrandomsampling.
2.Nominal/Ordinalleveldata.
3.Noemptycells
i)Genderv/sPreferredmodeofcommuting:
ATTRIBUTES:
ATTRIBUTE1:GENDER
ATTRIBUTE2:PREFFREDMODEOFCOMMUTING
HYPOTHESIS:
NULL(Ho):Thereisanassociationbetweenattribute1and2.
ALTERNATE(H1):Thereisnoassociationbetweenattribute1and2.
TheTestStatistic:
2=((oijeij)2/eij)
i,j=1,2.
WhereOi=anobservedfrequencyofithgroup.
Ei=anexpected(theoretical)frequencyforithgroup.
K=totalnumberofobservations.
Thechisquarestatisticcanthenbeusedtocalculateapvaluebycomparing
thevalueofthestatistictochisquaredistribution.Thenumberofdegreesof
freedomisequalto(c1)*(r1)where,candrarethelevelsoftwoattributes.
SettheRejectionCriteria:
Achisquaredprobabilityoflessthanorequalto0.05(orthechisquared
statisticbeingatorlargerthanthe0.05criticalpoint)leadstorejectionof
thenullhypothesisH0infavourofH1.
CHECKINGASSOCIATIONBETWEEN:
GENDERANDPREFERREDMODEOFCOMMUTING
CROSSTABULATIONOFGENDERANDPREFRRED
MODEOFCOMMUTING
GENDER SUBWAYS OVERHEADBRIDGES
ChiSquareTest:SUBWAYS,OVERHEADBRIDGES
OVERHEAD
SUBWAYSBRIDGES Total
1392 168560
390.28169.72
0.0080.017
28439123
85.7237.28
0.0350.080
Total476207683
ChiSq=0.139,DF=1,PValue=0.709
Thepvalue=0.709>0.05.i.eitislarge
Hencewedonotrejectthenullhypothesis.
Thusgenderandpreferredmodeofcommutingthroughsubwayare
independent.
CONCLUSION:
Thusweseefromchisquareteststhatpreferredmodeofcommutingis
independentofgender.
ii)ReasonsofCommutingthroughsubwaysv/sGender:
ATTRIBUTES:
Attribute1:CommutingReasons
Attribute2:Gender
HYPOTHESIS:
NULL(Ho):Thereisanassociationbetweenattribute1and2.
ALTERNATE(H1):Thereisnoassociationbetweenattribute1and2.
TheTestStatistic:
2=((oijeij)2/eij)
i,j=1,2.
WhereOi=anobservedfrequencyofithgroup.
Ei=anexpected(theoretical)frequencyforithgroup.
K=totalnumberofobservations.
Thechisquarestatisticcanthenbeusedtocalculateapvaluebycomparing
thevalueofthestatistictochisquaredistribution.Thenumberofdegreesof
freedomisequalto(c1)*(r1)where,candrarethelevelsoftwoattributes.
SettheRejectionCriteria:
Achisquaredprobabilityoflessthanorequalto0.05(orthechisquared
statisticbeingatorlargerthanthe0.05criticalpoint)leadstorejectionof
thenullhypothesisH0infavourofH1.
CHECKINGASSOCIATIONBETWEEN:
GENDERANDREASONSOFCOMMUTINGTHROUGHSUBWAYS
CROSSTABULATIONOFGENDERANDREASONS
OFCOMMUTINGTHROUGHSUBWAYS:
REASONFORPREFERRINGSUBWAY
RESONS Total
LESSCROWDED 167
COMP.PREFERISCLOSEBY 105
LESSDISTANCE 196
SAFEENOUGH 174
TOTAL 642
ChiSquareTest:GENDER,REASONS
malefemaleTotal
lesscrowded39128167
29.13137.87
3.3410.706
Comp.prefer
iscloseby2976105
18.3286.68
6.2291.316
Lessdistance27169196
34.19161.81
1.5130.320
Safeenough17157174
30.36143.64
5.8761.242
Total112530642
ChiSq=20.543,DF=3,PValue=0.000
Thepvalue=0.00<0.05
Hence,werejectthenullhypothesis.
Thusgenderandreasonsofcommutingthroughsubwayaredependent.
CONCLUSION:
Thusweseefromchisquareteststhatmodeofcommutingthroughsubway
dependsgender.
iii)ReasonsofCommutingthroughoverheadbridgesv/sGender:
ATTRIBUTES:
Attribute1:Reasonsofcommutingthroughoverheadbridges
Attribute2:Gender
HYPOTHESIS:
NULL(Ho):Thereisanassociationbetweenattribute1and2.
ALTERNATE(H1):Thereisnoassociationbetweenattribute1and2.
TheTestStatistic:
2=((oijeij)2/eij)
i,j=1,2.
WhereOi=anobservedfrequencyofithgroup.
Ei=anexpected(theoretical)frequencyforithgroup.
K=totalnumberofobservations.
Thechisquarestatisticcanthenbeusedtocalculateapvaluebycomparing
thevalueofthestatistictochisquaredistribution.Thenumberofdegreesof
freedomisequalto(c1)*(r1)where,candrarethelevelsoftwoattributes.
SettheRejectionCriteria:
Achisquaredprobabilityoflessthanorequalto0.05(orthechisquared
statisticbeingatorlargerthanthe0.05criticalpoint)leadstorejectionof
thenullhypothesisH0infavourofH1.
CHECKINGASSOCIATIONBETWEEN:
GENDERANDREASONSOFCOMMUTINGTHROUGHOVERHEADBRIDGES.
REASONSFORCOMMUTINGSUBWAYS
CROSSTABULATIONOFGENDERANDREASONS
OFCOMMUTINGTHROUGHOVERHEADBRIDGES:
malefemaleTotal
safeenough30120150
29.05120.95
0.0310.007
Tocheckthe
trainstatus1984103
19.9583.05
0.0450.011
Total49204253
ChiSq=0.094,DF=1,PValue=0.759
Thepvalue=0.759>0.05.i.eitislarge
Hencewedonotrejectthenullhypothesis.
Thusgenderandreasonofcommutingthroughoverheadbridgeare
independent.
CONCLUSION:
Thusweseefromchisquareteststhatreasonofcommutingthrough
overheadbridgeisindependentofgender.
Kruskal-Wallis Test
M ean 131.44
S tDev 74.56
V ariance 5559.26
S kew ness 1.47560
Kurtosis 1.16658
N 25
M inimum 46.00
1st Q uartile 80.00
M edian 110.00
3rd Q uartile 150.50
50 100 150 200 250 300 M aximum 302.00
95% C onfidence Interv al for M ean
100.66 162.22
95% C onfidence Interv al for M edian
83.40 138.04
95% C onfidence Interv al for S tD ev
9 5 % C onfidence Inter vals
58.22 103.72
Mean
Median
AsthePvalueislessthan0.05,hencethenormalitytestdoesnothold,thereforewecannotproceed
furtherforparametrictest.HencewewilluseNonParametricTesti.eFriedmanTest
FriedmanTest:
Friedman test is a nonparametric analysis of a randomized block experiment, and thus provides an
alternativetotheTwowayanalysisofvariance.Thehypothesesare:
H0:alltreatmenteffectsarezeroversusH1:notalltreatmenteffectsarezero
Randomizedblockexperimentsareageneralizationofpairedexperiments,andtheFriedmantestisa
generalization of the paired sign test. Additivity (fit is sum of treatment and block effect) is not
requiredforthetest,butisrequiredfortheestimateofthetreatmenteffects.
Output
Minitabprintstheteststatistic,whichhasanapproximatelychisquaredistribution,andtheassociated
degreesoffreedom(numberoftreatmentsminusone).Iftherearetieswithinoneormoreblocks,the
averagerankisused,andateststatisticcorrectedfortiesisalsoprinted.Iftherearemanyties,the
uncorrectedteststatisticisconservative;thecorrectedversionisusuallycloser,butmaybeeither
conservativeorliberal.Minitabdisplaysanestimatedmedianforeachtreatmentlevel.Theestimated
medianisthegrandmedianplusthetreatmenteffect.
Hypothesis:
H0:Facilityeffectsaredifferentfrom0.
against
H1:Facilityeffectsarenotdifferentfrom0.
S = 4.96 DF = 4 P = 0.291
Est Sum of
Facility N Median Ranks
CCTv 5 100.0 17.0
Dustbin 5 85.0 13.0
Lighting 5 277.6 21.0
Public Phones 5 84.8 12.0
Security 5 84.6 12.0
Result:
Theteststatistic,S,hasapvalueof0.291,unadjustedforties,Foralevels0.05or0.10,thereis
insufficientevidencetorejectH0becausethepvalueisgreaterthanthealphalevel.ThereforeWe
concludethatthedatadonotsupportthehypothesisthatanyofthefacilityeffectsaredifferentfrom
zero.
BinaryLogisticRegression:
TheBinarylogisticregressionmodelis,
(X)=
Where,
(X):conditionalprobabilitythattheoutcomeispresent.
i.e.Pr(Y=1|X)
Y:responsevariable,
X:vectorofindependentvariables.
Weusethetransformationcalledlogit,whichforcesthepredictionequation
topredictvaluesbetween0and1.
Logittransformationofabovemodel:
g(x)=0+jXj
where,
g(x):logittransformationoftheprobabilityoftheevent.
0:interceptoftheregressionline
j:slopeoftheithregressionline
I. ProceduretocarryoutBinaryRegressionAnalysis:
i. Globaltesting:
ii. Stepwiseselectionprocedure:
Itbeginswithnovariablesintheequationandsubsequentlyincreases
thenumberofvariablesinthemodeluntilthemodelissatisfactory.
The order of the variables to be included in the model is determined
basedonpartialcorrelationcoefficientwhichmeasurestheimportance
ofthevariablesnotyetintheequation.Itispossiblethatonevariable
whichwasincludedinthemodelatearlierstageisremovedduetoits
relationshipwithothervariablesinthemodel.
Ateachstepanappropriatestatisticaltestforeachvariablecurrentlyin
the model will be performed to determine whether the variable has
significantcontributiontothemodel.
Anegativeresultmaysuggesttheremovalofthevariable.
When no variables in the current model can be removed and no new
variables are suggested to be added to the model, the selecting
procedurestops.
ANALYSIS
Dependentvariable:
Thedependentvariableisbinary
Y=1;Yes(Femalecommuterswhohavefacedharassment)
Y=0;No(Femalecommuterswhohavenotfacedharassment)
Independentvariables:
X1:Maritalstatus
X2:Occupation
X3:MobileOperator
X4:ReceivedHelpfromanyone
X5:Haveyoueverhelpanyone
X6:NetworkConnectivity
X7:NecessityofCanteen
X8:SafetyAlarm
X9:Lighting
X10:CCTV
X11:Security
X12:PublicPhones
X13:Dustbins
SASprogramforbinarylogisticregression:
odshtml;
proclogisticdata=surveydescending;
classmarital_status(ref="1")occupation(ref="1")mobile_operator(ref="1")
received_help_from_anyone(ref="1")ever_help(ref="1")
network_connectivity(ref="1")necessity_canteen(ref="1")
safety_alarm(ref=1)lighting(ref=1)cctv(ref=1)security(ref=1)public_
phones(ref=1)dustbins(ref=1)
/param=ref;
modelreg=marital_statusoccupationmobile_operator
received_help_from_anyoneever_helpnetwork_connectivity
necessity_canteensafety_alarmlightingcctvsecuritypublic_phones
dustbins/ctable
selection=stepwise
lackfitdetailssle=0.1sls=0.1;
outputout=survey1p=prreschi=resi1;
run;
/*codeforclassifyingthepredictedprobabilities*/
datapred;
setsurvey1;
ifpr>0.05thenpredicted="1";
elseifpr<0.05thenpredicted="0";
run;
/*codefortheclassificationtable*/
procfreqdata=pred;
tableHarresment*predicted/chisq;
run;
Logisticregressionmodel:
g(x)=0+jXj(j=1,2..)
where,
g(x):logittransformationoftheprobabilityoftheevent.
0:interceptoftheregressionline
j:slopeoftheithregressionline(j=1,2..13)
Designvariables:
Theindependentvariablesareconvertedtodesignvariable.
2 1 0 0 0 0
3 0 1 0 0 0
4 0 0 1 0 0
5 0 0 0 1 0
6 0 0 0 0 1
Mobile_Operator 1 0 0 0 0 0 0 0
Class Level Information
Class Value Design Variables
2 1 0 0 0 0 0 0
3 0 1 0 0 0 0 0
4 0 0 1 0 0 0 0
5 0 0 0 1 0 0 0
6 0 0 0 0 1 0 0
7 0 0 0 0 0 1 0
8 0 0 0 0 0 0 1
Network_connectivity 1 0 0
2 1 0
3 0 1
0 1 0 0 0 0
Lighting 1 0 0 0 0 0
2 0 1 0 0 0
3 0 0 1 0 0
4 0 0 0 1 0
5 0 0 0 0 1
Cctv 1 0 0 0 0
2 1 0 0 0
3 0 1 0 0
4 0 0 1 0
5 0 0 0 1
Class Level Information
Class Value Design Variables
Security 1 0 0 0 0
2 1 0 0 0
3 0 1 0 0
4 0 0 1 0
5 0 0 0 1
public_phones 0 1 0 0 0 0
1 0 0 0 0 0
2 0 1 0 0 0
3 0 0 1 0 0
4 0 0 0 1 0
5 0 0 0 0 1
Dustbins 1 0 0 0 0
2 1 0 0 0
3 0 1 0 0
4 0 0 1 0
5 0 0 0 1
Hypothesis:
For the stepwise selection procedure, we kept sle and sls criteria
at 10.%. The summary of stepwise selection procedure is as
follows:
Summary of Stepwise Selection
H1: not Ho
14.9843 21 0.8237
g(x)=0+3iX3i+4iX4i+6iX6i+7i X7i+11iX11i+12i
X12i+13iX13i
where,
X3: Mobile Operator
X4: Received Help from anyone
X6: Network Connectivity
X7: Necessity of Canteen
X11: Security
X13: Dustbin
H0: i = 0 ; i = 3,4,6,7,11,12,13
H1: not H0
Conclusion :
Independent
variable X
X=1 X=0
Total 1 1
(0)/1-(0)
H1 : Not H0
Where,
ok: Total number of respondents in the kth group for whom Y=1.
Reject H0 for large values of Chi Square i.e. if p value < 0.05
4.0862 8 0.8493
CONCLUSION:
X11: Security
X13: Dustbins
v) Classification Table:-
0 1
1 70 386 456
1) Mobile Operator
2) Received Help from anyone
3) Network Connectivity
4) Necessity of Canteen
5) Security
6) Dustbins
Conclusions:
3. When women approach to some units for seeking help, NGOs are
the ones who address safety and make an attempt to take some
action. Also police force, political parties are some others who
address safety.
Recommendations:
Subways must be kept clean for avoiding the diseases that can
spread.
8.BIBLIOGRAPHY
BOOKS REFERRED TO :-
WEBSITES REFERRED TO :-
GOOGLE.COM
WIKIPEDIA.COM
SOFTWARES USED :-
1) MINITAB
3) Microsoft Excel
PROJECT BY:
DEVEN BHOSLE
SUNETRA JOGAL
RUPESH KHANEKAR
SAYALI PARKAR
NITIN PAWAR
AMRUTA PRDHAN
ADITYA SAWANT
RAHUL SINGH