Académique Documents
Professionnel Documents
Culture Documents
The first treatment that will be discussed here is the highdose treatment. Here's the
summary of the highdose treatment:
with(subset(geneexpression,Treatment=="Highdose"),summary(Luminosity))
##
##
with(subset(geneexpression,Treatment=="Highdose"),boxplot(Luminosity))
The spread of the highdose treatment is very small compared to the other treatments (notice
the numbers on the y-axis) and there are no outliers.
The second treatment is Control 1.
with(subset(geneexpression,Treatment=="Control 1"),boxplot(Luminosity))
with(subset(geneexpression,Treatment=="Control 1"),summary(Luminosity))
##
##
The spread of the Control 1 treatment is large compared to the highdose treatment but
relatively small compared to the Control 3 treatment. There are 3 outliers.
The third treatment is Control 2.
with(subset(geneexpression,Treatment=="Control 2"),boxplot(Luminosity))
with(subset(geneexpression,Treatment=="Control 2"),summary(Luminosity))
##
##
The spread of the Control 2 treatment is a little bit larger than that of Control 2 but still
relatively small when compared to that of Control 3. There is 1 outlier.
The last treatment is Control 3.
with(subset(geneexpression,Treatment=="Control 3"),boxplot(Luminosity))
with(subset(geneexpression,Treatment=="Control 3"),summary(Luminosity))
##
##
The spread of the Control 3 treatment is that largest of all but it has no outliers.
Below are the same set of data but summarized and graphed with log(Luminosity) instead
of Luminosity.
with(subset(geneexpression,Treatment=="Highdose"),boxplot(log(Luminosity)))
with(subset(geneexpression,Treatment=="Highdose"),summary(log(Luminosity)))
##
##
with(subset(geneexpression,Treatment=="Control 1"),boxplot(log(Luminosity)))
with(subset(geneexpression,Treatment=="Control 1"),summary(log(Luminosity)))
##
##
with(subset(geneexpression,Treatment=="Control 2"),boxplot(log(Luminosity)))
with(subset(geneexpression,Treatment=="Control 2"),summary(log(Luminosity)))
##
##
with(subset(geneexpression,Treatment=="Control 3"),boxplot(log(Luminosity)))
with(subset(geneexpression,Treatment=="Control 3"),summary(log(Luminosity)))
##
##
with(wine, summary(sugar))
##
##
a)
The distribution of sugar is slightly skewed to the left with a couple of outlers above
3. These are separated from the main distribution and not considered when describing
the left-skewed distribution.
b)
c)
d) Removing the outliers, the distribution of the sugar is still slightly skewed to the left.
The spread is smaller without the outliers and the mean/median is lower too.
e) The mean is bigger than the median in part a because part a takes into account the
outliers too, therefore affecting the mean.
2-13
means=do(1000)*mean(sugar,data=resample(wine))
hist(means$result)
a)
b)
95% of the time the mean of a sample of 40 is between 2.32 and 3.39.
Because 2.54 is between the confidence range using the bootstraping method.