Vous êtes sur la page 1sur 17

Abstract In this report you will find analysis based on data I collected.

You will find questions answered to if there is a relationship between a persons height and weight, and ring or show size. The most interesting question I will be analyzing, based on my data, is the big question everyone is always arguing about do males earn greater gross income then females? Also you will find if there are relationships between peoples political party affiliations and how they feel about certain topics. For example, if they believe President Obama will be re-elected for a second term, or if they are in favor of the death penalty. All the graphs and tables that you will see were all done in TC stats.

Introduction As a research scientist I was hired by the Apple Company to conduct a research project. A selected group of research scientists were chosen by this company because of our knowledge is the statistical field. The scientists collected data and came together to compare it. I was supposed to get ten subjects and have them take a twenty-three question survey. This survey consisted of basic question like their gender, age, and ethnicity. I will be focusing on different types of categories between female and males. But I will not be as interested in the different categories in ring and shoe size, and height and weight. Some of the graphs I will provide will be both male and females data combined. The annual gross income is the one I will be separating and looking specifically at what males and females earn. I believe that males will be the ones that earn a little bit more than females.

Methods After everyone had turned in their data, form their surveys, I took a simple random sample of 36 out of the whole entire data of 2627. I took my sample on TC Stats (iPad software created by math professors at Taft College). I then cleaned up my data and changed any values that were typed in wrong. For example, for height it needed to be typed in inches and not feet. I changed a sample that said 6ft. to 72 inches. Also for incomes of my subjects I got rid of dollar signs or commas they put. I only wanted to work with information that will lead me to a right conclusion. Out of my sample of 36, 19 are females and 17 are males. I will be looking if there is a difference in gross income based on gender. I will also be seeing if there is a relationship between combination of a persons height, weight, and ring or shoe size. For all my tests I will be using a Confidence interval of 95% and my alpha will be 0.05.

Results The below box and whisker plot is for males and females. By looking at it you can see the median falls at 67. Also the minimum is at 52 and the maximum height is at 76.

Figure 1

Table 1: 5 number summary of height. The box and whisker plot for height below shows (figure2) the weight of males and females in my sample. The median was 160 the minimum is 98 and the maximum was 290.

Figure 2

Table 2: Table 2 shows the five number summary for weight.

Now I will be looking to see if there is a relationship between the combination of a persons height and weight. I will be looking for a relationship between two variables, so I will produce a scatter plot to see if my data has a linear trend to it.

The scatter plot above is showing us that my data for height and weight is reasonably linear so I will be using Persons Correlation. I used height as my x-axis variable and weight as my y-axis variable. My TC stats program gave me a Pearsons r of 0.6736 and a p-value of 3.097E-06, which I round of to 0.0000. With this p-value I will reject my null hypotheses, thus leading to suggest that there is enough evidence to suggest a relationship between a person height and weight. My 95% Confidence Interval values are (0.4431, 0.8205). This tells me that I am 95% confidant that the true correlation falls between 0.4431 and 0.8205.

Box and Whicker Plot for Ring and Shoe size for male and female.

Figure 3

As I looked for a relationship in height and weight, I will now do the same and see if there is a relationship between a persons ring and shoe size. Fist I will produce a scatter plot of shoe and ring size. I will be using shoe size as my x-axis variable and ring size as my y-axis.

Judging by my scatter plot there seems to be some curvature going on in it. So instead of using Pearsons Correlation I will be taking a nonparametric approach and uses Spearmans Correlation, which is computed in the ranks of the data. I calculated a Spearmans r of 0.6493 and I got a p-value of 0.0002, so I will reject my null hypothesis. My conclusion is that there is enough evidence to suggest a relationship between a persons shoe size and ring size. My 95% confidence intervals for Spearmans are (0.3500, 0.8283).

The Histogram above shows water consumed by both males and females according to what was reported.

The first table below shows the Summary Statistics of my sample of males income that was reported in my sample of 17 males. By looking at the table we can see the median average earnings for the males was 30k. Also the mean of my 17 male subjects was 56270.

Table 3: Summary statistics for male annual gross income.

The box and whisker plot below is the annual gross income for males. Looking at it we can see that my data is skewed right. By looking at the table above of the summary statistics and comparing it to the box and whisker plot you can see that the mean fall at 5627, the minimum is 0 because in my data it was reported that some have an income of 0. The maximum income I had in my data was 263,000. Figure 4

The table below is the Summary Statistics for annual income for females. I had 19 females in my data but as you can see below my value for n is 17 but the reason for that is because two of my female annual income slots were left blank which is why it only shows a sample of 17. Table 4: Annual gross income for females.

The box and whisker plot shows what the summary statistics has shown us but puts it in a visual way to see. The median falls at 21,492. My max earning for female in my data was 80,000. Which if you compare to the males maximum earning there is a dramatic change. Figure 5

What I did above was separate males and females gross incomes. Now what I will be looking for is if there is a difference in gross income based in gender. I will be comparing two groups (males and females) so I will want to use a 2-Sample T-test. So first off I will be looking at 2 normal plots, one plot for males and the other for females. Both normal plots will have to be normal so I can use a 2-sample t-test. If just one graph violets my assumptions I will use Wilcoxon RankSum test.

Normal Plot for Males Gross Income

Normal Plot for Females Gross Income

As we look at both normal plots we see that both normal plots have gross violations, which violates my assumption of the data being distributed normal for males and females. So I will switch to using a Wilcoxon Rank-Sum test. When I run this test I get a p-value of 0.1046 which I will FTR (fail to reject) my null hypothesis. My conclusion then is that there is not enough evidence to suggest a difference in gross income based on gender.

Out of my 36 subjects I drew a table of the breakdown of their Political Party Affiliation. Democrat Republican Independent Other Total 8 16 7 5 36

For this section I will now be looking into and seeing if there is a relationship between political party and see how respondents responded to President Obama being re-elected, health care bill being passed, and the death penalty.

Table 5: Political party and answer for Obama re-election Yes Republicans Democrat Independent Other 1 3 1 1 No 12 2 2 3 Undecided 3 3 4 1

Table 5 shows the breakdown of party affiliation and how the respondents think about Obama being re-elected for a second term. For this I will be using a Chi-Squared Test of Independence.

My p-value came out to be 0.1557 which I will FTR (fail to reject). My conclusion will be there is not enough evidence to suggest a relationship between political parties in the decision of if President Obama will be re-elected.

Table 6: Political party and Health Care Bill Yes Republican Democrat Independent Other 3 3 2 No 12 1 3 2 Undecided 1 4 2 3

Table 6 is the breakdown for political party and how parties feel about the health care bill being passed. I also used a Chi-Squared Test of Independence and I got a p-value of 0.0508 which I will FTR. Which tells me that there is not enough evidence to suggest a relationship between party affiliation and the health care bill being passed.

Table 7: Political party and answer to the Death Penalty. Yes Republicans Democrat Independent Other 13 5 5 1 No 3 2 2 2 2 1 Undecided

For this test I used a Chi-Squared Test of Independence. Table 7 shows the breakdown of political party affiliation and there answer to the death penalty. I ran the test and got a p-value of 0.0876, so I will FTR my null hypothesis. My conclusion is that there is not enough evidence to suggest a relationship between political party and whether or not they are in favor of the death penalty.

Table 8 Yes Left Right Ambidextrous 4 20 8 1 No Undecided 1 2

Table 8 is a breakdown of my samples handedness and if they are in favor of the death penalty. I will be using a Chi-Squared Test of Independence. I did not find any statistical evidence (p-value = 0.2548) to suggest a relationship between headedness and whether or not they are in favor of the death penalty. An add on question that I will find is if there is a relationship between political party and if the respondent is in favor of Proposition 8. Proposition 8 is a proposition in California that eliminates the rights to same-sex couples to get married. This proposition was passed on the November 2008 elections and remains in effect to this day.

Table 9 Yes Republican Democrat Independent Other 10 4 2 1 No 6 4 5 4

Table 9 show the breakdown of political party and how they answered to the question of if they are in favor of proposition 8. I used a Chi-Squared Test of Independence. I got a p-value of 0.2629 and I came to the conclusion of that there is not enough statistical evidence to suggest a relationship between party affiliation and if they are in favor of Proposition 8. The next comparison I will be looking at is if there a relationship between gender and weight. The first normal plot will be for weight of males, and the second will be for females. I will want to conduct a 2-sample T-test, and both of my normal plots need to be reasonably normal. If just one normal plot has gross violations I will use Wilcoxon Rank-Sum test. Normal Plot for Male Water Consumption

Normal Plot for Female Water Consumption

Looking at both normal plots there are many gross violations going on. So instead of using a 2-sample t-test I will be using instead the Wilcoxon Rank-Sum test. There is enough evidence (p-value = 0.0002) to suggest a relationship between gender and weight.

Discussion At the end of all my results I failed to reject most of my p-values. Like for the annual gross income for males and females I did not have enough statistical evidence to suggest that males earned more than females. The reason could be that I didnt have enough samples to really compare. Also most of my political affiliation tests came to a conclusion that I didnt have enough evidence. Now that could mean that party affiliation may not play a role in what Americans views are in the death penalty and Proposition 8. If I were to do this research project again I would change how many samples I used. For this project I used a sample of 36, next time I would use a bit more like 50.

Vous aimerez peut-être aussi