Vous êtes sur la page 1sur 9

To describe data using numerical method is means that we summarize data by using numbers , figures , specific values using

numbers (not graphs not tables ).

There are 2 groups of majors : 1) measure of location: to measure centering of the data. 2) measure of variability :to consider the variability or desperation of the data .

look at this values : If we drop the values like this , this is a set of value . What is the best value that give you an impression about the all value ? Answer= the center of the data is the best way to describe and summarize data.

we are going to measure the center. In fact we will deal now with three measures of Center density : 1)-Mean. 2)-Median. 3)-Mode. We are familiar with this ,so the dr. said : we will go very fast talking about this.

This data is a monthly income for 70 diabetic patients we would like to know the average of diabetic income is higher or lower than the general population.
425 440 450 465 480 510 575 430 440 450 470 485 515 575 430 440 450 470 490 525 580 435 445 450 472 490 525 590 435 445 450 475 490 525 600 435 445 460 475 500 535 600 435 445 460 475 500 549 600 435 445 460 480 500 550 600 440 450 465 480 500 570 615 440 450 465 480 510 570 615

Can you give me information about the average income of diabetics ? showing you this data will be hard to do , so we are going to summarize it first of all we will use the mean to summarize the data. MEAN: we are talking about special type of mean is arithmetic mean, also known as average. arithmetic mean= is calculated by totaling the result of all observations and dividing by total number of observations. EX: We have a data for seven girls are as a follow: ( 141 , 141 , 143 , 144 , 145 , 146 , 155 ) cm . calculate the average : Total is : 1015 mean is : 1015/7 = 145 cm Remember the AVARAGE and the UNIT . *If the Data are from a sample, the mean is denoted by (x )

x
n

ii

ex: If we have the height of 7 girls , x1 is the height of the first girl . x2 is the height of the second girl , this is the sum of Xs divided by n *If the data are from a population , the mean id denoted by (m ) ( mu )
x x ii N

n: sample size . N: population size . Look at this Data :

This data is represent the values of Cholesterol Level for subjects

190, 199, 198, 196, 192, 199, 198, 196, 193, 199, 198, 196, 196, 190, 199, 198, 198, 196, 400, 480

What is the strange in this data ? All of the values are not far from each other except 2 values , they are extreme. The average of all values is 217.2 What do you think about this value ? This value will not give me an information about what uses to summarize these 2 values neither to summarize all the rest of values . So the mean now is not good measure scenario in this case ( WHY ? ) because we have extreme values. Now we are going to Excluding the extreme values the answer will be : 196 mg/dl But we cannot exclude the values ( 400 , 408 ) because they are REAL values. so we have to look for another measure which is median. Median : Its the value that divides the distribution or the data into two equal halves,(50% of the values below or equal to this value and 50% above or equal the data ). To calculate median we need to : 1) list all observations from Lowest to Highest. 2) Count the number observation (n) 3) Determine the Position of the median is ( n+1)/2

EX1: We have a data for seven girls are as a follow: ( 47 , 42 , 44, 40 ,43 ,72 , 41 ) Kg Calculate the Median : 1) Arrange data in ascending order. ( 40 , 41 , 42 , 43 ,44 , 47 ,72 ) 2) The position of the median is ( n+1 )/2 (7 + 1 ) /2 = 4 So the median is the forth one which is 43. * Dont forget to arrange the data EX2: We have a data for seven girls are as a follow: ( 47 , 42 , 44, 40 ,43 , 50, 72 , 41 ) Kg Calculate the Median : 1) Arrange data in ascending order. ( 40 , 41 , 42 , 43 ,44 , 47 ,50, 72 ) 2) The position of the median is ( n+1 )/2 (8 + 1 ) /2 = 4.5 So the median is between the fourth and fifth positions so we will take the average between these 2 values . (43+44)/2 = 43.5 * Dont forget to arrange the data ** When the number of values is even we have 2 middle values so we take their average ** When the number of values are odd we have a single value for the median Mode another measure of center density , its the most frequent value in the data set Its useful for categorized data .

Example weights of 7 girls are as follow: (47 , 44, 44 , 40 , 43 , 72 , 44) Kg The solution : 1) arrange the data : 40 , 43 , 44, 44 ,44 , 47 , 72 2) mode is : 44 Look to this figure : What we call this figure ? This is Histogram, and this curve is normal curve. The deference between histogram and diagram is the par( )in the histogram it is close to each others.
10

2 Std. Dev = 3.17 Mean = 11.1 0 5.0 6.0 7.0 8.0 9.0 10.0 11.0 12.0 13.0 14.0 15.0 16.0 17.0 N = 39.00

VAR00001

*sometimes we cant find the mode of the data Like this EXAMPLE : 1 , 2 , 3, 4, 5, 6, 7, .

The mode here is : { NO MODE } but it not ZERO because , Zero is a value and the mode could be Zero *Sometimes you will find 2 values that occur together Ex: Whats the mode for the following data (1 , 2 ) ( 2 , 3 ) We have to mode , so we call this bimodal ,to drow the curve for by modal data it will look like this :

* sometimes you will have more than two , the data are multimodal.

Percentiles : Percentiles provides information about how the data are spread over the interval from the smallest value to the largest value . 95th Percentiles means 5% over the data and 95% below the data. 50th Percentiles means 50% over the data and 50% below the data, That means its the median. Example : For adults we can calculate the body mass index but for children we cannot because they grow very fast so we use the Percentiles to know the mass index of them

To calculate the position of pth Percentiles i = (P/100)n i = index , n = number of times

*if i is not an integer rounded up ( like 7.1 or 7.9 8 ) *if i is an integer the Percentiles is the average of the values in position I and i+1

Quartiles Quartiles are specific Percentile First Quartile = 25th Percentile Second Quartile = 50th Percentile = Median. Third Quartile = 75th Percentile These examples from the slides: 90th Percentile i = (p/100)n = (90/100)70 = 63 Averaging the 63rd and 64th data values: 90th Percentile = (580 + 590)/2 = 585
425 440 450 465 480 510 575 430 440 450 470 485 515 575 430 440 450 470 490 525 580 435 445 450 472 490 525 590 435 445 450 475 490 525 600 435 445 460 475 500 535 600 435 445 460 475 500 549 600 435 445 460 480 500 550 600 440 450 465 480 500 570 615 440 450 465 480 510 570 615

Third Quartile Third quartile = 75th percentile i = (p/100)n = (75/100)70 = 52.5 = 53 Third quartile = 525
425 440 450 465 480 510 575 430 440 450 470 485 515 575 430 440 450 470 490 525 580 435 445 450 472 490 525 590 435 445 450 475 490 525 600 435 445 460 475 500 535 600 435 445 460 475 500 549 600 435 445 460 480 500 550 600 440 450 465 480 500 570 615 440 450 465 480 510 570 615

Look at this data : A: 8 9 10 10 10 11 12 B: 1 5 10 10 10 15 19 What is the mean , median , mode of the each these data ? mean, median , mode equal 10 for each these data Whats the deference between these data ? The values in A are closed together but in B they are far from the center.

More examples :

Find the mode of: 9, 3, 3, 44, 17 , 17, 44, 15, 15, 15, 27, 40, 8, Put the numbers is order for ease: 3, 3, 8, 9, 15, 15, 15, 17, 17, 27, 40, 44, 44, The Mode is 15 (15 occurs the most at 3 times) Find the Median of: 9, 3, 44, 17, 15 (Odd amount of numbers) Line up your numbers: 3, 9, 15, 17, 44 (smallest to largest) The Median is: 15 (The number in the middle) Find the Median of: 8, 3, 44, 17, 12, 6 (Even amount of numbers) Line up your numbers: 3, 6, 8, 12, 17, 44 Add the 2 middles numbers and divide by 2: 8 12 = 20 2 = 10 The Median is 10.

: ) .. )QUIZ 5 . . : .

Vous aimerez peut-être aussi