Académique Documents
Professionnel Documents
Culture Documents
Chapter 17
Statistics
Statistics is a body of methods used for collecting, organizing, presenting,
analyzing and interpreting numerical data which are used for making better,
informed and wise decisions.
17.1 Data
Data are raw facts or results of observations which have not been processed or
arranged in a particular order. For example, scores of students in a class and ages
of students in a particular school are examples of numerical data.
A set of given data can be arranged or organized into a frequency table, e.g., 8, 8,
8, 8, 8, 9, 10, 10, 10, 10, 10, 10, 11, 12, 12, 20, 21, 25, 25 can be arranged or
organized into a frequency table using Tally system.
Frequency Table
400
CHAPTER 17: STATISTICS
Find below the cumulative frequency of the values in the frequency table.
The table below shows the frequencies of baby girls born into 4 families in a
week.
401
CHAPTER 17: STATISTICS
Simple Bar Chart showing baby girls born into four families.
12
10
8
Frequency
6
4
2
0
A B C D
Family
2. Multiple Bar Charts show how two or more set of inter-related data are
represented using different bars. The bars are differentiated using different
shades or colours.
3. Component Bar Chart is used to show the dimensions of a whole bar into
component bars.
Name of Scores
Student Mathematics Biology English
Language
Ade 10 15 5
Bello 15 20 10
Chris 10 20 5
The information can be used to draw a multiple and component bar charts.
402
CHAPTER 17: STATISTICS
25
20
Test Scores
15 Maths
Biology
10 Eng. Lang
0
Ade Bello Chris
Students' Names
50
45
40
Test Scores
35
30 Eng. Lang
25 Biology
20 Maths
15
10
5
0
Ade Bello Chris
Students' Names
17.2.4 Histogram
This is a diagrammatic way of representing a frequency distribution by blocks or
rectangles without gaps between them
The table below shows the number of customers attended to by a shop attendant
from Monday to Friday in a particular week.
403
CHAPTER 17: STATISTICS
Number of customers 8 5 10 15 20
Weekdays Mon Tue Wed Thu Fri
20
18
16
Number of Customers
14
12
10
8
6
4
2
0
Mon Tue Wed Thur Fri
Weekdays
17.2.5 Pie Chart
When numerical data are represented by sectors of a circle, the angle of each of
the sectors is made to be proportional to the corresponding frequency it
represents.
In other words, Pie Charts are used to illustrate proportions of frequencies which
are converted to degrees (i.e., to form sectors).
To find each sector, divide each of the frequencies by their total and then multiply
by 360 .
404
CHAPTER 17: STATISTICS
Food - N5,000
Clothing - N2,000
Saving - N10,000
Rent - N1,000
Using a pair of compasses and a pencil with a protractor, you can easily draw the
required Pie Chart as shown below.
Saving
200
40 20
100
Rent
Clothing
Food
405
CHAPTER 17: STATISTICS
x i
sum of values
Mean x i 1
n total nuber of values
x is the mean;
x is the value or number;
n is the number of values for which the mean is being calculated;
i is equal to each value of x.
For example, the mean of the students’ scores given in the table below can be
calculated using the procedure earlier given.
Since the scores and their frequencies have been given, the mid value of each
class interval is obtained by dividing the sum of lower and upper limits of each
class by 2. For example, for 10–12, the mid-value
406
CHAPTER 17: STATISTICS
10 12 22
11.
2 2
Others are specified in the table below
427
M ean score
29
43.72
The method of assumed mean can be used to calculate the mean of these
ungrouped data: 10, 12, 15, 25, 30, 40.
Total deviation = 42
The mean = x assumed mean + mean of deviation
Assumed mean = 15
407
CHAPTER 17: STATISTICS
42
Assume mean 15, Mean of deviation 7
6
x 15 7
22
The same method of assumed mean can also be used to calculate the mean of
grouped data shown below:
Find the mid-values of the class intervals to get the assumed mean.
( x A)
Note: is used to calculate the assumed mean of ungrouped data while
n
f ( x A)
is used for grouped data.
f
408
CHAPTER 17: STATISTICS
The median can be found by first of all arranging the values in ascending order:
1, 2, 4, 5, 8.
1
M edian (n 1)th term
2
1
(5 1)th
2
6
3rd term
2
4.
This is applicable to ungrouped data.
We can consider the table below and then find the median.
N 25
12.5 th item
2 2
The value lies in the interval 60 –70
(i c)
Median L
f
409
CHAPTER 17: STATISTICS
N
L = 60, N = 25, C = 2
, i = 10 and f = 10
10 (12.5 5)
60
10
10 (7.5)
60
10
75
60
10
60 7.5
67.5
The Mode
This is the most frequent value. It occurs the greatest number of times in a
distribution. For example, the mode of 10, 8, 8, 8, 12, is 8.
This is because 8 occur 3 times which is the greatest number of times.
If there are two modes, the distribution is said to be bimodal, e.g., 8 and 10 are the
two modes in 8, 10, 8, 2, 10, 12, 9.
But then, if there are three modes, the distribution is said to be tri-modal and
multi-modal for many modes.
The mode of grouped data can be estimated from a histogram by joining the ends
of the rectangular area of the modal class to that of two adjacent classes. The
point of intersection of the two lines drawn is the mode estimate.
The mode can be found by drawing a histogram and then estimate it.
The modal class is between 21 – 40.
410
CHAPTER 17: STATISTICS
The
mode estimate = 28.5
1. The Range:
This is the difference between the highest and lowest values in a set of data.
Consider 10, 12, 8, 20, 11, 12. The range is 20–8 = 12.
2. Mean Deviation
This is the mean of the absolute deviations of the values from the mean of the
group. The deviation of values from the mean is 0 and that brings about the use of
absolute value of the deviations.
411
CHAPTER 17: STATISTICS
xx
for ungrouped data
n
and
f x x
for grouped data.
f
x mean of the given items;
x value of each item;
n the number of items;
f the frequencies of the items.
The mean deviations of the following distributions can be found using any of the
formulae given above.
Solution:
10 20 8 5 15 12
(a) The mean =
6
70
x 11.67.
6
x xx xx
10 –1.67 1.67
20 8.33 8.33
8 –3.67 3.67
5 –6.67 6.67
15 3.33 3.33
12 0.33 0.33
x x 24
xx
Mean deviation
n
24
4
6
412
CHAPTER 17: STATISTICS
189.34
fx 475
The mean x
f 30
15.03
f x x
The mean deviation
f
189.34
30
6.31.
3. Variance
This is the mean of the squares of each observation from the mean of a set of data.
f x x
2
Variance .
f
4. Standard Deviation
This is the square root of the variance.
f x x
2
Standard deviation .
f
(a) 2, 3, 5, 6, 8, 9, 10.
413
CHAPTER 17: STATISTICS
(b)
Class interval 0–10 10–20 20–30 30–40 40–50
Frequency 6 10 15 2 4
Solution:
Values Frequency x x x x 2
(x) (f)
2 1 –4.14 17.14
3 1 –3.14 9.86
5 1 –1.14 1.30
6 1 –0.14 0.02
8 1 1.86 3.46
9. 1 2.86 8.18
10 1 3.86 14.90
f x x
2
f 7
54.86
fx 2 3 5 6 8 9 10
The mean
f 7
43
6.14
7
f x x
2
Variance
f
54.86
7.84
7
54.867
Standard deviation
7
7.84 2.8
5. Quartiles
These are points or values that divide a particular distribution into four parts.
These are:
(a) Lower Quartile (First quartile): This is 14 of the way up the distribution.
It is written as Q1 14 (n)th term.
1
(b) Median (second quartile): This is 2 of the way up the distribution. It is
written as Q2 12 (n)th term.
414
CHAPTER 17: STATISTICS
(c) Upper Quartile (third quartile): This is 34 of the way up the distribution.
It is depicted by Q3 34 (n)th term.
(d) Semi-Interquartile Range: This is the half of the difference between the
upper quartile and lower quartile. It is expressed as 12 Q3 Q1 .
(e) Interquartile Range: Is the difference between the upper quartile and
lower quartile Q3 Q1 .
(f) Percentile: This is the pth percentile i.e., (pth%) of the way up the
distribution. It divides a distribution into hundred equal parts.
Solution:
(a) Cumulative frequency table
415
CHAPTER 17: STATISTICS
75%
50%
25%
(c) (i)
1
Lower quartile (n)th term
4
1
(30)th term
4
7.5th term
Q1 9.5.
(ii)
1
Second quartile (n)th term
2
1
(30)th term
2
15.0th term
Q2 35.5.
416
CHAPTER 17: STATISTICS
(iii)
3
Second (upper)quartile (n)th term
4
3
(30)th term
4
22.5th term
Q3 52.5.
(iv)
Interquartile range Q3 Q1
52.5 9.5
43.0.
(v)
Semi - Interquartile Range
1
Q3 Q1
2
1
(43.0)
2
21.5.
(vi)
80
80th percentile of 30
100
80
30
100
24th term (read the value of the mark at 24 on cumulative frequency axis)
54.5
17.6 Exercises
1. The table below shows how a company’s Sales Manager spent his 1995
annual salary.
Food - 30%
Rent - 18%
Car maintenance - 25%
Savings - 12%
Taxes - 5%
Others - 10%
417
CHAPTER 17: STATISTICS
Find the:
(a) (i) mean (ii) median (iii) mode of the marks;
(b) percentage of the students who scored at least 8 marks.
418
CHAPTER 17: STATISTICS
7. The following table shows the frequency distribution of the ages, in years,
of 50 people at a bus stop.
Find the:
(a) mean (b) standard deviation of the grouped distribution.
Marks 4 5 6 7 8 9
No. of Candidates 5 8 5 6 4 2
Find the mean score to the nearest whole number.
10. The table below shows the weekly profit in Naira from a Mini Market.
419
CHAPTER 17: STATISTICS
11. The following are the lengths (in centimeters) of fifty planks, cut by a
machine in a saw-mill:
33 49 60 58 59 71 42 88 68 91
54 32 81 59 41 55 38 56 86 62
50 69 50 84 77 33 71 42 69 93
61 51 23 76 63 96 26 70 66 80
44 52 46 33 68 39 61 71 48 66
(a) Using class interval 21–30, 31–40, …
(i) construct the frequency table;
(ii) draw the histogram for the distribution.
(b) (i) Identify the modal class;
(ii) Use your histogram to estimate the mode of the distribution.
Items Expenditure in
thousand Naira
Wages and salaries 25
Fuel and power 15
Raw materials 65
Maintenance 5
Miscellaneous 10
(a) Draw a pie chart to illustrate the information.
(b) What percentage of the total expenditure goes on fuel and power?
420
CHAPTER 17: STATISTICS
(a)
JS.2 JS.1
o
48
oo
56
48 SS.3
44o
JS.3 o
96
52o
o
64
SS.2
SS.1
Fig. 17.1
421
CHAPTER 17: STATISTICS
16 8
(b) Pr(selecting a SS1 student) or 0.178
90 45
2. The following are the ages of 50 workers in a factory some years back.
71, 79, 46, 35, 25, 28, 56, 82, 52, 68,
68, 64, 93, 95, 78, 43, 58, 72, 57, 60,
50, 98, 62, 63, 70, 73, 53, 44, 86, 69,
72, 68, 88, 51, 32, 59, 72, 73, 46, 85,
40, 55, 52, 61, 96, 67, 82, 72, 48, 59,
422
CHAPTER 17: STATISTICS
(b)
(c)
(i) Median = 63.5
(ii) Q1 52.5, Q3 76.5, Inter-quartile range = 76.5 – 52.5 = 24
9 10 5 28 14
(d) Pr(picking a worker who is more than 60 years) or 0.56
450 50 25
Sores 1 2 3 4 5 7 8 9 10
No. of Pupils 1 1 5 3 k2+1 6 2 3 4
423
CHAPTER 17: STATISTICS
Solution:
x 1 2 3 4 5 7 8 9 10
f 1 1 5 3 k2+1 6 2 3 4
fx 1 2 15 12 5k2+5 42 16 27 40
fx 160 5k , f 26 k
2 2
(a) (i)
fx x
f
160 5k 2
6
26 k 2
160 5k 2 6(26 k 2 )
160 5k 2 156 6k 2
160 156 6k 2 5k 2
4 k2
2k
k 2
57
(iii) Median 6
2
424
CHAPTER 17: STATISTICS
(b)
Calculate the:
(a) mean;
(b) standard deviation, correct to two decimal places.
(WASSCE June 2007)
425
CHAPTER 17: STATISTICS
Solution:
Intervals x f fx fx2
0–4 2 1 2 4
5–9 7 5 35 245
10 – 14 12 10 120 1440
15 – 19 17 9 153 2601
20 – 24 22 5 110 2420
30 420 6710
fx fx
2 2
Height (cm) 60 – 64 65 – 69 70 – 74 75 – 80 – 84 85 – 89
79
Frequency 8 13 17 24 11 7
426
CHAPTER 17: STATISTICS
x
fx 5950 74.375
(a) Mean height f 80
74.4 (1 dec. place)
(b) .
427
CHAPTER 17: STATISTICS
Mark 1–5 6 – 10 11 – 15 16 – 20 21 – 25
Frequency 4 6 11 8 1
Solution:
428
CHAPTER 17: STATISTICS
Marks 11 – 20 21 – 30 31 – 40 41 – 50 51 – 60 61 – 70 71 – 80 81 – 90 91 – 100
Number of 9 25 43 49 29 22 10 8 5
Candidates
(a) (i)
Marks f Cf Upper Boundaries
11 – 20 9 9 20.5
21 – 30 25 34 30.5
31 – 40 43 77 40.5
41 – 50 49 126 50.5
51 – 60 29 155 60.5
61 – 70 22 177 70.5
71 – 80 10 187 80.5
(ii)
81 – 90 8 195 90.5
91 – 100 5 200 100.5
429
CHAPTER 17: STATISTICS
Q1 Q3
1
(b) (i) Q1 200 50 th from graph = 34.5
4
3
Q3 200 150 th 58.5
4
Interquartile range is 58.5-34.5 = 24.0
(ii) Minimum pass mark = 83.5
Marks 1 – 10 11 – 20 21 – 30 31 – 40 41 – 50 51 – 60 61 – 70 71 – 80 81 – 90 91-100
Frequency 4 6 9 12 20 15 7 5 0 2
430
CHAPTER 17: STATISTICS
Solution:
(b) .
Q1
1
(c) (i) Median 80 40th 44.5
2
431
CHAPTER 17: STATISTICS
1
(ii) Lower quartile (Q1 ) 80 20 th 31.5
4
80 75 5 1
(d) (i) Prob. (getting 75% and above) 0.0625
80 80 16
30 3
(ii) Prob. (failing if 40% is pass mark) 0.375
80 8
9. The table below shows how a man spends his income in a month.
432
CHAPTER 17: STATISTICS
House Rent
Provision 54o
45o 81o Food
Transportation
21 37 49 27 49 42 26 33 46 40
50 29 23 24 29 31 36 22 27 38
30 26 42 39 34 23 21 32 41 46
46 31 33 29 28 43 47 40 34 44
26 38 34 49 45 27 25 33 39 40
433
CHAPTER 17: STATISTICS
Solution:
(b)
434
CHAPTER 17: STATISTICS
11.
Class Interval Frequency
30 – 39 5
40 – 49 12
50 – 59 18
60 – 69 24
70 – 79 19
80 – 89 14
90 – 99 8
Solution:
435
CHAPTER 17: STATISTICS
(a)
Variance
f (x x) 2
25604
256.04
f 100
436
CHAPTER 17: STATISTICS
Standard deviation
f ( x x) 2
256.04 16.00
f
12. The table shows the number of children per family in a community.
No. of Children 0 1 2 3 4 5
No. of Families 3 5 7 4 3 2
(b) If a pie chart were to be drawn for the data, what would be the
sectorial angle representing families with one child?
(WASSCE June 2009)
Solution:
No. of Children No. of Families Cumulative frequency
(x) (f) cf
0 3 3
1 5 8
2 7 15
3 4 19
4 3 22
5 2 24
(a) (i) From the table, the mode is 2 children (has the highest frequency)
3
(ii) The third quartile is 24 18th which is 3 children
4
(iii) The probability that a family has at least 2 children is
7 4 3 2 16 2
or 0.67
24 24 3
5
(b) Sectorial angle representing families with one child is 360 o 75o
24
437
CHAPTER 17: STATISTICS
60 54 40 67 53 73 37 55 62 43
44 69 39 32 45 58 48 67 39 51
46 59 40 52 61 48 23 60 59 47
65 56 74 47 40 59 68 51 50 50
71 51 26 36 38 70 46 40 51 42
Solution:
(a)
(b) Mean =
fx 2535 50.7
f 50
(c) Percentage of students who scored more than 60%
8 3 11
100 22%
50 50
14.
62 54 53 44 46 55 46 56 68 63
59 61 66 54 39 48 47 53 59 57
50 35 40 30 46 44 36 49 54 51
57 56 45 33 38 41 40 45 53 58
51 45 48 34 36 46 43 49 63 52
(a) Using the class intervals of 30 – 34, 35 – 39, 40 – 44, ... Construct the
frequency distribution table.
(b) Calculate the;
(i) mean;
438
CHAPTER 17: STATISTICS
Solution:
(i) Mean
fx 2455 49.1 49.10 (2d . p)
f 50
f (x x
2
60 54 40 67 53 73 37 55 62 43
44 69 39 32 45 58 48 67 39 51
46 59 40 52 61 48 23 60 59 47
65 58 74 47 40 59 68 51 50 50
71 51 26 36 38 70 46 40 51 42
439
CHAPTER 17: STATISTICS
(b)
440
CHAPTER 17: STATISTICS
9
From the graph, the no. of students is 9. Hence, prob. is
50
16. The frequency distribution of the weight of 100 participants in a high jump
competition is as shown below:-
Weight (kg) 20 – 29 30 – 39 40 – 49 50 – 59 60 – 69 70 – 79
Number of
Participant 10 18 22 25 16 9
Solution:
Weight (kg) Upper Boundary Frequency Cumulative
20 – 29 29.5 10 10
30 – 39 39.5 18 28
40 – 49 49.5 22 50
50 – 59 59.5 25 75
60 – 69 69.5 16 91
70 – 79 79.5 9 100
441
CHAPTER 17: STATISTICS
(b)
Q3
Q1
17. The table below shows the distribution of scores, in percentage obtained
by 20 students in a class test.
Scores 20 25 30 35 40 45 50
Number of Students 5 3 1 6 2 1 2
442
CHAPTER 17: STATISTICS
Solution:
(a) .
(b) (i) The mode is the score with the highest no. of students ie. 35
(ii) The median is the middle score when arranged in ascending or
descending order. The mid score falls between 10th and 11th
35 35
students ie. 35
2
(c) The mean
(20 5) (25 3) (30 1) (35 6) (40 2) (45 1) (50 2)
20
100 75 30 210 80 45 100
20
640
20
32
443
CHAPTER 17: STATISTICS
18. The table gives the distribution of marks for 360 candidates who sat for an
examination.
Mark% 0-9 10-19 20-29 30-39 40-49 50-59 60-69 70-79 80-89
Number of
Candidates 20 48 60 72 80 40 25 10 5
Solution:
444
CHAPTER 17: STATISTICS
(a)
Q3
Q2
Q1
445