Académique Documents
Professionnel Documents
Culture Documents
HAMIDULLAH
Organized by Waqas Ahmad
Waqas Ahmad
waqaskanju@gmail.com
http://www.wsk.host56.com
Introduction to Statistics:
The word Statistics comes from the Italian word “Statista” which means “political state “. Statistics is the
branch of knowledge which establishes the numerical facts of so many problems that arise in human
life.
Statistics been an exciting discipline has wide applications nearly in all phases of human endeavor.
Dealing with affairs of state to the influence in business, social, & pure sciences, medical sciences
agricultural and engineering, it has played a basic rule in developing information technology.
Today statistics is a scientific study of handling qualitative information it provides a methodology for
collection, classification, description and interpretation of data obtained through a conduct of surveys
and experiments. The main purpose is to describe and draw inferences about the characteristics of
population.
Definition of Statistics: The term statistics is used in a singular sense or plural sense.
In Singular sense it refers to the science comparison method used in the collection, presentation,
analysis and interpretation of data and their applications.
In plural sense it conveys two meanings. Firstly the word statistics means an aggregate of numerical
facts (data) collected for the purpose of some information.
For example the height of First year Students, the annual rain fall in Mingora city for the last ten years.
Etc.
Secondly, it stands for any value calculated from sample data. For example the average height of First
year students, the average rain fall in Mingora city for last ten years.
Population or Universe (N): Population is a statistical term, which is used to define, a finite or infinite
number of similar or like.
For example the population of students in UOS, the height of students of Govt. Jahanzeb College,
children in a city are the example of population with finite size.
On the other hand population of starts in the sky, all heights between 1 & 3 meters and the population
of trees in swat are the examples of infinite population.
Population Parameter: A numerical value such as mean, standard deviation calculated from
population is knows as population parameter.
Sample (n): sample is a representative part of a population or a subset from a set of units, which is
provided by some process and it is denoted by (n).
The population are generally large or infinite so we use samples drawn from these populations to study
the desirable characteristics and properties of the population.
Page 1 of 51
http://www.wsk.host56.com
Sampling: The process of drawing a part from a whole/population is called sampling.
Sample statistics: The numerical value such as mean and standard deviation calculated from sample is
know is sample statistics.
Mean 𝑋− and standard deviation (S) are the sample statistics or simple statistics. The value of
parameter is fixed or constant where is the value of statistics varies from sample to sample.
Lecture 2
Random Sampling: When every unit of a universe/population has an equal and independent chance of
being chosen for a sample. The sample is called random sampling.
1. Descriptive Statistics: The branch of statistics devoted to the summarization and representation
of dataset is called descriptive statistics. // Need 5 examples home work
For example, college teacher compute an average grade for has statistics class. The average grade
describes the performance of that particular class but do not make a generalization about other
classes, we can say that the college teacher is using descriptive statistics.
2. Inferential Statistics: The branch of statistics concerned with using sample data to make an
inference about a larger group of data is called inferential statistics.
// Need 5 examples home work.
1. Statistical data are now widely used in taking all administrative decisions.
a. The authorities in education departments are considering the question of opening new
schools and colleges. Obviously, the decision will be based on knowledge of the school going
populations at different levels.
b. The authorities want to revise the pay scales of employees in view of an increase in the cost
of living. Statistical techniques may help to determine the raise of living.
2. Statistics plays an important role in business because it provides a quantitative basis for arriving
at decisions in all matters connected with operation of business. For example, a successful
business man must know the demand of his consumers, statistics would help to plan production
according to the demands of the consumers.
3. Statistics has proved to be of immense use in astronomy, biology, zoology, physics, chemistry,
agricultural, meteorology, economics, education and sociology etc.
Page 2 of 51
http://www.wsk.host56.com
4. Tools of statistics are indispensable for transport authorities. The transport authorities before
launching any new items first undertake a survey to see whether it would be feasible for them
or not.
5. Actuarial science is the discipline that applies the statistical and mathematical methods to assess
risks in the insurance and finance industries.
6. Biostatistics is a branch of biology that studies biological phenomena and observation by means
of statistical analysis and include medical statistics.
7. Econometrics is a branch of economics that implies statistical methods to the empirical study of
economic theories and relationship.
Lecture 3
Variable: A characteristic that can take on different possible values is called a variable. For
example heights of students, ran fall at a place, price of commodity.
Quantitative variable: If the values are expressed numerically the variable is said to be
quantitative, such as age, weights, income, number of children etc.
Qualitative Variable: If the values refers to non-numerical qualities, the variable is said to be
qualitative such as poverty, eye color, smoking, intelligence level.
The variables are denoted by capital letters such as X, Y, Z while small letters x, y, z denote their
values.
Discrete Variable: A discrete variable can assume only a finite number of values between any
two points. For example the number of children in a family, the number of deaths in an
accidents etc.
Continuous variable: A continuous variable may take an infinite numbers of values between
any two points. Such as heights of students. The temperature at place, the distance covered by
a tourist.
Data: Data is a plural of datum so data means information. Without data there is no ground for
statistics. Data are the core (hard middle part) of the science of statistics. In order to apply
statistical method to any type of inquiry it is necessary that statistical data be collected,
because no statistical analysis is possible in the absence of quantitative data.
Page 3 of 51
http://www.wsk.host56.com
1. Primary data: primary data is also called as original data but raw in the form, sense it is
collected for the first time by person who is going to use and they are to be arranged
and organized later on, to be fit for use. If primary data are collected directly from the
population then it is also called first-hand information.
2. Secondary data: secondary data is indirectly collected from the population or when
statistically method applied on primary data they lose their shape and become
secondary data.
Lecture 4
Classifications: Classification is the process of arranging the items in groups or classes according to their
resemblance and gives expression to the unity of attributes.
The main objects of classification is to bring out points of similarities and dissimilarities by condensing
details it saves one from mental strength. This enables one to make comparisons and draw inferences
with more ease. It prepares the ground for the proper presentation of statistical facts. It may be noted
that in ideal classification should possess the following merits.
1. Descriptive classification: When the data are classified on the basis of qualities or attributes,
which are incapable of quantitative measurement, the classification is said to be descriptive or
according to the attributes.
2. Numerical classification: This type of classification is applicable to quantitative data only. Data
relating to height, weight, income, production their value are expressed in terms of numbers
with appropriate units.
Tabulation: The process of arranging the data in tabular form are called tabulation.
Frequency distribution: A large mass of data possess different characteristics, it grouped into different
classes then observations are determined in each class. The arrangement of these classes into tabular
form makes of frequency distribution.
Page 4 of 51
http://www.wsk.host56.com
Class interval: The class interval in term of class width/length is the difference between the upper class
boundary and lower class boundary.
Note: Number of classes should be more than 6 and less than 25.
Class boundaries: If one have grouped frequency distributions with class interval having a gap between
upper class limit of one class and the lower class limit of the next class, we can make it continuous by
subtracting half the length of the gap from the lower class limit, and adding the half the length of the
gap to the upper class limit.
Lecture 5
Examples:
Q: Prepare a frequency distribution from the following figures related to bonus paid of factory workers.
61, 70, 81, 90, 105, 43, 50, 53, 62, 63, 77, 79, 74, 65, 80, 89, 95, 79, 85, 57 using 10 as class interval and
then find class marks class boundary and cumulative frequency.
Answer:
𝑟𝑎𝑛𝑔𝑒 62
Number of classes = = 6.2 ≅ 7
𝐶𝑙𝑎𝑠𝑠 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 10
Page 5 of 51
http://www.wsk.host56.com
F= Frequency
C.B =Class Boundary = (lower limit of Class B minus upper limit of Class A divide by 2)
e-g (50 – 49)/2
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
H/W
Q: 133 183 178 176 130 135 118 123 194 160 163 185 133 100 109 106 145 149 139 151 157 178 188
198 144 121 168 162 193 140
Using C.I = 10
100 106 109 118 121 123 130 133 133 135 139 140 144 145 149 151 157 160 162 163 168 176 178 178
183 185 188 193 194 198
𝑅𝑎𝑛𝑔𝑒⁄ 98
No. of class = 𝐶𝑙𝑎𝑠𝑠 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 = 10
= 9.8 ≅ 10
Class Interval Tally bar Frequency Class Marks Class boundary C. freq.
Page 6 of 51
http://www.wsk.host56.com
150-159 Il 2 154.5 149.5 – 159.5 15 + 2 = 17
HOME WORK
Q: Example construct a frequency distribution of the data, using class interval of size 2. Also find
cumulative frequency class marks and class boundary.
1345624635482424535451023453432553546563461244345374634577
43
Ascending order:
0111222222333333333333444444444444444455555555555566666677
78
Range = 8 – 0 = 8
8
No of Class= 2 = 4 but this value is not acceptable as no of class should not be less than 6
Page 7 of 51
http://www.wsk.host56.com
3-4 IIII IIII IIII I 16 3.5 22 + 16 = 38
7-8 I 1 7.5 59 + 1 = 60
Total 60
Lecture 6
Home work:
Q: Construct a frequency distribution table. Also find cumulative frequency and class marks.
0.27 0.23 0.21 0.24 0.30 0.26 0.25 0.25 0.35 0.37 0.31 0.29 0.34 0.32 0.29 0.30 0.29 0.30 0.31 0.27 0.28
But I solved it. I solved this question using Virtual University Lecture notes.
Answer:
As
𝑅𝑎𝑛𝑔𝑒 𝑅𝑎𝑛𝑔𝑒
No of Classes = 𝐶𝑙𝑎𝑠𝑠 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙
=> Class interval = ⁄𝑛𝑜 𝑜𝑓 𝐶𝑙𝑎𝑠𝑠𝑒𝑠
Page 8 of 51
http://www.wsk.host56.com
0.24 - 0.26 IIII 4 0.25 6
Lecture 7
Q: Construct a frequency distribution of forty balls diameters using appropriate class interval. Also
construct cumulative frequency, class boundaries and class marks
6.3 7.0 6.8 6.9 6.8 7.1 6.6 7.0 7.2 7.6 6.7 6.5 6.9 7.4 6.6 7.4 6.9 7.2 6.7 7.4 6.5 7.2 7.2 7.1 7.3 7.4 7.6 7.5
7.5 7.8 6.7 6.9 7.0 7.2 6.5 6.8 7.0 7.3 6.5 6.8
Answer:
1.5
No of classes = = 7.5 ≅ 8
0.2
Class Boundaries = (lower Class limit of 2nd Class – upper Class limit of 1st Class) /2
6.3 – 6.4 1 6.25 - 6.45 1=1 6.3 – Anx = (Lower Class Limit -
anx)
6.5 - 6.6 6 6.45 - 6.65 1+6=7
6.4 + Anx = Upper Class Limit +
6.7 - 6.8 7 6.65 – 6.85 7+7=14 Anx
Page 9 of 51
http://www.wsk.host56.com
6.9 - 7.0 8 6.85 – 7.05 14+8=22
Each frequency of class divided by the total frequency of all class is called relative frequency
And
Percentage of each frequency of a class divided by the total frequency is called percentage relative
frequency.
frequency
Relative Frequency =
total frequency
Frequency
Page 10 of 51
http://www.wsk.host56.com
7.1 - 7.2 7 7.045 – 7.25 22+7=29 7/40 =0.175 (7/40)*100
= 17.5
Cumulative frequency distribution: The cumulative form of a frequency distribution is called cumulative
frequency distribution.
When one wishes to know that what proportion of the elements of a data set have values above or
below certain level. The cumulative form of a frequency distribution is a very effective device for
summarization the information. If the cumulative process is from the lowest value to the highest it is
referred to “a less than type” cumulative frequency distribution. When the frequencies are cumulated
form the highest values to the lowest values, it is called a more than type cumulative frequency
distribution the graph of a cumulative frequency distribution is called ogive.
Page 11 of 51
http://www.wsk.host56.com
7.7 7.8 1 7.65 – 7.85 40 0.025 2.5 40 1
H/W
Find Relative cumulative frequency and Percentage relative cumulative frequency of the above example.
Page 12 of 51
http://www.wsk.host56.com
7.5- 7.6 4 0.1 10 39 39/40=0.975 (39/40) *100
=97.5
Lecture 8
OR
An average is the stage at which the largest number of items tends to concentrate (majmoa) and so it is
called central tendency. Average are also sometimes referred to as measures of location because they
locate the center of distribution.
Measure of Central tendency: The method we use to find the representative value of data is called
measure of central tendency.
1. The most important object of measuring location or average is to find out the value that can
represents the whole data.
2. One of the object of measuring average is very much useful for comparative study of different
distribution.
3. Average are very useful for computing various other statistical measure such as dispersion,
skewness, kurtosis and various other basic characteristics of a mass of data.
−
𝑥 = sample mean ℳ = population mean
1. Mathematical averages:
a. Arithmetic mean or simple mean
b. Geometric mean
c. Harmonic mean
Page 13 of 51
http://www.wsk.host56.com
2. Averages of position:
a. Median or quintiles or partition values
b. Mode
Arithmetic mean: One of the most common use measures of location is the arithmetic average or mean.
Arithmetic mean is mid value of the items in data. Mean is called as the central value of distribution.
Arithmetic mean from ungroup data: Mean is obtained by adding all the data set values and dividing it
by the number of values involved.
− x1+ x2+..+ xn Σ𝑥
A.M or 𝑥 = 𝑛
=
𝑛
Calculate the arithmetic mean of the values the following values 10, 20,30,40,50
10+20+30+40+50
A.M = 30
= 30
Example No 2:
Σx
If population then A.M = Μ = N
Arithmetic mean from frequency data:
Page 14 of 51
http://www.wsk.host56.com
− f1x1 + f2x2 + … + fn Σ fx
The formula used for frequency data is 𝑥 = =
f1+f2+f3+...+fn Σ𝑓
X f fx
10 1 10
20 4 80
30 10 300
40 4 160
50 1 50
Σ fx 600
= 30
Σ𝑓 20
Lecture 9
Example # 2 H/W
F 3 6 9 13 8 5 3 2
Class marks 6 7 8 9 10 11 12 5
(x)
Answer:
X f fx
6 3 18
7 6 42
8 9 72
9 13 117
10 8 80
Page 15 of 51
http://www.wsk.host56.com
11 5 55
12 3 36
5 2 10
∑ 𝑓𝑥
A.M = 𝑥̅ = ∑𝑓
Example 3: H/W
X 4 5 6 7 8 9 10 11 12 13 14
F 3 18 23 42 62 78 18 200 198 82 14
Example #1 in Classes
15 – 25 4 (15 + 25) /2 = 20 80
25 35 10 30 30
35 – 45 4 40 160
45 – 55 1 50 50
20 600
∑ 𝑓𝑥
A.M = 𝑥̅ = ∑𝑓
= 600/20 = 30
Page 16 of 51
http://www.wsk.host56.com
F 4 6 8 11 15 9
∑ 𝑓𝑥
A.M = 𝑥̅ = ∑𝑓
Use this formula
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Find the arithmetic mean of all problem we studied in previous classes. H/W
Lecture 10
Weighted Athematic Mean: In calculating arithmetic mean or average, we give equal importance to all
items of the series but it is not necessary that all the appointing of the total number of items have the
same relative degree of importance. Then the relative importance of the various items is indicated by
assigning, “weights” to the various items. The average thus computed by making use of the weights is
called weighted arithmetic average.
Example:
Let us consider a table, showing the average income of four families having earner men as below
No of earners /man 2 4 3 6
Page 17 of 51
http://www.wsk.host56.com
We find the mean for four income values. Table shows different number of earner men in each family.
The computation of the above mean takes into count on equal number of men. Therefore it will miss
lead us.
In order to find more reliable and accurate measure we should multiply each income values by the
respective number of earner men in each family. By doing this we assign weights due importance to
each income value. This Mean/average which we obtain is now called weighted arithmetic mean
∑ 𝑤𝑥 𝑥1 𝑤1 + 𝑥2 𝑤2 + 𝑥3 𝑤3 + 𝑥4 𝑤4 2(1000)+4(1300)+3 (1500)+6(2000)
𝑥̅𝑤 = ∑𝑤
= = = 1580
𝑤1 +𝑤2 + 𝑤3 + 𝑤4 2+4+3+6
Example 2:
The marks obtain by a student in statistics, mathematics and computer science were 60, 75 and 63
respectively.
Find the appropriate average if weights of one, two and one are assigned to these marks.
H/w
Example no 3: H/W
Q: A candidate obtains the following percentage in an examination English = 50, Urdu= 85, Math= 73,
computer= 69, and statistics= 55.
Find the weight mean, if the weight of 4, 3, 2, 2, and 1 respective are allotted to the subjects.
Answer: ?
Geometric mean:
Let 𝑥1 , 𝑥2 , 𝑥3 , . . , 𝑥𝑛 be the n values of a data set, thus mathematically geometric mean is given by
1
G.M = 𝑛√𝑥1 . 𝑥2 . 𝑥3 . . . . 𝑥𝑛 = (𝑥1 . 𝑥2 . 𝑥3 . . . . 𝑥𝑛 )1/𝑛 = 𝑛 log(𝑥1 . 𝑥2 . 𝑥3 . . . . 𝑥𝑛 )
1
(log 𝑥1 +𝑙𝑜𝑔𝑥2 +𝑙𝑜𝑔𝑥3 +. . +𝑙𝑜𝑔𝑥𝑛 )
𝑛
Page 18 of 51
http://www.wsk.host56.com
1
log of G.M = 𝑛 ∑ log 𝑥
∑ log 𝑥
log 𝑜𝑓 𝐺. 𝑀 =
𝑛
Lecture 11
Q: Calculate the geometric mean of the marks obtained by the class of seven students
Solution:
X log 𝑥
70 1.85
80 1.90
85 1.93
90 1.95
95 1.98
60 1.78
50 1.70
Σ log x = 13.09
∑ logx 13.09
G.M = 𝑎𝑛𝑡𝑖 𝑛
= antilog ( 7
) = antilog 1.87
G.M = 74.13
Answer
Example No 1: H/W
Page 19 of 51
http://www.wsk.host56.com
5.20, 3.50, 6, 9.75, 6.5, 3.65, 9.5, 1.25
Solution:
X log 𝑥
5.20 0.72
3.50 0.54
6.0 0.78
9.75 0.99
6.5 0.81
3.65 0.56
9.5 0.98
1.25 0.1
Σ logx = 5.48
∑ logx ∑ 5.48
G.M = 𝑎𝑛𝑡𝑖𝑙𝑜𝑔 𝑛
= antilog ( 8
) = antilog (0.69)
G.M = 4.84
Example 2: H/w
Solution
X log 𝑥
Page 20 of 51
http://www.wsk.host56.com
1 0
7 0.85
18 1.25
65 1.81
91 1.96
103 2.01
∑ log x = 7.88
∑ logx 7.88
G.M = 𝑎𝑛𝑡𝑖 ( ) = antilog ( ) = antilog (1.31)
𝑛 6
G.M = 20.57
∑ 𝑓 log 𝑥
G.M = antilog
∑𝑓
Example 1: H/W
F 1 2 4 5 3 1 0
Solution:
X F log x f log x
70 1 1.85 1.85
80 2 1.90 3.8
Page 21 of 51
http://www.wsk.host56.com
85 4 1.93 7.74
90 5 1.95 9.75
95 3 1.98 5.94
60 1 1.78 1.78
50 0 1.70 0
∑ f log x= 30.86
∑ 𝑓 log 𝑥 30.86
G.M = antilog
∑𝑓
𝐴𝑛𝑡𝑖𝑙𝑜𝑔( ) = Antilog (1.93)
16
G.M =84.87
Example 3:
0 - 10 8 5 0.70 5.6
10 - 20 12 15 1.18 14.16
20 - 30 20 25 1.40 28
30 - 40 4 35 1.54 6.16
Page 22 of 51
http://www.wsk.host56.com
40 - 50 5 45 1.65 8.25
50 - 60 3 55 1.74 5.22
∑f = 52 ∑ f log x = 67.39
∑ 𝑓 log 𝑥 67.39
G.M = antilog
∑𝑓
=Antilog ( ) = Antilog(1.30) = 19.76
52
G.M = 19.76
Given Data:
Marks 0-1 1-2 2-3 3-4 4-5 5-6 6-7 7-8
No of Students 10 40 20 10 10 20 40 10
Solution:
no of Students Class Marks
Marks (f) (x) log x f logx
-
0- 1 10 0.5 0.3010 -3.0103
1- 2 40 1.5 0.1761 7.0437
2- 3 20 2.5 0.3979 7.9588
3- 4 10 3.5 0.5441 5.4407
4- 5 10 4.5 0.6532 6.5321
5- 6 20 5.5 0.7404 14.8073
6- 7 40 6.5 0.8129 32.5165
7- 8 10 7.5 0.8751 8.7506
Page 23 of 51
http://www.wsk.host56.com
∑ 𝑓 log 𝑥 80.04
G.M = antilog
∑𝑓
= 𝑎𝑛𝑡𝑖𝑙𝑜𝑔( ) = 𝑎𝑛𝑡𝑖𝑙𝑜𝑔 (0.50)
160
G.M =3.16
Answer
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
Example 5:
Calculated geometric mean from group data. Take Example one From Lecture 5. H/W
Frequency 1 3 4 5 4 2 1
Answer:
Class Class
Interval Frequency marks logx F logx
40 - 49 1 44.5 1.6484 1.6484
50 - 59 3 54.5 1.7364 5.2092
60 - 69 4 64.5 1.8096 7.2382
70 - 79 5 74.5 1.8722 9.3608
80 - 89 4 84.5 1.9269 7.7074
90 - 99 2 94.5 1.9754 3.9509
100 - 109 1 104.5 2.0191 2.0191
Total 20 37.1340
Page 24 of 51
http://www.wsk.host56.com
∑ 𝑓 log 𝑥 37.1340
G.M = antilog
∑𝑓
antilog ( ) = antilog (1.86) = 72.44
20
Example: H/W
Ages 24 27 29 31 37 39 40 43 47 49 50
Frequency 1 1 2 4 7 6 3 3 1 1 1
Frequency
Ages (x) (f) logx f logx
24 1 1.3802 1.3802
27 1 1.4314 1.4314
29 2 1.4624 2.9248
31 4 1.4914 5.9654
37 7 1.5682 10.9774
39 6 1.5911 9.5464
40 3 1.6021 4.8062
43 3 1.6335 4.9004
49 1 1.6902 1.6902
49 1 1.6902 1.6902
50 1 1.6990 1.6990
∑ 30 47.0116
∑ 𝑓 log 𝑥
antilog( ∑𝑓
) = 𝑎𝑛𝑡𝑖𝑙𝑜𝑔 (47.01⁄30) = 36.90
Answer
Harmonic mean: Of a series of values is the reciprocal of A.M and the reciprocal of values of items.
Page 25 of 51
http://www.wsk.host56.com
𝑛
H.M = 1
∑
𝑥
Lecture 12
Find out the harmonic mean of the following discrete data.
H.M = 𝑛⁄ 1
∑
𝑥
X 1/x
1 1 H.M = 𝑛⁄ 1
∑
𝑥
0.5 2
Putting values in above
10 0.1
formula
45 0.02
H.M = 7⁄103.38 = 0.07
175 0.005
0.01 100
4 0.25
1
∑𝑥 = 103.38
&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&&
3.5 0.29
6.0 0.17
9.75 0.10
6.5 0.15
3.65 0.27
1.25 0.8
9.5 0.11
1
∑ 𝑥 = 2.09
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Example 1:
F 1 6 12 16 20 9 5
Solution:
[ ] = Given
X F f/x
0.01 1 100
0.5 6 12
∑𝑓
1
4
12
16
12
4
H.M= 𝑓
10 20 2
∑𝑥
45 9 0.20
69
175 5 0.03 = 0.53
130
∑ 69 130.23
Page 27 of 51
http://www.wsk.host56.com
Example 2:
X 6 7 8 9 10 11 12
F 3 6 9 13 8 5 3
Solution:
X F f/x
6 3 0.50
7 6 0.86
8 9 1.13
9 13 1.44
10 8 0.80
11 5 0.45
12 3 0.25
∑ 47 5.43
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Example 1:
0-10 8 5
10-20 12 15
20-30 20 25
30-40 4 35
Page 28 of 51
http://www.wsk.host56.com
∑𝑓
H.M= 𝑓
∑
𝑥
H/W Use the above formula
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Relationship of average:
Relationship between A.M G.M and H.M:
If 2, 4, 3, 7
½ = 0.5
Σ 𝑥 2+4+3+7 16
A.M = 4
= 4 =4 ¼ = 0.25
𝑛
∑ logx 1/3 = 0.33
G.M = 𝑎𝑛𝑡𝑖 ( ) = antilog [log (2) + log (4) +log (3) + log (7)] /4
𝑛
1/7 = 0.14
G.M = antilog (2.22/4) = antilog (0.56) = 3.6
G.M = 3.6
H.M = 𝑛⁄ 1
∑
𝑥
Putting in formula
4/1.22 = 3.3
H.M = 3.3
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Page 29 of 51
http://www.wsk.host56.com
A.M or G.M or H.M
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
If x1, x2 are the two values of a variant then their G.M is equal to the square root of their A.M and H.M
G = √𝐴𝑀 ∗ 𝐻𝑀
2 2
2
G = A*H => 𝐴=
G => 𝐻=
G
𝐻 𝐴
Q: Prove that AM > GM > HM for the following group data // Home Work
Given = [ ]
A.M = 180/10 = 18
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Page 30 of 51
http://www.wsk.host56.com
Lecture 13
I skipped this Lecture there were only Examples in this Lecture.
Lecture 14
Averages of position
Median: Median is the middle item of a series when it is arranged in ascending or descending order of
magnitude. Median divides the series in two equal parts.
First arrange the series in ascending order and then apply the formula
𝑛+1
Median = size of 2
th item (for odd number)
Example 1:
2, 3, 5, 7, 10
Page 31 of 51
http://www.wsk.host56.com
𝑛+1 6
2
= size if 2 th item
Which is 5
Example 2:
2,3,5,7,10,13
N=6
(5+7)/2 = 6 Ans
Example 3:
In case of frequency distribution median is the size of (n+1)/2 the median is the value corresponding to
the cumulative frequency distribution in which (n+1)/2 or n/2 lies.
Exam 1:
The distribution given below gives the marks in CS test by a class of 45 Ist year Students.
Find Median:
X F Cumulative
frequency
0 2 2
1 2 4
Page 32 of 51
http://www.wsk.host56.com
2 4 8
3 7 15
4 12 27
5 6 33
6 4 37
7 3 40
8 2 42
9 2 44
10 1 45
(N+1)/2
(45+1)/2 = 23
23 Fall in 4
ℎ 𝑛+1
Median = L + 𝑓 ( 2
- c.f) When f (odd)
Lecture 15
Find Median
Solution:
Page 33 of 51
http://www.wsk.host56.com
F 3 6 9 8 5 3 4
X 3 7 9 10 12 11 5
X F c.f
3 3 3
5 4 7
7 6 13
8 10 23
9 9 32
11 3 35
12 5 40
∑ f = 40
Here n= 40 as it is even
𝑛
Formula for odd =2 => 40⁄2 = 20
Example 2:
No of std 10 40 20 10 10 20 40 10
Page 34 of 51
http://www.wsk.host56.com
ℎ 𝑛
Median = 𝐿+ ( − 𝑐)
𝑓 2
In case of a frequency distribution, the median is given by the formula
Where
L =lower class boundary of the median class (i.e. that class for which the cumulative frequency is just in
excess of n/2).
Note:
This formula is based on the assumption that the observations in each class are evenly distributed
between the two class limits.
Given: [ ]
0-1 10 10
1-2 40 50
2-3 20 70
3-4 10 80
4-5 10 90
5-6 20 110
6-7 40 150
7-8 10 160
Page 35 of 51
http://www.wsk.host56.com
𝑛 160
For even 2 = 2
= 80
80 lies in Marks(3-4)
ℎ 𝑛
Median = 𝐿+ ( − 𝑐)
𝑓 2
1 160
Median = 3+ ( − 70)
10 2
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
As discussed in the previous lecture median divides the data into two equal parts. Other very useful
divisions are also possible called Quartiles, Deciles and Percentiles respectively.
Calculation of Quantiles:
i) Quartiles: A quartile is the value of item divided into 4 equal parts as the median is the
value of middle item of a series of data. There are 2 quartiles Q1 is the first quartile and
Q3 is the third quartile. Q1 and Q3 are called lower and upper quartile respectively. The
second quartile Q2 is the Median. The quartiles are given by the formula
For odd
(𝑛+1)
Q1 = size of 4
th item
(𝑛+1)
Q2 = size of 2 th item
4
(𝑛+1)
Q3 = size of 3 4
th item
(𝑛+1)
Q4 = size of 4 4
th item
For even
Page 36 of 51
http://www.wsk.host56.com
(𝑛)
Q1 = size of 4
th item
(𝑛)
Q2 = size of 2 4
th item
(𝑛)
Q3 = size of 3 4
th item
(𝑛)
Q4 = size of 4 4
th item
Deciles: They divide the data into 10 equal parts d1, d2, d3 … d9
The 5th Decile D5 consider the median value
For odd
(𝑛+1)
Q1 = size of th item
10
(𝑛+1)
Q2 = size of 2 10
th item
(𝑛+1)
Q3 = size of 3 10
th item
(𝑛+1)
Q4 = size of 4 10
th item
For even
(𝑛)
Q1 = size of 10
th item
(𝑛)
Q2 = size of 2 th item
10
(𝑛)
Q3 = size of 3 10 th item
(𝑛)
Q4 = size of 4 10 th item
Percentiles: They divide the data into 100 equal parts. The First, Second, Third up to 99th percentiles are
denoted by P1, P2, and P3 …. P99
For odd
Page 37 of 51
http://www.wsk.host56.com
(𝑛+1)
Q1 = size of 100
th item
(𝑛+1)
Q2 = size of 2 100
th item
(𝑛+1)
Q3 = size of 3 100
th item
(𝑛+1)
Q4 = size of 4 th item
100
For even
(𝑛)
Q1 = size of th item
100
(𝑛)
Q2 = size of 2 th item
100
(𝑛)
Q3 = size of 3 100 th item
(𝑛)
Q4 = size of 4 th item
100
…
…
(𝑛)
Q99 = size of 99 100 th item
Lecture 16
Example 1:
Find the upper and lower Quartile from the following observations
20 25 15 25 30 25 35 30 35 45 40
Find Q1,Q3
Solution:
15 20 25 25 25 30 30 35 35 40 45
n=11
𝑛+1
Q1 = Size of (4
)th item
Page 38 of 51
http://www.wsk.host56.com
11+1
Q1 = Size of ( 4
)th item => Third Item
Which is 25
3(𝑛+1) 3(12)
Q3 = Size of ( )th item => )th => 9th item.
4 4
Which is 35
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
So
[ ] = Third Value;
[ ] = Fourth Value
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
15 20 25 25 25 30 30 35 35 40 // HOME WORK //
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Example 2: Find all quartiles, 3rd Deciles from the following data. The marks obtained by 7 Students
In Statistics
82 53 54 60 62 63 46 // Home Work
Lecture 17
Example 1
Q: Find the lower and upper quartile by using the following observations.
Page 39 of 51
http://www.wsk.host56.com
20 25 15 25 30 25 35 30 35 45 40
Solution:
15 20 25 25 25 30 30 35 35 40 45
Here n = 11
Q3 = 9th item. Which is 35
Q1 = 3rd item. Which is 25
Example 2:
Finding D3
𝑛+1
D3 = size of 3( 10 ) th item
Here n = 11
12
D3 = size of 3(10) th item
Finding D7
𝑛+1
D7 = size of 7( 10 ) th item
Here n = 11
12
D7 = size of 7(10) th item
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Finding P75
𝑛+1
P75 = size of 75( 100 ) th item
Here n = 11
12
P75 = size of 75( ) th item
100
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Finding P99
Page 41 of 51
http://www.wsk.host56.com
𝑛+1
P99 = size of 99( 100 ) th item
Here n = 11
12
P99 = size of 99(100) th item
//Confuse
45 + 0.88(45 - 45)
= 45
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Example 3:
Find all Quartiles, and 3rd Decile from the following data
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Find Q1, Q3, D5, P25 and P75 by using the following observation
X 15 20 25 30 35 40 45
F 1 2 4 7 9 10 11
Solution:
Given [ ]
x F c.f
15 1 1
Page 42 of 51
http://www.wsk.host56.com
20 2 3
25 4 7
30 7 14
35 9 23
40 10 33
45 11 44
Here n = ∑f = 44
𝑛
Q1 = size of 4 𝑡ℎ item
44
Q1 = = 11
4
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Page 43 of 51
http://www.wsk.host56.com
Lecture 18
Page 44 of 51
http://www.wsk.host56.com
Calculate:
𝑛
Q1 = size of 4 th item
5
Q1 = 22.5 + 2 (2.5 - 2)
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Page 45 of 51
http://www.wsk.host56.com
3𝑛
Q3 = size of 4
th item
ℎ 3𝑛
L+ ( − 𝑐. 𝑓)
𝑓 4
5
32.5 + (7.5 - 7)
2
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
5𝑛
D5 = size of 10 th item
@@@@@@@@@@@@@@@@@
25𝑛
P25 = size of 100 th item
ℎ 25𝑛
P25 = L+ ( - c.f)
𝑓 100
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
P75 =
75𝑛
Size of 100 th = 7.5 item
ℎ 75𝑛
P75 = L+ 𝑓 (100 - c.f)
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Find Q1, Q3 D5, D7, p20 p70 p99 from the following group data
Page 46 of 51
http://www.wsk.host56.com
F 1 4 8 11 15 9 1
Example 3:
classes 2.5 – 2.9 3.0 3.4 3.5 3.9 4.0 4.4 4.5 – 4.9 5.0 5.4 5.5 5.9 6.0 – 6.4
F 2 7 17 25 20 12 9 8
Lecture 19
Mod: Mod is the most frequent value in a data set. In other words the maximum point of the frequency
distribution.
Example 1:
From the following sizes of shoes find out the model size. 11, 7, 9, 8, 6, 7, 8, 9, 10, 6, 8, 7, and 8
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Page 47 of 51
http://www.wsk.host56.com
X 10 61 62 63 64 65 66 67 68 69 70
f 2 4 6 7 12 18 26 7 5 3 2
Mode = 66
Ans
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
𝒇 − 𝒇𝟎
Mode = L + 𝟐𝒇𝒎𝒎−𝒇 (𝒉)
𝟎 − 𝒇𝟏
Where
Given [ ]
13 – 17 1
18 – 22 1
23 -27 2
28 – 32 3 27.5 – 32.5
33 – 37 2
38 – 42 1
L =27.5
Page 48 of 51
http://www.wsk.host56.com
H=5
fm = 3
f0 = 2
f1 = 2
1
Mode = 27.5 + 2(5) => 27.5 + 2.5 => 30 Ans
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Given: [ ]
9.3-9.7 2
9.8 – 10.2 5
10.3 – 10.7 12
11.3 – 11.7 10
𝒇 − 𝒇𝟎
Mode = L + 𝟐𝒇𝒎𝒎−𝒇 (𝒉)
𝟎 − 𝒇𝟏
L = 10.75
𝒇𝒎 = 17
𝒇𝟎 = 12
𝒇𝟏 = 10
Page 49 of 51
http://www.wsk.host56.com
𝟏𝟕− 𝟏𝟐
Mode = 10.75 + 𝟐(𝟏𝟕) −𝟏𝟐− 𝟏𝟎 (𝟎. 𝟓)
𝟓 𝟐.𝟓
Mode =𝟏𝟎. 𝟕𝟓 + 𝟑𝟒−𝟐𝟐 (𝟎. 𝟓) => 10.75 + 𝟏𝟐 => 10.75 + 0.23 => 10.98
Ans
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Marks F
0-1 10
1-2 40
2-3 20
3-4 10
4-5 10
5-6 20
6-7 30
7-8 10
𝒇 − 𝒇𝟎
Mode = L + 𝟐𝒇𝒎𝒎−𝒇 (𝒉)
𝟎 − 𝒇𝟏
L=1
𝒇𝒎 = 40
𝒇𝟎 = 10
Page 50 of 51
http://www.wsk.host56.com
𝒇𝟏 = 20
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@
Or
Where
= Lower class boundary of the model class
= Frequency of the model class (maximum frequency)
= Frequency preceding the model class frequency
= Frequency following the model class frequency
= Class interval size of the model class
Page 51 of 51