Académique Documents
Professionnel Documents
Culture Documents
Classification of Data
The process of arranging data into homogenous group or classes according to some common characteristics
present in the data is called classification.
For Example: The process of sorting letters in a post office, the letters are classified according to the cities
and further arranged according to streets.
Bases of Classification:
There are four important bases of classification:
(1) Qualitative Base (2) Quantitative Base (3) Geographical Base (4) Chronological or Temporal Base
(1) Qualitative Base:
When the data are classified according to some quality or attributes such as sex, religion, literacy,
intelligence etc
(2) Quantitative Base:
When the data are classified by quantitative characteristics like heights, weights, ages, income etc
Construct a frequency distribution with suitable class interval size of marks obtained by
students of a class are given below:
23, 50, 38, 42, 63, 75, 12, 33, 26, 39, 35, 47, 43, 52, 56, 59, 64, 77, 15, 21, 51, 54, 72, 68, 36, 65,
52, 60, 27, 34, 47, 48, 55, 58, 59, 62, 51, 48, 50, 41, 57, 65, 54, 43, 56, 44, 30, 46, 67, 53
Solution:
Arrange the marks in ascending order as
12, 15, 21, 23, 26, 27, 30, 33, 34, 35, 36, 38, 39, 41, 42, 43, 43, 44, 46, 47, 47, 48, 48, 50, 50, 51,
51, 52, 52, 53, 54, 54, 55, 56, 56, 57, 58, 59, 59, 60, 62, 63, 64, 65, 65, 67, 68, 72, 75, 77
Minimum Value =
Maximum =
Range = Maximum Value Minimum Value =
=
Number of Classes =
=
=
=
Class Interval Size ( ) =
Marks
Class Limits
C.L
Tally
Marks
or
approximate
=
Number of
Students
or
Class
Boundary
C.B
Class
Marks
Note: For finding the class boundaries, we take half of the difference between lower class limit
of the 2nd class and upper class limit of the 1st class
. This value is
subtracted from lower class limit and added in upper class limit to get the required class
boundaries.
Class
Boundary
C.B
Tally
Marks
Frequency
Geometric Mean
It is another measure of central tendency based on mathematical footing like arithmetic mean.
Geometric mean can be defined in the following terms:
Geometric mean is the nth positive root of the product of n positive given values
Hence, geometric mean for a value
of
and given as under:
containing
values such as
is denoted by
Example:
Find the Geometric Mean of the following Data
and
are repeated
Solution:
We may write it as given below:
Here
,
,
Using the formula of geometric mean for grouped data, geometric mean in this case will become:
The method explained above for the calculation of geometric mean is useful when the numbers of
values in given data are small in number and the facility of electronic calculator is available. When a set of
data contains large number of values then we need an alternative way for computing geometric mean. The
modified or alternative way of computing geometric mean is given as under:
Example:
Find the Geometric Mean of the values 10, 5, 15, 8, 12
Total
Example:
Find the Geometric Mean for the following distribution of students marks:
Marks
No. of Students
Solution:
Marks
Total
No. of Students
Mid Points
Harmonic Mean
Harmonic mean is another measure of central tendency and also based on mathematic footing like
arithmetic mean and geometric mean. Like arithmetic mean and geometric mean, harmonic mean is also
useful for quantitative data. Harmonic mean is defined in following terms:
Harmonic mean is quotient of number of the given values and sum of the reciprocals of the given
values.
Harmonic mean in mathematical terms is defined as follows:
For Ungrouped Data
Example:
Calculate the harmonic mean of the numbers: 13.5, 14.5, 14.8, 15.2 and 16.1
Solution:
The harmonic mean is calculated as below:
Example:
Given the following frequency distribution of first year
students of a particular college. Calculate the Harmonic Mean.
Age (Years)
Number of Students
Total
Solution:
The given distribution belongs to a grouped data and the variable involved is ages of first year
students. While the number of students Represent frequencies.
Ages (Years)
Number of Students
Total
years.
Example:
Calculate the harmonic mean for the given below:
Marks
Solution:
The necessary calculations are given below:
Marks
Total
Example (4):
The following data shows distance covered by
Distance (Km)
Number of Persons
Calculate Arithmetic Mean by Step-Deviation Method; also explain why it is better than direct
method in this particular case.
Solution:
The given distribution belongs to a grouped data and the variable involved is ages of
distance covered. While the number of persons Represent frequencies.
Distance
Covered in (Km)
Number of
Persons
Mid Points
Total
and
Km
Explanation:
Here from the mid points ( ) it is very much clear that each mid point is multiple of
and there is also a gap of
from mid point to mid point i.e. class size or interval ( ). Keeping
in view this, we should prefer to take method of Step-Deviation instead of Direct Method.
Example (5):
The following frequency distribution showing the marks obtained by
students in
statistics at a certain college. Find the arithmetic mean using (1) Direct Method (2) Short-Cut
Method (3) Step-Deviation.
Marks
Frequency
Solution:
Direct
Method
Marks
Short-Cut
Method
Step-Deviation
Method
Total
or
Marks
Where
Marks
(3) Step-Deviation Method:
Where
Marks
Merits and Demerits of Arithmetic Mean
Merits:
It is rigidly defined.
Demerits:
The mean sometimes does not coincide with any of the abserved value.
Merits:
Demerits:
Demerits: