Académique Documents
Professionnel Documents
Culture Documents
Organization of Data
70 64 99 55 64 89 87 65
62 38 67 70 60 69 78 39
75 36 71 51 99 68 95 86
57 53 47 50 55 81 80 89
51 36 63 66 85 79 83 70
Isabela
N Vizcaya
424,580
82,895
(Eastern
Visayas)
Biliran
E Samar
58,135
202,680
Pages 123 -124
Quirino 59,555 Leyte 680,536
Region 4a Region 10
(CALABAR
ZON) Batangas 440,603 (Northern Bukidnon 449,647
Cavite 244,712 Mindanao) Camiguin 41,017
Region 4b Region 11
(Davao
(MIMAROPA) Marinduque 113,553 Region) Davao dN 637,298
Romblon 170,917
Example
Create an Excel file with 3 variables:
Provincial Code, Province and Magnitude
Steps:
1. Highlight cells containing data (including column
labels).
2. Click Data/Sort.
3. Fill up dialogue box. Identify sorting variable in
Sort by box. Click the desired arrangement
(ascending or descending). Click Header Row
since highlighted cells include column labels.
0 0 1 2 2 2 3 3 4 4
0 0 1 2 2 3 3 3 4 4
0 1 1 2 2 3 3 3 4 4
0 1 1 2 2 3 3 3 4 5
0 1 1 2 2 3 3 3 4 5
Single Value Grouping of Number of Children of Currently Married Women Using Any
Modern Method of Contraceptive:
1,202,500- 1,352,499 1
Total 82
Example
Exercise no. 4 (page 135)
n=30
Sturges’ formula: K = 1 + (3.322)(log(30))=5.9.
We consider using 6 classes.
Important Pointers:
1. The number of significant digits of LCL must be the same as the values in the
data.
2. The LCL of first class should be less than or equal to the minimum value of the
data set.
3. Often times, the LCL is selected so that we end up with numbers that are easy
to work with and read such as multiples of 5’s, 10’s and so on.
4. If the observations tend to be concentrated at specific values throughout the
range of data – for example, prices are often multiples of 50 or 100 or 1000 –
one may have to experiment on choosing the LCL so that the midpoint of the
class interval will be at these values.
5. Whenever appropriate, choose the LCL so that the groupings will be
meaningful. For example, when the values are grades then choose the LCL of
the first class so that the passing grade will be the LCL of one of the classes.
Example: We have selected C=5 and the LCL of the first class=35.00. The largest
observation is 67.29. Thus, the lower class limits are:
LCL
35.00
35.00 + 5 = 40.00 40.00
40.00 + 5 = 45.00 45.00
45.00 + 5 = 50.00 50.00
50.00 + 5 = 55.00 55.00
55.00 + 5 = 60.00 60.00
60.00 + 5 = 65.00 65.00
65.00 + 5 = 70.00 STOP
Example:
LCL UCL
35.00 39.99
40.00 44.99
45.00 49.99
50.00 54.99
55.00 59.99
60.00 64.99
65.00 69.99
Example:
LCL UCL f
35.00 39.99 7
40.00 44.99 6
45.00 49.99 5
50.00 54.99 1
55.00 59.99 3
60.00 64.99 6
65.00 69.99 2
30
35.00 - 39.99…………………. 7
40.00 - 44.99…………………. 6
45.00 - 49.99…………………. 5
50.00 - 54.99…………………. 1
55.00 - 59.99…………………..3
60.00 - 64.99…………………..6
65.00 - 69.99…………………..2
Total…………………30
Additional Pointers
Whenever possible, all classes should be of the same size. If the classes sizes
are not equal, it becomes difficult to tell whether the differences in class
frequencies result mainly from differences in the concentration of items or from
differences in the class sizes. However, there are certain instances when it is
not practical to use equal class sizes. One such case is when the distribution is
badly skewed.
Example: Suppose the salaries range from 20,000 to 1,000,000 but 95% of
observations are less than 80,000. If all class sizes are equal to 100,000 then
the fdt is almost useless. There will be around 10 classes but 95% belong in
the first class. No information would be provided about the distribution of
salaries of the 95% with salaries from 1 – 100,000.
For such distributions, unequal class intervals are generally used. For instance,
equal class sizes of say 10,000 might be used for the range wherein most of the
salaries fall, after which the size might increase to say 100,000. Another option
is to use an open class interval to account for the remainder of salaries that are
not included in the classification.
Example:
Frequency Distribution of Magnitude of Poor Population
of the Provinces in the Philippines
0 – 99,999 …………………………… 13
100,000 – 199,999...………………………… 18
200,000 – 299,999…………………………… 16
300,000 – 399,999…………………………… 10
400,000 – 499,999 ………………………….. 11
500,000 – 599,999 ………………………….. 5
600,000 – 699,999 ………………………….. 4
700,000 and above…………………………… 5
Note: For rounded figures, there are no gaps in the class boundaries. The
number of decimal place is one more than the number of decimal place of
the class limits.
LCL UCL LCB UCB
LCL UCL CM
LCB UCB f 7
34.995 39.995 7 6
No. of Children
39.995 44.995 6 5
44.995 49.995 5 4
49.995 54.995 1 3
54.995 59.995 3 2
59.995 64.995 6
1
64.995 69.995 2
0
134.995 239.995 3 44.9954 49.9955 54.9956 59.9957 64.9958 69.9959
Weight (in lbs)
10 51 - 60 6 1 6
10 61 - 70 102 1 102
10 71 - 80 134 1 134
20 81 - 100 293 2 146.5
50 101 - 150 364 5 72.8
50 151 - 200 101 5 20.2
Frequency Histogram of Annual Income of Taxpayers
160
140
Number of Taxpayers
120
100
80
60
40
20
0
45.5 65.5 85.5 105.5 125.5 145.5 165.5 185.5 205.5
ChapterAnnual
6. Organization
Income (in thousands) of Data
CM No. of children
Frequency Polygon of Weight of Children
32.495 0
8
37.495 7 7
42.495 6 6
No. of Children
5
47.495 5 4
52.495 1 3
57.495 3 2
1
62.495 6 0
72.495 0
We find the relative frequency for each class interval by dividing the class frequency of a
class interval to the number of observations. The sum of the relative frequency column is
one. On the other hand, we derive the relative frequency percentage from the relative
frequency. We simply multiply the relative frequency by 100% to get the relative frequency
percentage. The sum of the relative frequency percentage column is one hundred percent.
TABLE 7. Frequency Distribution with Relative Frequency
and Relative Frequency Percentage Columns
Relative
Class Limits Relative Frequency
LCL UCL f Frequency Percentage
2,500 - 192,499 30 0.366 36.6
Variations of FDT:
Cumulative Frequency Distribution
(page 130)
Example
Example
35
Class Boundaries <CFD
Less than Cumulative Frequency
34.995 - 39.995 7 30
39.995 - 44.995 13 25
44.995 - 49.995 18 20
49.995 - 54.995 19 15
54.995 - 59.995 22 10
59.995 - 64.995 28 5
64.995 - 69.995 30 0
39.995 44.995 49.995 54.995 59.995 64.995 69.995
35
Class Boundaries >CFD
44.995 - 49.995 17 20
49.995 - 54.995 12 15
54.995 - 59.995 11 10
59.995 - 64.995 8
5
64.995 - 69.995 2
0
34.995 39.995 44.995 49.995 54.995 59.995 64.995
Weight (in lbs)
Assignment
Use data in Exercise no. 5, page 135.
1. Construct a frequency distribution and compute
for the relative frequency percentages. Present
the frequency distribution and the relative
frequency percentage distribution in a formal
statistical table.
2. Present the frequency histogram of fdt in no. 1.
3. Present the frequency polygon of fdt in no. 1.
4. Construct the <CF and >CF.
5. Present the ogives of the cumulative frequencies
in no. 4.