Académique Documents
Professionnel Documents
Culture Documents
When the researchers gather the needed data, the next task is to organize and present them
with the use of appropriate tables and graphs. This section explains how to organize and display
data in a presentable and understandable manner. The students will learn on how to prepare
frequency distribution tables for quantitative and qualitative data. Furthermore, they will know on
how to construct graphs such as: bar graph, pie graph, and line graph and frequency polygon.
This is also called categorical frequency distribution. This is used for data that can be place
in categories. In other words, the qualitative data or categorical data is the appropriate in this type of
tabular presentation.
Consider the following data give the results of a sample survey. The letters A, B, and C
represent three categories. Construct a frequency distribution table from the given data below;
A B A A C C A C C C
C B C B B C B B B C
B C C A C C C B C A
Solutions: the categories are the three letters A, B, and C, so the simple frequency distribution table
is
This is one technique used to facilitate the description of important features of the data.
Grouped frequency Distribution is utilized in presenting data sets by sorting them into classes and
showing the frequency of occurrences in each class.
Basic Terminology
Before constructing grouped frequency distribution table let us familiarized all the features
on it.
This refers to the grouping defined by lower class limit and upper class limit. Sometimes,
the class limits is not defined.
Class boundaries
These are used to separate the classes so that these are no gaps in the frequency distribution
table. In continuous like test scores, weights, heights, etc. is supposed to starts and ends with an
accurate class limits by at least 0.5.
This is the value of a class interval or classes. This is obtained by finding the average of the
lower class limit and upper class limit.
Class size or class width is the difference between the upper class limit/boundary and lower
class limit/boundary. The ratio of two values that is, the range divided by desired number of classes.
Class frequency
2 8 1 5 9 5 14 10 31 20
15 4 10 6 5 5 1 8 12 10
25 40 31 24 20 20 3 9 15 15
25 8 1 1 16 23 18 25 21 12
5. Determine the class boundaries. This will be obtained by adding 0.5 for each upper class limit and
subtract 0.5 for each lower class limit.
6. Tally the scores.
7. Determine the class midpoint for each class interval.
8. Construct cumulative frequency (less than).
Graphical Presentation
After the data have been collected, they can be consolidated and summarized to show the
following information:
The type of graph you choose depends on the type of variable you have measured. When the
variable of interest is qualitative, you can use either pie chart or bar graph. Pie Chart is the familiar
circular graph that shows how the measurements are distributed among the categories. While Bar
chart shows the same distribution of measurements in categories, with the height of the bar
measuring how often a particular category was observed. Here is an example of a pie chart below;
Example: In a survey concerning public education, 400 school administrators were asked to rate the
quality of education in the Philippines. Their responses are summarized in the Table 2. Construct a
pie chart and a bar chart for this set of data.
Rating Frequency
A 35
B 260
C 93
D 12
Total 400
Solutions: Assign one sector of a circle to each category. The angle of each sector should be
proportional to the proportional of measurements in that category. The equation to find the angle is
Thus the pie chart of the ratings of 400 educators is shown in figure 1
23.3%
65.0%
Bar chart usually plot frequency against the categories. A bar chart for the previous data is
shown in Figure 2.
Chart of frequency
250
200
frequency
150
100
50
0
A B C D
rating
Line Graphs
When a quantitative variable is recorded over time at equally intervals (such as daily,
weekly, monthly, quarterly, or yearly), the data set forms a time series. Time series data are most
effectively presented on a line chart with time as the horizontal axis. The idea is to try to discern a
pattern or trend that will likely continue into the future, and then to use that pattern to make accurate
predictions for the immediate future.
Example: In the year 2025, the oldest “baby boomers” will be 79 years old, and the oldest “Gen-
Xers” will be two years from Social Security eligibility. The United States Bureau of the Census
gives projections for the portion of the U.S. population that will be 85 and over in the coming years,
as shown below. Construct a line chart to illustrate the data.
20.0
85 and over (millions)
17.5
15.0
12.5
10.0
7.5
5.0
2010 2020 2030 2040 2050
Year
Histogram
Example: Consider the frequency distribution table for a distance of 40 People drove
Histogram of distance
12
10
8
Frequency
0
4 11 18 25 32 39
distance
Figure 3 Distance of People Drove to the Mall
Another simple way to display the distribution of a quantitative data set is the stem and leaf
plot. This plot presents a graphical display of the data using the actual numerical values of each data
point.
Example: The table below lists the prices (in dollars) of19 different brands of walking shoes.
Construct a stem and leaf plot to display the distribution of the data.
90 70 70 70 75 70
65 68 60 74 70 95
75 70 68 65 40 65
70
Solutions: To create the stem and leaf, you could divide each observation between the ones and the
tens place. The number to the left is the stem; the number to the right is the leaf.
4 0 1
5 0
6 580855 6
7 0005040500 10
8 0
9 05 2
Sometimes the available stem choices result in a plot that contains too few stems and a large
number of leaves within each stem. In this situation, you can stretch the stems by dividing each one
into several lines, depending on the leaf values assigned to them. Stems are usually divided into two
lines, with leaves 0 – 4 in the first line and leaves 5 – 9 in the second line.