Académique Documents
Professionnel Documents
Culture Documents
STATISTICS?
X1 − X2
t =
(n1 − 1)s12+ (n2 − 1)s22 1
+
1
n1 + n2 − 2 n n
1 2
k Nj k Tj2
2
ssb = X ij −
j =1 i =1 j = 1 Nj
science which deals with methods
of collecting,
organizing,
presentation, analysis,
and interpretation
of data;
drawing conclusion about
what the data means.
It can be random or
non-random.
A study that includes every member if
the population is called a census.
Advantages:
▪ Increase confidence interval
▪ Enough respondents have a high degree
of statistical confidence in survey
results
▪ Everyone is invited
Opportunity for everybody to provide
feedback
▪ Higher degree of accuracy
▪ Suitable for heterogeneous units (with
difference)
A study that includes every member if
the population is called a census.
Disadvantages:
▪ Limits other possibility survey
opportunities
▪ A census isn’t worth having to put a
hold on other survey projects
▪ The more you survey your entire list,
the lower the response rate will be each
time (time and labor consuming)
▪ Inconvenient, it takes much time,
money, and set of industrial
organizations
A variable is a characteristic under
study that assumes different values
for different elements.
Ex:
Age, IQ scores, height,
weight, income
Qualitative Data refers to
descriptive attributes that cannot
be subjected to mathematical
operations.
Ex:
Gender, citizenship,
educational attainment,
religion
A data whose values are countable is
called discrete data.
Results from counting and it take certain
values (whole numbers)
Ex:
• Number of cell phones
users in a company
• The number of computers
in the laboratory
A data that can assume a numerical
value over a certain interval/s is called
continuous data.
Ex:
• Weight of students
• The temperature in a city
over a period of time
• Area of classrooms
Number of non-earning
members in a family.
Fractional Part of a
family’s income spent
for food.
a parameter is a value or measure
obtained from a population.
Ex:
In a given popularity survey of a
certain program, if 10% of it
are senior citizens with an
average of 65, then the average
age is statistic.
A study conducted at a Cruise Line College
revealed that the trainees who attended
class 95%-100% of the time usually
receive a 1.0 in the class.
Ex:
Percentile rank of
students in NSAT and
Top 10 students.
Interval
Specifies the precise
difference between or among
the values or ranks.
Ex:
Number of students
present and absent.
classify whether nominal,
ordinal, interval or ratio…
Color of a persons hair
= (x1+x2+x3+x4)2 = 100
= (1+2+3+4)2
= (10)2
x1 = 1 y1 = 4
X2 = 2 y2 = 3
X3 = 3 y3 = 2
X4 = 4 y4 = 1
= x1y1+x2y2+x3y3+x4y4
= (1)(4)+(2)(3)+(3)(2)+(4)(1)
= 4+ 6+ 6+ 4 = 20
1 2 3 4
= + + +
4 3 2 1
3 8 18 48 77
= + + + =
12 12 12 12 12
x1 = 1 y1 = 4
X2 = 2 y2 = 3
X3 = 3 y3 = 2
X4 = 4 y4 = 1
Express each in summation
notation, then find the sum.
Ex:
Age, IQ scores, height,
weight, income
Qualitative Data refers to descriptive
attributes that cannot be subjected to
mathematical operations.
Ex:
Gender, citizenship,
educational attainment,
religion
Population consist of all elements-
individuals items or object- whose
characteristics are being studied.
The technique of
collecting information
from a portion of the
population is called
sampling.
the portion of the population selected for
study is referred to as a sample.
20000
n=
1 + (20000)( 0.05) 2
20000
n=
1 + (20000)( 0.10) 2
Determine the sample size to be
considered for a research work that will
involve a population of 6000 students if
the desired margin of error is:
a. 10%
b. 5%
c. 1%
d. 3%
e. 7%
Interview:
If necessary, highlight
sources and footnotes.
12
10
6 Series1
0
31 – 37 38 – 44 45 – 51 52 – 58 59 – 65 66 – 72 73 – 79 80 – 86
10
6
Series1
0
31 – 37 38 – 44 45 – 51 52 – 58 59 – 65 66 – 72 73 – 79 80 – 86
66 – 72 , 6
45 – 51 , 9
59 – 65 , 8
52 – 58 , 11
66 – 72
12%
45 – 51
18%
59 – 65
16%
52 – 58
22%
X = 86
X = 34400
Median is .
Determine the median of the set
of measures:
12.6, 15.0, 19.8, 17.9,
11.7, 18.6, 14.1, 13.4
11.7,12.6,13.4,14.1,15.0,17.9,18.6,19.8
14.1 + 15.0
2
= 14.55
Median is .
Quantity with the
most number of
frequency.
Determine the mode for the set
of scores:
11, 15 ,13, 15, 14, 13, 15
15 appeared 3 times.
Unimodal.
Mode is .
Determine the mode for the set
of scores:
88, 89, 82, 82,
82, 89, 88, 89
Mode is and .
Determine the mode for the set
of scores:
21, 18, 16, 21, 18, 16
Mode is .
1. 12,31,29,6,10,8,13,27,24,16
2. 52,52,52,50,55,50,57,49,60,49,62
3. 35,42,36,40,49,33,39,37,44,28,43
4. 13.2,11.5,14.8,12.6,10.2,11.2,11.1,
10.1,10.4
5. 140,142,138,107,121,115,110,121,
130,126,128,130,121
1. Mean: 17.6
Median: 14.5
Mode: none
2. Mean: 53.45
Median: 52
Mode: 52
3. Mean: 38.73
Median: 39
Mode: none
4. Mean: 11.68
Median: 11.2
Mode: none
5. Mean: 125.31
Median: 126
Mode: 121
Mean – computational or
calculated average.
Median – rank or positional
average.
Mode – inspectional or commercial
average.
Mean – easily affected by an
increase o decrease in the number
of data.
Median – May or may not be
affected by extreme values.
Mode – May or may not be
affected by an introduction of
other data.
Mean – most widely used average
and subject to further math
computation.
Median – less widely used but can
be subjected to math computation.
Mode – rarely used and cannot be
mathematically manipulated.
Mean – measure for interval or
ratio scales such as scores/grades,
temperature, and population.
Median – measure for ordinal
scales such as test scores, salary.
Mode – measure for nominal scales
such as number of certain brand
of commodities.
a descriptive measures which
indicates the degree of
asymmetry of a distribution.
The test is
of average
difficulty
Frequency
Scores
10
6
Series1
0
31 – 37 38 – 44 45 – 51 52 – 58 59 – 65 66 – 72 73 – 79 80 – 86
10
6
Series1
0
31 – 37 38 – 44 45 – 51 52 – 58 59 – 65 66 – 72 73 – 79 80 – 86
“Tie”
For population data
=
2
N x −
2
( x )
2
2
N
For sample data
S = 2
n x −
2
( x ) 2
n( n − 1)
We will use n-1 instead of
n as divisor in computing the
variance of a sample data
although the variance is
defined as the average of
the squared deviations
about the mean.
The reason for this is to avoid
the likely existence of biases
that are normally associated
with the use of the variance
computed from different
random samples, especially
when the sample sizes n are
small.
The n different sizes
selected from the same
population generally yield
different values for the
variance. But, the average of
these values computed from
several samples of the
population tends to be closer
to the actual variance-the
population variance.
x x2 x x2 x x2
97 9409 94 8836 95 9025
92 8464 94 8836 94 8836
96 9216 92 8464 93 8649
95 9025 94 8836 96 9216
90 8100 96 9216 92 8464
470 44214 470 44188 470 44190
2 =
N x2 − ( x ) 2
N2
5(44214) − (470) 2 5(44188) − (470) 2 5(44190) − (470) 2
=
2
=
2 =
2
52 52 52
= 6.8 = 1.6 =2
Differentiates sets of scores
with equal averages.
“Nathaniel” 2 = 1.4
“Juan”
“Honesto”
“Nathaniel”
Graphically, a standard deviation
of 2.6 means most of the scores
are within 2.6 units from the
mean.
A standard deviation of 1.3
and 1.4 means most of the
scores are within 1.3 and 1.4
units, respectively, from the
mean.
Unlike the range, variance
and standard deviation are
affected by the value of
each observation.
If a constant c is added to
each observation, the
variance will not change.
CV is a measure of relative
variation expressed in percent (%).
It is used to compare the
variability of two or more set of
data especially when the
observations are expressed in
different units of measurement.
2.6
SD CVJuan = • 100 = 2.77%
CV = • 100 94
X
1.3
CVHonesto = • 100
94
= 1.38%
1.4
CVNathaniel = • 100
94
= 1.49%
x = 424 x 2 = 11576
X =
x
S2 =
n x2 − ( x) 2 SD
CV = • 100
n n(n − 1) X
424 16(11576) − (424) 4.76
2
= = = • 100
16 16(16 − 1) 26.5
= 26.5 = 22.67 = 17.97%
R = HS − LS S = 22.67
= 34 − 19 = 4.76
= 15