Académique Documents
Professionnel Documents
Culture Documents
Biostatistics
Branch of basic sciences which
deals with the collection,
analysis and interpretation of
data in relation to vital events.
Research
Research is the systematic collection
analysis and interpretation of data to
answer a certain question or solve a
problem.
Vital Events
These are live births, deaths,
foetal deaths, marriages,
divorces, adoptions,
legitimations, recognitions,
annulments, and legal
separations.
Data
Consist of discrete observations
or attributes or events that carry
little meaning when considered
alone.
Data
Information
Variable
A variable is the characteristic of a
person, object or phenomenon that
can take on different values.
Variables can be of two types:
Numerical variables
Categorical variables
Variables
Numerical variable
If the values of the variables are expressed in
numbers they are called numerical variables e.g.
persons age, weight, or income.
Categorical variable
Values of these variables are expressed in categories
e.g. the colour ie red, green,blue or types of food
such as maize or wheat.
Types of variable
Variable
Categorical
(Qualitative)
Ordered
Unordered
categorical
categorical
e.g. Gender
Male, female
Blood groups
Ethnic groups
Food types
(Nominal)
Age group
Nutritional
Status
Social class
e.g. poor,
Good, excellent
(Ordinal)
Numerical
(Quantitative)
Continuous
Discrete
(1.2,1.35..)
(1,2,3)
Number of
people in
household
(whole Nos.)
Income
weight
height
distance
(in decimals)
Presentation of Data
(Tables Graphs Charts)
1.
Table
2.
Bar
3.
4.
5.
6.
7.
Simple
Frequency distribution table
Simple
Multiple (Clustered)
Component (Staked)
Histogram
Frequency polygon
Line diagram
Pie chart
Scattered diagram
Tables
General principles.
Tables should be simple, 2 or 3 tables are preferred
over a single large table
Table should be self explanatory
Male
Female
Total
1st Year
60
125
185
2nd Year
3rd Year
70
85
110
90
180
175
4th Year
121
122
241
Final Year
110
90
200
Total
446
537
981
40
35
30
32
27
25
20
20
PG Trainees
18
15
9
10
3
0
0
Surgery
Med
Paeds
Gyne
Eye
ENT
4
1 1
Derma
0
Research
Histogram
A histogram is a graph used only for
presenting frequency distribution of
quantitative data. There is no space
between the cell, and equal class
interval
Frequency Polygon
If it is desired to present more than one set
of data in term of a frequency distribution,
the data should be presented in the form of
a frequency polygon.
A frequency polygon is constructed from a
histogram, I.e. the midpoint of the class
interval are connected by a straight line.
18
16
14
12
FREQUENCY POLYGON
10
8
6
4
2
0
147.32-- 150.59-- 153.86-- 157.13-- 160.4-150.59 153.86 157.13
160.4 163.67
FREQUENCY
Chart-I
126-130
10%
121-125
10%
131-135
3%
90-95
7%
96-100
7%
101-105
13%
116-120
13%
111-115
13%
106-110
24%
Sex Ratio
Female versus male percentage is shown by the
graphical representation:
Sex Ratio
Males 58%
Females
42%
Surgery
Med
Paeds
Gyne
Eye
Derma
Res
Not Sure
1%
9%
3%
36%
7%
18%
22%
Scattered Diagram
A scattered diagram represent the relationship
between two numerical variables measured on
the same subject, one variable is measured
along each axis.
SPOT MAPS
Statistics
Descriptive statistics
Inferential statistics
(Population, Sample, Element.)
(N
X)
Samples
Probability Sample (Permit use of inferential statistics)
1.
2.
Convenience samples
Quota samples
Probability
The probability of an event is a quantitative
measure of the proportion of all possible,
equally likely outcomes that are favourable to
the event.
It is denoted by p
Probability is usually expressed by decimal fraction,
not as percentage, and must lie between 0 (zero
probability) and 1 (absolutely certainty)
Mean
Median
Mode
Mean
Sum of all values (x) divided by number
of observations. It is denoted by X
Easy to calculate
Gives an idea of central tendency
Influenced by extreme values
May not convey proper sense e.g. Mean
No. of children may turn out to be 5.77
Example
Following values
19 20 22 20 21 23
Total = 125
Mean = 125/6 = 20.8
If the values are
19 20 22 20 21 50
Mean = 152/6 = 25
Median
When the data is arranged in ascending or descending
order, the median is the value that divides the data into
two parts.
Advantages
It is not influenced by extremes value.
Disadvantages
Not a very precise measure.
Not amenable to further statistical evaluation.
Example
2 5 7 9 12 13 14
Median = 9
2 5 8 9 10 12
Median = (8+9)/2 = 8.5
Mode
The mode is the most frequently occurring
value among all the observations in a sample.
For example: (22 22 21 22 23)
Data may be :Amodal
Unimodal
Bimodal
Trimodal
Multimodal
Range
The range is the difference between
the highest and the lowest figure in a
given sample
Example
4 5 6 9 10 12
Range = 8
Standard Deviation
It is a measure, which describe how much
individual measures differ, on average, from
the mean
It is denoted by SD
It is the most important measure of
dispersion around the mean and forms the
basis of most statistical analysis.
Thanx