Académique Documents
Professionnel Documents
Culture Documents
3 September, 2013
Rohit Patra
What is Statistics? Statistics is concerned with methods for collecting, summarizing and analyzing data for drawing conclusions based on the information contained in the data Applications: Economics: Data on unemployment, interest rates, stocks Clinical trials: Is drug A better than drug B? Weather data: Is global warming real? Agriculture: Does variety C have higher yields than variety D? In short, Statistics is the study of any phenomenon exhibiting uncertainty and variation.
Rohit Patra
Rohit Patra
Rohit Patra
Rohit Patra
Population: All objects/individuals of a particular type. Sample: A (representative) subset of the population Variable: Any characteristic of an unit/individual, e.g., height, weight, gender ... Two types of variables: Categorical (qualitative) and Quantitative Categorical: gender, eye color, blood type ... Quantitative: temperature, age, height, income ...
Rohit Patra
Easy to calculate; affected by outliers" = middle observation (if n is odd, then Sample median: X the (n + 1)/2-th obs.; if n is even, then average of n/2 and n/2 + 1 obs.)
Pn
i =1 Xi
Rohit Patra
Measures of variability
min1i n Xi
1 1(
Standard deviation: S =
S2
Pn
2 i =1 Xi
2) nX
2 = s2 , Dene Yi = Xi + b. Then sY X
Rohit Patra
Percentiles
The p-th percentile is the data value which has p% of the observations falling at or below it Median is the 50th percentile Q1 = 1st quartile = 25th percentile (median of the lower half of the ranked data) Q3 = 3rd quartile = 75th percentile (median of the upper half of the ranked data) Inter-quartile range: IQR = Q3 extreme obs.) Five-number summary! A Box plot is a visual representation of the ve number summary. Outliers: Bigger than Q3 + 1.5IQR and smaller than Q1 1.5IQR .
Rohit Patra Stat 1211 Fall 2013 9
Q1 (not affected by
Overview of course Descriptive statistics Graphical statistics Sample mean, median, variance, standard deviation
Rohit Patra
10
Probability Probability provides methods for quantifying the chances, or likelihood, associated with various outcomes. Example: Will it rain tomorrow? Will the Republicans win? Will I get 3 Heads (H) in 3 coin tosses? Experiment Experiment: Action or process that generates data Outcome Examples: (a) Flip a coin and record H or T; (b) Toss a coin three times; (c) roll a pair of dice; (d) ip coin until rst H.
Rohit Patra
11
Sample space S : set of all possible outcomes of experiment Examples: (a) S = {H , T }; (b) S = {HHH , HHT , HTH , HTT , THH , THT , TTH , THH }; (c) S = {(1, 1), (1, 2), . . . , (1, 6), . . . , (6, 6)}; (d) S = {H , TH , TTH , TTTH , . . .}.
(b) Let B be the event that we get exactly two tails; B = {HTT , THT , TTH } (d) Let D be the event that # coin ips needed is 2; D = {TH } (c) Let C be the event that the rst dice shows 1; C = {(1, 1), (1, 2), . . . , (1, 6)}
Rohit Patra
12
Set relations Ac : The complement of A is the event that A does not occur (Union) A [ B : Set of all outcomes in A, or in B , or in both (Intersection) A \ B : set of all outcomes that are in A and in B Null set: the event containing no outcomes, denoted by Mutually exclusive/disjoint events: A and B are disjoint if they have no outcomes in common, i.e., A \ B =
Rohit Patra
13
(2) P (S ) = 1 (3) If A1 , A2 , . . . are disjoint, then P (A1 [ A2 [ . . .) = P (A1 ) + P (A2 ) + . . . Properties P( ) = 0 P (A) + P (Ac ) = 1, for any event A P (A [ B ) = P (A) + P (B ) P (A \ B )
Rohit Patra
14