Académique Documents
Professionnel Documents
Culture Documents
Techniques
Part I: Probability and statistics
hugo.ruiz@ub.edu
STATISTICS
2
Statistics
Two classes of problems:
Hypotheses
Simple hypothesis: determines uniquely
Composite hypothesis: the prediction depends on
certain parameters : ;
4
Test statistic
Test statistic: a function of the measured sample of
values of the random variables that will be used to
estimate the agreement with the hypotheses
If we perform measurements, we denote
T , ,
Example: LHCb
6
Cerenkov radiation
cos
http://skullsinthestars.com/2009/11/20/reversing-optical-shockwaves-using-metamaterials/
8
9
10
Taking decisions
Well decide that a particle is a kaon if critical
region (here: )
Significance level: prob. of rejecting if it is true
$
!" # -./
%&'(
11
Efficiencies, errors
The efficiencies for selecting kaons and pions are
%&'(
01 " 2 # 1 ,
*$
%&'(
04 " # )
*$
12
But
Let as assume that, for a
given particle, we obtain
2
13
Back to Bayes!
The problem is the same as in the disease example
Exercise: relate ! and ) with the rates of fake positives
64 2
61 2 + 64
14
Purity
If we select kaons based on a cut , what is the
fraction of true kaons we are getting?
number of kaons with
91
number of all particles with
91 %
J*$ 61 2 + 1 , 61 #
&'(
15
P-value
Let us forget about pions
17
Samples, statistic
Let us go back to composite hypotheses: the predicted
pdf depends on parameters , ,
That is, we have ;
19
Bias of an estimator
If we repeat our full experiment (with its
measurement values) many times, we will obtain
different values of the statistic W
These values will distribute differently according to
the true values of the parameters
Sampling distribution: the pdf of W, W,
Bias of an estimator: [ \ ] ^ , ], with
L W; J W W; #W
J J W , O ; O ; # #O
20
Comments on the bias
It depends on:
The sample size
The functional form of the estimator
The true pdf of , (), including the value of
An estimator is said to be unbiased if the bias is zero for
any
21
d
Not to be confused with the population mean a L[], for
which is an estimator
We have
O O O
1 1 1
L L c
c L[
] c a a
d
d
d
This is an unbiased and consistent estimator
22
Estimators for variance and covariance
Sample variance: g V O
d
, V
O*
It is an unbiased estimator for the population variance i V
The estimator j V O
d
, k V is also unbiased
O
For the covariance matrix and the correl. coefficients:
O
1
lYmn c(
, )(o
, op)
,1
d
Y
lmn O
d
, o
, op
qmn
gm gn
sd
, V td ot , op V V
O O
23
Variance of an estimator
V
l W L W V , L W
Remember: iu is a measure of the expected
dispersion of W about its mean in a large number of
similar experiments each with sample size
sometimes quoted as statistical uncertainty of ]^
Note: what we are saying in fact is
l W; WRUQP.wUx
L W V ; WRUQP.wUx
V
W W
, L ; RUQP.wUx
(see later example on exponential distribution)
24
Ex: variance of the sample mean
For example, for the case of the mean:
O O
1 1
l L V , L V
L c
c s , aV
d sd
O
1
V c L
s , aV
,sd
1 iV
V V , a V + a V + i V , aV
IMPORTANT!
25
26
Example: Poison distribution
V
What is l W L W V , L W in this case?
Confidence intervals
Consider , W, and (W|) its pdf
Let us define W (), WV () by:
^~
u $
1,)
" W #W " W #W
*$ ^
u 2
W For a given ]
Area )
^
Y ()
V
For M or V , the probability of having a value as
*
eccentric as Y RUQP.wUx is
V
V
30
Back to the Poisson example
Remember: we got 0 events
our measurement is ] ^}
Clearly we cannot set a
symmetric interval here
(Note: discrete: J )
u *u
We have 0 | | *u
!
If we want 0 to be excluded with a CL ):
| u~ 1 , ) ,ln (1 , ))
For a 95% CL: ln 0.05 3 ] at % C. L.
31
Interpretation
As we defined them, the confidence interval represents
values for the parameter for which the difference
between the parameter and the observed estimate is
not statistically significant at the (1 , ))% level
32
Back to Bayes again!!
Beware of the interpretation of confidence intervals
Assigning a probability to a given value of would
require more information that just :
g T
(): priori probability of
: priori probability of (takes into account the
distribution of probabilities of !)
All this is normally not available in Physics
33