Académique Documents
Professionnel Documents
Culture Documents
Hypothesis Testing
2
Hypotheses
Null hypothesis:
Considered true by default
The boring stuf we want to reject
( e.g., Standard Model )
3
Hypotheses
Alternative hypothesis:
Complementary to null hypothesis
The special case we wish to see
( e.g., some new physics models )
4
Examples of Hypotheses
5
Frequently Used Hypotheses in HEP
Null hypothesis
“Background-only hypothesis”: Only Standard Model
contributes to the observation.
Alternative hypothesis
“Signal-plus-background hypothesis”: Additional
new physics processes contribute.
6
More about Hypotheses
Simple hypothesis:
The expected distribution of data can be entirely
determined. (No free parameters)
Composite hypothesis:
Based on an ensemble of simple hypotheses,
which may be related by a continuous parameter.
7
Test Statistic
Test statistic :
A variable for testing how well the
hypothesis agree with the observation.
It could be either a scalar function or a
vector function.
8
Test Statistic
9
Test Statistic
Example:
Let test statistic be the number of events.
10
Critical Region
11
Critical Region
One-tailed test
(examined later)
Defne as rejection
region, and as
acceptance region.
If lies in the rejection
region, then the null
hypothesis is rejected.
If lies in the acceptance
region, then we fail to reject the
null hypothesis.
12
Critical Region
13
One- and two- tailed test
A two-tailed test
14
Type I and Type II Errors
15
Type I and Type II Errors
16
Type I and Type II Errors
17
The Testing Process
18
The Testing Process
19
Choosing the Test Statistic
A good test statistic results in a clear separation of
the distribution of t for the diferent hypotheses.
A test statistic is called “sufcient” if there exists no
other test statistic that provides additional relevant
information on the hypothesis model.
An ideal test statistic obeys these and provide the
best power for a given size .
20
Choosing the Test Statistic
Example of test statistics: reconstructed mass,
transverse momentum, ( for goodness-of-ft
tests) , etc.
21
Choosing the Test Statistic
Example of test statistics: reconstructed mass,
transverse momentum, ( for goodness-of-ft
tests) , etc.
22
Choosing the Test Statistic
Neyman-Pearson lemma:
If all of the hypotheses are simple, that is, we could
entirely determine the model, the likelihood ratio
23
Choosing the Test Statistic
24
Choosing the Test Statistic
25
Choosing the Critical Region
A trade-of between type I and type II errors has
to be made.
In high-energy physics, is commonly taken to
be very small for a discovery.
26
Determining the Test Statistic Distributions
If there are many data samples, the distribution
will be like a Gaussian and is easy to determine.
If not, then the distribution is non-trivial.
Usually, numerical methods are used to
determine it.
27