Tutorial Trygg Wold

Homepage of Chemometrics
© 1996-2002
http://www.acc.umu.se/~tnkjtg/Chemometrics/Editorial
OUTLINE
1.) Introduction Introduction to Statistical Experimental Design
2.) Stages in the DoE - What is it? Why and Where is it Useful?
process
3.) Analysing the resulting
experimental data Johan Trygg & Svante Wold
4.) Applications of DoE University of Queensland, Australia & Umeå University, Sweden
5.) References
6.) Appendix
7.) DoE Exercises
Definition: Statistical experimental design, a.k.a. design of experiments (DoE) is the

methodology of how to conduct and plan experiments in order to extract the maximum
amount of information in the fewest number of runs.
1. Introduction 1.1 The traditional way - the COST ap-
List of previous
…We have a large reservoir of engineers (and proach [COST: Change One Separate
HoC Editorials are
scientists) with a vast background of engineering
given on last page. factor at a Time]
know- how. They need to learn statistical methods
that can tap into the knowledge. Statistics used as a Most experimentation today is done by changing
catalyst to engineering creation will, I believe, always levels of one factor (variable) at a time in an
result in the fastest and most economical pro- unsystematic way in order to try and find the
gress… George Box, 1992 optimum conditions of a complex system. Is this
a good (efficient, rational, economic) strategy ?
No!!
How shall I find the optimum? This is a com-
As shown by Fisher around 1925, changing one
mon question everywhere in business. In re-
separate factor at a time (COST) does not give
search and development, often half of the re-
any information about the position of the opti-
sources are spent on solving optimization prob-
mum in the common case where there are in-
lems. With the rapidly rising costs of making
teractions between factors. Then the COST
experiments, it is essential that the optimization
approach gets stuck, usually far from the real
is done with as few experiments as possible.
optimum. However, the experimenter perceives
This is one important reason why statistical
that the optimum has been reached because
experimental design is needed.
changing one factor at a time does not lead to
any further improvement. The COST approach
DoE originated in the 1920's by a British scien-
is said to be pseudo-convergent.
tist, Sir R. A. Fisher, as a method to maximize
the knowledge gained from experimental data
What are the problems associated with
and it has evolved over the last 70 years. Most
the COST approach:
experimentation involves several factors and
are conducted in order to optimize processes 1. Does not lead to real optimum
and or investigate and understand the relation- 2. Inefficient, unnecessarily many runs
ships between the factors and the characteris- 3. Provides no information about what happens
tics of the process (reponses) of interest. when factors are varied simultaneously (ignores
interactions)
4. Provides less information about the variability
http://www.acc.umu.se/~tnkjtg/Chemometrics/Editorial Page 2
Typical examples when design of experi-

ments (DoE) is useful involve the devel-
opment of new products and processes,
e.g. optimizing the quality and perform-
ance of an existing product or optimizing
existing manufacturing processes of
chemicals, polymers, materials, drugs
and pharmaceuticals, foods and bever-
ages, cosmetics, paints and so on.
2. Stages in the DoE proc-

ess
2.1 Familiarization, fiddle around
Fiddling around a little. Problem formula-
tion is extremely important during the
whole process. Formulate question(s)
stating the objectives and goals of the
investigation. What do I want?
• What is the purpose?

• What are the objectives?
• Identify factors, factor ranges and
types of factors (quantitative or
qualitative)
• What is possible, experimentally,
financially, environmentally?
of the response of methods of statistical experimental If these objectives can not be put into
5. Isolated, unconnected experiments design. These methods have been fur- words, there is no reason to continue
6. Slow growth of knowledge, no map- ther refined by Yule, Box, Stu and Bill because it shows that the investigators
ping of experimental space. Hunter, Scheffe, Cox, Taguchi, and oth- don't know what to do.
ers, so that today they comprise a tool
Any measurement and experiment is box for virtually any optimization prob- 2.2 Screening (many factors)
influenced by noise. Under stable condi- lem. The basic idea is to devise a small Finding out a little about many factors.
tions, any process varies around its set of experiments, in which all pertinent Which factors are the dominating ones.
mean +/- 3 std dev. Two COST experi- factors are varied systematically. This set To assure that uncontrolled factors
ments may give two different results, but usually does not include more than ten (humidity, etc..) do not bias the results,
with no estimate (poor) of noise level. to twenty experiments. The subsequent perform the runs in random order
BUT by making a set of well planned analysis of the resulting experimental Screening experiments give information
experiments with correct analysis (DoE), data will identify the optimal conditions, about
we can separate "real effects" from noise the factors that most influence the re- • What are the important factors
and draw correct conclusions and act sults and those that do not, the presence • If we are in the correct region,
correctly. This means decreased variabil- of interactions and synergisms, and so
(ranges)
ity and quality improvement. So to inves- on. The most important aspect of statis-
tical experimental designs is that they • If there is curvature and if it
tigate systems involving several factors in
provide a strict mathematical framework masks the effects
presence of variability or noise one
for changing all pertinent factors simulta- • What to do next
needs a better strategy than that based
on changing one separate factor at a neously, and achieve this in a small num-
time and that strategy is given by experi- ber of experimental runs. Most of us can Pareto's principle states that 20 % of the
mental design (DoE). only grasp the effect of one factor at a data (factors) account for 80 % of the
time in our minds, and that leads to the information. Screening designs provide
1.2 What to do instead - Design inefficient COST approach. We need the simple models with information about
mathematics (and the computer) to keep dominating variables, and information
of Experiments (DoE)
track of the factors and their combina- about ranges. In addition, they provide
• Which factors have a real influ- tions. few experiments / factor which means
ence on the response? • All factors are varied together that relevant information is gained in
• What are the best settings of the over a set of experimental runs only a few experiments.
factors to achieve optimal condi- • Noise is decreased by means of Linear models and interaction models
tions for best performance on a are sufficient, since we are only inter-
averaging
system? ested in the effects. We merely ask, if a
• The functional space is efficiently
• What are the predicted values of factor does influence the response, not
mapped, interactions and syner- how.
the responses for given settings
gisms are seen
of the factors in a model?
In 1925 Fisher started the development
degree interaction effects. This loss of

information is the prize we need to pay
2.2.1 Screening designs: 5. Noise can be estimated from

-Replicated center points
Different types of screening designs
-Other replication for the reduction of the number of ex-
exist, which one to choose depends on
-Residuals (higher level interac- periments.
the problem. The most common one is
tions) • Fraction (k-p) selected by choice
the fractional factorials design.
-Previous knowledge of generator
Factorial designs form the basis for all • Degree of confounding of coeffi-
classical experimental designs, both cients can be derived from the
screening and RSM. For screening, we defining relation The shortest
will concentrate on two-level designs. word in the defining relation is
They are sufficient to estimate linear, the resolution of that fractional
and interaction models, and they require factorial design.
a very low number of experimental runs.
There are some clear obvious advan- 2.2.4 Placket-Burman
tages of using a factorial design. See http://www.itl.nist.gov/div898/
• Averages are more stable than handbook/pri/section3/pri335.htm for
single observations. more info.
• The more data one averages, the Figure. Example of a full factorial
more reliable is the result. design in three variables, 2.2.5 Special designs
23= 8 experiments In addition, special designs are needed in
The "equal opportunity" strategy constrained regions or mixture problems.
(screening) Mixture problems are common in the
1. The factors are selected that a chemical, food and beverage, cosmetics,
Notation
priori are believed to be the and drug industries. One reason is be-
The low level of a factor will be denoted cause the factors add up to 100 percent.
most influential on the response [-1] = [-]
2. A range is chosen for each factor This introduces a constraint on the de-
The high level of a factor will be denoted sign and must be handled with special
3. A (full factorial) or fractional [+1] =[+]
factorial design (plus 3 ctr. tools and models. D-optimal designs or
The center level of a factor will be de- other special designs are used when
points) is selected. This provide noted [ 0] = [0]
orthogonal, balanced estimates there are constraints put on the factors
of the effects (and possibly inter- for economical / synthetical /
2.2.3 Fractional factorial design environmental or other reasons. The
actions) with equal variance.
Investigating more than 5 factors with simpler factorial designs can not be used
4. The experimental runs are per-
the full factorial design becomes time in such cases.
formed in random order. This
consuming, 25=32, 26=64,27=128, etc. • D-optimal (and other "optimal"
results in a ranking of the factors
Instead, performing a fractional factorial
(and possibly interactions) with designs)
design reduces that number quickly
significance estimates. - See http://www.itl.nist.gov/
without the loss of too much informa-
Let us first describe the full factorial div898/handbook/pri/section5/
tion regarding the estimation of factors
design, and later explain the fractional pri521.htm for more info.
involved. Fractional factorial design takes
factorial design. • Mixture designs
advantage of the fact that 3-way and
higher interactions are seldom signifi- - See http://www.itl.nist.gov/
2.2.2 Full factorial design cant. The downside, of course, for not div898/handbook/pri/section5/
The full factorial design is a set of ex- performing all experiments is that con- pri54.htm for more info
perimental runs where every level of a founding patterns are present. In other
factor is investigated at both levels of all words, the estimated effects are not 2.3 Finding optimal region of op-
the other factors. It is a balanced "pure" but instead mixed with higher erability
(orthogonal) design. A balanced design Simplex designs (see http://
allows the estimation of a factor effect www.multisimplex.com/algorithm.htm
independently of all the other effects for more info) and steepest ascent
1. N=2k (+3 centerpoints) number approaches (see http://www.itl.nist.gov/
of runs for k factors div898/handbook/pri/section5/
2. Effects (main effects, interac- pri5311.htm for more info.) are used to
tions) are calculated using least achieve optimal conditions in problems
squares ( or Yates algorithm) where experiments can only be done
3. Estimate of noise is used for one at a time in a sequence or after the
confidence interval for the ef- screening stage where one is often inter-
fects, or hypothesis testing ested into moving the experimental
4. When no estimate of noise is region to its optimum.
available, normal probability plot Figure. Example of a fractional
of the effects is used. factorial design in three variables,
2.3 Response surface modeling
23-1= 4 experiments
and optimization (few factors) Usually a fractional factorial (see section tions to Latent Structures (PLS)
After screening, the goal of the investiga- 2.2.3 in this editorial) or Plackett- PLS is one of the most common meth-
tion is usually to create a valid map of Burman design is used. ods for analyzing multivariate data where
the experimental domain (local space) a quantitative relationship between a
given by the significant factors and their 3. Analysing the resulting descriptor matrix X and a response
ranges. This is done with a quadratic matrix Y is sought. The PLS model can
polynomial model. The higher order
experimental data be expressed by:
models has an increased complexity and After the planning stage, when the set of
therefore also requires more experi- experiments are laid out according to a Model of X: X = TPT+E
ments / factor than screening designs. statistical design, the planned experi- Model of Y: Y = TCt +F
Different types of RSM designs ments are made, either in parallel, or
• Three level factorial designs one after another. Each experiment gives PLS contains MLR as a special case, then
- See http://www.itl.nist.gov/ results, i.e. values of the response vari- the PLS regression coefficients and the
div898/handbook/pri/section3/ ables. Thereafter, these data are ana- MLR coefficients are identical.
pri339.htm for more info. lysed by means of multiple regression, or
• Central composite designs generalisations thereof such as the PLS 3.3 Orthogonal projections to
(CCD) and O-PLS methods (see earlier Editori- latent structures (O-PLS)
- See http://www.itl.nist.gov/ als 2002). This gives a model relating the The recent O-PLS methods [O-PLS and
div898/handbook/pri/section3/ factors to the results, showing which O2-PLS] (see Editorial from April 2002
pri3361.htm for more info. factors are important, and how they on O-PLS), are improved modifications
combine in influencing the results. The of the NIPALS PLS algorithm. The devel-
• Box Behnken designs
model is then used to make predictions, opment of O-PLS has, like the orthogo-
- See http://www.itl.nist.gov/ e.g. how to set the factors to achieve
div898/handbook/pri/section3/ nal signal correction (OSC) filters
desired (optimal) results.The fitted (March 2002 Editorial), been driven by
pri3362.htm for more info. model is reviewed by…
• D-optimal designs the large amount of non-correlated
• Examining the coefficients and variation present in the data sets today,
- See http://www.itl.nist.gov/ their 95% confidence interval, or especially in a multivariate calibration
div898/handbook/pri/section5/ normal probability plots of ef- situation. The interpretational ability of
pri521.htm for more info. fects and interactions. the other inverse regression models
• Examining the ANOVA table, (PLS, PCR, MLR) largely depends on the
2.5 Robustness testing [read more
check for curvature degree of systematic orthogonal varia-
in Ref 8]
• Plotting residuals, normal prob- tion in X with regards to Y.
In robustness testing of, for instance, an
ability plot of residuals, and run
analytical method, the aim is to explore The basic idea of O2-PLS is to divide the
order residuals
how sensitive the responses are to small systematic part in X and Y into two
changes in the factor settings. Ideally, a • Checking for the optimal trans-
formation of the response parts, one which is related to X and Y,
robustness test should show that the and one that is not. For each matrix, the
responses are not sensitive to small through the use of the Box Cox
plot. latter is computed in a way that makes it
fluctuations in the factors, that is, the orthogonal to the other matrix, i.e. com-
results are the same for all experiments. • Conclusions:
pletely independent. If X or Y contains
Robustness testing is usually applied as 1. Select dominating factors
strong but irrelevant variation, O2-PLS
the last test just before the release of a 2. Check and modify ranges
improves the interpretational ability of
product or a method. When performing 3. Look for curvature
the parameters in the model, e.g. score
a robustness test of a method, the ob- plots, loading plots compared to the
jective is 3.1 Multiple Linear Regression MLR and PLS methods (and other meth-
• to ascertain that the method is (MLR) ods with similar properties such as ridge
robust to small fluctuations in Traditionally, the most frequently used regression).
the factor levels, method for finding the regression coeffi- Thus the O2-PLS model can be written
• and, if non-robustness is de- cients b is the ordinary least squares as a factor analysis model, where some
tected… method where: factors (T) are common to both X and
• to understand how to alter the y=Xb+f Y;
bounds of the factors so that b=(XTX)-1XTy
robustness may still be claimed. This minimizes the residuals (fTf), which X model: X = TWT + TY-orthoPY-orthoT +E
is equivalent to maximizing the fit to y. Y model: Y = UCT + UX-orthoPX-orthoT + F
Robustness is achieved when the de- With MLR, the coefficients of the model Prediction of Y: Yhat = TCT
signer understands these potential are computed to minimize the sum of
sources of variation and takes steps to the squares of the residuals. In order to 3.4 ANOVA - Analysis of Vari-
desensitize the product to them. G. estimate b, MLR requires that the X- ance
Taguchi, a Japanese engineer, had a big variables must be linearly independent
ANOVA breaks up sums of squares in
effect on quality control and experimen- ((XTX) of full rank). It is important to
components and compares their size
tal design in the 1980s and 1990s. The also note that MLR fits one response at
with F-test.
Taguchi Methods (see http:// a time and hence assumes them to be
www.stat.rutgers.edu/~buyske/591/ independent.
SS_Y = SS_Regression + SS_Residual
lect10.pdf for more info.) is a well SS_Residual =
known strategy in robustness testing. 3.2 Partial Least Squares Projec-
SS_lack_of_fit + SS_pure_error
If center points exist, lack of fit indicates 5. References Further Reading

curvature. Lack of fit compares pure
error (from replicated experiments) 1. G.E.P Box, W.G. Hunter and J.S. • Introduction to Design of Ex-
with residual error (model error). Resid- Hunter "Statistics for experimenters", periments
ual plots indicate John Wiley and Sons, Inc,., New York -See http://www.umetrics.com/
(1978) pdfs/books/DOEBook.pdf
• Outliers
2. G.E.P Box, N.R. Draper "Empirical • Introduction to DoE
• Curvature model-building and Response surfaces", -See http://
• Need for transformations John Wiley and Sons, Inc,., New York kingkong.me.berkeley.edu/
(1987) html/pres_assets/pdfs/
4. Applications of DoE 3. C.K. Bayne and I.B. Rubin "Practical fracfact2.pdf
Experimental Designs and optimizations • DoE overview Engineering Statis-
Chemical synthesis methods for chemists", VCH Publishers,
1. Synthetic steps tics Handbook (NIST)
Inc., Deerfield Beach, Florida (1986)
2. Work up and separation -See http://www.itl.nist.gov/
4. Morgan E. - Chemometrics: Experi-
3. Reagents, solvents, catalysts div898/handbook/pri/pri_d.htm
mental Design. John Wiley & Sons, Inc.,
4. Structure ' reactivity and proper- New York • Optimization designs, NIST
ties. 5. Engineering Statistics Handbook http://www.itl.nist.gov/div898/
(NIST) [http://www.itl.nist.gov/div898/ education/dex/optdesgn/
Biotech industry handbook/index.htm] July 2002. optdesgn.pdf
1. Pharmaceutics, formulation for 6. Carlson R. - Design and Optimization • How to Select Design of Experi-
drug delivery in Organic Synthesis. Elsevier science ments Software -See http://
2. Media development & optimiza- publishers, Amsterdam www.qualitydigest.com/nov98/
tion 7. Slide notes from Svante Wold html/doe.html
3. Biochemistry, drug design (personal communication) • Robust Design and Taguchi
4. Analytical biochemistry, separa- 8. Design of Experiments: Principles and Method -See http://
tion (HPLC,…), assay develop- Applications, Umetrics Academy - L. www.stat.rutgers.edu/
ment and optimization Eriksson, E. Johansson, N. Kettaneh- ~buyske/591/lect10.pdf
5. Pharmacology Wold, C. Wikström, and S. Wold, ISBN
6. Process optimization and con- 91-973730-0-1
trol, fermentation, separation, 9. Response Surface Methodology: Proc-
purification ess and Product Optimization Using
Designed Experiments, 2nd Edition Ray-
Cosmetic industry mond H. Myers, Douglas C. Montgom-
1. Processes, production, separa- ery ISBN: 0-471-41255-
tion, cleaning,…
2. Formulations, shampoos, nail
polish, creams, perfumes, soaps,
powders, …
3. Molecular structure, high po-
tency, low toxicity, allergenicity
Drug industry
1. Pharmaceutics, formulation for
drug release, hardness of pills,…
2. Organic chemistry, synthesis,
drug design, …
3. Analytical chemistry, Separation
[HPLC, …], resolution, speed.
4. Pharmacology
5. Process optimization and con-
trol, synthesis, fermentation,
separations, …
Process industry
1. Process optimization and control
(yield, purity, through put time,
pollution, energy consumption)
2. Product quality and performance
(material strength, warp, color,
taste, odour)
3. Product stability versus process
variation
Appendix
Model Parameters
Goodness of fit statistics, information
about model adequacy
PRESS, Predicted Residual Sum of
Squares:Sum of squared differences
between predicted and observed y-
values (over all rounds)
Dealing with and quantifying

variability
Check them out at HoC’s website

http:/www.acc.umu.se/~tnkjtg/Chemometrics/Editorial
Stay alert for upcoming

editorials on HoC
No Editorial this month

July 2002
A statistician's view of the single-y PLS problem

June 2002 Dick Kleinknecht
Wavelets in Chemometrics
May 2002 - compression, denoising and feature extraction
Johan Trygg
How to create an OSC filter for PLS and end up with a new generic modelling
April 2002 method, O-PLS
Johan Trygg
Everything you need to know about Orthogonal Signal Correction (OSC)

filters
March 2002
- and how they can improve interpretation of your data
Johan Trygg
Have you ever wondered why PLS sometimes needs
Feb 2002 more than one component for a single-y vector?
Johan Trygg

Tutorial Trygg Wold

Transféré par

Informations du document

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

Tutorial Trygg Wold

Transféré par

Droits d'auteur :

Formats disponibles

Homepage of Chemometrics

Definition: Statistical experimental design, a.k.a. design of experiments (DoE) is the

Typical examples when design of experi-

2. Stages in the DoE proc-

• What is the purpose?

degree interaction effects. This loss of

2.2.1 Screening designs: 5. Noise can be estimated from

If center points exist, lack of fit indicates 5. References Further Reading

Dealing with and quantifying

Check them out at HoC’s website

Stay alert for upcoming

No Editorial this month

A statistician's view of the single-y PLS problem

Everything you need to know about Orthogonal Signal Correction (OSC)

Vous aimerez peut-être aussi