Académique Documents
Professionnel Documents
Culture Documents
in Applied Mathematics
Mark E. Johnson
SportMetrika, Inc.
maejohns@yahoo.com
4 October 2005
Outline
1. Why a mathematician in sports?
2. Quantitative Decision Making
3. What is “statistics” ?
4. Related problems in sports statistics
5. Some other examples of mathematics in
industry
6. Getting here
7. Future directions
Outline
1. Why a mathematician in sports?
2. Quantitative Decision Making
3. What is “statistics” ?
4. Related problems in sports statistics
5. Some other examples of mathematics in
industry
6. Getting here
7. Future directions
A mathematician in sports?
• Sports is just behind the
times; businesses weren’t
always analytical either
• Off-season decision making
• Moneyball
– Finding market inefficiencies
– Convincing others of a new way
to look at things
From Michael Lewis’ Moneyball
"... OPS was the simple addition of on-base and slugging percentages.
Crude as it was, it was a much better indicator than any other
offensive statistic of the number of runs a team would score. Simply
adding the two statistics together, however, implied that they were
of equal importance. If the goal was to raise a team's OPS, an extra
percentage point of on-base was as good as an extra percentage
point of slugging.
Before his thought experiment Paul (DePodesta) had felt uneasy
with this crude assumption; now he saw that the assumption was
absurd. An extra point of on-base percentage was clearly more
valuable than an extra point of slugging percentage -- but by how
much? … In his model an extra point of on-base percentage was
worth three times an extra point of slugging percentage."
A little history of statistics and
baseball
• Bill James, the pioneer.
• SABR – Society for American Baseball Research
– http://www.sabr.org
• Some websites and information:
– Books
• Bill James Historical Abstracts
• “Baseball Hacks”, by Joe Adler
– Some sites
• http://www.baseballprospectus.com/
• http://www.Baseball-Reference.com
• http://www.Retrosheet.org
– Other resources
• http://www.sportmetrika.com/resources.php
Video
• [insert video here]
Outline
1. Why a mathematician in sports?
2. Quantitative Decision Making
3. What is “statistics” ?
4. Related problems in sports statistics
5. Some other examples of mathematics in
industry
6. Getting here
7. Future directions
Sports Decision Making
• League-level decisions
– Commissioner
• Team-level decisions
– General Manager
1100
http://www.baseball1.com 900
700
600
500
40 50 60 70 80 90 100 110 120
1800 1100
1700 1000
1600 900
1500 H 800 RA
1400 700
1300 600
1200 500
40 50 60 70 80 90 100 110 120 40 50 60 70 80 90 100 110 120
Common Statistics relationship
with each other
0.300 0.900
0.290
0.850
0.280
0.270
0.800
0.260
0.250 AVG 0.750 OPS
0.240
0.700
0.230
0.220 0.650
0.210
0.600
0.200 450 550 650 750 850 950 1050
450 550 650 750 850 950 1050
OPS2 OPS3
1.300 1.600
1.250 1.550
1.200 1.500
1.150 1.450
1.050 1.350
1.000 1.300
0.950 1.250
0.900 1.200
450 550 650 750 850 950 1050 450 550 650 750 850 950 1050
Return to Moneyball for a
moment…
• Why a 3? Why not a 2?
• Linear regression can be used to
determine a best fit of A, B, and C in the
linear model:
• What is A / B ?
It depends on how you try to
answer the question…
• This most likely is not the right
way to answer this question….
– Consider outliers due to “odd”
seasons, such as the
numbers that may be
generated because Barry
Bonds is on your team, or if
you play in Coors Field.
– Or, perhaps a least-squares
distant metric is not best
choice.
• Or, this may not be the best
question to ask ….
• Pending editorial review, see
my Hack in Joe Adler’s
“Baseball Hacks” for more on
this subject.
Defining States in baseball
• Baseball is discrete
• Events can be recorded, as can the state
of affairs before and after the event.
• For the time being, consider a state as
being the (number of outs, base-runners)
pair.
State transitions and Run
Expectancy
RE 99-02 0 outs 1 out 2 outs
• The number of runs
expected in the Empty .56 .30 .12