Académique Documents
Professionnel Documents
Culture Documents
Basic Analysis
Therealvoyageofdiscoveryconsistsnotinseekingnew
landscapes,butinhavingneweyes.
MarcelProust
JMP,ABusinessUnitofSAS
SASCampusDrive
Cary,NC27513
11.1
Thecorrectbibliographiccitationforthismanualisasfollows:SASInstituteInc.2014.
JMP11BasicAnalysis.Cary,NC:SASInstituteInc.
JMP11BasicAnalysis
Copyright2014,SASInstituteInc.,Cary,NC,USA
ISBN9781612906706
Allrightsreserved.ProducedintheUnitedStatesofAmerica.
For a hard-copy book:Nopartofthispublicationmaybereproduced,storedinaretrieval
system,ortransmitted,inanyformorbyanymeans,electronic,mechanical,photocopying,
orotherwise,withoutthepriorwrittenpermissionofthepublisher,SASInstituteInc.
For a Web download or e-book:Youruseofthispublicationshallbegovernedbytheterms
establishedbythevendoratthetimeyouacquirethispublication.
Thescanning,uploading,anddistributionofthisbookviatheInternetoranyothermeans
withoutthepermissionofthepublisherisillegalandpunishablebylaw.Pleasepurchase
onlyauthorizedelectroniceditionsanddonotparticipateinorencourageelectronicpiracy
ofcopyrightedmaterials.Yoursupportofothersrightsisappreciated.
U.S. Government Restricted Rights Notice:Use,duplication,ordisclosureofthissoftware
andrelateddocumentationbytheU.S.governmentissubjecttotheAgreementwithSAS
InstituteandtherestrictionssetforthinFAR52.22719,CommercialComputer
SoftwareRestrictedRights(June1987).
SASInstituteInc.,SASCampusDrive,Cary,NorthCarolina27513.
1stprinting,September2013
2ndprinting,February2014
SASPublishingprovidesacompleteselectionofbooksandelectronicproductstohelp
customersuseSASsoftwaretoitsfullestpotential.Formoreinformationaboutour
ebooks,elearningproducts,CDs,andhardcopybooks,visittheSASPublishingWebsite
atsupport.sas.com/publishingorcall18007273228.
SASandallotherSASInstituteInc.productorservicenamesareregisteredtrademarksor
trademarksofSASInstituteInc.intheUSAandothercountries.indicatesUSA
registration.
Otherbrandandproductnamesareregisteredtrademarksortrademarksoftheir
respectivecompanies.
ScintillaCopyright19982012byNeilHodgson<neilh@scintilla.org>.
AllRightsReserved.
Permissiontouse,copy,modify,anddistributethissoftwareanditsdocumentationfor
anypurposeandwithoutfeeisherebygranted,providedthattheabovecopyright
noticeappearinallcopiesandthatboththatcopyrightnoticeandthispermission
noticeappearinsupportingdocumentation.
NEILHODGSONDISCLAIMSALLWARRANTIESWITHREGARDTOTHISSOFTWARE,INCLUDING
ALLIMPLIEDWARRANTIESOFMERCHANTABILITYANDFITNESS,INNOEVENTSHALLNEIL
HODGSONBELIABLEFORANYSPECIAL,INDIRECTORCONSEQUENTIALDAMAGESORANY
DAMAGESWHATSOEVERRESULTINGFROMLOSSOFUSE,DATAORPROFITS,WHETHERINAN
ACTIONOFCONTRACT,NEGLIGENCEOROTHERTORTIOUSACTION,ARISINGOUTOFORIN
CONNECTIONWITHTHEUSEORPERFORMANCEOFTHISSOFTWARE.
TelerikRadControls:Copyright20022012,Telerik.UsageoftheincludedTelerik
RadControlsoutsideofJMPisnotpermitted.
ZLIBCompressionLibraryCopyright19952005,JeanLoupGaillyandMarkAdler.
MadewithNaturalEarth.Freevectorandrastermapdata@naturalearthdata.com.
PackagesCopyright20092010,StphaneSudre(s.sudre.free.fr).Allrightsreserved.
Redistributionanduseinsourceandbinaryforms,withorwithoutmodification,are
permittedprovidedthatthefollowingconditionsaremet:
Redistributionsofsourcecodemustretaintheabovecopyrightnotice,thislistof
conditionsandthefollowingdisclaimer.
Redistributionsinbinaryformmustreproducetheabovecopyrightnotice,thislistof
conditionsandthefollowingdisclaimerinthedocumentationand/orothermaterials
providedwiththedistribution.
NeitherthenameoftheWhiteBoxnorthenamesofitscontributorsmaybeusedto
endorseorpromoteproductsderivedfromthissoftwarewithoutspecificpriorwritten
permission.
THISSOFTWAREISPROVIDEDBYTHECOPYRIGHTHOLDERSANDCONTRIBUTORSASISAND
ANYEXPRESSORIMPLIEDWARRANTIES,INCLUDING,BUTNOTLIMITEDTO,THEIMPLIED
WARRANTIESOFMERCHANTABILITYANDFITNESSFORAPARTICULARPURPOSEARE
DISCLAIMED.INNOEVENTSHALLTHECOPYRIGHTOWNERORCONTRIBUTORSBELIABLEFOR
ANYDIRECT,INDIRECT,INCIDENTAL,SPECIAL,EXEMPLARY,ORCONSEQUENTIALDAMAGES
(INCLUDING,BUTNOTLIMITEDTO,PROCUREMENTOFSUBSTITUTEGOODSORSERVICES;LOSS
OFUSE,DATA,ORPROFITS;ORBUSINESSINTERRUPTION)HOWEVERCAUSEDANDONANY
THEORYOFLIABILITY,WHETHERINCONTRACT,STRICTLIABILITY,ORTORT(INCLUDING
NEGLIGENCEOROTHERWISE)ARISINGINANYWAYOUTOFTHEUSEOFTHISSOFTWARE,
EVENIFADVISEDOFTHEPOSSIBILITYOFSUCHDAMAGE.
iODBCsoftwareCopyright19952006,OpenLinkSoftwareIncandKeJin
(www.iodbc.org).Allrightsreserved.
Redistributionanduseinsourceandbinaryforms,withorwithoutmodification,are
permittedprovidedthatthefollowingconditionsaremet:
Redistributionsofsourcecodemustretaintheabovecopyrightnotice,thislistof
conditionsandthefollowingdisclaimer.
Redistributionsinbinaryformmustreproducetheabovecopyrightnotice,thislist
ofconditionsandthefollowingdisclaimerinthedocumentationand/orother
materialsprovidedwiththedistribution.
NeitherthenameofOpenLinkSoftwareInc.northenamesofitscontributorsmay
beusedtoendorseorpromoteproductsderivedfromthissoftwarewithoutspecific
priorwrittenpermission.
THISSOFTWAREISPROVIDEDBYTHECOPYRIGHTHOLDERSANDCONTRIBUTORSASISAND
ANYEXPRESSORIMPLIEDWARRANTIES,INCLUDING,BUTNOTLIMITEDTO,THEIMPLIED
WARRANTIESOFMERCHANTABILITYANDFITNESSFORAPARTICULARPURPOSEARE
DISCLAIMED.INNOEVENTSHALLOPENLINKORCONTRIBUTORSBELIABLEFORANYDIRECT,
INDIRECT,INCIDENTAL,SPECIAL,EXEMPLARY,ORCONSEQUENTIALDAMAGES(INCLUDING,
BUTNOTLIMITEDTO,PROCUREMENTOFSUBSTITUTEGOODSORSERVICES;LOSSOFUSE,
DATA,ORPROFITS;ORBUSINESSINTERRUPTION)HOWEVERCAUSEDANDONANYTHEORYOF
LIABILITY,WHETHERINCONTRACT,STRICTLIABILITY,ORTORT(INCLUDINGNEGLIGENCEOR
OTHERWISE)ARISINGINANYWAYOUTOFTHEUSEOFTHISSOFTWARE,EVENIFADVISEDOF
THEPOSSIBILITYOFSUCHDAMAGE.
bzip2,theassociatedlibrarylibbzip2,andalldocumentation,areCopyright
19962010,JulianRSeward.Allrightsreserved.
Redistributionanduseinsourceandbinaryforms,withorwithoutmodification,are
permittedprovidedthatthefollowingconditionsaremet:
Redistributionsofsourcecodemustretaintheabovecopyrightnotice,thislistof
conditionsandthefollowingdisclaimer.
Theoriginofthissoftwaremustnotbemisrepresented;youmustnotclaimthatyou
wrotetheoriginalsoftware.Ifyouusethissoftwareinaproduct,anacknowledgment
intheproductdocumentationwouldbeappreciatedbutisnotrequired.
Alteredsourceversionsmustbeplainlymarkedassuch,andmustnotbe
misrepresentedasbeingtheoriginalsoftware.
Thenameoftheauthormaynotbeusedtoendorseorpromoteproductsderivedfrom
thissoftwarewithoutspecificpriorwrittenpermission.
THISSOFTWAREISPROVIDEDBYTHEAUTHORASISANDANYEXPRESSORIMPLIED
WARRANTIES,INCLUDING,BUTNOTLIMITEDTO,THEIMPLIEDWARRANTIESOF
MERCHANTABILITYANDFITNESSFORAPARTICULARPURPOSEAREDISCLAIMED.INNOEVENT
SHALLTHEAUTHORBELIABLEFORANYDIRECT,INDIRECT,INCIDENTAL,SPECIAL,
EXEMPLARY,ORCONSEQUENTIALDAMAGES(INCLUDING,BUTNOTLIMITEDTO,
PROCUREMENTOFSUBSTITUTEGOODSORSERVICES;LOSSOFUSE,DATA,ORPROFITS;OR
BUSINESSINTERRUPTION)HOWEVERCAUSEDANDONANYTHEORYOFLIABILITY,WHETHER
INCONTRACT,STRICTLIABILITY,ORTORT(INCLUDINGNEGLIGENCEOROTHERWISE)ARISING
INANYWAYOUTOFTHEUSEOFTHISSOFTWARE,EVENIFADVISEDOFTHEPOSSIBILITYOF
SUCHDAMAGE.
RsoftwareisCopyright19992012,RFoundationforStatisticalComputing.
MATLABsoftwareisCopyright19842012,TheMathWorks,Inc.ProtectedbyU.S.
andinternationalpatents.Seewww.mathworks.com/patents.MATLABandSimulink
areregisteredtrademarksofTheMathWorks,Inc.Seewww.mathworks.com/
trademarksforalistofadditionaltrademarks.Otherproductorbrandnamesmaybe
trademarksorregisteredtrademarksoftheirrespectiveholders.
GettheMostfromJMP
Whetheryouareafirsttimeoralongtimeuser,thereisalwayssomethingtolearn
aboutJMP.
VisitJMP.comtofindthefollowing:
liveandrecordedwebcastsabouthowtogetstartedwithJMP
videodemosandwebcastsofnewfeaturesandadvancedtechniques
detailsonregisteringforJMPtraining
schedulesforseminarsbeingheldinyourarea
successstoriesshowinghowothersuseJMP
ablogwithtips,tricks,andstoriesfromJMPstaff
aforumtodiscussJMPwithotherusers
http://www.jmp.com/getstarted/
Contents
Basic Analysis
1
Distributions
Using the Distribution Platform ................................................... 31
OverviewoftheDistributionPlatform ............................................... 33
CategoricalVariables ............................................................ 33
ContinuousVariables ........................................................... 33
ExampleoftheDistributionPlatform ................................................ 33
LaunchtheDistributionPlatform ................................................... 35
TheDistributionReport ............................................................ 36
Histograms .................................................................... 38
Basic Analysis
TheFrequenciesReport ......................................................... 41
TheQuantilesReport ........................................................... 41
TheSummaryStatisticsReport .................................................. 41
DistributionPlatformOptions ...................................................... 44
OptionsforCategoricalVariables ................................................... 45
OptionsforContinuousVariables ................................................... 47
NormalQuantilePlot ........................................................... 50
OutlierBoxPlot ................................................................ 51
QuantileBoxPlot ............................................................... 52
StemandLeaf .................................................................. 53
CDFPlot ....................................................................... 53
TestMean ...................................................................... 55
TestStdDev ................................................................... 56
ConfidenceIntervalsforContinuousVariables .................................... 57
SaveCommandsforContinuousVariables ........................................ 57
PredictionIntervals ............................................................. 58
ToleranceIntervals .............................................................. 59
CapabilityAnalysis ............................................................. 59
FitDistributions ................................................................... 62
ContinuousFit ................................................................. 62
DiscreteFit ..................................................................... 63
FitDistributionOptions ......................................................... 64
AdditionalExamplesoftheDistributionPlatform .................................... 67
ExampleofSelectingDatainMultipleHistograms ................................ 67
ExamplesoftheTestProbabilitiesOption ......................................... 68
ExampleofPredictionIntervals .................................................. 70
ExampleofToleranceIntervals .................................................. 71
ExampleofCapabilityAnalysis .................................................. 73
StatisticalDetailsfortheDistributionPlatform ....................................... 74
StatisticalDetailsforStandardErrorBars ......................................... 74
StatisticalDetailsforQuantiles .................................................. 74
StatisticalDetailsforSummaryStatistics .......................................... 75
StatisticalDetailsfortheNormalQuantilePlot .................................... 76
StatisticalDetailsfortheWilcoxonSignedRankTest ............................... 76
Basic Analysis
StatisticalDetailsfortheStandardDeviationTest .................................. 76
StatisticalDetailsforNormalQuantiles ........................................... 77
StatisticalDetailsforSavingStandardizedData ................................... 77
StatisticalDetailsforPredictionIntervals ......................................... 77
StatisticalDetailsforToleranceIntervals .......................................... 78
StatisticalDetailsforCapabilityAnalysis ......................................... 79
StatisticalDetailsforContinuousFitDistributions ................................. 82
StatisticalDetailsforDiscreteFitDistributions .................................... 88
StatisticalDetailsforFittedQuantiles ............................................. 91
StatisticalDetailsforFitDistributionOptions ..................................... 91
Introduction to Fit Y by X
Examine Relationships Between Two Variables .................................... 95
OverviewoftheFitYbyXPlatform ................................................. 97
LaunchtheFitYbyXPlatform ...................................................... 97
LaunchSpecificAnalysesfromtheJMPStarterWindow ........................... 98
Bivariate Analysis
Examine Relationships between Two Continuous Variables ....................... 101
ExampleofBivariateAnalysis ...................................................... 103
LaunchtheBivariatePlatform ...................................................... 103
TheBivariatePlot ................................................................. 104
FittingCommandsandGeneralOptions ............................................ 105
FittingCommandCategories ................................................... 108
FittheSameCommandMultipleTimes .......................................... 108
FitMean ......................................................................... 108
FitMeanReport ............................................................... 109
FitLineandFitPolynomial ........................................................ 110
LinearFitandPolynomialFitReports ........................................... 110
FitSpecial ........................................................................ 117
FitSpecialReportsandMenus .................................................. 118
FitSpline ......................................................................... 119
SmoothingSplineFitReport .................................................... 119
KernelSmoother .................................................................. 120
LocalSmootherReport ......................................................... 120
10
Basic Analysis
Oneway Analysis
Examine Relationships between a Continuous Y and a Categorical X Variable .... 143
OverviewofOnewayAnalysis ..................................................... 145
Basic Analysis
11
12
Basic Analysis
Contingency Analysis
Examine Relationships between Two Categorical Variables ....................... 207
ExampleofContingencyAnalysis .................................................. 209
LaunchtheContingencyPlatform .................................................. 210
TheContingencyReport .......................................................... 210
ContingencyPlatformOptions ..................................................... 212
Basic Analysis
13
14
Basic Analysis
Logistic Analysis
Examine Relationships between a Categorical Y and a Continuous X Variable .... 243
OverviewofLogisticRegression ................................................... 245
NominalLogisticRegression ................................................... 245
OrdinalLogisticRegression .................................................... 245
ExampleofNominalLogisticRegression ........................................... 246
LaunchtheLogisticPlatform ...................................................... 248
TheLogisticReport ............................................................... 248
LogisticPlot ................................................................... 249
Iterations ..................................................................... 250
WholeModelTest ............................................................. 250
ParameterEstimates ........................................................... 252
LogisticPlatformOptions ......................................................... 253
ROCCurves .................................................................. 254
SaveProbabilityFormula ....................................................... 255
InversePrediction ............................................................. 256
AdditionalExamplesofLogisticRegression ......................................... 256
ExampleofOrdinalLogisticRegression ......................................... 256
AdditionalExampleofaLogisticPlot ........................................... 258
ExampleofROCCurves ....................................................... 260
ExampleofInversePredictionUsingtheCrosshairTool .......................... 261
ExampleofInversePredictionUsingtheInversePredictionOption ................ 262
StatisticalDetailsfortheLogisticPlatform .......................................... 264
StatisticalDetailsfortheWholeModelTestReport ............................... 264
Basic Analysis
15
10 Bootstrapping
Approximate the Distribution of a Statistic through Resampling ................... 277
ExampleofBootstrapping ......................................................... 279
PerformaBootstrapAnalysis ...................................................... 280
BootstrapWindowOptions ........................................................ 280
StackedResultsTable .............................................................. 281
UnstackedResultsTable ........................................................... 282
AnalysisofBootstrapResults ...................................................... 282
11 Tabulate
Create Summary Tables Interactively ............................................. 285
ExampleoftheTabulatePlatform .................................................. 287
LaunchtheTabulatePlatform ...................................................... 290
UsetheDialog ................................................................. 292
AddStatistics ................................................................. 294
TheTabulateOutput .............................................................. 296
AnalysisColumns ............................................................. 298
GroupingColumns ............................................................ 298
ColumnandRowTables ....................................................... 299
EditTables .................................................................... 300
TabulatePlatformOptions ......................................................... 300
ShowTestBuildPanel .......................................................... 301
RightClickMenuforColumns ................................................. 302
AdditionalExamplesoftheTabulatePlatform ....................................... 302
ExampleofCombiningColumnsintoaSingleTable .............................. 306
ExampleUsingaPageColumn ................................................. 308
16
Basic Analysis
References
Index
Basic Analysis ................................................................... 315
Chapter 1
Learn about JMP
Documentation and Additional Resources
Thischapterincludesthefollowinginformation:
bookconventions
JMPdocumentation
JMPHelp
additionalresources,suchasthefollowing:
otherJMPdocumentation
tutorials
indexes
Webresources
Figure 1.1TheJMPHelpHomeWindowonWindows
Contents
FormattingConventions .......................................................... 19
JMPDocumentation.............................................................. 19
JMPDocumentationLibrary .................................................... 20
JMPHelp .................................................................... 24
AdditionalResourcesforLearningJMP ............................................. 24
Tutorials ..................................................................... 25
SampleDataTables ............................................................ 25
LearnaboutStatisticalandJSLTerms............................................ 25
LearnJMPTipsandTricks...................................................... 26
Tooltips ...................................................................... 26
JMPUserCommunity ......................................................... 26
JMPerCable.................................................................. 26
JMPBooksbyUsers ........................................................... 27
TheJMPStarterWindow ....................................................... 27
Chapter 1
Basic Analysis
19
Formatting Conventions
Thefollowingconventionshelpyourelatewrittenmaterialtoinformationthatyouseeon
yourscreen.
Sampledatatablenames,columnnames,pathnames,filenames,fileextensions,and
foldersappearinHelveticafont.
Helvetica boldformattingindicatesitemsthatyouselecttocompleteatask:
buttons
checkboxes
commands
listnamesthatareselectable
menus
options
tabnames
textboxes
Thefollowingitemsappearinitalics:
wordsorphrasesthatareimportantorhavedefinitionsspecifictoJMP
booktitles
variables
FeaturesthatareforJMPProonlyarenotedwiththeJMPProicon
ofJMPProfeatures,visithttp://www.jmp.com/software/pro/.
.Foranoverview
Note: SpecialinformationandlimitationsappearwithinaNote.
Tip: HelpfulinformationappearswithinaTip.
JMP Documentation
JMPoffersdocumentationinvariousformats,fromprintbooksandPortableDocument
Format(PDF)toelectronicbooks(ebooks).
20
Chapter 1
Basic Analysis
AllbooksarealsocombinedintoonePDFfile,calledJMPDocumentationLibrary,for
convenientsearching.OpentheJMPDocumentationLibraryPDFfilefromtheHelp > Books
menu.
ebooksareavailableatAmazon,SafariBooksOnline,andintheAppleiBookstore.
YoucanalsopurchaseprinteddocumentationontheSASwebsite:
http://support.sas.com/documentation/onlinedoc/jmp/index.html
Document Purpose
Document Content
DiscoveringJMP
Ifyouarenotfamiliar
withJMP,starthere.
IntroducesyoutoJMPandgetsyou
startedcreatingandanalyzingdata.
UsingJMP
LearnaboutJMPdata
tablesandhowto
performbasic
operations.
CoversgeneralJMPconceptsand
featuresthatspanacrossallofJMP,
includingimportingdata,modifying
columnsproperties,sortingdata,and
connectingtoSAS.
BasicAnalysis
Performbasicanalysis
usingthisdocument.
DescribestheseAnalyzemenuplatforms:
Distribution
FitYbyX
MatchedPairs
Tabulate
Howtoapproximatesampling
distributionsusingbootstrappingisalso
included.
Chapter 1
Basic Analysis
Document Title
Document Purpose
Document Content
EssentialGraphing
Findtheidealgraph
foryourdata.
DescribestheseGraphmenuplatforms:
GraphBuilder
OverlayPlot
Scatterplot3D
ContourPlot
BubblePlot
ParallelPlot
CellPlot
Treemap
ScatterplotMatrix
TernaryPlot
Chart
21
Alsocovershowtocreatebackground
andcustommaps.
Profilers
Learnhowtouse
interactiveprofiling
tools,whichenableyou
toviewcrosssections
ofanyresponse
surface.
CoversallprofilerslistedintheGraph
menu.Analyzingnoisefactorsis
includedalongwithrunningsimulations
usingrandominputs.
Designof
ExperimentsGuide
Learnhowtodesign
experimentsand
determineappropriate
samplesizes.
CoversalltopicsintheDOEmenu.
22
Chapter 1
Basic Analysis
Document Title
Document Purpose
Document Content
FittingLinearModels
LearnaboutFitModel
platformandmanyof
itspersonalities.
Describesthesepersonalities,all
availablewithintheAnalyzemenuFit
Modelplatform:
SpecializedModels
Learnaboutadditional
modelingtechniques.
StandardLeastSquares
Stepwise
GeneralizedRegression
MixedModel
MANOVA
LoglinearVariance
NominalLogistic
OrdinalLogistic
GeneralizedLinearModel
DescribestheseAnalyze>Modeling
menuplatforms:
Partition
Neural
ModelComparison
Nonlinear
GaussianProcess
TimeSeries
ResponseScreening
TheScreeningplatformintheAnalyze>
ModelingmenuisdescribedinDesignof
ExperimentsGuide.
Multivariate
Methods
Readabouttechniques
foranalyzingseveral
variables
simultaneously.
DescribestheseAnalyze>Multivariate
Methodsmenuplatforms:
Multivariate
Cluster
PrincipalComponents
Discriminant
PartialLeastSquares
Chapter 1
Basic Analysis
Document Title
Document Purpose
Document Content
QualityandProcess
Methods
Readabouttoolsfor
evaluatingand
improvingprocesses.
DescribestheseAnalyze>Qualityand
Processmenuplatforms:
Reliabilityand
SurvivalMethods
ConsumerResearch
Learntoevaluateand
improvereliabilityina
productorsystemand
analyzesurvivaldata
forpeopleand
products.
Learnaboutmethods
forstudyingconsumer
preferencesandusing
thatinsighttocreate
betterproductsand
services.
23
ControlChartBuilderandindividual
controlcharts
MeasurementSystemsAnalysis
Variability/AttributeGaugeCharts
Capability
ParetoPlot
Diagram
DescribestheseAnalyze>Reliabilityand
Survivalmenuplatforms:
LifeDistribution
FitLifebyX
RecurrenceAnalysis
Degradation
ReliabilityForecast
ReliabilityGrowth
ReliabilityBlockDiagram
Survival
FitParametricSurvival
FitProportionalHazards
DescribestheseAnalyze>Consumer
Researchmenuplatforms:
Categorical
FactorAnalysis
Choice
Uplift
ItemAnalysis
24
Chapter 1
Basic Analysis
Document Title
Document Purpose
Document Content
ScriptingGuide
Learnabouttaking
advantageofthe
powerfulJMP
ScriptingLanguage
(JSL).
Coversavarietyoftopics,suchaswriting
anddebuggingscripts,manipulating
datatables,constructingdisplayboxes,
andcreatingJMPapplications.
JSLSyntaxReference
ReadaboutmanyJSL
functionsonfunctions
andtheirarguments,
andmessagesthatyou
sendtoobjectsand
displayboxes.
Includessyntax,examples,andnotesfor
JSLcommands.
Note: TheBooksmenualsocontainstworeferencecardsthatcanbeprinted:TheMenuCard
describesJMPmenus,andtheQuickReferencedescribesJMPkeyboardshortcuts.
JMP Help
JMPHelpisanabbreviatedversionofthedocumentationlibrarythatprovidestargeted
information.YoucanopenJMPHelpinseveralways:
OnWindows,presstheF1keytoopentheHelpsystemwindow.
Gethelponaspecificpartofadatatableorreportwindow.SelecttheHelptool from
theToolsmenuandthenclickanywhereinadatatableorreportwindowtoseetheHelp
forthatarea.
WithinaJMPwindow,clicktheHelpbutton.
SearchtheHelpathttp://jmp.com/support/help/(Englishonly).
Tutorials(seeTutorialsonpage 25)
Sampledata(seeSampleDataTablesonpage 25)
Indexes(seeLearnaboutStatisticalandJSLTermsonpage 25)
Chapter 1
Basic Analysis
TipoftheDay(seeLearnJMPTipsandTricksonpage 26)
Webresources(seeJMPUserCommunityonpage 26)
JMPerCabletechnicalpublication(seeJMPerCableonpage 26)
BooksaboutJMP(seeJMPBooksbyUsersonpage 27)
JMPStarter(seeTheJMPStarterWindowonpage 27)
25
Tutorials
YoucanaccessJMPtutorialsbyselectingHelp > Tutorials.ThefirstitemontheTutorialsmenu
isTutorials Directory.Thisopensanewwindowwithallthetutorialsgroupedbycategory.
IfyouarenotfamiliarwithJMP,thenstartwiththeBeginners Tutorial.Itstepsyouthroughthe
JMPinterfaceandexplainsthebasicsofusingJMP.
TherestofthetutorialshelpyouwithspecificaspectsofJMP,suchascreatingapiechart,
usingGraphBuilder,andsoon.
Openthesampledatadirectory.
Openanalphabetizedlistofallsampledatatables.
Findasampledatatablewithinacategory.
Sampledatatablesareinstalledinthefollowingdirectory:
OnWindows:C:\Program Files\SAS\JMP\<version_number>\Samples\Data
OnMacintosh:\Library\Application Support\JMP\<version_number>\Samples\Data
InJMPPro,sampledataisinstalledintheJMPPRO(ratherthanJMP)directory.
LetsyousearchforinformationaboutJSLfunctions,objects,anddisplay
boxes.YoucanalsoeditandrunsamplescriptsfromtheScriptingIndex.
Scripting Index
26
Chapter 1
Basic Analysis
Tooltips
JMPprovidesdescriptivetooltipswhenyouplaceyourcursoroveritems,suchasthe
following:
Menuortoolbaroptions
Labelsingraphs
Textresultsinthereportwindow(moveyourcursorinacircletoreveal)
FilesorwindowsintheHomeWindow
CodeintheScriptEditor
JMPer Cable
TheJMPerCableisayearlytechnicalpublicationtargetedtousersofJMP.TheJMPerCableis
availableontheJMPwebsite:
http://www.jmp.com/about/newsletters/jmpercable/
Chapter 1
Basic Analysis
27
TodisplaytheJMPStarterautomaticallywhenyouopenJMPonWindows,selectFile >
Preferences > General,andthenselectJMP Starter fromtheInitialJMPWindowlist.On
Macintosh,selectJMP > Preferences > Initial JMP Starter Window.
28
Chapter 1
Basic Analysis
Chapter 2
Introduction to Basic Analysis
Overview of Fundamental Analysis Methods
ThisbookdescribestheinitialtypesofanalysesthatyouoftenperforminJMP:
TheDistributionplatformillustratesthedistributionofasinglevariableusinghistograms,
additionalgraphs,andreports.Onceyouknowhowyourdataisdistributed,youcanplan
theappropriatetypeofanalysisgoingforward.SeeChapter3,Distributions.
TheFitYbyXplatformanalyzesthepairofXandYvariablesthatyouspecify,bycontext,
basedonmodelingtype.SeeChapter4,IntroductiontoFitYbyX.Thefourtypesof
analysesinclude:
TheBivariateplatform,whichanalyzestherelationshipbetweentwocontinuousX
variables.SeeChapter5,BivariateAnalysis.
TheOnewayplatform,whichanalyzeshowthedistributionofacontinuousYvariable
differsacrossgroupsdefinedbyacategoricalXvariable.SeeChapter6,Oneway
Analysis.
TheContingencyplatform,whichanalyzesthedistributionofacategoricalresponse
variableYasconditionedbythevaluesofacategoricalXfactor.SeeChapter7,
ContingencyAnalysis.
TheLogisticplatform,whichfitstheprobabilitiesforresponsecategories(Y)toa
continuousXpredictor.SeeChapter8,LogisticAnalysis.
TheMatchedPairsplatformcomparesthemeansbetweentwoormorecorrelated
variablesandassessesthedifferences.SeeChapter9,MatchedPairsAnalysis.
Bootstrapanalysisapproximatesthesamplingdistributionofastatistic.Thedatais
resampledwithreplacementandthestatisticiscomputed.Thisprocessisrepeatedto
produceadistributionofvaluesforthestatistic.SeeChapter10,Bootstrapping.
TheTabulateplatforminteractivelyconstructstablesofdescriptivestatistics.SeeChapter
11,Tabulate.
30
Chapter 2
Basic Analysis
Chapter 3
Distributions
Using the Distribution Platform
TheDistributionplatformillustratesthedistributionofasinglevariableusinghistograms,
additionalgraphs,andreports.Thewordunivariatesimplymeansinvolvingonevariable
insteadoftwo(bivariate)ormany(multivariate).However,youcanexaminethedistribution
ofseveralindividualvariableswithinareport.Thereportcontentforeachvariablechanges
dependingonwhetherthevariableiscategorical(nominalorordinal)orcontinuous.
Onceyouknowhowyourdataisdistributed,youcanplantheappropriatetypeofanalysis
goingforward.
TheDistributionreportwindowisinteractive.Clickingonahistogrambarhighlightsthe
correspondingdatainanyotherhistogramsandinthedatatable.SeeFigure 3.1.
Figure 3.1ExampleoftheDistributionPlatform
Contents
OverviewoftheDistributionPlatform .............................................. 33
ExampleoftheDistributionPlatform ............................................... 33
LaunchtheDistributionPlatform................................................... 35
TheDistributionReport ........................................................... 36
Histograms ................................................................... 38
TheFrequenciesReport........................................................ 41
TheQuantilesReport.......................................................... 41
TheSummaryStatisticsReport .................................................. 41
DistributionPlatformOptions ..................................................... 44
OptionsforCategoricalVariables ................................................... 45
OptionsforContinuousVariables .................................................. 47
NormalQuantilePlot .......................................................... 50
OutlierBoxPlot ............................................................... 51
QuantileBoxPlot ............................................................. 52
StemandLeaf ................................................................ 53
CDFPlot ..................................................................... 53
TestMean.................................................................... 55
TestStdDev .................................................................. 56
ConfidenceIntervalsforContinuousVariables.................................... 57
SaveCommandsforContinuousVariables........................................ 57
PredictionIntervals ............................................................ 58
ToleranceIntervals ............................................................ 59
CapabilityAnalysis............................................................ 59
FitDistributions.................................................................. 62
AdditionalExamplesoftheDistributionPlatform .................................... 67
StatisticalDetailsfortheDistributionPlatform....................................... 74
Chapter 3
Basic Analysis
Distributions
Overview of the Distribution Platform
33
Categorical Variables
Forcategoricalvariables,theinitialgraphthatappearsisahistogram.Thehistogramshowsa
barforeachleveloftheordinalornominalvariable.Youcanalsoaddadivided(mosaic)bar
chart.
Thereportsshowcountsandproportions.Youcanaddconfidenceintervalsandtestthe
probabilities.
Continuous Variables
Fornumericcontinuousvariables,theinitialgraphsshowahistogramandanoutlierboxplot.
Thehistogramshowsabarforgroupedvaluesofthecontinuousvariable.Thefollowing
optionsarealsoavailable:
quantileboxplot
normalquantileplot
stemandleafplot
CDFplot
Thereportsshowselectedquantilesandsummarystatistics.Reportoptionsareavailablefor
thefollowing:
savingranks,probabilityscores,normalquantilevalues,andsoon,asnewcolumnsinthe
datatable
testingthemeanandstandarddeviationofthecolumnagainstaconstantyouspecify
fittingvariousdistributionsandnonparametricsmoothingcurves
performingacapabilityanalysisforaqualitycontrolapplication
confidenceintervals,predictionintervals,andtoleranceintervals
34
Distributions
Example of the Distribution Platform
Chapter 3
Basic Analysis
Fromthehistograms,younoticethefollowing:
Theagesarenotuniformlydistributed.
Forheight,therearetwopointswithextremevalues(thatmightbeoutliers).
Clickonthebarfor50intheheighthistogramtotakeacloserlookatthepotentialoutliers.
Thecorrespondingagesarehighlightedintheagehistogram.Thepotentialoutliersare
age12.
Thecorrespondingrowsarehighlightedinthedatatable.Thenamesofthepotential
outliersareLillieandRobert.
Addlabelstothepotentialoutliersintheheighthistogram.
1. Selectbothoutliers.
Chapter 3
Basic Analysis
Distributions
Example of the Distribution Platform
35
2. RightclickononeoftheoutliersandselectRow Label.
Labeliconsareaddedtotheserowsinthedatatable.
3. (Optional)Resizetheboxplotwidertoseethefulllabels.
Figure 3.3PotentialOutliersLabeled
Table 3.1DescriptionoftheDistributionLaunchWindow
Y, Columns
Assignsthevariablesthatyouwanttoanalyze.Ahistogramand
associatedreportsappearforeachvariable.
Weight
Assignsavariabletogivetheobservationsdifferentweights.Any
momentthatisbasedontheSumWgtsisaffectedbyweights.
36
Distributions
The Distribution Report
Chapter 3
Basic Analysis
Assignsafrequencyvariabletothisrole.Thisisusefulifyouhave
summarizeddata.Inthisinstance,youhaveonecolumnfortheY
valuesandanothercolumnforthefrequencyofoccurrenceofthe
Yvalues.Thesumofthisvariableisincludedintheoverallcount
appearingintheSummaryStatisticsreport(representedbyN).All
othermomentstatistics(mean,standarddeviation,andsoon)are
alsoaffectedbytheFreqvariable.
By
ProducesaseparatereportforeachleveloftheByvariable.Ifmore
thanoneByvariableisassigned,aseparatereportisproducedfor
eachpossiblecombinationofthelevelsoftheByvariables.
Histograms Only
Removeseverythingexceptthehistogramsfromthereport
window.
Forgeneralinformationaboutlaunchwindows,seeUsingJMP.
Chapter 3
Basic Analysis
Distributions
The Distribution Report
37
Figure 3.5TheInitialDistributionReportWindow
Note: Anyrowsthatareexcludedinthedatatablearealsohiddeninthehistogram.
TheinitialDistributionreportcontainsahistogramandreportsforeachvariable.Notethe
following:
Toreplaceavariableinareport,fromtheColumnspaneloftheassociateddatatable,drag
anddropthevariableintotheaxisofthehistogram.
Toinsertanewvariableintoareport,creatinganewhistogram,draganddropthe
variableoutsideofanexistinghistogram.Thenewvariablecanbeplacedbefore,between,
oraftertheexistinghistograms.
Note: Toremoveavariable,selectRemovefromtheredtrianglemenu.
TheredtrianglemenunexttoDistributionscontainsoptionsthataffectallofthevariables.
SeeDistributionPlatformOptionsonpage 44.
38
Distributions
The Distribution Report
Chapter 3
Basic Analysis
Theredtrianglemenunexttoeachvariablecontainsoptionsthataffectonlythatvariable.
SeeOptionsforCategoricalVariablesonpage 45orOptionsforContinuousVariables
onpage 47.IfyouholddowntheCTRLkeyandselectavariableoption,theoptionapplies
toallofthevariablesthathavethesamemodelingtype.
Histogramsvisuallydisplayyourdata.SeeHistogramsonpage 38.
TheinitialreportforacategoricalvariablecontainsaFrequenciesreport.SeeThe
FrequenciesReportonpage 41.
TheinitialreportforacontinuousvariablecontainsaQuantilesandaSummaryStatistics
report.SeeTheQuantilesReportonpage 41andTheSummaryStatisticsReporton
page 41.
Histograms
Histogramsvisuallydisplayyourdata.Forcategorical(nominalorordinal)variables,the
histogramshowsabarforeachleveloftheordinalornominalvariable.Forcontinuous
variables,thehistogramshowsabarforgroupedvaluesofthecontinuousvariable.
Table 3.2HistogramActions
Highlightingdata
Clickonahistogrambaroranoutlyingpointinthegraph.
Thecorrespondingrowsarehighlightedinthedatatable,and
correspondingsectionsofotherhistogramsarealso
highlighted,ifapplicable.SeeHighlightBarsandSelect
Rowsonpage 40.
Creatingasubset
Doubleclickonahistogrambar,orrightclickonahistogram
barandselectSubset.Anewdatatableiscreatedthat
containsonlytheselecteddata.
Resizingtheentire
histogram
Hoveroverthehistogrambordersuntilyouseea
doublesidedarrow.Thenclickanddragtheborders.For
moredetails,seetheUsingJMPbook.
Rescalingtheaxis
(Continuousvariablesonly)Clickanddragonanaxisto
rescaleit.
Alternatively,hoverovertheaxisuntilyouseeahand.Then,
doubleclickontheaxisandsettheparametersintheAxis
Specificationwindow.
Resizinghistogrambars
(Continuousvariablesonly)Therearemultipleoptionsto
resizehistogrambars.SeeResizeHistogramBarsfor
ContinuousVariablesonpage 39.
Chapter 3
Basic Analysis
Distributions
The Distribution Report
39
Specifythedatathatyouselectinmultiplehistograms.See
SpecifyYourSelectioninMultipleHistogramsonpage 40.
Toseeadditionaloptionsforthehistogramortheassociateddatatable:
Rightclickonahistogram.SeetheUsingJMPbook.
Clickontheredtrianglenexttothevariable,andselectHistogram Options.Optionsare
slightlydifferentdependingonthevariablemodelingtype.SeeOptionsforCategorical
Variablesonpage 45orOptionsforContinuousVariablesonpage 47.
theGrabber(hand)tool
theIncrementoption
UsetheGrabberTool
TheGrabbertoolisaquickwaytoexploreyourdata.
1. SelectTools > Grabber.
Note: (Windowsonly)Toseethemenubar,youmightneedtohoveroverthebarbelowthe
windowtitle.YoucanalsochangethissettinginFile > Preferences > Windows Specific.
2. Placethegrabbertoolanywhereinthehistogram.
3. Clickanddragthehistogrambars.
Thinkofeachbarasabinthatholdsanumberofobservations:
Movingthehandtotheleftincreasesthebinwidthandcombinesintervals.Thenumberof
barsdecreasesasthebarsizeincreases.
Movingthehandtotherightdecreasesthebinwidth,producingmorebars.
Movingthehandupordownshiftsthebinlocationsontheaxis,whichchangesthe
contentsandsizeofeachbin.
UsetheSetBinWidthOption
TheSet Bin Widthoptionisamoreprecisewaytosetthewidthforallbarsinahistogram.To
usetheSetBinWidthoption,fromtheredtrianglemenuforthevariable,selectHistogram
Options > Set Bin Width.Changethebinwidthvalue.
40
Distributions
The Distribution Report
Chapter 3
Basic Analysis
UsetheIncrementOption
TheIncrementoptionisanotherprecisewaytosetthebarwidth.TousetheIncrementoption,
doubleclickontheaxis,andchangetheIncrementvalue.
Highlight Bars and Select Rows
Clickingonahistogrambarhighlightsthebarandselectsthecorrespondingrowsinthedata
table.Theappropriateportionsofallothergraphicaldisplaysalsohighlighttheselection.
Figure 3.6showstheresultsofhighlightingabarintheheighthistogram.Thecorresponding
rowsareselectedinthedatatable.
Tip: Todeselecthistogrambars,presstheCTRLkeyandclickonthehighlightedbars.
Figure 3.6HighlightingBarsandRows
Toextendyourselection,holddowntheSHIFTkeyandselectanotherbar.Thisisthe
equivalentofusinganoroperator.
Tonarrowyourselection,holddowntheALTkeyandselectanotherbar.Thisisthe
equivalentofusinganandoperator.
RelatedInformation
ExampleofSelectingDatainMultipleHistogramsonpage 67
Chapter 3
Basic Analysis
Distributions
The Distribution Report
41
Listseachvaluefoundforaresponsevariable.
Count
Liststhenumberofrowsfoundforeachlevelofaresponse
variable.IfyouuseaFreqvariable,theCountisthesumofthe
Freqvariablesforeachleveloftheresponsevariable.
Prob
Liststheprobability(orproportion)ofoccurrenceforeachlevelof
aresponsevariable.Theprobabilityiscomputedasthecount
dividedbythetotalfrequencyofthevariable,shownatthebottom
ofthetable.
StdErrProb
Liststhestandarderroroftheprobabilities.Thiscolumnmightbe
hidden.Toshowthecolumn,rightclickinthetableandselect
Columns > StdErr Prob.
CumProb
Containsthecumulativesumofthecolumnofprobabilities.This
columnmightbehidden.Toshowthecolumn,rightclickinthe
tableandselectColumns > Cum Prob.
StatisticalDetailsforQuantilesonpage 74
42
Distributions
The Distribution Report
Chapter 3
Basic Analysis
Table 3.4describesthestatisticsthatappearbydefault.
Table 3.5describesadditionalstatisticsthatyoucanaddtothereportusingtheCustomize
Summary Statisticswindow.
Table 3.4DescriptionoftheSummaryStatisticsReport
Mean
Estimatestheexpectedvalueoftheunderlyingdistributionforthe
responsevariable,whichisthearithmeticaverageofthecolumns
values.Itisthesumofthenonmissingvaluesdividedbythe
numberofnonmissingvalues.
StdDev
Thenormaldistributionismainlydefinedbythemeanand
standarddeviation.Theseparametersprovideaneasywayto
summarizedataasthesamplebecomeslarge:
68%ofthevaluesarewithinonestandarddeviationofthe
mean
95%ofthevaluesarewithintwostandarddeviationsofthe
mean
99.7%ofthevaluesarewithinthreestandarddeviationsofthe
mean
StdErrMean
Thestandarderrorofthemean,whichestimatesthestandard
deviationofthedistributionofthemean.
Upper95%Meanand
Lower95%Mean
Are95%confidencelimitsaboutthemean.Theydefineaninterval
thatisverylikelytocontainthetruepopulationmean.
Isthetotalnumberofnonmissingvalues.
Table 3.5AdditionalSummaryStatistics
SumWeight
ThesumofacolumnassignedtotheroleofWeight(in
thelaunchwindow).SumWgtisusedinthe
denominatorforcomputationsofthemeaninsteadof
N.
Sum
Thesumoftheresponsevalues.
Variance
Thesamplevariance,andthesquareofthesample
standarddeviation.
Skewness
Measuressidednessorsymmetry.
Kurtosis
Measurespeakednessorheavinessoftails.
Chapter 3
Basic Analysis
Distributions
The Distribution Report
43
Thepercentcoefficientofvariation.Itiscomputedas
thestandarddeviationdividedbythemeanand
multipliedby100.Thecoefficientofvariationcanbe
usedtoassessrelativevariation,forexamplewhen
comparingthevariationindatameasuredindifferent
unitsorwithdifferentmagnitudes.
NMissing
Thenumberofmissingobservations.
NZero
Thenumberofzerovalues.
NUnique
Thenumberofuniquevalues.
UncorrectedSS
Theuncorrectedsumofsquaresorsumofvalues
squared.
CorrectedSS
Thecorrectedsumofsquaresorsumofsquaresof
deviationsfromthemean.
Autocorrelation
(AppearsonlyifyouhavenotspecifiedaFrequency
variable.)Firstautocorrelationthattestsiftheresiduals
arecorrelatedacrosstherows.Thistesthelpsdetect
nonrandomnessinthedata.
Minimum
Representsthe0percentileofthedata.
Maximum
Representsthe100percentileofthedata.
Median
Representsthe50thpercentileofthedata.
Mode
Thevaluethatoccursmostofteninthedata.Ifthereare
multiplemodes,thesmallestmodeappears.
TrimmedMean
Themeancalculatedafterremovingthesmallestp%
andthelargestp%ofthedata.Thevalueofpisentered
intheEnter trimmed mean percenttextboxatthe
bottomofthewindow.TheTrimmedMeanoptionis
notavailableifyouhavespecifiedaWeightvariable.
GeometricMean
Thenthrootoftheproductofthedata.
Range
Thedifferencebetweenthemaximumandminimumof
thedata.
InterquartileRange
Thedifferencebetweenthe3rdand1stquartiles.
44
Distributions
Distribution Platform Options
Chapter 3
Basic Analysis
(DoesnotappearifyouhavespecifiedaWeight
variable.)Themedianoftheabsolutedeviationsfrom
themedian.
RobustMean
Therobustmean,calculatedinawaythatisresistantto
outliers,usingHubersMestimation.SeeHuberand
Ronchetti,2009.
RobustStdDev
Therobuststandarddeviation,calculatedinawaythat
isresistanttooutliers,usingHubersMestimation.See
HuberandRonchetti,2009.
Enter(1alpha)formean
confidenceinterval
Specifythealphalevelforthemeanconfidenceinterval.
Entertrimmedmeanpercent
Specifythetrimmedmeanpercentage.Thepercentage
istrimmedoffeachsideofthedata.
Showsallofthemodesiftherearemultiplemodes.
RelatedInformation
StatisticalDetailsforSummaryStatisticsonpage 75
Scalesallaxeswiththesameminimum,maximum,andintervals
sothatthedistributionscanbeeasilycompared.
Chapter 3
Basic Analysis
Distributions
Options for Categorical Variables
45
Changestheorientationofthehistogramandthereportsto
horizontalandstackstheindividualdistributionreportsvertically.
Deselectthisoptiontoreturnthereportwindowtoitsoriginal
layout.
Arrange in Rows
Enterthenumberofplotsthatappearinarow.Thisoptionhelps
youviewplotsverticallyratherthaninonewiderow.
Savesthehistogramsas.swffilesthatareAdobeFlashplayer
compatible.UsethesefilesinpresentationsandinWebpages.An
HTMLpageisalsosavedthatshowsyouthecorrectcodeforusing
theresulting.swffile.
Formoreinformationaboutthisoption,gotohttp://
www.jmp.com/support/swfhelp/en.
Script
Thismenucontainsoptionsthatareavailabletoallplatforms.
TheyenableyoutoredotheanalysisorsavetheJSLcommandsfor
theanalysistoawindoworafile.Formoreinformation,seeUsing
JMP.
ShowsorhidestheFrequenciesreport.SeeTheFrequencies
Reportonpage 41.
Horizontal Layout
Changestheorientationofthehistogramandthereportsto
verticalorhorizontal.
Axes on Left
MovestheCount,Prob,andDensityaxestotheleftinsteadofthe
right.
ThisoptionisapplicableonlyifHorizontal Layoutisselected.
46
Distributions
Options for Categorical Variables
Chapter 3
Basic Analysis
Showsorhidesthehistogram.SeeHistogramsonpage 38.
Vertical
Changestheorientationofthehistogramfromaverticaltoa
horizontalorientation.
Drawsthestandarderrorbaroneachlevelofthehistogram.
Separate Bars
Separatesthehistogrambars.
Histogram Color
Changesthecolorofthehistogrambars.
Count Axis
Addsanaxisthatshowsthefrequencyofcolumnvalues
representedbythehistogrambars.
Prob Axis
Addsanaxisthatshowstheproportionofcolumnvalues
representedbyhistogrambars.
Density Axis
Addsanaxisthatshowsthelengthofthebarsinthehistogram.
Thecountandprobabilityaxesarebasedonthefollowing
calculations:
prob=(barwidth)*density
count=(barwidth)*density*(totalcount)
Show Percents
Labelsthepercentofcolumnvaluesrepresentedbyeach
histogrambar.
Show Counts
Labelsthefrequencyofcolumnvaluesrepresentedbyeach
histogrambar.
Mosaic Plot
Displaysamosaicbarchartforeachnominalorordinalresponse
variable.Amosaicplotisastackedbarchartwhereeachsegment
isproportionaltoitsgroupsfrequencycount.
Order By
Reordersthehistogram,mosaicplot,andFrequenciesreportin
ascendingordescendingorder,bycount.Tosavetheneworderas
acolumnproperty,usetheSave > Value Orderingoption.
Test Probabilities
Displaysareportthattestshypothesizedprobabilities.See
ExamplesoftheTestProbabilitiesOptiononpage 68formore
details.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
47
Thismenucontainsconfidencelevels.Selectavaluethatislisted,
orselectOthertoenteryourown.JMPcomputesscoreconfidence
intervals.
TheSavesubmenucontainsthefollowingoptions:
Level Numbers
CreatesanewcolumninthedatatablecalledLevel <colname>.
Thelevelnumberofeachobservationcorrespondstothe
histogrambarthatcontainstheobservation.
Value Ordering
(UsewiththeOrder Byoption)Createsanewvalueordering
columnpropertyinthedatatable,reflectingtheneworder.
Script to log
Displaysthescriptcommandstogeneratethecurrentreportinthe
logwindow.SelectView > Logtoseethelogwindow.
Remove
Permanentlyremovesthevariableandallitsreportsfromthe
Distributionreport.
ShowsorhidestheQuantilesreport.SeeTheQuantilesReport
onpage 41.
Changesthequantileincrementorrevertbacktothedefault
quantileincrement.
48
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Setscustomquantilesbyvaluesorbyincrements.Youcanalso
specifytheconfidencelevel.Smoothedempiricallikelihood
quantileestimates,basedonakerneldensityestimate,areadded
tothereport.Theconfidenceintervalsforthesequantileestimates
tendtocontainthetruequantilewiththepromisedconfidence
level.
Summary Statistics
ShowsorhidestheSummaryStatisticsreport.SeeTheSummary
StatisticsReportonpage 41.
Customize Summary
Statistics
AddsorremovesstatisticsfromtheSummaryStatisticsreport.See
TheSummaryStatisticsReportonpage 41.
Horizontal Layout
Changestheorientationofthehistogramandthereportsto
verticalorhorizontal.
Axes on Left
TheHistogram Optionssubmenucontainsthefollowingoptions:
Histogram
Showsorhidesthehistogram.SeeHistogramsonpage 38.
Shadowgram
Replacesthehistogramwithashadowgram.Tounderstanda
shadowgram,considerthatifthebinwidthofahistogramis
changed,theappearanceofthehistogramchanges.A
shadowgramoverlayshistogramswithdifferentbinwidths.
Dominantfeaturesofadistributionarelesstransparentonthe
shadowgram.
Notethatthefollowingoptionsarenotavailablefor
shadowgrams:
Vertical
Show Counts
Show Percents
Changestheorientationofthehistogramfromaverticaltoa
horizontalorientation.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
49
Drawsthestandarderrorbaroneachlevelofthehistogramusing
thestandarderror.Thestandarderrorbaradjustsautomatically
whenyouadjustthenumberofbarswiththehandtool.See
ResizeHistogramBarsforContinuousVariablesonpage 39,and
StatisticalDetailsforStandardErrorBarsonpage 74.
Changesthebinwidthofthehistogrambars.SeeResize
HistogramBarsforContinuousVariablesonpage 39.
Histogram Color
Changesthecolorofthehistogrambars.
Count Axis
Addsanaxisthatshowsthefrequencyofcolumnvalues
representedbythehistogrambars.
Note: Ifyouresizethehistogrambars,thecountaxisalsoresizes.
Prob Axis
Addsanaxisthatshowstheproportionofcolumnvalues
representedbyhistogrambars.
Note: Ifyouresizethehistogrambars,theprobabilityaxisalso
resizes.
Density Axis
Thedensityisthelengthofthebarsinthehistogram.Boththe
countandprobabilityarebasedonthefollowingcalculations:
prob=(barwidth)*density
count=(barwidth)*density*(totalcount)
WhenlookingatdensitycurvesthatareaddedbytheFit
Distributionoption,thedensityaxisshowsthepointestimatesof
thecurves.
Note: Ifyouresizethehistogrambars,thedensityaxisalsoresizes.
Show Percents
Labelstheproportionofcolumnvaluesrepresentedbyeach
histogrambar.
Show Counts
Labelsthefrequencyofcolumnvaluesrepresentedbyeach
histogrambar.
Addsanormalquantileplotthatshowstheextenttowhichthe
variableisnormallydistributed.SeeNormalQuantilePloton
page 50.
Addsanoutlierboxplotthatshowstheoutliersinyourdata.See
OutlierBoxPlotonpage 51.
50
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Addsaquantileboxplotthatshowsspecificquantilesfromthe
Quantilesreport.SeeQuantileBoxPlotonpage 52.
Addsastemandleafreport,whichisavariationofthehistogram.
SeeStemandLeafonpage 53.
CDF Plot
Addsaplotoftheempiricalcumulativedistributionfunction.See
CDFPlotonpage 53.
Test Mean
Performsaonesampletestforthemean.SeeTestMeanon
page 55.
Performsaonesampletestforthestandarddeviation.SeeTest
StdDevonpage 56.
Confidence Interval
Chooseconfidenceintervalsforthemeanandstandarddeviation.
SeeConfidenceIntervalsforContinuousVariablesonpage 57.
Prediction Interval
Choosepredictionintervalsforasingleobservation,orforthe
meanandstandarddeviationofthenextrandomlyselected
sample.SeePredictionIntervalsonpage 58.
Tolerance Interval
Computesanintervaltocontainatleastaspecifiedproportionof
thepopulation.SeeToleranceIntervalsonpage 59.
Capability Analysis
Measurestheconformanceofaprocesstogivenspecification
limits.SeeCapabilityAnalysisonpage 59.
Continuous Fit
Fitsdistributionstocontinuousvariables.SeeFitDistributions
onpage 62.
Discrete Fit
Fitsdistributionstodiscretevariables.SeeFitDistributionson
page 62.
Save
Savesinformationaboutcontinuousorcategoricalvariables.See
SaveCommandsforContinuousVariablesonpage 57.
Remove
Permanentlyremovesthevariableandallitsreportsfromthe
Distributionreport.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
ThenormalquantileplotalsoshowsLillieforsconfidencebounds(Conover1980)and
probabilityandnormalquantilescales.
Figure 3.7NormalQuantilePlot
Notethefollowinginformation:
Theyaxisshowsthecolumnvalues.
Thexaxisshowstheempiricalcumulativeprobabilityforeachvalue.
RelatedInformation
StatisticalDetailsfortheNormalQuantilePlotonpage 76
Notethefollowingaspectsaboutoutlierboxplots:
Thehorizontallinewithintheboxrepresentsthemediansamplevalue.
51
52
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Theconfidencediamondcontainsthemeanandtheupperandlower95%ofthemean.If
youdrewalinethroughthemiddleofthediamond,youwouldhavethemean.Thetop
andbottompointsofthediamondrepresenttheupperandlower95%ofthemean.
Theendsoftheboxrepresentthe25thand75thquantiles,alsoexpressedasthe1stand3rd
quartile,respectively.
Thedifferencebetweenthe1stand3rdquartilesiscalledtheinterquartilerange.
Theboxhaslinesthatextendfromeachend,sometimescalledwhiskers.Thewhiskers
extendfromtheendsoftheboxtotheoutermostdatapointthatfallswithinthedistances
computedasfollows:
1stquartile1.5*(interquartilerange)
3rdquartile+1.5*(interquartilerange)
Ifthedatapointsdonotreachthecomputedranges,thenthewhiskersaredeterminedby
theupperandlowerdatapointvalues(notincludingoutliers).
Thebracketoutsideoftheboxidentifiestheshortesthalf,whichisthemostdense50%of
theobservations(RousseuwandLeroy1987).
RemoveObjectsfromtheOutlierBoxPlot
Toremovetheconfidencediamondortheshortesthalf,proceedasfollows:
1. RightclickontheoutlierboxplotandselectCustomize.
2. ClickBox Plot.
3. DeselectthecheckboxnexttoConfidence DiamondorShortest Half.
FormoredetailsabouttheCustomizeGraphwindow,seetheUsingJMPbook.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
53
Figure 3.9QuantileBoxPlot
90% quantile
10% quantile
Quantilesarevalues.wherethepthquantileislargerthanp%ofthevalues.Forexample,10%
ofthedataliesbelowthe10thquantile,and90%ofthedataliesbelowthe90thquantile.
CDF Plot
TheCDFplotcreatesaplotoftheempiricalcumulativedistributionfunction.UsetheCDF
plottodeterminethepercentofdatathatisatorbelowagivenvalueonthexaxis.
54
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Figure 3.10CDFPlot
Forexample,inthisCDFplot,approximately30%ofthedataislessthanatotalfatvalueof10
grams.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
55
Test Mean
UsetheTest Meanwindowtospecifyoptionsforandperformaonesampletestforthemean.
Ifyouspecifyavalueforthestandarddeviation,aztestisperformed.Otherwise,thesample
standarddeviationisusedtoperformattest.Youcanalsorequestthenonparametric
WilcoxonSignedRanktest.
UsetheTest Mean optionrepeatedlytotestdifferentvalues.Eachtimeyoutestthemean,a
newTestMeanreportappears.
Table 3.9DescriptionoftheTestMeanReport
Statistics that are calculated for Test Mean:
tTest(orzTest)
Liststhevalueoftheteststatisticandthepvaluesforthetwosided
andonesidedalternatives.
SignedRank
(OnlyappearsfortheWilcoxonSignedRanktest)Liststhevalueof
theWilcoxonsignedrankstatisticfollowedbythepvaluesforthe
twosidedandonesidedalternatives.Thetestassumesonlythatthe
distributionissymmetric.SeeStatisticalDetailsfortheWilcoxon
SignedRankTestonpage 76.
Probability values:
Prob>|t|
Theprobabilityofobtaininganabsolutetvaluebychancealonethat
isgreaterthantheobservedtvaluewhenthepopulationmeanis
equaltothehypothesizedvalue.Thisisthepvalueforobserved
significanceofthetwotailedttest.
Prob>t
Theprobabilityofobtainingatvaluegreaterthanthecomputed
samplet ratiobychancealonewhenthepopulationmeanisnot
differentfromthehypothesizedvalue.Thisisthepvalueforan
uppertailedtest.
Prob<t
Theprobabilityofobtainingatvaluelessthanthecomputedsample
t ratiobychancealonewhenthepopulationmeanisnotdifferent
fromthehypothesizedvalue.Thisisthepvalueforalowertailed
test.
Table 3.10DescriptionsoftheTestMeanOptions
PValue animation
Startsaninteractivevisualrepresentationofthe
pvalue.Enablesyoutochangethehypothesizedmean
valuewhilewatchinghowthechangeaffectsthe
pvalue.
56
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Startsaninteractivevisualrepresentationofpowerand
beta.Youcanchangethehypothesizedmeanand
samplemeanwhilewatchinghowthechangesaffect
powerandbeta.
Remove Test
Removesthemeantest.
ProvidesthevalueoftheChisquareteststatistic.SeeStatistical
DetailsfortheStandardDeviationTestonpage 76.
MinPValue
TheprobabilityofobtainingagreaterChisquarevalueby
chancealonewhenthepopulationstandarddeviationisnot
differentfromthehypothesizedvalue.SeeStatisticalDetails
fortheStandardDeviationTestonpage 76.
Prob>ChiSq
TheprobabilityofobtainingaChisquarevaluegreaterthanthe
computedsampleChisquarebychancealonewhenthe
populationstandarddeviationisnotdifferentfromthe
hypothesizedvalue.Thisisthepvalueforobservedsignificance
ofaonetailedttest.
Prob<ChiSq
TheprobabilityofobtainingaChisquarevaluelessthanthe
computedsampleChisquarebychancealonewhenthe
populationstandarddeviationisnotdifferentfromthe
hypothesizedvalue.Thisisthepvalueforobservedsignificance
ofaonetailedttest.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
57
Column Added
to Data Table
Description
Level Numbers
Level
<colname>
Thelevelnumberofeachobservationcorrespondsto
thehistogrambarthatcontainstheobservation.The
histogrambarsarenumberedfromlowtohigh,
beginningwith1.
Level Midpoints
Midpoint
<colname>
Themidpointvalueforeachobservationis
computedbyaddinghalfthelevelwidthtothe
lowerlevelbound.
Ranks
Ranked
<colname>
Providesarankingforeachofthecorresponding
columnsvaluesstartingat1.Duplicateresponse
valuesareassignedconsecutiveranksinorderof
theiroccurrenceinthedatatable.
58
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Column Added
to Data Table
Description
Ranks Averaged
RankAvgd
<colname>
Ifavalueisunique,thentheaveragedrankisthe
sameastherank.Ifavalueoccursktimes,the
averagerankiscomputedasthesumofthevalues
ranksdividedbyk.
Prob Scores
Prob
<colname>
ForNnonmissingscores,theprobabilityscoreofa
valueiscomputedastheaveragedrankofthatvalue
dividedbyN + 1.Thiscolumnissimilartothe
empiricalcumulativedistributionfunction.
Normal Quantiles
N-Quantile
<colname>
SavestheNormalquantilestothedatatable.See
StatisticalDetailsfortheNormalQuantilePloton
page 76.
Standardized
Std<colname>
Savesstandardizedvaluestothedatatable.See
StatisticalDetailsforSavingStandardizedDataon
page 77.
Centered
Centered
<colname>
Savesvaluesforcenteringonzero.
Spec Limits
(none)
Storesthespecificationlimitsappliedinacapability
analysisasacolumnpropertyofthecorresponding
columninthecurrentdatatable.Automatically
retrievesanddisplaysthespecificationlimitswhen
yourepeatthecapabilityanalysis.
Script to Log
(none)
Printsthescripttothelogwindow.Runthescriptto
recreatetheanalysis.
Prediction Intervals
Predictionintervalsconcernasingleobservation,orthemeanandstandarddeviationofthe
nextrandomlyselectedsample.Thecalculationsassumethatthegivensampleisselected
randomlyfromanormaldistribution.Selectonesidedortwosidedpredictionintervals.
WhenyouselectthePrediction Interval optionforavariable,thePredictionIntervalswindow
appears.Usethewindowtospecifytheconfidencelevel,thenumberoffuturesamples,and
eitheraonesidedortwosidedlimit.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
59
RelatedInformation
StatisticalDetailsforPredictionIntervalsonpage 77
ExampleofPredictionIntervalsonpage 70
Tolerance Intervals
Atoleranceintervalcontainsatleastaspecifiedproportionofthepopulation.Itisa
confidenceintervalforaspecifiedproportionofthepopulation,notthemean,orstandard
deviation.CompletediscussionsoftoleranceintervalsarefoundinHahnandMeeker(1991)
andinTamhaneandDunlop(2000).
WhenyouselecttheTolerance Intervaloptionforavariable,theToleranceIntervalswindow
appears.Usethewindowtospecifytheconfidencelevel,theproportiontocover,andeithera
onesidedortwosidedlimit.Thecalculationsarebasedontheassumptionthatthegiven
sampleisselectedrandomlyfromanormaldistribution.
RelatedInformation
StatisticalDetailsforToleranceIntervalsonpage 78
ExampleofToleranceIntervalsonpage 71
Capability Analysis
TheCapability Analysisoptionmeasurestheconformanceofaprocesstogivenspecification
limits.WhenyouselecttheCapability Analysisoptionforavariable,theCapabilityAnalysis
windowappears.Usethewindowtoenterspecificationlimits,distributiontype,and
informationaboutsigma.
Note: Tosavethespecificationlimitstothedatatableasacolumnproperty,selectSave >
Spec Limits.Whenyourepeatthecapabilityanalysis,thesavedspecificationlimitsare
automaticallyretrieved.
TheCapabilityAnalysisreportisorganizedintotwosections:CapabilityAnalysisandthe
distributiontype(LongTermSigma,SpecifiedSigma,andsoon).
Capability Analysis Descriptions
TheCapabilityAnalysiswindow,report,andoptionsaredescribedinthefollowingtables.
60
Distributions
Options for Continuous Variables
Chapter 3
Basic Analysis
Table 3.13DescriptionoftheCapabilityAnalysisWindow
<Distributiontype>
Bydefault,thenormaldistributionisassumedwhen
calculatingthecapabilitystatisticsandthepercentoutofthe
specificationlimits.Toperformacapabilityanalysison
nonnormaldistributions,seethedescriptionofSpecLimits
underFitDistributionOptionsonpage 64.
<Sigmatype>
Estimatessigma(usingtheselectedmethods.See
StatisticalDetailsforCapabilityAnalysisonpage 79.
Table 3.14DescriptionoftheCapabilityAnalysisReport
Specification
Liststhespecificationlimits.
Value
Liststhevaluesthatyouspecifiedforeachspecificationlimitand
thetarget.
Portionand%Actual
Portionlabelsdescribethenumbersinthe %Actualcolumn,as
follows:
Capability
BelowLSLgivesthepercentageofthedatathatisbelowthe
lowerspecificationlimit.
AboveUSLgivesthepercentageofthedatathatisabovethe
upperspecificationlimit.
TotalOutsidegivesthetotalpercentageofthedatathatis
eitherbelowLSLoraboveUSL.
Typeofprocesscapabilityindices.SeeTable 3.19.
Note: ThereisapreferenceforCapabilitycalledPpk Capability
LabelingthatlabelsthelongtermcapabilityoutputwithPpk
labels.OpenthePreferencewindow(File > Preferences),then
selectPlatforms > Distributiontoseethispreference.
Index
Processcapabilityindexvalues.
UpperCI
Upperconfidenceinterval.
LowerCI
Lowerconfidenceinterval.
Chapter 3
Basic Analysis
Distributions
Options for Continuous Variables
61
PortionlabelsdescribethenumbersinthePercentcolumn,as
follows:
BelowLSLgivesthepercentageofthefitteddistributionthat
isbelowthelowerspecificationlimit.
AboveUSLgivesthepercentageofthefitteddistribution
thatisabovetheupperspecificationlimit.
TotalOutsidegivesthetotalpercentageofthefitted
distributionthatiseitherbelowLSLoraboveUSL.
PPM(partsper
million)
ThePPMvalueisthePercentcolumnmultipliedby10,000.
SigmaQuality
SigmaQualityisfrequentlyusedinSixSigmamethods,andis
alsoreferredtoastheprocesssigma.SeeStatisticalDetailsfor
CapabilityAnalysisonpage 79.
Table 3.15DescriptionoftheCapabilityAnalysisOptions
Z Bench
Showsthevalues(representedbyIndex)oftheBenchmarkZ
statistics.AccordingtotheAIAGStatisticalProcessControl
manual,Zrepresentsthenumberofstandarddeviationunitsfrom
theprocessaveragetoavalueofinterestsuchasanengineering
specification.Whenusedincapabilityassessment,ZUSListhe
distancetotheupperspecificationlimitandZLSListhedistance
tothelowerspecificationlimit.SeeStatisticalDetailsfor
CapabilityAnalysisonpage 79.
Capability Animation
Interactivelychangethespecificationlimitsandtheprocessmean
toseetheeffectsonthecapabilitystatistics.Thisoptionisavailable
onlyforcapabilityanalysesbasedontheNormaldistribution.
RelatedInformation
StatisticalDetailsforCapabilityAnalysisonpage 79
ExampleofCapabilityAnalysisonpage 73
62
Distributions
Fit Distributions
Chapter 3
Basic Analysis
Fit Distributions
UsetheContinuousorDiscrete Fitoptionstofitadistributiontoacontinuousordiscrete
variable.
Acurveisoverlaidonthehistogram,andaParameterEstimatesreportisaddedtothereport
window.Aredtrianglemenucontainsadditionaloptions.SeeFitDistributionOptionson
page 64.
Note: TheLifeDistributionplatformalsocontainsoptionsfordistributionfittingthatmight
usedifferentparameterizationsandallowforcensoring.SeetheQualityandProcessMethods
book.
Continuous Fit
UsetheContinuous Fitoptionstofitthefollowingdistributionstoacontinuousvariable.
TheNormaldistributionisoftenusedtomodelmeasuresthataresymmetricwithmostof
thevaluesfallinginthemiddleofthecurve.JMPusestheunbiasedestimatewhen
determiningtheparametersfortheNormaldistribution.
TheLogNormaldistributionisoftenusedtomodelvaluesthatareconstrainedbyzerobut
haveafewverylargevalues.TheLogNormaldistributioncanbeobtainedby
exponentiatingtheNormaldistribution.JMPusesthemaximumlikelihoodestimation
whendeterminingtheparametersfortheLogNormaldistribution.
TheWeibull,Weibullwiththreshold,andExtremeValuedistributionsoftenprovidea
goodmodelforestimatingthelengthoflife,especiallyformechanicaldevicesandin
biology.
TheExponentialdistributionisespeciallyusefulfordescribingeventsthatrandomlyoccur
overtime,suchassurvivaldata.Theexponentialdistributionmightalsobeusefulfor
modelingelapsedtimebetweentheoccurrenceofnonoverlappingevents,suchasthe
timebetweenauserscomputerqueryandresponseoftheserver,thearrivalofcustomers
ataservicedesk,orcallscominginataswitchboard.
TheGammadistributionisboundbyzeroandhasaflexibleshape.
TheBetadistributionisusefulformodelingthebehaviorofrandomvariablesthatare
constrainedtofallintheinterval0,1.Forexample,proportionsalwaysfallbetween0and
1.
TheNormalMixturesdistributionfitsamixtureofnormaldistributions.Thisflexible
distributioniscapableoffittingmultimodaldata.Youcanalsofittwoormore
distributionsbyselectingtheNormal 2 Mixture,Normal 3 Mixture,orOtheroptions.
Chapter 3
Basic Analysis
Distributions
Fit Distributions
63
TheSmoothCurvedistribution...Asmoothcurveisfitusingnonparametricdensity
estimation(kerneldensityestimation).Thesmoothcurveisoverlaidonthehistogramand
asliderappearsbeneaththeplot.Controltheamountofsmoothingbychangingthekernel
standarddeviationwiththeslider.TheinitialKernel Stdestimateisformedbysumming
thenormaldensitiesofthekernelstandarddeviationlocatedateachdatapoint.
TheJohnsonSu,JohnsonSb,andJohnsonSlDistributionsareusefulforitsdatafitting
capabilitiesbecauseitsupportseverypossiblecombinationofskewnessandkurtosis.
TheGeneralizedLog(Glog)distributionisusefulforfittingdatathatarerarelynormally
distributedandoftenhavenonconstantvariance,likebiologicalassaydata.
StatisticalDetailsforContinuousFitDistributionsonpage 82
StatisticalDetailsforFittedQuantilesonpage 91
StatisticalDetailsforFitDistributionOptionsonpage 91
Discrete Fit
UsetheDiscreteFitoptionstofitadistribution(suchasPoissonorBinomial)toadiscrete
variable.Theavailabledistributionsareasfollows:
Poisson
Binomial
GammaPoisson
BetaBinomial
64
Distributions
Fit Distributions
Chapter 3
Basic Analysis
RelatedInformation
StatisticalDetailsforDiscreteFitDistributionsonpage 88
StatisticalDetailsforFittedQuantilesonpage 91
StatisticalDetailsforFitDistributionOptionsonpage 91
Createsaquantileoraprobabilityplot.SeeDiagnosticPlot
onpage 65.
Density Curve
Usestheestimatedparametersofthedistributiontooverlay
adensitycurveonthehistogram.
Goodness of Fit
Computesthegoodnessoffittestforthefitteddistribution.
SeeGoodnessofFitonpage 66.
Fix Parameters
Enablesyoutofixparametersandreestimatethenonfixed
parameters.AnAdequacyLR(likelihoodratio)Testreport
alsoappears,whichtestsyournewparameterstodetermine
whethertheyfitthedata.
Quantiles
Returnstheunscaledanduncenteredquantilesforthe
specificlowerprobabilityvaluesthatyouspecify.
Usethisoptionwhenyoudonotknowthespecification
limitsforaprocessandyouwanttouseitsdistributionasa
guidelineforsettingspecificationlimits.
Usuallyspecificationlimitsarederivedusingengineering
considerations.Iftherearenoengineeringconsiderations,
andifthedatarepresentsatrustedbenchmark(wellbehaved
process),thenquantilesfromafitteddistributionareoften
usedtohelpsetspecificationlimits.SeeStatisticalDetails
forFitDistributionOptionsonpage 91.
Spec Limits
Computesgeneralizationsofthestandardcapabilityindices,
basedonthespecificationlimitsandtargetyouspecify.See
SpecLimitsonpage 67.
Chapter 3
Basic Analysis
Distributions
Fit Distributions
65
Savesthefittedquantilevaluesasanewcolumninthe
currentdatatable.SeeStatisticalDetailsforFitted
Quantilesonpage 91.
Createsanewcolumninthecurrentdatatablethatcontains
fittedvaluesthathavebeencomputedbythedensity
formula.Thedensityformulausestheestimatedparameter
values.
Savesthespecificationlimitsasacolumnproperty.See
StatisticalDetailsforFitDistributionOptionsonpage 91.
Save Transformed
Createsanewcolumnandsavesaformula.Theformulacan
transformthecolumntonormalityusingthefitted
distribution.Thisoptionisavailableonlywhenoneofthe
JohnsondistributionsortheGlogdistributionisfit.
Remove Fit
Removesthedistributionfitfromthereportwindow.
Diagnostic Plot
TheDiagnostic Plotoptioncreatesaquantileoraprobabilityplot.Dependingonthefitted
distribution,theplotisoneoffourformats.
Table 3.17DescriptionsofPlotFormats
Plot Format
Applicable Distributions
Thefittedquantilesversusthedata
Weibullwiththreshold
Gamma
Beta
Poisson
GammaPoisson
Binomial
BetaBinomial
Normal
NormalMixtures
Exponential
Thefittedprobabilityversusthedata
66
Distributions
Fit Distributions
Chapter 3
Basic Analysis
Applicable Distributions
Thefittedprobabilityversusthedataonlogscale
Weibull
LogNormal
ExtremeValue
JohnsonSl
JohnsonSb
JohnsonSu
Glog
Thefittedprobabilityversusthestandardnormal
quantile
Table 3.18describestheoptionsintheredtrianglemenunexttoDiagnosticPlot.
Table 3.18DescriptionsoftheDiagnosticPlotOptions
Rotate
Reversesthexandyaxes.
Confidence
Limits
DrawsLilliefors95%confidencelimitsfortheNormalQuantileplot,and
95%equalprecisionbandswitha = 0.001andb = 0.99forallotherquantile
plots(MeekerandEscobar(1998)).
Line of Fit
Drawsthestraightdiagonalreferenceline.Ifavariablefitstheselected
distribution,thevaluesfallapproximatelyonthereferenceline.
Median
Reference
Line
Drawsahorizontallineatthemedianoftheresponse.
Goodness of Fit
TheGoodness of Fitoptioncomputesthegoodnessoffittestforthefitteddistribution.The
goodnessoffittestsarenotChisquaretests,butareEDF(EmpiricalDistributionFunction)
tests.EDFtestsofferadvantagesovertheChisquaretests,includingimprovedpowerand
invariancewithrespecttohistogrammidpoints.
ForNormaldistributions,theShapiroWilktestfornormalityisreportedwhenthesample
sizeislessthanorequalto2000,andtheKSLtestiscomputedforsamplesthataregreater
than2000.
Fordiscretedistributions(suchasPoissondistributions)thathavesamplesizeslessthan
orequalto30,theGoodnessofFittestisformedusingtwoonesidedexactKolmogorov
testscombinedtoformanearexacttest.Fordetails,seeConover1972.Forsamplesizes
greaterthan30,aPearsonChisquaredgoodnessoffittestisperformed.
Chapter 3
Basic Analysis
Distributions
Additional Examples of the Distribution Platform
67
RelatedInformation
StatisticalDetailsforFitDistributionOptionsonpage 91
Spec Limits
TheSpec Limitsoptionlaunchesawindowrequestingspecificationlimitsandtarget,and
thencomputesgeneralizationsofthestandardcapabilityindices.Thisisdoneusingthefact
thatforthenormaldistribution,3isboththedistancefromthelower0.135percentileto
median(ormean)andthedistancefromthemedian(ormean)totheupper99.865percentile.
Thesepercentilesareestimatedfromthefitteddistribution,andtheappropriate
percentiletomediandistancesaresubstitutedfor3inthestandardformulas.
RelatedInformation
StatisticalDetailsforFitDistributionOptionsonpage 91
68
Distributions
Additional Examples of the Distribution Platform
Chapter 3
Basic Analysis
Figure 3.11SelectingDatainMultipleHistograms
Chapter 3
Basic Analysis
Distributions
Additional Examples of the Distribution Platform
Figure 3.12ExamplesofTestProbabilitiesOptions
ExampleofGeneratingtheTestProbabilitiesReport
Togenerateatestprobabilitiesreportforavariablewithmorethantwolevels:
1. RefertoFigure 3.12atleft.Type0.25inallfourHypothProbfields.
2. ClicktheFix hypothesized values, rescale omittedbutton.
3. ClickDone.
LikelihoodRatioandPearsonChisquaretestsarecalculated.SeeFigure 3.13atleft.
Togenerateatestprobabilitiesreportforavariablewithexactlytwolevels:
1. RefertoFigure 3.12atright.Type0.5inbothHypothProbfields.
2. Clicktheprobability less than hypothesized valuebutton.
3. ClickDone.
Exactprobabilitiesarecalculatedforthebinomialtest.SeeFigure 3.13atright.
69
70
Distributions
Additional Examples of the Distribution Platform
Chapter 3
Basic Analysis
Figure 3.13ExamplesofTestProbabilitiesReports
Chapter 3
Basic Analysis
Distributions
Additional Examples of the Distribution Platform
71
Figure 3.14ThePredictionIntervalsWindow
Inthisexample,youcanbe95%confidentaboutthefollowing:
Eachofthenext10observationswillbebetween0.013755and0.279995.
Themeanofthenext10observationswillbebetween0.115596and0.178154.
Thestandarddeviationofthenext10observationswillbebetween0.023975and0.069276.
72
Distributions
Additional Examples of the Distribution Platform
Chapter 3
Basic Analysis
6. Keepthedefaultselections,andclickOK.
Figure 3.17ExampleofaToleranceIntervalReport
Inthisexample,youcanbe95%confidentthatatleast90%ofthepopulationliebetween
0.057035and0.236715,basedontheLowerTI(toleranceinterval)andUpperTIvalues.
Chapter 3
Basic Analysis
Distributions
Additional Examples of the Distribution Platform
73
74
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Thespeclimitsareaddedtothehistogramsothatthedatacanbevisuallycomparedtothe
limits.Asyoucansee,someoftheabrasionlevelsarebelowthelowerspeclimit,andsome
areveryclosetotheupperspeclimit.TheCapabilityAnalysisresultsareaddedtothereport.
TheCpkvalueis0.453,indicatingaprocessthatisnotcapable,relativetothegiven
specificationlimits.
Iftheresultisaninteger,thepthquantileisthatrankscorrespondingvalue.
Iftheresultisnotaninteger,thepthquantileisfoundbyinterpolation.Thepthquantile,
denotedqp,iscomputedasfollows:
q p = 1 f y i + f y i + 1
where:
nisthenumberofnonmissingvaluesforavariable
y1,y2,...,ynrepresentstheorderedvaluesofthevariable
yn+1 istakentobe yn
iistheintegerpartandfisthefractionalpartof(n+1)p.
(n+1)p=i+f
Forexample,supposeadatatablehas15rowsandyouwanttofindthe75thand90thquantile
valuesofacontinuouscolumn.Afterthecolumnisarrangedinascendingorder,theranks
thatcontainthesequantilesarecomputedasfollows:
7590
------- 15 + 1 = 12 and --------- 15 + 1 = 14.4
100
100
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
75
Thevaluey12isthe75thquantile.The90thquantileisinterpolatedbycomputingaweighted
averageofthe14thand15thrankedvaluesasy90 = 0.6y14 + 0.4y15.
s =
2
s where
N
s =
i=1
wi yi yw
------------------------------N1
y w = weightedmean
andwiisaweightterm(=1forequallyweighteditems)
76
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Kurtosis
Kurtosisisbasedonthefourthmomentaboutthemeanandiscomputedasfollows:
nn + 1
-------------------------------------------------- n 1 n 2 n 3
i=1
4
2
2 x i x
3 n 1
w i ------------ ---------------------------------n 2n 3
s
wherewiisaweightterm(=1forequallyweighteditems).Usingthisformula,the
Normaldistributionhasakurtosisof0.
whereriistherankoftheithobservation,andNisthenumberofnonmissing(and
nonexcluded)observations.
Thenormalquantilevaluesarecomputedasfollows:
ri
1 -------------
N + 1
whereisthecumulativeprobabilitydistributionfunctionforthenormaldistribution.
ThesenormalquantilevaluesareVanDerWaerdenapproximationstotheorderstatisticsthat
areexpectedforthenormaldistribution.
---------------------n 1 s
2
TheTestStatisticisdistributedasaChisquarevariablewithn1degreesoffreedomwhen
thepopulationisnormal.
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
TheMinPValueisthepvalueofthetwotailedtest,andiscalculatedasfollows:
2*min(p1,p2)
wherep1istheloweronetailpvalueandp2istheupperonetailpvalue.
ri
1 -------------
N + 1
where:
isthecumulativeprobabilitydistributionfunctionforthenormaldistribution
riistherankoftheithobservation
Nisthenumberofnonmissingobservations
Xistheoriginalcolumn
X isthemeanofcolumnX
SXisthestandarddeviationofcolumnX
Formfutureobservations:
1
y ym = X t 1 2m ;n 1 1 + --- s for m 1
m
n
Forthemeanofmfutureobservations:
1- + --1- s
Y l Y u = X t 1 2 n 1 --for m 1 .
m n
Forthestandarddeviationofmfutureobservations:
1
s l s u = s -------------------------------------------------------- s F 1 2 ; m 1 n 1 for m 2
F 1 2 ; n 1 m 1
77
78
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
wherem=numberoffutureobservations,andn=numberofpointsincurrentanalysis
sample.
Theonesidedintervalsareformedbyusing1inthequantilefunctions.
Forreferences,seeHahnandMeeker(1991),pages6164.
p n n fromTable1ofOdehandOwen(1980).
tisthequantilefromthenoncentraltdistribution,and
quantile.
isthestandardnormal
Two-Sided Interval
Thetwosidedintervaliscomputedasfollows:
T p ,T p = x g 1 2 ;p ,n s ,x + g 1 2 ;p ,n s
L
U
L
U
where
s=standarddeviationand g 1 2 ;p,n isaconstantthatcanbefoundinTable4ofOdeh
andOwen1980).
Todetermineg,considerthefractionofthepopulationcapturedbythetoleranceinterval.
TamhaneandDunlop(2000)givethisfractionasfollows:
x + gs
x gs
------------------------ ------------------------
wheredenotesthestandardnormalc.d.f.(cumulativedistributionfunction).Therefore,
gsolvesthefollowingequation:
X + gs
X gs
P ------------------------- ------------------------- 1 = 1
where1isthefractionofallfutureobservationscontainedinthetoleranceinterval.
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
79
MoreinformationisgiveninTablesA.1a,A.1b,A.11a,andA.11bofHahnandMeeker(1991).
Long-termusestheoverallsigma.ThisoptionisusedforPpkstatistics,andcomputes
sigmaasfollows:
n
i=1
xi x
-------------------n1
Specified Sigmaenablesyoutotypeaspecific,knownsigmausedforcomputing
capabilityanalyses.Sigmaisuserspecified,andisthereforenotcomputed.
Moving Rangeenablesyoutoenterarangespan,whichcomputessigmaasfollows:
R - where
= ------------d2 n
R istheaverageofthemovingranges
d2(n)istheexpectedvalueoftherangeofnindependentnormallydistributedvariables
withunitstandarddeviation.
andeachithsubgroupisdefinedbytheorderofthedata,sigmaiscomputedasfollows:
nj
nj
X ij X i
1---X
=
----------------------- n 1 where i n X ij
j
j
j=1
i = 1j = 1
ThisformulaiscommonlyreferredtoastheRootMeanSquareError,orRMSE.
1--r
Table 3.19DescriptionsofCapabilityIndicesandComputationalFormulas
Index
Index Name
Formula
CP
processcapability
ratio,Cp
(USLLSL)/6swhere:
USListheupperspeclimit
LSListhelowerspeclimit
80
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Index Name
CIsforCP
LowerCIonCP
UpperCIonCP
CPK(PPKfor
AIAG)
processcapability
index,Cpk
CIsforCPK
LowerCI
SeeBissell(1990)
UpperCI
CPM
processcapability
index,Cpm
Formula
2
2 n 1
CP -------------------------n1
2
1 2 n 1
CP ---------------------------------n1
min(CPL,CPU)
1
1
C pk 1 1 1 2 ----------------- + -------------------2
n
1
9n C pk
1 - + ------------------1 C pk 1 + 1 1 2 ---------------2
n
1
9n C pk
min
target LSL USL target ---------------------------------------------------------------------------------2
3 s + mean target
Note: CPMconfidenceintervalsarenot
reportedwhenthetargetisnotwithinthe
LowerandUpperSpecLimitsrange.CPM
intervalsareonlyreportedwhenthetargetis
withinthisrange.JMPwritesamessagetothe
logtonotewhytheCPMconfidenceintervals
aremissing.
CIsforCPM
LowerCIonCPM
x Target 2 2
n 1 + --------------------------
s
= -------------------------------------------------------x Target 2
1 + 2 --------------------------
s
UpperCIonCPM
1 2
CPM -------------------------
where=sameasabove.
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
81
Index Name
Formula
CPL
processcapability
ratioofonesided
lowerspec
(meanLSL)/3s
CPU
processcapability
ratioofonesided
upperspec
(USLmean)/3s
Acapabilityindexof1.33isconsideredtobetheminimumacceptable.Foranormal
distribution,thisgivesanexpectednumberofnonconformingunitsofabout6per100,000.
Exact100(1)%lowerandupperconfidencelimitsforCPLarecomputedusinga
generalizationofthemethodofChouetal.(1990),whopointoutthatthe100(1)lower
confidencelimitforCPL(denotedbyCPLLCL)satisfiesthefollowingequation:
Pr T n 1 = 3 n CPLLCL 3CPL n = 1
whereTn1()hasanoncentraltdistributionwithn1degreesoffreedomand
noncentralityparameter.
Exact100(1)%lowerandupperconfidencelimitsforCPUarealsocomputedusinga
generalizationofthemethodofChouetal.(1990),whopointoutthatthe100(1)lower
confidencelimitforCPU(denotedCPULCL)satisfiesthefollowingequation:
Pr T n 1 = 3 n CPULCL 3CPU n = 1
whereTn1()hasanoncentraltdistributionwithn1degreesoffreedomand
noncentralityparameter.
Note: Becauseofalackofsupportingresearchatthetimeofthiswriting,computing
confidenceintervalsforcapabilityindicesisnotrecommended,exceptforcaseswhenthe
capabilityindicesarebasedonthestandarddeviation.
SigmaQualityisdefinedasthefollowing
SigmaQuality = NormalQuantile 1 %outside
-------------------------- + 1.5
100
SigmaQualityAbove = NormalQuantile 1 %above
---------------------- + 1.5
100
SigmaQualityBelow = NormalQuantile 1 %below
---------------------- + 1.5
100
82
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Forexample,ifthereare3defectsinn=1,000,000observations,theformulayields6.03,ora
6.03sigmaprocess.TheresultsofthecomputationsoftheSigmaQualityAboveUSLand
SigmaQualityBelowLSLcolumnvaluesdonotsumtotheSigmaQualityTotalOutside
columnvaluebecausecalculatingSigmaQualityinvolvesfindingnormaldistribution
quantiles,andisthereforenotadditive.
HerearetheBenchmarkZformulas:
ZUSL=(USLXbar)/sigma=3*CPU
ZLSL=(XbarLSL)/sigma=3*CPL
ZBench=InverseCumulativeProb(1P(LSL)P(USL))
where:
P(LSL)=Prob(X<LSL)=1CumProb(ZLSL)
P(USL)=Prob(X>USL)=1CumProb(ZUSL).
(themean)definesthelocationofthedistributiononthexaxis
(standarddeviation)definesthedispersionorspreadofthedistribution
1
x 2
----------------exp -------------------2 2
2 2
for x ; ;
E(x)=
Var(x)=2
LogNormal
TheLogNormalfittingoptionestimatestheparameters(scale)and(shape)forthe
twoparameterlognormaldistribution.AvariableYislognormalifandonlyif X = ln Y is
normal.Thedatamustbegreaterthanzero.
Chapter 3
Basic Analysis
pdf:
Distributions
Statistical Details for the Distribution Platform
log x 2
exp ------------------------------------1
2 2
-------------- ----------------------------------------------------x
2
83
for 0 x ; ; 0<
E(x)= exp + 2 2
Var(x)= exp 2 + 2 exp 2 + 2
Weibull, Weibull with Threshold, and Extreme Value
TheWeibulldistributionhasdifferentshapesdependingonthevaluesof(scale)and
(shape).Itoftenprovidesagoodmodelforestimatingthelengthoflife,especiallyfor
mechanicaldevicesandinbiology.TheWeibulloptionisthesameastheWeibullwith
thresholdoption,withathreshold()parameterofzero.FortheWeibullwiththreshold
option,JMPestimatesthethresholdastheminimumvalue.Ifyouknowwhatthethreshold
shouldbe,setitbyusingtheFix Parametersoption.SeeFitDistributionOptionsonpage 64.
ThepdffortheWeibullwiththresholdisasfollows:
pdf:
x
----- x 1 exp ------------
for ,>0; x
1
E(x)= + 1 + ---
where . istheGammafunction.
TheExtremeValuedistributionisatwoparameterWeibull(,)distributionwiththe
transformedparameters = 1 / and = ln().
Exponential
Theexponentialdistributionisespeciallyusefulfordescribingeventsthatrandomlyoccur
overtime,suchassurvivaldata.Theexponentialdistributionmightalsobeusefulfor
modelingelapsedtimebetweentheoccurrenceofnonoverlappingevents,suchasthetime
betweenauserscomputerqueryandresponseoftheserver,thearrivalofcustomersata
servicedesk,orcallscominginataswitchboard.
TheExponentialdistributionisaspecialcaseofthetwoparameterWeibullwhen=1and
,andalsoaspecialcaseoftheGammadistributionwhen=1.
pdf:
1
--- exp x
for 0<; 0 x
84
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
E(x)=
Var(x)=
Devore(1995)notesthatanexponentialdistributionismemoryless.Memorylessmeansthatif
youcheckacomponentafterthoursanditisstillworking,thedistributionofadditional
lifetime(theconditionalprobabilityofadditionallifegiventhatthecomponenthasliveduntil
t)isthesameastheoriginaldistribution.
Gamma
TheGammafittingoptionestimatesthegammadistributionparameters, > 0and > 0.The
parameter,calledalphainthefittedgammareport,describesshapeorcurvature.The
parameter,calledsigma,isthescaleparameterofthedistribution.Athirdparameter,,
calledtheThreshold,isthelowerendpointparameter.Itissettozerobydefault,unlessthere
arenegativevalues.YoucanalsosetitsvaluebyusingtheFix Parametersoption.SeeFit
DistributionOptionsonpage 64.
pdf:
1 ------------------ x 1 exp x
for x ; 0<,
E(x)=+
Var(x)=
Thestandardgammadistributionhas = 1.Sigmaiscalledthescaleparameterbecause
valuesotherthan1stretchorcompressthedistributionalongthexaxis.
Theexponentialdistributionisthefamilyofgammacurvesthatoccurwhen = 1and
= 0.
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
85
SetparameterstofixedvaluesbyusingtheFix Parametersoption.Theupperthresholdmust
begreaterthanorequaltothemaximumdatavalue,andthelowerthresholdmustbeless
thanorequaltotheminimumdatavalue.FordetailsabouttheFixParametersoption,seeFit
DistributionOptionsonpage 64.
1
------------------------------------------- x 1 + x 1
B + 1
pdf:
for x + ; 0<,,
E(x)= + ------------+
2
Var(x)= ------------------------------------------------ + 2 + + 1
where B . istheBetafunction.
Normal Mixtures
TheNormal Mixturesoptionfitsamixtureofnormaldistributions.Thisflexibledistributionis
capableoffittingmultimodaldata.
FitamixtureoftwoorthreenormaldistributionsbyselectingtheNormal 2 MixtureorNormal
3 Mixtureoptions.Alternatively,youcanfitamixtureofknormaldistributionsbyselecting
theOtheroption.Aseparatemean,standarddeviation,andproportionofthewholeis
estimatedforeachgroup.
i=1
k
E(x)=
x
----- --------------i
i i
k
i
pdf:
i =1
i i
k
Var(x)=
i=1
2
i i
i =1
i i2 + i2
wherei,i,andiaretherespectivemean,standarddeviation,andproportionfortheith
group,and . isthestandardnormalpdf.
Smooth Curve
TheSmooth Curveoptionfitsasmoothcurveusingnonparametricdensityestimation(kernel
densityestimation).Thesmoothcurveisoverlaidonthehistogramandasliderappears
beneaththeplot.Controltheamountofsmoothingbychangingthekernelstandarddeviation
withtheslider.TheinitialKernel Stdestimateisformedbysummingthenormaldensitiesof
thekernelstandarddeviationlocatedateachdatapoint.
86
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
where,fortheJohnsonSu:
2
1
f y = ln Y + 1 + Y = sinh Y
Y = X
-----------
where,fortheJohnsonSb:
Y
f Y = ln -------------
1Y
Y = X
-----------
X+
andfortheJohnsonSl,where = 1 .
f Y = ln Y
Y = X
-----------
X if = 1
X if = 1
JohnsonSu
pdf:
x 2 1 2
--- 1 + ------------
+ sinh 1 ------------
for x ; 0<,
JohnsonSb
pdf:
+ ln -------------------------- ------------------------------------------------
x x x
for <x<+
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
87
JohnsonSl
pdf:
x
--------------- + ln ------------
x
where . isthestandardnormalpdf.
Note: Theparameterconfidenceintervalsarehiddeninthedefaultreport.Parameter
confidenceintervalsarenotverymeaningfulforJohnsondistributions,becausetheyare
transformationstonormality.Toshowparameterconfidenceintervals,rightclickinthereport
andselectColumns > Lower 95%andUpper 95%.
Generalized Log (Glog)
Thisdistributionisusefulforfittingdatathatarerarelynormallydistributedandoftenhave
nonconstantvariance,likebiologicalassaydata.TheGlogdistributionisdescribedwiththe
parameters(location),(scale),and(shape).
pdf:
x + x2 + 2
x + x2 + 2
--- log -------------------------------- -----------------------------------------------------------
x 2 + 2 + x x 2 + 2
for 0 ; 0<;
TheGlogdistributionisatransformationtonormality,andcomesfromthefollowing
relationship:
1
x + x2 + 2
Ifz= --- log -------------------------------- ~N(0,1),thenx~Glog(,,).
2
When=0,theGlogreducestotheLogNormal(,).
Note: Theparameterconfidenceintervalsarehiddeninthedefaultreport.Parameter
confidenceintervalsarenotverymeaningfulfortheGLogdistribution,becauseitisa
transformationtonormality.Toshowparameterconfidenceintervals,rightclickinthereport
andselectColumns > Lower 95%andUpper 95%.
All
IntheCompareDistributionsreport,theShowDistributionlistissortedbyAICcinascending
order.
TheformulaforAICcisasfollows:
2 + 1
AICc= 2logL + 2 + -------------------------n + 1
88
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
where:
logListhelogLikelihood
nisthesamplesize
isthenumberofparameters
Ifyourdatahasnegativevalues,theShowDistributionlistdoesnotincludethosedistributions
thatrequiredatawithpositivevalues.Ifyourdatahasnonintegervalues,thelistof
distributionsdoesnotincludediscretedistributions.Distributionswiththresholdparameters,
likeBetaandJohnsonSb,arenotincludedinthelistofpossibledistributions.
e x
--------------x!
for 0 ; x=0,1,2,...
E(x)=
Var(x)=
SincethePoissondistributionisadiscretedistribution,theoverlaidcurveisastepfunction,
withjumpsoccurringateveryinteger.
Gamma Poisson
ThisdistributionisusefulwhenthedataisacombinationofseveralPoisson()distributions,
eachwithadifferent.Oneexampleistheoverallnumberofaccidentscombinedfrom
multipleintersections,whenthemeannumberofaccidents()variesbetweenthe
intersections.
TheGammaPoissondistributionresultsfromassumingthatx|followsaPoisson
distributionandfollowsaGamma(,).TheGammaPoissonhasparameters and
.Theisadispersionparameter.If>1,thereisoverdispersion,meaningthereis
morevariationinxthanexplainedbythePoissonalone.If=1,xreducestoPoisson().
pmf:
E(x)=
-
x + ---------- ----------
1 1 x 1
------------------------------------------- ------------
x + 1 ------------
1
for 0 ; 1 ; x=0,1,2,...
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
89
Var(x)=
where . istheGammafunction.
Rememberthatx|~Poisson(),while~Gamma(,).Theplatformestimates and
Toobtainestimatesforand,usethefollowingformulas:
= 1
= --
Iftheestimateofis1,theformulasdonotwork.Inthatcase,theGammaPoissonhas
reducedtothePoisson(),and istheestimateof
Iftheestimateforisaninteger,theGammaPoissonisequivalenttoaNegativeBinomial
withthefollowingpmf:
y + r 1 r
py =
p 1 p y
for 0 y
withr=and(1p)/p=.
Binomial
TheBinomialoptionacceptsdataintwoformats:aconstantsamplesize,oracolumn
containingsamplesizes.
pmf:
n p x 1 p n x
x
for 0 p 1 ; x=0,1,2,...,n
E(x)=np
Var(x)=np(1p)
wherenisthenumberofindependenttrials.
Note: TheconfidenceintervalforthebinomialparameterisaScoreinterval.SeeAgresti
(1998).
Beta Binomial
ThisdistributionisusefulwhenthedataisacombinationofseveralBinomial(p)distributions,
eachwithadifferentp.Oneexampleistheoverallnumberofdefectscombinedfrommultiple
manufacturinglines,whenthemeannumberofdefects(p)variesbetweenthelines.
90
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
TheBetaBinomialdistributionresultsfromassumingthatx|followsaBinomial(n,)
distributionandfollowsaBeta(,).TheBetaBinomialhasparametersp = /(+)and = 1/
(++1).Theisadispersionparameter.When>0,thereisoverdispersion,meaningthereis
morevariationinxthanexplainedbytheBinomialalone.When<0,thereisunder
dispersion.When = 0,xisdistributedasBinomial(n,p).TheBetaBinomialonlyexistswhen
n2.
pmf:
1
1
1
--- 1 x + p --- 1 n x + 1 p --- 1
n --------------------------------------------------------------------------------------------------------------------------- x
p 1--- 1 1 p 1--- 1 n + 1
--- 1
p
1p
for 0 p 1 ; max ( ---------------------, ---------------------) 1 ; x=0,1,2,...,n
np1 n2+p
E(x)=np
Var(x)=np(1p)[1+(n1)]
where . istheGammafunction.
Rememberthatx|~Binomial(n,),while~Beta(,).Theparametersp = /(+)and = 1/
(++1)areestimatedbytheplatform.Toobtainestimatesofand,usethefollowing
formulas:
1
= p ----------
1
= 1 p ----------
Iftheestimateofis0,theformulasdonotwork.Inthatcase,theBetaBinomialhasreduced
totheBinomial(n,p),and p istheestimateofp.
TheconfidenceintervalsfortheBetaBinomialparametersareprofilelikelihoodintervals.
Comparing All Distributions
TheShowDistributionlistissortedbyAICcinascendingorder.
TheformulaforAICcisasfollows:
2 + 1 AICc= 2logL + 2 + ------------------------n + 1
where:
logListhelogLikelihood
nisthesamplesize
isthenumberofparameters
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
91
Parameters
Normala
andareunknown
ShapiroWilk(forn2000)
KolmogorovSmirnovLillefors
(forn>2000)
andarebothknown
KolmogorovSmirnovLillefors
eitherorisknown
(none)
LogNormal
andareknownor
unknown
KolmogorovsD
Weibull
andknownor
unknown
CramrvonMisesW2
Weibullwiththreshold
,andknownor
unknown
CramrvonMisesW2
ExtremeValue
andknownor
unknown
CramrvonMisesW2
Exponential
isknownorunknown
KolmogorovsD
Gamma
andareknown
CramrvonMisesW2
eitherorisunknown
(none)
92
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Parameters
Beta
andareknown
KolmogorovsD
eitherorisunknown
(none)
Binomial
isknownorunknown
andnisknown
KolmogorovsD(forn30)
Pearson(forn>30)
BetaBinomial
andknownor
unknown
KolmogorovsD(forn30)
Pearson(forn>30)
Poisson
knownorunknown
KolmogorovsD(forn30)
Pearson(forn>30)
GammaPoisson
orknownorunknown
KolmogorovsD(forn30)
Pearson(forn>30)
a. For the three Johnson distributions and the Glog distribution, the data are transformed to
Normal,thentheappropriatetestofnormalityisperformed.
Spec Limits
WritingTforthetarget,LSL,andUSLforthelowerandupperspecificationlimits,andPfor
the*100thpercentile,thegeneralizedcapabilityindicesareasfollows:
P 0.5 LSL
C pl = ------------------------------------P 0.5 P 0.00135
USL P 0.5
C pu = ------------------------------------P 0.99865 P 0.5
USL LSL C p = ----------------------------------------------P 0.99865 P 0.00135
USL P0.5
P 0.5 LSL
C pk = min ------------------------------------- ,------------------------------------
P 0.5 P 0.00135 P 0.99865 P 0.5
1-- USL + LSL P 0.5
2
K = 2 --------------------------------------------------------USL LSL
T LSL
USL T
min ------------------------------------- ,------------------------------------P P
0.5
0.00135 P 0.99865 P 0.5
C pm = ---------------------------------------------------------------------------------------------T 2
1 + -------------
Chapter 3
Basic Analysis
Distributions
Statistical Details for the Distribution Platform
93
Ifthedataarenormallydistributed,theseformulasreducetotheformulasforstandard
capabilityindices.SeeTable 3.19.
Set Spec Limits for K Sigma
TypeaKvalueandselectonesidedortwosidedforyourcapabilityanalysis.Tail
probabilitiescorrespondingtoKstandarddeviationsarecomputedfromtheNormal
distribution.Theprobabilitiesareconvertedtoquantilesforthespecificdistributionthatyou
havefitted.Theresultingquantilesareusedforspecificationlimitsinthecapabilityanalysis.
ThisoptionissimilartotheQuantilesoption,butyouprovideKinsteadofprobabilities.K
correspondstothenumberofstandarddeviationsthatthespecificationlimitsareawayfrom
themean.
Forexample,foraNormaldistribution,whereK=3,the3standarddeviationsbelowand
abovethemeancorrespondtothe0.00135thquantileand0.99865thquantile,respectively.The
lowerspecificationlimitissetatthe0.00135thquantile,andtheupperspecificationlimitisset
atthe0.99865thquantileofthefitteddistribution.Acapabilityanalysisisreturnedbasedon
thosespecificationlimits.
94
Distributions
Statistical Details for the Distribution Platform
Chapter 3
Basic Analysis
Chapter 4
Introduction to Fit Y by X
Examine Relationships Between Two Variables
TheFitYbyXplatformanalyzesthepairofXandYvariablesthatyouspecify,bycontext,
basedonmodelingtype.
Herearethefourtypesofanalyses:
Bivariatefitting
Onewayanalysisofvariance
Logisticregression
Contingencytableanalysis
Figure 4.1ExamplesofFourTypesofAnalyses
Oneway
Logistic
Contingency
Nominal or
Ordinal Y
Continuous Y
Bivariate
Continuous X
Nominal or Ordinal X
Contents
OverviewoftheFitYbyXPlatform ................................................ 97
LaunchtheFitYbyXPlatform ..................................................... 97
LaunchSpecificAnalysesfromtheJMPStarterWindow............................ 98
Chapter 4
Basic Analysis
Introduction to Fit Y by X
Overview of the Fit Y by X Platform
Modeling Types
Description
Reference
Bivariate
ContinuousY
bycontinuousX
Analyzestherelationship
betweentwocontinuous
variables
SeeBivariate
Analysis.
Oneway
ContinuousY
bynominal
or ordinalX
Analyzeshowthedistribution
ofacontinuousYvariable
differsacrossgroupsdefined
byacategoricalXvariable
SeeOneway
Analysis.
Logistic
Nominalor
ordinalYby
continuousX
Fitstheprobabilitiesfor
responsecategoriestoa
continuousXpredictor
SeeLogistic
Analysis.
Contingency
Nominalor
ordinalYby
nominalor
ordinalX
Analyzesthedistributionofa
categoricalresponsevariableY
asconditionedbythevaluesof
acategoricalXfactor
SeeContingency
Analysis.
97
98
Introduction to Fit Y by X
Overview of the Fit Y by X Platform
Chapter 4
Basic Analysis
Table 4.1DescriptionoftheFitYbyXLaunchWindow
Bivariate, Oneway,
Logistic, Contingency
Thisgridshowswhichanalysisresultsfromthedifferent
combinationsofdatatypes.Onceyouhaveassignedyour
columns,theapplicableplatformappearsasalabelabovethe
grid.
Block
(Optional,forOnewayandContingencyonly):
FortheOnewayplatform,identifiesasecondfactor,
whichformsatwowayanalysiswithoutinteraction.The
datashouldbebalancedandhaveequalcountsineach
blockbygroupcell.IfyouspecifyaBlockvariable,the
datashouldbebalancedandhaveequalcountsineach
blockbygroupcell.Intheplot,thevaluesoftheY
variablearecenteredbytheBlockvariable.
FortheContingencyplatform,identifiesasecondfactor
andperformsaCochranMantelHaenszeltest.
Formoreinformationaboutlaunchwindows,seeUsingJMP.
Y Button
X Button
FitYbyX
Y,Response
X,Factor
Bivariate
Y,Response
X,Regressor
Oneway
Y,Response
X,Grouping
Logistic
Y,Categorical
Response
X,Continuous
Regressor
Chapter 4
Basic Analysis
Introduction to Fit Y by X
Overview of the Fit Y by X Platform
Y Button
X Button
Contingency
Y,Response
Category
X,Grouping
Category
99
100
Introduction to Fit Y by X
Overview of the Fit Y by X Platform
Chapter 4
Basic Analysis
Chapter 5
Bivariate Analysis
Examine Relationships between Two Continuous Variables
TheBivariateplatformshowstherelationshipbetweentwocontinuousvariables.Itisthe
continuousbycontinuouspersonalityoftheFitYbyXplatform.Thewordbivariatesimply
meansinvolvingtwovariablesinsteadofone(univariate)ormany(multivariate).
TheBivariateanalysisresultsappearinascatterplot.EachpointontheplotrepresentstheX
andYscoresforasinglesubject;inotherwords,eachpointrepresentstwovariables.Using
thescatterplot,youcanseeataglancethedegreeandpatternoftherelationshipbetweenthe
twovariables.Youcaninteractivelyaddothertypesoffits,suchassimplelinearregression,
polynomialregression,andsoon.
Figure 5.1ExampleofBivariateAnalysis
Contents
ExampleofBivariateAnalysis..................................................... 103
LaunchtheBivariatePlatform..................................................... 103
TheBivariatePlot ............................................................... 104
FittingCommandsandGeneralOptions ........................................... 105
FitMean ....................................................................... 108
FitLineandFitPolynomial ....................................................... 110
FitSpecial ...................................................................... 117
FitSpline ....................................................................... 119
KernelSmoother................................................................ 120
FitEachValue .................................................................. 121
FitOrthogonal.................................................................. 122
DensityEllipse .................................................................. 123
NonparDensity ................................................................. 125
FitRobust...................................................................... 126
HistogramBorders.............................................................. 126
GroupBy ...................................................................... 127
FittingMenus................................................................... 127
AdditionalExamplesoftheBivariatePlatform...................................... 131
StatisticalDetailsfortheBivariatePlatform ......................................... 139
Chapter 5
Basic Analysis
Bivariate Analysis
Example of Bivariate Analysis
103
Youcanseethattheverbalscoreswerehigherwhenasmallerpercentageofthepopulation
tookthetest.
TolaunchtheBivariateplatform,fromtheJMPStarterwindow,clickontheBasiccategory
andclickBivariate.
104
Bivariate Analysis
The Bivariate Plot
Chapter 5
Basic Analysis
Figure 5.3TheBivariateLaunchWindow
Forinformationaboutthislaunchwindow,seeIntroductiontoFitYbyXchapteron
page 95.
Note: AnyrowsthatareexcludedinthedatatablearealsohiddenintheBivariateplot.
Chapter 5
Basic Analysis
Bivariate Analysis
Fitting Commands and General Options
105
TheBivariatereportbeginswithaplotforeachpairofXandYvariables.Replacevariablesin
theplotbydragginganddroppingavariable,inoneoftwoways:swapexistingvariablesby
dragginganddroppingavariablefromoneaxistotheotheraxis;or,clickonavariableinthe
Columnspaneloftheassociateddatatableanddragitontoanaxis.
YoucaninteractwiththisplotjustasyoucanwithotherJMPplots(forexample,resizingthe
plot,highlightingpointswiththearroworbrushtool,andlabelingpoints).Fordetailsabout
thesefeatures,seetheUsingJMPbook.
Youcanfitcurvesontheplotandviewstatisticalreportsandadditionalmenususingthe
fittingcommandsthatarelocatedwithintheredtrianglemenu.SeeFittingCommandsand
GeneralOptionsonpage 105.
fitting
commands
106
Bivariate Analysis
Fitting Commands and General Options
Chapter 5
Basic Analysis
Table 5.1DescriptionsoftheGeneralOptions
Show Points
Hidesorshowsthepointsinthescatterplot.Acheckmarkindicates
thatpointsareshown.
Histogram Borders
Attacheshistogramstothexandyaxesofthescatterplot.Acheck
markindicatesthathistogrambordersareturnedon.See
HistogramBordersonpage 126.
Group By
Letsyouselectaclassification(orgrouping)variable.Aseparate
analysisiscomputedforeachlevelofthegroupingvariable,and
regressioncurvesorellipsesareoverlaidonthescatterplot.See
GroupByonpage 127.
Script
Containsoptionsthatareavailabletoallplatforms.Theseoptions
enableyoutoredotheanalysisorsavetheJSLcommandsforthe
analysistoawindoworafile.Formoreinformation,seeUsingJMP.
Eachfittingcommandaddsthefollowing:
aline,curve,ordistributiontothescatterplot
aredtrianglemenutothereportwindow
aspecificreporttothereportwindow
Figure 5.6ExampleoftheFitMeanFittingCommand
Chapter 5
Basic Analysis
Bivariate Analysis
Fitting Commands and General Options
107
Table 5.2DescriptionsoftheFittingCommands
Fit Mean
Addsahorizontallinetothescatterplotthat
representsthemeanoftheYresponse
variable.
SeeFitMeanon
page 108.
Fit Line
Addsstraightlinefitstoyourscatterplot
usingleastsquaresregression.
SeeFitLineandFit
Polynomialon
page 110.
Fit Polynomial
Fitspolynomialcurvesofacertaindegree
usingleastsquaresregression.
SeeFitLineandFit
Polynomialon
page 110.
Fit Special
TransformsYandX.Transformations
include:log,square root,square,reciprocal,
andexponential.Youcanalsoturnoff
centerpolynomials,constraintheintercept
andtheslope,andfitpolynomialmodels.
SeeFitSpecialon
page 117.
Fit Spline
Fitsasmoothingsplinethatvariesin
smoothness(orflexibility)accordingtothe
lambda()value.Thevalueisatuning
parameterinthesplineformula.
SeeFitSplineon
page 119.
Kernel Smoother
Producesacurveformedbyrepeatedly
findingalocallyweightedfitofasimple
curve(alineoraquadratic)atsampled
pointsinthedomain.
SeeKernelSmoother
onpage 120.
FitsavaluetoeachuniqueXvalue,which
canbecomparedtootherfittedlines,
showingtheconceptoflackoffit.
SeeFitEachValueon
page 121.
Fit Orthogonal
FitslinesthatadjustforvariabilityinXas
wellasY.
SeeFitOrthogonalon
page 122.
Density Ellipse
Drawsanellipsethatcontainsaspecified
massofpoints.
SeeDensityEllipseon
page 123.
Nonpar Density
Showspatternsinthepointdensity,which
isusefulwhenthescatterplotisso
darkenedbypointsthatitisdifficultto
distinguishpatterns.
SeeNonparDensityon
page 125.
Fit Robust
Attemptstoreducetheinfluenceofoutliers
inyourdataset.
SeeFitRobuston
page 126.
108
Bivariate Analysis
Fit Mean
Chapter 5
Basic Analysis
Description
Fitting Commands
RegressionFits
Regressionmethodsfitacurvethroughthepoints.
Thecurveisanequation(amodel)thatis
estimatedusingleastsquares,whichminimizes
thesumofsquareddifferencesfromeachpointto
theline(orcurve).Regressionfitsassumethatthe
Yvariableisdistributedasarandomscatterabove
andbelowalineoffit.
Fit Mean
Fit Line
Fit Polynomial
Fit Special
Fit Spline
Fit Each Value
Fit Orthogonal
Density
Estimation
Densityestimationfitsabivariatedistributionto
thepoints.Youcaneitherselectabivariatenormal
density,characterizedbyellipticalcontours,ora
generalnonparametricdensity.
Fit Mean
UsingtheFit Meancommand,youcanaddahorizontallinetothescatterplotthatrepresents
themeanoftheYresponsevariable.Youcanstartbyfittingthemeanandthenusethemean
lineasareferenceforotherfits(suchasstraightlines,confidencecurves,polynomialcurves,
andsoon).
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Mean
109
Figure 5.7ExampleofFitMean
Fit Mean
Meanoftheresponsevariable.Thepredictedresponsewhenthereare
nospecifiedeffectsinthemodel.
StdDev
[RMSE]
Standarddeviationoftheresponsevariable.Squarerootofthemean
squareerror,alsocalledtherootmeansquareerror(orRMSE).
StdError
Standarddeviationoftheresponsemean.Calculatedbydividingthe
RMSEbythesquarerootofthenumberofvalues.
SSE
Errorsumofsquaresforthesimplemeanmodel.Appearsasthesumof
squaresforErrorintheanalysisofvariancetablesforeachmodelfit.
RelatedInformation
FittingMenusonpage 127
110
Bivariate Analysis
Fit Line and Fit Polynomial
Chapter 5
Basic Analysis
Figure 5.8showsanexamplethatcomparesalinearfittothemeanlineandtoadegree2
polynomialfit.
Notethefollowinginformation:
TheFit Lineoutputisequivalenttoapolynomialfitofdegree1.
TheFit Meanoutputisequivalenttoapolynomialfitofdegree0.
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Line and Fit Polynomial
111
Figure 5.9ExampleofEquationsofFit
Note: Youcanedittheequationbyclickingonit.
EachLinearandPolynomialFitDegreereportcontainsatleastthreereports.Afourthreport,
LackofFit,appearsonlyifthereareXreplicatesinyourdata.
Summary of Fit Report
TheSummaryofFitreportsshowthenumericsummariesoftheresponseforthelinearfitand
polynomialfitofdegree2forthesamedata.YoucancomparemultipleSummaryofFit
reportstoseetheimprovementofonemodeloveranother,indicatedbyalargerRsquarevalue
andsmallerRoot Mean Square Error.
Figure 5.10SummaryofFitReportsforLinearandPolynomialFits
Table 5.4DescriptionoftheSummaryofFitReport
RSquare
Measurestheproportionofthevariationexplainedbythemodel.
Theremainingvariationisnotexplainedbythemodeland
attributedtorandomerror.TheRsquareis1ifthemodelfits
perfectly.
TheRsquarevaluesinFigure 5.10indicatethatthepolynomialfit
ofdegree2givesasmallimprovementoverthelinearfit.
SeeStatisticalDetailsfortheSummaryofFitReportonpage 140.
112
Bivariate Analysis
Fit Line and Fit Polynomial
Chapter 5
Basic Analysis
AdjuststheRsquare valuetomakeitmorecomparableover
modelswithdifferentnumbersofparametersbyusingthedegrees
offreedominitscomputation.
SeeStatisticalDetailsfortheSummaryofFitReportonpage 140.
RootMeanSquare
Error
Estimatesthestandarddeviationoftherandomerror.Itisthe
squarerootofthemeansquareforErrorintheAnalysisofVariance
report.SeeFigure 5.12.
MeanofResponse
Providesthesamplemean(arithmeticaverage)oftheresponse
variable.Thisisthepredictedresponsewhennomodeleffectsare
specified.
Observations
Providesthenumberofobservationsusedtoestimatethefit.If
thereisaweightvariable,thisisthesumoftheweights.
Thedifferencebetweentheresidualerrorfromthemodelandthepureerroriscalledthelack
offiterror.Thelackoffiterrorcanbesignificantlygreaterthanthepureerrorifyouhavethe
wrongfunctionalformoftheregressor.Inthatcase,youshouldtryadifferenttypeofmodel
fit.TheLackofFitreporttestswhetherthelackoffiterroriszero.
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Line and Fit Polynomial
113
Table 5.5DescriptionoftheLackofFitReport
Source
DF
Thedegreesoffreedom(DF)foreachsourceoferror.
Sumof
Squares
TheTotal ErrorDFisthedegreesoffreedomfoundontheErrorlineof
theAnalysisofVariancetable(shownundertheAnalysisofVariance
Reportonpage 114).ItisthedifferencebetweentheTotalDFandthe
ModelDFfoundinthattable.TheErrorDFispartitionedintodegrees
offreedomforlackoffitandforpureerror.
ThePure ErrorDFispooledfromeachgroupwheretherearemultiple
rowswiththesamevaluesforeacheffect.SeeStatisticalDetailsfor
theLackofFitReportonpage 141.
Thesumofsquares(SSforshort)foreachsourceoferror.
TheTotal ErrorSSisthesumofsquaresfoundontheErrorlineofthe
correspondingAnalysisofVariancetable,shownunderAnalysisof
VarianceReportonpage 114.
ThePure ErrorSSispooledfromeachgroupwheretherearemultiple
rowswiththesamevalueforthexvariable.Thisestimatestheportion
ofthetruerandomerrorthatisnotexplainedbymodelxeffect.See
StatisticalDetailsfortheLackofFitReportonpage 141.
MeanSquare
Thesumofsquaresdividedbyitsassociateddegreesoffreedom.This
computationconvertsthesumofsquarestoanaverage(meansquare).
Fratiosforstatisticaltestsaretheratiosofmeansquares.
FRatio
TheratioofmeansquareforlackoffittomeansquareforPureError.It
teststhehypothesisthatthelackoffiterroriszero.
Prob>F
TheprobabilityofobtainingagreaterFvaluebychancealoneifthe
variationduetolackoffitvarianceandthepureerrorvariancearethe
same.Ahighpvaluemeansthatthereisnotasignificantlackoffit.
114
Bivariate Analysis
Fit Line and Fit Polynomial
Chapter 5
Basic Analysis
ThemaximumR2thatcanbeachievedbyamodelusingonlythe
variablesinthemodel.
SeeStatisticalDetailsfortheLackofFitReportonpage 141.
Table 5.6DescriptionoftheAnalysisofVarianceReport
Source
Thethreesourcesofvariation:Model,Error,andC. Total.
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Line and Fit Polynomial
115
Sumof
Squares
MeanSquare
Thedegreesoffreedom(DF)foreachsourceofvariation:
Adegreeoffreedomissubtractedfromthetotalnumberofnon
missingvalues(N)foreachparameterestimateusedinthe
computation.Thecomputationofthetotalsamplevariationusesan
estimateofthemean.Therefore,onedegreeoffreedomissubtracted
fromthetotal,leaving49.Thetotalcorrecteddegreesoffreedomare
partitionedintotheModelandErrorterms.
Onedegreeoffreedomfromthetotal(shownontheModelline)is
usedtoestimateasingleregressionparameter(theslope)forthe
linearfit.Twodegreesoffreedomareusedtoestimatetheparameters
( 1 and 2 )forapolynomialfitofdegree2.
TheErrordegreesoffreedomisthedifferencebetweenC. Totaldfand
Modeldf.
Thesumofsquares(SSforshort)foreachsourceofvariation:
Inthisexample,thetotal(C. Total)sumofsquareddistancesofeach
responsefromthesamplemeanis57,278.157,asshowninFigure 5.12.
Thatisthesumofsquaresforthebasemodel(orsimplemeanmodel)
usedforcomparisonwithallothermodels.
Forthelinearregression,thesumofsquareddistancesfromeach
pointtothelineoffitreducesfrom12,012.733.Thisistheresidualor
unexplained(Error) SSafterfittingthemodel.TheresidualSSfora
seconddegreepolynomialfitis6,906.997,accountingforslightly
morevariationthanthelinearfit.Thatis,themodelaccountsformore
variationbecausethemodelSSarehigherfortheseconddegree
polynomialthanthelinearfit.TheC. totalSSlesstheErrorSSgives
thesumofsquaresattributedtothemodel.
Thesumofsquaresdividedbyitsassociateddegreesoffreedom.The
Fratioforastatisticaltestistheratioofthefollowingmeansquares:
TheModelmeansquareforthelinearfitis45,265.4.Thisvalue
estimatestheerrorvariance,butonlyunderthehypothesisthatthe
modelparametersarezero.
TheErrormeansquareis245.2.Thisvalueestimatestheerror
variance.
116
Bivariate Analysis
Fit Line and Fit Polynomial
Chapter 5
Basic Analysis
Themodelmeansquaredividedbytheerrormeansquare.The
underlyinghypothesisofthefitisthatalltheregressionparameters
(excepttheintercept)arezero.Ifthishypothesisistrue,thenboththe
meansquareforerrorandthemeansquareformodelestimatetheerror
variance,andtheirratiohasanFdistribution.Ifaparameterisa
significantmodeleffect,theFratioisusuallyhigherthanexpectedby
chancealone.
Prob>F
Theobservedsignificanceprobability(pvalue)ofobtainingagreater
Fvaluebychancealoneifthespecifiedmodelfitsnobetterthanthe
overallresponsemean.Observedsignificanceprobabilitiesof0.05orless
areoftenconsideredevidenceofaregressioneffect.
Table 5.7DescriptionoftheParameterEstimatesReport
Term
Liststhenameofeachparameterintherequestedmodel.Theinterceptisa
constantterminallmodels.
Estimate
Liststheparameterestimatesofthelinearmodel.Thepredictionformulaisthe
linearcombinationoftheseestimateswiththevaluesoftheircorresponding
variables.
StdError
Liststheestimatesofthestandarderrorsoftheparameterestimates.Theyare
usedinconstructingtestsandconfidenceintervals.
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Special
117
Liststheteststatisticsforthehypothesisthateachparameteriszero.Itisthe
ratiooftheparameterestimatetoitsstandarderror.Ifthehypothesisistrue,
thenthisstatistichasaStudentstdistribution.
Prob>|t|
Liststheobservedsignificanceprobabilitycalculatedfromeachtratio.Itisthe
probabilityofgetting,bychancealone,atratiogreater(inabsolutevalue)than
thecomputedvalue,givenatruenullhypothesis.Often,avaluebelow0.05(or
sometimes0.01)isinterpretedasevidencethattheparameterissignificantly
differentfromzero.
Torevealadditionalstatistics,rightclickinthereportandselecttheColumns menu.Statistics
notshownbydefaultareasfollows:
Lower 95% Thelowerendpointofthe95%confidenceintervalfortheparameterestimate.
Upper 95%
Theupperendpointofthe95%confidenceintervalfortheparameterestimate.
variablesthataremeasuredondifferentscales.SeeStatisticalDetailsfortheParameter
EstimatesReportonpage 141.
VIF
Thevarianceinflationfactor.
Thedesignstandarderrorfortheparameterestimate.SeeStatistical
DetailsfortheParameterEstimatesReportonpage 141.
RelatedInformation
StatisticalDetailsforFitLineonpage 139
FittingMenusonpage 127
Fit Special
UsingtheFit Specialcommand,youcantransformYandX.Transformationsincludethe
following:log,square root,square,reciprocal,andexponential.Youcanalsoconstrainthe
slopeandintercept,fitapolynomialofspecificdegree,andcenterthepolynomial.
Table 5.8DescriptionoftheSpecifyTransformationorConstraintWindow
Y Transformation
UsetheseoptionstotransformtheYvariable.
X Transformation
UsetheseoptionstotransformtheXvariable.
Degree
Usethisoptiontofitapolynomialofthespecifieddegree.
118
Bivariate Analysis
Fit Special
Chapter 5
Basic Analysis
Centered Polynomial
transformationsoftheXvariable,polynomialcenteringisnot
performed.Centeringpolynomialsstabilizestheregression
coefficientsandreducesmulticollinearity.
Constrain Intercept to
Selectthischeckboxtoconstrainthemodelintercepttobethe
specifiedvalue.
Constrain Slope to
Selectthischeckboxtoconstrainthemodelslopetobethe
specifiedvalue.
Yes
No
Transformed
Fit Report
and menu
Degree?
2-5
Linear Fit
Report and
menu
Polynomial
Fit Report
and menu
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Spline
119
RelatedInformation
ExampleoftheFitSpecialCommandonpage 132
LinearFitandPolynomialFitReportsonpage 110
FittingMenusonpage 127
Fit Spline
UsingtheFit Splinecommand,youcanfitasmoothingsplinethatvariesinsmoothness(or
flexibility)accordingtothelambda()value.Thelambdavalueisatuningparameterinthe
splineformula.Asthevalueofdecreases,theerrortermofthesplinemodelhasmore
weightandthefitbecomesmoreflexibleandcurved.Asthevalueofincreases,thefit
becomesstiff(lesscurved),approachingastraightline.
Notethefollowinginformation:
ThesmoothingsplinecanhelpyouseetheexpectedvalueofthedistributionofYacrossX.
Thepointsclosesttoeachpieceofthefittedcurvehavethemostinfluenceonit.The
influenceincreasesasyoulowerthevalueof,producingahighlyflexiblecurve.
Youmightfindithelpfultotryseveralvalues.YoucanusetheLambdasliderbeneath
theSmoothingSplinereporttoexperimentwithdifferentvalues.However,isnot
invarianttothescalingofthedata.Forexample,thevalueforanXmeasuredininches,is
notthesameasthevalueforanXmeasuredincentimeters.
Measurestheproportionofvariationaccountedforbythe
smoothingsplinemodel.Formoreinformation,seeStatistical
DetailsfortheSmoothingFitReportsonpage 142.
SumofSquares
Error
Sumofsquareddistancesfromeachpointtothefittedspline.Itis
theunexplainederror(residual)afterfittingthesplinemodel.
120
Bivariate Analysis
Kernel Smoother
Chapter 5
Basic Analysis
Enablesyoutochangethevalue,eitherbyenteringanumber,or
bymovingtheslider.
RelatedInformation
FittingMenusonpage 127
StatisticalDetailsforFitSplineonpage 139
Kernel Smoother
TheKernel Smoothercommandproducesacurveformedbyrepeatedlyfindingalocally
weightedfitofasimplecurve(alineoraquadratic)atsampledpointsinthedomain.The
manylocalfits(128intotal)arecombinedtoproducethesmoothcurveovertheentire
domain.ThismethodisalsocalledLoessorLowess,whichwasoriginallyanacronymfor
LocallyWeightedScatterplotSmoother.SeeCleveland(1979).
Usethismethodtoquicklyseetherelationshipbetweenvariablesandtohelpyoudetermine
thetypeofanalysisorfittoperform.
Measurestheproportionofvariationaccountedforbythekernel
smoothermodel.Formoreinformation,seeStatisticalDetailsfor
theSmoothingFitReportsonpage 142.
SumofSquares
Error
Sumofsquareddistancesfromeachpointtothefittedkernel
smoother.Itistheunexplainederror(residual)afterfittingthekernel
smoothermodel.
LocalFit(lambda)
Selectthepolynomialdegreeforeachlocalfit.Quadratic
polynomialscantracklocalbumpinessmoresmoothly.Lambdais
thedegreeofcertainpolynomialsthatarefittedbythemethod.
Lambdacanbe1or2.
Chapter 5
Basic Analysis
Bivariate Analysis
Fit Each Value
121
Specifyhowtoweightthedataintheneighborhoodofeachlocalfit.
Loessusestricube.Theweightfunctiondeterminestheinfluence
thateachxiandyihasonthefittingoftheline.Theinfluence
decreasesasxiincreasesindistancefromxandfinallybecomes
zero.
Smoothness(alpha)
Controlshowmanypointsarepartofeachlocalfit.Usetheslideror
typeinavaluedirectly.Alphaisasmoothingparameter.Itcanbe
anypositivenumber,buttypicalvaluesare1/4to1.Asalpha
increases,thecurvebecomessmoother.
Robustness
Reweightsthepointstodeemphasizepointsthatarefartherfrom
thefittedcurve.Specifythenumberoftimestorepeattheprocess
(numberofpasses).Thegoalistoconvergethecurveand
automaticallyfilteroutoutliersbygivingthemsmallweights.
RelatedInformation
FittingMenusonpage 127
Givesthetotalnumberofobservations.
NumberofUniqueValues
GivesthenumberofuniqueXvalues.
DegreesofFreedom
Givesthepureerrordegreesoffreedom.
SumofSquares
Givesthepureerrorsumofsquares.
MeanSquare
Givesthepureerrormeansquare.
RelatedInformation
FittingMenusonpage 127
122
Bivariate Analysis
Fit Orthogonal
Chapter 5
Basic Analysis
Fit Orthogonal
TheFit OrthogonalcommandfitslinesthatadjustforvariabilityinXaswellasY.
Usestheunivariatevarianceestimatescomputedfromthe
samplesofXandY.Thisturnsouttobethestandardizedfirst
principalcomponent.Thisoptionisnotagoodchoiceina
measurementsystemsapplicationsincetheerrorvariancesare
notlikelytobeproportionaltothepopulationvariances.
Equal Variances
Uses1asthevarianceratio,whichassumesthattheerror
variancesarethesame.Usingequalvariancesisequivalenttothe
nonstandardizedfirstprincipalcomponentline.Supposethat
thescatterplotisscaledthesameintheXandYdirections.When
youshowanormaldensityellipse,youseethatthislineisthe
longestaxisoftheellipse.
Fit X to Y
Usesavarianceratioofzero,whichindicatesthatYeffectively
hasnovariance.
Letsyouenteranyratiothatyouwant,givingyoutheabilityto
makeuseofknowninformationaboutthemeasurementerrorin
XandresponseerrorinY.
Givesthenamesofthevariablesusedtofittheline.
Mean
Givesthemeanofeachvariable.
StdDev
Givesthestandarddeviationofeachvariable.
VarianceRatio
Givesthevarianceratiousedtofittheline.
Chapter 5
Basic Analysis
Bivariate Analysis
Density Ellipse
123
Givesthecorrelationbetweenthetwovariables.
Intercept
Givestheinterceptofthefittedline.
Slope
Givestheslopeofthefittedline.
LowerCL
Givesthelowerconfidencelimitfortheslope.
UpperCL
Givestheupperconfidencelimitfortheslope.
Alpha
Enterthealphalevelusedincomputingtheconfidenceinterval.
RelatedInformation
FittingMenusonpage 127
StatisticalDetailsforFitOrthogonalonpage 139
ExampleUsingtheFitOrthogonalCommandonpage 134
Density Ellipse
UsingtheDensity Ellipseoption,youcandrawanellipse(orellipses)thatcontainsthe
specifiedmassofpoints.Thenumberofpointsisdeterminedbytheprobabilitythatyouselect
fromtheDensity Ellipsemenu).
Figure 5.15ExampleofDensityEllipses
ThedensityellipsoidiscomputedfromthebivariatenormaldistributionfittotheXandY
variables.Thebivariatenormaldensityisafunctionofthemeansandstandarddeviationsof
theXandYvariablesandthecorrelationbetweenthem.TheOtherselectionletsyouspecify
anyprobabilitygreaterthanzeroandlessthanorequaltoone.
124
Bivariate Analysis
Density Ellipse
Chapter 5
Basic Analysis
Theseellipsesarebothdensitycontoursandconfidencecurves.Asconfidencecurves,they
showwhereagivenpercentageofthedataisexpectedtolie,assumingthebivariatenormal
distribution.
Thedensityellipsoidisagoodgraphicalindicatorofthecorrelationbetweentwovariables.
Theellipsoidcollapsesdiagonallyasthecorrelationbetweenthetwovariablesapproaches
either1or1.Theellipsoidismorecircular(lessdiagonallyoriented)ifthetwovariablesare
lesscorrelated.
Correlation Report
TheCorrelationreportthataccompanieseachDensity Ellipsefitshowsthecorrelation
coefficientfortheXandYvariables.
Note: Toseeamatrixofellipsesandcorrelationsformanypairsofvariables,usethe
MultivariatecommandintheAnalyze > Multivariate Methods menu.
Table 5.13DescriptionoftheCorrelationReport
Variable
Givesthenamesofthevariablesusedincreatingtheellipse
Mean
GivestheaverageofboththeXandYvariable.
StdDev
GivesthestandarddeviationofboththeXandYvariable.
AdiscussionofthemeanandstandarddeviationareinthesectionThe
SummaryStatisticsReportonpage 41intheDistributionschapter.
Correlation
ThePearsoncorrelationcoefficient.Ifthereisanexactlinearrelationship
betweentwovariables,thecorrelationis1or1dependingonwhetherthe
variablesarepositivelyornegativelyrelated.Ifthereisnorelationship,the
correlationtendstowardzero.
Formoreinformation,seeStatisticalDetailsfortheCorrelationReporton
page 142.
Signif.Prob
Probabilityofobtaining,bychancealone,acorrelationwithgreaterabsolute
valuethanthecomputedvalueifnolinearrelationshipexistsbetweentheX
andYvariables.
Number
Givesthenumberofobservationsusedinthecalculations.
RelatedInformation
FittingMenusonpage 127
ExampleofGroupByUsingDensityEllipsesonpage 137
Chapter 5
Basic Analysis
Bivariate Analysis
Nonpar Density
125
Nonpar Density
Whenaplotshowsthousandsofpoints,themassofpointscanbetoodarktoshowpatternsin
density.UsingtheNonpar Density(nonparametricdensity)optionmakesiteasiertoseethe
patterns.
Bivariatedensityestimationmodelsasmoothsurfacethatdescribeshowdensethedata
pointsareateachpointinthatsurface.Theplotaddsasetofcontourlinesshowingthe
density(Figure 5.16).Thecontourlinesarequantilecontoursin5%intervals.Thismeansthat
about5%ofthepointsarebelowthelowestcontour,10%arebelowthenextcontour,andso
on.Thehighestcontourhasabout95%ofthepointsbelowit.
Figure 5.16ExampleofNonparDensity
Youcanchangethesizeofanonparametricdensitycontourgridtocreatesmoothercontours.
Thedefaultvalueis51points,whichcancreatejaggedcontoursarounddensepoints.
PressShiftandselectNonpar DensityfromtheBivariateredtrianglemenu.Enteralargervalue
thanthedefault51points.
FittingMenusonpage 127
126
Bivariate Analysis
Fit Robust
Chapter 5
Basic Analysis
Fit Robust
Note: Formoredetailsaboutrobustfitting,seeHuber,1973.
TheFit Robust optionattemptstoreducetheinfluenceofoutliersinyourdataset.Inthis
instance,outliersareanobservationthatdoesnotcomefromthetrueunderlying
distributionofthedata.Forexample,ifweightmeasurementswerebeingtakeninpoundsfor
asampleofindividuals,butoneoftheindividualsaccidentallyrecordedtheirweightin
kilogramsinsteadofpounds,thiswouldbeadeviationfromthetruedistributionofthedata.
Outlierssuchasthiscouldleadyouintomakingincorrectdecisionsbecauseoftheirinfluence
onthedata.TheFitRobust optionreducestheinfluenceofthesetypesofoutliers.
RelatedInformation
FittingMenusonpage 127
ExampleUsingtheFitRobustCommandonpage 135
Histogram Borders
TheHistogram Bordersoptionappendshistogramstothexandyaxesofthescatterplot.You
canusethehistogramstovisualizethemarginaldistributionsoftheXandYvariables.
Figure 5.17ExampleofHistogramBorders
Chapter 5
Basic Analysis
Bivariate Analysis
Group By
127
Group By
UsingtheGroup Byoption,youcanselectaclassification(grouping)variable.Whena
groupingvariableisineffect,theBivariateplatformcomputesaseparateanalysisforeach
levelofthegroupingvariable.Regressioncurvesorellipsesthenappearonthescatterplot.
Thefitforeachlevelofthegroupingvariableisidentifiedbeneaththescatterplot,with
individualpopupmenustosaveorremovefittinginformation.
TheGroup ByoptionischeckedintheFittingmenuwhenagroupingvariableisineffect.You
canchangethegroupingvariablebyfirstselectingtheGroup Byoptiontoremove(uncheck)
theexistingvariable.Then,selecttheGroup Byoptionagainandrespondtoitswindowas
before.
YoumightusetheGroupByoptioninthesedifferentways:
Anoverlayoflinearregressionlinesletsyoucompareslopesvisually.
Anoverlayofdensityellipsescanshowclustersofpointsbylevelsofagroupingvariable.
RelatedInformation
ExampleofGroupByUsingDensityEllipsesonpage 137
ExampleofGroupByUsingRegressionLinesonpage 138
Fitting Menus
Inadditiontoareport,eachfittingcommandaddsafittingmenutothereportwindow.The
followingtableshowsthefittingmenusthatcorrespondtoeachfittingcommand.
Fitting Command
Fitting Menu
Fit Mean
FitMean
Fit Line
LinearFit
Fit Polynomial
PolynomialFitDegree=X*
Fit Special
LinearFit
PolynomialFitDegree=X*
TransformedFitX*
ConstrainedFits
Fit Spline
SmoothingSplineFit,lambda=X*
128
Bivariate Analysis
Fitting Menus
Chapter 5
Basic Analysis
Fitting Command
Fitting Menu
Kernel Smoother
LocalSmoother
FitEachValue
Fit Orthogonal
OrthogonalFitRatio=X*
Density Ellipse
BivariateNormalEllipseP=X*
Nonpar Density
QuantileDensityColors
Fit Robust
RobustFit
*X=variablecharacterornumber
Displaysorhidestheconfidencelimitsfortheexpectedvalue
(mean).ThisoptionisnotavailablefortheFitSpline,Density
Ellipse,FitEachValue,andFitOrthogonalfitsandisdimmed
onthosemenus.
Displaysorhidestheconfidencelimitsforanindividual
predictedvalue.Theconfidencelimitsreflectvariationinthe
errorandvariationintheparameterestimates.Thisoptionis
notavailablefortheFitMean,FitSpline,DensityEllipse,Fit
EachValue,andFitOrthogonalfitsandisdimmedonthose
menus.
Line Color
Letsyouselectfromapaletteofcolorsforassigningacolorto
eachfit.
Line of Fit
Displaysorhidesthelineoffit.
Line Style
Letsyouselectfromthepaletteoflinestylesforeachfit.
Line Width
Givesthreelinewidthsforthelineoffit.Thedefaultline
widthisthethinnestline.
Report
Turnsthefitstextreportonandoff.
Chapter 5
Basic Analysis
Save Predicteds
Bivariate Analysis
Fitting Menus
129
Createsanewcolumninthecurrentdatatablecalled
Predicted colnamewherecolnameisthenameoftheY
variable.Thiscolumnincludesthepredictionformulaandthe
computedsamplepredictedvalues.Thepredictionformula
computesvaluesautomaticallyforrowsthatyouaddtothe
table.ThisoptionisnotavailablefortheFitEachValueand
DensityEllipsefitsandisdimmedonthosemenus.
Note: YoucanusetheSave PredictedsandSave Residuals
commandsforeachfit.Ifyouusethesecommandsmultiple
timesorwithagroupingvariable,itisbesttorenamethe
resultingcolumnsinthedatatabletoreflecteachfit.
Save Residuals
Createsanewcolumninthecurrentdatatablecalled
Residuals colnamewherecolnameisthenameoftheY
variable.Eachvalueisthedifferencebetweentheactual
(observed)valueanditspredictedvalue.UnliketheSave
Predictedscommand,thiscommanddoesnotcreatea
formulainthenewcolumn.Thisoptionisnotavailableforthe
FitEachValueandDensityEllipsefitsandisdimmedon
thosemenus.
Note: YoucanusetheSave PredictedsandSave Residuals
commandsforeachfit.Ifyouusethesecommandsmultiple
timesorwithagroupingvariable,itisbesttorenamethe
resultingcolumnsinthedatatabletoreflecteachfit.
Remove Fit
Removesthefitfromthegraphandremovesitstextreport.
Linear Fits, Polynomial Fits, and Fit Special, and Fit Robust Only:
Mean Confidence Limit
Formula
Createsanewcolumninthedatatablecontainingaformula
forthemeanconfidenceintervals.
Createsanewcolumninthedatatablecontainingaformula
fortheindividualconfidenceintervals.
Plot Residuals
Producesfourdiagnosticplots:residualbypredicted,actual
bypredicted,residualbyrow,andanormalquantileplotof
theresiduals.SeeDiagnosticsPlotsonpage 131.
130
Bivariate Analysis
Fitting Menus
Chapter 5
Basic Analysis
Promptsyoutoenterthealphaleveltocomputeanddisplay
confidencelevelsforlinefits,polynomialfits,andspecialfits.
Save Coefficients
Savesthesplinecoefficientsasanewdatatable,withcolumns
calledX,A,B,C,andD.TheXcolumngivestheknotpoints.A,
B,C,andDaretheintercept,linear,quadratic,andcubic
coefficientsofthethirddegreepolynomial.Thesecoefficients
spanfromthecorrespondingvalueintheXcolumntothenext
highestvalue.
Shaded Contour
Shadestheareainsidethedensityellipse.
Selectsthepointsinsidetheellipse.
Selectsthepointsoutsidetheellipse.
Kernel Control
Displaysasliderforeachvariable,whereyoucanchangethe
kernelstandarddeviationthatdefinestherangeofXandY
valuesfordeterminingthedensityofcontourlines.
5% Contours
Showsorhidesthe5%contourlines.
Contour Lines
Showsorhidesthecontourlines.
Contour Fill
Fillstheareasbetweenthecontourlines.
Selectspointsthatfallinauserspecifiedquantilerange.
Colorsthepointsaccordingtodensity.
Createsanewcolumncontainingthedensityquantileeach
pointisin.
Mesh Plot
Isathreedimensionalplotofthedensityoveragridofthe
twoanalysisvariables.SeeFigure 5.18.
Model Clustering
Createsanewcolumninthecurrentdatatableandfillsitwith
clustervalues.
Note: Ifyousavethemodalclusteringvaluesfirstandthen
savethedensitygrid,thegridtablealsocontainsthecluster
values.Theclustervaluesareusefulforcoloringandmarking
pointsinplots.
Chapter 5
Basic Analysis
Bivariate Analysis
Additional Examples of the Bivariate Platform
131
Savesthedensityestimatesandthequantilesassociatedwith
theminanewdatatable.Thegriddatacanbeusedto
visualizethedensityinotherways,suchaswiththe
Scatterplot 3DortheContourPlotplatforms.
Figure 5.18ExampleofaMeshPlot
Diagnostics Plots
ThePlot Residualsoptioncreatesresidualplotsandotherplotstodiagnosethemodelfit.The
followingplotsareavailable:
isaplotoftheresidualsvs.thepredictedvalues.Ahistogramof
theresidualsisalsocreated.
isaplotoftheresidualvaluesvs.theXvariable.
isaNormalquantileplotoftheresiduals.
132
Bivariate Analysis
Additional Examples of the Bivariate Platform
Chapter 5
Basic Analysis
6. FromtheredtrianglemenuforBivariateFit,selectFit Special.TheSpecifyTransformation
orConstraintwindowappears.Foradescriptionofthiswindow,seeTable 5.8.
Figure 5.20TheSpecifyTransformationorConstraintWindow
Chapter 5
Basic Analysis
Bivariate Analysis
Additional Examples of the Bivariate Platform
133
Figure 5.21ExampleofFitSpecialReport
Figure 5.21showsthefittedlineplottedontheoriginalscale.Themodelappearstofitthedata
well,astheplottedlinegoesthroughthecloudofpoints.
134
Bivariate Analysis
Additional Examples of the Bivariate Platform
Chapter 5
Basic Analysis
Equal Variances
Fit X to Y
Chapter 5
Basic Analysis
Bivariate Analysis
Additional Examples of the Bivariate Platform
135
Figure 5.22ExampleofOrthogonalFittingOptions
Fit X to Y
Fit Line
ThescatterplotinFigure 5.22showsthestandardizedheightandweightvalueswithvarious
linefitsthatillustratethebehavioroftheorthogonalvarianceratioselections.Thestandard
linearregression(Fit Line)occurswhenthevarianceoftheXvariableisconsideredtobevery
small.Fit X to Yistheoppositeextreme,whenthevariationoftheYvariableisignored.All
otherlinesfallbetweenthesetwoextremesandshiftasthevarianceratiochanges.Asthe
varianceratioincreases,thevariationintheYresponsedominatesandtheslopeofthefitted
lineshiftsclosertotheYbyXfit.Likewise,whenyoudecreasetheratio,theslopeoftheline
shiftsclosertotheXbyYfit.
136
Bivariate Analysis
Additional Examples of the Bivariate Platform
Chapter 5
Basic Analysis
Figure 5.23ExampleofRobustFit
IfyoulookatthestandardAnalysisofVariancereport,youmightwronglyconcludethat
heightandweightdonothavealinearrelationship,sincethepvalueis0.1203.However,
whenyoulookattheRobustFitreport,youwouldprobablyconcludethattheydohavea
linearrelationship,becausethepvaluethereis0.0489.Itappearsthatsomeofthe
measurementsareunusuallylow,perhapsduetoincorrectuserinput.Thesemeasurements
wereundulyinfluencingtheanalysis.
Chapter 5
Basic Analysis
Bivariate Analysis
Additional Examples of the Bivariate Platform
137
TheellipsesinFigure 5.24showclearlyhowthedifferenttypesofhotdogsclusterwith
respecttothecostvariables.
138
Bivariate Analysis
Additional Examples of the Bivariate Platform
Chapter 5
Basic Analysis
ThescatterplottotheleftinFigure 5.25hasasingleregressionlinethatrelatesweightto
height.Thescatterplottotherightshowsseparateregressionlinesformalesandfemales.
Chapter 5
Basic Analysis
Bivariate Analysis
Statistical Details for the Bivariate Platform
139
yi =
j xi + i
j=0
y distance
orthogonal
distance
x distance
140
Bivariate Analysis
Statistical Details for the Bivariate Platform
Chapter 5
Basic Analysis
ThefitrequiresthatyouspecifytheratioofthevarianceoftheerrorinYtotheerrorinX.This
isthevarianceoftheerror,notthevarianceofthesamplepoints,soyoumustchoosecarefully.
Theratio 2y 2x isinfiniteinstandardleastsquaresbecause 2x iszero.Ifyoudoan
orthogonalfitwithalargeerrorratio,thefittedlineapproachesthestandardleastsquaresline
offit.Ifyouspecifyaratioofzero,thefitisequivalenttotheregressionofXonY,insteadofY
onX.
Themostcommonuseofthistechniqueisincomparingtwomeasurementsystemsthatboth
haveerrorsinmeasuringthesamevalue.Thus,theYresponseerrorandtheXmeasurement
errorareboththesametypeofmeasurementerror.Wheredoyougetthemeasurementerror
variances?Youcannotgetthemfrombivariatedatabecauseyoucannottellwhich
measurementsystemproduceswhatproportionoftheerror.So,youeithermustblindly
assumesomeratiolike1,oryoumustrelyonseparaterepeatedmeasurementsofthesame
unitbythetwomeasurementsystems.
AnadvantagetothisapproachisthatthecomputationsgiveyoupredictedvaluesforbothY
andX;thepredictedvaluesarethepointonthelinethatisclosesttothedatapoint,where
closenessisrelativetothevarianceratio.
ConfidencelimitsarecalculatedasdescribedinTanandIglewicz(1999).
Chapter 5
Basic Analysis
Bivariate Analysis
Statistical Details for the Bivariate Platform
141
DF p =
ni 1
i=1
niisthenumberofsubjectsintheithgroup.
PureErrorSS
ForthePureErrorSS,ingeneral,ifthereareggroupshavingmultiplerowswiththesamex
value,thepooledSS,denotedSSp,iswrittenasfollows:
g
SS p =
SS i
i=1
whereSSiisthesumofsquaresfortheithgroupcorrectedforitsmean.
MaxRSq
BecausePure Errorisinvarianttotheformofthemodelandistheminimumpossiblevariance,
Max RSqiscalculatedasfollows:
SS Pureerror
1 ---------------------------------------------------------------------SS Totalforwholemodel
s x s y
where istheestimatedparameter,sxandsyarethestandarddeviationsoftheXandY
variables.
DesignStdError
DesignStdErroriscalculatedasthestandarderroroftheparameterestimatedividedbythe
RMSE.
142
Bivariate Analysis
Statistical Details for the Bivariate Platform
Chapter 5
Basic Analysis
s xy
2
wi xi xi yi y i r xy = -------------- where s xy = ---------------------------------------------------df
2 2
sx sy
Where w i iseithertheweightoftheithobservationifaweightcolumnisspecified,or1ifno
weightcolumnisassigned.
Chapter 6
Oneway Analysis
Examine Relationships between a Continuous Y and a Categorical X
Variable
UsingtheOnewayorFitYbyXplatform,youcanexplorehowthedistributionofa
continuousYvariablediffersacrossgroupsdefinedbyasinglecategoricalXvariable.For
example,youmightwanttofindouthowdifferentcategoriesofthesametypeofdrug(X)
affectpatientpainlevelsonanumberedscale(Y).
TheOnewayplatformisthecontinuousbynominalorordinalpersonalityoftheFitYbyX
platform.Theanalysisresultsappearinaplot,andyoucaninteractivelyaddadditional
analyses,suchasthefollowing:
aonewayanalysisofvariancetofitmeansandtotestthattheyareequal
nonparametrictests
atestforhomogeneityofvariance
multiplecomparisontestsonmeans,withmeanscomparisoncircles
outlierboxplotsoverlaidoneachgroup
powerdetailsfortheonewaylayout
Figure 6.1OnewayAnalysis
Contents
OverviewofOnewayAnalysis .................................................... 145
ExampleofOnewayAnalysis ..................................................... 145
LaunchtheOnewayPlatform ..................................................... 147
TheOnewayPlot ................................................................ 147
OnewayPlatformOptions ........................................................ 148
Quantiles....................................................................... 155
Means/AnovaandMeans/Anova/Pooledt.......................................... 156
AnalysisofMeansMethods ...................................................... 163
CompareMeans................................................................. 166
Nonparametric.................................................................. 171
UnequalVariances .............................................................. 176
EquivalenceTest ................................................................ 179
RobustFit...................................................................... 180
Power ......................................................................... 180
NormalQuantilePlot ............................................................ 182
CDFPlot ....................................................................... 182
Densities ....................................................................... 183
MatchingColumn ............................................................... 183
AdditionalExamplesoftheOnewayPlatform....................................... 184
StatisticalDetailsfortheOnewayPlatform ......................................... 202
Chapter 6
Basic Analysis
Oneway Analysis
Overview of Oneway Analysis
145
Younoticethatonedrug(A)hasconsistentlylowerscoresthantheotherdrugs.Youalso
noticethatthexaxisticksareunequallyspaced.Thelengthbetweentheticksisproportional
tothenumberofscores(observations)foreachdrug.
146
Oneway Analysis
Example of Oneway Analysis
Chapter 6
Basic Analysis
Performananalysisofvarianceonthedata.
6. FromtheredtrianglemenuforOnewayAnalysis,selectMeans/Anova.
Note: IftheXfactorhasonlytwolevels,theMeans/AnovaoptionappearsasMeans/Anova/
Pooled t, andaddsapooledttestreporttothereportwindow.
Figure 6.3ExampleoftheMeans/AnovaOption
Notethefollowingobservations:
Meandiamondsrepresentingconfidenceintervalsappear.
Thelinenearthecenterofeachdiamondrepresentsthegroupmean.Ataglance,you
canseethatthemeanforeachdruglookssignificantlydifferent.
Theverticalspanofeachdiamondrepresentsthe95%confidenceintervalforthemean
ofeachgroup.
SeeMeanDiamondsandXAxisProportionalonpage 161.
TheSummaryofFittableprovidesoverallsummaryinformationabouttheanalysis.
Chapter 6
Basic Analysis
Oneway Analysis
The Oneway Plot
147
TheAnalysisofVariancereportshowsthestandardANOVAinformation.Younoticethat
theProb > F(thepvalue)is0.0053,whichsupportsyourvisualconclusionthatthereare
significantdifferencesbetweenthedrugs.
TheMeansforOnewayAnovareportshowsthemean,samplesize,andstandarderrorfor
eachlevelofthecategoricalfactor.
TolaunchtheOnewayplatform,fromtheJMPStarterwindow,clickontheBasiccategory
andclickOneway.
Figure 6.4TheOnewayLaunchWindow
Formoreinformationaboutthislaunchwindow,seeIntroductiontoFitYbyXchapteron
page 95.
148
Oneway Analysis
Oneway Platform Options
Chapter 6
Basic Analysis
Youcanaddreports,additionalplots,andteststothereportwindowusingtheoptionsinthe
redtrianglemenuforOnewayAnalysis.SeeOnewayPlatformOptionsonpage 148.
ToproducetheplotshowninFigure 6.5,followtheinstructionsinExampleofOneway
Analysisonpage 145.
Figure 6.5TheOnewayPlot
Chapter 6
Basic Analysis
Oneway Analysis
Oneway Platform Options
149
Figure 6.6ExampleofOnewayPlatformOptions
Whenyouselectaplatformoption,objectsmightbeaddedtotheplot,andareportisadded
tothereportwindow.
Table 6.1ExamplesofOptionsandElements
Platform Option
Quantiles
Boxplots
Quantilesreport
Means/Anova
Meandiamonds
OnewayANOVAreports
Meanlines,errorbars,and
standarddeviationlines
MeansandStdDeviations
report
Compare Means
Comparisoncircles
MeansComparisonreports
(exceptNonparametric
MultipleComparisonsoption)
ThefollowingtabledescribesalloftheplatformoptionsintheredtrianglemenuforOneway
Analysis.Someoptionsmightnotappearunlessspecificconditionsaremet.
150
Oneway Analysis
Oneway Platform Options
Chapter 6
Basic Analysis
Table 6.2DescriptionsofOnewayAnalysisPlatformOptions
Quantiles
Liststhefollowingquantilesforeachgroup:
0%(Minimum)
10%
25%
50%(Median)
75%
90%
100%(Maximum)
or
Means/Anova/Pooled t
Fitsmeansforeachgroupandperformsaoneway
analysisofvariancetotestiftherearedifferences
amongthemeans.SeeMeans/AnovaandMeans/
Anova/Pooledtonpage 156.
TheMeans/Anova/Pooled toptionappearsonlyiftheX
factorhastwolevels.
Givessummarystatisticsforeachgroup.Thestandard
errorsforthemeansuseindividualgroupstandard
deviationsratherthanthepooledestimateofthe
standarddeviation.
Theplotnowcontainsmeanlines,errorbars,and
standarddeviationlines.Forabriefdescriptionofthese
elements,seeDisplayOptionsonpage 153.Formore
detailsabouttheseelements,seeMeanLines,Error
Bars,andStandardDeviationLinesonpage 162.
t test
Producesattestreportassumingthatthevariancesare
notequal.SeeThettestReportonpage 158.
ThisoptionappearsonlyiftheXfactorhastwolevels.
ProvidesfourcommandsforperformingAnalysisof
Means(ANOM)procedures.Therearecommandsfor
comparingbothmeansandvariances.SeeAnalysisof
MeansMethodsonpage 163.
Chapter 6
Basic Analysis
Oneway Analysis
Oneway Platform Options
151
Providesmultiplecomparisonmethodsforcomparing
setsofgroupmeans.SeeCompareMeanson
page 166.
Nonparametric
Providesnonparametriccomparisonsofgroupmeans.
SeeNonparametriconpage 171.
Unequal Variances
Performsfourtestsforequalityofgroupvariances.
AlsogivestheWelchtest,whichisanANOVAtestfor
comparingmeanswhenthevarianceswithingroups
arenotequal.SeeUnequalVariancesonpage 176.
Equivalence Test
Teststhatadifferenceislessthanathresholdvalue.See
EquivalenceTestonpage 179.
Robust Fit
Attemptstoreducetheinfluenceofoutliersonyour
data.SeeRobustFitonpage 180.
Power
Providescalculationsofstatisticalpowerandother
detailsaboutagivenhypothesistest.SeePoweron
page 180.
ThePowerDetailswindowandreportsalsoappear
withintheFitModelplatform.Forfurtherdiscussion
andexamplesofpowercalculations,seetheFitting
LinearModelsbook.
Set Level
Youcanselectanoptionfromthemostcommonalpha
levelsorspecifyanylevelwiththeOtherselection.
Changingthealphalevelresultsinthefollowing
actions:
recalculatesconfidencelimits
adjuststhemeandiamondsontheplot(iftheyare
showing)
modifiestheupperandlowerconfidencelevel
valuesinreports
changesthecriticalnumberandcomparisoncircles
forallCompareMeansreports
changesthecriticalnumberforallNonparametric
MultipleComparisonreports
152
Oneway Analysis
Oneway Platform Options
Chapter 6
Basic Analysis
Providesthefollowingoptionsforplottingthe
quantilesofthedataineachgroup:
Line of Fitdrawsstraightdiagonalreferencelineson
theplotforeachleveloftheXvariable.Thisoption
isavailableonlyonceyouhavecreatedaplot
(ActualbyQuantileorQuantilebyActual).
CDF Plot
Plotsthecumulativedistributionfunctionforallofthe
groupsintheOnewayreport.SeeCDFPloton
page 182.
Densities
Comparesdensitiesacrossgroups.SeeDensitieson
page 183.
Matching Column
Specifyamatchingvariabletoperformamatching
modelanalysis.Usethisoptionwhenthedatainyour
Onewayanalysiscomesfrommatched(paired)data,
suchaswhenobservationsindifferentgroupscome
fromthesamesubject.
Theplotnowcontainsmatchinglinesthatconnectthe
matchingpoints.
SeeMatchingColumnonpage 183.
Chapter 6
Basic Analysis
Oneway Analysis
Oneway Platform Options
153
Savesthefollowingquantitiesasnewcolumnsinthe
currentdatatable:
Save Residualssavesvaluescomputedasthe
responsevariableminusthemeanoftheresponse
variablewithineachlevelofthefactorvariable.
Save Standardizedsavesstandardizedvaluesofthe
responsevariablecomputedwithineachlevelofthe
factorvariable.Thisisthecenteredresponse
dividedbythestandarddeviationwithineachlevel.
computedwithineachlevelofthecategoricalfactor
variable.
Save Predictedsavesthepredictedmeanofthe
responsevariableforeachlevelofthefactor
variable.
Display Options
Addsorremoveselementsfromtheplot.SeeDisplay
Optionsonpage 153.
Script
Thismenucontainsoptionsthatareavailabletoall
platforms.Theyenableyoutoredotheanalysisorsave
theJSLcommandsfortheanalysistoawindowora
file.Formoreinformation,seeUsingJMP.
Display Options
UsingDisplayOptions,youcanaddorremoveelementsfromaplot.Someoptionsmightnot
appearunlesstheyarerelevant.
Table 6.3DescriptionsofDisplayOptions
All Graphs
Showsorhidesallgraphs.
Points
Showsorhidesdatapointsontheplot.
Box Plots
Showsorhidesoutlierboxplotsforeachgroup.
Mean Diamonds
Drawsahorizontallinethroughthemeanofeach
groupproportionaltoitsxaxis.Thetopandbottom
pointsofthemeandiamondshowtheupperandlower
95%confidencepointsforeachgroup.SeeMean
DiamondsandXAxisProportionalonpage 161.
154
Oneway Analysis
Oneway Platform Options
Chapter 6
Basic Analysis
Drawsalineatthemeanofeachgroup.SeeMean
Lines,ErrorBars,andStandardDeviationLineson
page 162.
Mean CI Lines
Drawslinesattheupperandlower95%confidence
levelsforeachgroup.
Identifiesthemeanofeachgroupandshowserrorbars
onestandarderroraboveandbelowthemean.See
MeanLines,ErrorBars,andStandardDeviation
Linesonpage 162.
Grand Mean
DrawstheoverallmeanoftheYvariableontheplot.
Showslinesonestandarddeviationaboveandbelow
themeanofeachgroup.SeeMeanLines,ErrorBars,
andStandardDeviationLinesonpage 162.
Comparison Circles
Showsorhidescomparisoncircles.Thisoptionis
availableonlywhenoneoftheCompare Meansoptions
isselected.SeeStatisticalDetailsforComparison
Circlesonpage 202.
Connect Means
Connectsthegroupmeanswithastraightline.
Mean of Means
Drawsalineatthemeanofthegroupmeans.
X-Axis proportional
Makesthespacingonthexaxisproportionaltothe
samplesizeofeachlevel.SeeMeanDiamondsand
XAxisProportionalonpage 161.
Points Spread
Spreadspointsoverthewidthoftheinterval
Points Jittered
Addssmallspacesbetweenpointsthatoverlayonthe
sameyvalue.Thehorizontaladjustmentofpoints
variesfrom0.375to0.625witha4*(Uniform0.5)5
distribution.
Matching Lines
(OnlyappearswhentheMatching Columnoptionis
selected.)Connectsmatchingpoints.
OnlyappearswhentheMatching Columnoptionis
selected.)Drawsdottedlinestoconnectcellmeans
frommissingcellsinthetable.Thevaluesusedasthe
endpointsofthelinesareobtainedusingatwoway
ANOVAmodel.
Chapter 6
Basic Analysis
Oneway Analysis
Quantiles
155
Drawssidebysidehistogramstotherightofthe
originalplot.
(AppearsonlywhenRobustFitisselected.)Drawsa
lineattherobustmeanofeachgroup.
Quantiles
TheQuantilesreportlistsselectedpercentilesforeachleveloftheXfactorvariable.The
medianisthe50thpercentile,andthe25thand75thpercentilesarecalledthequartiles.
TheQuantilesoptionaddsthefollowingelementstotheplot:
thegrandmeanrepresentingtheoverallmeanoftheYvariable
outlierboxplotssummarizingthedistributionofpointsateachfactorlevel
Figure 6.7OutlierBoxPlotandGrandMean
outlier box plot
grand mean
Note: Tohidetheseelements,clicktheredtrianglenexttoOnewayAnalysisandselect
Display Options > Box PlotsorGrand Mean.
Theverticallinewithintheboxrepresentsthemediansamplevalue.
Theendsoftheboxrepresentthe75thand25thquantiles,alsoexpressedasthe3rdand1st
quartile,respectively.
Thedifferencebetweenthe1stand3rdquartilesiscalledtheinterquartilerange.
156
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
Chapter 6
Basic Analysis
Eachboxhaslines,sometimescalledwhiskers,thatextendfromeachend.Thewhiskers
extendfromtheendsoftheboxtotheoutermostdatapointthatfallswithinthedistances
computedasfollows:
3rd quartile + 1.5*(interquartile range)
1st quartile - 1.5*(interquartile range)
Ifthedatapointsdonotreachthecomputedranges,thenthewhiskersaredeterminedby
theupperandlowerdatapointvalues(notincludingoutliers).
Figure 6.8ExamplesofOutlierBoxPlots
whisker
median
sample
value
75% quantile
or 3rd quartile
25% quantile
or 1st quartile
Reference
Meandiamondsareaddedto
theOnewayplot
SeeDisplayOptionsonpage 153andMean
DiamondsandXAxisProportionalonpage 161.
Chapter 6
Basic Analysis
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
157
Reference
Reports
SeeTheSummaryofFitReportonpage 157.
SeeTheAnalysisofVarianceReportonpage 159.
SeeTheMeansforOnewayAnovaReporton
page 161.
SeeThettestReportonpage 158.
Note: ThisreportappearsonlyiftheMeans/Anova/
Pooled toptionisselected.
SeeTheBlockMeansReportonpage 161.
Note: Thisreportappearsonlyifyouhavespecifieda
Blockvariableinthelaunchwindow.
Measurestheproportionofthevariationaccountedfor
byfittingmeanstoeachfactorlevel.Theremaining
variationisattributedtorandomerror.TheR2valueis1
iffittingthegroupmeansaccountsforallthevariation
withnoerror.AnR2of0indicatesthatthefitservesno
betterasapredictionmodelthantheoverallresponse
mean.Formoreinformation,seeStatisticalDetailsfor
theSummaryofFitReportonpage 204.
R2isalsocalledthecoefficientofdetermination.
AdjRsquare
AdjustsR2tomakeitmorecomparableovermodels
withdifferentnumbersofparametersbyusingthe
degreesoffreedominitscomputation.Formore
information,seeStatisticalDetailsfortheSummaryof
FitReportonpage 204.
158
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
Chapter 6
Basic Analysis
Estimatesthestandarddeviationoftherandomerror.It
isthesquarerootofthemeansquareforErrorfoundin
theAnalysisofVariancereport.
MeanofResponse
Overallmean(arithmeticaverage)oftheresponse
variable.
Observations(orSumWgts)
Numberofobservationsusedinestimatingthefit.If
weightsareused,thisisthesumoftheweights.
SeeStatisticalDetailsfortheSummaryofFitReport
onpage 204.
Equalvariances.IfyouselecttheMeans/Anova/Pooled toption,atTestreportappears.
ThistTestassumesequalvariances.
Unequalvariances.Ifyouselectthet-Testoptionfromtheredtrianglemenu,atTest
reportappears.ThistTestassumesunequalvariances.
Thereportshowsthefollowinginformation:
Table 6.6DescriptionofthetTestReport
tTestplot
Showsthesamplingdistributionofthedifferenceinthe
means,assumingthenullhypothesisistrue.The
verticalredlineistheactualdifferenceinthemeans.
Theshadedareascorrespondtothepvalues.
Difference
ShowstheestimateddifferencebetweenthetwoX
levels.Intheplots,theDifferencevalueappearsasared
linethatcomparesthetwolevels.
StdErrDif
Showsthestandarderrorofthedifference.
UpperCLDif
Showstheupperconfidencelimitforthedifference.
LowerCLDif
Showsthelowerconfidencelimitforthedifference.
Chapter 6
Basic Analysis
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
159
Showsthelevelofconfidence(1alpha).Tochangethe
levelofconfidence,selectanewalphalevelfromthe
Set Levelcommandfromtheplatformredtriangle
menu.
tRatio
Valueofthetstatistic.
DF
Thedegreesoffreedomusedinthettest.
Prob>|t|
Thepvalueassociatedwithatwotailedtest.
Prob>t
Thepvalueassociatedwithalowertailedtest.
Prob<t
Thepvalueassociatedwithanuppertailedtest.
Liststhethreesourcesofvariation,whicharethemodel
source,Error,andC. Total(correctedtotal).
DF
Recordsanassociateddegreesoffreedom(DFforshort)
foreachsourceofvariation:
IftheXfactorhasklevels,thenthemodelhask 1
degreesoffreedom.
TheErrordegreesoffreedomisthedifferencebetween
theC. TotaldegreesoffreedomandtheModeldegrees
offreedom(inotherwords,N k).
160
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
Chapter 6
Basic Analysis
Recordsasumofsquares(SSforshort)foreachsource
ofvariation:
Thetotal(C. Total)sumofsquaresofeachresponse
fromtheoverallresponsemean.TheC. Totalsumof
squaresisthebasemodelusedforcomparisonwith
allothermodels.
Thesumofsquareddistancesfromeachpointtoits
respectivegroupmean.Thisistheremaining
unexplainedError(residual)SSafterfittingthe
analysisofvariancemodel.
ThetotalSSminustheerrorSSgivesthesumofsquares
attributedtothemodel.Thistellsyouhowmuchofthe
totalvariationisexplainedbythemodel.
MeanSquare
Isasumofsquaresdividedbyitsassociateddegreesof
freedom:
TheModelmeansquareestimatesthevarianceof
theerror,butonlyunderthehypothesisthatthe
groupmeansareequal.
TheErrormeansquareestimatesthevarianceofthe
errortermindependentlyofthemodelmeansquare
andisunconditionedbyanymodelhypothesis.
FRatio
Modelmeansquaredividedbytheerrormeansquare.
Ifthehypothesisthatthegroupmeansareequal(there
isnorealdifferencebetweenthem)istrue,thenboth
themeansquareforerrorandthemeansquarefor
modelestimatetheerrorvariance.TheirratiohasanF
distribution.Iftheanalysisofvariancemodelresultsin
asignificantreductionofvariationfromthetotal,theF
ratioishigherthanexpected.
Prob>F
Probabilityofobtaining(bychancealone)anFvalue
greaterthantheonecalculatedif,inreality,thereisno
differenceinthepopulationgroupmeans.Observed
significanceprobabilitiesof0.05orlessareoften
consideredevidencethattherearedifferencesinthe
groupmeans.
Chapter 6
Basic Analysis
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
161
ListsthelevelsoftheXvariable.
Number
Liststhenumberofobservationsineachgroup.
Mean
Liststhemeanofeachgroup.
StdError
Liststheestimatesofthestandarddeviationsforthe
groupmeans.Thisstandarderrorisestimated
assumingthatthevarianceoftheresponseisthesame
ineachlevel.Itistherootmeansquareerrorfoundin
theSummaryofFitreportdividedbythesquarerootof
thenumberofvaluesusedtocomputethegroupmean.
Lower95%andUpper95%
Liststhelowerandupper95%confidenceintervalfor
thegroupmeans.
162
Oneway Analysis
Means/Anova and Means/Anova/Pooled t
Chapter 6
Basic Analysis
Figure 6.9ExamplesofMeanDiamondsandXAxisProportionalOptions
overlap marks
95%
confidence
interval
group mean
x-axis proportional
Notethefollowingobservations:
Thetopandbottomofeachdiamondrepresentthe(1alpha)x100confidenceintervalfor
eachgroup.Theconfidenceintervalcomputationassumesthatthevariancesareequal
acrossobservations.Therefore,theheightofthediamondisproportionaltothereciprocal
ofthesquarerootofthenumberofobservationsinthegroup.
IftheX-Axis proportionaloptionisselected,thehorizontalextentofeachgroupalongthe
xaxis(thehorizontalsizeofthediamond)isproportionaltothesamplesizeforeachlevel
oftheXvariable.Therefore,thenarrowerdiamondsareusuallytaller,becausefewerdata
pointsresultsinawiderconfidenceinterval.
Themeanlineacrossthemiddleofeachdiamondrepresentsthegroupmean.
Overlapmarksappearaslinesaboveandbelowthegroupmean.Forgroupswithequal
samplesizes,overlappingmarksindicatethatthetwogroupmeansarenotsignificantly
differentatthegivenconfidencelevel.Overlapmarksarecomputedasgroup
mean 2 2 CI 2 .Overlapmarksinonediamondthatareclosertothemeanof
anotherdiamondthanthatdiamondsoverlapmarksindicatethatthosetwogroupsare
notdifferentatthegivenconfidencelevel.
ThemeandiamondsautomaticallyappearwhenyouselecttheMeans/Anova/Pooled t or
Means/Anovaoptionfromtheplatformmenu.However,youcanshoworhidethemat
anytimebyselectingDisplay Options > Mean Diamondsfromtheredtrianglemenu.
Chapter 6
Basic Analysis
Oneway Analysis
Analysis of Means Methods
163
Figure 6.10MeanLines,MeanErrorBars,andStdDevLines
ifyouwanttotestwhetheranyofthegroupmeansarestatisticallydifferentfromthe
overallmean
ifyouwanttotestwhetheranyofthegroupstandarddeviationsarestatisticallydifferent
fromtherootmeansquareerror(RMSE)
Compare Means
UsetheANOMandANOM with Transformed Ranksoptionstocomparegroupmeanstothe
overallmean.
Table 6.9DescriptionsofMethodsforComparingMeans
ANOM
Comparesgroupmeanstotheoverallmean.This
methodassumesthatyourdataisapproximately
normallydistributed.SeeExampleofanAnalysisof
MeansChartonpage 184.
164
Oneway Analysis
Analysis of Means Methods
Chapter 6
Basic Analysis
ThisisthenonparametricversionoftheANOManalysis.
Usethismethodifyourdataisclearlynonnormaland
cannotbetransformedtonormality.Compareseach
groupmeantransformedranktotheoverallmean
transformedrank.
Comparesgroupstandarddeviationstotherootmean
squareerror.Thismethodassumesthatyourdatais
approximatelynormallydistributed.Tousethis
method,eachgroupmusthaveatleastfour
observations.SeeExampleofanAnalysisofMeansfor
VariancesChartonpage 185.
ThisisthenonparametricversionoftheANOM for
Variancesanalysis.Usethismethodifyoususpectyour
dataisnonnormalandcannotbetransformedto
normality.Comparesthegroupmeansoftheabsolute
deviationfromthemedian(ADM)totheoverallmean
ADM.
acenterlineindicatingtheoverallmeanorrootmeansquareerror(orMSEwhenin
variancescale)
upperdecisionlimits(UDL)
lowerdecisionlimits(LDL)
Ifagroupmeanfallsoutsideofthedecisionlimits,thenthatmeanissignificantlydifferent
fromtheoverallmean.Ifagroupstandarddeviationfallsoutsideofthedecisionlimits,then
thatstandarddeviationissignificantlydifferentfromtherootmeansquareerror.
Chapter 6
Basic Analysis
Oneway Analysis
Analysis of Means Methods
165
Selectanoptionfromthemostcommonalphalevelsor
specifyanylevelwiththeOtherselection.Changingthe
alphalevelmodifiestheupperandlowerdecision
limits.
ForANOM,createsareportshowinggroupmeansand
decisionlimits.
Display Options
Displayoptionsincludethefollowing:
decisionlimitshading.
statistic.
Point Options
Show Needlesshowstheneedles.Thisisthe
defaultoption.
Show Connected Pointsshowsalineconnecting
themeansforeachgroup.
Show Only Pointsshowsonlythepoints
representingthemeansforeachgroup.
166
Oneway Analysis
Compare Means
Chapter 6
Basic Analysis
Compare Means
Note: AnothermethodforcomparingmeansisANOM.SeeAnalysisofMeansMethodson
page 163.
UsetheCompareMeansoptionstoperformmultiplecomparisonsofgroupmeans.Allof
thesemethodsusepooledvarianceestimatesforthemeans.EachCompareMeansoption
addscomparisoncirclesnexttotheplotandspecificreportstothereportwindow.Fordetails
aboutcomparisoncircles,seeUsingComparisonCirclesonpage 167.
Option
Description
Reference
Nonparametric
Menu Option
Each Pair,
Students t
Computesindividual
pairwisecomparisonsusing
Studentsttests.Ifyou
makemanypairwisetests,
thereisnoprotectionacross
theinferences.Therefore,
thealphasize(TypeIerror
rate)acrossthehypothesis
testsishigherthanthatfor
individualtests.
SeeEachPair,
Studentston
page 168.
Nonparametric >
Nonparametric
Multiple
Comparisons >
Wilcoxon Each
Pair
Showsatestthatissizedfor
alldifferencesamongthe
means.ThisistheTukeyor
TukeyKramerHSD
(honestlysignificant
difference)test.(Tukey
1953,Kramer1956).This
testisanexactalphalevel
testifthesamplesizesare
thesame,andconservative
ifthesamplesizesare
different(Hayter1984).
SeeAllPairs,
TukeyHSDon
page 169.
Nonparametric >
Nonparametric
Multiple
Comparisons >
Steel-Dwass All
Pairs
Testswhetherthemeansare
lessthantheunknown
maximumorgreaterthan
theunknownminimum.
ThisistheHsuMCBtest
(Hsu1981).
SeeWithBest,
HsuMCBon
page 169.
none
Chapter 6
Basic Analysis
Oneway Analysis
Compare Means
167
Option
Description
Reference
Nonparametric
Menu Option
With Control,
Dunnetts
Testswhetherthemeansare
differentfromthemeanofa
controlgroup.Thisis
Dunnettstest(Dunnett
1955).
SeeWithControl,
Dunnettson
page 170.
Nonparametric >
Nonparametric
Multiple
Comparisons >
Steel With Control
Note: IfyouhavespecifiedaBlockcolumn,thenthemultiplecomparisonmethodsare
performedondatathathasbeenadjustedfortheBlockmeans.
RelatedInformation
ExampleContrastingAlloftheCompareMeansTestsonpage 191
Compareeachpairofgroupmeansvisuallybyexaminingtheintersectionofthecomparison
circles.Theoutsideangleofintersectiontellsyouwhetherthegroupmeansaresignificantly
different.SeeFigure 6.12.
Circlesformeansthataresignificantlydifferenteitherdonotintersect,orintersect
slightly,sothattheoutsideangleofintersectionislessthan90degrees.
168
Oneway Analysis
Compare Means
Chapter 6
Basic Analysis
Ifthecirclesintersectbyanangleofmorethan90degrees,oriftheyarenested,themeans
arenotsignificantlydifferent.
Figure 6.12AnglesofIntersectionandSignificance
angle greater
than 90 degrees
angle equal to
90 degrees
not significantly
different
borderline
significantly
different
significantly
different
Iftheintersectionangleiscloseto90degrees,youcanverifywhetherthemeansare
significantlydifferentbyclickingonthecomparisoncircletoselectit.SeeFigure 6.13.To
deselectcircles,clickinthewhitespaceoutsidethecircles.
Figure 6.13HighlightingComparisonCircles
Groups that are different
from the selected group
appear as thick gray
circles.
Groups that are not
different from the
selected group appear
as thin red circles.
The selected group
appears as a thick red
circle.
RelatedInformation
StatisticalDetailsforComparisonCirclesonpage 202
ExampleoftheEachPair,StudentstTestonpage 186
Chapter 6
Basic Analysis
Oneway Analysis
Compare Means
169
ExampleoftheAllPairs,TukeyHSDTestonpage 187
Ifameanhasmeanssignificantlyseparatedaboveit,itisnotregardedasthemaximum.
Ifameanhasmeanssignificantlyseparatedbelowit,itisnotregardedastheminimum.
Ifameanissignificantlyseparatedaboveallothermeans,itisregardedasthemaximum.
Ifameanissignificantlyseparatedbelowallothermeans,itisregardedastheminimum.
Note: MeansthatarenotregardedasthemaximumortheminimumbyMCBarealsothe
meansthatarenotcontainedintheselectedsubsetofGupta(1965)ofpotentialmaximumsor
minimummeans.
Forthemaximumreport,acolumnshowstherowmeanminusthecolumnmeanminusthe
LSD.Ifavalueispositive,therowmeanissignificantlyhigherthanthemeanforthecolumn,
andthemeanforthecolumnisnotthemaximum.
Fortheminimumreport,acolumnshowstherowmeanminusthecolumnmeanplustheLSD.
Ifavalueisnegative,therowmeanissignificantlylessthanthemeanforthecolumn,andthe
meanforthecolumnisnottheminimum.
170
Oneway Analysis
Compare Means
Chapter 6
Basic Analysis
RelatedInformation
ExampleoftheWithBest,HsuMCBTestonpage 189
ExampleoftheWithControl,DunnettsTestonpage 190
Showsatableofalldifferencesofmeans.
Confidence Quantile
Showsthetvalueorothercorrespondingquantiles
usedforconfidenceintervals.
Showsamatrixshowingifadifferenceexceedstheleast
significantdifferenceforallcomparisons.
Note: ForHsusMCBandDunnettstest,only
DifferenceMatrix,ConfidenceQuantile,andLSD
ThresholdMatrixareapplicable.
Showsthetraditionallettercodedreportwheremeans
thatarenotsharingaletteraresignificantlydifferent.
Chapter 6
Basic Analysis
Oneway Analysis
Nonparametric
171
Thisreportshowstherankeddifferences,fromhighest
tolowest,withaconfidenceintervalbandoverlaidon
theplot.Confidenceintervalsthatdonotfullycontain
theircorrespondingbararesignificantlydifferentfrom
eachother.
Detailed Comparisons Report
Showsadetailedreportforeachcomparison.Each
sectionshowsthedifferencebetweenthelevels,
standarderrorandconfidenceintervals,tratios,
pvalues,anddegreesoffreedom.Aplotillustratingthe
comparisonappearsontherightofeachreport.
ThisoptionisnotavailableforAllPairs,TukeysHSD,
andNonparametricMultipleComparisons.
Nonparametric
Nonparametrictestsareusefulfortestingwhethergroupmeansormediansarelocatedthe
sameacrossgroups.However,theusualanalysisofvarianceassumptionofnormalityisnot
made.Nonparametrictestsusefunctionsoftheresponseranks,calledrankscores(Hajek
1969).
Note: IfyouhavespecifiedaBlockcolumn,thenthenonparametrictestsareperformedon
datathathasbeenadjustedfortheBlockmeans.
Table 6.13DescriptionsofNonparametricTests
WilcoxonTesta
PerformsthetestbasedonWilcoxonrankscores.TheWilcoxon
rankscoresarethesimpleranksofthedata.TheWilcoxontest
isthemostpowerfulranktestforerrorswithlogistic
distributions.Ifthefactorhastwoormorelevels,the
KruskalWallistestisperformed.
TheWilcoxontestisalsocalledtheMannWhitneytest.
172
Oneway Analysis
Nonparametric
Chapter 6
Basic Analysis
PerformsthetestbasedonMedianrankscores.TheMedian
rankscoresareeither1or0,dependingonwhetherarankis
aboveorbelowthemedianrank.TheMediantestisthemost
powerfulranktestforerrorswithdoubleexponential
distributions.
vanderWaerdenTest
PerformsthetestbasedonVanderWaerdenrankscores.The
VanderWaerdenrankscoresaretheranksofthedatadivided
byoneplusthenumberofobservationstransformedtoa
normalscorebyapplyingtheinverseofthenormaldistribution
function.TheVanderWaerdentestisthemostpowerfulrank
testforerrorswithnormaldistributions.
KolmogorovSmirnov
Test
Performsthetestbasedontheempiricaldistributionfunction,
whichtestswhetherthedistributionoftheresponseisthesame
acrossthegroups.Bothanapproximateandanexacttestare
given.ThistestisavailableonlywhentheXfactorhastwo
levels.
ExactTest
Providesoptionsforperformingexactversionsofthe
Wilcoxon,Median,vanderWaerden,and
KolmogorovSmirnovtests.Theseoptionsareavailableonly
whentheXfactorhastwolevels,andaftertheapproximatetest
isrequested.
Chapter 6
Basic Analysis
Oneway Analysis
Nonparametric
173
Providesseveraloptionsforperformingnonparametric
multiplecomparisons.Thesetestsarebasedonranks,and
controlfortheoverallalphalevel,exceptfortheWilcoxonEach
Pairtest.Thefollowingtestsareavailable:
Wilcoxon Each Pair performstheWilcoxontestoneachpair,
anddoesnotcontrolfortheoverallalphalevel.Thisisthe
nonparametricversionoftheEach Pair, Students toption
foundontheCompareMeansmenu.
Steel-Dwass All Pairs performstheSteelDwasstestoneach
controllevel,similartotheSteelWithControloption.The
Dunnmethodisdifferentinthatitcomputesranksonall
thedata,notjustthepairbeingcompared.
performsacomparisonofeach
pair,similartotheSteelDwassAllPairsoption.TheDunn
methodisdifferentinthatitcomputesranksonallthedata,
notjustthepairbeingcompared.
SeeDunn(1964)andHsu(1996).
a.FortheWilcoxon,Median,andVan derWaerden tests,iftheX factorhas morethantwo
levels, a chisquare approximation to the oneway test is performed. If the X factor has two
levels, a normal approximation to the twosample test is performed, in addition to the
chisquareapproximationtotheonewaytest.
174
Oneway Analysis
Nonparametric
Chapter 6
Basic Analysis
Table 6.14DescriptionsoftheWilcoxon,Median,andVanderWaerdenTests
Level
Liststhefactorlevelsoccurringinthedata.
Count
Recordsthefrequenciesofeachlevel.
ScoreSum
Recordsthesumoftherankscoreforeachlevel.
ExpectedScore
Recordstheexpectedscoreunderthenullhypothesisthatthereis
nodifferenceamongclasslevels.
ScoreMean
Recordsthemeanrankscoreforeachlevel.
(MeanMean0)/Std0
Recordsthestandardizedscore.Mean0isthemeanscoreexpected
underthenullhypothesis.Std0isthestandarddeviationofthe
scoresumexpectedunderthenullhypothesis.Thenullhypothesis
isthatthegroupmeansormediansareinthesamelocationacross
groups.
ChiSquare
Givesthevaluesofthechisquareteststatistic.
DF
Givesthedegreesoffreedomforthetest.
Prob>ChiSq
Givesthepvalueforthetest.
Givesthesumoftherankscores.ThisisreportedonlywhentheX
factorhastwolevels.
Givestheteststatisticforthenormalapproximationtest.Thisis
reportedonlywhentheXfactorhastwolevels.
Prob|Z|
Givesthepvalueforthenormalapproximationtest.Thisis
reportedonlywhentheXfactorhastwolevels.
ProbS
Givesaonesidedpvalueforthetest.Thisisreportedonlywhen
theXfactorhastwolevels,andtheexactversionofthetestis
requested.
ExacttestsareavailableonlyinJMPPro.
Prob|SMean|
Givesatwosidedpvalueforthetest.Thisisreportedonlywhen
theXfactorhastwolevels,andtheexactversionofthetestis
requested.
ExacttestsareavailableonlyinJMPPro.
Chapter 6
Basic Analysis
Oneway Analysis
Nonparametric
175
Table 6.15DescriptionoftheKolmogorovSmirnovTest
Level
Liststhefactorlevelsoccurringinthedata.
Count
Recordsthefrequenciesofeachlevel.
EDFatMaximum
Liststhevalueatwhichthemaximumdeviationfromthe
empiricaldistributionfunction(EDF)ofeachlevelandtheoverall
EDFoccurs.
DeviationfromMean
atMaximum
ListsthevalueoftheEDFofasampleatthemaximumdeviation
fromthemeanoftheEDFfortheoverallsample.
KS
AKolmogorovSmirnovstatistic.
KSa
AnasymptoticKolmogorovSmirnovstatistic.
D=max|F1F2|
ListsthemaximumabsolutedeviationbetweentheEDFoftwo
classlevels.
Prob>D
Liststhepvalueforthetest.Inotherwords,theprobabilitythatD
isgreaterthantheobservedvalued,underthenullhypothesisof
nodifferencebetweenclasslevelsorsamples.
D+=max(F1F2)
ListsaonesidedteststatisticthatmaxdeviationbetweentheEDF
oftwoclasslevelsispositive.
Prob>D+
ListstheprobabilitythatD+isgreaterthantheobservedvalued+,
underthenullhypothesisofnodifferencebetweenthetwoclass
levels.
D=max(F2F1)
ListsaonesidedteststatisticthatmaxdeviationbetweentheEDF
oftwoclasslevelsisnegative.
Prob>D
ListstheprobabilitythatDisgreaterthantheobservedvaluefor
d.
Table 6.16DescriptionsoftheNonparametricMultipleComparisonsTests
q*
Givesthequantilevalueusedintheconfidenceintervals.
Alpha
Givesthealphalevelusedintheconfidenceintervals
Level
Givesthepairusedinthecurrentcomparison
ScoreMeanDiff
Givesthedifferenceofthescoremeans.
StdErrDif
Givesthestandarderrorofthedifferencebetweenthescore
means.
176
Oneway Analysis
Unequal Variances
Chapter 6
Basic Analysis
Givesthestandardizedteststatistic,whichhasanasymptotic
standardnormaldeviationunderthenullhypothesis.
pValue
GivestheasymptotictwosidedpvalueforZ.
HodgesLehmann
GivestheHodgesLehmannestimatoroflocationshift.Itisthe
medianofallpaireddifferencesbetweenobservationsinthetwo
samples.
LowerCL
GivesthelowerconfidencelimitfortheHodgesLehmannstatistic.
UpperCL
GivestheupperconfidencelimitfortheHodgesLehmann
statistic.
Unequal Variances
Whenthevariancesacrossgroupsarenotequal,theusualanalysisofvarianceassumptions
arenotsatisfiedandtheANOVAFtestisnotvalid.JMPprovidesfourtestsforequalityof
groupvariancesandanANOVAthatisvalidwhenthegroupsamplevariancesareunequal.
Theconceptbehindthefirstthreetestsofequalvariancesistoperformananalysisofvariance
onanewresponsevariableconstructedtomeasurethespreadineachgroup.Thefourthtestis
Bartlettstest,whichissimilartothelikelihoodratiotestundernormaldistributions.
Note: AnothermethodtotestforunequalvariancesisANOMV.SeeAnalysisofMeans
Methodsonpage 163.
Table 6.17DescriptionsofTestsforEqualVariances
OBrien
Constructsadependentvariablesothatthegroupmeansofthenew
variableequalthegroupsamplevariancesoftheoriginalresponse.An
ANOVAontheOBrienvariableisactuallyanANOVAonthegroup
samplevariances(OBrien1979,Olejnik,andAlgina1987).
BrownForsythe
ShowstheFtestfromanANOVAwheretheresponseistheabsolute
valueofthedifferenceofeachobservationandthegroupmedian
(BrownandForsythe1974).
Levene
ShowstheFtestfromanANOVAwheretheresponseistheabsolute
valueofthedifferenceofeachobservationandthegroupmean
(Levene1960).Thespreadismeasuredas z ij = y ij y i (asopposedto
2
theSASdefault zij2 = y ij y i ).
Chapter 6
Basic Analysis
Oneway Analysis
Unequal Variances
177
Comparestheweightedarithmeticaverageofthesamplevariancesto
theweightedgeometricaverageofthesamplevariances.Thegeometric
averageisalwayslessthanorequaltothearithmeticaveragewith
equalityholdingonlywhenallsamplevariancesareequal.Themore
variationthereisamongthegroupvariances,themorethesetwo
averagesdiffer.Afunctionofthesetwoaveragesiscreated,which
approximatesa2distribution(or,infact,anFdistributionundera
certainformulation).Largevaluescorrespondtolargevaluesofthe
arithmeticorgeometricratio,andthereforetowidelyvaryinggroup
variances.DividingtheBartlettChisquareteststatisticbythedegrees
offreedomgivestheFvalueshowninthetable.Bartlettstestisnot
veryrobusttoviolationsofthenormalityassumption(Bartlettand
Kendall1946).
Ifthereareonlytwogroupstested,thenastandardFtestforunequalvariancesisalso
performed.TheFtestistheratioofthelargertothesmallervarianceestimate.Thepvalue
fromtheFdistributionisdoubledtomakeitatwosidedtest.
Note: IfyouhavespecifiedaBlockcolumn,thenthevariancetestsareperformedondataafter
ithasbeenadjustedfortheBlockmeans.
Liststhefactorlevelsoccurringinthedata.
Count
Recordsthefrequenciesofeachlevel.
StdDev
Recordsthestandarddeviationsoftheresponseforeachfactor
level.Thestandarddeviationsareequaltothemeansofthe
OBrienvariable.Ifaleveloccursonlyonceinthedata,no
standarddeviationiscalculated.
178
Oneway Analysis
Unequal Variances
Chapter 6
Basic Analysis
Recordsthemeanabsolutedifferenceoftheresponseandgroup
mean.Themeanabsolutedifferencesareequaltothegroupmeans
oftheLevenevariable.
MeanAbsDifto
Median
Recordstheabsolutedifferenceoftheresponseandgroupmedian.
Themeanabsolutedifferencesareequaltothegroupmeansofthe
BrownForsythevariable.
Test
Liststhenamesofthetestsperformed.
FRatio
RecordsacalculatedFstatisticforeachtest.
SeeStatisticalDetailsfortheTestsThattheVariancesAreEqual
Reportonpage 205.
DFNum
Recordsthedegreesoffreedominthenumeratorforeachtest.Ifa
factorhasklevels,thenumeratorhask 1degreesoffreedom.
Levelsoccurringonlyonceinthedataarenotusedincalculating
teststatisticsforOBrien,BrownForsythe,orLevene.The
numeratordegreesoffreedominthissituationisthenumberof
levelsusedincalculationsminusone.
DFDen
Recordsthedegreesoffreedomusedinthedenominatorforeach
test.ForOBrien,BrownForsythe,andLevene,adegreeof
freedomissubtractedforeachfactorlevelusedincalculatingthe
teststatistic.Onemoredegreeoffreedomissubtractedforthe
overallmean.Ifafactorhasklevels,thedenominatordegreesof
freedomisn k 1.
pValue
Probabilityofobtaining,bychancealone,anFvaluelargerthan
theonecalculatedifinrealitythevariancesareequalacrossall
levels.
Table 6.19DescriptionoftheWelchsTestReport
FRatio
ShowstheFteststatisticfortheequalvariancetest.
SeeStatisticalDetailsfortheTestsThattheVariancesAreEqual
Reportonpage 205.
DFNum
Recordsthedegreesoffreedominthenumeratorofthetest.Ifa
factorhasklevels,thenumeratorhask 1degreesoffreedom.
Levelsoccurringonlyonceinthedataarenotusedincalculating
theWelchANOVA.Thenumeratordegreesoffreedominthis
situationisthenumberoflevelsusedincalculationsminusone.
Chapter 6
Basic Analysis
Oneway Analysis
Equivalence Test
179
Recordsthedegreesoffreedominthedenominatorofthetest.
SeeStatisticalDetailsfortheTestsThattheVariancesAreEqual
Reportonpage 205.
Prob>F
Probabilityofobtaining,bychancealone,anFvaluelargerthan
theonecalculatedifinrealitythemeansareequalacrossalllevels.
Observedsignificanceprobabilitiesof0.05orlessareconsidered
evidenceofunequalmeansacrossthelevels.
tTest
ShowstherelationshipbetweentheFratioandthetTest.
CalculatedasthesquarerootoftheFratio.AppearsonlyiftheX
factorhastwolevels.
RelatedInformation
ExampleoftheUnequalVariancesOptiononpage 192
StatisticalDetailsfortheTestsThattheVariancesAreEqualReportonpage 205
Equivalence Test
Equivalencetestsassesswhetherthereisapracticaldifferenceinmeans.Youmustpicka
thresholddifferenceforwhichsmallerdifferencesareconsideredpracticallyequivalent.The
moststraightforwardtesttoconstructusestwoonesidedttestsfrombothsidesofthe
differenceinterval.Ifbothtestsreject(orconcludethatthedifferenceinthemeansdiffers
significantlyfromthethreshold),thenthegroupsarepracticallyequivalent.TheEquivalence
TestoptionusestheTwoOneSidedTests(TOST)approach.
RelatedInformation
ExampleofanEquivalenceTestonpage 193
180
Oneway Analysis
Robust Fit
Chapter 6
Basic Analysis
Robust Fit
Note: Formoredetailsaboutrobustfitting,seeHuber,1973.
TheRobust Fitoptionattemptstoreducetheinfluenceofoutliersinyourdataset.Inthis
instance,outliersareanobservationthatdoesnotcomefromthetrueunderlying
distributionofthedata.Forexample,ifweightmeasurementswerebeingtakeninpoundsfor
asampleofindividuals,butoneoftheindividualsaccidentallyrecordedtheirweightin
kilogramsinsteadofpounds,thiswouldbeadeviationfromthetruedistributionofthedata.
Outlierssuchasthiscouldleadyouintomakingincorrectdecisionsbecauseoftheirinfluence
onthedata.TheRobust Fitoptionreducestheinfluenceofthesetypesofoutliers.
RelatedInformation
ExampleoftheRobustFitOptiononpage 194
Power
ThePoweroptioncalculatesstatisticalpowerandotherdetailsaboutagivenhypothesistest.
LSV(theLeastSignificantValue)isthevalueofsomeparameterorfunctionofparameters
thatwouldproduceacertainpvaluealpha.Saidanotherway,youwanttoknowhow
smallaneffectwouldbedeclaredsignificantatsomepvaluealpha.TheLSVprovidesa
measuringstickforsignificanceonthescaleoftheparameter,ratherthanonaprobability
scale.Itshowshowsensitivethedesignanddataare.
LSN(theLeastSignificantNumber)isthetotalnumberofobservationsthatwould
produceaspecifiedpvaluealphagiventhatthedatahasthesameform.TheLSNis
definedasthenumberofobservationsneededtoreducethevarianceoftheestimates
enoughtoachieveasignificantresultwiththegivenvaluesofalpha,sigma,anddelta(the
significancelevel,thestandarddeviationoftheerror,andtheeffectsize).Ifyouneedmore
datatoachievesignificance,theLSNhelpstellyouhowmanymore.TheLSNisthetotal
numberofobservationsthatyieldsapproximately50%power.
Poweristheprobabilityofgettingsignificance(pvalue<alpha)whenarealdifference
existsbetweengroups.Itisafunctionofthesamplesize,theeffectsize,thestandard
deviationoftheerror,andthesignificancelevel.Thepowertellsyouhowlikelyyour
experimentistodetectadifference(effectsize),atagivenalphalevel.
Note: Whenthereareonlytwogroupsinaonewaylayout,theLSVcomputedbythepower
facilityisthesameastheleastsignificantdifference(LSD)showninthemultiplecomparison
tables.
Chapter 6
Basic Analysis
Oneway Analysis
Power
181
Significancelevel,between0and1(usually0.05,0.01,or0.10).
Initially,avalueof0.05shows.
Sigma()
Standarderroroftheresidualerrorinthemodel.Initially,RMSE,
theestimatefromthesquarerootofthemeansquareerroris
suppliedhere.
Delta()
Raweffectsize.Fordetailsabouteffectsizecomputations,seethe
FittingLinearModelsbook.Thefirstpositionisinitiallysettothe
squarerootofthesumsofsquaresforthehypothesisdividedbyn;
thatis, = SS n .
Number(n)
Totalsamplesizeacrossallgroups.Initially,theactualsamplesize
isputinthefirstposition.
SolveforPower
Solvesforthepower(theprobabilityofasignificantresult)asa
functionofallfourvalues:,,,andn.
SolveforLeast
SignificantNumber
Solvesforthenumberofobservationsneededtoachieve
approximately50%powergiven,,and.
SolveforLeast
SignificantValue
Solvesforthevalueoftheparameterorlineartestthatproducesa
pvalueof.Thisisafunctionof,,n,andthestandarderrorof
theestimate.ThisfeatureisavailableonlywhentheXfactorhas
twolevelsandisusuallyusedforindividualparameters.
182
Oneway Analysis
Normal Quantile Plot
Chapter 6
Basic Analysis
Whenyoulookatpowerretrospectively,youuseestimatesofthe
standarderrorandthetestparameters.
Adjustedpoweristhepowercalculatedfromamoreunbiased
estimateofthenoncentralityparameter.
Theconfidenceintervalfortheadjustedpowerisbasedonthe
confidenceintervalforthenoncentralityestimate.
Adjustedpowerandconfidencelimitsarecomputedonlyforthe
originalDelta,becausethatiswheretherandomvariationis.
RelatedInformation
ExampleofthePowerOptiononpage 196
StatisticalDetailsforPoweronpage 204
values.ThequantilesarecomputedandplottedseparatelyforeachleveloftheXvariable.
TheLine of Fitoptionshowsorhidesthelinesoffitonthequantileplots.
RelatedInformation
ExampleofaNormalQuantilePlotonpage 197
CDF Plot
ACDFplotshowsthecumulativedistributionfunctionforallofthegroupsintheOneway
report.CDFplotsareusefulifyouwanttocomparethedistributionsoftheresponseacross
levelsoftheXfactor.
RelatedInformation
ExampleofaCDFPlotonpage 198
Chapter 6
Basic Analysis
Oneway Analysis
Densities
183
Densities
TheDensitiesoptionsprovideseveralwaystocomparethedistributionandcompositionof
theresponseacrossthelevelsoftheXfactor.Therearethreedensityoptions:
smoothcurveisthekerneldensityestimateforeachgroup.
AteachXvalue,theCompositionofDensitiesplotshowshoweachgroupcontributesto
thetotal.
RelatedInformation
ExampleoftheDensitiesOptionsonpage 199
Matching Column
UsetheMatching Columnoptiontospecifyamatching(ID)variableforamatchingmodel
analysis.TheMatching Columnoptionaddressesthecasewhenthedatainaonewayanalysis
comefrommatched(paired)data,suchaswhenobservationsindifferentgroupscomefrom
thesamesubject.
Note: Aspecialcaseofmatchingleadstothepairedttest.TheMatched Pairsplatform
handlesthistypeofdata,butthedatamustbeorganizedwiththepairsindifferentcolumns,
notindifferentrows.
TheMatching Columnoptionperformstwoprimaryactions:
Itfitsanadditivemodel(usinganiterativeproportionalfittingalgorithm)thatincludes
boththegroupingvariable(theXvariableintheFitYbyXanalysis)andthematching
variablethatyouselect.Theiterativeproportionalfittingalgorithmmakesadifferenceif
therearehundredsofsubjects,becausetheequivalentlinearmodelwouldbeveryslow
andwouldrequirehugememoryresources.
Itdrawslinesbetweenthepointsthatmatchacrossthegroups.Iftherearemultiple
observationswiththesamematchingIDvalue,linesaredrawnfromthemeanofthe
groupofobservations.
184
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
TheMatchingFitreportshowstheeffectswithFtests.Theseareequivalenttotheteststhat
yougetwiththeFitModelplatformifyouruntwomodels,onewiththeinteractiontermand
onewithout.Ifthereareonlytwolevels,thentheFtestisequivalenttothepairedttest.
Note: FordetailsabouttheFitModelplatform,seetheFittingLinearModelsbook.
RelatedInformation
ExampleoftheMatchingColumnOptiononpage 200
FortheexampleinFigure 6.14,themeansfordrugAandCarestatisticallydifferentfromthe
overallmean.ThedrugAmeanislowerandthedrugCmeanishigher.Notethedecision
limitsforthedrugtypesarenotthesame,duetodifferentsamplesizes.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
185
FromFigure 6.15,noticethatthestandarddeviationforBrand2exceedsthelowerdecision
limit.Therefore,Brand2hassignificantlylowervariancethantheotherbrands.
186
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
Themeanscomparisonmethodcanbethoughtofasseeingiftheactualdifferenceinthe
meansisgreaterthanthedifferencethatwouldbesignificant.ThisdifferenceiscalledtheLSD
(leastsignificantdifference).TheLSDtermisusedforStudentstintervalsandincontextwith
intervalsforothertests.Inthecomparisoncirclesgraph,thedistancebetweenthecircles
centersrepresenttheactualdifference.TheLSDiswhatthedistancewouldbeifthecircles
intersectedatrightangles.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
187
Figure 6.17ExampleofMeansComparisonsReportforEachPair,Studentst
InFigure 6.17,theLSDthresholdtableshowsthedifferencebetweentheabsolutedifferencein
themeansandtheLSD(leastsignificantdifference).Ifthevaluesarepositive,thedifferencein
thetwomeansislargerthantheLSD,andthetwogroupsaresignificantlydifferent.
188
Oneway Analysis
Additional Examples of the Oneway Platform
Figure 6.18ExampleofAllPairs,TukeyHSDComparisonCircles
Figure 6.19ExampleofMeansComparisonsReportforAllPairs,TukeyHSD
Chapter 6
Basic Analysis
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
189
InFigure 6.19,theTukeyKramerHSDThresholdmatrixshowstheactualabsolutedifference
inthemeansminustheHSD,whichisthedifferencethatwouldbesignificant.Pairswitha
positivevaluearesignificantlydifferent.Theq*(appearingabovetheHSDThresholdMatrix
table)isthequantilethatisusedtoscaletheHSDs.Ithasacomputationalrolecomparableto
aStudentst.
190
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
Figure 6.21ExampleofMeansComparisonsReportforWithBest,HsuMCB
TheComparisonvs.Min/Maxreportcompareseachleveltotheminimumandthemaximum
level.Forexample,level17istheonlylevelthatissignificantlydifferentfromtheminimum
level.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
191
Alternatively,clickonarowtohighlightitinthescatterplotbeforeselectingtheCompare
Means > With Control, Dunnettsoption.Thetestusestheselectedrowasthecontrol
group.
8. ClickOK.
Figure 6.22ExampleofWithControl,DunnettsComparisonCircles
UsingthecomparisoncirclesinFigure 6.22,youcanconcludethatlevel17istheonlylevel
thatissignificantlydifferentfromthecontrollevelof12.
192
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
Althoughthefourmethodsalltestdifferencesbetweengroupmeans,differentresultscan
occur.Figure 6.23showsthecomparisoncirclesforallfourtests,withtheage17groupasthe
controlgroup.
Figure 6.23ComparisonCirclesforFourMultipleComparisonTests
FromFigure 6.23,noticethatfortheStudentstandHsumethods,agegroup15(thethird
circlefromthetop)issignificantlydifferentfromthecontrolgroupandappearsgray.But,for
theTukeyandDunnettmethod,agegroup15isnotsignificantlydifferent,andappearsred.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
Figure 6.24ExampleoftheUnequalVariancesReport
Sincethepvaluefromthe2sidedFTestislarge,youcanconcludethatthevariancesare
equal.
193
194
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
8. ClickOK.
Figure 6.25ExampleofanEquivalenceTest
FromFigure 6.25,noticethefollowing:
TheUpperThresholdtestcomparestheactualdifferenceto6.
TheLowerThresholdtestcomparestheactualdifferenceto6.
Forbothtests,thepvalueissmall.Therefore,youcanconcludethattheactualdifference
inmeans(3.02)issignificantlydifferentfrom6and6.Foryourpurposes,youcandeclare
themeanstobepracticallyequivalent.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
195
Figure 6.26ExampleofRobustFit
IfyoulookatthestandardAnalysisofVariancereport,youmightwronglyconcludethat
thereisadifferencebetweenthethreeformulations,sincethepvalueis.0319.However,when
youlookattheRobustFitreport,youwouldnotconcludethatthethreeformulationsare
significantlydifferent,becausethepvaluethereis.21755.Itappearsthatthetoxicityforafew
oftheobservationsisunusuallyhigh,creatingtheundueinfluenceonthedata.
196
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
11. ClickDone.
PoweriscomputedforeachcombinationofDeltaandNumber,andappearsinthePower
report.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
197
ToplotthePowervalues:
12. Fromtheredtrianglemenuatthebottomofthereport,selectPower Plot.
Figure 6.28ExampleofthePowerReport
13. YoumightneedtoclickanddragverticallyonthePoweraxistoseeallofthedatainthe
plot.
PowerisplottedforeachcombinationofDeltaandNumber.Asyoumightexpect,thepower
risesforlargerNumber(samplesizes)valuesandforlargerDeltavalues(differencein
means).
198
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
Figure 6.29ExampleofaNormalQuantilePlot
FromFigure 6.29,noticethefollowing:
TheLineofFitappearsbydefault.
Thedatapointstrackverycloselytothelineoffit,indicatinganormaldistribution.
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
199
Figure 6.30ExampleofaCDFPlot
ThelevelsoftheXvariablesintheinitialOnewayanalysisappearintheCDFplotasdifferent
curves.ThehorizontalaxisoftheCDFplotusestheyvalueintheinitialOnewayanalysis.
200
Oneway Analysis
Additional Examples of the Oneway Platform
Chapter 6
Basic Analysis
Figure 6.31ExampleoftheDensitiesOptions
contribution of
males to height
density
contribution of
females to
height density
females contributed
about 61% to height
density at this X level
Chapter 6
Basic Analysis
Oneway Analysis
Additional Examples of the Oneway Platform
201
Figure 6.32ExampleoftheMatchingColumnReport
Theplotgraphsthemilestraveledbyseason,withsubjectasthematchingvariable.Thelabels
nexttothefirstmeasurementforeachsubjectonthegrapharedeterminedbythespeciesand
subjectvariables.
TheMatchingFitreportshowstheseasonand subjecteffectswithFtests.Theseare
equivalenttotheteststhatyougetwiththeFitModelplatformifyouruntwomodels,one
withtheinteractiontermandonewithout.Ifthereareonlytwolevels,thentheFtestis
equivalenttothepairedttest.
Note: FordetailsabouttheFitModelplatform,seetheFittingLinearModelsbook.
202
Oneway Analysis
Statistical Details for the Oneway Platform
Chapter 6
Basic Analysis
LSD = t 2 std 1 2
Thestandarderrorofthedifferenceoftwoindependentmeansiscalculatedfromthe
followingrelationship:
2
2
2
std 1 2 = std 1 + std 2
Whenthemeansareuncorrelated,thesequantitieshavethefollowingrelationship:
2
2
2
2
LSD = t 2 std 1 2 = t 2 std 1 + t 2 std 2
ThesesquaredvaluesformaPythagoreanrelationship,illustratedgraphicallybytheright
triangleshowninFigure 6.33.
Figure 6.33RelationshipoftheDifferencebetweenTwoMeans
t std 1
t std 1 2
--2
--2
t std 2
--2
Thehypotenuseofthistriangleisameasuringstickforcomparingmeans.Themeansare
significantlydifferentifandonlyiftheactualdifferenceisgreaterthanthehypotenuse(LSD).
Supposethatyouhavetwomeansthatareexactlyontheborderline,wheretheactual
differenceisthesameastheleastsignificantdifference.Drawthetrianglewithverticesatthe
meansmeasuredonaverticalscale.Also,drawcirclesaroundeachmeansothatthediameter
ofeachisequaltotheconfidenceintervalforthatmean.
Chapter 6
Basic Analysis
Oneway Analysis
Statistical Details for the Oneway Platform
203
Figure 6.34GeometricRelationshipofttestStatistics
t std 1
t std 1 2
--2
--2
t std 2
--2
Theradiusofeachcircleisthelengthofthecorrespondinglegofthetriangle,whichis
t 2 std i .
Thecirclesmustintersectatthesamerightangleasthetrianglelegs,givingthefollowing
relationship:
Ifthemeansdifferexactlybytheirleastsignificantdifference,thentheconfidenceinterval
circlesaroundeachmeanintersectatarightangle.Thatis,theangleofthetangentsisa
rightangle.
Now,considerthewaythatthesecirclesmustintersectifthemeansaredifferentbygreater
thanorlessthantheleastsignificantdifference:
Ifthecirclesintersectsothattheoutsideangleisgreaterthanarightangle,thenthemeans
arenotsignificantlydifferent.Ifthecirclesintersectsothattheoutsideangleislessthana
rightangle,thenthemeansaresignificantlydifferent.Anoutsideangleoflessthan90
degreesindicatesthatthemeansarefartherapartthantheleastsignificantdifference.
Ifthecirclesdonotintersect,thentheyaresignificantlydifferent.Iftheynest,theyarenot
significantlydifferent.SeeFigure 6.12.
Thesamegraphicaltechniqueworksformanymultiplecomparisontests,substitutinga
differentprobabilityquantilevaluefortheStudentst.
204
Oneway Analysis
Statistical Details for the Oneway Platform
Chapter 6
Basic Analysis
1=r1isthenumeratordf.
2=r(n1)isthedenominatordf.
nisthenumberpergroup.
risthenumberofgroups.
nc=n(CSS)/2isthenoncentralityparameter.
r
CSS =
g isthecorrectedsumofsquares.
g=1
gisthemeanofthegthgroup.
istheoverallmean.
2isestimatedbythemeansquarederror(MSE).
freedom.SeeTheAnalysisofVarianceReportonpage 159.
Chapter 6
Basic Analysis
Oneway Analysis
Statistical Details for the Oneway Platform
205
Statistical Details for the Tests That the Variances Are Equal Report
FRatio
OBrienstestconstructsadependentvariablesothatthegroupmeansofthenewvariable
equalthegroupsamplevariancesoftheoriginalresponse.TheOBrienvariableiscomputed
asfollows:
2
wherenrepresentsthenumberofyijkobservations.
BrownForsytheisthemodelFstatisticfromanANOVAon z ij = y ij y i where y i isthe
medianresponsefortheithlevel.Ifanyzijiszero,thenitisreplacedwiththenextsmallestzij
inthesamelevel.Thiscorrectsfortheartificialzerosoccurringinlevelswithoddnumbersof
observations.
TheLeveneFisthemodelFstatisticfromanANOVAon z ij = y ij y i. where y i. isthemean
responsefortheithlevel.
Bartlettstestiscalculatedasfollows:
vi
v log ---- s 2i v i log s i2
i v i
T = ---------------------------------------------------------------------- where v i = n i 1 and v = v i
1 1
i
---- ---
i v i v
1 + --------------------
3 k 1
andniisthecountontheithlevelandsi2istheresponsesamplevarianceontheithlevel.The
Bartlettstatistichasa2distribution.DividingtheChisquareteststatisticbythedegreesof
freedomresultsinthereportedFvalue.
WelchsTestFRatio
TheWelchsTestFRatioiscomputedasfollows:
wi y i y ..
i
-----------------------------------k1
ni
w i y i.
F = -----------------------------------------------------------------------w 2 where w i = ----- , u = w i , y .. = ------------ ,
1 ------i
2
u
i
si
i
2------------------k 2
u-
---------------------+
1
n 1
2
i
i
1
k
206
Oneway Analysis
Statistical Details for the Oneway Platform
Chapter 6
Basic Analysis
andniisthecountontheithlevel, y i. isthemeanresponsefortheithlevel,andsi2isthe
responsesamplevariancefortheithlevel.
WelchsTestDFDen
TheWelchapproximationforthedenominatordegreesoffreedomisasfollows:
1
df = -------------------------------------------------w2
1 ------i
3
u ---------------------------- 2 n 1
k 1 i i
wherewi,ni,anduaredefinedasintheFratioformula.
Chapter 7
Contingency Analysis
Examine Relationships between Two Categorical Variables
TheContingencyorFitYbyXplatformletsyouexplorethedistributionofacategorical
(nominalorordinal)variableYacrossthelevelsofasecondcategoricalvariableX.The
ContingencyplatformisthecategoricalbycategoricalpersonalityoftheFitYbyXplatform.The
analysisresultsincludeamosaicplot,frequencycounts,andproportions.Youcan
interactivelyperformadditionalanalysesandtestsonyourdata,suchasanAnalysisofMeans
forProportions,acorrespondenceanalysisplot,andsoon.
Figure 7.1ExampleofContingencyAnalysis
Contents
ExampleofContingencyAnalysis................................................. 209
LaunchtheContingencyPlatform................................................. 210
TheContingencyReport ......................................................... 210
ContingencyPlatformOptions .................................................... 212
MosaicPlot ..................................................................... 214
ContingencyTable............................................................... 216
Tests........................................................................... 218
AnalysisofMeansforProportions ................................................. 220
CorrespondenceAnalysis ........................................................ 220
CochranMantelHaenszelTest .................................................... 222
AgreementStatistic .............................................................. 222
RelativeRisk.................................................................... 223
TwoSampleTestforProportions.................................................. 224
MeasuresofAssociation.......................................................... 224
CochranArmitageTrendTest ..................................................... 226
ExactTest ...................................................................... 226
AdditionalExamplesoftheContingencyPlatform ................................... 228
StatisticalDetailsfortheContingencyPlatform...................................... 240
Chapter 7
Basic Analysis
Contingency Analysis
Example of Contingency Analysis
209
210
Contingency Analysis
The Contingency Report
Chapter 7
Basic Analysis
FromthemosaicplotandlegendinFigure 7.2,noticethefollowing:
VeryfewJapanesecarsfallintotheLargesizecategory.
ThemajorityoftheEuropeancarsfallintotheSmallandMediumsizecategories.
ThemajorityoftheAmericancarsfallintotheLargeandMediumsizecategories.
TolaunchtheContingencyplatform,fromtheJMPStarterwindow,clickontheBasic
categoryandclickContingency.
Figure 7.3TheContingencyLaunchWindow
Formoreinformationaboutthislaunchwindow,seeIntroductiontoFitYbyXchapteron
page 95.
Chapter 7
Basic Analysis
Contingency Analysis
The Contingency Report
211
Figure 7.4ExampleofaContingencyReport
Note: AnyrowsthatareexcludedinthedatatablearealsohiddenintheMosaicPlot.
TheContingencyreportinitiallyshowsaMosaicPlot,aContingencyTable,andaTestsreport.
Youcanaddotheranalysesandtestsusingtheoptionsthatarelocatedwithintheredtriangle
menu.Fordetailsaboutallofthesereportsandoptions,seeContingencyPlatformOptions
onpage 212.
212
Contingency Analysis
Contingency Platform Options
Chapter 7
Basic Analysis
Agraphicalrepresentationofthedatainthe
ContingencyTable.SeeMosaicPlotonpage 214.
Contingency Table
Atwowayfrequencytable.Thereisarowforeach
factorlevelandacolumnforeachresponselevel.See
ContingencyTableonpage 216.
Tests
AnalogoustotheAnalysisofVariancetablefor
continuousdata.Thetestsshowthattheresponselevel
ratesarethesameacrossXlevels.SeeTestson
page 218.
Set level
Changesthealphalevelusedinconfidenceintervals.
Selectoneofthecommonvalues(0.10,0.05,0.01)or
selectaspecificvalueusingtheOtheroption.
Onlyappearsiftheresponsehasexactlytwolevels.
ComparesresponseproportionsfortheXlevelstothe
overallresponseproportion.SeeAnalysisofMeansfor
Proportionsonpage 220.
Correspondence Analysis
Showswhichrowsorcolumnsofafrequencytablehave
similarpatternsofcounts.Inthecorrespondence
analysisplot,thereisapointforeachrowandforeach
columnofthecontingencytable.SeeCorrespondence
Analysisonpage 220.
Testsifthereisarelationshipbetweentwocategorical
variablesafterblockingacrossathirdclassification.See
CochranMantelHaenszelTestonpage 222.
Chapter 7
Basic Analysis
Contingency Analysis
Contingency Platform Options
213
Agreement Statistic
OnlyappearswhenboththeXandYvariableshavethe
samelevels.DisplaystheKappastatistic(Agresti1990),
itsstandarderror,confidenceinterval,hypothesistest,
andBowkerstestofsymmetry,alsoknowasMcNemars
test.SeeAgreementStatisticonpage 222.
Relative Risk
Calculatesriskratios.AppearsonlywhenboththeXand
Yvariableshaveonlytwolevels.SeeRelativeRiskon
page 223.
Odds Ratio
Appearsonlywhenthereareexactlytwolevelsforeach
variable.Producesareportoftheoddsratio.Formore
information,seeStatisticalDetailsfortheOddsRatio
Optiononpage 240.
Thereportalsogivesaconfidenceintervalforthisratio.
YoucanchangethealphalevelusingtheSet Level
option.
Performsatwosampletestforproportions.Thistest
comparestheproportionsoftheYvariablebetweenthe
twolevelsoftheXvariable.Appearsonlywhenboththe
XandYvariableshaveonlytwolevels.SeeTwoSample
TestforProportionsonpage 224.
Measures of Association
Describestheassociationbetweenthevariablesinthe
contingencytable.SeeMeasuresofAssociationon
page 224.
Testsfortrendsinbinomialproportionsacrosslevelsofa
singlevariable.Thistestisappropriateonlywhenone
variablehastwolevelsandtheothervariableisordinal.
SeeCochranArmitageTrendTestonpage 226.
Exact Test
Providesexactversionsofthefollowingtests:
FishersTest
CochranArmitageTrendTest
AgreementTest
SeeExactTestonpage 226.
214
Contingency Analysis
Mosaic Plot
Chapter 7
Basic Analysis
Mosaic Plot
ThemosaicplotisagraphicalrepresentationofthetwowayfrequencytableorContingency
Table.Amosaicplotisdividedintorectangles,sothattheareaofeachrectangleis
proportionaltotheproportionsoftheYvariableineachleveloftheXvariable.Themosaic
plotwasintroducedbyHartiganandKleinerin1981andrefinedbyFriendly(1994).
ToproducetheplotshowninFigure 7.5,followtheinstructionsinExampleofContingency
Analysisonpage 209.
Figure 7.5ExampleofaMosaicPlot
NotethefollowingaboutthemosaicplotinFigure 7.5:
TheproportionsonthexaxisrepresentthenumberofobservationsforeachleveloftheX
variable,whichiscountry.
TheproportionsontheyaxisatrightrepresenttheoverallproportionsofSmall,Medium,
andLargecarsforthecombinedlevels(American,European,andJapanese).
Thescaleoftheyaxisatleftshowstheresponseprobability,withthewholeaxisbeinga
probabilityofone(representingthetotalsample).
Clickingonarectangleinthemosaicplothighlightstheselectionandhighlightsthe
correspondingdataintheassociateddatatable.
Replacevariablesinthemosaicplotbydragginganddroppingavariable,inoneoftwoways:
swapexistingvariablesbydragginganddroppingavariablefromoneaxistotheotheraxis;
or,clickonavariableintheColumnspaneloftheassociateddatatableanddragitontoan
axis.
Chapter 7
Basic Analysis
Contingency Analysis
Mosaic Plot
215
Context Menu
Rightclickonthemosaicplottochangecolorsandlabelthecells.
Table 7.1ContextMenuOptions
Set Colors
Showsthecurrentassignmentofcolorstolevels.SeeSetColors
onpage 215.
Cell Labeling
Specifyalabeltobedrawninthemosaicplot.Selectoneofthe
followingoptions:
Unlabeled
Showsnolabels,andremovesanyoftheotheroptions.
eachcell.
Showstherowlabelsforalloftherows
representedbythecell.
Note: Fordescriptionsoftheremainderoftherightclickoptions,seetheUsingJMPbook.
SetColors
WhenyouselecttheSet Colorsoption,theSelectColorsforValueswindowappears.
Figure 7.6SelectColorsforValuesWindow
Thedefaultmosaiccolorsdependonwhethertheresponsecolumnisordinalornominal,and
whetherthereisanexistingValueColorscolumnproperty.Tochangethecolorforanylevel,
clickontheovalinthesecondcolumnofcolorsandpickanewcolor.
216
Contingency Analysis
Contingency Table
Chapter 7
Basic Analysis
Table 7.2DescriptionoftheSelectColorsforValuesWindow
Macros
Computesacolorgradientbetweenanytwolevels,as
follows:
Ifyouselectarangeoflevels(bydraggingthemouse
overthelevelsthatyouwanttoselect,orpressingthe
SHIFTkeyandclickingthefirstandlastlevel),the
Gradient Between Selected Points optionappliesacolor
gradienttothelevelsthatyouhaveselected.
UndoanyofyourchangesbyselectingRevert to Old
Colors.
Color Theme
Changesthecolorsforeachvaluebasedonacolortheme.
Ifyouselectthischeckbox,anewcolumnproperty(Value
Colors)isaddedtothecolumnintheassociateddatatable.
Toeditthispropertyfromthedatatable,selectCols >
Column Info.
Contingency Table
TheContingencyTableisatwowayfrequencytable.Thereisarowforeachfactorlevelanda
columnforeachresponselevel.
ToproducetheplotshowninFigure 7.7,followtheinstructionsinExampleofContingency
Analysisonpage 209.
Chapter 7
Basic Analysis
Contingency Analysis
Contingency Table
217
Figure 7.7ExampleofaContingencyTable
NotethefollowingaboutContingencytables:
TheCount,Total%,Col%,andRow%correspondtothedatawithineachcellthathasrow
andcolumnheadings(suchasthecellunderAmericanandLarge).
Thelastcolumncontainsthetotalcountsforeachrowandpercentagesforeachrow.
Thebottomrowcontainstotalcountsforeachcolumnandpercentagesforeachcolumn.
Forexample,inFigure 7.7,focusonthecarsthatarelargeandcomefromAmerica.The
followingtableexplainstheconclusionsthatyoucanmakeaboutthesecarsusingthe
ContingencyTable.
Table 7.3ConclusionsBasedonExampleofaContingencyTable
Number
Description
Label in Table
36
Numberofcarsthatarebothlargeandcomefrom
America
Count
11.88%
Percentageofallcarsthatarebothlargeandcome
fromAmerica(36/303)a.
Total%
85.71%
PercentageoflargecarsthatcomefromAmerica(36/
42)b
Col%
31.30%
PercentageofAmericancarsthatarelarge(36/115)c.
Row%
37.95%
PercentageofallcarsthatcomefromAmerica(115/
303).
(none)
13.86%
Percentageofallcarsthatarelarge(42/303).
(none)
218
Contingency Analysis
Tests
Chapter 7
Basic Analysis
a.303isthetotalnumberofcarsinthepoll.
b.42isthetotalnumberoflargecarsinthepoll.
c.115isthetotalnumberofAmericancarsinthepoll.
Tip: ToshoworhidedataintheContingencyTable,fromtheredtrianglemenunextto
ContingencyTable,selecttheoptionthatyouwanttoshoworhide.
Table 7.4DescriptionoftheContingencyTable
Count
Cellfrequency,margintotalfrequencies,andgrandtotal(total
samplesize).
Total%
Percentofcellcountsandmargintotalstothegrandtotal.
Row%
Percentofeachcellcounttoitsrowtotal.
Col%
Percentofeachcellcounttoitscolumntotal.
Expected
Expectedfrequency(E)ofeachcellundertheassumptionof
independence.Computedastheproductofthecorrespondingrow
totalandcolumntotaldividedbythegrandtotal.
Deviation
Observedcellfrequency(O)minustheexpectedcellfrequency(E).
CellChiSquare
Chisquarevaluescomputedforeachcellas(O E)2 / E.
ColCum
Cumulativecolumntotal.
ColCum%
Cumulativecolumnpercentage.
RowCum
Cumulativerowtotal.
RowCum%
Cumulativerowpercentage.
Tests
TheTestsreportshowstheresultsfortwoteststodeterminewhethertheresponselevelrates
arethesameacrossXlevels.
ToproducethereportshowninFigure 7.8,followtheinstructionsinExampleof
ContingencyAnalysisonpage 209.
Chapter 7
Basic Analysis
Contingency Analysis
Tests
219
Figure 7.8ExampleofaTestsReport
NotethefollowingabouttheChisquarestatistics:
Whenbothcategoricalvariablesareresponses(Yvariables),theChisquarestatisticstest
thattheyareindependent.
YoumighthaveaYvariablewithafixedXvariable.Inthiscase,theChisquarestatistics
testthatthedistributionoftheYvariableisthesameacrosseachXlevel.
Table 7.5DescriptionoftheTestsReport
N
Totalnumberofobservations.
DF
Recordsthedegreesoffreedomassociatedwiththetest.
Thedegreesoffreedomareequalto(c 1)(r 1),wherecisthenumberof
columnsandristhenumberofrows.
LogLike
Negativeloglikelihood,whichmeasuresfitanduncertainty(muchlike
sumsofsquaresincontinuousresponsesituations).
Rsquare(U)
Portionofthetotaluncertaintyattributedtothemodelfit.
AnR2of1meansthatthefactorscompletelypredictthecategorical
response.
AnR2of0meansthatthereisnogainfromusingthemodelinsteadof
fixedbackgroundresponserates.
Formoreinformation,seeStatisticalDetailsfortheTestsReporton
page 241.
Test
ListstwoChisquarestatisticaltestsofthehypothesisthattheresponse
ratesarethesameineachsamplecategory.Formoreinformation,see
StatisticalDetailsfortheTestsReportonpage 241.
Prob>ChiSq
Liststheprobabilityofobtaining,bychancealone,aChisquarevalue
greaterthantheonecomputedifnorelationshipexistsbetweenthe
responseandfactor.Ifbothvariableshaveonlytwolevels,Fishersexact
probabilitiesfortheonetailedtestsandthetwotailedtestalsoappear.
220
Contingency Analysis
Analysis of Means for Proportions
Chapter 7
Basic Analysis
Selectsthealphalevelusedintheanalysis.
Producesareportthatshowstheresponseproportionswith
decisionlimitsforeachleveloftheXvariable.Thereport
indicateswhetheralimithasbeenexceeded.
Changestheresponsecategoryusedintheanalysis.
Display Options
Showsorhidesthedecisionlimits,decisionlimitshading,
centerline,andpointoptions.
RelatedInformation
ExampleofAnalysisofMeansforProportionsonpage 228
Correspondence Analysis
Correspondenceanalysisisagraphicaltechniquetoshowwhichrowsorcolumnsofa
frequencytablehavesimilarpatternsofcounts.Inthecorrespondenceanalysisplot,thereisa
pointforeachrowandforeachcolumn.UseCorrespondenceAnalysiswhenyouhavemany
levels,makingitdifficulttoderiveusefulinformationfromthemosaicplot.
Chapter 7
Basic Analysis
Contingency Analysis
Correspondence Analysis
221
Producesa3Dscatterplot.
Takestheorderofthelevelssortedbythefirst
correspondencescorecoefficientandmakesacolumn
propertyforboththeXandYcolumns.
Providesthesingularvaluedecompositionofthecontingency
table.Fortheformula,seeStatisticalDetailsfortheDetails
Reportonpage 242.
Inertia
Liststhesquareofthesingularvalues,reflectingtherelative
variationaccountedforinthecanonicaldimensions.
Portion
Portionofinertiawithrespecttothetotalinertia.
Cumulative
Showsthecumulativeportionofinertia.Ifthefirsttwosingular
valuescapturethebulkoftheinertia,thenthe2D
correspondenceanalysisplotissufficienttoshowthe
relationshipsinthetable.
222
Contingency Analysis
Cochran-Mantel-Haenszel Test
Chapter 7
Basic Analysis
ThevaluesplottedontheCorrespondenceAnalysisplot
(Figure 7.11).
Yvariable(Response)
c1,c2,c3
ThevaluesplottedontheCorrespondenceAnalysisplot
(Figure 7.11).
RelatedInformation
ExampleofCorrespondenceAnalysisonpage 229
Cochran-Mantel-Haenszel Test
TheCochranMantelHaenszeltestdiscoversifthereisarelationshipbetweentwocategorical
variablesafterblockingacrossathirdclassification.
Table 7.9DescriptionoftheCochranMantelHaenszelTest
CorrelationofScores
ApplicablewheneithertheYorXisordinal.The
alternativehypothesisisthatthereisalinearassociation
betweenYandXinatleastoneleveloftheblocking
variable.
RowScorebyColCategories
ApplicablewhenYisordinalorinterval.Thealternative
hypothesisisthat,foratleastoneleveloftheblocking
variable,themeanscoresoftherrowsareunequal.
ColScorebyRowCategories
ApplicablewhenXisordinalorinterval.Thealternative
hypothesisisthat,foratleastoneleveloftheblocking
variable,themeanscoresoftheccolumnsareunequal.
GeneralAssoc.ofCategories
Teststhatforatleastoneleveloftheblockingvariable,
thereissometypeofassociationbetweenXandY.
RelatedInformation
Agreement Statistic
Whenthetwovariableshavethesamelevels,theAgreement Statisticoptionisavailable.This
optionshowstheKappastatistic(Agresti1990),itsstandarderror,confidenceinterval,
hypothesistest,andBowkerstestofsymmetry.
Chapter 7
Basic Analysis
Contingency Analysis
Relative Risk
223
TheKappastatisticandassociatedpvaluegiveninthissectionareapproximate.Anexact
versionoftheagreementtestisavailable.SeeExactTestonpage 226.
Table 7.10DescriptionoftheAgreementStatisticReport
Kappa
ShowstheKappastatistic.
StdErr
ShowsthestandarderroroftheKappastatistic.
Lower95%
ShowsthelowerendpointoftheconfidenceintervalforKappa.
Upper95%
ShowstheupperendpointoftheconfidenceintervalforKappa.
Prob>Z
ShowsthepvalueforaonesidedtestforKappa.Thenull
hypothesistestsifKappaequalszero.
Prob>|Z|
ShowsthepvalueforatwosidedtestforKappa.
ChiSquare
ShowstheteststatisticforBowkerstest.ForBowkerstestof
symmetry,thenullhypothesisisthattheprobabilitiesinthe
squaretablesatisfysymmetry,orthatpij=pjiforallpairsoftable
cells.WhenbothXandYhavetwolevels,thistestisequalto
McNemarstest.
Prob>ChiSq
ShowsthepvaluefortheBowkerstest.
RelatedInformation
ExampleoftheAgreementStatisticOptiononpage 234
StatisticalDetailsfortheAgreementStatisticOptiononpage 240
Relative Risk
Calculateriskratiosfor2x2contingencytablesusingtheRelative Riskoption.Confidence
intervalsalsoappearinthereport.Youcanfindmoreinformationaboutthismethodin
Agresti(1990)section3.4.2.
TheChooseRelativeRiskCategorieswindowappearswhenyouselecttheRelative Risk
option.Youcanselectasingleresponseandfactorcombination,oryoucancalculatetherisk
ratiosforallcombinationsofresponseandfactorlevels.
RelatedInformation
ExampleoftheRelativeRiskOptiononpage 235
224
Contingency Analysis
Two Sample Test for Proportions
Chapter 7
Basic Analysis
Showsthetestbeingperformed.
Proportion
Difference
ShowsthedifferenceintheproportionsbetweenthelevelsoftheX
variable.
Lower95%
Showsthelowerendpointoftheconfidenceintervalforthe
difference.BasedontheadjustedWaldconfidenceinterval.
Upper95%
Showstheupperendpointoftheconfidenceintervalforthe
difference.BasedontheadjustedWaldconfidenceinterval.
AdjustedWaldTest
Showstwotailedandonetailedtests.
Prob
Showsthepvaluesforthetests.
Response<variable>
categoryofinterest
Selectwhichresponseleveltouseinthetest.
RelatedInformation
ExampleofaTwoSampleTestforProportionsonpage 237
Measures of Association
Youcanrequestseveralstatisticsthatdescribetheassociationbetweenthevariablesinthe
contingencytablebyselectingtheMeasures of Associationoption.
Fordetailsaboutmeasuresofassociation,seethefollowingreferences:
BrownandBenedetti(1977)
GoodmanandKruskal(1979)
KendallandStuart(1979)
SnedecorandCochran(1980)
Somers(1962)
Chapter 7
Basic Analysis
Contingency Analysis
Measures of Association
225
Table 7.12DescriptionoftheMeasuresofAssociationReport
Gamma
Basedonthenumberofconcordantanddiscordantpairsand
ignorestiedpairs.Takesvaluesintherange1to1.
KendallsTaub
SimilartoGammaandusesacorrectionforties.Takesvaluesin
therange1to1.
StuartsTauc
SimilartoGammaandusesanadjustmentfortablesizeanda
correctionforties.Takesvaluesintherange1to1.
SomersD
AnasymmetricmodificationofTaub.
TheC|RdenotesthattherowvariableXisregardedasan
independentvariableandthecolumnvariableYisregardedas
dependent.
Similarly,theR|CdenotesthatthecolumnvariableYis
regardedasanindependentvariableandtherowvariableXis
dependent.
SomersDdiffersfromTaubinthatitusesacorrectionforties
onlywhenthepairistiedontheindependentvariable.Ittakes
valuesintherange1to1.
LambdaAsymmetric
ForC|R,isinterpretedastheprobableimprovementin
predictingthecolumnvariableYgivenknowledgeoftherow
variableX.
ForR|C,isinterpretedastheprobableimprovementin
predictingtherowvariableXgivenknowledgeaboutthe
columnvariableY.
Takesvaluesintherange0to1.
LambdaSymmetric
LooselyinterpretedastheaverageofthetwoLambdaAsymmetric
measures.Takesvaluesintherange0to1.
UncertaintyCoef
ForC|R,istheproportionofuncertaintyinthecolumn
variableYthatisexplainedbytherowvariableX.
ForR|C,isinterpretedastheproportionofuncertaintyinthe
rowvariableXthatisexplainedbythecolumnvariableY.
Takesvaluesintherange0to1.
UncertaintyCoef
Symmetric
SymmetricversionofthetwoUncertaintyCoefmeasures.Takes
valuesintherange0to1.
226
Contingency Analysis
Cochran Armitage Trend Test
Chapter 7
Basic Analysis
Eachstatisticappearswithitsstandarderrorandconfidenceinterval.Notethefollowing:
Gamma,KendallsTaub,StuartsTauc,andSomersDaremeasuresofordinalassociation
thatconsiderwhetherthevariableYtendstoincreaseasXincreases.Theyclassifypairsof
observationsasconcordantordiscordant.Apairisconcordantifanobservationwitha
largervalueofXalsohasalargervalueofY.Apairisdiscordantifanobservationwitha
largervalueofXhasasmallervalueofY.Thesemeasuresareappropriateonlywhenboth
variablesareordinal.
TheLambdaandUncertaintymeasuresareappropriateforordinalandnominalvariables.
RelatedInformation
ExampleoftheMeasuresofAssociationOptiononpage 238
ExampleoftheCochranArmitageTrendTestonpage 239
Exact Test
Thefollowingtabledescribestheexactversionsofthreeofthetestsavailableinthe
Contingencyplatform.
Chapter 7
Basic Analysis
Contingency Analysis
Exact Test
227
Table 7.13ExactTests
FishersExactTest
PerformsFishersExacttestforanrxctable.This
isatestforassociationbetweentwovariables.
Fishersexacttestassumesthattherowand
columntotalsarefixed,andusesthe
hypergeometricdistributiontocompute
probabilities.
Thistestdoesnotdependonanylargesample
distributionassumptions.Thismeansitis
appropriateforsituationswheretheLikelihood
RatioandPearsontestsbecomelessreliable,like
forsmallsamplesizesorsparsetables.
Thereportincludesthefollowinginformation:
Table Probability (P) givestheprobabilityforthe
observedtable.Thisisnotthepvalueforthe
test.
Two-sided Prob P givesthepvalueforthe
twosidedtest.
For2x2tables,theFishersExacttestis
automaticallyperformed.SeeTestsonpage 218.
ExactCochranArmitageTrendTest
PerformstheexactversionoftheCochran
ArmitageTrendTest.Thistestisavailableonly
whenoneofthevariableshastwolevels.Formore
detailsaboutthetrendtest,seeCochran
ArmitageTrendTestonpage 226.
ExactAgreementTest
Performsanexacttestfortestingagreement
betweenvariables.Thisisanexacttestforthe
Kappastatistic.Thisisavailableonlywhenthetwo
variableshavethesamelevels.Formoredetails
aboutagreementtesting,seeAgreementStatistic
onpage 222.
228
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
229
Figure 7.9ExampleofAnalysisofMeansforProportions
Figure 7.9showstheproportionofpatientswhowereontimefromeachclinic.From
Figure 7.9,noticethefollowing:
TheproportionofontimearrivalsisthehighestforclinicF,followedbyclinicB.
ClinicDhasthelowestproportionofontimearrivals,followedbyclinicA.
ClinicEandclinicCareclosetotheaverage,anddonotexceedthedecisionlimits.
230
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
Figure 7.10MosaicPlotfortheCheeseData
FromthemosaicplotinFigure 7.10,younoticethatthedistributionsdonotappearalike.
However,itischallengingtomakesenseofthemosaicplotacrossninelevels.A
correspondenceanalysiscanhelpdefinerelationshipsinthistypeofsituation.
7. Toseethecorrespondenceanalysisplot,fromtheredtrianglemenunexttoContingency
Analysis,selectCorrespondence Analysis.
Figure 7.11ExampleofaCorrespondenceAnalysisPlot
least liked
neutral
most liked
Figure 7.11showsthecorrespondenceanalysisgraphically,withtheplotaxeslabeledc1and
c2.Noticethefollowing:
c1seemstocorrespondtoageneralsatisfactionlevel.Thecheesesonthec1axisgofrom
leastlikedatthetoptomostlikedatthebottom.
c2seemstocapturesomequalitythatmakesBandDdifferentfromAandC.
CheeseDisthemostlikedcheese,withresponsesof8and9.
CheeseBistheleastlikedcheese,withresponsesof1,2,and3.
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
231
CheesesCandAareinthemiddle,withresponsesof4,5,6,and7.
8. FromtheredtrianglemenunexttoCorrespondenceAnalysis,select3D Correspondence
Analysis.
Figure 7.12Exampleofa3DScatterplot
FromFigure 7.12,noticethefollowing:
Lookingatthec1axis,responses1through5appeartotherightof0(positive).Responses
6through9appeartotheleftof0(negative).
Lookingatthec2axis,AandCappeartotherightof0(positive).BandDappeartothe
leftof0(negative).
Youcanhavetwoconclusions:c1correspondstothegeneralsatisfaction(fromleastto
mostliked).c2correspondstoaqualitythatmakesBandDdifferentfromAandC.
232
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
4. SelectFromandclickX, Factor.
5. ClickOK.
6. FromtheredtrianglemenunexttoContingencyTable,deselecteverythingexceptCount.
Figure 7.13ContingencyAnalysisforEmailData
Lookingatthefrequencytable,younoticethefollowingdifferences:
JeffsendsmessagestoeveryonebutreceivesmessagesonlyfromMichael.
MichaelandJohnsendmanymoremessagesthantheothers.
Michaelsendsmessagestoeveryone.
JohnsendsmessagestoeveryoneexceptJeff.
KatherineandAnnsendmessagesonlytoMichaelandJohn.
Furthervisualizetheresultsofthecontingencytablewithacorrespondenceanalysis.From
theredtrianglemenunexttoContingencyAnalysis,selectCorrespondence Analysis.
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
233
Figure 7.14CorrespondenceAnalysisforEmailData
FromtheDetailsreportinFigure 7.14,younoticethefollowing:
ThePortioncolumnshowsthatthebulkofthevariation(56% + 42%)ofthemailsending
patternissummarizedbyc1andc2,fortheToandFromgroups.
TheCorrespondenceAnalysisplotofc1andc2showsthepatternofmaildistribution
amongthemailgroup,asfollows:
KatherineandAnnhavesimilarsendingandreceivingpatterns;theybothsendemails
toMichaelandJohnandreceiveemailsfromMichael,John,andJeff.
JeffandMichaellieinasingledimension,buthaveoppositesendingandreceiving
patterns.JeffsendsemailstoeveryoneandreceivesemailsonlyfromMichael.
Michaelsendsemailtoeveryoneandreceivesemailfromeveryone.
Johnspatternsdifferfromtheothers.HesendsemailtoAnn,Katherine,andMichael,
andreceivesemailfromeveryone.
234
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
6. FromtheredtrianglemenunexttoContingencyAnalysis,select
Cochran Mantel Haenszel.
7. SelectProtein/FatasthegroupingvariableandclickOK.
Figure 7.15ExampleofaCochranMantelHaenszelTest
FromFigure 7.15,younoticethefollowing:
TheTestsreportshowsamarginallysignificantChisquareprobabilityofabout0.0799,
indicatingsomesignificanceintherelationshipbetweenhotdogtasteandtype.
However,theCochranMantelHaenszelreportshowsnorelationshipatallbetweenhot
dogtasteandtypeafteradjustingfortheproteinfatratio.TheChisquaretestforthe
adjustedcorrelationhasaprobabilityof0.857,andtheChisquareprobabilityassociated
withthegeneralassociationofcategoriesis0.282.
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
235
3. SelectAandclickY, Response.
4. SelectBandclickX, Factor.
5. ClickOK.
6. FromtheredtrianglemenunexttoContingencyAnalysis,selectAgreement Statistic.
Figure 7.16ExampleoftheAgreementStatisticReport
FromFigure 7.16,younoticethattheagreementstatisticof0.86ishigh(closeto1)andthe
pvalueof<.0001issmall.Thisreinforcesthehighagreementseenbylookingatthediagonal
ofthecontingencytable.Agreementbetweentheratersoccurswhenbothratersgivearating
of0orbothgivearatingof1.
236
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
6. FromtheredtrianglemenunexttoContingencyAnalysis,selectRelative Risk.
TheChooseRelativeRiskCategorieswindowappears.
Figure 7.17TheChooseRelativeRiskCategoriesWindow
NotethefollowingabouttheChooseRelativeRiskCategorieswindow:
Ifyouareinterestedinonlyasingleresponseandfactorcombination,youcanselectthat
here.Forexample,ifyouclickedOKinthewindowinFigure 7.17,thecalculationwould
beasfollows:
P Y = Married X = Female
--------------------------------------------------------------------------P Y = Married X = Male
Ifyouwouldliketocalculatetheriskratiosforall( 2 2 =4)combinationsofresponseand
factorlevels,selecttheCalculate All Combinationscheckbox.SeeFigure 7.18.
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
237
Toseehowtherelativeriskiscalculated,proceedasfollows:
1. ExaminethefirstentryintheRelativeRiskreport,whichisP(Married|Female)/
P(Married|Male).
2. YoucanfindtheseprobabilitiesintheContingencyTable.Sincetheprobabilitiesare
computedbasedontwolevelsofsex,whichdiffersacrosstherowsofthetable,usethe
Row%toreadtheprobabilities,asfollows:
P(Married|Female)=0.6884
P(Married|Male)=0.6121
Therefore,thecalculationsareasfollows:
0.6884
0.6121
238
Contingency Analysis
Additional Examples of the Contingency Platform
Chapter 7
Basic Analysis
Figure 7.19ExampleoftheTwoSampleTestforProportionsReport
Inthisexample,youarecomparingtheprobabilityofbeingmarriedbetweenfemalesand
males.SeetheRow%intheContingencyTabletoobtainthefollowing:
P(Married|Female)=0.6884
P(Married|Male)=0.6121
Thedifferencebetweenthesetwonumbers,0.0763,istheProportionDifferenceshowninthe
report.Thetwosidedpvalue(0.1686)islarge,indicatingthatthereisnosignificantdifference
betweentheproportions.
Chapter 7
Basic Analysis
Contingency Analysis
Additional Examples of the Contingency Platform
239
Figure 7.20ExampleoftheMeasuresofAssociationReport
Sincethevariablesthatyouwanttoexamine(sexandmaritalstatus)arenominal,usethe
LambdaandUncertaintymeasures.Theconfidenceintervalsincludezero,soyouconclude
thatthereisnoassociationbetweensexandmaritalstatus.
Thetwosidedpvalue(0.7094)islarge.Fromthis,youcanconcludethatthereisnotrendin
theproportionofmaleandfemalesthatpurchasedifferentsizesofcars.
240
Contingency Analysis
Statistical Details for the Contingency Platform
Chapter 7
Basic Analysis
Theasymptoticvarianceofthesimplekappacoefficientisestimatedbythefollowing:
2
2
A + B Cvar = ------------------------where A = p ii 1 p i. + p .i 1 , B = 1 2 p ij p .i + p j. and
2
ij
1 Pe n
i
2
C = P e 1
SeeFleiss,Cohen,andEveritt(1969).
ForBowkerstestofsymmetry,thenullhypothesisisthattheprobabilitiesinthetwobytwo
tablesatisfysymmetry(pij=pji).
wherepijisthecountintheithrowandjthcolumnofthe2x2table.
Chapter 7
Basic Analysis
Contingency Analysis
Statistical Details for the Contingency Platform
241
Thetotalnegativeloglikelihoodisfoundbyfittingfixedresponseratesacrossthetotal
sample.
Test
ThetwoChisquaretestsareasfollows:
TheLikelihood RatioChisquaretestiscomputedastwicethenegativeloglikelihoodfor
ModelintheTeststable.SomebooksusethenotationG2forthisstatistic.Thedifferenceof
twonegativeloglikelihoods,onewithwholepopulationresponseprobabilitiesandonewith
eachpopulationresponserates,iswrittenasfollows:
2
G = 2
ij
n ij
N
and p j = ------j
= -----N
N
Thisformulacanbemorecompactlywrittenasfollows:
n ij
2
n ln -----G = 2 ij e
ij
i j
ThePearsonChisquareiscalculatedbysummingthesquaresofthedifferencesbetweenthe
observedandexpectedcellcounts.ThePearsonChisquareexploitsthepropertythat
frequencycountstendtoanormaldistributioninverylargesamples.Thefamiliarformofthis
Chisquarestatisticisasfollows:
2
O E
--------------------E
whereOistheobservedcellcountsandEistheexpectedcellcounts.Thesummationisover
allcells.Thereisnocontinuitycorrectiondonehere,asissometimesdonein2by2tables.
242
Contingency Analysis
Statistical Details for the Contingency Platform
Dr
0.5
P rc' D c
where:
Pisthematrixofcountsdividedbythetotalfrequency
randcarerowandcolumnsumsofP
theDsarediagonalmatricesofthevaluesofrandc
Chapter 7
Basic Analysis
Chapter 8
Logistic Analysis
Examine Relationships between a Categorical Y and a Continuous X
Variable
TheLogisticplatformfitstheprobabilitiesforresponsecategoriestoacontinuousxpredictor.
Thefittedmodelestimatesprobabilitiesforeachxvalue.TheLogisticplatformisthenominal
orordinalbycontinuouspersonalityoftheFitYbyXplatform.Thereisadistinctionbetween
nominalandordinalresponsesonthisplatform:
Nominallogisticregressionestimatesasetofcurvestopartitiontheprobabilityamongthe
responses.
Ordinallogisticregressionmodelstheprobabilityofbeinglessthanorequaltoagiven
response.Thishastheeffectofestimatingasinglelogisticcurve,whichisshifted
horizontallytoproduceprobabilitiesfortheorderedcategories.Thismodelisless
complexandisrecommendedfororderedresponses.
Figure 8.1ExamplesofLogisticRegression
Contents
OverviewofLogisticRegression .................................................. 245
NominalLogisticRegression................................................... 245
OrdinalLogisticRegression.................................................... 245
ExampleofNominalLogisticRegression........................................... 246
LaunchtheLogisticPlatform ..................................................... 248
TheLogisticReport .............................................................. 248
LogisticPlot................................................................. 249
Iterations.................................................................... 250
WholeModelTest............................................................ 250
ParameterEstimates.......................................................... 252
LogisticPlatformOptions ........................................................ 253
ROCCurves ................................................................. 254
SaveProbabilityFormula...................................................... 255
InversePrediction ............................................................ 256
AdditionalExamplesofLogisticRegression ........................................ 256
ExampleofOrdinalLogisticRegression ......................................... 256
AdditionalExampleofaLogisticPlot ........................................... 258
ExampleofROCCurves ...................................................... 260
ExampleofInversePredictionUsingtheCrosshairTool........................... 261
ExampleofInversePredictionUsingtheInversePredictionOption................. 262
StatisticalDetailsfortheLogisticPlatform .......................................... 264
StatisticalDetailsfortheWholeModelTestReport ............................... 264
Chapter 8
Basic Analysis
Logistic Analysis
Overview of Logistic Regression
245
246
Logistic Analysis
Example of Nominal Logistic Regression
Chapter 8
Basic Analysis
Chapter 8
Basic Analysis
Logistic Analysis
Example of Nominal Logistic Regression
Figure 8.2ExampleofNominalLogisticReport
Theplotshowsthefittedmodel,whichisthepredictedprobabilityofbeingcured,asa
functionofln(dose).Thepvalueissignificant,indicatingthatthedosageamountshavea
significanteffectonwhethertherabbitsarecured.
247
248
Logistic Analysis
The Logistic Report
Chapter 8
Basic Analysis
TolaunchtheLogisticplatform,fromtheJMPStarterwindow,clickontheBasiccategory
andthenclickLogistic.
Figure 8.3TheLogisticLaunchWindow
Formoreinformationaboutthislaunchwindow,seeIntroductiontoFitYbyXchapteron
page 95.
Chapter 8
Basic Analysis
Logistic Analysis
The Logistic Report
249
Figure 8.4ExampleofaLogisticReport
TheLogisticreportwindowcontainstheLogisticplot,theIterationsreport,theWholeModel
Testreport,andtheParameterEstimatesreport.
Note: Theredtrianglemenuprovidesmoreoptionsthatcanaddtotheinitialreportwindow.
SeeLogisticPlatformOptionsonpage 253.
Logistic Plot
Thelogisticprobabilityplotgivesacompletepictureofwhatthelogisticmodelisfitting.At
eachxvalue,theprobabilityscaleintheydirectionisdividedup(partitioned)into
probabilitiesforeachresponsecategory.Theprobabilitiesaremeasuredasthevertical
distancebetweenthecurves,withthetotalacrossallYcategoryprobabilitiessummingto1.
250
Logistic Analysis
The Logistic Report
Chapter 8
Basic Analysis
Replacevariablesintheplotinoneoftwoways:swapexistingvariablesbydraggingand
droppingavariablefromoneaxistotheotheraxis;or,clickonavariableintheColumns
paneloftheassociateddatatableanddragitontoanaxis.
RelatedInformation
AdditionalExampleofaLogisticPlotonpage 258
Iterations
TheIterationsreportshowseachiterationandtheevaluatedcriteriathatdeterminewhether
themodelhasconverged.Iterationsappearonlyfornominallogisticregression.
TheReducedmodelonlycontainsanintercept.
(sometimes
calledSource)
TheFullmodelcontainsalloftheeffectsaswellasthe
intercept.
TheDifferenceisthedifferenceoftheloglikelihoodsofthefull
andreducedmodels.
DF
Recordsthedegreesoffreedomassociatedwiththemodel.
Chapter 8
Basic Analysis
Logistic Analysis
The Logistic Report
Measuresvariation,sometimescalleduncertainty,inthesample.
Full(thefullmodel)isthenegativeloglikelihood(oruncertainty)
calculatedafterfittingthemodel.Thefittingprocessinvolves
predictingresponserateswithalinearmodelandalogistic
responsefunction.Thisvalueisminimizedbythefittingprocess.
Reduced (thereducedmodel)isthenegativeloglikelihood(or
uncertainty)forthecasewhentheprobabilitiesareestimatedby
fixedbackgroundrates.Thisisthebackgrounduncertaintywhen
themodelhasnoeffects.
Thedifferenceofthesetwonegativeloglikelihoodsisthe
reductionduetofittingthemodel.Twotimesthisvalueisthe
likelihoodratioChisquareteststatistic.
ChiSquare
ThelikelihoodratioChisquaretestofthehypothesisthatthe
modelfitsnobetterthanfixedresponseratesacrossthewhole
sample.ItistwicetheLogLikelihoodfortheDifferenceModel.Itis
twotimesthedifferenceoftwonegativeloglikelihoods,onewith
wholepopulationresponseprobabilitiesandonewith
eachpopulationresponserates.
Formoreinformation,seeStatisticalDetailsfortheWholeModel
TestReportonpage 264.
Prob>ChiSq
Theobservedsignificanceprobability,oftencalledthepvalue,for
theChisquaretest.Itistheprobabilityofgetting,bychancealone,
aChisquarevaluegreaterthantheonecomputed.Modelsare
oftenjudgedsignificantifthisprobabilityisbelow0.05.
Rsquare(U)
Theproportionofthetotaluncertaintythatisattributedtothe
modelfit.
Totestthatthefactorvariablehasnoeffectontheresponse,lookat
thedifferencebetweenthefollowing:
theloglikelihoodfromthefittedmodel
theloglikelihoodfromthemodelthatuseshorizontallines
Formoreinformation,seeStatisticalDetailsfortheWholeModel
TestReportonpage 264.
AICc
ThecorrectedAkaikeInformationCriterion.
BIC
TheBayesianInformationCriterion.
251
252
Logistic Analysis
The Logistic Report
Chapter 8
Basic Analysis
Thetotalsamplesizeusedincomputations.Ifyouspecifieda
Weightvariable,thisisthesumoftheweights.
Theavailablemeasuresoffitareasfollows:
Entropy RSquare comparestheloglikelihoodsfromthefitted
modelandtheconstantprobabilitymodel.
isageneralizationoftheRsquaremeasure
thatsimplifiestotheregularRsquareforcontinuousnormal
responses.ItissimilartotheEntropyRSquare,butinsteadof
usingtheloglikelihood,itusesthe2/nrootofthelikelihood.
Generalized RSquare
probabilityassociatedwiththeeventthatoccurred.
RMSE istherootmeansquareerror,wherethedifferencesare
betweentheresponseandp(thefittedprobabilityfortheevent
thatactuallyoccurred).
istheaverageoftheabsolutevaluesofthe
differencesbetweentheresponseandp(thefittedprobability
fortheeventthatactuallyoccurred).
withthehighestfittedprobabilityisnottheobservedcategory.
ForEntropyRSquareandGeneralizedRSquare,valuescloserto1
indicateabetterfit.ForMeanLogp,RMSE,MeanAbsDev,and
MisclassificationRate,smallervaluesindicateabetterfit.
Training
Thevalueofthemeasureoffit.
Definition
Thealgebraicdefinitionofthemeasureoffit.
Parameter Estimates
Thenominallogisticmodelfitsaparameterfortheinterceptandslopeforeachof k 1 logistic
comparisons,wherekisthenumberofresponselevels.TheParameterEstimatesreportlists
theseestimates.Eachparameterestimatecanbeexaminedandtestedindividually,although
thisisseldomofmuchinterest.
Chapter 8
Basic Analysis
Logistic Analysis
Logistic Platform Options
253
Table 8.2DescriptionoftheParameterEstimatesReport
Term
Listseachparameterinthelogisticmodel.Thereisaninterceptanda
slopetermforthefactorateachleveloftheresponsevariable,exceptthe
lastlevel.
Estimate
Liststheparameterestimatesgivenbythelogisticmodel.
StdError
Liststhestandarderrorofeachparameterestimate.Theyareusedto
computethestatisticalteststhatcompareeachtermtozero.
ChiSquare
ListstheWaldtestsforthehypothesesthateachoftheparametersis
zero.TheWaldChisquareiscomputedas(Estimate/StdError)2.
Prob>ChiSq
ListstheobservedsignificanceprobabilitiesfortheChisquaretests.
Covariance of Estimates
Reportstheestimatedvariancesoftheparameterestimates,andtheestimatedcovariances
betweentheparameterestimates.Thesquarerootofthevarianceestimatesisthesameas
thosegivenintheStd Errorsection.
AddsoddsratiostotheParameterEstimatesreport.For
moredetails,seetheFittingLinearModelsbook.
Thisoptionisavailableonlyforaresponsewithtwolevels.
Inverse Prediction
Predictionofxvaluesfromgivenyvalues.Formore
information,seeInversePredictiononpage 256.
Thisoptionisavailableonlyforaresponsewithtwolevels.
Logistic Plot
Showsorhidesthelogisticplot.
254
Logistic Analysis
Logistic Platform Options
Plot Options
Chapter 8
Basic Analysis
ThePlotOptionsmenuincludesthefollowingoptions:
Show Points Togglesthepointsonoroff.
Isusefulonlyifyouhaveseveralpoints
foreachxvalue.Inthesecases,yougetreasonable
estimatesoftherateateachvalue,andcomparethisrate
withthefittedlogisticcurve.Topreventtoomany
degeneratepoints,usuallyatzeroorone,JMPonly
showstheratevalueifthereareatleastthreepointsat
thexvalue.
Line Color
Enablesyoutopickthecoloroftheplotcurves.
ROC Curve
AReceiverOperatingCharacteristiccurveisaplotof
sensitivityby(1specificity)foreachvalueofx.SeeROC
Curvesonpage 254.
Lift Curve
Producesaliftcurveforthemodel.Aliftcurveshowsthe
sameinformationasaROCcurve,butinawayto
dramatizetherichnessoftheorderingatthebeginning.The
Yaxisshowstheratioofhowrichthatportionofthe
populationisinthechosenresponselevelcomparedtothe
rateofthatresponselevelasawhole.SeetheFittingLinear
Modelsbookfordetailsaboutliftcurves.
Createsnewdatatablecolumnsthatcontainformulas.See
SaveProbabilityFormulaonpage 255.
Script
Thismenucontainsoptionsthatareavailabletoall
platforms.Theyenableyoutoredotheanalysisorsavethe
JSLcommandsfortheanalysistoawindoworafile.For
moreinformation,seeUsingJMP.
ROC Curves
Supposeyouhaveanxvaluethatisadiagnosticmeasurementandyouwanttodeterminea
thresholdvalueofxthatindicatesthefollowing:
Aconditionexistsifthexvalueisgreaterthanthethreshold.
Aconditiondoesnotexistifthexvalueislessthanthethreshold.
Chapter 8
Basic Analysis
Logistic Analysis
Logistic Platform Options
255
Forexample,youcouldmeasureabloodcomponentlevelasadiagnostictesttopredictatype
ofcancer.Nowconsiderthediagnostictestasyouvarythethresholdand,thus,causemoreor
fewerfalsepositivesandfalsenegatives.Youthenplotthoserates.Theidealistohaveavery
narrowrangeofxcriterionvaluesthatbestdividestruenegativesandtruepositives.The
ReceiverOperatingCharacteristic(ROC)curveshowshowrapidlythistransitionhappens,
withthegoalbeingtohavediagnosticsthatmaximizetheareaunderthecurve.
Twostandarddefinitionsusedinmedicineareasfollows:
Sensitivity,theprobabilitythatagivenxvalue(atestormeasure)correctlypredictsan
existingcondition.Foragivenx,theprobabilityofincorrectlypredictingtheexistenceofa
conditionis1 sensitivity.
Specificity,theprobabilitythatatestcorrectlypredictsthataconditiondoesnotexist.
AROCcurveisaplotofsensitivityby(1 specificity)foreachvalueofx.Theareaunderthe
ROCcurveisacommonindexusedtosummarizetheinformationcontainedinthecurve.
Whenyoudoasimplelogisticregressionwithabinaryoutcome,thereisaplatformoptionto
requestaROCcurveforthatanalysis.AfterselectingtheROC Curveoption,awindowasks
youtospecifywhichleveltouseaspositive.
Ifatestpredictedperfectly,itwouldhaveavalueabovewhichtheentireabnormalpopulation
wouldfallandbelowwhichallnormalvalueswouldfall.Itwouldbeperfectlysensitiveand
thenpassthroughthepoint(0,1)onthegrid.TheclosertheROCcurvecomestothisideal
point,thebetteritsdiscriminatingability.Atestwithnopredictiveabilityproducesacurve
thatfollowsthediagonalofthegrid(DeLong,etal.1988).
TheROCcurveisagraphicalrepresentationoftherelationshipbetweenfalsepositiveand
truepositiverates.Astandardwaytoevaluatetherelationshipiswiththeareaunderthe
curve,shownbelowtheplotinthereport.Intheplot,ayellowlineisdrawnata45degree
angletangenttotheROCCurve.Thismarksagoodcutoffpointundertheassumptionthat
falsenegativesandfalsepositiveshavesimilarcosts.
RelatedInformation
ExampleofROCCurvesonpage 260
formulasforlinearcombinations(typicallycalledlogits)ofthexfactor
predictionformulasfortheresponselevelprobabilities
apredictionformulathatgivesthemostlikelyresponse
256
Logistic Analysis
Additional Examples of Logistic Regression
Chapter 8
Basic Analysis
Inverse Prediction
Inversepredictionistheoppositeofprediction.Itisthepredictionofxvaluesfromgiveny
values.Butinlogisticregression,insteadofayvalue,youhavetheprobabilityattributedto
oneoftheYlevels.Thisfeatureonlyworkswhentherearetworesponsecategories(abinary
response).
TheFitModelplatformalsohasanoptionthatgivesaninversepredictionwithconfidence
limits.TheFittingLinearModelsbookgivesmoreinformationaboutinverseprediction.
RelatedInformation
ExampleofInversePredictionUsingtheCrosshairToolonpage 261
ExampleofInversePredictionUsingtheInversePredictionOptiononpage 262
Chapter 8
Basic Analysis
Logistic Analysis
Additional Examples of Logistic Regression
257
Figure 8.5ExampleofOrdinalLogisticReport
Youinterpretthisreportthesamewayasthenominalreport.SeeTheLogisticReporton
page 248.
Intheplot,markersforthedataaredrawnattheirxcoordinate.Whenseveraldatapoints
appearatthesameyposition,thepointsarejittered.Thatis,smallspacesappearbetweenthe
datapointssoyoucanseeeachpointmoreclearly.
Wheretherearemanypoints,thecurvesarepushedapart.Wheretherearefewtonopoints,
thecurvesareclosetogether.Thedatapushesthecurvesinthatwaybecausethecriterionthat
ismaximizedistheproductoftheprobabilitiesfittedbythemodel.Thefittriestoavoid
pointsattributedtohaveasmallprobability,whicharepointscrowdedbythecurvesoffit.
SeetheFittingLinearModelsbookformoreinformationaboutcomputationaldetails.
258
Logistic Analysis
Additional Examples of Logistic Regression
Chapter 8
Basic Analysis
FordetailsabouttheWholeModelTestreportandtheParameterEstimatesreport,seeThe
LogisticReportonpage 248.IntheParameterEstimatesreport,aninterceptparameteris
estimatedforeveryresponselevelexceptthelast,butthereisonlyoneslopeparameter.The
interceptparametersshowthespacingoftheresponselevels.Theyalwaysincrease
monotonically.
Chapter 8
Basic Analysis
Logistic Analysis
Additional Examples of Logistic Regression
259
InFigure 8.6,notethefollowingobservations:
Thefirst(bottom)curverepresentstheprobabilitythatacaratagivenweightisSporty.
ThesecondcurverepresentstheprobabilitythatacarisSmallorSporty.Lookingonlyat
thedistancebetweenthefirstandsecondcurvescorrespondstotheprobabilityofbeing
Small.
Asyoumightexpect,heaviercarsaremorelikelytobeLarge.
Markersforthedataaredrawnattheirxcoordinate,withtheypositionjitteredrandomly
withintherangecorrespondingtotheresponsecategoryforthatrow.
Ifthexvariablehasnoeffectontheresponse,thenthefittedlinesarehorizontalandthe
probabilitiesareconstantforeachresponseacrossthecontinuousfactorrange.Figure 8.7
showsalogisticplotwhereWeightisnotusefulforpredictingType.
Figure 8.7ExamplesofSampleDataTableandLogisticPlotShowingNoybyxRelationship
260
Logistic Analysis
Additional Examples of Logistic Regression
Chapter 8
Basic Analysis
Figure 8.8ExamplesofSampleDataTableandLogisticPlotShowinganAlmostPerfectybyx
Relationship
Chapter 8
Basic Analysis
Logistic Analysis
Additional Examples of Logistic Regression
261
Figure 8.9ExamplesofROCCurveandTable
SincetheROCcurveiswellaboveadiagonalline,youconcludethatthemodelhasgood
predictiveability.
262
Logistic Analysis
Additional Examples of Logistic Regression
Chapter 8
Basic Analysis
7. Placethehorizontalcrosshairlineatabout0.5onthevertical(Response)probabilityaxis.
8. Movethecrosshairintersectiontothepredictionline,andreadtheIn(dose)valuethat
showsonthehorizontalaxis.
Inthisexample,arabbitwithaIn(dose) ofapproximately0.9isequallylikelytobecuredasit
istodie.
Figure 8.10ExampleofCrosshairToolonLogisticPlot
Chapter 8
Basic Analysis
Logistic Analysis
Additional Examples of Logistic Regression
263
10. ClickOK.
TheInversePredictionplotappears.
Figure 8.11InversePredictionWindow
Figure 8.12ExampleofInversePredictionPlot
Theestimatesofthexvaluesandtheconfidenceintervalsareshowninthereportaswellasin
theprobabilityplot.Forexample,thevalueofln(Dose)thatresultsina90%probabilityof
beingcuredisestimatedtobebetween0.526and0.783.
264
Logistic Analysis
Statistical Details for the Logistic Platform
Chapter 8
Basic Analysis
G = 2 ln p background ln p model
wherethesummationsareoverallobservationsinsteadofallcells.
Rsquare(U)
Theratioofthisteststatistictothebackgroundloglikelihoodissubtractedfrom1tocalculate
R2.Moresimply,Rsquare(U)iscomputedasfollows:
--------------------------------------------------------------------------loglikelihoodforDifference loglikelihoodforReduced
usingquantitiesfromtheWholeModelTestreport.
Chapter 9
Matched Pairs Analysis
Compare Measurements on the Same Subject
TheMatchedPairsplatformcomparesthemeansbetweentwoormorecorrelatedvariables
andassessesthedifferences.Forexample,youmightcompareabloodpressuremeasurement
takenonthesamesubjectbeforeatreatmentandagainafterthetreatment.Astatistical
methodcalledthepairedttesttakesthecorrelatedresponsesintoaccount.
Theplatformproducesagraphofthepaireddifferencesbythepairedmeans,andthepaired
ttestresultsforallthreealternativehypotheses.Additionalfeaturesprovideformorethan
twomatchedresponsesandforagroupingcolumntotestacrosssamples,inasimpleversion
ofrepeatedmeasuresanalysis.
Figure 9.1ExampleofMatchedPairsAnalysis
Contents
OverviewoftheMatchedPairsPlatform ........................................... 267
ExampleofComparingMatchedPairs ............................................. 267
LaunchtheMatchedPairsPlatform ................................................ 268
MultipleYColumns.......................................................... 269
TheMatchedPairsReport........................................................ 270
DifferencePlotandReport .................................................... 271
AcrossGroups ............................................................... 271
MatchedPairsPlatformOptions................................................... 272
ExampleComparingMatchedPairsacrossGroups .................................. 272
StatisticalDetailsfortheMatchedPairsPlatform .................................... 274
GraphicsforMatchedPairs .................................................... 274
CorrelationofResponses ...................................................... 275
Chapter 9
Basic Analysis
267
UsetheSplit optionintheTablesmenutosplitthecolumnofmeasurementsintotwo
columns.ThenyoucanusetheMatchedPairsplatform.
Fortworesponsecolumns,createathirdcolumnthatcalculatesthedifferencebetweenthe
tworesponses.Thentestthatthemeanofthedifferencecolumniszerowiththe
Distributionplatform.
Forthetworesponsesstoredinasinglecolumn,youcandoatwowayanalysisof
variance.Onefactor(theIDvariable)identifiesthetworesponsesandtheotherfactor
identifiesthesubject.UsetheFitYbyXOnewayplatformwithablockingvariable(the
subjectcolumn),orusetheFitModelplatformtodoatwowayANOVA.Thetestonthe
IDfactorisequivalenttothepairedttest.
Note: Ifthedataarepaired,donotdoaregularindependentttest.Donotstackthedatainto
onecolumnandusetheFitYbyXOnewayANOVAontheIDwithoutspecifyingablock
variable.Todothishastheeffectofignoringthecorrelationbetweentheresponses.This
causesthetesttooverestimatetheeffectifresponsesarenegativelycorrelated,orto
underestimatetheeffectifresponsesarepositivelycorrelated.
268
Chapter 9
Basic Analysis
Thereportwindowappears.
Figure 9.2TheMatchedPairsReportWindow
Theresultsshowthat,onaverage,thetympanicthermometermeasures1.12degreeshigher
thantheoralthermometer.Thesmallpvalue(Prob>|t|)indicatesthatthisdifferenceis
statisticallysignificant,andnotduetochance.
Notethatthismatchedpairsanalysisdoesnotindicatewhichthermometeriscorrect(if
either),butonlyindicatesthatthereisadifferencebetweenthethermometers.
Chapter 9
Basic Analysis
269
Figure 9.3TheMatchedPairsLaunchWindow
Table 9.1DescriptionoftheMatchedPairsLaunchWindow
Y, Paired Response
Providethetworesponsecolumns.Forinformationabout
analyzingmorethantworesponses,seeMultipleYColumns
onpage 269.
X, Grouping
(Optional)Provideagroupingvariabletocomparethe
differencesacrossgroups.Formoreinformation,seeAcross
Groupsonpage 271.
Weight
(Optional)Identifiesonecolumnwhosenumericvaluesassign
aweighttoeachrowintheanalysis.
Freq
(Optional)Identifiesonecolumnwhosenumericvaluesassign
afrequencytoeachrowintheanalysis.
By
(Optional)Performsaseparatematchedpairsanalysisforeach
leveloftheByvariable.
Formoreinformationaboutthelaunchwindow,seeUsingJMP.
Multiple Y Columns
Youcanhavemorethantworesponses.Ifthenumberofresponsesisodd,allpossiblepairs
areanalyzed.Thefollowingtableshowsanexampleforthreeresponses.
Y1byY2
Y1byY3
Y2byY3
270
Chapter 9
Basic Analysis
Ifthenumberofresponsesiseven,theMatchedPairsplatformaskswhetheryouwanttodo
allpossiblepairs.Ifyoudonotdoallpossiblepairs,adjacentresponsesareanalyzedasapair.
Thefollowingtableshowsthearrangementofanalysesforfourresponses.
Y1byY2
Y3byY4
Note: Theredtrianglemenuprovidesadditionaloptionsthatcanaddreportstotheinitial
reportwindow.SeeMatchedPairsPlatformOptionsonpage 272.
Chapter 9
Basic Analysis
271
Themeandifferenceisshownasthehorizontalline,withthe95%confidenceinterval
aboveandbelowshownasdottedlines.Iftheconfidenceregionincludeszero,thenthe
meansarenotsignificantlydifferentatthe0.05level.Inthisexamplethedifferenceis
significant.
Ifyouaddareferenceframe,themeanofpairsisshownbytheverticalline.Fordetails
aboutareferenceframe,seeMatchedPairsPlatformOptionsonpage 272.
TheDifferencereportshowsthemeanofeachresponse,thedifferenceofthemeans,anda
confidenceintervalforthedifference.TheDifferencereportalsoshowstheresultsofthe
pairedttest.
Across Groups
Note: TheAcrossGroupsreportappearsonlyifyouhavespecifiedanX, Groupingvariable.
TheAcrossGroupsanalysiscorrespondstoasimplerepeatedmeasuresanalysis.(Youcanget
thesametestresultsusingtheManovapersonalityoftheFitModelplatform.)
Table 9.2DescriptionoftheAcrossGroupsReport
Mean Difference
Showsthemeanofthedifferenceacrossrowsineach
groupbetweenthetwopairedcolumns.Inotherwords,
thisisthewithinsubjectbyacrosssubjectinteraction,
orsplitplotbywholeplotinteraction.
Mean Mean
Showsthemeanofthemeanacrossrowsineachgroup
acrossthetwopairedcolumns.Inotherwords,thisis
theacrosssubjectorwholeploteffect.
TwoFtestsdeterminewhethertheacrossgroups
valuesaredifferent:
Mean Differenceteststhatthechangeacrossthepair
ofresponsesisdifferentindifferentgroups.
Mean Meanteststhattheaverageresponsefora
subjectisdifferentindifferentgroups
RelatedInformation
ExampleComparingMatchedPairsacrossGroupsonpage 272
272
Chapter 9
Basic Analysis
Showsorhidestheplotofthepaireddifferencesbypaired
means.Foradetaileddescriptionofthisplot,seeDifference
PlotandReportonpage 271.
Showsorhidestheplotofpaireddifferencesbyrownumber.
Reference Frame
ShowsorhidesthereferenceframeonthePlotDifbyMean
plot.ArectangleshowingwhereaplotofY2byY1wouldbe
locatedinsidetheplot,tiltedandpossiblysquished.Avertical
redlineisshownrepresentingthemeanofmeans.The
referenceframeisshowninitiallywhentherangeofthe
differencesisgreaterthanhalftherangeofthedata.
ShowsorhidestheWilcoxonSignedRanktest.Thisisa
nonparametricversionofthepairedttestthatcomparesthe
sizesofthepositivedifferencestothesizesofthenegative
differences.Thistestdoesassumethatthedistributionofthe
differencesissymmetric.(SeeConover1999,page350.)
Sign Test
ShowsorhidestheSignTest.Thisisanonparametricversion
ofthepairedttestthatusesonlythesign(positiveor
negative)ofthedifferenceforthetest.
Set Level
Changesthealphalevelusedintheanalyses.Affectsthe
confidenceintervalsinthereportandontheplot.
Script
Thismenucontainsoptionsthatareavailabletoallplatforms.
TheyenableyoutoredotheanalysisorsavetheJSL
commandsfortheanalysistoawindoworafile.Formore
information,seeUsingJMP.
Chapter 9
Basic Analysis
273
274
Chapter 9
Basic Analysis
TheFRatiofortheMeanDifferenceintheAcrossGroupsreportcorrespondstotheFRatiofor
Time*drugundertheWithinSubjectsreport.TheFRatiofortheMeanMeanintheAcross
GroupsreportcorrespondstotheFRatiofordrugunderBetweenSubjectsintheManovaFit
report.
Chapter 9
Basic Analysis
275
Figure 9.6ExampleofTransformingtoDifferencebyMean,Rotatedby45Degrees
Before rotation, the axes
represent y1 and y2.
1 1 y1 = y2 + y1
1 1 y2
y2 y1
After rotation, the axes represent
a sum and difference.
y2
y2
y2 y1
y1
y1
y2 + y1
region where
y1 < y2.
mean difference
and 95% CI
line where
y1 = y2.
y2 y1
region where
y1 > y2.
y2 + y1
2
rescaled to mean
of responses
Correlation of Responses
Inmostcaseswherethepairofmeasurementsistakenfromthesameindividualatdifferent
times,theyarepositivelycorrelated.However,iftheyrepresentcompetingresponses,the
correlationcanbenegative.
Chapter 9
Basic Analysis
Figure 9.7showshowthepositivecorrelationofthetworesponsesbecomesthesmallvariance
onthedifference(theyaxis).Ifthecorrelationisnegative,theellipseisorientedintheother
directionandthevarianceoftherotatedgraphislargeontheyaxis.
difference
Y2
Figure 9.7ExamplesofPositiveCorrelationBeforeandAfterRotation
Y2
276
Y1
Y1
mean
Chapter 10
Bootstrapping
Approximate the Distribution of a Statistic through Resampling
Bootstrappingisaresamplingmethodforapproximatingthesamplingdistributionofa
statistic.Thedataisresampledwithreplacementandthestatisticiscomputed.Thisprocessis
repeatedtoproduceadistributionofvaluesforthestatistic.
Bootstrappingisusefulwhenestimatingpropertiesofastatistic(mean,standarderror,andso
on)andperforminginference,inthefollowingsituations:
Thetheoreticaldistributionofthestatisticiscomplicatedorunknown.
Inferenceusingparametricmethodsisnotpossibleduetoviolationsofassumptions.
JMPprovidesbootstrappingforstatisticalplatformsthatsupportFrequencycolumnsin
whichtherowsareassumedtobeindependent.
TheBootstrapoptionisontherightclickmenu,separatefromstandardplatformcommands.
Figure 10.1ExampleoftheDistributionforBootstrappingResults
Contents
ExampleofBootstrapping ........................................................ 279
PerformaBootstrapAnalysis..................................................... 280
BootstrapWindowOptions ....................................................... 280
StackedResultsTable............................................................ 281
UnstackedResultsTable.......................................................... 282
AnalysisofBootstrapResults..................................................... 282
Chapter 10
Basic Analysis
Bootstrapping
Example of Bootstrapping
279
Example of Bootstrapping
Atiremanufacturerwantstopredictanengineshorsepowerfromtheenginesdisplacement
(in3).Thecompanyismostinterestedinestimatingtheslopeoftherelationshipbetweenthe
variables.Theslopevalueshelpthecompanypredictthecorrespondingchangein
horsepowerwhenthedisplacementchanges.
1. OpentheCar Physical Data.jmpsampledatatable.
2. SelectAnalyze > Fit Y by X.
3. AssignHorsepowertotheY, Responserole.
4. AssignDisplacementtotheX, Factorrole.
5. ClickOK.
6. SelectFit LinefromtheBivariateFitredtrianglemenu.
7. IntheParameter Estimatesreport,rightclicktheEstimatecolumnandselectBootstrap
(Figure 10.2).
Figure 10.2SelecttheBootstrapOption
280
Bootstrapping
Bootstrap Window Options
Chapter 10
Basic Analysis
Tip: MakesurethatthecorrectdatatableisselectedpriortorunningtheDistribution,orelse
theBootstrapConfidenceLimitsreportwillnotappear.
12. AssignDisplacementtotheY, Columnsrole.
13. ClickOK.
TheDistributionreportincludestheBootstrapConfidenceLimitsreport(Figure 10.3).
Figure 10.3BootstrapReport
Inthisexample,theestimateoftheslopefromtheoriginaldatais0.504.For95%coverageof
thebootstrapresults,thecompanycanestimatetheslopetobebetween0.40186and0.616.
Whenthedisplacementischangedbyoneunit,thehorsepowerchangesbysomeamount
between0.40186and0.616.
Chapter 10
Basic Analysis
Fractional Weights
Bootstrapping
Stacked Results Table
281
Usesfractionalweightsintheresamplingprocess.
IfFractional Weightsisnotselected,thefollowingobservationsaretrue:
Thesamplingweightsassignedtotheoriginalobservationsareintegers.
Theinfluenceofanobservationisdeterminedbytherelativeweights.Forexample,if
anobservationisassignedaweightof2,thenthatobservationappearstwiceinthe
sampleforthecurrentbootstrapiteration.
Thesamplingweightscanbezero.Observationswithaweightofzeroarenotincluded
intheestimationprocessforthecurrentbootstrapiteration.
IfFractional Weightsisselected,thefollowingobservationsaretrue:
Thesamplingweightscanbenonintegers.Theinfluenceofanobservationis
determinedbytherelativeweights.
Thesamplingweightscannotbezero.Alloftheobservationsareincludedinthe
bootstrapprocess.
Inbothcases,thesumoftheweightsisn(thenumberofobservationsintheoriginaldata).
Split Selected Column Splitsastackedcolumnofbootstrapresultsintoseparatecolumns,one
foreachstatistic.Forexample,saythatthecolumnthatyourightclickedbeforechoosing
Bootstraphasthreestatisticsinit.Inadditiontothemainresultstablewithallthestatistics
stacked,anotherresultstableiscreatedwiththeresultsforthatcolumn,unstackedinto
separatecolumns.Foranexample,seeStackedResultsTableonpage 281.
Discard Stacked Table if Split Works (ApplicableonlyiftheSplit Selected Columnoptionis
282
Bootstrapping
Unstacked Results Table
Chapter 10
Basic Analysis
Notethefollowingaboutthestackedresultstable:
Thedatatablecolumnsusedintheanalysisappearinthetable.
Theresultstableincludeseveryelementintheplatformreportthatyouarebootstrapping.
Inthisexample,thereportistheParameterEstimatesreportfromaBivariatemodelinFit
YbyX.TheelementsareTerm,Bias,Estimate,Std Error,t Ratio,Prob>|t|.
TheBootIDcolumnidentifiestheiterationofthebootstrapprocess.Therowswhere
BootID=0correspondtotheoriginalestimates.ThoserowsaremarkedwithanXand
havetheexcludedrowstate.
TheBootIDcolumnidentifiestheiterationofthebootstrapprocess.TherowwhereBootID=
0correspondtotheoriginalestimates.ThatrowismarkedwithanXandhastheexcluded
rowstate.
Chapter 10
Basic Analysis
Bootstrapping
Analysis of Bootstrap Results
283
Figure 10.6BootstrapConfidenceLimitsReport
TheBootstrapConfidenceLimitsreportshowsthefollowingquantilesofthebootstrap
results:
Coverage percentageassociatedwiththequantile
Pct Lower
the100((1p)/2)thquantileofthebootstrapresults,wherepistheCoveragevalue
Pct Upper
the100(p+(1p)/2)thquantileofthebootstrapresults,wherepistheCoverage
value
Original Estimate
estimateofthestatisticusingtheoriginaldata
FormoreinformationaboutinterpretingtheBootstrapConfidenceLimitsreport,see
ExampleofBootstrappingonpage 279.
284
Bootstrapping
Analysis of Bootstrap Results
Chapter 10
Basic Analysis
Chapter 11
Tabulate
Create Summary Tables Interactively
UsetheTabulateplatformtointeractivelyconstructtablesofdescriptivestatistics.The
Tabulateplatformisaneasyandflexiblewaytopresentsummarydataintabularform.Tables
arebuiltfromgroupingcolumns,analysiscolumns,andstatisticskeywords.
Figure 11.1TabulateExamples
Contents
ExampleoftheTabulatePlatform ................................................. 287
LaunchtheTabulatePlatform ..................................................... 290
UsetheDialog............................................................... 292
AddStatistics ................................................................ 294
TheTabulateOutput............................................................. 296
AnalysisColumns ............................................................ 298
GroupingColumns ........................................................... 298
ColumnandRowTables ...................................................... 299
EditTables .................................................................. 300
TabulatePlatformOptions........................................................ 300
ShowTestBuildPanel ........................................................ 301
RightClickMenuforColumns................................................. 302
AdditionalExamplesoftheTabulatePlatform ...................................... 302
ExampleofCombiningColumnsintoaSingleTable .............................. 306
ExampleUsingaPageColumn................................................. 308
Chapter 11
Basic Analysis
Tabulate
Example of the Tabulate Platform
287
1. OpentheBig Class.jmpsampledatatable.
2. SelectAnalyze > Tabulate.
Sinceheightisthevariableyouareexamining,youwantittoappearatthetopofthetable.
3. ClickheightanddragitintotheDropzoneforcolumns.
Figure 11.3HeightVariableAdded
288
Tabulate
Example of the Tabulate Platform
Youwantthestatisticsbysex,andyouwantsextoappearontheside.
4. Clicksexanddragitintotheblankcellnexttothenumber2502.0.
Figure 11.4SexVariableAdded
Insteadofthesum,youwantittoshowthemean.
5. ClickMeananddragitontopofSum.
Chapter 11
Basic Analysis
Chapter 11
Basic Analysis
Tabulate
Example of the Tabulate Platform
289
Figure 11.5MeanStatisticAdded
Youalsowanttoseethecombinedmeanformalesandfemales.
6. ClickAllanddragitontopofsex.Or,youcansimplyselecttheAdd Aggregate Statistics
checkbox.
290
Tabulate
Example of the Tabulate Platform
Chapter 11
Basic Analysis
Figure 11.6AllStatisticAdded
7. (Optional)ClickDone.
Thecompletedtableshowsthemeanheightforfemales,males,andthecombinedmean
heightforboth.
Chapter 11
Basic Analysis
Tabulate
Example of the Tabulate Platform
291
Figure 11.7TheTabulateInteractiveTable
Interactive table/dialog
incorporateit.SeeAddStatisticsonpage 294.
Drop zone for columns Draganddropcolumnsorstatisticsheretocreatecolumns.
Note: IfthedatatablecontainscolumnswithnamesequaltothoseintheStatisticsoptions,be
suretodraganddropthecolumnnamefromthecolumnlist;otherwise,JMPmaysubstitute
thestatisticofthesamenameinthetable.
Drop zone for rows Draganddropcolumnsorstatisticsheretocreaterows.
292
Tabulate
Example of the Tabulate Platform
Resulting cells
Chapter 11
Basic Analysis
Showstheresultingcellsbasedonthecolumnsorstatisticsthatyoudragand
drop.
Identifiesthedatatablecolumnwhosevaluesassignafrequencytoeachrow.This
optionisusefulwhenafrequencyisassignedtoeachrowinsummarizeddata.
Freq
Weight Identifiesthedatatablecolumnwhosevariablesassignweight(suchasimportance
orinfluence)tothedata.
Page Column Generatesseparatetablesforeachcategoryofanominalorordinalcolumn.
SeeExampleUsingaPageColumnonpage 308.
Include missing for grouping columns Createsaseparategroupformissingvaluesin
groupingcolumns.Whenunchecked,missingvaluesarenotincludedinthetable.Note
thatanymissingvaluecodesthatyouhavedefinedascolumnpropertiesaretakeninto
account.
Changestheorderofthetabletobeinascendingorder
ofthevaluesofthegroupingcolumns.
Addsaggregatestatisticsforallrowsandcolumns.
dropanalysisornonanalysis(forexample,grouping)columns.
Enablesyoutochangethenumericformatfordisplayingspecificstatistics.
SeeChangeNumericFormatsonpage 296.
Change Format
Enablesyoutospecifyauniformcustomscale.
Uniform plot scale (OnlyappearsifShow Chartisselectedfromtheredtrianglemenu.)
Deselectthisboxforeachcolumnofbarstousethescaledeterminedseparatelyfromthe
dataineachdisplayedcolumn.
Chapter 11
Basic Analysis
Tabulate
Example of the Tabulate Platform
293
Figure 11.8UsingtheDialog
Thedialogcontainsthefollowingoptions:
Include marginal statistics Aggregatessummaryinformationforcategoriesofagrouping
column.
For quantile statistics, enter value (%) Typethevalueatwhichthespecificpercentageofthe
argumentislessthanorequalto.Forexample,75%ofthedataislessthanthe75th
quantile.Thisappliestoallgroupingcolumns.
Statistics Onceyouveselectedacolumn,selectastandardstatistictoapplytothatcolumn.
SeeAddStatisticsonpage 294
Grouping (row labels) Selectthecolumntouseastherowlabel.
Grouping (column labels)
Selectthecolumntouseasthecolumnlabel.
294
Tabulate
Example of the Tabulate Platform
Chapter 11
Basic Analysis
Add Statistics
Tip: Youcanselectbothacolumnandastatisticatthesametimeanddragthemintothetable.
Tabulatesupportsalistofstandardstatistics.Thelistisdisplayedinthecontrolpanel.Youcan
draganykeywordfromthatlisttothetable,justlikeyoudowiththecolumns.Notethe
following:
Thestatisticsassociatedwitheachcellarecalculatedonvaluesoftheanalysiscolumns
fromallobservationsinthatcategory,asdefinedbythegroupingcolumns.
Alloftherequestedstatisticshavetoresideinthesamedimension,eitherintherowtable
orinthecolumntable.
Ifyoudragacontinuouscolumnintoadataarea,itistreatedasananalysiscolumn.
Tabulateusesthefollowingkeywords:
N
Providesthenumberofnonmissingvaluesinthecolumn.Thisisthedefaultstatistic
whenthereisnoanalysiscolumn.
Providesthearithmeticmeanofacolumnsvalues.Itisthesumofnonmissingvalues
(andifdefined,multipliedbytheweightvariable)dividedbytheSum Wgt.
Mean
Providesthesamplestandarddeviation,computedforthenonmissingvalues.Itis
thesquarerootofthesamplevariance.
Std Dev
Min
Providesthesmallestnonmissingvalueinacolumn.
Max
Providesthelargestnonmissingvalueinacolumn.
Range
ProvidesthedifferencebetweenMaxandMin.
Computesthepercentageoftotalofthewholepopulation.Thedenominatorused
inthecomputationisthetotalofalltheincludedobservations,andthenumeratoristhe
totalforthecategory.Ifthereisnoanalysiscolumn,the%ofTotalisthepercentageoftotal
ofcounts.Ifthereisananalysiscolumn,the%ofTotalisthepercentageofthetotalofthe
sumoftheanalysiscolumn.Thus,thedenominatoristhesumoftheanalysiscolumnover
alltheincludedobservations,andthenumeratoristhesumoftheanalysiscolumnforthat
category.Youcanrequestdifferentpercentagesbydraggingthekeywordintothetable.
% of Total
Droppingoneormoregroupingcolumnsfromthetabletothe%ofTotalheading
changesthedenominatordefinition.Forthis,Tabulateusesthesumofthesegrouping
columnsforthedenominator.
Togetthepercentageofthecolumntotal,dragallthegroupingcolumnsontherow
tableanddropthemontothe% of Totalheading(sameasColumn%).Similarly,toget
thepercentageoftherowtotal,dragallgroupingcolumnsonthecolumntableand
dropthemontothe% of Totalheading(sameasRow%).
N Missing
Providesthenumberofmissingvalues.
Chapter 11
Basic Analysis
Tabulate
Example of the Tabulate Platform
N Categories
295
Providesthenumberofdistinctcategories.
Providesthesumofallvaluesinthecolumn. Thisisthedefaultstatisticforanalysis
columnswhentherearenootherstatisticsforthetable.
Sum
Providesthesumofallweightvaluesinacolumn.Or,ifnocolumnisassignedthe
weightrole,Sum Wgtisthetotalnumberofnonmissingvalues.
Sum Wgt
Providesthesamplevariance,computedforthenonmissingvalues.Itisthesumof
squareddeviationsfromthemean,dividedbythenumberofnonmissingvaluesminus
one.
Variance
squarerootofN.Ifacolumnisassignedtheroleofweight,thenthedenominatoristhe
squarerootofthesumoftheweights.
CV
(CoefficientofVariation)Providesthemeasureofdispersion,whichisthestandard
deviationdividedbythemeanmultipliedbyonehundred.
Providesthe50thpercentile,whichisthevaluewherehalfthedataarebelowand
halfareaboveorequaltothe50thquantile(median).
Median
Interquartile Range
Providesthedifferencebetweenthe3rdquartileand1stquartile.
Providesthevalueatwhichthespecificpercentageoftheargumentislessthanor
equalto.Forexample,75%ofthedataislessthanthe75thquantile.Youcanrequest
differentquantilesbyclickinganddraggingtheQuantileskeywordintothetable,andthen
enteringthequantileintotheboxthatappears.
Quantiles
Providesthepercentofeachcellcounttoitscolumntotalifthereisnoanalysis
column.Ifthereisananalysiscolumn,theColumn%isthepercentofthecolumntotalof
thesumoftheanalysiscolumn.Fortableswithstatisticsonthetop,youcanaddColumn
%totableswithmultiplerowtables(stackedvertically).
Column %
Providesthepercentofeachcellcounttoitsrowtotalifthereisnoanalysiscolumn.
Ifthereisananalysiscolumn,theRow%isthepercentoftherowtotalofthesumofthe
analysiscolumn.Fortableswithstatisticsontheside,youcanaddRow%totableswith
multiplecolumntables(sidebysidetables).
Row %
All
Aggregatessummaryinformationforcategoriesofagroupingcolumn.
296
Tabulate
The Tabulate Output
Chapter 11
Basic Analysis
Chapter 11
Basic Analysis
Tabulate
The Tabulate Output
297
Figure 11.10TabulateOutput
Creatingatableinteractivelyisaniterativeprocess:
Clicktheitems(columnsorstatistics)fromtheappropriatelist,anddragthemintothe
dropzone(forrowsorcolumns).SeeEditTablesonpage 300,andColumnandRow
Tablesonpage 299.
Addtothetablebyrepeatingthedraganddropprocess.Thetableupdatestoreflectthe
latestaddition.Iftherearealreadycolumnheadingsorrowlabels,youcandecidewhere
theadditiongoesrelativetotheexistingitems.
Notethefollowingaboutclickinganddragging:
JMPusesthemodelingtypetodetermineacolumnsrole.Continuouscolumnsare
assumedtobeanalysiscolumns.SeeAnalysisColumnsonpage 298.Ordinalornominal
columnsareassumedtobegroupingcolumns.SeeGroupingColumnsonpage 298.
298
Tabulate
The Tabulate Output
Chapter 11
Basic Analysis
Whenyoudraganddropmultiplecolumnsintotheinitialtable:
Ifthecolumnsshareasetofcommonvalues,theyarecombinedintoasingletable.A
crosstabulationofthecolumnnamesandthecategoriesgatheredfromthesecolumnsis
generated.Eachcellisdefinedbyoneofthecolumnsandoneofthecategories.
Ifthecolumnsdonotsharecommonvalues,theyareputintoseparatetables.
Youcanalwayschangethedefaultactionbyrightclickingonacolumnandselecting
Combine TablesorSeparate Tables.Formoredetails,seeRightClickMenufor
Columnsonpage 302.
Tonestcolumns,createatablewiththefirstcolumn,andthendragtheadditionalcolumns
intothefirstcolumn.
Inaproperlycreatedtable,allgroupingcolumnsaretogether,allanalysiscolumnsare
together,andallstatisticsaretogether.Therefore,JMPdoesnotintersperseastatistics
keywordwithinalistofanalysiscolumns.JMPalsodoesnotinsertananalysiscolumn
withinalistofgroupingcolumns.
YoucandragcolumnsfromtheTablepanelinthedatatableontoaTabulatetableinstead
ofusingtheTabulateControlPanel.
Analysis Columns
Analysiscolumnsareanynumericcolumnsforwhichyouwanttocomputestatistics.They
arecontinuouscolumns.Tabulatecomputesstatisticsontheanalysiscolumnsforeach
categoryformedfromthegroupingcolumns.
Notethatalltheanalysiscolumnshavetoresideinthesamedimension,eitherintherow
tableorinthecolumntable.
Grouping Columns
Groupingcolumnsarecolumnsthatyouwanttousetoclassifyyourdataintocategoriesof
information.Theycanhavecharacter,integer,orevendecimalvalues,butthenumberof
uniquevaluesshouldbelimited.Groupingcolumnsareeithernominalorordinal.
Notethefollowing:
Ifgroupingcolumnsarenested,Tabulateconstructsdistinctcategoriesfromthe
hierarchicalnestingofthevaluesofthecolumns.Forexample,fromthegroupingcolumns
SexwithvaluesFandM,andthegroupingcolumnMaritalStatuswithvaluesMarried
andSingle,Tabulateconstructsfourdistinctcategories:FandMarried,FandSingle,M
andMarried,MandSingle.
Youcanspecifygroupingcolumnsforcolumntablesaswellasrowtables.Togetherthey
generatethecategoriesthatdefineeachtablecell.
Chapter 11
Basic Analysis
Tabulate
The Tabulate Output
299
Tabulatedoesnotincludeobservationswithamissingvalueforoneormoregrouping
columnsbydefault.YoucanincludethembycheckingtheInclude missing for grouping
columnsoption.
Tospecifycodesorvaluesthatshouldbetreatedasmissing,usetheMissingValueCodes
columnproperty.YoucanincludethesebycheckingtheInclude missing for grouping
columnsoption.FormoredetailsaboutMissingValueCodes,seetheUsingJMPbook.
300
Tabulate
Tabulate Platform Options
Chapter 11
Basic Analysis
Formultiplecolumntables,thelabelsonthesidearesharedacrossthecolumntables.Inthis
instance,countryandsexaresharedacrossthetables.Similarly,formultiplerowtables,the
headingsonthetoparesharedamongtherowtables.Inthisinstance,bothageandtypeare
sharedamongthetables.
Edit Tables
Thereareseveralwaystoedittheitemsthatyouaddtoatable.
Delete Items
Afteryouadditemstothetable,youcanremovetheminanyoneofthefollowingways:
Dragtheitemawayfromthetable.
Toremovethelastitem,clickUndo.
RightclickonanitemandselectDelete.
Displaysthesummarizeddataintabularform.
Chapter 11
Basic Analysis
Tabulate
Tabulate Platform Options
301
statistics.Thesimplebarchartenablesvisualcomparisonoftherelativemagnitudeofthe
summarystatistics.Bydefault,allcolumnsofbarssharethesamescale.Youcanhaveeach
columnofbarsusethescaledeterminedseparatelyfromthedataineachdisplayed
column,byclearingtheUniform plot scalecheckbox.Youcanspecifyauniformcustom
scaleusingtheChange Plot Scalebutton.Thechartsareeither0basedorcenteredon0.If
thedataareallnonnegative,orallnonpositive,thechartsbaselineisat0.Otherwise,the
chartsarecenteredon0.
Show Control Panel Displaysthecontrolpanelforfurtherinteraction.
Show Shading Displaysgrayshadingboxesinthetablewhentherearemultiplerows.
Show Tooltip Displaystipsthatappearwhenyoumovethemouseoverareasofthetable.
Show Test Build Panel Displaysthecontrolareathatletsyoucreateatestbuildusinga
randomsamplefromtheoriginaltable.Thisisparticularlyusefulwhenyouhavelarge
amountsofdata.SeeShowTestBuildPanelonpage 301.
Make Into Data Table Makesadatatablefromthereport.Thereisonedatatableforeachrow
table,becauselabelsofdifferentrowtablesmightnotbemappedtothesamestructure.
Displaysoptionsforsavingscripts,redoinganalyses,andviewingthedatatable.For
details,seetheUsingJMPbook.
Script
Note: ForadescriptionoftheoptionsintheSelectColumnsredtrianglemenu,seetheUsing
JMPbook.
302
Tabulate
Additional Examples of the Tabulate Platform
Chapter 11
Basic Analysis
3. ClickResample.
4. ToseethesampleddatainaJMPdatatable,clicktheTest Data Viewbutton.Whenyou
dismissthetestbuildpanel,Tabulateusesthefulldatatabletoregeneratethetablesas
designed.
(Appearsonlyforseparateornestedcolumns)
Combinesseparateornestedcolumns.SeeExampleofCombiningColumnsintoaSingle
Tableonpage 306.
column.
Nest Grouping Columns
Nestsgroupingcolumnsverticallyorhorizontally.
Chapter 11
Basic Analysis
Tabulate
Additional Examples of the Tabulate Platform
Figure 11.13TableShowingCountsofCarOwnership
1. OpentheCar Poll.jmpsampledatatable.
2. SelectAnalyze > Tabulate.
3. ClickcountryanddragitintotheDropzoneforrows.
4. Clicksizeanddragittotherightofthecountryheading.
Figure 11.14CountryandSizeAddedtotheTable
303
304
Tabulate
Additional Examples of the Tabulate Platform
Chapter 11
Basic Analysis
CreateaTableShowingStatistics
Supposethatyouwouldliketoseethemean(average)andthestandarddeviationoftheage
ofpeoplewhoowneachsizecar.YouwantthetabletolooklikeFigure 11.15.
Figure 11.15TableShowingMeanandStandardDeviationbyAge
1. StartfromFigure 11.14.Clickageanddragittotherightofthesizeheading.
2. ClickMeananddragitoverSum.
3. ClickStd DevanddragitbelowMean.
StdDevisplacedbelowMeaninthetable.DroppingStdDevaboveMeanplacesStdDev
aboveMeaninthetable.
Chapter 11
Basic Analysis
Tabulate
Additional Examples of the Tabulate Platform
Figure 11.16Age,Mean,andStdDevAddedtotheTable
RearrangetheTableContents
Supposethatyouwouldprefersizetobeontop,showingacrosstablayout.Youwantthe
tabletolooklikeFigure 11.17.
Figure 11.17SizeonTop
305
306
Tabulate
Additional Examples of the Tabulate Platform
Chapter 11
Basic Analysis
Torearrangethetablecontents,proceedasfollows:
1. StartfromFigure 11.16.Clickonthesizeheadinganddragittotherightofthetable
headings.SeeFigure 11.18.
Figure 11.18Movingsize
Chapter 11
Basic Analysis
Tabulate
Additional Examples of the Tabulate Platform
Figure 11.19AddingDemographicData
1. OpentheChildrens Popularity.jmpsampledatatable.
2. SelectAnalyze > Tabulate.
3. SelectGrades,Sports,Looks,andMoneyanddragthemintotheDropzoneforrows.
Figure 11.20ColumnsbyCategories
Noticethatasingle,combinedtableappears.
Tabulatethepercentageoftheonetofourratingsofeachcategory.
4. DragGenderintotheemptyheadingatleft.
307
308
Tabulate
Additional Examples of the Tabulate Platform
Chapter 11
Basic Analysis
5. Drag% of Totalabovethenumberedheadings.
6. DragAllbesidethenumber4.
Figure 11.21Gender,%ofTotal,andAllAddedtotheTable
Breakdownthetabulationfurtherbyaddingdemographicdata.
7. DragUrban/Ruralbelowthe% of Totalheading.
Figure 11.22Urban/RuralAddedtotheTable
Youcanseethatforboysinrural,suburban,andurbanareas,sportsarethemostimportant
factorforpopularity.Forgirlsinrural,suburban,andurbanareas,looksarethemost
importantfactorforpopularity.
Chapter 11
Basic Analysis
Tabulate
Additional Examples of the Tabulate Platform
309
Figure 11.23MeanHeightofStudentsbySex
Females
Males
1. OpentheBig Class.jmpsampledatatable.
2. SelectAnalyze > Tabulate.
Sinceheightisthevariableyouareexamining,youwantittoappearatthetopofthetable.
3. ClickheightanddragitintotheDropzoneforcolumns.
Youwantthestatisticsbyage,andyouwantagetoappearontheside.
4. Clickageanddragitintotheblankcellnexttothenumber2502.
5. ClicksexanddragitintoPage Column.
6. SelectFfromthePageColumnlisttoshowthemeanheightsforonlyfemales.
7. SelectMfromthePageColumnlisttoshowthemeanheightsforonlymales.Youcanalso
selectNone Selectedtoshowallvalues.
310
Tabulate
Additional Examples of the Tabulate Platform
Figure 11.24UsingaPageColumn
Chapter 11
Basic Analysis
Appendix A
References
Agresti,A.(1984),AnalysisofOrdinalCategoricalData,NewYork:JohnWileyandSons,Inc.
Agresti,A.(1990),CategoricalDataAnalysis,NewYork:JohnWileyandSons,Inc.
Agresti,A.,andCoull,B.(1998),ApproximateisBetterThanExactforIntervalEstimationof
BinomialProportions,TheAmericanStatistician,52,119126
Akaike,H.(1974),FactorAnalysisandAIC,Pschychometrika,52,317332.
Akaike,H.(1987),AnewLookattheStatisticalIdentificationModel,IEEETransactionson
AutomaticControl,19,716723.
AmericanSocietyforQualityStatisticsDivision(2004),GlossaryandTablesforStatisticalQuality
Control,FourthEdition,Milwaukee:QualityPress.
Bartlett,M.S.andD.G.Kendall(1946),TheStatisticalAnalysisofVariancesHeterogeneity
andtheLogarithmicTransformation,JRSSSuppl8,128138.
Bartlett,M.S.(1966),AnIntroductiontoStochasticProcesses,SecondEdition,Cambridge:
CambridgeUniversityPress.
Bissell,A.F.(1990),HowReliableisYourCapabilityIndex?,AppliedStatistics,30,331340.
Brown,M.B.andBenedetti,J.K.(1977),SamplingBehaviorofTestsforCorrelationin
TwoWayContingencyTables,JournaloftheAmericanStatisticalAssociation72,305315.
Brown,M.B.andForsythe,A.B.(1974a),TheSmallSampleBehaviorofSomeStatisticsWhich
TesttheEqualityofSeveralMeans,Technometrics16:1,129132.
Brown,M.B.andForsythe,A.B.(1974),RobusttestsfortheequalityofvariancesJournalof
theAmericanStatisticalAssociation,69,364367.
Chou,Y.M.,Owen,D.B.,andBorrego,S.A.(1990),Lowerconfidencelimitsonprocess
capabilityindices,JournalofQualityTechnology,22(3):223229.
Cleveland,W.S.(1979).RobustLocallyWeightedRegressionandSmoothingScatterplots.
JournaloftheAmericanStatisticalAssociation74(368):829836.
Cohen,J.(1960),Acoefficientofagreementfornominalscales,EducationPsychological
Measurement,20:3746.
Cochran,W.G.andCox,G.M.(1957),ExperimentalDesigns,SecondEdition,NewYork:John
WileyandSons.
Conover,W.J.(1972).AKolmogorovGoodnessoffitTestforDiscontinuousDistributions.
JournaloftheAmericanStatisticalAssociation67:591596.
Conover,W.J.(1980),PracticalNonparametricStatistics,NewYork:JohnWileyandSons,Inc.
312
References
Appendix A
Basic Analysis
DeLong,E.,Delong,D,andClarkePearson,D.L.(1988),ComparingtheAreasUnderTwoor
moreCorrelatedReceiverOperatingCharacteristicCurves:ANonparametricApproach,
Biometrics44,837845.
Devore,J.L.(1995),ProbabilityandStatisticsforEngineeringandtheSciences,DuxburyPress,
CA.
Dunn,O.J.(1964),MultipleComparisonsUsingRankSums,Technometrics6,241252.
Dunnett,C.W.(1955),Amultiplecomparisonprocedureforcomparingseveraltreatments
withacontrolJournaloftheAmericanStatisticalAssociation,50,10961121.
Dwass,M.(1955),ANoteonSimultaneousConfidenceIntervals,AnnalsofMathematical
Statistics26:146147.
Eubank,R.L.(1999),NonparametricRegressionandSplineSmoothing,SecondEdition,Boca
Raton,Florida:CRC.
Fisher,L.andVanNess,J.W.(1971),AdmissibleClusteringProcedures,Biometrika,58,
91104.
Fleiss,J.L.,CohenJ.,andEveritt,B.S.(1969),LargeSampleStandardErrorsofKappaand
WeightedKappa,PsychologicalBulletin,72:323327.
Friendly,M.(1991),MosaicDisplaysforMultiwayContingencyTables,NewYorkUniversity
DepartmentofPsychologyReports:195.
Goodman,L.A.andKruskal,W.H.(1979),MeasuresofAssociationforCrossClassification,New
York:SpringerVerlag(reprintofJASAarticles).
Gupta,S.S.(1965),OnSomeMultipleDecision(selectionandranking),Rules.,Technometrics7,
225245.
Hahn,G.J.andMeeker,W.Q.(1991)StatisticalIntervals:AGuideforPractitioners.NewYork:
Wiley.
Hajek,J.(1969),ACourseinNonparametricStatistics,SanFrancisco:HoldenDay.
Hartigan,J.A.andKleiner,B.(1981),MosaicsforContingencyTables,Proceedingsofthe13th
SymposiumontheInterfacebetweenComputerScienceandStatistics,Ed.Eddy,W.F.,NewYork:
SpringerVerlag,268273.
Hayter,A.J.(1984),AproofoftheconjecturethattheTukeyKramermultiplecomparisons
procedureisconservative,AnnalsofMathematicalStatistics,126175.
Hosmer,D.W.andLemeshow,S.(1989),AppliedLogisticRegression,NewYork:JohnWileyand
Sons.
HotDogs,(1986),ConsumerReports(June),364367.
Hsu,J.(1981),Simultaneousconfidenceintervalsforalldistancesfromthebest,Annalsof
Statistics,9,10261034.
Hsu,J.(1989),TutorialNotesonMultipleComparisons,AmericanStatisticalAssociation,
Washington,DC.
Hsu,J.C.(1996),MultipleComparisons:TheoryandMethods,ChapmanandHall.
Huber,PeterJ.(1973),RobustRegression:Asymptotics,Conjecture,andMonteCarlo,
AnnalsofStatistics,Volume1,Number5,799821.
Appendix A
Basic Analysis
References
313
Huber,P.J.andRonchetti,E.M.(2009),RobustStatistics,SecondEdition,Wiley.
Iman,R.L.(1974),UseofatstatisticasanApproximationtotheExactDistributionof
WilcoxonSignedRanksTestStatistic,CommunicationsinStatisticsSimulationand
Computation,795806.
Kendall,M.andStuart,A.(1979),TheAdvancedTheoryofStatistics,Volume2,NewYork:
MacmillanPublishingCompany,Inc.
Kramer,C.Y.(1956),Extensionofmultiplerangeteststogroupmeanswithunequalnumbers
ofreplications,Biometrics,12,309310.
Levene,H.(1960),RobusttestsfortheequalityofvariancesInI.Olkin(ed),Contributionsto
probabilityandstatistics,StanfordUniv.Press.
McCullagh,P.andNelder,J.A.(1983),GeneralizedLinearModels,London:ChapmanandHall
Ltd.
Meeker,W.Q.andEscobar,L.A.(1998),StatisticalMethodsforReliabilityData,pp.6062,New
York:JohnWileyandSons.
Nelson,P.R.,Wludyka,P.S.,andCopeland,K.A.F.(2005),TheAnalysisofMeans:AGraphical
MethodforComparingMeans,Rates,andProportions,Philadelphia:SocietyforIndustrialand
AppliedMathematics.
Neter,J.,Wasserman,W.andKutner,M.H.(1990),AppliedLinearStatisticalModels,Third
Edition,Boston:Irwin,Inc.
OBrien,R.G.(1979),AgeneralANOVAmethodforrobusttestsofadditivemodelsfor
variances,JournaloftheAmericanStatisticalAssociation,74,877880.
OBrien,R.,andLohr,V.(1984),PowerAnalysisForLinearModels:TheTimeHasCome,
ProceedingsoftheNinthAnnualSASUsersGroupInternationalConference,840846.
Odeh,R.E.andOwen,D.B.(1980)TablesforNormalToleranceLimits,SamplingPlans,and
Screening.NewYork:MarcelDekker,Inc.
Olejnik,S.F.andAlgina,J.(1987),TypeIErrorRatesandPowerEstimatesofSelected
ParametricandNonparametricTestsofScale,JournalofEducationalStatistics12,4561.
Reinsch,C.H.(1967),SmoothingbySplineFunctions,NumerischeMathematik,10,177183.
Rousseuw,P.J.andLeroy,A.M.(1987),RobustRegressionandOutlierDetection,NewYork:John
WileyandSons.
Slifker,J.F.andShapiro,S.S.(1980).Technometrics,22,239246.
Snedecor,G.W.andCochran,W.G.(1980),StatisticalMethods,7thedition,Ames,Iowa:Iowa
StateUniversityPress.
Somers,R.H.(1962),ANewAsymmetricMeasureofAssociationforOrdinalVariables,
AmericanSociologicalReview,27,799811.
Tan,CharlesY.,andIglewicz,Boris(1999),MeasurementmethodsComparisonsandLinear
StatisticalRelationship,Technometrics,41:3,192201.
Tamhane,A.C.andDunlop,D.D.(2000)StatisticsandDataAnalysis.PrenticeHall.
Tukey,J.(1953),Aproblemofmultiplecomparisons,Dittoedmanuscriptof396pages,
PrincetonUniversity.
314
References
Appendix A
Basic Analysis
Welch,B.L.(1951),Onthecomparisonofseveralmeanvalues:analternativeapproach,
Biometrika38,330336.
Wludyka,P.S.andNelson,P.R.(1997),AnAnalysisofMeansTypeTestforVariancesFrom
NormalPopulations,Technometrics,39:3,274285.
Index
Basic Analysis
Numerics
3DCorrespondenceAnalysisoption 221231
5%Contoursoption 130
A
AgreementStatisticoption 213,222223,240
Allfits 6388
AllGraphsoption 153154
AllPairs,TukeyHSDtest 166,169189
AnalysisofMeans
charts 164184
forProportionsoption 212,220
options 165
AnalysisofMeansMethods 150,163165
AnalysisofVariance
SeealsoOneway
report 114116,159160
ANOM 163
SeealsoAnalysisofMeans
forVariances 164,185
forVarianceswithLevene(ADM) 164
withTransformedRanks 164
B
Bartletttest 177
BetaBinomialfit 8990
Betafit 84
Binomialfit 89
BivariateNormalEllipsemenu 127130
Bivariateplatform 101
example 103
launching 103
options 105108,127131
reportwindow 104
Blockbutton 98
BlockMeansreport 161
bootstrapping 277283
Boxplots 153
BrownForsythetest 176
C
CapabilityAnalysisoption 5961,7981
CDFplot 152153,182
CellLabelingoption 215
ChangeFormatoption 292
ChangeItemLabel 300
ChangePlotScaleoption 292
closingJMPStarterwindow 27
CochranArmitageTrendTest 213,226
CochranMantelHaenszeltest 212,222
CoefficientofVariation 295
ColorbyDensityQuantileoption 130
ColorThemeoption 216
columntables 299
CompareDensitiesoption 183
CompareMeansoptions 151,166171
comparisoncircles 154,167168,202203
CompositionofDensitiesoption 183
ConfidCurvesFitoption 128
ConfidCurvesIndivoption 128
ConfidQuantileoption 170
ConfidShadedFitoption 129
ConfidShadedIndivoption 129
ConfidenceIntervaloptions 47,57
ConfidenceLimitsoption 66
ConnectMeansoption 154155
ConnectingLettersReport 170
Contingencyplatform 207
example 209210
launching 210
options 212213
reportwindow 210
ContingencyTable 212,216218
316
ContinuousFitoptions 6288
ContourFilloption 130
ContourLinesoption 130
Correlationreport 124
CorrespondenceAnalysisoption 212,220233
CountAxisoption 46,49
CovarianceofEstimatesreport 253
D
Densitiesoptions 152,183
DensityAxisoption 46,49
DensityCurveoption 64
DensityEllipseoption 107,123124
densityestimationfits 108
DetailedComparisonsReport 171
Detailsreport 221
DiagnosticPlotoption 6466
DifferenceMatrixoption 170
DiscreteFitoptions 6390
DisplayOptions 153155,165,220
forcategoricalvariables 45
forcontinuousvariables 4748
Distributionplatform 31
categoricalvariablesin 33
continuousvariablesin 33
example 3335
launching 35
options 4458
reportwindow 3638
DunnAllPairsforJointRankstest 173
DunnWithControlforJointRankstest 173
E
EachPair,Studentsttest 166,168187
EqualVariancesoption 122
EquivalenceTestoption 151,179194
ExactTest 213,226
SeealsoFishersExactTest
options 172
Exponentialfit 83
ExtremeValuefit 83
F
FishersExactTest 220
Index
Basic Analysis
FitEachValue
command 107,121
menu 127129
report 121
FitLinecommand 107,110117,139
FitMean
command 107109
menu 127129
report 109
FitOrthogonalcommand 107,122126,139
FitPolynomialcommand 107,110141
FitRobustcommand 107
FitSpecialcommand 107,117119
FitSplinecommand 107,119120,139
FitXtoYoption 122
FitYbyXplatform 95
FixParametersoption 64
Frequencies
option 45
report 41
G
Gammafit 84
GammaPoissonfit 8889
GLogfit 87
GoodnessofFit
option 64
tests 6692
GradientBetweenEndsoption 216
GradientBetweenSelectedPointsoption 216
GrandMeanoption 154
GraphinVarianceScaleoption 165166
GroupByoption 106,127
H
HistogramOptions 48
histograms 3868
bordersoption 106,126
coloroption 46,49
creatingsubsets 38
highlightingdata 38,40
optioninOneway 155
optionsforcategoricalvariables 46
optionsforcontinuousvariables 4849
rescaling 38
317
Index
Basic Analysis
resizing 3839
selectingdata 40
specifyingdata 39
HorizontalLayoutoption 45,48
WholeModelTestreport 250252
logisticregression 243,245
LogNormalfit 82
LSDThresholdMatrixoption 170
Includemissingforgroupingcolumns
option 292
InversePredictionoption 253
Iterationsreport 250
Macrosoption 216
MakeIntoDataTableoption 301
MannWhitneytest,seeWilcoxonTest
MatchedPairsplatform 265,267
examples 267268,272274
launching 268
multipleYcolumns 269
options 272
reportwindow 270271
statisticaldetails 274
Tukeymeandifferenceplot 270271
MatchingColumnoption 152,183184
MatchingDottedLinesoption 154
MatchingLinesoption 154,183
Max(summarystatistics) 294
MeanCILinesoption 154
MeanDiamondsoption 153,161162
MeanErrorBarsoption 154,162
MeanLinesoption 154,162
MeanofMeansoption 154
MeansandStdDevoption 150
MeansforOnewayAnovareport 161
Means/Anovaoption 150,156
Means/Anova/Pooledtoption 150,156
MeasuresofAssociationoption 213,224225
Median(summarystatistics) 295
MedianReferenceLineoption 66
MedianTest 172
menutips 26
MeshPlotoption 130
Min(summarystatistics) 294
ModelClusteringoption 130
MosaicPlot
inContingency 212,214216
optioninDistribution 46
J
jitter 257
JMPStarter 27
JMPtutorials 25
Johnsonfits 8687
K
KernelControloption 130
KernelSmoothercommand 107
KolmogorovSmirnovTest 172
KruskalWallistest,seeWilcoxonTest
L
LackofFitreport 112114
Levenetest 176
LiftCurveoption 254
LineColoroption 128
LineofFitoption 66,128,182
LineStyleoption 128
LineWidthoption 128
LinearFit
menu 127130
report 110141
Logisticplatform 243
Seealsologisticregression
CovarianceofEstimatesreport 253
examples 246247,256260
Iterationsreport 250
launching 248
logisticplot 249,253
options 253256
ParameterEstimatesreport 252
reportwindow 248253
N
NandNMissing(summarystatistics) 294
NCategories(summarystatistics) 295
318
NonparDensitycommand 107,125126
Nonparametric
options 151
tests 171
NonparametricBivariateDensityreport 125
NonparametricMultipleComparisons
tests 173
Normalfit 82
NormalMixturesfits 85
NormalQuantilePlot 4951,152,182
O
OBrientest 176
OddsRatiosoption 213,253
Onewayplatform 143,145
example 145147
launching 147
options 148155
plot 147
opening
JMPStarterwindow 27
Orderbycountofgroupingcolumns
option 292
OrderByoption 46
OrderedDifferencesReport 171
OrthogonalFitRatiomenu 127129
OrthogonalRegressionreport 122
OutlierBoxPlot 5152
P
pairedttest 265,267
ParameterEstimatesreport 116141,252
PlotActualbyQuantileoption 182
PlotDifbyMeanoption 272
PlotDifbyRowoption 272
PlotOptionsmenuinLogistic 254
PlotQuantilebyActualoption 182
PlotResidualsoption 129
PointsJitteredoption 154
Pointsoption 153
PointsSpreadoption 154
Poissonfit 88
PolynomialFitDegreemenu 127130
PolynomialFitreport 110141
Poweroption 151,180182,204
Index
Basic Analysis
PredictionIntervaloption 5871,77
ProbAxisoption 46,49
ProportionofDensitiesoption 183
Q
QuantileBoxPlot 5253
QuantileDensityContoursmenu 127131
Quantiles
option 47,64,74,150,155156
report 41
R
ranges,summarystatistics 294
ReferenceFrameoption 272
regressionfitsforBivariate 108
RelativeRiskoption 213,223
RemoveColumnLabel 300
RemoveFitoption 65,129
Reportoption 128
RestoreColumnLabel 300
ReverttoOldColorsoption 216
ROCCurveoption 254
Rotateoption 66
rowprofile 221
rowtables 299
S
samplesize,intabulate 301
SaveCoefficientsoption 130
SaveColorstoColumnoption 216
SavecommandsinDistribution 47,5758
SaveDensityFormulacommand 65
SaveDensityGridoption 131
SaveDensityQuantileoption 130
SaveFittedQuantilescommand 65
SaveforAdobeFlashplatform(.SWF)
option 45
Saveoptions
inOneway 153
SavePredictedsoption 129
SaveProbabilityFormulaoption 254
SaveResidualsoption 129
SaveSpecLimitscommand 65
SaveValueOrderingoption 221
319
Index
Basic Analysis
SavedTransformedcommand 65
SelectColorsforValueswindow 215
SelectPointsbyDensityoption 130
SelectPointsInsideoption 130
SelectPointsOutsideoption 130
SeparateBarsoption 46
SetAlphaLeveloption
forBivariate 130
forContingency 212,220
forMatchedPairs 272
forOneway 151,165
SetBinWidthoption 49
SetColorsoption 215
SetSpecLimitsforKSigmaoption 64
ShadedContouroption 130
Shadowgramoption 48
ShowCenterLineoption 165
ShowChartoption 301
ShowControlPaneloption 301
ShowCountsoption 46,49
ShowDecisionLimitShadingoption 165
ShowDecisionLimitsoption 165
ShowPercentsoption 46,49
ShowPointsoption
forBivariate 106
ShowShadingoption 301
ShowSummaryReportoption 165,220
ShowTableoption 300
ShowTestBuildPaneloption 301
Showtooltipoption 300
SignTest 272
SmoothCurvefit 85
SmoothingSplineFit
menu 127
report 119
SpecLimitsoption 64,67
SpecifiedVarianceRatiooption 122
SpecifyTransformationorConstraint
window 118,132
Stackoption 45
StandardDeviation 294
StandardError 295
StdDevandStdErr(summarystatistics) 294
StdDevLinesoption 154,162
StdErrorBarsoption 46,49
SteelWithControltest 173
SteelDwassAllPairstest 173
StemandLeafplot 53
Studentsttest 166,168187
Subsetoption 38
SummaryofFitreport 111112,157
SummaryStatistics
customize 48
options 44
report 41,48
SwitchResponseLevelforProportion
option 220
T
ttest
option 150
report 158159
Tabulate 285310
TestMeanoption 5556
TestProbabilitiesoption 4670
TestStdDevoption 56
Tests
optioninContingency 212
report 218219
ToleranceIntervaloption 5972,7879
tooltips 26
TransformedFit
menu 127129
report 118
Tukeymeandifferenceplot 270271
tutorials 25
TwoSampleTestforProportions 213,224
U
UnequalVariances 151,176177
Uniformplotscaleoption 292
UniformScalingoption 44
UnivariateVariances,PrinCompoption 122
V
vanderWaerdenTest 172
Variance(summarystatistics) 295
Verticaloption 46,48
320
W-Z
Weibullfits 83
Welchstest 177
WholeModelTestreport 250252
Wilcoxon
EachPairtest 173
SignedRanktest 272
Test 171
WithBest,HsuMCBtest 166,169190
WithControl,Dunnettstest 167,170
XAxisproportionaloption 154,161162
X,ContinuousRegressorbutton 98
X,Groupingbutton 98
X,GroupingCategorybutton 99
X,Regressorbutton 98
Y,CategoricalResponsebutton 98
Y,ResponseCategorybutton 99
Index
Basic Analysis