Académique Documents
Professionnel Documents
Culture Documents
Lab Exercise 1:
First date with JMP Pro
Learning Outcome
1
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Versions history
Version Description of changes Date Editor
1.0.0 First working draft 20/12/2013 Kam
1.2.0 Final proof-reading 02/01/2013 Kam
1.3.0 Minor editing and replacing the data 23/08/2013 Kam
1.4.0 Revise base on JMP Pro 11 05/01/2014 Kam
1.5.0 Minor editing and replacing the data 20/08/2014 Kam
1.6.0 Replaced the hands-on dataset and minor revision 9/1/2014 Kam
1.7.0 Revised based on JMP Pro 12 2/9/2015 Kam
1.8.0 Replaced the hands-on dataset and minor revision 14/01/2016 Kam
1.9.0 Minor revision 12/08/2016 Kam
2.0.0 Revised based on JMP Pro 13 and change dataset 12/01/2017 Kam
2
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Contents
1.0 Overview ..................................................................................................................................... 4
1.1 Getting Start with JMP ............................................................................................................ 4
1.1.1 Launching JMP and getting started................................................................................. 4
1.1.3 Loading data into JMP ..................................................................................................... 5
1.2 Data Preparation ................................................................................................................... 10
1.2.1 Using Tabulate functions to check data accuracy ......................................................... 10
1.2.2 Correct the error using recode function ....................................................................... 13
1.3 Reformatting data type ......................................................................................................... 15
1.4 Using table functions to derive summary statistics .............................................................. 20
1.5 Working with interactive tabular analysis ............................................................................ 23
3
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
1.0 Overview
This exercise aims to familiarise you with the basic components of JMP. You will also learn
how to use the table manipulation and tabulation functions of JMP to perform data import,
data cleaning, data recoding, data transformation and interactive tabulation analysis.
When JMP is launched, you will see two windows, namely: the Tip of the Day window and
the JMP Home window.
The Tip of the Day window gives useful hints on using JMP. The tip displayed is randomly
selected and changes every time JMP program is launched.
The JMP Home Window menus, across the top, can also be used to perform JMP functions.
The JMP toolbar, located beneath the Menu bar, provides many shortcuts and useful tools.
4
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
At the Tip of the Day window, click on the Close button to close the window.
In this section, you will learn how to import an existing data file in Comma-Separated Values
(http://en.wikipedia.org/wiki/Comma-separated_values) format into JMP as JMP’s Data
Table. The file name is REALIS2016.csv (Note: assume that it is found in \ISSS602\lab01
folder). It provides private property transaction records from 1 January 2016 to 31
December 2016.
5
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
At the Files of type:, select Text File (*.txt, *.csv, *.dat, *.tsv).
For Open as, click on the radio button in front of Data with Preview.
6
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
The Import dialog window appears. You will use this interface to specify the characteristics
of the data.
Your screen should look similar to the figure below. This interface allows you to specify the
data types such as numerical or character.
7
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
File Name
Tables Panel
Columns Pane
Row Pane
The Columns Pane shows that there are a total of 20 variables in the data table. The
categorical variables (11) are indicated by the icon and the continuous variables (9) are
indicated by icon.
The Row Pane shows that this data table contains a total of 20,266 records.
DIY: With reference to the lab exercise note, try to understand the
various components such as Table Panel, Column Panel and Row
Panel of JMP data table.
8
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
For Save as type:, select JMP Data Table (*.jmp) from the drop-down list.
9
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Notice that the name of the data table located at the upper right hand corner of the data
table has been changed to REALIS2016.
Best Practice: Once data is imported into JMP, you should examine the Data Table carefully
for accuracy and completeness. Accuracy refers to the degree that data types in the
imported dataset matches the original dataset. Completeness refers to the degree that the
number of records and fields in the imported dataset matches the original dataset.
In this section you will learn how to use the Tabulate Functions, an interactive tabulating
environment of JMP, to detect inconsistent records in a data table.
10
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
11
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
A frequency table showing the count of records in each Planning Area appears.
Notice that there are two Yishun planning areas. One is in upper case and the other one is
in lower case. This will affect subsequent analysis if it is not been corrected.
12
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
In this section, you will learn how to correct the error identified in section 1.2.1 using the
Recode function of JMP.
At the REALIS2016 Data Table, click on the Planning Area field name.
13
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Notice that JMP replaced all records with the word YISHUN under the Planning Area field to
Yishun.
14
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
JMP is very picky as to the data type of the variables that are going to be analysed. For
certain operations, the data must be continuous and others categorical (i.e. nominal or
ordinal). For example, the Postal District, Postal Sector and Postal Code variables should be
in categorical data format instead of continuous data format. The Sale Date field should be
in month/year format.
In this section, you will learn how to change the above variables into the appropriate data
types.
First, you will learn how to change the data type of the Sale Date field.
15
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
The dialog box containing the Column Info of Sale Date appears.
At the Format field, select Date, then m/y from the drop-down list.
16
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Notice that the records under the Sale Date field have changed to month/year format (i.e.
01/2014).
Next, you are going to change the data type of Postal District, Postal Sector and Postal Code
to categorical.
Press on the Ctrl button of the keyboard then click on Postal District, Postal Sector
and Postal Code fields.
Right click and select Column Info from the context menu.
17
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
18
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
At the Data Type field of Postal District, select Character from the drop-down list.
Notice that JMP will automatically change the Modeling Type from Continuous to Nominal.
DIY: Repeat the same steps to change the data type of Postal Sector and
Postal Code.
Notice that the symbols in front of Postal District, Postal Sector and Postal Code in the
Column Pane are now represented by red histogram symbols.
19
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
In this section, you will learn how to construct a summary table using the Tables functions
of JMP.
First, you will use the static Table -> Summary function of JMP.
20
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Click on No. of Units and select Sum from the Statistics drop-down list.
21
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Next,
Click on Area (sq m) and select Mean from the Statistics drop-down list.
Click on Transacted Price and select Mean from the Statistics drop-down list.
Click on Transacted Price and select Median from the Statistics drop-down list.
Click on Unit Price (psm) and select Mean from the Statistics drop-down list.
Click on Unit Price (psm) and select Median from the Statistics drop-down list.
22
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Now you will learn how to construct summary table interactively using the Tabulate
function of JMP.
At the menu bar of REALIS2016 data table window, select Analyze -> Tabulate.
First, you will create a summary table of numbers of properties transacted by property type.
Next, you will compute the mean and median unit selling price of each property type.
Select New Column Table with Analysis Columns from the context menu.
23
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
The beauty of Tabulate is that it is highly interactive. Users can change the table structure
easily by dragging and dropping the variables to the appropriate row or column dimensions.
For example, you can create a cross-table showing the mean transaction price by Property
Type and Planning Region.
24
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
Drag-and-drop it on to the Drop zone for columns dimension. Place it on top of the
Unit Price ($ pcf).
25
ISSS602 Data Analytics Lab
Lab Exercise 1: First date with JMP Pro
26