Vous êtes sur la page 1sur 108

ONTAP Performance Analysis

Exercise Guide
Content Version 1
NETAPP UNIVERSITY

ONTAP Performance Analysis

Exercise Guide
Course ID: STRSW-ILT-PERFCDOT-REV01
Catalog Number: STRSW-ILT-PERFCDOT-REV01-EG

NetApp University - Do Not Distribute


ATTENTION
The information contained in this course is intended only for training. This course contains information and activities that,
while beneficial for the purposes of training in a closed, non-production environment, can result in downtime or other severe
consequences in a production environment. This course material is not a technical reference and should not, under any
circumstances, be used in production environments. To obtain reference materials, refer to the NetApp product
documentation that is located at http://now.netapp.com/.

COPYRIGHT
© 2017 NetApp, Inc. All rights reserved. Printed in the U.S.A. Specifications subject to change without notice.
No part of this document covered by copyright may be reproduced in any form or by any means—graphic, electronic, or
mechanical, including photocopying, recording, taping, or storage in an electronic retrieval system—without prior written
permission of NetApp, Inc.

U.S. GOVERNMENT RIGHTS


Commercial Computer Software. Government users are subject to the NetApp, Inc. standard license agreement and
applicable provisions of the FAR and its supplements.

TRADEMARK INFORMATION
NETAPP, the NETAPP logo, and the marks listed at http://www.netapp.com/TM are trademarks of NetApp, Inc. Other
company and product names may be trademarks of their respective owners.

E-2 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


TABLE OF CONTENTS
WELCOME..................................................................................................................................................... E-1
MODULE 1: HOW A NETAPP STORAGE SYSTEM WORKS................................................................... E1-1
MODULE 2: PERFORMANCE ANALYSIS TOOLS ................................................................................... E2-1
MODULE 3: ONCOMMAND PERFORMANCE MANAGER ....................................................................... E3-1
MODULE 4: CPU AND MEMORY PERFORMANCE ................................................................................. E4-1
MODULE 5: WAFL PERFORMANCE MONITORING AND ANALYSIS .................................................... E5-1
MODULE 6: DISK I/O PERFORMANCE ..................................................................................................... E6-1
MODULE 7: FLASH CACHE AND FLASH POOL PERFORMANCE ........................................................ E7-1
MODULE 8: CLUSTER INTERCONNECT PERFORMANCE .................................................................... E8-1
MODULE 9: STORAGE QOS...................................................................................................................... E9-1
MODULE 10: NETWORK-ATTACHED STORAGE PERFORMANCE .................................................... E10-1
MODULE 11: STORAGE AREA NETWORK PERFORMANCE .............................................................. E11-1

E-3 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Getting Started
Study Aid Icons
In your exercises, you might see one or more of the following icons.
Warning
If you misconfigure a step that is marked with this icon, later steps might not work
properly. Check the step carefully before you move forward.
Attention
Review this step or comment carefully to save time, learn a best practice, or avoid errors.
Information
A comment that is labeled with this icon provides information about the topic or
procedure.
References
A comment that is labeled with this icon identifies reference material that provides more
information.

Exercise 0: Checking the Lab Equipment


In this exercise, you familiarize yourself with your equipment, synchronize system time, assign a Network
Time Protocol (NTP) time server, and ensure that licenses are installed.

Objectives
This exercise focuses on enabling you to do the following:
 Ensure connectivity to an ONTAP cluster
 Synchronize system time between the ONTAP clusters and the Windows domain controller
 Assign an NTP server IP address for the ONTAP clusters
 Ensure that required licenses are installed on the ONTAP clusters

Exercise Equipment
Your lab contains the following virtual machines:
 One Windows Server 2012 R2 system
 One CentOS Linux 6.5 Server system
 One ONTAP 9.1 2-node cluster (svl-nau)
 One OnCommand Unified Manager system
 One OnCommand Performance Manager system

When you use the connection information that your instructor assigned to you, you are first connected
through a Remote Desktop Connection to a Windows Server 2012 system. From this Windows desktop, you
connect to the other servers in your exercise environment.

E-4 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Windows Server 2012 R2
Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password

Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123


Linux CentOS Server CentOS64 192.168.0.10 root Netapp123
OnCommand Unified
ocum 192.168.0.15 admin Netapp123
Manager
OnCommand
opm 192.168.0.16 admin Netapp123
Performance Manager
ONTAP Cluster Mgmt
svl-nau 192.168.0.50 admin Netapp123
LIF
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123
SMB Server SVM_white 192.168.0.140 vsadmin Netapp123
NFS Server SVM_teal 192.168.0.141 vsadmin Netapp123
SVM_red 192.168.0.51 vsadmin Netapp123
iSCSI Servers SVM_orange 192.168.0.52 vsadmin Netapp123
SVM_yellow 192.168.0.53 vsadmin Netapp123

E-5 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Ensure connectivity to your ONTAP Cluster
In this task, you familiarize yourself with the Windows Server 2012 desktop. You ensure connectivity to the
ONTAP cluster and verify the health of the ONTAP cluster.

Step Action

1-1 Ensure that you see the Start page of your assigned Windows Server 2012 system.

1-2 On the Windows Server 2012 Start page, scroll down and click the Desktop tile.

1-3 Verify that you see the desktop and that it contains the shortcut to the PuTTY program.

1-4 To connect to the ONTAP cluster web UI, you browse to the System Manager URL,
which is in ONTAP 9.1.
To connect to the CLI of the ONTAP cluster, you use PuTTY. PuTTY is a UI for the
Telnet and Secure Shell (SSH) protocols.

E-6 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-5 Double-click the PuTTY shortcut.

1-6 In the PuTTY Configuration dialog box, verify that the svl-nau-mgmt saved session is listed,
and then double-click svl-nau-mgmt.

1-7 1. You can also connect to the ONTAP cluster CLI by connecting to any node in
the cluster: svl-nau-01 (node 1) or svl-nau-02 (node 2).

E-7 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-8 In the PuTTY Security Alert message, click Yes.


You receive an alert message the first time that you connect.

1-9 At the ONTAP cluster login prompt, provide the svl-nau credentials:
 Log in as: admin
 Password: Netapp123

The ONTAP cluster CLI prompt and cursor appear.

1-10 If you have difficulty logging in to the ONTAP cluster CLI, refer to the following
table. Ensure that you are using the correct user name and password in the correct
case (both are case-sensitive).
Host
System Name IP Address User Name Password
ONTAP cluster admin (case
svl-nau 192.168.0.50 Netapp123
management LIF sensitive)

E-8 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-11 Verify that both nodes of the ONTAP cluster are healthy and eligible:
cluster show

1-12 If the health or eligibility of either node is listed as false, alert your instructor.

E-9 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 2: Verify that required license codes are installed
Many of the advanced features of ONTAP software require licenses. In later exercises, you use several
licensed features of ONTAP software. In this task, you ensure that the necessary licenses were preinstalled.
Your instructor will provide you with license codes for any software that you need to install.

Step Action

2-1 In the svl-nau-mgmt CLI, enter the following command:


license show

2-2 Verify that the required license codes (that is, codes for the licenses that enable the features of
ONTAP that are required for later exercises) are installed.

2-3 If any license is not installed, contact NetApp University Support.

E-10 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 3: Set the ONTAP command line system timeout value (optional)
In this optional task, you disable the auto-logout feature by setting the ONTAP command-line system timeout
value to zero.
Step Action

3-1 Open a PuTTY session with the NetApp cluster.

3-2 Check the current system timeout value.


system timeout show

3-3 Set the system timeout to 0 (no timeout).


system timeout modify -timeout 0

End of Exercise

E-11 ONTAP Performance Analysis: Welcome

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 1: How a NetApp Storage System Works
Exercise 1: Identifying Cluster Components
Before you can troubleshoot a performance issue, you must understand the working environment of the
system. In this exercise, you log into a NetApp cluster and identify the components and configuration of the
cluster.

Objectives
This exercise focuses on enabling you to do the following:
 Log in to the exercise environment
 Identify the components of ONTAP software
 Set the ONTAP command-line system timeout value (optional)

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Domain Controller ONTAP 9.1 Cluster
Laptop or
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

E1-1 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Access OnCommand System Manager
In this task, you confirm access to the cluster management port and launch System Manager.
NOTE: For more information about how to configure a storage system with System Manager, see the ONTAP
9 Administration course.
Step Action

1-1 On the Windows Server desktop, launch Windows Internet Explorer.

1-2 Connect to the System Manager through the cluster management interface.
https://192.168.0.50

E1-2 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-3 Log in to the cluster OnCommand System Manager interface.


User name = admin
Password = Netapp123

1-4 View the Cluster Details page for the cluster.

E1-3 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 2: Identify ONTAP components
In this task, you use either System Manager or the ONTAP CLI to identify key ONTAP components.
Step Action

2-1 Identify the feature licenses that are installed on the cluster by selecting the appropriate
checkboxes.
NFS
CIFS
iSCSI
SnapRestore
SnapMirror
SnapVault
FCP
FlexClone
SnapLock
SnapManagerSuite

2-2 Record the number of aggregates that are present in the cluster. ________________________

2-3 Enter information about each aggregate in the following table.


Aggregate Owning Disk Disk Disk Is a Number RAID
Name Node Count Type RPM Flash of Group
Pool RAID Size
Present? Groups

2-4 Record the number of storage virtual machines (SVMs) that are present in the cluster.
________________________

E1-4 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-5 Enter information about each SVM in the following table.


SVM Volume Allowed Protocols
Name Type

2-6 Record the number of volumes that are present in the cluster. ________________________

2-7 Record the number of volumes in the cluster, if any, that are over 80% full.
________________________

2-8 Record the number of LUNs that are present in the cluster. ________________________

2-9 Enter information about each LUN in the following table.


LUN SVM Operating Initiator Initiators LUN
Name Name System Group ID
Type

2-10 Record the number of SMB shares that are present in the cluster. ________________________

2-11 Enter information about each SMB share in the following table.
Share SVM Share Share Access Share ACL
Name Name Path Control List (ACL) Permission
User and Group

E1-5 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-12 Record the number of Ethernet ports that are present in the cluster.
________________________

2-13 Record the number of 10-Gigabit Ethernet (GbE) ports that are present in the cluster.
________________________

2-14 Record the number of FC ports that are present in the cluster. ________________________

2-15 Record the number of 8-Gb FC ports that are present in the cluster.
________________________

2-16 Record the number of LIFs that are present in the cluster. ________________________

2-17 Record the number of LIFs that are present in the cluster that have the “data” role.
________________________

2-18 Record the number of LIFs that are present in the cluster that have the data protocol of iSCSI.
________________________

2-19 Record the number of LIFs that are present in the cluster that have the data protocol of CIFS?
________________________

2-20 Record any LIFs that are not present on their home port. ________________________

End of Exercise

E1-6 ONTAP Performance Analysis: How NetApp Storage Works

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 2: Performance Analysis Tools
Exercise 1: Analyzing Performance Statistics
In this exercise, you examine the different variations of the statistics command. You also identify
storage system workloads and potential bottlenecks.

Objectives
This exercise focuses on enabling you to do the following:
 Perform initial health checks on a cluster
 Perform baseline performance monitoring and analysis from the cluster shell
 Use the statistics catalog command
 Use the statistics start and statistics show commands
 Define workload characteristics

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Domain Controller ONTAP 9.1 Cluster
Laptop or
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

E2-1 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Perform initial health checks on the cluster
In this task, you query the cluster to assess the initial health status.
Step Action

1-1 Open a PuTTY session with the NetApp cluster.

1-2 Determine whether all nodes are healthy.


cluster show

1-3 Determine whether the replication rings have the same masters or consistent masters.
Before you use this command, remember to set the advanced privilege level.
set advanced
cluster ring show

1-4 Using the advanced privilege level, determine whether the cluster connectivity is healthy. If the
cluster is a single-node cluster, this command fails.
cluster ping-cluster -node local

1-5 Determine the health of the cluster.


system health status show

1-6 Determine the health of each cluster subsystem.


system health subsystem show

1-7 Determine whether any health alerts were issued for the cluster.
system health alert show

1-8 Display a list of health alerts and their descriptions.


system health alert definition show

1-9 Determine whether any volumes are not online.


volume show -state !online

1-10 Determine whether any aggregates are not online.


storage aggregate show -state !online

E2-2 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-11 Determine whether any disks are in a broken state.


storage disk show -state broken

Task 2: Perform baseline performance monitoring from the cluster shell


In this task, you use cluster shell commands to monitor cluster performance.
Step Action

2-1 Display an overview of the performance of the entire cluster.


statistics show-periodic –iterations 4

2-2 Determine the performance status of each node in the cluster.


statistics show-periodic –object node –instance svl-nau-01
-iterations 4

statistics show-periodic –object node –instance svl-nau-02


-iterations 4

2-3 Display performance statistics for all aggregates in the cluster.


statistics aggregate show

2-4 Display performance statistics for all flash pool aggregates in the cluster.
statistics cache flash-pool show

2-5 Display performance statistics for all disks in the cluster.


statistics disk show

2-6 Display performance statistics for all LIFs in the cluster.


statistics lif show

2-7 Display performance statistics for all LUNs in the cluster.


statistics lun show

2-8 Display performance statistics for all nodes in the cluster.


statistics node show

2-9 Display performance statistics for all volumes in the cluster.


statistics volume show

E2-3 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-10 Display performance statistics for all storage virtual machines (SVMs) in the cluster.
statistics vserver show

2-11 Display performance statistics for all workloads in the cluster.


statistics workload show

2-12 Check the overall latency for each quality-of-service (QoS) policy group in the cluster.
qos statistics latency show –iterations 3

2-13 Check the overall latency for each QoS workload in the cluster.
qos statistics workload latency show –iterations 3

Task 3: Examine the statistics catalog command


In this task, you issue the three statistics catalog commands and exercise the supporting parameters,
using different privilege levels.
Step Action

3-1 Examine the statistics objects that are available at the admin privilege level.
statistics catalog object show

3-2 Change to the advanced privilege level, and reissue the command.

3-3 Record the command syntax that displays only the advanced level statistics objects without their
descriptions. ___________________________

3-4 Record the command syntax that displays statistics objects that are associated with SVMs.
____________________________________
You should still be in the advanced privilege level.
Record the number of statistics objects that appear in the list. ________________

3-5 Change to the admin privilege level, and examine the instance names that are available for the
statistics object “volume.”
statistics catalog instance show -object volume

3-6 Record the command syntax that displays the instance names that are available for the statistics
object that represents SVMs that have LIFs associated with them.
____________________________________
Record the number of instance names that appear in the list. _____________

E2-4 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-7 Record the command syntax that displays the instance names that are available for the statistics
object that represents SVMs that have volumes associated with them.
____________________________________
Record the number of instance names that appear in the list. _____________

3-8 Record the command syntax that displays the instance names that are available for the statistics
object that represents the disk that is associated with the second node in the cluster (svl-nau-02).
____________________________________

3-9 Examine the counters that are available for the statistics object “disk,” and show the detailed
information.
statistics catalog counter show -object disk –describe

3-10 Change command privilege level to advanced privilege level, and rerun the command.
Remember to reset the privilege level to admin when you are done.

3-11 Examine the counters that are available for the statistics object “aggregate,” and show the
detailed information.
statistics catalog counter show -object aggregate -describe

3-12 Examine the counters that are available for the statistics object “volume,” and show the detailed
information.
statistics catalog counter show -object volume -describe

Task 4: Examine the statistics start and statistics show


commands
In this task, you issue the statistics start and statistics show commands and exercise the
supporting parameters, using different privilege levels.
Step Action

4-1 Using the admin privilege level, attempt to start statistics data collection on the statistics object
“nfsv3.”
statistics start -object nfsv3 -sample-id sample_nfsv3_adm

4-2 Using the advanced privilege level, start statistics data collection on the statistics object “nfsv3.”
set advanced
statistics start -object nfsv3 -sample-id sample_nfsv3_adv

4-3 Display the counters that are associated with the statistics object “nfsv3” instance “SVM_teal”
for both samples.
statistics show -object nfsv3 -instance SVM_teal -counter * -sample-id
sample_nfsv3_adm
statistics show -object nfsv3 -instance SVM_teal -counter * -sample-id
sample_nfsv3_adv
Record any difference in the displays that you observe. __________________

E2-5 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

4-4 Using the advanced privilege level, start statistics data collection on the statistics object “disk.”
statistics start -object disk -sample-id sample_disk

4-5 Record the command syntax that displays all latency related counters for disk v4.20.
___________________________
Record the number of counters that appear in the list. _______________

4-6 Record the command syntax that displays all user_read_latency counters for all disks in the
cluster. ___________________________

Record the number of counters that appear in the list. _______________

4-7 Display only the disks with the user_read_latency counter between 1000us and 10000us.
statistics show -object disk -instance * -counter user_read_latency
-sample-id sample_disk -value 1000..10000

4-8 Record the command syntax that displays all user_read_latency counters that are greater than
10000us for all disks in the cluster, and limit the display to only the counter and value fields.

4-9 Using the advanced privilege level, start statistics data collection on the statistics object
“aggregate.”
statistics start -object aggregate -sample-id sample_aggr

4-10 Display nonzero user_reads and user_writes counters for each aggregate.
statistics show -object aggregate -instance * -counter
user_reads|user_writes -sample-id sample_aggr -value >0

4-11 Using the advanced privilege level, start statistics data collection on the statistics object
“volume.”
statistics start -object volume -sample-id sample_volume

4-12 Display all volume activity by displaying all data counters for each volume.
statistics show -object volume -instance * -counter *data -sample-id
sample_volume

4-13 Record the command syntax that displays all latency counters for all volumes in the cluster.
__________________________
Now, record the command syntax that displays all nonlatency counters for all volumes in the
cluster. __________________________

4-14 Using the advanced privilege level, start statistics data collection on the statistics object
“processor.”
statistics start -object processor -sample-id sample_processor

E2-6 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

4-15 Record the command syntax that displays all counters for all processors in the cluster.
__________________________

4-16 Using the advanced privilege level, start statistics data collection on the statistics object
“workload.”
statistics start -object workload -sample-id sample_workload

4-17 Display all latency counters for each workload, and limit the display to the counter and value
fields.
statistics show -object workload -instance * -counter *latency -sample-
id sample_workload -fields counter,value

4-18 Stop all statistics data collection.


statistics stop -sample-id sample_nfsv3_adm
statistics stop -sample-id sample_nfsv3_adv
statistics stop -sample-id sample_disk
statistics stop -sample-id sample_aggr
statistics stop -sample-id sample_volume
statistics stop -sample-id sample_processor
statistics stop -sample-id sample_workload

4-19 Delete old sample files.


statistics samples show
statistics samples delete -sample-id *

Task 5: Define workload characteristics


In this task, you gather statistical data and evaluate the data to determine the workload characteristics.
Step Action

5-1 Turn off the Snapshot policy for volume SVM_teal_vol1.


volume modify -vserver SVM_teal -volume SVM_teal_vol1 -snapshot-policy
none

5-2 On your Windows Server desktop, double-click the CourseFiles shortcut.

5-3 Copy the file 300mfile to the Z: drive.

E2-7 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-4 Normally you would start data collection for all protocols that are served by the cluster, but for
this exercise you use only the SMB protocol.
Using the diagnostic privilege level, start statistics data collection on the objects “cifs,”
“volume,” and “readahead.”
Diagnostic privilege level commands are required to capture the cifs_read_size_histo,
cifs_write_size_histo, rand_read_req, and seq_read_req counters.
set diagnostic
statistics start -object cifs|volume|readahead -sample-id sample_cifs1

5-5 Display the QoS workload characterization for the volume in real time.
qos statistics volume characteristics show vserver volume

5-6 On the Windows Server desktop, double-click the Command Prompt.

5-7 Drag the SIO.exe shortcut to the Command Prompt window.

5-8 Press the space bar, type the following list of parameters, and then press Enter.
0 0 32k 300m 30 1 Z:\300mfile

E2-8 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-9 After the command ends and the command prompt returns, go to the PuTTY session and
terminate the QoS statistics command, by using ‘^C’.
Record the volume workload characteristics:
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________
Concurrency: ____________

5-10 Turn off the performance data collection.


statistics stop -sample-id sample_cifs1

5-11 Start statistics data collection again on the same objects, but this time change the sample name
to “sample_cifs2.”
statistics start -object cifs|volume|readahead -sample-id sample_cifs2

5-12 Display the QoS workload characterization for the volume in real time.
qos statistics volume characteristics show vserver volume

5-13 In the Command Prompt window, press the Up arrow, change the SIO parameters to the
following, and then press Enter:
100 0 32k 300m 30 4 Z:\300mfile

5-14 After the command ends and the command prompt returns, go to the PuTTY session and
terminate the QoS statistics command using ‘^C’.
Record the volume workload characteristics:
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________
Concurrency: ____________

5-15 Turn off the performance data collection.


statistics stop -sample-id sample_cifs2

5-16 Start statistics data collection again on the same objects, but this time change the sample name
to sample_cifs3.
statistics start -object cifs|volume|readahead -sample-id sample_cifs3

5-17 Display the QoS workload characterization for the volume in real time.
qos statistics volume characteristics show vserver volume

E2-9 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-18 In the Command Prompt window, press the Up arrow, change the SIO parameters to the
following parameters, and then press Enter:
70 60 4k 300m 30 32 Z:\300mfile

5-19 After the command ends and the command prompt returns, go to the PuTTY session and
terminate the QoS statistics command, by using ‘^C’.
Record the volume workload characteristics:
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________
Concurrency: ____________

5-20 Turn off the performance data collection.


statistics stop -sample-id sample_cifs3

5-21 The QoS workload characterization commands provide useful information, but they don’t
provide the same level of detail that you obtain by using the statistics show command.
The following commands help to define the CIFS workload characteristics in each sample.
Use the commands to complete the table in 5-22.

statistics show -object cifs -instance * -counter cifs_read_ops


-sample-id sample_cifs3
statistics show -object cifs -instance * -counter cifs_write_ops
-sample-id sample_cifs3

statistics show -object cifs -instance * -counter cifs_read_size_histo


-sample-id sample_cifs3
statistics show -object cifs -instance * -counter cifs_write_size_histo
-sample-id sample_cifs3

statistics show -object volume -instance white_vol1 -counter


cifs_read_latency -sample-id sample_cifs3

statistics show -object volume -instance white_vol1 -counter


cifs_write_latency -sample-id sample_cifs3

statistics show -object readahead -instance * -counter rand_read_reqs


-sample-id sample_cifs3
statistics show -object readahead -instance * -counter seq_read_reqs
-sample-id sample_cifs3

E2-10 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-22 Record your observations for sample_cifs3 in the table that follows.

CHARACTERISTIC OBSERVED VALUE

Read Write

Throughput

Latency

Operation Size

Concurrency

Randomness

5-23 Calculate the concurrency for the workload (throughput * latency), and enter the result into the
table in 5-22.

5-24 Record the type of workload characteristics that you observed in your analysis of sample_cifs3.
__________________________

End of Exercise

E2-11 ONTAP Performance Analysis: Performance Tools

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 3: OnCommand Performance Manager
Exercise 1: OnCommand Performance Manager Thresholds, Events, and
Alerts
In this exercise, you explore the OnCommand Performance Manager user interface, including events and
alerts. This exercise is based on a customized version of Performance Manager with a database of real-world
performance data that gives you a realistic view of the tool. The data was collected between March and
September 2015. Ongoing data collection is disabled.

Objectives
This exercise focuses on enabling you to do the following:
 Log in to Performance Manager and explore the Performance Manager UI
 Explore the Performance Manager dashboard

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
OnCommand Unified
ocum 192.168.0.15 admin Netapp123
Manager
OnCommand Performance
opm 192.168.0.16 admin Netapp123
Manager

E3-1 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Log in to Performance Manager and explore the Performance
Manager UI
Step Action

1-1 Open a browser window and enter the IP address of Performance Manager:
https://192.168.0.16

1-2 In the “There is a problem” dialog box, click Continue to this website.

1-3 Sign in to Performance Manager as the administrator (the user name and password are case-
sensitive):
 User name: admin
 Password: Netapp123

1-4 Performance Manager is fully configured in your exercise kit, so the Performance Manager
Dashboard appears:

1-5 Because this customized version of Performance Manager does not collect live data,
a red warning appears below the cluster name.

E3-2 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-6 On the Dashboard page, position your cursor over the heading for each pane. Review the
definitions of each performance-measurement counter.

1-7 Record the reason that the squares in the performance panes are green.
_____________________

1-8 In the Utilization panel of the cluster dashboard, click the Nodes checkbox.

1-9 Record the number of nodes that appear in cluster opm-capacity. ______________

1-10 To sort by percentage of utilization, lowest to highest, click the Utilization column header.
Record the node that is the most heavily utilized. __________________

1-11 Use the browser back button to return to the Performance Dashboard.

E3-3 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-12 In the Latency panel of the cluster dashboard, click the SVMs checkbox.

1-13 Record the number of storage virtual machines (SVMs) that appear on the cluster opm-capacity.
_________________

1-14 To sort the SVMs by IOPS, in descending order, click the IOPS column header twice.
Record the name of the busiest SVM. _______________

1-15 Use the browser back button to return to the Performance Dashboard.

1-16 In the Utilization panel of the cluster dashboard, click the Aggregates checkbox.

E3-4 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-17 To sort by utilization, lowest to highest, click the Utilization column header. To toggle the sort
order from highest to lowest, click the Utilization column header again. Record the name of the
aggregate that is the most heavily utilized. _________________

1-18 To sort the aggregates in descending order, based on response time, click the Latency column
header twice. Record the name of the aggregate that is the slowest. _________________

1-19 Use the browser back button to return to the Performance Dashboard.

1-20 Click View Cluster Details.

E3-5 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-21 Observe that the Cluster Details page appears.

1-22 Verify that you see the number of cluster resources that are being monitored for performance.

1-23 To display the top 10 charts, click the Top Performers tab.

1-24 Scroll down to the Top 10 Volumes chart, and change the metric to IOPS.

1-25 Record the three volumes that experienced the highest IOPS over the previous 72 hours.
_________________________

E3-6 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 2: Explore the Performance Manager dashboard
Step Action
2-1. Click Storage and select Nodes:

2-2. In the Nodes page, click the link for node opm-capacity-01.

2-3. Use the Time Range menu to specify a Custom Range of time. Set the starting time to 12:00
am on August 1 and the ending time to 8:00 am on August 2.

E3-7 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action
2-4. Scroll down to observe the Latency, IOPS and MBps (megabytes per second) charts.
Record the approximate average latency during the time period. _______________
Record the approximate average number of IOPS during the time period. ________________
Record the approximate average throughput during the time period. __________________

2-5. In the View and Compare panel, click the Add button for node opm-capacity-02.

E3-8 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action
2-6. Observe that the performance data for node opm-capacity-02 was added to the charts for
comparison purposes.

2-7. In the Comparing panel, click the X symbol to remove the data for node opm-capacity-02 from
the charts.

E3-9 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action
2-8. In the Explorer page for node opm-capacity-01, position your cursor over the last critical-level
event. Click the View Event Detail button.
Record the event name and number. ______________
Record the time when the event was triggered. ______________
Record the performance policy that was triggered. ______________
Record the Performance Capacity Used value that triggered the event. ______________

E3-10 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action
2-9. Return to the Node Explorer page. Use the Choose Charts menu to display the charts for only
Events, Perf. Capacity Used, and Perf. Capacity Used (Advanced).

2-10. In the Performance Capacity Used chart, position your cursor over the spike in the chart that
aligns with the last critical event.
Record the value of the optimal Performance Capacity Used counter at that point. ___________

E3-11 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action
2-11. In the Perf. Capacity Used (Advanced) chart, position your cursor over the spike in the chart that
aligns with the last critical event.
Record the percentage of the optimal Performance Capacity Used that is consumed by user
protocols. _______________
Record the percentage of the optimal Performance Capacity Used that is consumed by
background processing. _______________

End of Exercise

E3-12 ONTAP Performance Analysis: OnCommand Performance Manager

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 4: CPU and Memory Performance
Exercise 1: Identifying and resolving storage controller performance
issues
In this exercise, you identify and resolve performance issues with the CPU and memory subsystems of a
NetApp storage system.

Objectives
This exercise focuses on enabling you to do the following:
 Display the performance characteristics of a cluster node
 Identify performance issues that are caused by a lack of controller resources
 Resolve performance issues that are caused by a lack of controller resources

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Domain Controller ONTAP 9.1 Cluster
Laptop or
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123

ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123


OnCommand Unified
ocum 192.168.0.15 admin Netapp123
Manager
OnCommand Performance
opm 192.168.0.16 admin Netapp123
Manager

E4-1 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Display node resource performance
In this task, you use the cluster shell to display and analyze the processing and memory performance
characteristics of a cluster node.
Step Action

1-1 Open a PuTTY session with the NetApp cluster.

1-2 Using the advanced privilege level, start statistics data collection on the objects “volume,”
“aggregate,” and “disk”.
set diag
statistics start -object system:node|bufcache -sample-id sample_CPUMem

1-3 On the Windows system desktop, open a PowerShell window.

1-4 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

1-5 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 1 Y:\900mfile

1-6 While the SIO program is running, display the clusterwide performance statistics.
cluster statistics show
Record the value for the CPU Busy field. _______________

1-7 While the SIO program is running, display the cluster node performance statistics.
statistics node show
Record the value for the CPU % field on each node. _______________

1-8 While the SIO program is running, display the performance statistics for quality-of-service
(QoS) workloads.
qos statistics workload performance show -node svl-nau-01

1-9 While the SIO program is running, display the CPU performance statistics for QoS workloads
qos statistics workload resource cpu show -node svl-nau-01

1-10 While the SIO program is running, display the latency statistics for QoS workloads
qos statistics workload latency show -node svl-nau-01
Record whether any latency is attributed to the Network or Data subsystems. _______________

1-11 Display CPU utilization for each CPU in the cluster node.
node run local sysstat –m –c 5

1-12 Determine whether any consistency points are being triggered by a lack of memory.
node run local sysstat –u –c 5
Record any consistency points that are triggered by a lack of Write Anywhere File Layout
(WAFL) buffers (CP Type=H or L). _______________
Record any consistency points that are triggered by a lack of Network buffers (CP Type=M).
_______________

E4-2 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-13 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_CPUMem

1-14 Display the CPU busy counters.


statistics show -counter avg_processor_busy|total_processor_busy
-sample-id sample_CPUMem

1-15 Display WAFL cache activity.


statistics show –object bufcache -sample-id sample_CPUMem
Record the number of buffers that are allocated. _______________
Record the amount of memory that is consumed by the WAFL cache. _______________
Record the number of buffers that are empty. _______________
Record the number of buffers that were stolen by other subsystems. _______________
Record whether the cluster memory is constrained. _______________

1-16 Delete any old sample files.


statistics samples show
statistics samples delete -sample-id sample_CPUMem

Task 2: Use Performance Manager to analyze a performance issue


In this task, you use OnCommand Performance Manager to identify the root cause of a performance issue.
Consider a scenario: You receive an alert email from Performance Manager that indicates that the
performance capacity of a node has been exceeded. The email message includes the event number “8208.”
Starting with the event, investigate and identify the cause of the performance event.

Step Action

2-1 Log in to OnCommand Performance Manager.

2-2 To navigate to the Events page, click the Events tab.

E4-3 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-3 Enter the event number that was provided by the alert email message into the search text box,
and click Search.

2-4 Confirm that the event is found. Record the event severity level. _______________

2-5 To identify the event type, position your cursor over the description field.

2-6 To navigate to the Event Details page, click the event name.

Record the full name of the event. _______________


Record the node that is affected. _______________
Record when the event was triggered. _______________
Record the threshold policy that was violated (if any). _______________

2-7 Click the link for the affected node.

2-8 In the Explorer tab of the Node Details page, use the Time Range menu to set the time range of
the charts to six hours before the event and three hours after the event. Click Apply Range.

E4-4 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-9 In the Events chart, position your cursor over the dot for the event that occurred at the time of
the event that you saw in the Events page. Record whether this event matches the event that you
identified in the Events page. _______________

2-10 In the Explorer tab of the Node Details page, scroll down to the Performance Capacity Used
Chart. Record whether the spike in capacity used corresponds to the time of the event.
_______________

2-11 Position your cursor over the spike in the Performance Capacity Used chart. Record the value of
the counter at the top of the spike. _______________ Record whether this value matches the
value that was reported in the Event Details page. _______________

2-12 To limit the display to the Performance Capacity Used charts, select the Choose charts menu.

E4-5 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-13 In the View and Compare menu, select Aggregates on this Node.

2-14 In the View and Compare panel, click the Performance Capacity Used column header twice, to
sort the aggregates in descending order based upon the percentage of their optimal performance
capacity used.
Record the aggregates that are using the most performance capacity. _______________

2-15 In the View and Compare panel, click the Add button next to the two aggregates that are
consuming the highest percentage of their optimal performance capacity. The metrics for these
aggregates are added to the performance charts for the node.

2-16 Based on the information displayed in the Performance Capacity Used chart, record the
aggregate that is contributing most to the used performance capacity of the node during the
event. _______________
If you find it difficult to discern the busiest aggregate, you can perform the following actions:
Use the Zoom View button to display the chart in a larger format, position your cursor over the
spike in the chart to display the metric values in text form, or drag your pointer over a section of
the chart to zoom in to a closer view.

2-17 Click the link for the aggregate with the highest performance capacity used during the event
time frame. The Aggregate Details page opens.

E4-6 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-18 In the Explorer tab of the Aggregate Details page, use the Time Range menu to set the time
range of the charts to six hours before the event and three hours after the event. Click Apply
Range.

2-19 In the Explorer tab of the Aggregate Details page, scroll down to the Performance Capacity
Used Chart. Record whether the spike in capacity used corresponds to the time of the event.
_______________

2-20 Now, scroll up to the Throughput (MBps, or megabytes per second) chart. Record whether the
spike in the Throughput chart corresponds to the time of the event. _______________

2-21 In the View and Compare panel, use the menu to select Volumes on this Aggregate.

2-22 In the View and Compare panel, click the Add button to add the metrics for the volume to the
charts.

E4-7 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-23 Record whether the activity on the volume correlates with the activity on the aggregate.
_______________
Record your explanation of how the volume can show more activity than the aggregate shows.
_______________
If you view the Volume Details page, you see that the cache miss ratio is zero for this volume
during this timeframe. Therefore, this volume is not the volume that generates I/O on the
aggregate.

Task 3: Use Perfomance Manager to resolve a performance issue


In this task, you use OnCommand Performance Manager to identify the best corrective action to take to
resolve a performance issue. You identified the volume that is stressing the node. Now you can resolve the
issue. One solution is to migrate the volume to an aggregate on a different node in the cluster. To do so, you
first must identify another node that is less busy. Then, you choose an aggregate on that node that can manage
the additional load that the volume poses.

Step Action

3-1 In the OnCommand Performance Manager user interface, use the Storage menu to navigate to
the Volumes page.

3-2 Record the total capacity of the busiest volume in the aggregate that you identified in Task 2.
_______________
Record the average throughput of the volume over the previous 72 hours. _______________
Record the average IOPS of the volume over the previous 72 hours. _______________

3-3 Use the Storage menu to navigate to the Nodes page.

3-4 Use the search text box to limit the display to only nodes that are members of the opm-capacity
cluster.

3-5 To sort the nodes by the percentage of optimal performance capacity used, in ascending order,
click the Performance Capacity Used column header.

Observe that node opm-capacity-01 is the node with the most available capacity over the
previous 72 hours. However, node opm-capacity-01 is also the node with the highest throughput
during that time. You might be able to improve performance by migrating the volume to node
opm-capacity-02.

E4-8 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-6 Use the Storage menu to navigate to the Aggregates page.

3-7 Use the search text box to limit the display to only aggregates that are owned by node opm-
capacity-02.

3-8 To sort the aggregates by the percentage of optimal performance capacity used, in ascending
order, click the Performance Capacity Used column header.
Record the aggregate that has the most available performance capacity. _______________
Record the reason that you might or might not want to migrate the volume to this aggregate.
_______________
Record whether sufficient free space to store the volume is available in the aggregate.
_______________

3-9 Now, you must use OnCommand System Manager or the ONTAP CLI to migrate the volume to
another aggregate on the same node or on a different node.

End of Exercise

E4-9 ONTAP Performance Analysis: CPU and Memory Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 5: WAFL Performance Monitoring and Analysis
Your instructor will provide you with the license codes that you need for the exercises.

Exercise 1: WAFL Performance


In this exercise, you identify and resolve performance issues that are related to the Write Anywhere File
Layout (WAFL) subsystem of ONTAP software.

Objectives
This exercise focuses on enabling you to do the following:
 Use the cluster shell to display the performance characteristics of the WAFL subsystem
 Use the cluster shell to resolve performance issues that are related to volumes and aggregates

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

E5-1 ONTAP Performance Analysis: WAFL Performance Monitoring and Analysis

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Display WAFL subsystem performance
In this task, you use the cluster shell to display the performance characteristics of the WAFL subsystem.
Step Action

1-1 On the Windows system desktop, open a PowerShell window.

1-2 From the PowerShell window, create a 900-megabyte (MB) file in the Z: drive.
fsutil file createnew Z:900mfile 900000000

1-3 Open a PuTTY session with the cluster.

1-4 From the diagnostic privilege level, start the collection of WAFL performance statistics.
set diag
statistics start –object raid|wafl –sample-id sample_wafl

1-5 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

1-6 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 1 Z:\900mfile

1-7 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_wafl

1-8 Examine the WAFL write request sizes. The histogram depicts the number of write requests of
each size that is passed from WAFL to RAID.
statistics show -object raid -instance raid -counter
write_io_depth_histo -sample-id sample_wafl

1-9 Display the number of RAID stripes written per second.


statistics show -object raid -instance raid -counter stripes_written
-sample-id sample_wafl

1-10 Record the number of these write requests that write a full RAID stripe. _______________
Record the number of these write requests that result in a partial stripe write.
___________________
statistics show -object raid -instance raid -counter
full_stripes|stripes_written|partial_stripes -sample-id sample_wafl
A high percentage of write requests resulting in partial stripe writes might indicate
fragmentation, especially if the WAFL write request sizes exceed the stripe width.

1-11 Compare the number of calculate parity read operations to the total number of blocks that are
written by WAFL.
statistics show -object wafl -instance wafl -counter
wafl_total_blk_writes|wafl_cp_blks_read|wafl_cp_read_time -sample-id
sample_wafl
A high percentage of write requests that result in a calculate parity read operation might indicate
fragmentation.

E5-2 ONTAP Performance Analysis: WAFL Performance Monitoring and Analysis

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 2: Resolve volume performance issues
In this task, you use cluster shell commands to resolve performance issues that are related to volumes.
Step Action

2-1 Determine whether a volume is overfull.


storage volume show-space –vserver <svm_name> -volume <vol_name>

2-2 Determine whether a volume is fragmented.


volume reallocation measure –vserver <svm_name> -path <vol_path> -once

2-3 Display the progress of the reallocate measure scan.


volume reallocation show

2-4 Display the results of the reallocate measure scan.


event log show -event wafl.reallocate
Record the volume layout optimization value. __________________

2-5 To consolidate the disk blocks that belong to the files in the volume, perform reallocation on the
volume.
volume reallocation start –vserver <svm_name> -path <vol_path> -once
–no-check

2-6 To consolidate the free blocks into contiguous disk blocks, perform reallocation on the
aggregate.
aggregate reallocation start –aggregate <aggr_name> -once

2-7 Determine whether read reallocation is enabled on the volume.


volume show <vol_name> -fields read-realloc

2-8 Enable read reallocation on the volume.


volume modify –vserver <svm_name> -volume <vol_name> -read-realloc
space-optimized

2-9 Determine whether free space reallocation is enabled on the aggregate.


aggregate show <aggr_name> -fields free-space-realloc

2-10 Enable free space reallocation on the aggregate.


aggregate modify <aggr_name> -free-space-realloc on

E5-3 ONTAP Performance Analysis: WAFL Performance Monitoring and Analysis

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 3: Resolve aggregate performance issues
In this task, you use cluster shell commands to resolve performance issues that are related to aggregates.
Step Action

3-1 Identify volumes and workloads in the aggregate.


volume show –aggregate <aggr_name>

3-2 Identify the most busy and least busy volumes in the aggregate.
statistics volume show –aggregate <aggr_name>

3-3 Identify the other nodes and or aggregates that can host the volumes.
statistics aggregate show
Which data (non-root) aggregate has the lowest number of I/O operations?

3-4 Move the least busy volume to the least busy aggregate.
volume move start –vserver <vs_name> –volume <volume>
-destination-aggregate <aggr_name>

End of Exercise

E5-4 ONTAP Performance Analysis: WAFL Performance Monitoring and Analysis

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 6: Disk I/O Performance
Exercise 1: Identifying and resolving disk I/O bottlenecks
In this exercise, you identify and resolve performance issues that are related to the disk I/O subsystem of
ONTAP.

Objectives
This exercise focuses on enabling you to do the following:
 Identify the components of the disk I/O subsystem
 Use the cluster shell to monitor and display disk subsystem performance
 Use the cluster shell to configure RAID background tasks
 Use Performance Manager to identify and resolve disk performance issues

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Domain Controller ONTAP 9.1 Cluster
Laptop or
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
OnCommand Unified
ocum 192.168.0.15 admin Netapp123
Manager
OnCommand Performance
opm 192.168.0.16 admin Netapp123
Manager

E6-1 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Identify the components of the disk I/O subsystem
In this task, you use the cluster shell to identify the components of the disk I/O subsystem.
Step Action

1-1 Open a PuTTY session with the ONTAP cluster.

1-2 Switch to the diagnostics privilege level of the cluster shell.


set diag

1-3 Identify the number of disks and types of disks that are present in the cluster.
storage disk show

1-4 Determine whether the cluster uses partitioned disks.


storage disk partition show
Record whether the cluster uses partitioned disks. ___________________
The lab equipment might not include disk partitions, so this command might return a result of
“This table is currently empty.”

1-5 Identify the aggregates that are present in the cluster.


storage aggregate show
Record the amount of space in the aggregate that is in use. ___________________

1-6 Identify how the space in an aggregate is used.


storage aggregate show-space –aggregate <aggr_name>
Record the amount of space in the aggregate that is consumed by metadata.
___________________

1-7 Identify the configuration of the aggregates that are present in the cluster.
storage aggregate show-status

1-8 Identify the configuration of a specific aggregate.


storage aggregate show-status –aggregate <aggr_name>
Record the number of plexes that are present in the aggregate. ___________________
Record the number of RAID groups that are present in the aggregate. ___________________
Record the number of disks per RAID group that are present in the aggregate.
___________________
Record the disk type and RPM. ___________________

1-9 Identify the disk shelves that are present in the cluster.
storage shelf show
Record the number of shelves that are present in the cluster. ___________________
The lab equipment might not include disk shelves, so this command might return a result of
“There are no entries matching your query,” or “This table is currently empty.”

E6-2 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-10 Identify the connection type of the disk shelves.


storage shelf show -port
Record the negotiated speed of the SAS port. ________________
The lab equipment might not include disk shelves, so this command might return a result of
“This table is currently empty.”

1-11 Determine whether any aggregates are not online.


storage aggregate show -state !online

1-12 Determine whether any disks are in a broken state.


storage disk show -state broken

Task 2: Use the cluster shell to monitor disk performance


In this task, you use cluster shell commands to monitor cluster disk I/O performance.
Step Action

2-1 Check the performance of all aggregates in the cluster.


statistics aggregate show

Record the aggregate that performs the most I/O operations. ________________

2-2 Display the 10 busiest disks.


statistics disk show –sort-key disk_busy –max 10

Record the busiest disk. ________________

2-3 Display the 10 slowest disks.


statistics disk show –sort-key operation_latency –max 10

Record the slowest disk. ________________

2-4 Identify the aggregate to which the slowest disk belongs.


storage aggregate show –disklist <disk_name>
Record the aggregate that contains the slowest disk. ________________

2-5 Display the latency of the disks that belong to the busiest aggregate.
storage disk show –aggregate <aggr_name> -fields average_latency

Record the slowest disk. ________________

E6-3 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 3: Use the cluster shell to display disk performance
In this task, you use the cluster shell to collect and display the performance characteristics of the disk I/O
subsystem.

Step Action

3-1 On the Windows system desktop, open a PowerShell window.

3-2 Using the advanced privilege level, start statistics data collection on the objects “volume,”
“aggregate,” and “disk.”
set advanced
statistics start -object volume|aggregate|disk -sample-id sample_disk

3-3 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

3-4 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 1 Z:\900mfile

3-5 While the SIO program is running, display the disk% busy value and determine if back-to-back
consistency points are occurring.
node run local sysstat –u –c 5

3-6 While the SIO program is running, display the volume performance statistics.
statistics volume show

3-7 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_disk

3-8 Check the latency for each volume in the cluster.


statistics show-periodic -object volume –counter avg_latency -sample-id
sample_disk

3-9 Check volume latency for a specific volume.


statistics show -object volume -instance white_vol1 –counter
avg_latency -sample-id sample_disk

3-10 Check volume activity.


statistics show -object volume -counter *data -sample-id sample_disk

3-11 Check the latency for each aggregate in the cluster.


statistics show -object aggregate -counter user_reads|user_writes
-sample-id sample_disk

3-12 Check disk latency.


statistics show -object disk -counter *latency -sample-id sample_disk

E6-4 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-13 Delete the performance sample files.


statistics samples show
statistics samples delete -sample-id *

Task 4: Use the cluster shell to configure RAID background tasks


In this task, you use cluster shell commands to determine the configuration of RAID subsystem background
tasks.
Step Action

4-1 Determine whether media scrubs are enabled.


storage raid-options show –node * -name raid.media_scrub.enable

4-2 Determine the rate at which media is scrubbed.


storage raid-options show –node * -name raid.media_scrub.rate

4-3 Determine whether RAID scrubs are enabled.


storage raid-options show –node * -name raid.scrub.enable

4-4 Determine the performance impact level of RAID scrubs.


storage raid-options show –node * -name raid.scrub.perf_impact

4-5 Determine the duration of RAID scrubs.


storage raid-options show –node * -name raid.scrub.duration

4-6 Determine when RAID scrubs are scheduled to run.


storage raid-options show –node * -name raid.scrub.schedule

Task 5: Use Performance Manager to identify and resolve disk


performance issues
In this task, you use Performance Manager to diagnose the cause of the overuse of an aggregate.
Step Action

5-1 Open a web browser, and connect to the OnCommand Performance Manager interface.
https://opm

5-2 Log in to Performance Manager:


User name: admin
Password: Netapp123

5-3 Navigate to the Events page.

5-4 Use the Search Event Id textbox to search for an event with event ID of 8002.

E6-5 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-5 Record the event description. ________________

You might need to position your cursor over the event description to see all of the descriptive
text.

5-6 To navigate to the Event Details page, click the event name.

Record the full name of the event. ________________

Record when the event was triggered. ________________

Record when the event concluded. ________________

Record the aggregate that is affected. ________________

Record the event description. ________________

5-7 To navigate to the Aggregate Details page, click the link of the aggregate that is affected.

5-8 Set the time range to encompass the start and end time of the event. For example, set the time
range from 9:00 PM July 30, 2016 through 7:00 AM July 31, 2016.

E6-6 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-9 Position your cursor over the last warning level event for this aggregate. Record the event name.
________________

5-10 Scroll down to the Perf. Capacity Used chart.

5-11 Position your cursor over the Perf. Capacity Used chart and identify the amount of performance
capacity that was used by Aggr5 at 3:54 am on July 31.

5-12 Change the list of objects that you want to view and compare to “Volumes on this Aggregate.”

E6-7 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-13 To add each volume that resides in this aggregate to the performance charts, click the
appropriate Add buttons.

5-14 Examine the IOPS and MBps (megabytes per second) charts. Record the volumes that contribute
to the traffic that goes to the aggregate. ________________

5-15 Click the name of the busiest volume in the aggregate.

5-16 Set the time range to encompass the start and end time of the event. For example, set the time
range from 9:00 PM July 30, 2016 through 7:00 AM July 31, 2016.

E6-8 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-17 Examine the volume IOPS and MBps charts.

5-18 Change the list of the objects that you want to view and compare to “SVM of this Volume.”

5-19 Add the storage virtual machine (SVM) to the charts.

E6-9 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

5-20 Examine the IOPS and MBps charts.

Record whether this SVM accounts for the activity to the volume. ________________

5-21 Performance Manager identified that an aggregate exceeded its permitted utilization threshold
and triggered an event alert. You used Performance Manager to identify the aggregate that
generated the event, to determine the volume in the aggregate that is driving the I/O, and to
determine the SVM that owns the volume.
You now can use OnCommand System Manager or OnCommand Unified Manager to determine
the client hosts that are accessing the volume and to determine the connection methods.

End of Exercise

E6-10 ONTAP Performance Analysis: Disk I/O Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 7: Flash Cache and Flash Pool Performance
Exercise 1: Exploring Cache Performance
In this exercise, you identify and resolve performance issues that are related to the caching subsystems of a
NetApp storage system. The caching of data in the optional Flash Cache modules and Flash Pool aggregates
is included in the exercise.

Objectives
This exercise focuses on enabling you to do the following:
 Analyze the performance of the caching subsystems
 Configure Flash Cache caching policies
 Configure the aggregate Automated Workload Analyzer (AWA) feature
 Observe and adjust Flash Pool caching policies

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

E7-1 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Analyze the performance of the caching subsystems
In this task, you use the cluster shell to identify the performance characteristics of the caching subsystems.

Step Action

1-1 Open a PuTTY session with the cluster.

1-2 Display the current Flash Cache configuration settings.


system node external-cache show

1-3 To simulate the presence of a Flash Cache module, enable the Predictive Cache Statistics (PCS)
feature.
system node external-cache modify -node * -is-enabled true

1-4 Adjust the size of the simulated Flash Cache module.


vserver options -option-name flexscale.pcs_size -option-value 16GB

1-5 Display the current Flash Cache configuration settings.


system node external-cache show

1-6 Using the advanced privilege level, start statistics data collection on the “wafl” and
“ext_cache_obj” objects.
set advanced
statistics start -object wafl|ext_cache_obj -sample-id sample_cache

1-7 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

1-8 Press the space bar, type the following list of parameters, and then press Enter:
60 80 32k 900m 90 1 Z:\900mfile

1-9 While SIO is running, observe the effectiveness of the Flash Cache cache.
statistics cache flash-pool show

1-10 Stop the collection of performance statistics.


statistics stop -sample-id sample_cache

1-11 Display the read_io_type counter values.


statistics show -object wafl –counter read_io_type -sample-id
sample_cache
Where are the majority of the read requests being satisfied from. ___________

1-12 Obtain a description of the cache counters.


catalog counter show –object ext_cache_obj –counter
hit|hit_percentage|disk_reads_replaced -describe

E7-2 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-13 Display the cache counter values.


statistics show -object ext_cache_obj –counter
hit|hit_percent|disk_reads_replaced -sample-id sample_cache
Record the cache hit rate. ___________
Record the number of disk read operations that are replaced by reads from cache. ___________

Task 2: Configure Flash Cache caching policies


In this task, you use the cluster shell and node shell to adjust the caching policies of the Flash Cache module
and observe the results.
Step Action

2-1 Display Flash Cache counter values.


statistics show -object ext_cache_obj –counter
hit|hit_percent|disk_reads_replaced -sample-id sample_cache
Record the cache hit rate. ___________

2-2 Observe the caching policies for readahead blocks and low-priority blocks.
vserver options -option-name flexscale.readahead_blocks
vserver options -option-name flexscale.lopri_blocks

2-3 Using the advanced privilege level, start statistics data collection on the object “ext_cache_obj”
to a different sample file.
statistics start -object ext_cache_obj -sample-id sample_cache2

2-4 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

2-5 Press the space bar, type the following list of parameters, and then press Enter:
60 80 32k 900m 90 1 Y:\900mfile

2-6 Display cache counter values.


statistics show -object ext_cache_obj –counter
hit|hit_percent|disk_reads_replaced -sample-id sample_cache2
Record the cache hit rate. ___________
Record whether the cache hit rate improved. ___________
Record whether the disk read operations replaced rate increased. ___________

2-7 Disable PCS.


system node external-cache modify –node local -is-enabled false

E7-3 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 3: Configure the aggregate AWA feature
In this task, you use the cluster shell and node shell to enable the AWA feature to predict the effectiveness of
using a Flash Pool aggregate.
Step Action

3-1 Launch the node shell on cluster node 1.


node run –node svl-nau-01

3-2 Switch to the advanced privilege level.


priv set advanced

3-3 Display the command syntax to manage AWA.


wafl awa

3-4 Start AWA on an aggregate.


wafl awa start fcal_data_102

3-5 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

3-6 Press the space bar, type the following list of parameters, and then press Enter:
60 80 32k 900m 90 1 Z:\900mfile

3-7 Display the results of AWA on an aggregate.


wafl awa print

3-8 Record the recommended Flash Pool cache size. ___________

3-9 Record the predicted cache hit ratio if only 60% of recommended Flash Pool cache size is
deployed. ___________

3-10 Display a list of the volumes in the aggregate that benefit the most from caching.
wafl awa print -t

3-11 Stop AWA on an aggregate.


wafl awa stop fcal_data_102

3-12 Exit from the node shell.


exit

Task 4: Observe and adjust Flash Pool caching policies


In this task, you use cluster shell commands to observe and adjust the caching policies of a volume that is
contained in a Flash Pool aggregate.
Step Action

4-1 Display the Flash Pool caching policy that is assigned to all volumes.
volume show –fields caching-policy

E7-4 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

4-2 Display the caching policies that can be assigned to a volume.


volume modify -caching-policy <TAB>

4-3 Change the Flash Pool caching policy that is assigned to a volume.
volume modify -vserver SVM_white -volume white_vol1 -caching-policy all

4-4 Display the Flash Pool caching priority that is assigned to a volume.
volume show –fields cache-retention-priority

4-5 Display the caching priorities that can be assigned to a volume.


volume modify -cache-retention-priority <TAB>

4-6 Change the Flash Pool caching priority that is assigned to a volume.
volume modify -vserver SVM_white -volume white_vol1
-cache-retention-priority high

4-7 Display the Flash Pool caching attributes that are assigned to volumes that belong to a storage
virtual machine (SVM).
volume show -vserver SVM_white –fields caching-policy,cache-retention-
priority

End of Exercise

E7-5 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 8: Cluster Interconnect Performance
Exercise 1: Cluster Interconnect Performance
In this exercise, you use the cluster shell to identify performance issues with the cluster interconnect network.
You resolve the issues by adding cluster interconnect ports and LIFs to the cluster nodes.

Objectives
This exercise focuses on enabling you to do the following:
 Identify performance issues with the cluster interconnect network
 Resolve performance issues with the cluster interconnect network by adding cluster ports to the node

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

Task 1: Display cluster interconnect performance


In this task, you use the cluster shell to identify performance issues with the cluster interconnect.
Step Action

1-1 Open a PuTTY session with the NetApp cluster.

E8-1 ONTAP Performance Analysis: Cluster Interconnect Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-2 Determine which ports are currently used for the cluster interconnect network.
network port show –ipspace Cluster
Record the ports that are used for the cluster interconnect network. __________________
Record the status of the ports. __________________
Record the speed of the ports. __________________

1-3 Determine if any cluster interconnect links are down.


network interface show –role cluster
Record the LIFs that are used for the cluster interconnect network. __________________
Record the status of the LIFs. __________________

1-4 Verify cluster interconnect network connectivity.


set advanced
cluster ping-cluster <node_name>
Record whether basic connectivity fails on any paths. __________________

1-5 Use the ping command to observe round trip times. Perform this step on each cluster LIF.
network ping –show-detail true –vserver <SVM_name> -lif <lif_name>
-use-source-port true –destination <remote_IP_addr> -count 3
Record the average round trip time for each of the paths. __________________

1-6 Observe the amount of traffic on the cluster interconnect links.


system cluster-switch interface show -counters
The lab environment does not contain physical network switches, so this command might return
the message, “This table is currently empty.”

Task 2: Resolve cluster interconnect performance issues


In this task, to resolve performance issues, you add and enable network ports for use with the cluster
interconnect network.

Step Action

2-1 Identify the available physical network ports.


network port show
Record the speed and frame size of the cluster ports. __________________

2-2 If the frame size is not 9000, then increase the frame size that is used on the cluster ports that are
members of the Cluster broadcast domain. Otherwise, skip this step.
network port broadcast-domain modify –ipspace Cluster –broadcast-domain
Cluster –mtu 9000

E8-2 ONTAP Performance Analysis: Cluster Interconnect Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-3 Select the network data ports to convert into cluster interconnect ports, and record the node and
port names.
Cluster node name = __________________
Port name = ___________________

2-4 Determine whether there are data LIFs residing on the ports that will be converted into cluster
ports.
network interface show –curr-node <node_name> –curr-port <port_name>

2-5 If data LIFs reside on the ports that will be converted into cluster ports, assign the data LIFs to a
different data port.
network interface modify –vserver <svm_name> -lif <lif_name> -home-node
<node_name> -home-port <port_name>

2-6 Move the data LIF to the new home port.


network interface revert –vserver <svm_name> -lif <lif_name>

2-7 Verify that the LIF is now on the new home port.
network interface show –vserver <svm_name> -lif <lif_name>
–fields curr-node,curr-port

2-8 Determine to which IP space and broadcast domain the new cluster ports currently belong, and
record the information.
network port show –port <port_name>
IPspace = _______________________
Broadcast = ______________________

2-9 Remove the new cluster ports from the current broadcast domain.
network port broadcast-domain remove-ports –ipspace <IP_space>
-broadcast-domain <bcast_dom> -ports <node_name:port_name>

2-10 Assign the new cluster ports to the Cluster broadcast domain.
network port broadcast-domain add-ports –ipspace Cluster -broadcast-
domain Cluster -ports <node_name:port_name>

2-11 Disable the new cluster ports, so that you can move the network cables to the cluster
interconnect network.
network port modify –up-admin false -node <node_name> -port <port_name>
During this exercise, you do not move the cluster cables.

2-12 Create a new cluster LIF for the new cluster port.
network interface create –vserver Cluster –lif <lif_name> –role cluster
–home-node <node_name> -home-port <port_name> -auto true

E8-3 ONTAP Performance Analysis: Cluster Interconnect Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-13 After you move the network cables to the cluster interconnect network, enable the new cluster
ports.
network port modify –up-admin true -node <node_name> -port <port_name>
During this exercise, you do not move the cluster cables, so leave the new cluster port disabled.

End of Exercise

E8-4 ONTAP Performance Analysis: Cluster Interconnect Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 9: Storage QoS
Exercise 1: Workload Management with Storage QoS
In this exercise, you learn how to work with the storage quality of service (QoS) feature to prevent problem
workloads and tenants from affecting other workloads and tenants.

Objectives
This exercise focuses on enabling you to do the following:
 Reactively limit throughput to a workload by associating the workload with a QoS policy group
 Proactively monitor workload performance by associating a workload with a QoS policy group
 Isolate a tenant workload by associating the workload with a QoS policy group

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

Task 1: Reactively limit throughput to a workload


In this task, you use the Iometer tool to generate a 50% write and 50% read workload on a LUN. You then set
up a storage QoS limit to throttle the workload that is associated with the LUN.

E9-1 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-1 On the landing host, launch the Iometer load generator. On the taskbar of the landing host, click
the Iometer icon and, if prompted, approve the Iometer End User License Agreement, EULA.

The Iometer window should open.

1-2 In the left pane, select the W2K12-01 node.

E9-2 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-3 On the Disk Targets tab, specify the following:


 Targets: Y:orange_lun1 drive
 Maximum Disk Size: 80000
 Starting Disk Sector: 0
 # of Outstanding I/Os: 16

1-4 Click the Access Specifications tab.

E9-3 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-5 Under Global Access Specifications, select 512B; 50% Read; 0% random.

1-6 Click Add.

1-7 Click the Test Setup tab.

E9-4 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-8 On the Test Setup tab, specify the following:


 Test Description: QoS Exercise
 Ramp Up Time: 4 seconds
 Record Results: None

1-9 Click the Results Display tab.

E9-5 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-10 Change the Update Frequency to 4 seconds.


This action reduces the load on the Windows computer that is required to update the Iometer
interface.

1-11 In the toolbar, click the Start Tests button (a green flag).

E9-6 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-12 Open Windows Explorer to the Y:orange_lun1 drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 40,000 kilobytes (KB) (80,000 x 512 bytes
[B]).

1-13 Notice that, when the file reaches its maximum size, the ramp-up time begins to count down for
four seconds.
The ramp-up time ensures that the storage is stable before the testing begins.

1-14 When the 50% read and 50% write test results begin to appear, notice that the total IOPS are
recorded on the top row of the display output.

On the test system, the total IOPS probably remain in a range below the range of a clustered
system that operates on nonvirtualized equipment. Remember that this exercise uses a Windows
system and a cluster that operates in a shared virtualized environment.

1-15 Leave the test running.

E9-7 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-16 Open a PuTTY session with the NetApp cluster.

1-17 Using the statistics command, identify the volume or LUN that shows the most IOPS.
statistics volume show
statistics lun show

1-18 Display the current storage QoS policies.


qos policy-group show

1-19 To see the current total IOPS, switch back to the Iometer tool.

1-20 Create a new storage QoS policy.

If the current total IOPS on the Iometer tool is higher than 1000, set the -max-throughput
parameter to 1000iops; otherwise, set the parameter to 500iops.
qos policy-group create -policy-group maxtp_policy -vserver SVM_orange
-max-throughput 1000iops

1-21 Display the current storage QoS policies.


qos policy-group show

1-22 Associate the policy to the hot volume or LUN.

You can associate a storage QoS policy to a storage virtual machine (SVM) hosting FlexVol
volumes, LUNs, volumes, or files.

1-23 Verify the current throughput from the storage.


qos statistics performance show –iterations 4

E9-8 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-24 Wait a few moments and watch the total IOPS on the Iometer decrease.

You can continue to run the previous command to observe the current storage readings. A few
minutes might elapse before the IOPS start to decrease.

1-25 When your testing is complete, stop the Iometer test by clicking the Stop button on the
toolbar.

1-26 In the PuTTY window, delete the policy.


qos policy-group delete -policy-group maxtp_policy

1-27 Close the Iometer tool.

1-28 On the landing host, launch the Iometer load generator. On the taskbar of the landing host, click
the Iometer icon and, if prompted, approve the Iometer EULA.

The Iometer window should open.

Task 2: Proactively monitor workload performance


In this task, you set up a storage QoS limit for a volume and use Iometer to generate a 50% write and 50%
read workload on the volume.
Step Action

2-1 Open a PuTTY session with the NetApp cluster.

E9-9 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-2 Create a new storage QoS policy.


qos policy-group create -policy-group prevent_policy -vserver
SVM_orange
-max-throughput 500iops

2-3 Display the current storage QoS policies.


qos policy-group show

2-4 You identified orange_vol1 as a new and potentially runaway workload.


Associate the storage QoS policy to the orange_vol1 volume.
You can associate a storage QoS policy to an SVM hosting FlexVol volumes, LUNs, volumes,
or files.
volume modify SVM_teal_vol1 -vserver SVM_teal -qos-policy-group
prevent_policy

2-5 Verify the current throughput from the storage.


qos statistics performance show –iterations 4

2-6 On the taskbar, click the Iometer icon.

The Iometer window should open.

E9-10 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-7 In the left pane, select the W2K12-01 node.

2-8 On the Disk Targets tab, specify the following:


 Targets: Y drive
 Maximum Disk Size: 20000
 Starting Disk Sector: 0
 # of Outstanding I/Os: 16

E9-11 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-9 Click the Access Specifications tab.

2-10 Under Global Access Specifications, select 4K; 50% Read; 0% random.

E9-12 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-11 Click Add.

2-12 Click the Test Setup tab.

2-13 On the Test Setup tab, specify the following:


 Test Description: QoS Exercise
 Ramp Up Time: 4 seconds
 Record Results: None

E9-13 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-14 Click the Results Display tab.

2-15 Change the Update Frequency to 4 seconds.


This reduces the load on the Windows computer that is required to update the Iometer interface.

E9-14 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-16 In the toolbar, click the Start Tests button (a green flag).

2-17 Open Windows Explorer to the Y drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 80,000 KB (20,000 x 4 KB).

2-18 Notice that, when the file reaches the maximum size, the ramp-up time begins to count down for
four seconds.
The ramp-up time ensures that the storage is stable before the testing begins.

E9-15 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-19 When the 50% read and 50% write test results begin to appear, notice that the total IOPS are
recorded on the top row of the display output.

Remember that this is a Windows computer and a cluster that operates in a shared virtualized
environment.

2-20 Leave the test running.

2-21 Notice the total IOPS.

E9-16 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-22 In the PuTTY window, verify the current throughput from the storage.
qos statistics performance show –iterations 4

2-23 When your testing is complete, stop the Iometer test by clicking the Stop button on the
toolbar.

2-24 In the PuTTY window, delete the policy.


qos policy-group delete -policy-group prevent_policy

2-25 Close the Iometer tool.

Task 3: Isolate a tenant workload


In this task, you set up a storage QoS tenant limit at the SVM level and use the Iometer tool to generate a 50%
write and 50% read workload on one of the tenants.
Step Action

3-1 Open a PuTTY session with the NetApp cluster.

3-2 Create three new storage QoS policies.


qos policy-group create -policy-group red_policy -vserver SVM_red
-max-throughput 500iops
qos policy-group create -policy-group orange_policy -vserver SVM_orange
-max-throughput 500iops
qos policy-group create -policy-group yellow_policy -vserver SVM_yellow
-max-throughput 500iops

3-3 Display the current storage QoS policies.


qos policy-group show

3-4 Associate each policy to the corresponding SVM.


You can associate a storage QoS policy to an SVM with FlexVol volumes, LUNs, volumes, or
files.
vserver modify -vserver SVM_red -qos-policy-group red_policy
vserver modify -vserver SVM_orange -qos-policy-group orange_policy
vserver modify -vserver SVM_yellow -qos-policy-group yellow_policy

3-5 Verify the current throughput from the storage.


qos statistics performance show –iterations 4

E9-17 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-6 On the taskbar, click the Iometer icon.

The Iometer window should open.

3-7 In the left pane, select the W2K12-01 node.

E9-18 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-8 On the Disk Targets tab, specify the following:


 Targets: Y drive
 Maximum Disk Size: 20000
 Starting Disk Sector: 0
 # of Outstanding I/Os: 16

3-9 Click the Access Specifications tab.

E9-19 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-10 Under Global Access Specifications, select 4K; 50% Read; 0% random.

3-11 Click Add.

3-12 Click the Test Setup tab.

E9-20 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-13 On the Test Setup tab, specify the following:


 Test Description: QoS Exercise
 Ramp Up Time: 4 seconds
 Record Results: None

3-14 Click the Results Display tab.

E9-21 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-15 Change the Update Frequency to 4 seconds.


This reduces the load on the Windows computer that is required to update the Iometer interface.

3-16 In the toolbar, click the Start Tests button (a green flag).

E9-22 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-17 Open File Explorer to the Y drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 80,000 KB (20,000 x 4 KB).

3-18 Notice that, when the file reaches the maximum size, the ramp-up time begins to count down for
four seconds.
The ramp-up time ensures that the storage is stable before the testing begins.

E9-23 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-19 When the 50% read and 50% write test results begin to appear, notice that the total IOPS are
recorded on the top row of the display output.

Remember that this is a Windows computer and a cluster that operates in shared virtualized
environment.

3-20 Leave the test running.

3-21 In the PuTTY window, verify the current throughput from the storage.
qos statistics performance show –iterations 4

E9-24 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-22 To see the current total IOPS, switch back to the Iometer tool.

3-23 When your testing is complete, stop the Iometer test by clicking the Stop button on the
toolbar.

3-24 Close the Iometer tool.

End of Exercise

E9-25 ONTAP Performance Analysis: Storage QoS

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 10: Network-Attached Storage Performance
Exercise 1: NAS Performance
In this exercise, you use the cluster shell to identify and resolve performance issues that relate to NAS
protocols.

Objectives
This exercise focuses on enabling you to do the following:
 Display NAS protocol performance statistics
 Resolve performance issues with NFS
 Resolve performance issues with SMB

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
Linux Server CentOS64 192.168.0.10 root Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

E10-1 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Task 1: Display NAS protocol performance statistics
In this task, you use the cluster shell commands to identify performance bottlenecks in NFS and CIFS
protocol traffic.
Step Action

1-1 Open a PuTTY session with the cluster.

1-2 Using the advanced privilege level, start statistics data collection on the objects “nfsv3” and
“cifs”.
set advanced
statistics start -object nfsv3|cifs -sample-id sample_NAS

1-3 Open a PuTTY session with the Linux host.

1-4 On the Linux host, open a command-line window.

1-5 On the Linux host, create a large file in the NFS mounted directory.
cp –r /var/log /mnt/SVM_teal/logdata
cp –r /mnt/SVM_teal/logdata /mnt/SVM_teal/logdata2

1-6 On the Windows system desktop, open a PowerShell window.

1-7 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

1-8 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 60 1 Z:\900mfile

1-9 While the SIO program is running, in the ONTAP Cluster shell, display the cluster performance
statistics.
cluster statistics show

1-10 While the SIO program is running, display the storage virtual machine (SVM) performance
statistics.
statistics vserver show –vserver SVM_white

1-11 While the SIO program is running, display the LIF performance statistics.
statistics lif show –lif SVM_white_cifs_lif1

1-12 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_NAS

1-13 Display the NFS read and write operation counters.


statistics show -object nfsv3 –instance SVM_teal -counter
nfsv3_read_ops|nfsv3_write_ops -sample-id sample_NAS
Record the read/write ratio of the I/O. _____________________

E10-2 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-14 Display the NFS latency counters.


statistics show -object nfsv3 -counter nfsv3_latency -sample-id
sample_NAS
Record the average latency of the NFS I/O traffic. _____________________

1-15 Display the CIFS read and write operation counters.


statistics show -object cifs –instance SVM_white -counter cifs
_read_ops| cifs _write_ops -sample-id sample_NAS
Record the read/write ratio of the I/O. _____________________

1-16 Display the CIFS latency counters.


statistics show -object cifs –instance SVM_white -counter cifs_latency
-sample-id sample_NAS
Record the average latency of the CIFS I/O traffic. _____________________

1-17 Delete the performance sample files.


statistics samples show
statistics samples delete -sample-id sample_NAS

Task 2: Resolve NFS performance issues


In this task, you resolve performance issues with the cluster interconnect network by adding and enabling
network ports for use with the cluster interconnect network.

Step Action

2-1 Display the NFS configuration of a NAS SVM.


vserver nfs show –vserver SVM_teal
Record whether NFSv4 client referrals are enabled. _____________________

2-2 Change the TCP and User Datagram Protocol (UDP) maximum transfer size for a NAS SVM.
vserver nfs modify –vserver SVM_teal -tcp-max-transfer-size 65536
vserver nfs modify –vserver SVM_teal -udp-max-transfer-size 65536

E10-3 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

2-3 Migrate an NFS LIF to another node in the cluster.


a. Identify the amount of traffic that is passing though the NFS LIF.
statistics lif show –lif SVM_teal_nfs_lif1
b. Determine the port that is hosting the NFS LIF.
network interface show –lif SVM_teal_nfs_lif1
c. Identify the port speed and configuration settings.
network port show -node svl-nau-01 -port e0e
d. Identify another port that has sufficient available I/O bandwidth.
statistics show-periodic -object nic_common -counter
instance_name|rx_total_bytes|tx_total_bytes|instance -iterations
1 -interval 1 -node svl-nau-02 -instance e0f
e. Move the LIF to the selected port.
migrate -vserver SVM_teal -lif SVM_teal_nfs_lif1 -destination-
node svl-nau-02 -destination-port e0f
You might need to disable NFSv4 referrals on the SVM.
vserver nfs modify –vserver SVM_teal -v4.0-referrals disabled

Task 3: Resolve CIFS performance issues


In this task, you use cluster shell commands to resolve performance issues with the CIFS file sharing
protocol.
Step Action

3-1 Display a list of SVMs that have CIFS enabled.


vserver cifs show

3-2 Display the CIFS configuration options for an SVM.


vserver cifs option show –vserver <svm_name>
Record whether client referrals are enabled. _____________________
Record whether copy offloads are enabled. _____________________

3-3 Migrate a CIFS LIF to another node in the cluster.


a. Identify the amount of traffic that is passing though the CIFS LIF.
statistics lif show –lif SVM_white_cifs_lif1
b. Identify another port that has sufficient available I/O bandwidth.
statistics show-periodic -object nic_common -counter
instance_name|rx_total_bytes|tx_total_bytes|instance -iterations
1
-interval 1 -node svl-nau-02 -instance e0f
c. Move the LIF to the selected port.
migrate -vserver SVM_white -lif SVM_white_cifs_lif1 -destination-
node svl-nau-02 -destination-port e0f
You might need to disable SMB referrals on the SVM.
vserver cifs option modify –vserver SVM_white -is-referral-enabled
false

E10-4 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

3-4 Add more CIFS LIFs.


a. Identify another port that has sufficient available I/O bandwidth.
statistics show-periodic -object nic_common -counter
instance_name|rx_total_bytes|tx_total_bytes|instance -iterations
1
-interval 1 -node svl-nau-02 -instance e0f
b. Create a CIFS LIF on the selected port.
network interface create -vserver SVM_white -lif
SVM_white_cifs_lif2 -role data -data-protocol cifs -home-node
svl-nau-02 -home-port e0f …
You might need to disable SMB referrals on the SVM.
vserver cifs option modify –vserver SVM_white -is-referral-enabled
false

Task 4: Use Perfstat to resolve CIFS performance issues


In this task, you use the data that Perfstat gathers to resolve performance issues with the CIFS file sharing
protocol. In this scenario, the customer is experiencing very slow performance when accessing “image”
volumes.

Step Action

4-1 Navigate to the folder that contains the performance data files that Perfstat collected.
CourseFiles\Perfstat\371007

4-2 Examine the contents of the df.txt file.


In the following table, record the names of the “image” volumes and the amount of capacity that
is used by each volume.
Volume Name Capacity Used

E10-5 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

4-3 Examine the contents of the sysstat_x_1sec.txt file.


Record the number of NFS operations that are performed per second, on average.
_____________________
Record the number of CIFS operations that are performed per second, on average.
_____________________
Record the number of iSCSI operations that are performed per second, on average.
_____________________
Record the number of FCP operations that are performed per second, on average.
_____________________
Record the amount of data that is read from and written to the network per second, on average.
_____________________
Record what this information indicates about the type of I/O that this system is performing.
____________________

4-4 Continue to examine the contents of the sysstat_x_1sec.txt file.


Record the amount of data that is read from and written to the hard disk drives per second, on
average. _____________________
Record how busy the hard disk drives (HDDs) are, on average. _____________________
Record what this information indicates about the state of the disk I/O subsystem.
_____________________

4-5 Continue to examine the contents of the sysstat_x_1sec.txt file.


Record the average CPU usage. _____________________
Record the cache hit rate. _____________________
Record what percentage of the time the system is processing consistency points.
_____________________
Record whether this information indicates a performance bottleneck. _____________________

E10-6 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

4-6 Examine the contents of the stats_stop_-l_workload_volume.txt file.


In the following table, for each “image” volume, record the number of read and write operations,
the amount of data read and written, and the latencies.

Volume Read Write Other Read Write Read Write


Ops Ops Ops Data Data Latency Latency
Image_full
Image_plus
Image_thumb
Image_med

Based on the results in the table, record where the majority of the workload is present on the
“image” volumes. _____________________

4-7 Examine the contents of the stats_stop_-l_readahead.txt file.


In the following table, record the value for each counter.
Counter Value
readahead:readahead:requested
readahead:readahead:read
readahead:readahead:incore
readahead:readahead:dir_blocks

Based on the results in the table, record the type of data that is being accessed.
_____________________

4-8 To confirm your conclusions, examine the wafl_susp_-w.txt file.


Locate the section that is titled “blocks read ahead by type.”
Record the type of blocks that are read most by the readahead mechanism.
_____________________
Record how this action might affect disk I/O performance. _____________________

End of Exercise

E10-7 ONTAP Performance Analysis: Network-Attached Storage Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Module 11: Storage Area Network Performance
Exercise 1: SAN Protocol Performance
In this exercise, you use the cluster shell to identify and resolve performance issues that relate to SAN
protocols.

Objectives
This exercise focuses on enabling you to do the following:
 Display SAN protocol performance statistics
 Resolve LUN performance issues
 Resolve SAN LIF performance issues

Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Windows Server 2012 R2


Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop

svl-nau-01 svl-nau-02
(node1) (node2)

Performance Manager Unified Manager CentOS 6.5

OPM UM

opm ocum linux1

Machine Host Name IP Address User Name Password


Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123
ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
ONTAP Cluster Node 2 svl-nau-02 192.168.0.92 admin Netapp123

Task 1: Display SAN protocol performance statistics


In this task, you use the cluster shell to identify performance issues that are related to SAN protocol activity.
Step Action

1-1 Open a PuTTY session with the NetApp cluster.

E11-1 ONTAP Performance Analysis: Storage Area Network Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-2 Using the advanced privilege level, start statistics data collection on the objects “volume,”
“aggregate,” and “disk.”
set advanced
statistics start -object lun|iscsi -sample-id sample_SAN

1-3 On the Windows system desktop, copy the 900mfile from the CourseFiles folder to the X: drive.

1-4 On the Windows system desktop, open a PowerShell window.

1-5 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.

1-6 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 X:\900mfile

1-7 While the SIO program is running, display the storage virtual machine (SVM) performance
statistics.
statistics lun show –vserver SVM_red

1-8 While the SIO program is running, display the LIF performance statistics.
statistics lif show –vserver SVM_red

1-9 Examine the FC port queue depths.


network fcp adapter show –fields queue-depth

1-10 While the SIO program is running, switch into the diagnostic privilege level and display the
LUN alignment statistics.
set diag
lun alignment show -vserver SVM_red -path /vol/red_vol1/red_lun1

1-11 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_SAN

1-12 Display the LUN read and write operation counters.


statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
read_ops|read_data|write_ops|write_data -sample-id sample_SAN

1-13 Display the LUN read and write size histograms.


statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
read_io_len_hist|write_io_len_hist -sample-id sample_SAN

1-14 Display the LUN I/O latency counters.


statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
avg_read_latency|avg_write_latency -sample-id sample_SAN

1-15 Display the percentage of read and write operations that are partial blocks.
statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
read_partial_blocks|write_partial_blocks -sample-id sample_SAN

E11-2 ONTAP Performance Analysis: Storage Area Network Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute


Step Action

1-16 Display the LUN I/O alignment histograms.


statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
read_align_histo|write_align_histo -sample-id sample_SAN

1-17 Delete the performance sample files.


statistics samples show
statistics samples delete -sample-id sample_SAN

Task 2: Resolve LUN performance issues


In this task, you resolve performance issues with LUNs.

Step Action

2-1 If poor LUN performance is caused by LUN misalignment, then you must realign the LUN.
Use the NetApp Virtual Storage Console (VSC) or other tools to realign the LUN.

2-2 If poor LUN performance is caused by congestion with other workloads that are accessing the
same aggregate, then migrate the LUN to another aggregate or node in the cluster:
a. Identify the LUN that is experiencing the highest latency.
b. Identify the volume in which the LUN resides.
c. Identify the aggregate in which the volume resides.
d. Determine the other volumes that reside in the same aggregate.
e. Decide whether to move this LUN or volume to another aggregate or whether to move a
different volume.
f. Identify another aggregate that has sufficient free space and I/O bandwidth headroom to
accommodate the volume that you want to move.
g. Move the volume to the selected aggregate.

The asymmetric logical unit access (ALUA) should redirect the client hosts to use a direct
path to the node that now contains the LUN.

Task 3: Resolve SAN performance issues


In this task, you explore ways to resolve performance issues with the ports that are connected to a SAN.

Step Action

3-1 Migrate a SAN LIF to another node in the cluster.


If the problem is occurring because the node controller is overwhelmed, and you cannot move
the LUN to another aggregate, then you can assign the SAN LIF on the saturated node to a port
on different node in the cluster. This action slightly reduces the burden on the saturated node,
because the processing of the SAN protocol is offloaded onto a different node.

3-2 Add a SAN LIF.


If the problem is caused by a saturated port on the cluster, then you can create a SAN LIF on the
SVM and assign the SAN LIF to a different port on the same node.

End of Exercise

E11-3 ONTAP Performance Analysis: Storage Area Network Performance

© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

NetApp University - Do Not Distribute

Vous aimerez peut-être aussi