STRSW Ilt Perfcdot Rev01 Exerciseguide

ONTAP Performance Analysis
Exercise Guide
Content Version 1
NETAPP UNIVERSITY
ONTAP Performance Analysis
Exercise Guide
Course ID: STRSW-ILT-PERFCDOT-REV01
Catalog Number: STRSW-ILT-PERFCDOT-REV01-EG
NetApp University - Do Not Distribute

ATTENTION
The information contained in this course is intended only for training. This course contains information and activities that,
while beneficial for the purposes of training in a closed, non-production environment, can result in downtime or other severe
consequences in a production environment. This course material is not a technical reference and should not, under any
circumstances, be used in production environments. To obtain reference materials, refer to the NetApp product
documentation that is located at http://now.netapp.com/.
COPYRIGHT
© 2017 NetApp, Inc. All rights reserved. Printed in the U.S.A. Specifications subject to change without notice.
No part of this document covered by copyright may be reproduced in any form or by any means—graphic, electronic, or
mechanical, including photocopying, recording, taping, or storage in an electronic retrieval system—without prior written
permission of NetApp, Inc.
U.S. GOVERNMENT RIGHTS

Commercial Computer Software. Government users are subject to the NetApp, Inc. standard license agreement and
applicable provisions of the FAR and its supplements.
TRADEMARK INFORMATION
NETAPP, the NETAPP logo, and the marks listed at http://www.netapp.com/TM are trademarks of NetApp, Inc. Other
company and product names may be trademarks of their respective owners.
E-2 ONTAP Performance Analysis: Welcome
© 2017 NetApp, Inc. This material is intended only for training. Reproduction is not authorized.

TABLE OF CONTENTS
WELCOME..................................................................................................................................................... E-1
MODULE 1: HOW A NETAPP STORAGE SYSTEM WORKS................................................................... E1-1
MODULE 2: PERFORMANCE ANALYSIS TOOLS ................................................................................... E2-1
MODULE 3: ONCOMMAND PERFORMANCE MANAGER ....................................................................... E3-1
MODULE 4: CPU AND MEMORY PERFORMANCE ................................................................................. E4-1
MODULE 5: WAFL PERFORMANCE MONITORING AND ANALYSIS .................................................... E5-1
MODULE 6: DISK I/O PERFORMANCE ..................................................................................................... E6-1
MODULE 7: FLASH CACHE AND FLASH POOL PERFORMANCE ........................................................ E7-1
MODULE 8: CLUSTER INTERCONNECT PERFORMANCE .................................................................... E8-1
MODULE 9: STORAGE QOS...................................................................................................................... E9-1
MODULE 10: NETWORK-ATTACHED STORAGE PERFORMANCE .................................................... E10-1
MODULE 11: STORAGE AREA NETWORK PERFORMANCE .............................................................. E11-1

Getting Started
Study Aid Icons
In your exercises, you might see one or more of the following icons.
Warning
If you misconfigure a step that is marked with this icon, later steps might not work
properly. Check the step carefully before you move forward.
Attention
Review this step or comment carefully to save time, learn a best practice, or avoid errors.
Information
A comment that is labeled with this icon provides information about the topic or
procedure.
References
A comment that is labeled with this icon identifies reference material that provides more
information.
Exercise 0: Checking the Lab Equipment

In this exercise, you familiarize yourself with your equipment, synchronize system time, assign a Network
Time Protocol (NTP) time server, and ensure that licenses are installed.
Objectives
This exercise focuses on enabling you to do the following:
 Ensure connectivity to an ONTAP cluster
 Synchronize system time between the ONTAP clusters and the Windows domain controller
 Assign an NTP server IP address for the ONTAP clusters
 Ensure that required licenses are installed on the ONTAP clusters
Exercise Equipment
Your lab contains the following virtual machines:
 One Windows Server 2012 R2 system
 One CentOS Linux 6.5 Server system
 One ONTAP 9.1 2-node cluster (svl-nau)
 One OnCommand Unified Manager system
 One OnCommand Performance Manager system
When you use the connection information that your instructor assigned to you, you are first connected
through a Remote Desktop Connection to a Windows Server 2012 system. From this Windows desktop, you
connect to the other servers in your exercise environment.

Windows Server 2012 R2
Laptop or Domain Controller ONTAP 9.1 Cluster
Classroom svl-nau
Desktop Remote
Desktop
svl-nau-01 svl-nau-02
(node1) (node2)
Performance Manager Unified Manager CentOS 6.5
OPM UM
opm ocum linux1
Machine Host Name IP Address User Name Password
Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123

Linux CentOS Server CentOS64 192.168.0.10 root Netapp123
OnCommand Unified
ocum 192.168.0.15 admin Netapp123
Manager
OnCommand
opm 192.168.0.16 admin Netapp123
Performance Manager
ONTAP Cluster Mgmt
svl-nau 192.168.0.50 admin Netapp123
LIF
ONTAP Cluster Node 1 svl-nau-01 192.168.0.91 admin Netapp123
SMB Server SVM_white 192.168.0.140 vsadmin Netapp123
NFS Server SVM_teal 192.168.0.141 vsadmin Netapp123
SVM_red 192.168.0.51 vsadmin Netapp123
iSCSI Servers SVM_orange 192.168.0.52 vsadmin Netapp123
SVM_yellow 192.168.0.53 vsadmin Netapp123

Task 1: Ensure connectivity to your ONTAP Cluster
In this task, you familiarize yourself with the Windows Server 2012 desktop. You ensure connectivity to the
ONTAP cluster and verify the health of the ONTAP cluster.
Step Action
1-1 Ensure that you see the Start page of your assigned Windows Server 2012 system.
1-2 On the Windows Server 2012 Start page, scroll down and click the Desktop tile.
1-3 Verify that you see the desktop and that it contains the shortcut to the PuTTY program.
1-4 To connect to the ONTAP cluster web UI, you browse to the System Manager URL,
which is in ONTAP 9.1.
To connect to the CLI of the ONTAP cluster, you use PuTTY. PuTTY is a UI for the
Telnet and Secure Shell (SSH) protocols.

Step Action
1-5 Double-click the PuTTY shortcut.
1-6 In the PuTTY Configuration dialog box, verify that the svl-nau-mgmt saved session is listed,
and then double-click svl-nau-mgmt.
1-7 1. You can also connect to the ONTAP cluster CLI by connecting to any node in
the cluster: svl-nau-01 (node 1) or svl-nau-02 (node 2).

Step Action
1-8 In the PuTTY Security Alert message, click Yes.

You receive an alert message the first time that you connect.
1-9 At the ONTAP cluster login prompt, provide the svl-nau credentials:
 Log in as: admin
 Password: Netapp123
The ONTAP cluster CLI prompt and cursor appear.
1-10 If you have difficulty logging in to the ONTAP cluster CLI, refer to the following
table. Ensure that you are using the correct user name and password in the correct
case (both are case-sensitive).
Host
System Name IP Address User Name Password
ONTAP cluster admin (case
svl-nau 192.168.0.50 Netapp123
management LIF sensitive)

Step Action
1-11 Verify that both nodes of the ONTAP cluster are healthy and eligible:
cluster show
1-12 If the health or eligibility of either node is listed as false, alert your instructor.

Task 2: Verify that required license codes are installed
Many of the advanced features of ONTAP software require licenses. In later exercises, you use several
licensed features of ONTAP software. In this task, you ensure that the necessary licenses were preinstalled.
Your instructor will provide you with license codes for any software that you need to install.
Step Action
2-1 In the svl-nau-mgmt CLI, enter the following command:

license show
2-2 Verify that the required license codes (that is, codes for the licenses that enable the features of
ONTAP that are required for later exercises) are installed.
2-3 If any license is not installed, contact NetApp University Support.

Task 3: Set the ONTAP command line system timeout value (optional)
In this optional task, you disable the auto-logout feature by setting the ONTAP command-line system timeout
value to zero.
Step Action
3-1 Open a PuTTY session with the NetApp cluster.
3-2 Check the current system timeout value.

system timeout show
3-3 Set the system timeout to 0 (no timeout).

system timeout modify -timeout 0
End of Exercise

Module 1: How a NetApp Storage System Works
Exercise 1: Identifying Cluster Components
Before you can troubleshoot a performance issue, you must understand the working environment of the
system. In this exercise, you log into a NetApp cluster and identify the components and configuration of the
cluster.
Objectives
 Log in to the exercise environment
 Identify the components of ONTAP software
 Set the ONTAP command-line system timeout value (optional)
Exercise Equipment
In this exercise, you use the elements that are highlighted (not the elements that are grayed-out).

Domain Controller ONTAP 9.1 Cluster
Laptop or
Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

ONTAP Cluster Mgmt LIF svl-nau 192.168.0.50 admin Netapp123
E1-1 ONTAP Performance Analysis: How NetApp Storage Works

Task 1: Access OnCommand System Manager
In this task, you confirm access to the cluster management port and launch System Manager.
NOTE: For more information about how to configure a storage system with System Manager, see the ONTAP
9 Administration course.
Step Action
1-1 On the Windows Server desktop, launch Windows Internet Explorer.
1-2 Connect to the System Manager through the cluster management interface.
https://192.168.0.50

Step Action
1-3 Log in to the cluster OnCommand System Manager interface.

User name = admin
Password = Netapp123
1-4 View the Cluster Details page for the cluster.

Task 2: Identify ONTAP components
In this task, you use either System Manager or the ONTAP CLI to identify key ONTAP components.
Step Action
2-1 Identify the feature licenses that are installed on the cluster by selecting the appropriate
checkboxes.
NFS
CIFS
iSCSI
SnapRestore
SnapMirror
SnapVault
FCP
FlexClone
SnapLock
SnapManagerSuite
2-2 Record the number of aggregates that are present in the cluster. ________________________
2-3 Enter information about each aggregate in the following table.

Aggregate Owning Disk Disk Disk Is a Number RAID
Name Node Count Type RPM Flash of Group
Pool RAID Size
Present? Groups
2-4 Record the number of storage virtual machines (SVMs) that are present in the cluster.
________________________

Step Action
2-5 Enter information about each SVM in the following table.

SVM Volume Allowed Protocols
Name Type
2-6 Record the number of volumes that are present in the cluster. ________________________
2-7 Record the number of volumes in the cluster, if any, that are over 80% full.
________________________
2-8 Record the number of LUNs that are present in the cluster. ________________________
2-9 Enter information about each LUN in the following table.

LUN SVM Operating Initiator Initiators LUN
Name Name System Group ID
Type
2-10 Record the number of SMB shares that are present in the cluster. ________________________
2-11 Enter information about each SMB share in the following table.
Share SVM Share Share Access Share ACL
Name Name Path Control List (ACL) Permission
User and Group

Step Action
2-12 Record the number of Ethernet ports that are present in the cluster.
________________________
2-13 Record the number of 10-Gigabit Ethernet (GbE) ports that are present in the cluster.
________________________
2-14 Record the number of FC ports that are present in the cluster. ________________________
2-15 Record the number of 8-Gb FC ports that are present in the cluster.
________________________
2-16 Record the number of LIFs that are present in the cluster. ________________________
2-17 Record the number of LIFs that are present in the cluster that have the “data” role.
________________________
2-18 Record the number of LIFs that are present in the cluster that have the data protocol of iSCSI.
________________________
2-19 Record the number of LIFs that are present in the cluster that have the data protocol of CIFS?
________________________
2-20 Record any LIFs that are not present on their home port. ________________________
End of Exercise

Module 2: Performance Analysis Tools
Exercise 1: Analyzing Performance Statistics
In this exercise, you examine the different variations of the statistics command. You also identify
storage system workloads and potential bottlenecks.
Objectives
 Perform initial health checks on a cluster
 Perform baseline performance monitoring and analysis from the cluster shell
 Use the statistics catalog command
 Use the statistics start and statistics show commands
 Define workload characteristics
Exercise Equipment

Laptop or
Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

E2-1 ONTAP Performance Analysis: Performance Tools

Task 1: Perform initial health checks on the cluster
In this task, you query the cluster to assess the initial health status.
Step Action
1-2 Determine whether all nodes are healthy.

cluster show
1-3 Determine whether the replication rings have the same masters or consistent masters.
Before you use this command, remember to set the advanced privilege level.
set advanced
cluster ring show
1-4 Using the advanced privilege level, determine whether the cluster connectivity is healthy. If the
cluster is a single-node cluster, this command fails.
cluster ping-cluster -node local
1-5 Determine the health of the cluster.

system health status show
1-6 Determine the health of each cluster subsystem.

system health subsystem show
1-7 Determine whether any health alerts were issued for the cluster.
system health alert show
1-8 Display a list of health alerts and their descriptions.

system health alert definition show
1-9 Determine whether any volumes are not online.

volume show -state !online
1-10 Determine whether any aggregates are not online.

storage aggregate show -state !online

Step Action
1-11 Determine whether any disks are in a broken state.

storage disk show -state broken
Task 2: Perform baseline performance monitoring from the cluster shell

In this task, you use cluster shell commands to monitor cluster performance.
Step Action
2-1 Display an overview of the performance of the entire cluster.

statistics show-periodic –iterations 4
2-2 Determine the performance status of each node in the cluster.

statistics show-periodic –object node –instance svl-nau-01
-iterations 4
statistics show-periodic –object node –instance svl-nau-02

-iterations 4
2-3 Display performance statistics for all aggregates in the cluster.

statistics aggregate show
2-4 Display performance statistics for all flash pool aggregates in the cluster.
statistics cache flash-pool show
2-5 Display performance statistics for all disks in the cluster.

statistics disk show
2-6 Display performance statistics for all LIFs in the cluster.

statistics lif show
2-7 Display performance statistics for all LUNs in the cluster.

statistics lun show
2-8 Display performance statistics for all nodes in the cluster.

statistics node show
2-9 Display performance statistics for all volumes in the cluster.

statistics volume show

Step Action
2-10 Display performance statistics for all storage virtual machines (SVMs) in the cluster.
statistics vserver show
2-11 Display performance statistics for all workloads in the cluster.

statistics workload show
2-12 Check the overall latency for each quality-of-service (QoS) policy group in the cluster.
qos statistics latency show –iterations 3
2-13 Check the overall latency for each QoS workload in the cluster.
qos statistics workload latency show –iterations 3
Task 3: Examine the statistics catalog command

In this task, you issue the three statistics catalog commands and exercise the supporting parameters,
using different privilege levels.
Step Action
3-1 Examine the statistics objects that are available at the admin privilege level.
statistics catalog object show
3-2 Change to the advanced privilege level, and reissue the command.
3-3 Record the command syntax that displays only the advanced level statistics objects without their
descriptions. ___________________________
3-4 Record the command syntax that displays statistics objects that are associated with SVMs.
____________________________________
You should still be in the advanced privilege level.
Record the number of statistics objects that appear in the list. ________________
3-5 Change to the admin privilege level, and examine the instance names that are available for the
statistics object “volume.”
statistics catalog instance show -object volume
3-6 Record the command syntax that displays the instance names that are available for the statistics
object that represents SVMs that have LIFs associated with them.
____________________________________
Record the number of instance names that appear in the list. _____________

Step Action
object that represents SVMs that have volumes associated with them.
____________________________________
Record the number of instance names that appear in the list. _____________
object that represents the disk that is associated with the second node in the cluster (svl-nau-02).
____________________________________
3-9 Examine the counters that are available for the statistics object “disk,” and show the detailed
information.
statistics catalog counter show -object disk –describe
3-10 Change command privilege level to advanced privilege level, and rerun the command.
Remember to reset the privilege level to admin when you are done.
3-11 Examine the counters that are available for the statistics object “aggregate,” and show the
detailed information.
statistics catalog counter show -object aggregate -describe
3-12 Examine the counters that are available for the statistics object “volume,” and show the detailed
information.
statistics catalog counter show -object volume -describe
Task 4: Examine the statistics start and statistics show

commands
In this task, you issue the statistics start and statistics show commands and exercise the
supporting parameters, using different privilege levels.
Step Action
4-1 Using the admin privilege level, attempt to start statistics data collection on the statistics object
“nfsv3.”
statistics start -object nfsv3 -sample-id sample_nfsv3_adm
4-2 Using the advanced privilege level, start statistics data collection on the statistics object “nfsv3.”
set advanced
statistics start -object nfsv3 -sample-id sample_nfsv3_adv
4-3 Display the counters that are associated with the statistics object “nfsv3” instance “SVM_teal”
for both samples.
statistics show -object nfsv3 -instance SVM_teal -counter * -sample-id
sample_nfsv3_adm
statistics show -object nfsv3 -instance SVM_teal -counter * -sample-id
sample_nfsv3_adv
Record any difference in the displays that you observe. __________________

Step Action
4-4 Using the advanced privilege level, start statistics data collection on the statistics object “disk.”
statistics start -object disk -sample-id sample_disk
4-5 Record the command syntax that displays all latency related counters for disk v4.20.
___________________________
Record the number of counters that appear in the list. _______________
4-6 Record the command syntax that displays all user_read_latency counters for all disks in the
cluster. ___________________________
Record the number of counters that appear in the list. _______________
4-7 Display only the disks with the user_read_latency counter between 1000us and 10000us.
statistics show -object disk -instance * -counter user_read_latency
-sample-id sample_disk -value 1000..10000
4-8 Record the command syntax that displays all user_read_latency counters that are greater than
10000us for all disks in the cluster, and limit the display to only the counter and value fields.
4-9 Using the advanced privilege level, start statistics data collection on the statistics object
“aggregate.”
statistics start -object aggregate -sample-id sample_aggr
4-10 Display nonzero user_reads and user_writes counters for each aggregate.
statistics show -object aggregate -instance * -counter
user_reads|user_writes -sample-id sample_aggr -value >0
“volume.”
statistics start -object volume -sample-id sample_volume
4-12 Display all volume activity by displaying all data counters for each volume.
statistics show -object volume -instance * -counter *data -sample-id
sample_volume
4-13 Record the command syntax that displays all latency counters for all volumes in the cluster.
__________________________
Now, record the command syntax that displays all nonlatency counters for all volumes in the
cluster. __________________________
“processor.”
statistics start -object processor -sample-id sample_processor

Step Action
4-15 Record the command syntax that displays all counters for all processors in the cluster.
__________________________
“workload.”
statistics start -object workload -sample-id sample_workload
4-17 Display all latency counters for each workload, and limit the display to the counter and value
fields.
statistics show -object workload -instance * -counter *latency -sample-
id sample_workload -fields counter,value
4-18 Stop all statistics data collection.

statistics stop -sample-id sample_nfsv3_adm
statistics stop -sample-id sample_nfsv3_adv
statistics stop -sample-id sample_disk
statistics stop -sample-id sample_aggr
statistics stop -sample-id sample_volume
statistics stop -sample-id sample_processor
statistics stop -sample-id sample_workload
4-19 Delete old sample files.

statistics samples show
statistics samples delete -sample-id *
Task 5: Define workload characteristics

In this task, you gather statistical data and evaluate the data to determine the workload characteristics.
Step Action
5-1 Turn off the Snapshot policy for volume SVM_teal_vol1.

volume modify -vserver SVM_teal -volume SVM_teal_vol1 -snapshot-policy
none
5-2 On your Windows Server desktop, double-click the CourseFiles shortcut.
5-3 Copy the file 300mfile to the Z: drive.

Step Action
5-4 Normally you would start data collection for all protocols that are served by the cluster, but for
this exercise you use only the SMB protocol.
Using the diagnostic privilege level, start statistics data collection on the objects “cifs,”
“volume,” and “readahead.”
Diagnostic privilege level commands are required to capture the cifs_read_size_histo,
cifs_write_size_histo, rand_read_req, and seq_read_req counters.
set diagnostic
statistics start -object cifs|volume|readahead -sample-id sample_cifs1
5-5 Display the QoS workload characterization for the volume in real time.
qos statistics volume characteristics show vserver volume
5-6 On the Windows Server desktop, double-click the Command Prompt.
5-7 Drag the SIO.exe shortcut to the Command Prompt window.
5-8 Press the space bar, type the following list of parameters, and then press Enter.
0 0 32k 300m 30 1 Z:\300mfile

Step Action
5-9 After the command ends and the command prompt returns, go to the PuTTY session and
terminate the QoS statistics command, by using ‘^C’.
Record the volume workload characteristics:
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________
Concurrency: ____________
5-10 Turn off the performance data collection.

statistics stop -sample-id sample_cifs1
5-11 Start statistics data collection again on the same objects, but this time change the sample name
to “sample_cifs2.”
5-13 In the Command Prompt window, press the Up arrow, change the SIO parameters to the
following, and then press Enter:
100 0 32k 300m 30 4 Z:\300mfile
terminate the QoS statistics command using ‘^C’.
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________

5-16 Start statistics data collection again on the same objects, but this time change the sample name
to sample_cifs3.

Step Action
5-18 In the Command Prompt window, press the Up arrow, change the SIO parameters to the
following parameters, and then press Enter:
70 60 4k 300m 30 32 Z:\300mfile
terminate the QoS statistics command, by using ‘^C’.
IOPS: _______________
Throughput: ______________
Request Size: ______________
Read %: __________

5-21 The QoS workload characterization commands provide useful information, but they don’t
provide the same level of detail that you obtain by using the statistics show command.
The following commands help to define the CIFS workload characteristics in each sample.
Use the commands to complete the table in 5-22.
statistics show -object cifs -instance * -counter cifs_read_ops

-sample-id sample_cifs3
statistics show -object cifs -instance * -counter cifs_write_ops
statistics show -object cifs -instance * -counter cifs_read_size_histo

statistics show -object cifs -instance * -counter cifs_write_size_histo
statistics show -object volume -instance white_vol1 -counter

cifs_read_latency -sample-id sample_cifs3
statistics show -object volume -instance white_vol1 -counter

cifs_write_latency -sample-id sample_cifs3
statistics show -object readahead -instance * -counter rand_read_reqs

statistics show -object readahead -instance * -counter seq_read_reqs

Step Action
5-22 Record your observations for sample_cifs3 in the table that follows.
CHARACTERISTIC OBSERVED VALUE
Read Write
Throughput
Latency
Operation Size
Concurrency
Randomness
5-23 Calculate the concurrency for the workload (throughput * latency), and enter the result into the
table in 5-22.
5-24 Record the type of workload characteristics that you observed in your analysis of sample_cifs3.
__________________________
End of Exercise

Module 3: OnCommand Performance Manager
Exercise 1: OnCommand Performance Manager Thresholds, Events, and
Alerts
In this exercise, you explore the OnCommand Performance Manager user interface, including events and
alerts. This exercise is based on a customized version of Performance Manager with a database of real-world
performance data that gives you a realistic view of the tool. The data was collected between March and
September 2015. Ongoing data collection is disabled.
Objectives
 Log in to Performance Manager and explore the Performance Manager UI
 Explore the Performance Manager dashboard
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

OnCommand Unified
Manager
OnCommand Performance
Manager
E3-1 ONTAP Performance Analysis: OnCommand Performance Manager

Task 1: Log in to Performance Manager and explore the Performance
Manager UI
Step Action
1-1 Open a browser window and enter the IP address of Performance Manager:
https://192.168.0.16
1-2 In the “There is a problem” dialog box, click Continue to this website.
1-3 Sign in to Performance Manager as the administrator (the user name and password are case-
sensitive):
 User name: admin
 Password: Netapp123
1-4 Performance Manager is fully configured in your exercise kit, so the Performance Manager
Dashboard appears:
1-5 Because this customized version of Performance Manager does not collect live data,
a red warning appears below the cluster name.

Step Action
1-6 On the Dashboard page, position your cursor over the heading for each pane. Review the
definitions of each performance-measurement counter.
1-7 Record the reason that the squares in the performance panes are green.
_____________________
1-8 In the Utilization panel of the cluster dashboard, click the Nodes checkbox.
1-9 Record the number of nodes that appear in cluster opm-capacity. ______________
1-10 To sort by percentage of utilization, lowest to highest, click the Utilization column header.
Record the node that is the most heavily utilized. __________________
1-11 Use the browser back button to return to the Performance Dashboard.

Step Action
1-12 In the Latency panel of the cluster dashboard, click the SVMs checkbox.
1-13 Record the number of storage virtual machines (SVMs) that appear on the cluster opm-capacity.
_________________
1-14 To sort the SVMs by IOPS, in descending order, click the IOPS column header twice.
Record the name of the busiest SVM. _______________
1-16 In the Utilization panel of the cluster dashboard, click the Aggregates checkbox.

Step Action
1-17 To sort by utilization, lowest to highest, click the Utilization column header. To toggle the sort
order from highest to lowest, click the Utilization column header again. Record the name of the
aggregate that is the most heavily utilized. _________________
1-18 To sort the aggregates in descending order, based on response time, click the Latency column
header twice. Record the name of the aggregate that is the slowest. _________________
1-20 Click View Cluster Details.

Step Action
1-21 Observe that the Cluster Details page appears.
1-22 Verify that you see the number of cluster resources that are being monitored for performance.
1-23 To display the top 10 charts, click the Top Performers tab.
1-24 Scroll down to the Top 10 Volumes chart, and change the metric to IOPS.
1-25 Record the three volumes that experienced the highest IOPS over the previous 72 hours.
_________________________

Task 2: Explore the Performance Manager dashboard
Step Action
2-1. Click Storage and select Nodes:
2-2. In the Nodes page, click the link for node opm-capacity-01.
2-3. Use the Time Range menu to specify a Custom Range of time. Set the starting time to 12:00
am on August 1 and the ending time to 8:00 am on August 2.

Step Action
2-4. Scroll down to observe the Latency, IOPS and MBps (megabytes per second) charts.
Record the approximate average latency during the time period. _______________
Record the approximate average number of IOPS during the time period. ________________
Record the approximate average throughput during the time period. __________________
2-5. In the View and Compare panel, click the Add button for node opm-capacity-02.

Step Action
2-6. Observe that the performance data for node opm-capacity-02 was added to the charts for
comparison purposes.
2-7. In the Comparing panel, click the X symbol to remove the data for node opm-capacity-02 from
the charts.

Step Action
2-8. In the Explorer page for node opm-capacity-01, position your cursor over the last critical-level
event. Click the View Event Detail button.
Record the event name and number. ______________
Record the time when the event was triggered. ______________
Record the performance policy that was triggered. ______________
Record the Performance Capacity Used value that triggered the event. ______________

Step Action
2-9. Return to the Node Explorer page. Use the Choose Charts menu to display the charts for only
Events, Perf. Capacity Used, and Perf. Capacity Used (Advanced).
2-10. In the Performance Capacity Used chart, position your cursor over the spike in the chart that
aligns with the last critical event.
Record the value of the optimal Performance Capacity Used counter at that point. ___________

Step Action
2-11. In the Perf. Capacity Used (Advanced) chart, position your cursor over the spike in the chart that
aligns with the last critical event.
Record the percentage of the optimal Performance Capacity Used that is consumed by user
protocols. _______________
Record the percentage of the optimal Performance Capacity Used that is consumed by
background processing. _______________
End of Exercise

Module 4: CPU and Memory Performance
Exercise 1: Identifying and resolving storage controller performance
issues
In this exercise, you identify and resolve performance issues with the CPU and memory subsystems of a
NetApp storage system.
Objectives
 Display the performance characteristics of a cluster node
 Identify performance issues that are caused by a lack of controller resources
 Resolve performance issues that are caused by a lack of controller resources
Exercise Equipment

Laptop or
Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1


OnCommand Unified
Manager
Manager
E4-1 ONTAP Performance Analysis: CPU and Memory Performance

Task 1: Display node resource performance
In this task, you use the cluster shell to display and analyze the processing and memory performance
characteristics of a cluster node.
Step Action
1-2 Using the advanced privilege level, start statistics data collection on the objects “volume,”
“aggregate,” and “disk”.
set diag
statistics start -object system:node|bufcache -sample-id sample_CPUMem
1-3 On the Windows system desktop, open a PowerShell window.
1-4 Drag the sio_ntap_win32.exe shortcut from the Windows system desktop into the PowerShell
window.
1-5 Press the space bar, type the following list of parameters, and then press Enter:
0 0 32k 900m 90 1 Y:\900mfile
1-6 While the SIO program is running, display the clusterwide performance statistics.
cluster statistics show
Record the value for the CPU Busy field. _______________
1-7 While the SIO program is running, display the cluster node performance statistics.
statistics node show
Record the value for the CPU % field on each node. _______________
1-8 While the SIO program is running, display the performance statistics for quality-of-service
(QoS) workloads.
qos statistics workload performance show -node svl-nau-01
1-9 While the SIO program is running, display the CPU performance statistics for QoS workloads
qos statistics workload resource cpu show -node svl-nau-01
1-10 While the SIO program is running, display the latency statistics for QoS workloads
qos statistics workload latency show -node svl-nau-01
Record whether any latency is attributed to the Network or Data subsystems. _______________
1-11 Display CPU utilization for each CPU in the cluster node.
node run local sysstat –m –c 5
1-12 Determine whether any consistency points are being triggered by a lack of memory.
node run local sysstat –u –c 5
Record any consistency points that are triggered by a lack of Write Anywhere File Layout
(WAFL) buffers (CP Type=H or L). _______________
Record any consistency points that are triggered by a lack of Network buffers (CP Type=M).
_______________

Step Action
1-13 After the SIO program completes, from the cluster shell prompt, stop the collection of
performance data.
statistics stop -sample-id sample_CPUMem
1-14 Display the CPU busy counters.

statistics show -counter avg_processor_busy|total_processor_busy
-sample-id sample_CPUMem
1-15 Display WAFL cache activity.

statistics show –object bufcache -sample-id sample_CPUMem
Record the number of buffers that are allocated. _______________
Record the amount of memory that is consumed by the WAFL cache. _______________
Record the number of buffers that are empty. _______________
Record the number of buffers that were stolen by other subsystems. _______________
Record whether the cluster memory is constrained. _______________
1-16 Delete any old sample files.

statistics samples delete -sample-id sample_CPUMem
Task 2: Use Performance Manager to analyze a performance issue

In this task, you use OnCommand Performance Manager to identify the root cause of a performance issue.
Consider a scenario: You receive an alert email from Performance Manager that indicates that the
performance capacity of a node has been exceeded. The email message includes the event number “8208.”
Starting with the event, investigate and identify the cause of the performance event.
Step Action
2-1 Log in to OnCommand Performance Manager.
2-2 To navigate to the Events page, click the Events tab.

Step Action
2-3 Enter the event number that was provided by the alert email message into the search text box,
and click Search.
2-4 Confirm that the event is found. Record the event severity level. _______________
2-5 To identify the event type, position your cursor over the description field.
2-6 To navigate to the Event Details page, click the event name.
Record the full name of the event. _______________

Record the node that is affected. _______________
Record when the event was triggered. _______________
Record the threshold policy that was violated (if any). _______________
2-7 Click the link for the affected node.
2-8 In the Explorer tab of the Node Details page, use the Time Range menu to set the time range of
the charts to six hours before the event and three hours after the event. Click Apply Range.

Step Action
2-9 In the Events chart, position your cursor over the dot for the event that occurred at the time of
the event that you saw in the Events page. Record whether this event matches the event that you
identified in the Events page. _______________
2-10 In the Explorer tab of the Node Details page, scroll down to the Performance Capacity Used
Chart. Record whether the spike in capacity used corresponds to the time of the event.
_______________
2-11 Position your cursor over the spike in the Performance Capacity Used chart. Record the value of
the counter at the top of the spike. _______________ Record whether this value matches the
value that was reported in the Event Details page. _______________
2-12 To limit the display to the Performance Capacity Used charts, select the Choose charts menu.

Step Action
2-13 In the View and Compare menu, select Aggregates on this Node.
2-14 In the View and Compare panel, click the Performance Capacity Used column header twice, to
sort the aggregates in descending order based upon the percentage of their optimal performance
capacity used.
Record the aggregates that are using the most performance capacity. _______________
2-15 In the View and Compare panel, click the Add button next to the two aggregates that are
consuming the highest percentage of their optimal performance capacity. The metrics for these
aggregates are added to the performance charts for the node.
2-16 Based on the information displayed in the Performance Capacity Used chart, record the
aggregate that is contributing most to the used performance capacity of the node during the
event. _______________
If you find it difficult to discern the busiest aggregate, you can perform the following actions:
Use the Zoom View button to display the chart in a larger format, position your cursor over the
spike in the chart to display the metric values in text form, or drag your pointer over a section of
the chart to zoom in to a closer view.
2-17 Click the link for the aggregate with the highest performance capacity used during the event
time frame. The Aggregate Details page opens.

Step Action
2-18 In the Explorer tab of the Aggregate Details page, use the Time Range menu to set the time
range of the charts to six hours before the event and three hours after the event. Click Apply
Range.
2-19 In the Explorer tab of the Aggregate Details page, scroll down to the Performance Capacity
Used Chart. Record whether the spike in capacity used corresponds to the time of the event.
_______________
2-20 Now, scroll up to the Throughput (MBps, or megabytes per second) chart. Record whether the
spike in the Throughput chart corresponds to the time of the event. _______________
2-21 In the View and Compare panel, use the menu to select Volumes on this Aggregate.
2-22 In the View and Compare panel, click the Add button to add the metrics for the volume to the
charts.

Step Action
2-23 Record whether the activity on the volume correlates with the activity on the aggregate.
_______________
Record your explanation of how the volume can show more activity than the aggregate shows.
_______________
If you view the Volume Details page, you see that the cache miss ratio is zero for this volume
during this timeframe. Therefore, this volume is not the volume that generates I/O on the
aggregate.
Task 3: Use Perfomance Manager to resolve a performance issue

In this task, you use OnCommand Performance Manager to identify the best corrective action to take to
resolve a performance issue. You identified the volume that is stressing the node. Now you can resolve the
issue. One solution is to migrate the volume to an aggregate on a different node in the cluster. To do so, you
first must identify another node that is less busy. Then, you choose an aggregate on that node that can manage
the additional load that the volume poses.
Step Action
3-1 In the OnCommand Performance Manager user interface, use the Storage menu to navigate to
the Volumes page.
3-2 Record the total capacity of the busiest volume in the aggregate that you identified in Task 2.
_______________
Record the average throughput of the volume over the previous 72 hours. _______________
Record the average IOPS of the volume over the previous 72 hours. _______________
3-3 Use the Storage menu to navigate to the Nodes page.
3-4 Use the search text box to limit the display to only nodes that are members of the opm-capacity
cluster.
3-5 To sort the nodes by the percentage of optimal performance capacity used, in ascending order,
click the Performance Capacity Used column header.
Observe that node opm-capacity-01 is the node with the most available capacity over the
previous 72 hours. However, node opm-capacity-01 is also the node with the highest throughput
during that time. You might be able to improve performance by migrating the volume to node
opm-capacity-02.

Step Action
3-6 Use the Storage menu to navigate to the Aggregates page.
3-7 Use the search text box to limit the display to only aggregates that are owned by node opm-
capacity-02.
3-8 To sort the aggregates by the percentage of optimal performance capacity used, in ascending
order, click the Performance Capacity Used column header.
Record the aggregate that has the most available performance capacity. _______________
Record the reason that you might or might not want to migrate the volume to this aggregate.
_______________
Record whether sufficient free space to store the volume is available in the aggregate.
_______________
3-9 Now, you must use OnCommand System Manager or the ONTAP CLI to migrate the volume to
another aggregate on the same node or on a different node.
End of Exercise

Module 5: WAFL Performance Monitoring and Analysis
Your instructor will provide you with the license codes that you need for the exercises.
Exercise 1: WAFL Performance

In this exercise, you identify and resolve performance issues that are related to the Write Anywhere File
Layout (WAFL) subsystem of ONTAP software.
Objectives
 Use the cluster shell to display the performance characteristics of the WAFL subsystem
 Use the cluster shell to resolve performance issues that are related to volumes and aggregates
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

E5-1 ONTAP Performance Analysis: WAFL Performance Monitoring and Analysis

Task 1: Display WAFL subsystem performance
In this task, you use the cluster shell to display the performance characteristics of the WAFL subsystem.
Step Action
1-2 From the PowerShell window, create a 900-megabyte (MB) file in the Z: drive.
fsutil file createnew Z:900mfile 900000000
1-3 Open a PuTTY session with the cluster.
1-4 From the diagnostic privilege level, start the collection of WAFL performance statistics.
set diag
statistics start –object raid|wafl –sample-id sample_wafl
window.
0 0 32k 900m 90 1 Z:\900mfile
performance data.
statistics stop -sample-id sample_wafl
1-8 Examine the WAFL write request sizes. The histogram depicts the number of write requests of
each size that is passed from WAFL to RAID.
statistics show -object raid -instance raid -counter
write_io_depth_histo -sample-id sample_wafl
1-9 Display the number of RAID stripes written per second.

statistics show -object raid -instance raid -counter stripes_written
-sample-id sample_wafl
1-10 Record the number of these write requests that write a full RAID stripe. _______________
Record the number of these write requests that result in a partial stripe write.
___________________
statistics show -object raid -instance raid -counter
full_stripes|stripes_written|partial_stripes -sample-id sample_wafl
A high percentage of write requests resulting in partial stripe writes might indicate
fragmentation, especially if the WAFL write request sizes exceed the stripe width.
1-11 Compare the number of calculate parity read operations to the total number of blocks that are
written by WAFL.
statistics show -object wafl -instance wafl -counter
wafl_total_blk_writes|wafl_cp_blks_read|wafl_cp_read_time -sample-id
sample_wafl
A high percentage of write requests that result in a calculate parity read operation might indicate
fragmentation.

Task 2: Resolve volume performance issues
In this task, you use cluster shell commands to resolve performance issues that are related to volumes.
Step Action
2-1 Determine whether a volume is overfull.

storage volume show-space –vserver <svm_name> -volume <vol_name>
2-2 Determine whether a volume is fragmented.

volume reallocation measure –vserver <svm_name> -path <vol_path> -once
2-3 Display the progress of the reallocate measure scan.

volume reallocation show
2-4 Display the results of the reallocate measure scan.

event log show -event wafl.reallocate
Record the volume layout optimization value. __________________
2-5 To consolidate the disk blocks that belong to the files in the volume, perform reallocation on the
volume.
volume reallocation start –vserver <svm_name> -path <vol_path> -once
–no-check
2-6 To consolidate the free blocks into contiguous disk blocks, perform reallocation on the
aggregate.
aggregate reallocation start –aggregate <aggr_name> -once
2-7 Determine whether read reallocation is enabled on the volume.

volume show <vol_name> -fields read-realloc
2-8 Enable read reallocation on the volume.

volume modify –vserver <svm_name> -volume <vol_name> -read-realloc
space-optimized
2-9 Determine whether free space reallocation is enabled on the aggregate.

aggregate show <aggr_name> -fields free-space-realloc
2-10 Enable free space reallocation on the aggregate.

aggregate modify <aggr_name> -free-space-realloc on

Task 3: Resolve aggregate performance issues
In this task, you use cluster shell commands to resolve performance issues that are related to aggregates.
Step Action
3-1 Identify volumes and workloads in the aggregate.

volume show –aggregate <aggr_name>
3-2 Identify the most busy and least busy volumes in the aggregate.
statistics volume show –aggregate <aggr_name>
3-3 Identify the other nodes and or aggregates that can host the volumes.
Which data (non-root) aggregate has the lowest number of I/O operations?
3-4 Move the least busy volume to the least busy aggregate.
volume move start –vserver <vs_name> –volume <volume>
-destination-aggregate <aggr_name>
End of Exercise

Module 6: Disk I/O Performance
Exercise 1: Identifying and resolving disk I/O bottlenecks
In this exercise, you identify and resolve performance issues that are related to the disk I/O subsystem of
ONTAP.
Objectives
 Identify the components of the disk I/O subsystem
 Use the cluster shell to monitor and display disk subsystem performance
 Use the cluster shell to configure RAID background tasks
 Use Performance Manager to identify and resolve disk performance issues
Exercise Equipment

Laptop or
Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

OnCommand Unified
Manager
Manager
E6-1 ONTAP Performance Analysis: Disk I/O Performance

Task 1: Identify the components of the disk I/O subsystem
In this task, you use the cluster shell to identify the components of the disk I/O subsystem.
Step Action
1-1 Open a PuTTY session with the ONTAP cluster.
1-2 Switch to the diagnostics privilege level of the cluster shell.

set diag
1-3 Identify the number of disks and types of disks that are present in the cluster.
storage disk show
1-4 Determine whether the cluster uses partitioned disks.

storage disk partition show
Record whether the cluster uses partitioned disks. ___________________
The lab equipment might not include disk partitions, so this command might return a result of
“This table is currently empty.”
1-5 Identify the aggregates that are present in the cluster.

storage aggregate show
Record the amount of space in the aggregate that is in use. ___________________
1-6 Identify how the space in an aggregate is used.

storage aggregate show-space –aggregate <aggr_name>
Record the amount of space in the aggregate that is consumed by metadata.
___________________
1-7 Identify the configuration of the aggregates that are present in the cluster.
storage aggregate show-status
1-8 Identify the configuration of a specific aggregate.

storage aggregate show-status –aggregate <aggr_name>
Record the number of plexes that are present in the aggregate. ___________________
Record the number of RAID groups that are present in the aggregate. ___________________
Record the number of disks per RAID group that are present in the aggregate.
___________________
Record the disk type and RPM. ___________________
1-9 Identify the disk shelves that are present in the cluster.
storage shelf show
Record the number of shelves that are present in the cluster. ___________________
The lab equipment might not include disk shelves, so this command might return a result of
“There are no entries matching your query,” or “This table is currently empty.”

Step Action
1-10 Identify the connection type of the disk shelves.

storage shelf show -port
Record the negotiated speed of the SAS port. ________________
The lab equipment might not include disk shelves, so this command might return a result of
“This table is currently empty.”
1-11 Determine whether any aggregates are not online.

storage aggregate show -state !online
1-12 Determine whether any disks are in a broken state.

storage disk show -state broken
Task 2: Use the cluster shell to monitor disk performance

In this task, you use cluster shell commands to monitor cluster disk I/O performance.
Step Action
2-1 Check the performance of all aggregates in the cluster.

Record the aggregate that performs the most I/O operations. ________________
2-2 Display the 10 busiest disks.

statistics disk show –sort-key disk_busy –max 10
Record the busiest disk. ________________
2-3 Display the 10 slowest disks.

statistics disk show –sort-key operation_latency –max 10
Record the slowest disk. ________________
2-4 Identify the aggregate to which the slowest disk belongs.

storage aggregate show –disklist <disk_name>
Record the aggregate that contains the slowest disk. ________________
2-5 Display the latency of the disks that belong to the busiest aggregate.
storage disk show –aggregate <aggr_name> -fields average_latency
Record the slowest disk. ________________

Task 3: Use the cluster shell to display disk performance
In this task, you use the cluster shell to collect and display the performance characteristics of the disk I/O
subsystem.
Step Action
“aggregate,” and “disk.”
set advanced
statistics start -object volume|aggregate|disk -sample-id sample_disk
window.
0 0 32k 900m 90 1 Z:\900mfile
3-5 While the SIO program is running, display the disk% busy value and determine if back-to-back
consistency points are occurring.
node run local sysstat –u –c 5
3-6 While the SIO program is running, display the volume performance statistics.
performance data.
statistics stop -sample-id sample_disk
3-8 Check the latency for each volume in the cluster.

statistics show-periodic -object volume –counter avg_latency -sample-id
sample_disk
3-9 Check volume latency for a specific volume.

statistics show -object volume -instance white_vol1 –counter
avg_latency -sample-id sample_disk
3-10 Check volume activity.

statistics show -object volume -counter *data -sample-id sample_disk
3-11 Check the latency for each aggregate in the cluster.

statistics show -object aggregate -counter user_reads|user_writes
-sample-id sample_disk
3-12 Check disk latency.

statistics show -object disk -counter *latency -sample-id sample_disk

Step Action
3-13 Delete the performance sample files.

statistics samples delete -sample-id *
Task 4: Use the cluster shell to configure RAID background tasks

In this task, you use cluster shell commands to determine the configuration of RAID subsystem background
tasks.
Step Action
4-1 Determine whether media scrubs are enabled.

storage raid-options show –node * -name raid.media_scrub.enable
4-2 Determine the rate at which media is scrubbed.

storage raid-options show –node * -name raid.media_scrub.rate
4-3 Determine whether RAID scrubs are enabled.

storage raid-options show –node * -name raid.scrub.enable
4-4 Determine the performance impact level of RAID scrubs.

storage raid-options show –node * -name raid.scrub.perf_impact
4-5 Determine the duration of RAID scrubs.

storage raid-options show –node * -name raid.scrub.duration
4-6 Determine when RAID scrubs are scheduled to run.

storage raid-options show –node * -name raid.scrub.schedule
Task 5: Use Performance Manager to identify and resolve disk

performance issues
In this task, you use Performance Manager to diagnose the cause of the overuse of an aggregate.
Step Action
5-1 Open a web browser, and connect to the OnCommand Performance Manager interface.
https://opm
5-2 Log in to Performance Manager:

User name: admin
Password: Netapp123
5-3 Navigate to the Events page.
5-4 Use the Search Event Id textbox to search for an event with event ID of 8002.

Step Action
5-5 Record the event description. ________________
You might need to position your cursor over the event description to see all of the descriptive
text.
5-6 To navigate to the Event Details page, click the event name.
Record the full name of the event. ________________
Record when the event was triggered. ________________
Record when the event concluded. ________________
Record the aggregate that is affected. ________________
Record the event description. ________________
5-7 To navigate to the Aggregate Details page, click the link of the aggregate that is affected.
5-8 Set the time range to encompass the start and end time of the event. For example, set the time
range from 9:00 PM July 30, 2016 through 7:00 AM July 31, 2016.

Step Action
5-9 Position your cursor over the last warning level event for this aggregate. Record the event name.
________________
5-10 Scroll down to the Perf. Capacity Used chart.
5-11 Position your cursor over the Perf. Capacity Used chart and identify the amount of performance
capacity that was used by Aggr5 at 3:54 am on July 31.
5-12 Change the list of objects that you want to view and compare to “Volumes on this Aggregate.”

Step Action
5-13 To add each volume that resides in this aggregate to the performance charts, click the
appropriate Add buttons.
5-14 Examine the IOPS and MBps (megabytes per second) charts. Record the volumes that contribute
to the traffic that goes to the aggregate. ________________
5-15 Click the name of the busiest volume in the aggregate.
5-16 Set the time range to encompass the start and end time of the event. For example, set the time
range from 9:00 PM July 30, 2016 through 7:00 AM July 31, 2016.

Step Action
5-17 Examine the volume IOPS and MBps charts.
5-18 Change the list of the objects that you want to view and compare to “SVM of this Volume.”
5-19 Add the storage virtual machine (SVM) to the charts.

Step Action
5-20 Examine the IOPS and MBps charts.
Record whether this SVM accounts for the activity to the volume. ________________
5-21 Performance Manager identified that an aggregate exceeded its permitted utilization threshold
and triggered an event alert. You used Performance Manager to identify the aggregate that
generated the event, to determine the volume in the aggregate that is driving the I/O, and to
determine the SVM that owns the volume.
You now can use OnCommand System Manager or OnCommand Unified Manager to determine
the client hosts that are accessing the volume and to determine the connection methods.
End of Exercise

Module 7: Flash Cache and Flash Pool Performance
Exercise 1: Exploring Cache Performance
In this exercise, you identify and resolve performance issues that are related to the caching subsystems of a
NetApp storage system. The caching of data in the optional Flash Cache modules and Flash Pool aggregates
is included in the exercise.
Objectives
 Analyze the performance of the caching subsystems
 Configure Flash Cache caching policies
 Configure the aggregate Automated Workload Analyzer (AWA) feature
 Observe and adjust Flash Pool caching policies
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

E7-1 ONTAP Performance Analysis: Flash Cache and Flash Pool Performance

Task 1: Analyze the performance of the caching subsystems
In this task, you use the cluster shell to identify the performance characteristics of the caching subsystems.
Step Action
1-2 Display the current Flash Cache configuration settings.

system node external-cache show
1-3 To simulate the presence of a Flash Cache module, enable the Predictive Cache Statistics (PCS)
feature.
system node external-cache modify -node * -is-enabled true
1-4 Adjust the size of the simulated Flash Cache module.

vserver options -option-name flexscale.pcs_size -option-value 16GB
1-5 Display the current Flash Cache configuration settings.

system node external-cache show
1-6 Using the advanced privilege level, start statistics data collection on the “wafl” and
“ext_cache_obj” objects.
set advanced
statistics start -object wafl|ext_cache_obj -sample-id sample_cache
window.
60 80 32k 900m 90 1 Z:\900mfile
1-9 While SIO is running, observe the effectiveness of the Flash Cache cache.
statistics cache flash-pool show
1-10 Stop the collection of performance statistics.

statistics stop -sample-id sample_cache
1-11 Display the read_io_type counter values.

statistics show -object wafl –counter read_io_type -sample-id
sample_cache
Where are the majority of the read requests being satisfied from. ___________
1-12 Obtain a description of the cache counters.

catalog counter show –object ext_cache_obj –counter
hit|hit_percentage|disk_reads_replaced -describe

Step Action
1-13 Display the cache counter values.

statistics show -object ext_cache_obj –counter
hit|hit_percent|disk_reads_replaced -sample-id sample_cache
Record the cache hit rate. ___________
Record the number of disk read operations that are replaced by reads from cache. ___________
Task 2: Configure Flash Cache caching policies

In this task, you use the cluster shell and node shell to adjust the caching policies of the Flash Cache module
and observe the results.
Step Action
2-1 Display Flash Cache counter values.

hit|hit_percent|disk_reads_replaced -sample-id sample_cache
2-2 Observe the caching policies for readahead blocks and low-priority blocks.
vserver options -option-name flexscale.readahead_blocks
vserver options -option-name flexscale.lopri_blocks
2-3 Using the advanced privilege level, start statistics data collection on the object “ext_cache_obj”
to a different sample file.
statistics start -object ext_cache_obj -sample-id sample_cache2
window.
60 80 32k 900m 90 1 Y:\900mfile
2-6 Display cache counter values.

hit|hit_percent|disk_reads_replaced -sample-id sample_cache2
Record whether the cache hit rate improved. ___________
Record whether the disk read operations replaced rate increased. ___________
2-7 Disable PCS.

system node external-cache modify –node local -is-enabled false

Task 3: Configure the aggregate AWA feature
In this task, you use the cluster shell and node shell to enable the AWA feature to predict the effectiveness of
using a Flash Pool aggregate.
Step Action
3-1 Launch the node shell on cluster node 1.

node run –node svl-nau-01
3-2 Switch to the advanced privilege level.

priv set advanced
3-3 Display the command syntax to manage AWA.

wafl awa
3-4 Start AWA on an aggregate.

wafl awa start fcal_data_102
window.
60 80 32k 900m 90 1 Z:\900mfile
3-7 Display the results of AWA on an aggregate.

wafl awa print
3-8 Record the recommended Flash Pool cache size. ___________
3-9 Record the predicted cache hit ratio if only 60% of recommended Flash Pool cache size is
deployed. ___________
3-10 Display a list of the volumes in the aggregate that benefit the most from caching.
wafl awa print -t
3-11 Stop AWA on an aggregate.

wafl awa stop fcal_data_102
3-12 Exit from the node shell.

exit
Task 4: Observe and adjust Flash Pool caching policies

In this task, you use cluster shell commands to observe and adjust the caching policies of a volume that is
contained in a Flash Pool aggregate.
Step Action
4-1 Display the Flash Pool caching policy that is assigned to all volumes.
volume show –fields caching-policy

Step Action
4-2 Display the caching policies that can be assigned to a volume.

volume modify -caching-policy <TAB>
4-3 Change the Flash Pool caching policy that is assigned to a volume.
volume modify -vserver SVM_white -volume white_vol1 -caching-policy all
4-4 Display the Flash Pool caching priority that is assigned to a volume.
volume show –fields cache-retention-priority
4-5 Display the caching priorities that can be assigned to a volume.

volume modify -cache-retention-priority <TAB>
4-6 Change the Flash Pool caching priority that is assigned to a volume.
volume modify -vserver SVM_white -volume white_vol1
-cache-retention-priority high
4-7 Display the Flash Pool caching attributes that are assigned to volumes that belong to a storage
virtual machine (SVM).
volume show -vserver SVM_white –fields caching-policy,cache-retention-
priority
End of Exercise

Module 8: Cluster Interconnect Performance
Exercise 1: Cluster Interconnect Performance
In this exercise, you use the cluster shell to identify performance issues with the cluster interconnect network.
You resolve the issues by adding cluster interconnect ports and LIFs to the cluster nodes.
Objectives
 Identify performance issues with the cluster interconnect network
 Resolve performance issues with the cluster interconnect network by adding cluster ports to the node
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

Task 1: Display cluster interconnect performance

In this task, you use the cluster shell to identify performance issues with the cluster interconnect.
Step Action
E8-1 ONTAP Performance Analysis: Cluster Interconnect Performance

Step Action
1-2 Determine which ports are currently used for the cluster interconnect network.
network port show –ipspace Cluster
Record the ports that are used for the cluster interconnect network. __________________
Record the status of the ports. __________________
Record the speed of the ports. __________________
1-3 Determine if any cluster interconnect links are down.

network interface show –role cluster
Record the LIFs that are used for the cluster interconnect network. __________________
Record the status of the LIFs. __________________
1-4 Verify cluster interconnect network connectivity.

set advanced
cluster ping-cluster <node_name>
Record whether basic connectivity fails on any paths. __________________
1-5 Use the ping command to observe round trip times. Perform this step on each cluster LIF.
network ping –show-detail true –vserver <SVM_name> -lif <lif_name>
-use-source-port true –destination <remote_IP_addr> -count 3
Record the average round trip time for each of the paths. __________________
1-6 Observe the amount of traffic on the cluster interconnect links.

system cluster-switch interface show -counters
The lab environment does not contain physical network switches, so this command might return
the message, “This table is currently empty.”
Task 2: Resolve cluster interconnect performance issues

In this task, to resolve performance issues, you add and enable network ports for use with the cluster
interconnect network.
Step Action
2-1 Identify the available physical network ports.

network port show
Record the speed and frame size of the cluster ports. __________________
2-2 If the frame size is not 9000, then increase the frame size that is used on the cluster ports that are
members of the Cluster broadcast domain. Otherwise, skip this step.
network port broadcast-domain modify –ipspace Cluster –broadcast-domain
Cluster –mtu 9000

Step Action
2-3 Select the network data ports to convert into cluster interconnect ports, and record the node and
port names.
Cluster node name = __________________
Port name = ___________________
2-4 Determine whether there are data LIFs residing on the ports that will be converted into cluster
ports.
network interface show –curr-node <node_name> –curr-port <port_name>
2-5 If data LIFs reside on the ports that will be converted into cluster ports, assign the data LIFs to a
different data port.
network interface modify –vserver <svm_name> -lif <lif_name> -home-node
<node_name> -home-port <port_name>
2-6 Move the data LIF to the new home port.

network interface revert –vserver <svm_name> -lif <lif_name>
2-7 Verify that the LIF is now on the new home port.
network interface show –vserver <svm_name> -lif <lif_name>
–fields curr-node,curr-port
2-8 Determine to which IP space and broadcast domain the new cluster ports currently belong, and
record the information.
network port show –port <port_name>
IPspace = _______________________
Broadcast = ______________________
2-9 Remove the new cluster ports from the current broadcast domain.
network port broadcast-domain remove-ports –ipspace <IP_space>
-broadcast-domain <bcast_dom> -ports <node_name:port_name>
2-10 Assign the new cluster ports to the Cluster broadcast domain.
network port broadcast-domain add-ports –ipspace Cluster -broadcast-
domain Cluster -ports <node_name:port_name>
2-11 Disable the new cluster ports, so that you can move the network cables to the cluster
interconnect network.
network port modify –up-admin false -node <node_name> -port <port_name>
During this exercise, you do not move the cluster cables.
2-12 Create a new cluster LIF for the new cluster port.
network interface create –vserver Cluster –lif <lif_name> –role cluster
–home-node <node_name> -home-port <port_name> -auto true

Step Action
2-13 After you move the network cables to the cluster interconnect network, enable the new cluster
ports.
network port modify –up-admin true -node <node_name> -port <port_name>
During this exercise, you do not move the cluster cables, so leave the new cluster port disabled.
End of Exercise

Module 9: Storage QoS
Exercise 1: Workload Management with Storage QoS
In this exercise, you learn how to work with the storage quality of service (QoS) feature to prevent problem
workloads and tenants from affecting other workloads and tenants.
Objectives
 Reactively limit throughput to a workload by associating the workload with a QoS policy group
 Proactively monitor workload performance by associating a workload with a QoS policy group
 Isolate a tenant workload by associating the workload with a QoS policy group
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

Task 1: Reactively limit throughput to a workload

In this task, you use the Iometer tool to generate a 50% write and 50% read workload on a LUN. You then set
up a storage QoS limit to throttle the workload that is associated with the LUN.
E9-1 ONTAP Performance Analysis: Storage QoS

Step Action
1-1 On the landing host, launch the Iometer load generator. On the taskbar of the landing host, click
the Iometer icon and, if prompted, approve the Iometer End User License Agreement, EULA.
The Iometer window should open.
1-2 In the left pane, select the W2K12-01 node.

Step Action
1-3 On the Disk Targets tab, specify the following:

 Targets: Y:orange_lun1 drive
 Maximum Disk Size: 80000
 Starting Disk Sector: 0
 # of Outstanding I/Os: 16
1-4 Click the Access Specifications tab.

Step Action
1-5 Under Global Access Specifications, select 512B; 50% Read; 0% random.
1-6 Click Add.
1-7 Click the Test Setup tab.

Step Action
1-8 On the Test Setup tab, specify the following:

 Test Description: QoS Exercise
 Ramp Up Time: 4 seconds
 Record Results: None
1-9 Click the Results Display tab.

Step Action
1-10 Change the Update Frequency to 4 seconds.

This action reduces the load on the Windows computer that is required to update the Iometer
interface.
1-11 In the toolbar, click the Start Tests button (a green flag).

Step Action
1-12 Open Windows Explorer to the Y:orange_lun1 drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 40,000 kilobytes (KB) (80,000 x 512 bytes
[B]).
1-13 Notice that, when the file reaches its maximum size, the ramp-up time begins to count down for
four seconds.
The ramp-up time ensures that the storage is stable before the testing begins.
1-14 When the 50% read and 50% write test results begin to appear, notice that the total IOPS are
recorded on the top row of the display output.
On the test system, the total IOPS probably remain in a range below the range of a clustered
system that operates on nonvirtualized equipment. Remember that this exercise uses a Windows
system and a cluster that operates in a shared virtualized environment.
1-15 Leave the test running.

Step Action
1-17 Using the statistics command, identify the volume or LUN that shows the most IOPS.
statistics lun show
1-18 Display the current storage QoS policies.

qos policy-group show
1-19 To see the current total IOPS, switch back to the Iometer tool.
1-20 Create a new storage QoS policy.
If the current total IOPS on the Iometer tool is higher than 1000, set the -max-throughput
parameter to 1000iops; otherwise, set the parameter to 500iops.
qos policy-group create -policy-group maxtp_policy -vserver SVM_orange
-max-throughput 1000iops

1-22 Associate the policy to the hot volume or LUN.
You can associate a storage QoS policy to a storage virtual machine (SVM) hosting FlexVol
volumes, LUNs, volumes, or files.
1-23 Verify the current throughput from the storage.

qos statistics performance show –iterations 4

Step Action
1-24 Wait a few moments and watch the total IOPS on the Iometer decrease.
You can continue to run the previous command to observe the current storage readings. A few
minutes might elapse before the IOPS start to decrease.
1-25 When your testing is complete, stop the Iometer test by clicking the Stop button on the
toolbar.
1-26 In the PuTTY window, delete the policy.

qos policy-group delete -policy-group maxtp_policy
1-27 Close the Iometer tool.
1-28 On the landing host, launch the Iometer load generator. On the taskbar of the landing host, click
the Iometer icon and, if prompted, approve the Iometer EULA.
Task 2: Proactively monitor workload performance

In this task, you set up a storage QoS limit for a volume and use Iometer to generate a 50% write and 50%
read workload on the volume.
Step Action

Step Action
2-2 Create a new storage QoS policy.

qos policy-group create -policy-group prevent_policy -vserver
SVM_orange

2-4 You identified orange_vol1 as a new and potentially runaway workload.

Associate the storage QoS policy to the orange_vol1 volume.
You can associate a storage QoS policy to an SVM hosting FlexVol volumes, LUNs, volumes,
or files.
volume modify SVM_teal_vol1 -vserver SVM_teal -qos-policy-group
prevent_policy

2-6 On the taskbar, click the Iometer icon.

Step Action

 Targets: Y drive

Step Action
2-10 Under Global Access Specifications, select 4K; 50% Read; 0% random.

Step Action
2-11 Click Add.


Step Action

This reduces the load on the Windows computer that is required to update the Iometer interface.

Step Action
2-17 Open Windows Explorer to the Y drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 80,000 KB (20,000 x 4 KB).
2-18 Notice that, when the file reaches the maximum size, the ramp-up time begins to count down for
four seconds.

Step Action
Remember that this is a Windows computer and a cluster that operates in a shared virtualized
environment.
2-21 Notice the total IOPS.

Step Action
2-22 In the PuTTY window, verify the current throughput from the storage.
toolbar.
2-24 In the PuTTY window, delete the policy.

qos policy-group delete -policy-group prevent_policy
Task 3: Isolate a tenant workload

In this task, you set up a storage QoS tenant limit at the SVM level and use the Iometer tool to generate a 50%
write and 50% read workload on one of the tenants.
Step Action
3-2 Create three new storage QoS policies.

qos policy-group create -policy-group red_policy -vserver SVM_red
qos policy-group create -policy-group orange_policy -vserver SVM_orange
qos policy-group create -policy-group yellow_policy -vserver SVM_yellow

3-4 Associate each policy to the corresponding SVM.

You can associate a storage QoS policy to an SVM with FlexVol volumes, LUNs, volumes, or
files.
vserver modify -vserver SVM_red -qos-policy-group red_policy
vserver modify -vserver SVM_orange -qos-policy-group orange_policy
vserver modify -vserver SVM_yellow -qos-policy-group yellow_policy


Step Action
3-6 On the taskbar, click the Iometer icon.

Step Action

 Targets: Y drive

Step Action
3-10 Under Global Access Specifications, select 4K; 50% Read; 0% random.
3-11 Click Add.

Step Action


Step Action

This reduces the load on the Windows computer that is required to update the Iometer interface.

Step Action
3-17 Open File Explorer to the Y drive and notice a new file, iobw.tst.
This is the test file that is increasing in size until it is 80,000 KB (20,000 x 4 KB).
3-18 Notice that, when the file reaches the maximum size, the ramp-up time begins to count down for
four seconds.

Step Action
Remember that this is a Windows computer and a cluster that operates in shared virtualized
environment.
3-21 In the PuTTY window, verify the current throughput from the storage.

Step Action
3-22 To see the current total IOPS, switch back to the Iometer tool.
toolbar.
End of Exercise

Module 10: Network-Attached Storage Performance
Exercise 1: NAS Performance
In this exercise, you use the cluster shell to identify and resolve performance issues that relate to NAS
protocols.
Objectives
 Display NAS protocol performance statistics
 Resolve performance issues with NFS
 Resolve performance issues with SMB
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

Linux Server CentOS64 192.168.0.10 root Netapp123
E10-1 ONTAP Performance Analysis: Network-Attached Storage Performance

Task 1: Display NAS protocol performance statistics
In this task, you use the cluster shell commands to identify performance bottlenecks in NFS and CIFS
protocol traffic.
Step Action
1-2 Using the advanced privilege level, start statistics data collection on the objects “nfsv3” and
“cifs”.
set advanced
statistics start -object nfsv3|cifs -sample-id sample_NAS
1-3 Open a PuTTY session with the Linux host.
1-4 On the Linux host, open a command-line window.
1-5 On the Linux host, create a large file in the NFS mounted directory.
cp –r /var/log /mnt/SVM_teal/logdata
cp –r /mnt/SVM_teal/logdata /mnt/SVM_teal/logdata2
window.
0 0 32k 900m 90 60 1 Z:\900mfile
1-9 While the SIO program is running, in the ONTAP Cluster shell, display the cluster performance
statistics.
cluster statistics show
1-10 While the SIO program is running, display the storage virtual machine (SVM) performance
statistics.
statistics vserver show –vserver SVM_white
1-11 While the SIO program is running, display the LIF performance statistics.
statistics lif show –lif SVM_white_cifs_lif1
performance data.
statistics stop -sample-id sample_NAS
1-13 Display the NFS read and write operation counters.

statistics show -object nfsv3 –instance SVM_teal -counter
nfsv3_read_ops|nfsv3_write_ops -sample-id sample_NAS
Record the read/write ratio of the I/O. _____________________

Step Action
1-14 Display the NFS latency counters.

statistics show -object nfsv3 -counter nfsv3_latency -sample-id
sample_NAS
Record the average latency of the NFS I/O traffic. _____________________
1-15 Display the CIFS read and write operation counters.

statistics show -object cifs –instance SVM_white -counter cifs
_read_ops| cifs _write_ops -sample-id sample_NAS
Record the read/write ratio of the I/O. _____________________
1-16 Display the CIFS latency counters.

statistics show -object cifs –instance SVM_white -counter cifs_latency
-sample-id sample_NAS
Record the average latency of the CIFS I/O traffic. _____________________

statistics samples delete -sample-id sample_NAS
Task 2: Resolve NFS performance issues

In this task, you resolve performance issues with the cluster interconnect network by adding and enabling
network ports for use with the cluster interconnect network.
Step Action
2-1 Display the NFS configuration of a NAS SVM.

vserver nfs show –vserver SVM_teal
Record whether NFSv4 client referrals are enabled. _____________________
2-2 Change the TCP and User Datagram Protocol (UDP) maximum transfer size for a NAS SVM.
vserver nfs modify –vserver SVM_teal -tcp-max-transfer-size 65536
vserver nfs modify –vserver SVM_teal -udp-max-transfer-size 65536

Step Action
2-3 Migrate an NFS LIF to another node in the cluster.

a. Identify the amount of traffic that is passing though the NFS LIF.
statistics lif show –lif SVM_teal_nfs_lif1
b. Determine the port that is hosting the NFS LIF.
network interface show –lif SVM_teal_nfs_lif1
c. Identify the port speed and configuration settings.
network port show -node svl-nau-01 -port e0e
d. Identify another port that has sufficient available I/O bandwidth.
statistics show-periodic -object nic_common -counter
instance_name|rx_total_bytes|tx_total_bytes|instance -iterations
1 -interval 1 -node svl-nau-02 -instance e0f
e. Move the LIF to the selected port.
migrate -vserver SVM_teal -lif SVM_teal_nfs_lif1 -destination-
node svl-nau-02 -destination-port e0f
You might need to disable NFSv4 referrals on the SVM.
vserver nfs modify –vserver SVM_teal -v4.0-referrals disabled
Task 3: Resolve CIFS performance issues

In this task, you use cluster shell commands to resolve performance issues with the CIFS file sharing
protocol.
Step Action
3-1 Display a list of SVMs that have CIFS enabled.

vserver cifs show
3-2 Display the CIFS configuration options for an SVM.

vserver cifs option show –vserver <svm_name>
Record whether client referrals are enabled. _____________________
Record whether copy offloads are enabled. _____________________
3-3 Migrate a CIFS LIF to another node in the cluster.

a. Identify the amount of traffic that is passing though the CIFS LIF.
statistics lif show –lif SVM_white_cifs_lif1
b. Identify another port that has sufficient available I/O bandwidth.
1
-interval 1 -node svl-nau-02 -instance e0f
c. Move the LIF to the selected port.
migrate -vserver SVM_white -lif SVM_white_cifs_lif1 -destination-
node svl-nau-02 -destination-port e0f
You might need to disable SMB referrals on the SVM.
vserver cifs option modify –vserver SVM_white -is-referral-enabled
false

Step Action
3-4 Add more CIFS LIFs.

a. Identify another port that has sufficient available I/O bandwidth.
1
-interval 1 -node svl-nau-02 -instance e0f
b. Create a CIFS LIF on the selected port.
network interface create -vserver SVM_white -lif
SVM_white_cifs_lif2 -role data -data-protocol cifs -home-node
svl-nau-02 -home-port e0f …
You might need to disable SMB referrals on the SVM.
vserver cifs option modify –vserver SVM_white -is-referral-enabled
false
Task 4: Use Perfstat to resolve CIFS performance issues

In this task, you use the data that Perfstat gathers to resolve performance issues with the CIFS file sharing
protocol. In this scenario, the customer is experiencing very slow performance when accessing “image”
volumes.
Step Action
4-1 Navigate to the folder that contains the performance data files that Perfstat collected.
CourseFiles\Perfstat\371007
4-2 Examine the contents of the df.txt file.

In the following table, record the names of the “image” volumes and the amount of capacity that
is used by each volume.
Volume Name Capacity Used

Step Action
4-3 Examine the contents of the sysstat_x_1sec.txt file.

Record the number of NFS operations that are performed per second, on average.
_____________________
Record the number of CIFS operations that are performed per second, on average.
_____________________
Record the number of iSCSI operations that are performed per second, on average.
_____________________
Record the number of FCP operations that are performed per second, on average.
_____________________
Record the amount of data that is read from and written to the network per second, on average.
_____________________
Record what this information indicates about the type of I/O that this system is performing.
____________________
4-4 Continue to examine the contents of the sysstat_x_1sec.txt file.

Record the amount of data that is read from and written to the hard disk drives per second, on
average. _____________________
Record how busy the hard disk drives (HDDs) are, on average. _____________________
Record what this information indicates about the state of the disk I/O subsystem.
_____________________
4-5 Continue to examine the contents of the sysstat_x_1sec.txt file.

Record the average CPU usage. _____________________
Record the cache hit rate. _____________________
Record what percentage of the time the system is processing consistency points.
_____________________
Record whether this information indicates a performance bottleneck. _____________________

Step Action
4-6 Examine the contents of the stats_stop_-l_workload_volume.txt file.

In the following table, for each “image” volume, record the number of read and write operations,
the amount of data read and written, and the latencies.
Volume Read Write Other Read Write Read Write

Ops Ops Ops Data Data Latency Latency
Image_full
Image_plus
Image_thumb
Image_med
Based on the results in the table, record where the majority of the workload is present on the
“image” volumes. _____________________
4-7 Examine the contents of the stats_stop_-l_readahead.txt file.

In the following table, record the value for each counter.
Counter Value
readahead:readahead:requested
readahead:readahead:read
readahead:readahead:incore
readahead:readahead:dir_blocks
Based on the results in the table, record the type of data that is being accessed.
_____________________
4-8 To confirm your conclusions, examine the wafl_susp_-w.txt file.

Locate the section that is titled “blocks read ahead by type.”
Record the type of blocks that are read most by the readahead mechanism.
_____________________
Record how this action might affect disk I/O performance. _____________________
End of Exercise

Module 11: Storage Area Network Performance
Exercise 1: SAN Protocol Performance
In this exercise, you use the cluster shell to identify and resolve performance issues that relate to SAN
protocols.
Objectives
 Display SAN protocol performance statistics
 Resolve LUN performance issues
 Resolve SAN LIF performance issues
Exercise Equipment

Classroom svl-nau
Desktop Remote
Desktop
(node1) (node2)
OPM UM
opm ocum linux1

Task 1: Display SAN protocol performance statistics

In this task, you use the cluster shell to identify performance issues that are related to SAN protocol activity.
Step Action
E11-1 ONTAP Performance Analysis: Storage Area Network Performance

Step Action
“aggregate,” and “disk.”
set advanced
statistics start -object lun|iscsi -sample-id sample_SAN
1-3 On the Windows system desktop, copy the 900mfile from the CourseFiles folder to the X: drive.
window.
0 0 32k 900m 90 X:\900mfile
1-7 While the SIO program is running, display the storage virtual machine (SVM) performance
statistics.
statistics lun show –vserver SVM_red
1-8 While the SIO program is running, display the LIF performance statistics.
statistics lif show –vserver SVM_red
1-9 Examine the FC port queue depths.

network fcp adapter show –fields queue-depth
1-10 While the SIO program is running, switch into the diagnostic privilege level and display the
LUN alignment statistics.
set diag
lun alignment show -vserver SVM_red -path /vol/red_vol1/red_lun1
performance data.
statistics stop -sample-id sample_SAN
1-12 Display the LUN read and write operation counters.

statistics show -object lun -instance /vol/red_vol1/red_lun1 -counter
read_ops|read_data|write_ops|write_data -sample-id sample_SAN
1-13 Display the LUN read and write size histograms.

read_io_len_hist|write_io_len_hist -sample-id sample_SAN
1-14 Display the LUN I/O latency counters.

avg_read_latency|avg_write_latency -sample-id sample_SAN
1-15 Display the percentage of read and write operations that are partial blocks.
read_partial_blocks|write_partial_blocks -sample-id sample_SAN

Step Action
1-16 Display the LUN I/O alignment histograms.

read_align_histo|write_align_histo -sample-id sample_SAN

statistics samples delete -sample-id sample_SAN
Task 2: Resolve LUN performance issues

In this task, you resolve performance issues with LUNs.
Step Action
2-1 If poor LUN performance is caused by LUN misalignment, then you must realign the LUN.
Use the NetApp Virtual Storage Console (VSC) or other tools to realign the LUN.
2-2 If poor LUN performance is caused by congestion with other workloads that are accessing the
same aggregate, then migrate the LUN to another aggregate or node in the cluster:
a. Identify the LUN that is experiencing the highest latency.
b. Identify the volume in which the LUN resides.
c. Identify the aggregate in which the volume resides.
d. Determine the other volumes that reside in the same aggregate.
e. Decide whether to move this LUN or volume to another aggregate or whether to move a
different volume.
f. Identify another aggregate that has sufficient free space and I/O bandwidth headroom to
accommodate the volume that you want to move.
g. Move the volume to the selected aggregate.
The asymmetric logical unit access (ALUA) should redirect the client hosts to use a direct
path to the node that now contains the LUN.
Task 3: Resolve SAN performance issues

In this task, you explore ways to resolve performance issues with the ports that are connected to a SAN.
Step Action
3-1 Migrate a SAN LIF to another node in the cluster.

If the problem is occurring because the node controller is overwhelmed, and you cannot move
the LUN to another aggregate, then you can assign the SAN LIF on the saturated node to a port
on different node in the cluster. This action slightly reduces the burden on the saturated node,
because the processing of the SAN protocol is offloaded onto a different node.
3-2 Add a SAN LIF.

If the problem is caused by a saturated port on the cluster, then you can create a SAN LIF on the
SVM and assign the SAN LIF to a different port on the same node.
End of Exercise

STRSW Ilt Perfcdot Rev01 Exerciseguide

Transféré par

Informations du document

Titre original

Copyright

Formats disponibles

Partager ce document

Partager ou intégrer le document

Options de partage

Avez-vous trouvé ce document utile ?

Ce contenu est-il inapproprié ?

Droits d'auteur :

Formats disponibles

STRSW Ilt Perfcdot Rev01 Exerciseguide

Transféré par

Droits d'auteur :

Formats disponibles

ONTAP Performance Analysis

ONTAP Performance Analysis

NetApp University - Do Not Distribute

U.S. GOVERNMENT RIGHTS

E-2 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

E-3 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

Exercise 0: Checking the Lab Equipment

E-4 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

Performance Manager Unified Manager CentOS 6.5

opm ocum linux1

Machine Host Name IP Address User Name Password

Windows Server 2012 w2k12 192.168.0.11 LEARN\Administrator Netapp123

E-5 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

E-6 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

1-5 Double-click the PuTTY shortcut.

E-7 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

1-8 In the PuTTY Security Alert message, click Yes.

The ONTAP cluster CLI prompt and cursor appear.

E-8 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

E-9 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

2-1 In the svl-nau-mgmt CLI, enter the following command:

2-3 If any license is not installed, contact NetApp University Support.

E-10 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

3-1 Open a PuTTY session with the NetApp cluster.

3-2 Check the current system timeout value.

3-3 Set the system timeout to 0 (no timeout).

E-11 ONTAP Performance Analysis: Welcome

NetApp University - Do Not Distribute

Windows Server 2012 R2

Performance Manager Unified Manager CentOS 6.5

opm ocum linux1

Machine Host Name IP Address User Name Password

E1-1 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

1-1 On the Windows Server desktop, launch Windows Internet Explorer.

E1-2 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

1-3 Log in to the cluster OnCommand System Manager interface.

1-4 View the Cluster Details page for the cluster.

E1-3 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

2-3 Enter information about each aggregate in the following table.

E1-4 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

2-5 Enter information about each SVM in the following table.

2-9 Enter information about each LUN in the following table.

E1-5 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

E1-6 ONTAP Performance Analysis: How NetApp Storage Works

NetApp University - Do Not Distribute

Windows Server 2012 R2

Performance Manager Unified Manager CentOS 6.5

opm ocum linux1

Machine Host Name IP Address User Name Password