Académique Documents
Professionnel Documents
Culture Documents
Cloud Era?
An Intelligent Roadmap for Capacity Planning
BACKGROUND
In any utility environment (electricity, gas, water, mobile telecoms) ensuring there is enough
resource or capacity available to meet demands is critical in satisfying the needs of the
consumer.
At the same time, it is important to ensure that there is not an over supply of capacity in a
market, as this will ultimately impact profitability and viability of the suppliers business model.
These rules not only apply to traditional utilities, but also to the provisioning of IT infrastructure
used to host applications that support the business.
Virtualization accelerated this concept through its ability to share compute resources amongst
multiple application workloads, and the agility it provides to rapidly provision and reconfigure
compute resources through software.
Whilst virtualization is a great enabler to get a better return on investment from virtualized IT
infrastructure, many Enterprise IT departments and Service Providers have relatively immature
capacity management processes and are not exploiting the latest innovations that would
enable them to transform their situation.
2
ARE YOUR CAPACITY MANAGEMENT PROCESSES FIT FOR THE CLOUD ERA?
This is typically based on a resource allocation model which takes the total amount of memory
and CPU capacity allocated to all virtual machines in a compute cluster and a level of over
provisioning (e.g. 2:1, 4:1, 8:1, 12:1 ) is assumed in order to calculate the requirement for physical
resources.
The level of over provisioning is often directly related to different tiers of infrastructure and the
service levels offered by these. For example, a Platinum Service level may be offered on a
Compute Cluster that has a conservative overprovision ratio of 2:1, and a Bronze Service might
be offered on a Compute Cluster with a more aggressive overprovision ratio of 12:1.
The application owner will make a decision about the tier of infrastructure on which they want
their application to run based on a trade off between the level of risk they are willing to assume
and the cost.
3
Figure 1 shows a compute cluster implemented with a ratio of 4:1 over provisioning. Utilization
levels are very low relative to the point where there would be any risk of resource contention that
would impact application performance. The result is a highly inefficient use of resources.
In compute clusters with higher ratios of over provisioning, there is greater risk of performance
issues, because hypervisor schedulers focus on balancing resources based on the deviation of
memory or CPU utilization across hosts within a cluster. They do not look at the problem from the
perspective of how to best meet the current and historical resource requirements of the actual
application workloads. In addition to this, they do not consider key resources such as CPU
scheduling, Memory Ballooning
and Swapping, I/O, and
network resources in their
decision of where to place
workloads. In brief, they are
simple much to the detriment
of efficiency.
4
ARE YOUR CAPACITY MANAGEMENT PROCESSES FIT FOR THE CLOUD ERA?
Infrastructure Constraints The way that networks and storage are provisioned constrains where
specific workloads can be placed and moved within the environment. These constraints should
be taken into account when making resource allocation decisions to reflect what is really
possible in the real world.
Business Constraints Organizations typically have business policies that also restrict where
workloads can run within the environment. These constraints may assist business continuity by
ensuring that redundant application instances do not co-exist on the same physical resource.
They may also support software-licensing requirements or adhere to security policies. These rules,
while essential, constrain how the available compute resources can be exploited.
Time When making resource allocation decisions, you need to consider the time dimension of
the past, because application workloads fluctuate and peak at different times, driven by
fluctuations in the business process they support. They may also grow or contract over time in their
consumption of resources. These characteristics should be factored into any workload placement
and resource allocation decisions.
The Future - Most organizations are constantly changing their virtualized data center, driven by
the need to deliver new projects, execute ongoing tech refreshes, and/or support organic
growth in demand of existing applications. Having the ability to look at the past and accurately
predict the impact of changes in the future is critical in planning what resource allocation
decisions should be taken to accommodate planned changes.
5
Complex Puzzle What resource allocation decisions
should be taken on an ongoing basis to accommodate
application workload demand?
This puzzle is not one that can be solved by human brainpower at least not within the necessary
time constraint and therefore most organizations accept the costly tradeoff of over investing in
infrastructure capacity to try and mitigate risk. Even when organizations do this, they may still
experience quality of service issues due to complexity of the Intelligent Workload Management
Problem.
The alternative to overinvestment is to employ state of the art capacity management software
that employs a mathematical approach to solve the Intelligent Workload Management
Problem.
There are literally hundreds of toolsets in the marketplace today that provide monitoring and
reporting capabilities, and they are a necessity in all mission-critical IT environments.
Many of the solutions available today have embedded analytical capabilities to assist users in
turning large volumes of monitoring data into information that can be consumed by IT Operations
and Engineering.
Examples of data reduction techniques include threshold alerting based on real time exceptions,
or trending in resource consumption to predict when resources will run out in the future. More
recently, solutions in this space have embedded analytics that focus on anomaly detection.
These solutions seek abnormal patterns in IT monitoring data, and infer pending issues that could
impact service delivery. These types of analytics can be classified as Data Analysis.
6
ARE YOUR CAPACITY MANAGEMENT PROCESSES FIT FOR THE CLOUD ERA?
Whilst these solutions may focus Operations on potential issues that could impact service delivery,
they do not provide the Decision Analytics required to determine what actions should be taken
to prevent and/or resolve resource contention and/or drive greater efficiency from the IT
infrastructure. In aggregate, these decisions could range from the hundreds to the thousands for
any single point in time. The inability of any human to not only interpret Data Analysis output, but
also act on such output highlights an important gap that can be addressed by a new category
of solution.
Virtualization has enabled us to change the paradigm of how we manage IT service delivery
through the ability to dynamically allocate resources through software controls. Whilst hypervisor
scheduling is marketed as a mechanism to control resource allocation in virtualized data centers,
the scope of decision-making and control they provide is very limited.
By maintaining the environment in the Desired State, these functions prevent resource
contention by sustaining the service levels of virtualized applications, and cut down the number
of problems and incidents that
Operations must handle.
7
in infrastructure efficiency.
Figure 3 shows the improvement that intelligent workload placement decisions had on a
compute cluster bearing a 12:1 over provisioning strategy. The risks of application performance
issues were significantly reduced
and a more efficient use of
resources was achieved.
Figures 5 and 6 illustrate examples Figure 4: Existing workload can be accommodated on a much
of the types of resource allocation smaller hardware footprint without impacting service.
decisions that can be driven through Decision Analysis & Control functions to control the
environment in the "Desired State", assuring applications get the resources they need to
operate reliably whilst maximizing infrastructure efficiency.
Figure 5: The resource management decisions that need to be taken to address existing and future performance risks
and inefficiencies.
8
ARE YOUR CAPACITY MANAGEMENT PROCESSES FIT FOR THE CLOUD ERA?
Figure 6: Consolidate workloads on fewer physical resources and reclaim unused virtual resources.
SUMMARY
1. Review your organizations current capacity management strategy, and identify
opportunities to transform the approach by better aligning resource supply and demand.
2. When assessing your performance and capacity management capabilities as part of
systems management strategy, make sure you differentiate between monitoring and
reporting functions from the Decision Analysis and Control functions required to
proactively maintain your environment in the Desired State on an ongoing basis.
3. Deploy a Proof of Concept in an operational environment to benchmark the business
benefit that can be realistically derived by maintaining your environment in the Desired
State, reducing risk and driving greater operational and infrastructure efficiency.
9
ABOUT VMTURBO
Founded in 2009, VMTurbo is a company founded on the belief that IT operations
management needs to be fundamentally changed to allow your organization to unlock the
full value of todays virtualized infrastructure and cloud services. Our charter is to transform IT
operations in cloud and virtualized environments from a complex, labor intensive, and
volatile process to one that is simple, automated and predictabledelivering greater
control in maintaining a healthy state and consistent service delivery.
VMTurbo offers an innovative control system for virtualized data centers. By leveraging the
dynamic resource allocation abilities of virtualization and automating decisions for resource
allocation and workload placement in software, our solution ensures applications get the
resources required while maximizing utilization of IT assets. Over 9,000 enterprises worldwide
have selected VMTurbo, including British Telecom, Colgate, CSC and the London School of
Economics.
www.vmturbo.com
10