See
See
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
2 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The goal of this lesson is to help you understand cloud architecting. You will learn
about the AWS Well-Architected Framework, which you can use to improve your
cloud architecture. You can also use the Well-Architected Framework to gain a better
understanding of how design decisions can impact your business. You will review
each of the pillars that make up the Well-Architected Framework: operational
excellence, security, reliability, performance efficiency, cost optimization, and
sustainability.
Features of the Well-Architected Framework
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What is the Well-Architected Framework?
4 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture is the art and science of designing and building large structures. Large
systems require architects to manage their size and complexity. Cloud architects
engage with decision makers to identify business goals, align technology deliverables
with those goals, and work with delivery teams to implement appropriate solutions.
The Well-Architected Framework helps cloud architects assess and improve their
architectures and get a better understanding of how their design decisions can
impact their business. It provides a set of questions that AWS experts have developed
to help customers think critically about their architecture, such as "Does your
infrastructure follow best practices?"
5 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The AWS WA Tool, available at no cost in the AWS Management Console, provides a
mechanism for regularly evaluating workloads, identifying high-risk issues, and
recording improvements.
AWS also provides access to a network of hundreds of members in the AWS Well-
Architected Partner Program. You can engage a partner in your area to help analyze
and review your applications.
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Well-Architected Framework pillars
Recover from
Protect and Eliminate Minimize
Deliver business failure and Use resources
monitor systems. unneeded environmental
value. mitigate sparingly.
expense. impacts.
disruption.
7 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Well-Architected Framework helps you design your architecture from different
perspectives, or pillars. The pillars are operational excellence, security, reliability,
performance efficiency, cost optimization, and sustainability. Each pillar contains a set
of design principles and best practices.
You will learn about each pillar in more detail and discuss the design principles for
each pillar in the next slides.
Operational excellence (1 of 2)
The ability to monitor systems to do the
following:
Operational
excellence • Deliver business value.
• Continually improve supporting processes and
procedures.
Key topics:
• Manage and automate changes.
• Respond to events.
• Define standards to manage daily operations.
Deliver business
value.
8 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The first pillar is the operational excellence pillar. This pillar includes how your
organization supports your business objectives and your ability to run workloads
effectively. It also includes how your organization supports your ability to gain insight
into their operations and to continuously improve supporting processes and
procedures to deliver business value.
Key topics include managing and automating changes, responding to events, and
defining standards to successfully manage daily operations.
Operational excellence (2 of 2)
Operational excellence design principles:
Operational • Perform operations as code.
excellence • Make frequent, small, reversible changes.
• Refine operations procedures frequently.
• Anticipate failure.
• Learn from all operational events and failures.
Deliver business
value.
9 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Perform operations as code: In the cloud, you can define your entire workload
(applications, infrastructure, and more) as code and update it with code. You can
script your operations’ procedures and automatically start them by initiating them in
response to events. By performing operations as code, you limit human error and
help ensure consistent responses to events.
Refine operations procedures frequently: As you use operations procedures, look for
opportunities to improve them. As you evolve your workload, evolve your procedures
appropriately. Set up regular game days to review and validate that all procedures are
effective and that teams are familiar with them.
Learn from all operational failures: Drive improvement through lessons learned from
all operational events and failures. Share what is learned across teams and through
the entire organization.
9
Security (1 of 2)
The ability to do the following:
• Monitor and protect information, systems, and
Security assets.
• Deliver business value through risk assessments and
mitigation strategies.
Key topics:
• Identify and manage who can do what.
• Establish controls to detect security events.
• Protect systems and services.
Protect and • Protect the confidentiality and integrity of data.
monitor systems.
10 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The security pillar describes how to take advantage of cloud technologies to protect
data, systems, and assets in a way that can improve your security posture.
The security pillar involves the ability to monitor and protect systems while delivering
business value through risk assessments and mitigation strategies. An example of
security in the cloud would be staying up to date with AWS and industry
recommendations and threat intelligence. Automation can be used for security
processes, testing, and validation to scale security operations.
Key topics include protecting data confidentiality and integrity, identifying and
managing who can do what (privilege management), protecting systems, and
establishing controls to detect security events.
Security (2 of 2)
Security design principles:
• Implement a strong identity foundation.
Security
• Enable traceability.
• Apply security at all layers.
• Automate security best practices.
• Protect data in transit and at rest.
• Keep people away from data.
• Prepare for security events.
Protect and
monitor systems.
11 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Design principles that can help you strengthen your workload security are as follows:
• Implement a strong identity foundation: Implement the principle of least
privilege, and enforce separation of duties with appropriate authorization for each
interaction with your AWS resources. Centralize identity management, and aim to
eliminate reliance on long-term static credentials.
• Enable traceability: Monitor, alert, and audit actions and changes to your
environment in real time. Integrate log and metric collection with systems to
automatically investigate and take action.
• Apply security at all layers: Apply a defense in-depth approach with multiple
security controls. Apply to all layers (for example, edge of network, virtual private
cloud [VPC], load balancing, every instance and compute service, operating
system, application, and code).
• Automate security best practices: Automated software-based security
mechanisms improve your ability to securely scale more rapidly and cost-
effectively. Create secure architectures, including the implementation of controls
that are defined and managed as code in version-controlled templates.
• Protect data in transit and at rest: Classify your data into sensitivity levels and use
mechanisms, such as encryption, tokenization, and access control where
appropriate.
• Keep people away from data: Use mechanisms and tools to reduce or eliminate
the need for direct access or manual processing of data. This practice reduces the
risk of mishandling, modification, or human error when handling sensitive data.
• Prepare for security events: Prepare for an incident by having incident
management and investigation policy and processes that align to your
organizational requirements. Run incident response simulations, and use tools
with automation to increase your speed for detection, investigation, and recovery.
11
Reliability (1 of 2)
The ability of a system to do the
following:
Reliability • Recover from infrastructure or service failures.
• Dynamically acquire computing resources to meet
demand.
• Mitigate disruptions, such as the following:
• Misconfigurations
• Transient network issues
Recover from
failure and
mitigate
disruption.
12 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The reliability pillar encompasses the ability of a workload to perform its intended
function correctly and consistently when it is expected to. This ability includes
operating and testing the workload through its total lifecycle.
By carefully evaluating each of these elements, you can anticipate, respond to, and
prevent failures.
12
Reliability (2 of 2)
Reliability design principles:
• Test recovery procedures.
Reliability
• Automatically recover from failure.
• Scale horizontally.
• Stop guessing capacity.
• Manage change in automation.
Recover from
failure and
mitigate
disruption.
13 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
13
Performance efficiency (1 of 2)
The ability to do the following:
Performance • Use computing resources efficiently to meet system
efficiency requirements.
• Maintain that efficiency as demand changes and
technologies evolve.
Use resources
sparingly.
14 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The performance efficiency pillar refers to using computing resources efficiently while
meeting system requirements. At the same time, it is important to maintain that
efficiency as demand fluctuates and technologies evolve. To implement performance
efficiency, take a data-driven approach to building a high-performance architecture.
Gather data on all aspects of the architecture from the high-level design to the
selection and configuration of resource types.
Reviewing your choices on a regular basis helps ensure that you are taking advantage
of the continually evolving AWS Cloud. Monitoring helps ensure that you are aware of
any deviance from expected performance. Make trade-offs in your architecture to
improve performance, such as using compression or caching, or relaxing consistency
requirements.
Factors that influence performance efficiency in the cloud include the following:
• Selection: It is important to choose the best solution that will optimize your
architecture. Solutions vary based on the kind of workload that you have, and you
can use AWS to customize your solutions in many different ways and
configurations.
• Review: You can continually innovate your solutions and take advantage of the
newer technologies and approaches that become available. Any of these newer
releases could improve the performance efficiency of your architecture.
• Monitoring: After you implement your architecture, you must monitor
performance to help ensure that you can remediate any issues before customers
are affected and aware of them. With AWS, you can use automation and monitor
your architecture with tools such as Amazon CloudWatch, Amazon Kinesis,
Amazon Simple Queue Service (Amazon SQS), and AWS Lambda.
• Trade-offs: An example of a trade-off that helps ensure an optimal approach is
trading consistency, durability, and space against time or latency to deliver higher
performance.
14
Performance efficiency (2 of 2)
Performance efficiency design principles:
Performance • Democratize advanced technologies.
efficiency • Go global in minutes.
• Use serverless architecture.
• Experiment more often.
• Consider mechanical sympathy.
Use resources
sparingly.
15 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The following design principles can help you achieve and maintain efficient workloads
in the cloud:
• Democratize advanced technologies: Technologies that are difficult to implement
can become simpler to consume by pushing that knowledge and complexity into
the cloud vendor’s domain. Instead of having your IT team learn how to host and
run a new technology, they can consume it as a service.
• Go global in minutes: With AWS, you can deploy your system in multiple AWS
Regions around the world. At the same time, you provide a lower latency and
better experience for your customers at minimal cost.
• Use a serverless architecture: Serverless computing is a cloud computing runtime
model where the cloud provider dynamically manages the allocation of machine
resources. Pricing is based on the actual amount of resources that an application
consumes instead of on pre-purchased units of capacity. In the cloud, you can use
serverless computing to reduce the need to run and maintain traditional servers
for compute activities. It also removes the operational burden and can lower
transactional costs.
• Experiment more often: With virtual and automatable resources, you can quickly
carry out comparative testing by using different types of instances, storage, or
configurations.
• Have mechanical sympathy: This principle suggests that you use the technology
approach that best aligns to what you are trying to achieve. For example, consider
data access patterns when you select database or storage approaches.
Cost optimization (1 of 2)
The ability to avoid or eliminate the
following:
Cost
• Unneeded costs
optimization
• Suboptimal resources
Eliminate
unneeded
expense.
16 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Cost optimization refers to the ability to avoid or eliminate unneeded expenses and
resources. It is a continual process of refinement and improvement over the span of a
workload’s lifecycle.
Similar to the other pillars within the Well-Architected Framework, cost optimization
has trade-offs to consider. For example, you want to consider whether to optimize for
speed-to-market or for cost. In some cases, it’s best to optimize for speed—going to
market quickly, shipping new features, or meeting a deadline—rather than investing
in upfront cost optimization.
Design decisions are sometimes directed by haste rather than data. The temptation
always exists to overcompensate rather than spend time benchmarking for the most
cost-optimal deployment. Overcompensation can lead to over-provisioned and
under-optimized deployments. However, it might be a reasonable choice if you must
lift and shift resources from your on-premises environment to the cloud and then
optimize afterward.
It's important to invest the right amount of effort in a cost optimization strategy
upfront. By doing so, you can realize the economic benefits of the cloud more readily
by helping ensure a consistent adherence to best practices and avoiding unnecessary
overprovisioning. The following sections provide techniques and best practices for
the initial and ongoing implementation of cloud financial management and cost
optimization for your workloads.
16
Cost optimization (2 of 2)
Cost optimization design principles:
Cost • Implement cloud financial management.
optimization • Adopt a consumption model.
• Measure overall efficiency.
• Reduce spending on data center operations.
• Analyze and attribute expenditures.
Eliminate
unneeded
expense.
17 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
17
Sustainability (1 of 2)
The ability to minimize the following:
• Impact of workloads on the environment
Sustainability
• Carbon emissions
• Energy consumptions
• Waste
Minimize
environmental
impacts.
18 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
When building cloud workloads, the practice of sustainability includes the following:
• Understanding the impacts of the services used
• Quantifying impacts through the entire workload lifecycle
• Applying design principles and best practices to reduce these impacts
You can use the AWS Cloud to run workloads designed to support your wider
sustainability challenges. Examples of these challenges include reducing carbon
emissions, lowering energy consumption, recycling water, or reducing waste in other
areas of your business or organization.
Sustainability through the cloud is when you use AWS technology to solve a broader
sustainability challenge. For example, you can use a machine learning service like
Amazon Monitron to detect abnormal behavior in industrial machinery. Using this
detection data, you can conduct preventive maintenance to reduce the risk of
environmental incidents caused by unexpected equipment failures. Thus, you can
help ensure that the machinery continues to operate at peak efficiency.
18
Sustainability (2 of 2)
Sustainability design principles:
• Understand your impact.
Sustainability
• Establish sustainability goals.
• Maximize utilization.
• Anticipate and adopt new, more efficient hardware
and software offerings.
• Use managed services.
• Reduce the downstream impact of your cloud
workloads.
Minimize
environmental
impacts.
19 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Apply these design principles when architecting your cloud workloads to maximize
sustainability and minimize impact:
• Understand your impact: Measure the impact of your cloud workload, and model
the future impact of your workload. Include all sources of impact, including
impacts that result from customer use of your products and from their eventual
decommissioning and retirement. Compare the productive output with the total
impact of your cloud workloads by reviewing the resources and emissions required
per unit of work. Use this data to establish KPIs, evaluate ways to improve
productivity while reducing impact, and estimate the impact of proposed changes
over time.
• Establish sustainability goals: For each cloud workload, establish long-term
sustainability goals, such as reducing the compute and storage resources required
per transaction. Model the ROI of sustainability improvements for existing
workloads, and give owners the resources that they need to invest in sustainability
goals. Plan for growth. Architect your workloads so that growth results in reduced
impact intensity measured against an appropriate unit, such as per user or per
transaction. These goals help you support the wider sustainability goals of your
business or organization, identify regressions, and prioritize areas of potential
improvement.
• Maximize utilization: Right-size workloads and implement efficient design to help
ensure high utilization and maximize the energy efficiency of the underlying
hardware. Two hosts running at 30 percent utilization are less efficient than one
host running at 60 percent due to baseline power consumption per host. At the
same time, eliminate or minimize idle resources, processing, and storage to reduce
the total energy required to power your workload.
• Anticipate and adopt more efficient hardware and software offerings: Support
upstream improvements that your partners and suppliers make to reduce the
impact of your cloud workloads. Continually monitor and evaluate new, more
efficient hardware and software offerings. Design for flexibility to facilitate the
rapid adoption of new efficient technologies.
• Use managed services: Sharing services across a broad customer base helps
maximize resource utilization, which reduces the amount of infrastructure needed
to support cloud workloads. For example, customers can share the impact of
common data center components like power and networking by migrating
workloads to the AWS Cloud and adopting managed services. Use managed
services that can help minimize your impact, such as automatically moving
infrequently accessed data to cold storage, to adjust capacity to meet demand. For
example, use Amazon Simple Storage Service (Amazon S3) Lifecycle configurations
or Amazon EC2 Auto Scaling.
• Reduce the downstream impact of your cloud workloads: Reduce the amount of
energy or resources required to use your services. Reduce or eliminate the need
for customers to upgrade their devices to use your services. Test using device
farms to understand expected impact, and test with customers to understand the
actual impact from using your services.
19
Checkpoint questions
20 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
21 © 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2022 Amazon Web Services, Inc. or its affiliates. All rights reserved. This work may not be reproduced or redistributed, in whole or in part, without prior written permission from Amazon Web Services, Inc.
Commercial copying, lending, or selling is prohibited. Corrections, feedback, or other questions? Contact us at https://support.aws.amazon.com/#/contacts/aws-training. All trademarks are the property of their
owners.