The Wayback Machine - https://web.archive.org/web/20180303124653/http://devops.sys-con.com:80/node/4213558

Welcome!

@DevOpsSummit Authors: Angsuman Dutta, Stackify Blog, Elizabeth White, Derek Weeks, Liz McMillan

Related Topics: @CloudExpo, Microservices Expo, Containers Expo Blog, @DevOpsSummit

@CloudExpo: Article

Test Environment Stability | @DevOpsSummit #APM #CloudNative #Monitoring

Find confidence in the systems you are delivering

How often is an environment unavailable due to factors within your project's control? How often is an environment unavailable due to external factors? Is the software and hardware in the environment up to date with the target production systems? How often do you have to resort to manual workarounds due to an environment?

These are all questions that you should ask yourself if testing environments are consistently unavailable and affected by outages. Here are three key metrics that you can track that can help avoid costly downtime.

Metric: Availability and Uptime Percentage
QA and staging environments seldom require the same level of uptime as production, but tell that to a team of developers working 24/7 on a project that has an aggressive deadline and you will have a very unhappy team. As a test environment manager, you know that when a QA system is unavailable, you get immediate calls from developers and managers.

Understanding why an outage happened is critical for communicating with a development team. If you follow a problem management process for production outages, similar to the process throughout the rest of your IT environment, then you should also follow this process with test environment management. Very often a QA environment will become unavailable due to a factor outside the control of a test environment manager. If one team pushes bad code that interrupts the QA process for all teams you need to be able to identify this clearly.

How to measure availability and uptime?
To truly understand availability and uptime, you must keep track of system availability with a standard monitoring tool such as Zabbix or Nagios. If your systems are visible to the public internet, you can also use hosted platforms like Pingdom to measure system availability.

An uptime of 95% is usually sufficient for a QA or staging environment. If your development is limited to a few time zones, you can also further qualify this by only measuring availability during development hours. While production availability commitment is often closer to 99% or 99.5%, you don't have to treat every QA outage as an emergency. But, be aware that developers may have other opinions-95% uptime still allows for eight hours of downtime a week, so it may be best to aim higher.

How does this metric motivate concrete action?
When you measure system availability and make these numbers public, you encourage test environment managers to make a commitment to uptime. This results in fewer obstacles for QA and development, allowing them to deliver software faster. There's nothing more debilitating to an organization than disruptions in QA and testing. Keeping a close eye on availability and tracking as a metric also encourages movement towards always-available QA systems.

Metric: Mean Time Between Outages
If a system has a 95% availability, then almost 75 minutes of downtime is acceptable every day. If the system fails for ten minutes every hour during an eight-hour work day due to a build or deployment, you'll be creating a QA or staging environment that has a 5% chance of losing developer and QA confidence. To get an accurate picture of system availability you need to couple an availability percentage metric with the mean time between outages (MTBO).

How to Measure MTBO?
If you follow a process that keeps track of outages and strives to understand the root causes of these outages, you'll amass a database of issues that can be used to derive the MTBO. With a monitoring system configured to calculate availability percentages automatically, you can use this same system to record your MTBO.

Example Metric: Goal for MTBO
A goal for MTBO, depends greatly on your availability goal. The lower your availability goal, the higher your MTBO should be. For example, if you have a 95% uptime commitment then your outages need to be spaced over a day or a week. You might have eight hours of downtime each weekend to perform system upgrades or a nightly build and deploy process that takes about an hour, but what you can't have is an MTBO of 45-60 minutes. This will mean that QA and staging systems will be unavailable for a few minutes every hour, which will result in dissatisfied customers.

How does this metric motivate concrete action?
If your MBTO is very short, this suggests that build and deploy activity from a continuous integration environment is frequently interrupting both development and QA. If your MBTO is very high, but your availability is very low (95% or lower) this means that you are experiencing multi-hour downtime at least once a day.

When measuring MBTO, encourage your release engineers and test environment managers to work together to create build and deployment scripts that don't affect availability, and encourage staff to approach QA and staging uptime with care. Without this metric, you run the risk of having teams grow complacent with frequent, low-level unavailability as long as they satisfy overall availability metrics.

Metric: Downtime requirement for a test environment build and deploy
When software is deployed to any system, there is a natural tendency for disruption. If new code is being deployed to an application server, that server often requires a restart so that new code can be loaded. If a web server, such as Apache or Nginx, is being reconfigured this often requires a fast restart measured in seconds.

Some of these build and deploy related disruptions can be avoided through the use of load balancers and clusters of machines. On the largest projects, this is essential in both production as well as staging and QA systems.

An example is a QA system for a large bank's transaction processing system. There are so many teams that depend on this system to be up and running 24/7 that causing any disruption would run the risk of freezing the QA process across the entire company.

Other build and deploy downtimes are unavoidable. A frequent example is when there are changes to a database schema. Certain changes to tables and indexes require systems to be stopped and rebooted to reach a state where database activity isn't competing with DDL statements.

The downtime requirement for a given build and deploy to a test environment is a central measure that is directly related to the availability metrics already discussed.

How to Measure Build/Deploy Downtime?
It's simple: run a build and deployment and keep track of the downtime that falls into the timespan of each build and deploy function. If you have a continuous integration system, such as Jenkins or Bamboo, grab the timestamps of the last few builds and look at your monitoring metrics on QA and staging to see if there is a system impact.

Example Metric: Goal for Build/Deploy Downtime
Your goal for this metric depends on your level of availability. If you are working on a shared service, build and deploy downtime requirements should be as close to zero as possible. If you are working on a less critical application, then build and deploy downtime should be measured in minutes or seconds.

How does this Metric Motivate Concrete Action?
This metric encourages release engineers and test environment managers to drive build and deploy downtime to zero. With the tools available to developers and DevOps professionals, it is possible to achieve zero-downtime deployments to QA and staging systems. Doing this will give internal customers more confidence in the systems you are delivering.

DXWorldEXPO LLC, the producer of the world's most influential technology conferences and trade shows has announced the conference tracks for CloudEXPO | DXWorldEXPO 2018 New York.

DXWordEXPO, colocated with Cloud Expo will be held November 11-13, 2018, in New York City.

Digital Transformation (DX) is a major focus with the introduction of DXWorldEXPO within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term.

A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throughout enterprises of all sizes.

Register for Full Conference "Gold Pass" ▸ Here (Expo Hall ▸ Here)

DXWordEXPO 2018 New York and Cloud Expo 2018 New York agenda presents 222 rockstar faculty members, 200 sessions and 22 keynotes and general sessions in 10 distinct conference tracks.

  • Cloud-Native | Serverless
  • DevOpsSummit
  • FinTechEXPO - New York Blockchain Event
  • CloudEXPO - Enterprise Cloud
  • DXWorldEXPO - Digital Transformation (DX)
  • Smart Cities | IoT | IIoT
  • AI | Machine Learning | Cognitive Computing
  • BigData | Analytics
  • The API Enterprise | Mobility | Security
  • Hot Topics | FinTech | WebRTC

Register for Full Conference "Gold Pass" ▸ Here (Expo Hall ▸ Here)

2018 Conference Agenda and Tracks, November 11-13, New York City

CloudEXPO covers all of these tools, with the most comprehensive program and with 222 top world-class speakers throughout our industry presenting Keynotes, General Sessions, Breakout Sessions along 10 focused tracks, as well as our signature Power Panels. Our Expo Floor brings together the world's leading companies throughout the world of Cloud Computing, Digital Transformation, and all they entail.

As your enterprise creates a vision and strategy that enables you to create your own unique, long-term success, learning about all the technologies involved is essential. Companies today not only form multi-cloud and hybrid cloud architectures, but create them with built-in cognitive capabilities.

Cloud-native thinking is now the norm in financial services, manufacturing, telco, healthcare, transportation, energy, media, entertainment, retail and other consumer industries, as well as the public sector.

CloudEXPO is the world's most important, independent event where technology buyers and vendors meet to experience and discuss the big picture of Digital Tranformation and all of the strategies, tactics, and tools they need to realize their goals.

FinTech Is Now Part of the DXWorldEXPO / CloudEXPO Program!

Register for Full Conference "Gold Pass" ▸ Here (Expo Hall ▸ Here)

Accordingly, attendees at the upcoming 22nd DXWorldEXPO | CloudEXPO November 11-13, 2018 in New York City will find fresh new content in two new tracks called:

  • FinTechEXPO
  • New York Blockchain Event

which will incorporate machine learningartificial intelligencedeep learning, and blockchain in two distinct tracks.

Financial enterprises in New York City, London, Singapore, and other world financial capitals are embracing a new generation of smart, automated FinTech that eliminates many cumbersome, slow, and expensive intermediate processes from their businesses.

FinTech brings efficiency as well as the ability to deliver new services and a much improved customer experience throughout the global financial services industry. FinTech is a natural fit with cloud computing, as new services are quickly developed, deployed, and scaled on public, private, and hybrid clouds.

More than US$20 billion in venture capital is being invested in FinTech this year. DXWorldEXPO / CloudEXPO is pleased to bring you the latest FinTech developments as an integral part of our program.

CloudEXPO is accepting submissions for this new track, so please visit www.CloudComputingExpo.com for the latest information or contact us at [email protected].

Register for Full Conference "Gold Pass" ▸ Here (Expo Hall ▸ Here)

Download Slide Deck ▸ Here

Only CloudEXPO brings together all this in a single location:

Attend CloudEXPO. Build your own custom experience. Learn about the world's latest technologies and chart your course to Digital Transformation.

22nd International CloudEXPO, taking place November 11-13, 2018, in New York City, will feature technical sessions from a rock star conference faculty and the leading industry players in the world.

Sponsorship Opportunities

Download Slide Deck: ▸ Here

Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterprises are using some form of XaaS - software, platform, and infrastructure as a service.

With major technology companies and startups seriously embracing Cloud strategies, now is the perfect time to attend and learn what is going on, contribute to the discussions, and ensure that your enterprise is on the right path to Digital Transformation.

Every Global 2000 enterprise in the world is now integrating cloud computing in some form into its IT development and operations. Midsize and small businesses are also migrating to the cloud in increasing numbers.

Download Slide Deck: ▸ Here

Companies are each developing their unique mix of cloud technologies and services, forming multi-cloud and hybrid cloud architectures and deployments across all major industries. Cloud-driven thinking has become the norm in financial services, manufacturing, telco, healthcare, transportation, energy, media, entertainment, retail and other consumer industries, and the public sector.

CloudEXPO is the single show where technology buyers and vendors can meet to experience and discus cloud computing and all that it entails. Sponsors of CloudEXPO will benefit from unmatched branding, profile building and lead generation opportunities through:

  • Featured on-site presentation and ongoing on-demand webcast exposure to a captive audience of industry decision-makers.
  • Showcase exhibition during our new extended dedicated expo hours
  • Breakout Session Priority scheduling for Sponsors that have been guaranteed a 35-minute technical session
  • Online advertising on 4,5 million article pages in SYS-CON's i-Technology Publications
  • Capitalize on our Comprehensive Marketing efforts leading up to the show with print mailings, e-newsletters and extensive online media coverage.
  • Unprecedented PR Coverage: Unmatched editorial coverage on Cloud Computing Journal.
  • Tweetup to over 100,000 plus Twitter followers
  • Press releases sent on major wire services to over 500 industry analysts.

For more information on sponsorship, exhibit, and keynote opportunities, contact [email protected].

Speaking Opportunities

Download Slide Deck: ▸ Here

The upcoming 22nd International DXWorldEXPOCloudEXPO, November 11-13, 2018 in New York City, NY announces that its Call For Papers for speaking opportunities is open.

Submit your speaking proposal ▸ Here or by email [email protected].

About DXWorldEXPO LLC

Download Slide Deck: ▸ Here

DXWorldEXPO LLC is a Lighthouse Point, Florida-based trade show company and the creator of DXWorldEXPODigital Transformation Conference & Expo. The company produces and presents CloudEXPO, DevOpsSummitFinTechEXPO Blockchain Event, the world's most influential conferences and trade shows.

More Stories By Dalibor Siroky

Dalibor Siroky, CEO and Co-Founder, Plutora: Dalibor has close to 15 years of leadership, consulting, enterprise product, and operations experience in Australia, Asia, and Europe. Before co-founding Plutora, Dalibor was the founder and managing director of Finotaur, a leading provider of independent management consulting services to Wealth Management, Investment Management, Private Banking, and Payment institutions within the Asia Pacific region. Earlier in his career, Dalibor served as the CIO of financial advisory software at Macquarie Bank, head of solution architecture at Commonwealth Bank of Australia, and as a management consultant at PricewaterhouseCoopers. Dalibor holds an MS in Software Engineering with distinction from the University of Oxford and an MBA with honors from the University of Chicago. Dalibor is also a graduate of the Royal Military College of Australia and served as a Captain in the Army.

@DevOpsSummit Stories
Let's do a visualization exercise. Imagine it's December 31, 2018, and you're ringing in the New Year with your friends and family. You think back on everything that you accomplished in the last year: your company's revenue is through the roof thanks to the success of your product, and you were promoted to Lead Developer. 2019 is poised to be an even bigger year for your company because you have the tools and insight to scale as quickly as demand requires. You're a happy human, and it's not just because of the bubbly in your glass. Now how does one turn this visualization into reality? You start by setting yourself up with the right technology to succeed. Behind every great cloud app is a fleet of powerful cloud monitoring tools that provide insight and direction for improving your product. This is the ticket for turning 2018 into the year of your dreams.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. This leads to a waste of cloud resources and increased operational overhead.
“delaPlex Software provides software outsourcing services. We have a hybrid model where we have onshore developers and project managers that we can place anywhere in the U.S. or in Europe,” explained Manish Sachdeva, CEO at delaPlex Software, in this SYS-CON.tv interview at CloudEXPO, held at the Javits Center in New York City, NY.
You know you need the cloud, but you're hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You're looking at private cloud solutions based on hyperconverged infrastructure, but you're concerned with the limits inherent in those technologies. What do you do?
As Cybric's Chief Technology Officer, Mike D. Kail is responsible for the strategic vision and technical direction of the platform. Prior to founding Cybric, Mike was Yahoo's CIO and SVP of Infrastructure, where he led the IT and Data Center functions for the company. He has more than 24 years of IT Operations experience with a focus on highly-scalable architectures.