, , , , , , , ,

Dynatrace AI predicts SLO violations and pinpoints root causes proactively

Dynatrace enables Site Reliability Engineering (SRE) teams to proactively ensure the highest service quality levels. Davis, the Dynatrace AI engine, identifies potential contributors to SLO violations in real time, before thresholds…
, , , , , , , , ,

Architected for resiliency: How Dynatrace withstands data center outages

Software reliability and resiliency don’t just happen by simply moving your software to a modern stack, or by moving your workloads to the cloud. There is no “Resiliency as a Service” you can connect to via an API that makes your service…
, , , , , ,

What is continuous delivery and what are best practices for implementing it?

In my previous article about continuous integration and continuous delivery (CI/CD), I defined CI/CD and explained how these practices work together to help DevOps teams deliver quality software faster. In this article, I take a deeper look…
, ,

Key takeaways from o11yfest 2021 – SRE, Observability, and OpenTelemetry

From May 17 to May 18, 2021, the Open-Source Engineering team at Dynatrace attended the virtual observability conference, o11yfest. The conference aims to increase the “awareness in OpenTelemetry and other relevant projects and techniques…
,

SRE vs DevOps: What you need to know

The events of 2020 accelerated the trend of organizations shifting to cloud-native technologies in response to the dramatic increase in demand for online services. Cloud-native environments bring speed and agility to software development and…
, , , , ,

A guide to event-driven SRE-inspired DevOps

“This is a mouthful of buzzwords” is how I started my recent presentations at the Online Kubernetes Meetup as well as the DevOps Fusion 2020 Online Conference when explaining the three big challenges we are trying to solve with Keptn –…
, , , , , ,

Tutorial: Guide to automated SRE-driven performance engineering

In this blog, I will be going through a step-by-step guide on how to automate SRE-driven performance engineering. You may have seen over the past few months we have been extensively promoting Service Level Indicators (SLIs) and Service Level…
, , , , , , , , , , , ,

Quickstart to Autonomous Cloud with Keptn on GKE

Self-Service Progressive Delivery of Microservices, Automated SLI/SLO based Quality Gates, Continuous Feedback through ChatOps and Automatic Remediation of Production Issues are some of the capabilities you expect from a modern cloud-native…
, , , , ,

Shift-Left SRE: Building Self-Healing into your Cloud Delivery Pipeline

Site Reliability Engineering is an exciting discipline in our industry. There are many aspects to building reliable and resilient systems which I won’t be able to cover in a single blog post. What I learned from many conversations with…