Posts in tag

SRE


When you’re troubleshooting an issue, finding the root cause often involves finding specific logs generated by infrastructure and application code. The faster you can find logs, the faster you can confirm or refute your hypothesis about the root cause and resolve the issue. Today, we’re pleased to announce a simpler way to find logs in …

New Relic (NYSE: NEWR), the observability company, announced the general availability of a new infrastructure monitoring experience to empower DevOps, SRE and ITOps teams to proactively identify and resolve issues in their public, private and hybrid cloud infrastructure. The modernized experience allows engineers to instantly isolate bottlenecks by filtering and sorting based on golden signal conditions, analyze all …

Founded by Google SRE alumni, it is no surprise that Loon’s Production Engineering/SRE team instituted a culture of blameless postmortems that became a key feature of Loon’s approach to incident response. Blameless postmortems originated as an aerospace practice in the mid-20th century, so it was particularly fitting that they came full circle to be used …

Terraform is an open source Infrastructure as Code tool that is popular with platform developers building reusable cloud automation. The Terraform Provider for Google Cloud Platform continues to add support for the latest Google Cloud features, such as Anthos on GKE, and our teams continue to expand Terraform integrations including Cloud Foundation Toolkit and Terraform …

Reliability matters. When users can’t access your application, if it’s slow to respond, or it behaves unexpectedly, they don’t get the value that you intend to provide. That’s why at Google we like to say that reliability is the most important feature of any system. Its impact can be seen all the way to the …

Editor’s note: Today we hear from Kenny Kon, an SRE Director at Sabre. Kenny shares about how they have been able to successfully adopt Google’s SRE framework by leveraging their partnership with Google Cloud.  As a leader in the travel industry, Sabre Corporation is driving innovation in the global travel industry and developing solutions that …

Software intelligence company Dynatrace (NYSE: DT) announced the findings from an independent global survey of 1,300 development and DevOps leaders, which revealed the primary challenges organizations are facing as they attempt to keep up with demand for digital innovation. The research highlighted that scaling DevOps and SRE practices is critical to accelerating the release of high-quality …

Over the past seven years, more than 32,000 professionals worldwide have taken part in the Accelerate State of DevOps reports, making it the largest and longest-running research of its kind. Year over year, the Accelerate State of DevOps reports provide data-driven industry insights that examine the capabilities and practices that drive software delivery as well as …

One facet of our work as Customer Reliability Engineers—Google Site Reliability Engineers (SREs) tapped to help Google Cloud customers develop that practice in their own organizations—is advising operations or SRE teams to improve their operational maturity. We’ve noticed a recurring question cropping up across many of these discussions, usually phrased along the lines of “is what …

Site Reliability Engineering (SRE) is a hot topic, but what exactly does it entail? And do you have to follow the principles to a T in order to achieve benefits from it? If you’re searching for answers to these common questions, look no further. In this episode of the Cloud & Culture podcast, VMware Tanzu’s …