aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • DevOps
  • Platforms

DevOps Vs. SRE Vs. Platform Engineering? The Gaps Might Be Smaller Than You Think

  • aster.cloud
  • July 5, 2022
  • 7 minute read

Platform engineering is the new cool kid on the block that everyone wants to be friends with. However, many are still confused where this new discipline comes from and how it differentiates from more established practices like DevOps and SRE. In this article, we provide some historical context and explain how they all relate to each other.

With the platform engineering movement picking up steam at an impressive pace over the last two years, there’s a lot of excitement as well as confusion around the definition of this new space and how it compares to the more established SRE and DevOps categories. On the Platform Engineering Slack we see many new members coming over from traditional SRE and DevOps roles. Some want a change in their career, some think platform engineering is the new hot thing they can leverage for a salary raise, some are confused what the difference to their current role is.


Partner with aster.cloud
for your next big idea.
Let us know here.



From our partners:

CITI.IO :: Business. Institutions. Society. Global Political Economy.
CYBERPOGO.COM :: For the Arts, Sciences, and Technology.
DADAHACKS.COM :: Parenting For The Rest Of Us.
ZEDISTA.COM :: Entertainment. Sports. Culture. Escape.
TAKUMAKU.COM :: For The Hearth And Home.
ASTER.CLOUD :: From The Cloud And Beyond.
LIWAIWAI.COM :: Intelligence, Inside and Outside.
GLOBALCLOUDPLATFORMS.COM :: For The World's Computing Needs.
FIREGULAMAN.COM :: For The Fire In The Belly Of The Coder.
ASTERCASTER.COM :: Supra Astra. Beyond The Stars.
BARTDAY.COM :: Prosperity For Everyone.

This blog post is aimed at giving an overview of the trend of platform engineering and hopefully bring some clarity around the definitions and differences between these terms. This is just one opinion so feel free to add to it, bash it or share your own in Slack or maybe directly at PlatformCon.

Evolution of DevOps

Let’s start from the beginning of cloud times, back in 2006 when Werner Vogels, CTO of Amazon, famously yelled on stage “you build it, you run it”. A new DevOps era was ushered into existence, together with AWS, and the convergence of developers and operations seemed the brave new world. The DevOps bible Accelerate described the 4 key DevOps metrics (lead time, deployment frequency, change failure rate and mean time to recovery or MTTR), which have since become an industry gold standard. Leading engineering organizations leveraged this new philosophy to develop, deliver and ship software faster and better than ever. Some were now able to deploy 100s or 1000s of times a day and deliver value to their customer at a speed that was unthinkable in the old throw-over-the-fence days.

Sounds great right? And it is. But only for a selected few.

DevOps evolution
Source: 2020 State of DevOps Report by Puppet

While many organizations are advancing on their DevOps journey, studies like the State of DevOps Report by Puppet or Humanitec’s Benchmarking Study have shown that too many teams are still stuck in the middle and can’t cross what Humanitec calls the DevOps Mountain of Tears.

DevOps Mountain of Tears: DevOps score 0-100 among respondents. 
DevOps Mountain of Tears: DevOps score 0-100 among respondents.
Source: DevOps Benchmarking Study 2021 

Why are these organizations stuck? Why can’t they enter the brave new world of DevOps? There can be a number of reasons of course. But a consistent theme that we see across industry studies, as well as in our Platform Engineering community, is that most of them are overwhelmed by the sheer complexity of their shiny new cloud native setups. Teams often embark on their cloud journeys thinking a microservice architecture running on Kubernetes will fix all their problems.

Read More  Hewlett Packard Enterprise Announces Simple And Affordable Next Generation Storage Solution For Small Businesses
Daniel Bryant
Source: Daniel Bryant @ PlatformCon 2022

The reality is usually very different. Cognitive load on developers constantly increased during the last 10 years. And in most cases such migrations are massively underestimated, ending up in a setup that is very hard to maintain, with teams not having all the right skill sets in place to make the most of their brand new infrastructure. Simply shifting left and telling developers they now do DevOps is a recipe for disaster, which often results in more senior developers doing “shadow ops”, while the actual Infra or Ops team is completely overwhelmed with tickets.

‍

The reality is usually very different and in most cases such migrations are massively underestimated, ending up in a setup that is very hard to maintain, with teams not having all the right skill sets in place to make the most of their brand new infrastructure. Simply shifting left and telling developers they now do DevOps is a recipe for disaster, which often results in more senior developers doing “shadow ops”, while the actual Infra or Ops team is completely overwhelmed with tickets.

Fake SRE

The SRE story is similar. Established and popularized by Google, this concept was sold to many engineering organizations as the dream culture everyone should aspire to have. And there is nothing against that at all. SLOs and error budgets are great metrics. There is nothing more valuable than having a reliable production environment with an uptime of 99,9%.

🔮 The widespread adoption of fake-SRE is going to be a disaster for organizations and the reputation of the IT industry, undoing a decade of good practices.

It will, however, enrich some unscrupulous IT outsourcing companies. 💸 #SRE #fakeSRE https://t.co/Cv2j6438p1— Matthew Skelton #BLM 💙🌻 (@matthewpskelton) August 26, 2021

But the reality often looks a bit different. What do you do if your quarterly error budget is already eaten up after two weeks? What happens when your SREs are constantly overworked and close to burnout because of too many unplanned night shifts? Suddenly, SRE can become a pretty restrictive function and your SRE team will think twice before they accept any further deployments.

Unlike at Google, SREs in most organizations do not have the capacity to constantly think about better ways to enable developer self-service, improve architecture and infrastructure tooling, while also establishing an observability and tracing setup. Most SRE teams are just trying to survive.

This leads pretty often to a very conservative mindset. Many SREs – for good reasons – see themselves as gatekeepers to prevent the next disaster.

Read More  Top 25 Cloud Computing Service Provider Companies (2021)
Dev vs. SRE
Source: DevOps Topologies

We see many teams striving to implement best practices from elite engineering organizations, without taking into account the key differences between the respective setups and, especially, resources. The quote below from DevOps Topologies captures this anti-pattern quite well:

 “The Ops engineers now get to call themselves SREs but little else has changed. Devs still throw software that is only ‘feature-complete’ over the wall to SREs. Software operability still suffers because Devs are no closer to actually running the software that they build, and the SREs still don’t have time to engage with Devs to fix problems when they arise.”

Platform teams to the rescue!

So how can you avoid such anti-patterns? You guessed it, you set up an internal platform team whose main objective is to build an Internal Developer Platform, or IDP. The book Team Topologies has become a standard reference on this and clearly shows how structuring an engineering organization to have dedicated platform teams that provide an IDP as a product to their customers, the developers, is the best way to overcome the fake SRE or shadow Ops anti-patterns.

This is backed up by the research and data in the Puppet’s State of DevOps report for both 2020 and 2021, which found a strong correlation between the increased performance of an engineering organization and the usage of internal platforms.

use of internal platforms and level of Devops evolution
Source: 2020 State of DevOps Report by Puppet

The core of platform engineering: product mindset!

This is something that most vendors of platform tooling like control planes tend not to tell you. They want you to believe using their product is the solution to all your problems: “Great, I don’t have to hire a PM for this, I can just outsource it to this tool!”.

But this is extremely unlikely to work. A functional Internal Developer Platform needs to be owned by an internal platform team that treats it as a product and keeps iterating on it, based on feedback from the rest of the engineering organization. If you want to dive deeper, check out this recent talk by Manuel Pais (co-author Team Topologies) on why you should treat your Platform as a Product at PlatformCon 2022.

Making sure you enforce a product mindset when setting up your platform engineering team is key for long term success. What you definitely don’t want your platform team to become is a glorified help desk that is constantly trying to keep up with a bunch of support operations tickets.

This was formulated very clearly in the dedicated section on the Thoughtworks Tech Radar:

“Using a product-thinking approach can help you clarify what each of your internal platforms should provide, depending on its customers. Companies that put their platform teams behind a ticketing system like an old-school operations silo find the same disadvantages of misaligned prioritization: slow feedback and response, resource allocation contention and other well-known problems caused by the silo. We’ve also seen several new tools and integration patterns for teams and technologies emerge, allowing more effective partitioning of both.”

This also means your platform team should build out a dedicated product management functionality. You should treat this team as a full blown product team, not just a new operations hybrid with a trendier name.

Read More  Securing Kubernetes Secrets With HashiCorp Vault

Here’s a handful of guiding principles we gathered from some of the top performing engineering organizations that have built successful platform cultures

  • Optimize for speed, not for cost (Courtney Kissler)
  • Optimize for better developer experience and reduction of cognitive load
  • Do your user research on what a great developer experience means to your dev team; otherwise you will end up like this
  • Come up with a good balance of the cognitive load tradeoff and find the right degree of abstraction for your team (see Humanitec’s DevOps Benchmarking Study 2021)
  • Provide golden paths, not cages
  • Evangelize your platform internally. Train and onboard users like you’d do for any other product. For more inspiration check Galo Navarro’s recent talk Salesman tricks for the platform engineer.

The goal should always be to build an Internal Developer Platform that serves the needs of your internal customers, the engineers. The results will hopefully be more stability and less configuration drift (SRE, you did it!), while also creating a true you build, you run it culture (DevOps, yey!).

This is really exciting, if you can execute on it well. There’s a constantly growing amount of conversations and threads around these topics in the Platform Engineering community. I was just speaking to a platform PM the other day who was quite enthusiastic about it. In his words, he “finally found a space and community that shares this product mindset when building platforms. Most other SRE or traditional DevOps communities tend to have a much narrower focus.”

I am particularly excited about the latest initiative by the community, the first PlatformCon earlier in June this year. It was great to see how more than 6.000 platform practitioners, DevOps and SRE folks, and product managers engaged with the speakers and with one another. We had more than 70 speakers, among them thought leaders like Gregor Hohpe (author Cloud Strategy), Nicki Watt (OpenCredo) and Nigel Kersten (Puppet). We saw great real-life implementation examples from the Netflix team, nesto or Frontside to name just a few. We can’t wait for PlatformCon23!

 

Guest post originally published on the Humanitec blog by Luca Galante
Source CNCF


For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Our humans need coffee too! Your support is highly appreciated, thank you!

aster.cloud

Related Topics
  • CNCF
  • DevOps Topologies
  • SRE
You May Also Like
Google Cloud and Smart Communications
View Post
  • Platforms
  • Technology

Smart Communications, Inc. Dials into Google Cloud AI to Help Personalize Digital Services for Filipinos

  • October 25, 2024
View Post
  • Platforms
  • Public Cloud

Empowering builders with the new AWS Asia Pacific (Malaysia) Region

  • August 30, 2024
Red Hat and Globe Telecoms
View Post
  • Platforms
  • Technology

Globe Collaborates with Red Hat Open Innovation Labs to Modernize IT Infrastructure for Greater Agility and Scalability

  • August 19, 2024
Huawei Cloud Cairo Region Goes Live
View Post
  • Cloud-Native
  • Computing
  • Platforms

Huawei Cloud Goes Live in Egypt

  • May 24, 2024
Asteroid
View Post
  • Computing
  • Platforms
  • Technology

Asteroid Institute And Google Cloud Identify 27,500 New Asteroids, Revolutionizing Minor Planet Discovery With Cloud Technology

  • April 30, 2024
IBM
View Post
  • Hybrid Cloud
  • Platforms

IBM To Acquire HashiCorp, Inc. Creating A Comprehensive End-to-End Hybrid Cloud Platform

  • April 24, 2024
View Post
  • Platforms
  • Technology

Canonical Delivers Secure, Compliant Cloud Solutions for Google Distributed Cloud

  • April 9, 2024
Redis logo
View Post
  • Platforms
  • Software

Redis Moves To Source-Available Licenses

  • April 2, 2024

Stay Connected!
LATEST
  • college-of-cardinals-2025 1
    The Definitive Who’s Who of the 2025 Papal Conclave
    • May 7, 2025
  • conclave-poster-black-smoke 2
    The World Is Revalidating Itself
    • May 6, 2025
  • 3
    Conclave: How A New Pope Is Chosen
    • April 25, 2025
  • Getting things done makes her feel amazing 4
    Nurturing Minds in the Digital Revolution
    • April 25, 2025
  • 5
    AI is automating our jobs – but values need to change if we are to be liberated by it
    • April 17, 2025
  • 6
    Canonical Releases Ubuntu 25.04 Plucky Puffin
    • April 17, 2025
  • 7
    United States Army Enterprise Cloud Management Agency Expands its Oracle Defense Cloud Services
    • April 15, 2025
  • 8
    Tokyo Electron and IBM Renew Collaboration for Advanced Semiconductor Technology
    • April 2, 2025
  • 9
    IBM Accelerates Momentum in the as a Service Space with Growing Portfolio of Tools Simplifying Infrastructure Management
    • March 27, 2025
  • 10
    Tariffs, Trump, and Other Things That Start With T – They’re Not The Problem, It’s How We Use Them
    • March 25, 2025
about
Hello World!

We are aster.cloud. We’re created by programmers for programmers.

Our site aims to provide guides, programming tips, reviews, and interesting materials for tech people and those who want to learn in general.

We would like to hear from you.

If you have any feedback, enquiries, or sponsorship request, kindly reach out to us at:

[email protected]
Most Popular
  • 1
    IBM contributes key open-source projects to Linux Foundation to advance AI community participation
    • March 22, 2025
  • 2
    Co-op mode: New partners driving the future of gaming with AI
    • March 22, 2025
  • 3
    Mitsubishi Motors Canada Launches AI-Powered “Intelligent Companion” to Transform the 2025 Outlander Buying Experience
    • March 10, 2025
  • PiPiPi 4
    The Unexpected Pi-Fect Deals This March 14
    • March 13, 2025
  • Nintendo Switch Deals on Amazon 5
    10 Physical Nintendo Switch Game Deals on MAR10 Day!
    • March 9, 2025
  • /
  • Technology
  • Tools
  • About
  • Contact Us

Input your search keywords and press Enter.