wiki
  • My Knowledge Wiki
  • Microservices
  • Domain-driven design (DDD) learning resources
  • Go Programming Language
  • Kotlin Programming Language
  • Java Programming Language
  • Hibernate
  • Reactive Programming
  • Kubernetes
  • Apache Kafka
  • Apache Beam
  • Google Cloud
  • Serverless
  • Janusgraph
  • Event Sourcing & CQRS
  • Performance Engineering
  • Site Reliability Engineering
  • Software Security
  • Scalability
  • Software Craftsmanship
  • Books and Papers
  • Conference and meetup videos
Powered by GitBook
On this page

Was this helpful?

Site Reliability Engineering

PreviousPerformance EngineeringNextSoftware Security

Last updated 5 years ago

Was this helpful?

Site Reliability Engineering (SRE) "is a discipline that incorporates aspects of software engineering and applies them to IT operations problems. The main goals are to create ultra-scalable and highly reliable software systems. According to Ben Treynor, founder of Google's Site Reliability Team, SRE is "what happens when a software engineer is tasked with what used to be called operations." "Fundamentally, it's what happens when you ask a software engineer to design an operations function. So SRE is fundamentally doing work that has historically been done by an operations team, but using engineers with software expertise, and banking on the fact that these engineers are inherently both predisposed to, and have the ability to, substitute automation for human labor." - Ben Treynor

  • AWS re:Invent 2015: DevOps at Amazon: A Look at Our Tools and Processes (DVO202)

https://en.wikipedia.org/wiki/Site_Reliability_Engineering
https://github.com/dastergon/awesome-sre
https://github.com/binhnguyennus/awesome-scalability
https://landing.google.com/sre/
https://landing.google.com/sre/books/
https://landing.google.com/sre/interview/ben-treynor/
https://github.com/dastergon/awesome-sre#books
https://medium.com/@jdavidmitchell/principles-of-site-reliability-engineering-at-google-8382b054e498
https://hackernoon.com/so-you-want-to-be-an-sre-34e832357a8c
https://hackernoon.com/the-must-know-checklist-for-devops-system-reliability-engineers-f74c1cbf259d
https://thenewstack.io/site-reliability-engineering-kind-magic/
https://thenewstack.io/the-new-devops-site-reliability-engineering-comes-of-age/
https://cloud.kapostcontent.net/pub/1418185e-b325-49d3-b65c-de338e45cb6f/ebook-10-essential-skills-of-a-site-reliability-engineer-sre.pdf
https://www.slideshare.net/initcron/devops-skills-survey
https://cloudacademy.com/blog/what-is-the-role-of-a-site-reliability-engineer
https://www.youtube.com/watch?v=esEFaY0FDKc
https://github.com/dastergon/awesome-chaos-engineering
https://blog.packagecloud.io/eng/2016/04/05/the-definitive-guide-to-linux-system-calls/