# Site Reliability Engineering

**Site Reliability Engineering** (**SRE**) "is a discipline that incorporates aspects of software engineering and applies them to IT operations problems. The main goals are to create ultra-scalable and highly reliable software systems. According to Ben Treynor, founder of Google's Site Reliability Team, SRE is "what happens when a software engineer is tasked with what used to be called operations."\
<https://en.wikipedia.org/wiki/Site_Reliability_Engineering>\
\
"Fundamentally, it's what happens when you ask a software engineer to design an operations function. So SRE is fundamentally doing work that has historically been done by an operations team, but using engineers with software expertise, and banking on the fact that these engineers are inherently both predisposed to, and have the ability to, substitute automation for human labor." - Ben Treynor

* <https://github.com/dastergon/awesome-sre>
* <https://github.com/binhnguyennus/awesome-scalability>
* <https://landing.google.com/sre/>
* <https://landing.google.com/sre/books/>
* <https://landing.google.com/sre/interview/ben-treynor/>
* <https://github.com/dastergon/awesome-sre#books>
* <https://medium.com/@jdavidmitchell/principles-of-site-reliability-engineering-at-google-8382b054e498>
* <https://hackernoon.com/so-you-want-to-be-an-sre-34e832357a8c>
* <https://hackernoon.com/the-must-know-checklist-for-devops-system-reliability-engineers-f74c1cbf259d>
* <https://thenewstack.io/site-reliability-engineering-kind-magic/>
* <https://thenewstack.io/the-new-devops-site-reliability-engineering-comes-of-age/>
* <https://cloud.kapostcontent.net/pub/1418185e-b325-49d3-b65c-de338e45cb6f/ebook-10-essential-skills-of-a-site-reliability-engineer-sre.pdf>
* <https://www.slideshare.net/initcron/devops-skills-survey>
* [https://cloudacademy.com/blog/what-is-the-role-of-a-site-reliability-engineer](https://cloudacademy.com/blog/what-is-the-role-of-a-site-reliability-engineer/)
* AWS re:Invent 2015: DevOps at Amazon: A Look at Our Tools and Processes (DVO202) <https://www.youtube.com/watch?v=esEFaY0FDKc>
* <https://github.com/dastergon/awesome-chaos-engineering>
* <https://blog.packagecloud.io/eng/2016/04/05/the-definitive-guide-to-linux-system-calls/>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://kgoralski.gitbook.io/wiki/site-reliability-engineering.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
