Site Reliability Engineering (SRE) Resources¶
⚙️ SRE Resources
Resources for Site Reliability Engineering, reliability, and production systems.
📖 Books¶
Essential SRE Books
- Site Reliability Engineering Book - Google SRE Team (Free)
- The Site Reliability Workbook - Google SRE Team (Free)
- Seeking SRE by David Blank-Edelman - O'Reilly
- Building Secure and Reliable Systems - Google (Free)
- The DevOps Handbook - SRE practices
📄 Research Papers¶
SRE Research
- Google SRE Papers - Google research
- The Datacenter as a Computer - Warehouse-scale machines
- Borg: Large-scale Cluster Management - Google's system
- Reliability Engineering Papers - SRE resources
⭐ GitHub Repositories¶
Important SRE Repos
- Google SRE Book - Official SRE book
- Awesome SRE - Curated SRE resources
- SRE Interview Prep - Interview questions
- Production Engineering - Facebook's practices
- SRE Resources - Community resources
🎥 Videos & Courses¶
Video Resources
- Google SRE Talks - SRE conference talks
- SREcon - SRE conference videos
- Reliability Engineering - Coursera course
- Incident Response - Incident management
📰 Articles & Blogs¶
Recommended Blogs
- Google SRE Blog - Google SRE updates
- PagerDuty Blog - Incident management
- Honeycomb Blog - Observability
- Lightstep Blog - Distributed tracing
- Charity Majors Blog - SRE insights
🔗 Recommended Reading¶
Additional Resources
- SRE Google Site - Official SRE resources
- SRE Principles - Core principles
- SLI/SLO/SLA Guide - Service level objectives
- Error Budgets - Reliability targets