- Добавил: literator
- Дата: Сегодня, 00:49
- Комментариев: 0
Автор: Florian Hoeppner, Francesco Sbaraglia
Издательство: Apress
Год: 2025
Страниц: 326
Язык: английский
Формат: True PDF, True EPUB
Размер: 12.8 MB
Transform enterprise IT by adopting site reliability engineering (SRE) practices that reduce downtime, build resilience, and drive business value. This book is a comprehensive guide designed to help site reliability engineers, DevOps teams, and platform engineers identify, address, and mitigate system weaknesses before they become significant critical failures. Authors Francesco Sbaraglia and Florian Hoeppner highlight the paradigm shift from IT as a cost center to a core business function, emphasizing the central role of developers and the need for speed and reliability. Specific SRE capabilities like chaos engineering, observability, and toil management are explored, along with strategies for successful implementation, including building a Center of Excellence, selecting the right tools, and fostering a culture of collaboration and continuous improvement. By the end of this book, you’ll know how to apply core SRE practices to strengthen reliability: establishing a chaos engineering practice led by SREs, running reliability-focused “game days,” improving observability, troubleshooting failure scenarios, and fortifying the digital resilience of your systems and teams. For professionals, architects, engineers, and practitioners eager to design, plan and implement enterprise system resilience with proven SRE practices.
