What is site-reliability-engineering-sre?

A discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems.

🌐 Begriffe in anderen Sprachen:
Inhalt steht zur Übersetzung an. Die englische Version wird angezeigt.

SRE is a set of practices and a job role that focuses on the reliable operation of computer systems. It was pioneered at Google and is often summarized by the phrase: 'SRE is what happens when you ask a software engineer to design an operations team.' SREs use software tools to manage systems, solve problems, and automate operations tasks.

        graph LR
  Center["What is site-reliability-engineering-sre?"]:::main
  Rel_site_reliability_engineering["site-reliability-engineering"]:::related -.-> Center
  click Rel_site_reliability_engineering "/terms/site-reliability-engineering"
  Rel_api_development["api-development"]:::related -.-> Center
  click Rel_api_development "/terms/api-development"
  Rel_agile_methodology["agile-methodology"]:::related -.-> Center
  click Rel_agile_methodology "/terms/agile-methodology"
  classDef main fill:#7c3aed,stroke:#8b5cf6,stroke-width:2px,color:white,font-weight:bold,rx:5,ry:5;
  classDef pre fill:#0f172a,stroke:#3b82f6,color:#94a3b8,rx:5,ry:5;
  classDef child fill:#0f172a,stroke:#10b981,color:#94a3b8,rx:5,ry:5;
  classDef related fill:#0f172a,stroke:#8b5cf6,stroke-dasharray: 5 5,color:#94a3b8,rx:5,ry:5;
  linkStyle default stroke:#4b5563,stroke-width:2px;

      

🧠 Wissenstest

1 / 1

🧒 Erkläre es wie einem 5-Jährigen

SRE is like a team of super-smart doctors for a giant robot. They don't just fix the robot when it breaks; they build special tools that help the robot fix itself and monitor its health so it never gets sick in the first place. Their goal is to make sure the robot is always 'available' to do its job.

🤓 Expert Deep Dive

SRE introduces key concepts like Error Budgets, SLIs (Service Level Indicators), and SLOs (Service Level Objectives). The core philosophy is that 'hope is not a strategy'. SREs spend at least 50% of their time on engineering/automation tasks and are empowered to push back on developers if a system becomes too unstable.

📚 Quellen