Software Reliability Engineering

1 readers
2 users here now

👋 Welcome to the SRE Community!

Whether you're just learning about Site Reliability Engineering/Software Reliability Engineering or you're a seasoned on-call warrior, you're in the right place.

SRE (Site Reliability Engineering - AKA Software Reliability Engineering) is a discipline that uses software engineering principles to ensure that systems are reliable, scalable, and resilient. It’s about balancing feature velocity with system stability—keeping things running even when they shouldn’t.

💬 What can you post here?

Here are a few ideas to get started:

🌐 Be excellent to each other

This community is part of the programming.dev network. Please make sure to:

founded 5 days ago
MODERATORS
1
 
 

Whether you're ironing out kinks in disk provisioning at scale or implementing service mesh for microservices. Let's talk about the problems we're currently tackling.

(Without violating NDA's of course)

2
28
SREsyphus (programming.dev)
submitted 5 days ago by [email protected] to c/[email protected]
 
 

Maybe, one of these days, my SRE team can define SLOs that actually make sense. This "alert on whatever the customer wants" stance really really sucks.