DevOpsAndSre Hub

This cluster covers the operational discipline of running software in production — automated delivery, deployment patterns, observability, on-call practice, and the SRE core principles. The orientation is concrete: practices that make the difference between a stable production system and an unstable one.

Delivery

DevOpsFundamentals — What DevOps actually changed; what it did not
CiCdPipelines — Pipeline design, stages, the patterns that scale
TrunkBasedDevelopment — Trunk vs. GitFlow, the case for trunk
GitWorkflows — Branch strategies, merge vs. rebase, commit hygiene
MonorepoVsPolyrepo — The trade-offs at scale
FeatureToggleManagement — Flag types, lifecycle, retirement
ReleaseEngineering — Release artifacts, signing, rollback
ReleasePlanning — Sequencing, dependencies, communication

Operations and Resiliency

OnCallPractices — Rotation, escalation, blameless postmortems
RunbookAutomation — Runbooks that work; automating the recoverable
StatusPageBestPractices — Public status pages, customer communication
ToilReductionStrategies — Identifying and eliminating operational toil
ScheduledTaskManagement — Cron, scheduled jobs, the patterns that survive
Auto Scaling Strategies — Horizontal vs. Vertical, predictive scaling, and cost control
Health Check Patterns — Liveness, readiness, and deep-health checks in distributed systems

Observability Implementation

Technical standards for monitoring and insight across the project ecosystem.

Observability and Monitoring Blueprint — Unified standard for OTel, Prometheus, and Grafana
Monitoring and Alerting — The architecture of insight: metrics, logs, and traces
AI Observability in Production — Monitoring LLM drift, safety, and evaluation metrics

Infrastructure and Tooling

Kubernetes Basics — Pods, Deployments, Services, and the K8s object model
Docker Deployment — Containerizing applications for portable production
Secrets Management — Storing and rotating credentials in a secure pipeline
Rate Limiting and Throttling — Protecting services from resource exhaustion
ServiceMeshArchitecture — When the mesh is worth the complexity
Container Security — Hardening the runtime and the image supply chain

Adjacent clusters

Cloud Platforms Hub — Where DevOps practices land in cloud
Software Engineering Practices Hub — Code-side disciplines
Web Services and APIs Hub — Service-level concerns