PinnedA checklist to choose a monitoring systemMonitoring is extremely important to understand the system's health. Monitoring tools range from open-source tools such as Prometheus to…Feb 25, 2024Feb 25, 2024
PinnedPublished inCloud Native DailyLaffer’s Curve and Reliability of Software SystemsLaffer Curve and software reliability: Some valuable insights into the optimal level of investment needed to ensure dependable and robust…Jun 6, 2023Jun 6, 2023
Are you following the SRE way?Site Reliability Engineering (SRE) is an established and critical practice, widely recognized for its importance in modern software…Jun 11, 2023Jun 11, 2023
Published inCloud Native DailyAnatomy of MetricsMetrics are a crucial element in the context of Observability and MonitoringJun 11, 2023Jun 11, 2023
The Fallacies of Distributed SystemsIn the realm of computer science, distributed systems have revolutionized how we perceive, manage, and employ data processing. A…Jun 9, 2023Jun 9, 2023
Metrics vs. Logs: A Detailed ExplorationIn the complex landscape of modern computing, two key concepts reign supreme: metrics and logs. While both are instrumental in…Jun 3, 2023Jun 3, 2023
A day in the life of an SREI always look forward to hearing stories from people, their workflows, and how they improve their craft. As part of Last9, where we build…Mar 20, 2023Mar 20, 2023
Starting o11y.wikiWhen I joined Last9 three years back, I knew little about SRE and Observability. The situation is not changed after three years :) There…Mar 18, 2023Mar 18, 2023