As Site Reliability Engineers it is our mission to ensure our services are highly available, secure, and scalable. With hundreds or thousands of different metrics across a (potentially) distributed system that you could monitor and alert on, where do we begin? How do we define what it means for a service to be "healthy"? This lightning talk focuses on the four golden signals of monitoring that...
Learn for free, join the best tech learning community
Event notifications, weekly newsletter
Access to all content