Conf42: Site Reliability Engineering 2022

...

Hodor: Detecting and addressing overload in LinkedIn microservices

Bryan Barkley
Senior Staff Engineer @ LinkedIn

Bryan Barkley's LinkedIn account

Vivek Deshpande
Senior Software Engineer @ LinkedIn

Vivek Deshpande's LinkedIn account


When pushed hard enough any system will eventually suffer, and ultimately fail unless relief is provided in some form. At LinkedIn, we have developed a framework for our microservices to help with these issues: Hodor (Holistic Overload Detection & Overload Remediation).

As the name suggests, it is designed to detect service overloads from multiple potential root causes, and to automatically improve the situation by dropping just enough traffic to allow the service to recover.

Hodor then maintains an optimal traffic level to prevent the service from reentering overload. All of this is done without manual tuning or specifying thresholds. In this talk, we will introduce Hodor, provide an overview of the framework, describe how it detects overloads, and how requests are dropped to provide relief.

Awesome conferences for

Priority access to all content

Community Discord

Exclusive promotions and giveaways