JANUARY 23 2020
TICKETS AVAILABLE NOW:
both for the conference and the workshop
JAN 23 2020
THE CHAOS WORKSHOP/
JAN 24 2020
The Comedy Store,
1a Oxendon St,
ABOUT THE EVENT /
Chaos Engineering for practitioners and adopters.
Come and break things on purpose with us!
CO-FOUNDER AND CTO
SCHEDULE / JANUARY 23 2020
THE COMEDY STORE
8:35 - 9:30 AM
How to Be Wrong
KEYNOTE BY RUSS MILES,
CEO @ CHAOSIQ
Being wrong is often seen as the WORST THING THAT CAN HAPPEN(tm), especially when you’re build business critical applications and services. But the increased velocity of modern software development, plus the increased need for our systems to be resilient, reliable, and RIGHT has increased the pressure on developers exponentially. Never before have software owners had such an opportunity, or the power, to BE WRONG!
We need to get better at being wrong, and that’s what this keynote is all about.
In this keynote talk Russ Miles, CEO of ChaosIQ,, will share the tools and techniques he uses to turn inevitably BEING WRONG, into BEING SUCCESSFUL at BEING WRONG. BEING WRONG can be turned to our advantage, and in this talk Russ will share stories of how this has happened, and also the challenges to look out for.
The myth of always being right when you create and operate software is over! You’re going to BE WRONG most of the time’s time to get better at BEING WRONG, learning to turn “accidents” such as outages into opportunities…
9:30 - 10:00 AM
for Spring Boot
TALK BY JOHN FLETCHER
AND MANUEL WESSNER,
CHAOS MONKEY EVANGELISTS
Everything you want to know about the useful and popular chaos engineering tool Chaos Monkey for Spring Boot (CMSB) from two of its maintainers.
- How to easily get started with your first Chaos Experiments.
- More exotic applications like dual attacks.
- How to integrate CMSB with automation tools like Chaos Toolkit and Chaos Mesh in order to run tests in your build chain.
- An overview of the history and the changes in the latest version.
- Who should get involved in the project, and how.
- A sneak peek into the next release.
The talk will primarily consist of live coding.
10:00 - 10:30 AM
Chaos Engineering in JVM
TALK BY LONG ZHANG,
PHD STUDENT IN COMPUTER SCIENCE
@ KTH ROYAL INSTITUTE OF TECHNOLOGY
During the talk, I will introduce the main research work we did recently on chaos engineering. We focus a lot on application-level chaos engineering in JVM. For example, ChaosMachine provides unique and actionable analysis on exception-handling capabilities in production, at the level of try-catch blocks. TripleAgent combines monitoring, perturbation, and failure-obliviousness for automated resilience improvement, at the level of methods.
Currently, we are exploring a new idea about chaos experiments for containerized Java applications. When the conference is held, it is promising to share some interesting findings of this work as well.
For sake of open-science, the code is made publicly available at
10:30 - 10:45 AM
10:45 - 11:30 AM
Security Precognition: Crafting Secure and Resilient Distributed Systems using Chaos Engineering
KEYNOTE BY AARON RINEHART
CO-FOUNDER AND CTO @ VERICA
Modern systems pose a number of thorny challenges and securing the transformation from legacy monolithic systems to distributed systems demands a change in mindset and engineering toolkit. The security engineering toolkit is unfortunately out-of-style and outdated with today's approach to building, security and operating distributed systems.
The speed, scale, and complex operations within microservice architectures make them tremendously difficult for humans to mentally model their behaviour. If the latter is even remotely true how is it possible to adequately secure services that are not even fully comprehended by the engineering teams that built them.
Security Chaos Engineering helps teams realign the actual state of operational security as well as build confidence that their security actually works the way the think it does. Chaos Engineering allows for security teams to proactively experiment on recurring incident patterns to derive new information about underlying factors that were previously unknown by reversing the postmortem and preparation phases. This is done by developing live-fire exercises that can be measured, managed, and automated.
It develops teams by building a learning culture around system failure to challenge engineering teams to proactively, safely discover system weakness before they disrupt business outcomes. In this session, we will introduce a new concept known as Security Chaos Engineering and how it can be applied to create highly secure, performant, and resilient distributed systems.
11:30 - 11:40 AM
Who is responsible for chaos?
A TALK BY JOYCE LIN
DEVELOPER ADVOCATE LEAD @ POSTMAN
If you’re thinking of starting a chaos program, you might be wondering which job functions are typically responsible for managing chaos within their organizations. This talk will look across a number of companies to determine who historically initiates chaos programs, as well as reveal new trends in this space.
11:50 AM - 13:10 PM
AMA SESSIONS WITH CONFERENCE PATRONS
13:10 - 13:50 PM
Shipping quality software in hostile environments
A TALK BY LUKA KLADARIC
FOUNDER & CHAOS MANAGEMENT SPECIALIST @ SEKURA
Everyone loves features, right? Product loves features. Management loves features. The board loves features. Features are what make the users use and the investors invest, right? They certainly make the media pay attention.
What happens when, for 8 years straight, all you care about is features? Productivity grinds to a halt, production outages are a given, post-mortems are a joke and job satisfaction and happiness are flatlining.
Lessons learned unravelling layers and layers of terribleness to rediscover productivity and job satisfaction while also improving security and robustness of the products.
13:50 - 14:20 PM
Chaos Engineering for SQL Server
A TALK BY ANDREW PRUSKI
SQL SERVER DBA @ CHANNEL ADVISOR
In this session we’ll look at how Chaos Engineering can be implemented with regard to SQL Server. SQL has various different high availability solutions but can we be sure that they’ll react as expected to a real world issue? Has the HA architecture only ever been tested in a planned maintenance window?
We’ll explore SQL Server’s built-in high availability features and take a look at Kubernetes, a brand new platform for SQL Server.
We’ll also have some fun by looking at KubeInvaders, a chaos engineering tool for Kubernetes…using Space Invaders!
14:20 - 14:50 PM
14:50 - 15:20 PM
getting out of the starting blocks.
A TALK BY ADRIAN HORNSBY
PRINCIPAL TECHNOLOGIST, ARCHITECTURE
@ AMAZON WEB SERVICES (AWS)
Architectures are growing increasingly distributed and hard to understand. As a result, software systems have become extremely difficult to debug and test, which increases the risk of failure. With these new challenges, chaos engineering ha become attractive to many organizations as a mechanism for underling the behavior of systems under expected circumstances.
Whilst interest is growing, few have managed to build sustainable chaos engineering practices. In this talk, I will review the state of chaos engineering, the issues customers are facing, based on my learning as an AWS Solution Architect and Technologist focusing on Chaos Engineering and explain why I started to build tools to help with failure injection.
15:20 - 15:50 PM
Cloud-Native Chaos Engineering
A TALK BY UMASANKAR MUKKARA
CO-FOUNDER AND COO @ MAYADATA
The cloud-native approach has taken the DevOps world by a pleasant surprise by the welcome adoption of Kubernetes across all categories - from Developers to SREs to VP of digital transformation. As the huge mass of legacy applications move Cloud-Native platforms, an important problem arises. How do SREs make sure the systems do not have weaknesses and have the required level of resilience? A well thought out chaos engineering methodology is the right answer. And for a large number of fast-changing applications and infrastructure, finding the right set of chaos experiments and identifying if the impact of chaos has resulted in showing up a weakness in the system is almost an impossible task.
In Cloud-Native Chaos engineering, the developers develop chaos tests as an extension of the development process. These tests are developed using standard Kubernetes Custom Resources or CRs so that they are easier to manipulate according to the environment. These chaos experiments are groomed in CI pipelines and finally published in the Chaos Hub so that they are available to SREs using the Cloud-Native applications in production. SREs use such chaos experiments of various microservices to schedule chaos in a random fashion to find weaknesses in their deployments, which leads to increased reliability.
15:50 - 16:20 PM
of Chaos Engineering
KEYNOTE BY MATTY STRATTON
DEVOPS ADVOCATE @ PAGERDUTY
Chaos Engineering, failure injection, and similar practices have verified benefits to the resilience of systems and infrastructure. But can they provide similar resilience to teams and people? What are the effects and impacts on the humans involved in the systems? This talk will delve into both positive and negative outcomes to all the groups of people involved - including users, engineers, product, and business owners.
Using case studies from organizations where chaos engineering has been implemented, we will explore the changes in attitude that these practices create. This talk will include a brief overview of chaos engineering practices for unfamiliar members of the audience, but the main focus will be on human elements. I will discuss successful implementations, as well as challenges faced in teams where chaos was a “success” from a technical perspective, but contained negative impact for the people involved.
16:20 - 16:50 PM
Journey to resilience : building the ramp to accelerate adoption of Chaos Engineering practices
A TALK BY VILAS VERARAGHAVAN
DIRECTOR OF ENGINEERING
@ WALMART LABS
Chaos engineering has come a long way from its early days at Netflix. Its importance is no longer questioned in the community but as it has gone mainstream, teams quickly learn that adoption is not a given. In this talk, we talk about the challenges that we encountered at Walmart and the techniques used to break through them.
We will discuss our successes and failures on the journey to resilience, highlighting the major barriers to adoption. The talk also will discuss the strategies we used to build tooling to guide teams in addition to a gamified approach to motivate them.
THE COMEDY PUB
(to be confirmed)
THE Chaos WORKSHOP
ABOUT THE WORKSHOP /
Let experts guide you through the world of chaos!
Featuring an In-depth Hands-on Workshop!
In this hands-on workshop, we will cover open source tools like Prometheus, Chaos Monkey, PowerfulSeal, and Pumba to introduce chaos into an existing system.
Participants will learn how to:
- gain visibility necessary to assess their hypotheses
- plan and implement meaningful chaos experiments using popular tools
- run their own Game Days at work
Required: a laptop and a strong will to learn :)
9:00 - 16:00
THE CHAOS WORKSHOP
12:00 - 13:00
(to be defined)