Running Kubernetes in production: A million ways to crash your cluster

Share

Bootstrapping a Kubernetes cluster is easy, rolling it out to nearly 200 engineering teams and operating it at scale, however, is a challenge. In this talk, Henning Jacobs presents Zalando’s approach to Kubernetes provisioning on AWS, operations and developer experience for our growing Zalando developer base.

He will walk you through his horror stories of operating 80+ clusters and share the insights we gained from incidents, failures, user reports, and general observations. Most of the experiences shared in this talk, apply to other Kubernetes infrastructures (EKS, GKE, ..) as well. With this session, Jacobs aims to reduce your unknown unknowns about running Kubernetes in production.

wuegqwpö

Henning Jacobs joined Zalando at the beginning of 2010 and accompanied the transformation of Zalando’s technology department through the eras of PHP/MySQL and Java/PostgreSQL to the new world of “Radical Agility”. He helped to build the AWS/STUPS cloud infrastructure to make innovation scale across autonomous teams. Henning is currently responsible for the developer journey at Zalando. His five teams help streamline the developer experience by providing a cloud-native application runtime to 200+ engineering teams.

eqwueg

ueqoiqhew

The post Running Kubernetes in production: A million ways to crash your cluster appeared first on JAXenter.

Source : JAXenter