From docker to kubernetes: running Apache Hadoop in a cloud native way

06/12/2018 - 14:50 to 15:30
long talk (40 min)

Session abstract: 

Creating containers for an application is easy (even if it’s a goold old distributed application like Apache Hadoop), just a few steps of packaging.

The hard part isn't packaging: it's deploying

How can we run the containers together? How to configure them? How do the services in the containers find and talk to each other? How do you deploy and manage clusters with hundred of nodes?

Modern cloud native tools like Kubernetes or Consul/Nomad could help a lot but they could be used in different way.

It this presentation I will demonstrate multiple solutions to manage containerized clusters with different cloud-native tools including kubernetes, and docker-swarm/compose.

No matter which tools you use, the same questions of service discovery and configuration management arise. This talk will show the key elements needed to make that containerized cluster work.


  • kubernetes, docker-swam, docker-compose, consul, consul-template, nomad
  • together with: Hadoop, Yarn, Spark, Kafka, ...