Satellite Events

satellite events

Fast Data Processing Pipeline for Predicting Flight Delays Using Apache APIS: Kafka, Spark ML, Drill, with MAPR-DB JSON

The possibility to blend machine learning with real-time transactional data flowing through a single platform is opening a world of new possibilities, such as enabling organizations to take advantage of opportunities as they arise. Leveraging these opportunities requires fast, scalable data processing pipelines which process, analyze, and store events as they arrive.

In this deep dive we will look at the architecture of a data pipeline that combines streaming data with machine learning to predict flight delays. You will see the end-to-end process required to build this application using Apache APIs for Kafka, Spark, Drill and other technologies:

  • Apache Spark Machine Learning to build a model to predict flight delays.
  • Kafka and Spark Streaming: Using the ML model with streaming data to do real-time analysis of flight delays.
  • Spark Streaming and fast storage with MapR-DB JSON
  • Analysis of Flight delay data and predictions stored in MapR-DB with Apache Spark, Apache Drill and OJAI.

The format will consist of lecture and lab on zeppelin notebooks running on  Zeppelin notebook code will be provided for download so that developers can try out the code on their own after the workshop. Developers can also download a complimentary ebook:  from MapR which explains the code examples and more.

About the instructor:

Carol is a Solutions Architect at MapR. Previously she was an Apache Spark Instructor and curriculum developer at MapR. Carol has experience working with Java technologies in many roles including software development, training, technology evangelism and developer outreach. She has extensive experience as a software developer and architect, building complex mission-critical applications in the banking, health insurance and telecom industries. Prior to MapR as a Technology Evangelist at Sun, Carol travelled worldwide, speaking, and giving Trainings. Prior to working at MapR: Carol worked as: a Senior developer for a health information exchange, an Architect on a massive OLTP Spring application to manage > 10 mill loans for the consumer credit division of a leading automoblile manufacturer and a leading bank. Carol worked on Pharmaceutical Intranet applications for Roche in Switzerland, a Telecom Network Management Application for HP in France, an Email Server for IBM in Germany, and as a student intern for the National Security Agency. Carol holds a M.S. in Computer Science from the University of Tennessee, a B.S. in Geology from Vanderbilt University, and is a Sun Certified Java Architect and Java Language Programmer. Carol is also Fluent in French and German.


Only 20 seats available.


plista GmbH, Torstraße 35, 5th floor, 10119 Berlin
tube: Rosa-Luxemburg-Platz (U2)



Friday, 14 June 2019

10am - 5pm CET

doors opne 9 am

Register here:

Apache Spark Workshop

Join Us for a full day of Apache Spark!
On June 19, 2019 we are holding an Apache Spark workshop for beginners presented by Marcin Szymaniuk at idealo Berlin.
Tickets are limited to 20 seats. Available here for only €395
Each ticket includes food and drink and after-workshop beer, kindly provided by idealo GmbH.

About the workshop
The goal is to provide an overview of the most important Spark features to start building the first Spark applications. No prior experience of Spark is required. All hands-on exercises will be in Scala but they will be simple enough for anybody with good knowledge of any modern programming language. The course emphasises practical aspects of Spark and common problems and misconceptions. All participants should have VirtualBox installed on their laptop so they can do hands- on exercises and fully benefit from the workshop.

About the instructor
Marcin is a data developer and architect with experience in data infrastructure administration. He has proven knoledge on real-life big data related problems that he solves on a daily basis (he has worked for companies like Spotify and Apple, and currently consulting on Big Data projects).

Program overview

1. Introduction to Spark

2. Basic building blocks in Spark

4. Writing and deploying Spark applications

5. Spark on a cluster

6. SQL-like Spark features

7. Spark use cases overview

Bonus exercises: Spark-SQL aggregations, Page Rank, Data generation with Spark, Broadcast join, Skewed join problem, AggregateByKey challenge, Tree-reduce.


idealo GmbH, Ritterstr. 11, 10969 Berlin
nearest tube station is Moritzplatz or Kottbusser Tor


19 June 2019, 9am - 5pm, doors open at 8:30 am

Register here:

Beam Summit Europe

The Beam Summit is a 2-day, multitrack event with the goal to bring together experts on Beam, new contributors and other participants interested in learning more about Apache Beam. Registration is free.

If you are interested in speaking or hosting a workshop at the Beam Summit Europe, please submit your session at Beam Summit CfP @ Sessionize. The deadline for submissions is March 31st. If you have additional questions, please contact the organizers at




June 19-20, 2019

Register here:

Elasticsearch and Elastic Stack: Search and Beyond

Elasticsearch is the most widely used full-text search engine, but is also very common for logging, metrics, and analytics. This exercise shows you what the rage is all about:

1. Overview of Elasticsearch and how it became the Elastic Stack.

2. Full-text search deep dive:

  • How does full-text search work in general and what are the differences to databases.
  • How the score or quality of a search result is calculated.
  • How to handle languages, search for terms and phrases, run boolean queries, add suggestions, work with ngrams, and more with Elasticsearch.

3. Going from search to logging, metrics, and analytics:

  • System metrics: Keep track of network traffic and system load.
  • Application logs: Collect structured logs in a central location from your systems and applications.
  • Uptime monitoring: Ping services and actively monitor their availability and response time.
  • Application metrics: Get the information from the applications such as nginx, MySQL, or your custom Java applications.
  • Request tracing: Trace requests through an application and show how long each call takes and where errors are happening.

And we will do all of that live, since it is so easy and much more interactive that way.

Only 20 places available, so secure your ticket now here

Instructor: Philipp Krenn, Developer advocate @elastic
Philipp lives to demo interesting technology. Having worked as a web, infrastructure, and database engineer for more than ten years, Philipp is now working as a developer advocate at Elastic — the company behind the open source Elastic Stack consisting of Elasticsearch, Kibana, Beats, and Logstash. Based in Vienna, Austria, he is constantly traveling Europe and beyond to speak and discuss about open source software, search, databases, infrastructure, and security.





idealo GmbH, Ritterstr. 11, 10969 Berlin
nearest tube station is Moritzplatz or Kottbusser Tor


19 June 2019, 9am - 5pm, doors open at 8:30 am

Register here:

MICES - Mix-Camp E-Commerce Search

MICES is a one day event on e-commerce search. The goal of the event is to bring together participants of different backgrounds (IT, product managers, UX designers, search managers, information retrieval specialists, search technology vendors, …) to discuss challenges, ideas, best practices and case studies in the e-commerce search domain. 

The format of the event will be a mix of scheduled talks and self-organising sessions.

                                          ***The call for talks closes on 14 April 2019.***


myToys. Potsdamer Straße 192, 10783 Berlin


19 June 2019, 9am to 7pm (doors open at 8:30am)

Register here:

ticket shop satellite events 2019


The satellite events above take place in various locations across Berlin on June 14th and 19th respectively. All events have a limited number of spaces available.
For the workshops, please register in the ticket shop below for only 395€ each. You can save 15% on your workshop ticket if you order it together with your regular #bbuzz ticket. Get your ticket here – you can choose your preferred workshop during the ordering process in our Berlin Buzzwords ticket shop. In case you have already purchased a ticket for Berlin Buzzwords, please contact for a discount code.
MICES is free. Please register on

More participants thanks to online event management solutions from XING Events.