Spark Summit 2018 — 10 Key Talks
In San Francisco I attended the Spark Summit 2018. It’s a candy store for data engineers and scientists, because Apache Spark
is a powerful open-source cluster-computing framework. It enables my
colleagues and me to solve complex machine learning problems at RTL.
Spark splits jobs into multiple tasks, which results in ‘embarrassingly parallelism’.
Originally it was developed by Matei Zaharia around the corner at the
University of California, Berkeley. Now it is maintained by Apache, with
Databricks as an important contributor.
Spark
is popular, proven by a record number of 4000 attendees. Actually the
event was called the Spark+AI summit this year, but let’s skip the
buzzwords before it’s called the
Spark+AI+Blockchain+AutonomousVehicles+… Summit.
The
Moscone Center was the theatre of 191 talks in 2 days. Creating a
classic fear of missing out. Luckily, all talks are recorded and
available online. It was hard to compare many great talks, but these are my favourite ten, in alphabetical order.
Keep reading