This year we continue the theme of Big Data, Machine Learning, Migration from Java to Scala, as well as topics related to efficient and reliable messaging using Scala, best practices and common pitfalls of using standard and third party libraries, architectural approaches and third party services, live codding sessions, implementing fancy programming patterns in Scala, memory optimization in Scala and safety.
Valerii Veseliak
Valerii Veseliak
Senior BigData Developer
Introduction to scalable Machine Learning pipelines with Apache Spark
Abstract
Apache Spark is famous framework for working with Big Data. In this presentation we will cover some background about main concepts of Apache Spark and Machine Learning. During the presentation we will review common machine learning and statistical algorithms which are implemented in Spark. Also we will talk about how to use Spark to build scalable machine learning pipelines with Spark MLLib.