NewCircle Developer Stream
Stream is a constantly updated source of free, educational content on open source development. Also, check out our bookshelf for in-depth tutorials.
In this article, I am going to show how to identify some common Spark issues the easy way: by looking at a particularly informative graphical report that is built into the Spark Web UI – the Web UI Stage Detail view.
A close look at the ways Spark ML models can be put into production, which patterns work best in which situations, and why.
Learn some key performance patterns and anti-patterns that will help you get the most out of Spark 2.0.
A hands-on tutorial using Spark SQL and DataFrames to retrieve insights and visualizations from datasets published by the City of San Francisco.
In this intermediate-level tutorial, I'll address the question of which Apache Spark APIs to use, with a series of brief technical explanations and demos that highlight best practices, latest APIs, and new features in Spark 2.0.
In this tour from QCon SF, I’ll show you Spark's ability to rapidly process Big Data. I'll demonstrate extracting information with RDDs, querying data using DataFrames, visualizing and plotting data, and show you how to create a machine-learning pipeline with Spark-ML and MLLib. We'll also discuss the internals which make Spark 10-100 times faster than Hadoop MapReduce and Hive.
Video covering Spark Streaming from my presentation at the Philly Area Scala Meetup.
Video and slides from my full-day Apache Spark workshop training at Spark Summit 2015
Today, according to Dean Wampler, Scala has successfully taken over the Big Data world. This is a talk about why.
Let's understand the basics of how Hadoop, and HDFS, works with the help of one our favorite childhood toys.
The data analysis framework Spark may play a big role in the future of development. This talk introduces you to five projects related to big data and the framework.