Tag

spark

Consuming records with Spark

From Spark in Action, Second Edition by Jean Georges Perrin

This article explores consuming records in files with Spark.

Aggregating Your Data with Spark

From Spark in Action, Second Edition by Jean-Georges Perrin

This article teaches you how to perform an aggregation using Apache Spark. You first look at the definition of an aggregation. You may already know and use aggregations in your job, and this might be a reminder for you. If this is the case, you can safely skim through it: Apache Spark’s aggregations are standard. The second part of this section shows you how to transform a SQL aggregation statement to Spark.

The Inner Workings of Spark

spark_in_act

From Spark in Action, Second Edition by Jean George Perrin

Ingesting Data from Files with Spark, Part 3

By Jean Georges Perrin

This is the third in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a XML file.

Ingesting Data from Files with Spark, Part 2

By Jean Georges Perrin This is the second in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a JSON file.

Ingesting Data from Files with Spark, Part 1

From Spark in Action, 2nd Ed. by Jean Georges Perrin

This is the first in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting data from CSV.

The Majestic Role of the Dataframe in Spark

From Spark with Java by Jean Georges Perrin

In this article, you’ll learn what a dataframe is, how it’s organized, and about immutability.

Build a Full-Featured Data Solution


slideshare-build-a-full-featured-data-solution

From Fusion in Action by Guy Sperry

What Happens behind the Scenes with Spark

From Spark with Java by Jean Georges Perrin

You’ve probably seen a simple use-case where Spark ingests data from a CSV file, then performs a simple operation, and then stores the result in the database. In this article, you’re going to see what happened behind the scenes.

Using Apache Spark with Java

From Spark in Action, Second Edition by Jean-Georges Perrin

© 2019 Manning — Design Credits