Tag

big data

Aggregating Your Data with Spark

From Spark in Action, Second Edition by Jean-Georges Perrin

Function Pipelines for Mapping Complex Transformations

From Mastering Large Datasets with Python by J.T. Wolohan

This article covers

· Using map to do complex data transformations

· Chaining together small functions into pipelines

· Applying these pipelines in parallel on large datasets

Maximise Customer Retention

From Fighting Churn with Data by Carl Gold


slideshare-maximise-customer-retention

Working with Large Datasets Faster: using the map function

From Mastering Large Datasets by JT Wolohan

This article explores using the map function creatively in a data project.

Modern Data Solutions with Python

From Mastering Large Datasets with Python by John T. Wolohan

 

 

 

The Inner Workings of Spark

spark_in_act

From Spark in Action, Second Edition by Jean George Perrin

Ingesting Data from Files with Spark, Part 4

From Spark in Action, 2nd Ed. by Jean Georges Perrin

This is the last in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a TXT file.

Ingesting Data from Files with Spark, Part 3

By Jean Georges Perrin

This is the third in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a XML file.

Ingesting Data from Files with Spark, Part 2

By Jean Georges Perrin This is the second in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a JSON file.

Ingesting Data from Files with Spark, Part 1

From Spark in Action, 2nd Ed. by Jean Georges Perrin

This is the first in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting data from CSV.

© 2019 Manning — Design Credits