Tag

big data

Why Choose Azure for Data Engineering?

From Azure Data Engineering by Richard Nuckolls

This article delves into Azure’s tools for data engineering and why you should consider using them.

Aggregating Your Data with Spark

From Spark in Action, Second Edition by Jean-Georges Perrin

This article teaches you how to perform an aggregation using Apache Spark. You first look at the definition of an aggregation. You may already know and use aggregations in your job, and this might be a reminder for you. If this is the case, you can safely skim through it: Apache Spark’s aggregations are standard. The second part of this section shows you how to transform a SQL aggregation statement to Spark.

Function Pipelines for Mapping Complex Transformations

From Mastering Large Datasets with Python by J.T. Wolohan

This article covers

· Using map to do complex data transformations

· Chaining together small functions into pipelines

· Applying these pipelines in parallel on large datasets

Maximise Customer Retention

From Fighting Churn with Data by Carl Gold


slideshare-maximise-customer-retention

Working with Large Datasets Faster: using the map function

From Mastering Large Datasets by JT Wolohan

This article explores using the map function creatively in a data project.

Modern Data Solutions with Python

From Mastering Large Datasets with Python by John T. Wolohan

 

 

 

The Inner Workings of Spark

spark_in_act

From Spark in Action, Second Edition by Jean George Perrin

Ingesting Data from Files with Spark, Part 4

From Spark in Action, 2nd Ed. by Jean Georges Perrin

This is the last in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a TXT file.

Ingesting Data from Files with Spark, Part 3

By Jean Georges Perrin

This is the third in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a XML file.

Ingesting Data from Files with Spark, Part 2

By Jean Georges Perrin This is the second in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a JSON file.

© 2019 Manning — Design Credits