big data

Managing Data Sources in Machine Learning

From Graph-Powered Machine Learning by Alessandro Negro

This article discusses managing data in graph-powered machine learning projects.

Creating a Bipartite Graph for a User-Item Dataset

By Graph-Powered Machine Learning Alessandro Negro

This article discusses creating a bigraph for a user-item dataset.

Processing Covid-19 Data with Apache Spark

In this video, Jean-Georges showcases how to use JHU data to predict new Covid-19 cases using Apache Spark.

Why Choose Azure for Data Engineering?

From Azure Storage, Streaming, and Batch Analytics by Richard Nuckolls

This article delves into Azure’s tools for data engineering and why you should consider using them.

Function Pipelines for Mapping Complex Transformations

From Mastering Large Datasets with Python by J.T. Wolohan

This article covers

· Using map to do complex data transformations

· Chaining together small functions into pipelines

· Applying these pipelines in parallel on large datasets

Maximise Customer Retention

From Fighting Churn with Data by Carl Gold

Working with Large Datasets Faster: using the map function

From Mastering Large Datasets by JT Wolohan

This article explores using the map function creatively in a data project.

Modern Data Solutions with Python

From Mastering Large Datasets with Python by John T. Wolohan




The Inner Workings of Spark


From Spark in Action, Second Edition by Jean George Perrin

Ingesting Data from Files with Spark, Part 4

From Spark in Action, 2nd Ed. by Jean Georges Perrin

This is the last in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a TXT file.

© 2021 Manning — Design Credits