From Azure Storage, Streaming, and Batch Analytics by Richard Nuckolls
This article delves into Azure’s tools for data engineering and why you should consider using them.
From Mastering Large Datasets with Python by J.T. Wolohan
This article covers
· Using map to do complex data transformations
· Chaining together small functions into pipelines
· Applying these pipelines in parallel on large datasets
From Mastering Large Datasets by JT Wolohan
This article explores using the map function creatively in a data project.
From Spark in Action, Second Edition by Jean George Perrin
From Spark in Action, 2nd Ed. by Jean Georges Perrin
This is the last in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a TXT file.
By Jean Georges Perrin
This is the third in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a XML file.
By Jean Georges Perrin This is the second in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting a JSON file.
From Spark in Action, 2nd Ed. by Jean Georges Perrin
This is the first in a series of 4 articles on the topic of ingesting data from files with Spark. This section deals with ingesting data from CSV.