Tag

data

Aggregating Your Data with Spark

From Spark in Action, Second Edition by Jean-Georges Perrin

Function Pipelines for Mapping Complex Transformations

From Mastering Large Datasets with Python by J.T. Wolohan

This article covers

· Using map to do complex data transformations

· Chaining together small functions into pipelines

· Applying these pipelines in parallel on large datasets

Maximise Customer Retention

From Fighting Churn with Data by Carl Gold


slideshare-maximise-customer-retention

The Data Scientist’s Survival Guide

From Build Your Career in Data Science by Emily Robinson and Jacqueline Nolis


slideshare-the-data-scientists-survival-guide

Working with Large Datasets Faster: using the map function

From Mastering Large Datasets by JT Wolohan

This article explores using the map function creatively in a data project.

Beyond Beyond Spreadsheets

Six Questions for Jonathan Carroll, author of Beyond Spreadsheets with R

By Frances Lefkowitz

Jonathan Carroll is a data science consultant providing R programming services. He holds a PhD in theoretical physics.

The Inner Workings of Spark

spark_in_act

From Spark in Action, Second Edition by Jean George Perrin

Develop Superior Machine Learning Algorithms

From Graph-Powered Machine Learning by Alessandro Negro


slideshare-develop-superior-machine-learning-algorithms

Analyzing Stock Price Time Series with Fortran Arrays, Part 2

From Modern Fortran by Milan Curcic

What do Cooking Pasta and Data Science Have in Common?

From Data Science at Scale with Python and Dask by Jesse C. Daniel

This article discusses Dask, how it compares to Apache Spark, and how to create and understand directed acyclic graphs using the example of the delicious Italian pasta dish bucatini all’Amatriciana.

© 2019 Manning — Design Credits