Subject

Big Data

The Majestic Role of the Dataframe in Spark

From Spark with Java by Jean Georges Perrin

In this article, you’ll learn what a dataframe is, how it’s organized, and about immutability.

What Are GANs?

By Vladimir Bok, author of GANs in Action

This article discusses the history and meaning of Generative Adversarial Networks, and their potential.

Build a Full-Featured Data Solution


slideshare-build-a-full-featured-data-solution

From Fusion in Action by Guy Sperry

What Happens behind the Scenes with Spark

From Spark with Java by Jean Georges Perrin

You’ve probably seen a simple use-case where Spark ingests data from a CSV file, then performs a simple operation, and then stores the result in the database. In this article, you’re going to see what happened behind the scenes.

Privacy, Twitter, and Machine Learning: six questions with Andrew Trask

Privacy, Twitter, and Machine Learning

Andrew Trask, author of Grokking Deep Learning

By Frances Lefkowitz, Manning Development Editor

Andrew Trask is a researcher pursuing a Doctorate at Oxford University, where he focuses on Deep Learning with an emphasis on human language. He is also a leader at OpenMined.org, an open-source community of researchers and developers working on creating free and accessible tools for secure AI. Previously, Andrew was analytics product manager at Digital Reasoning, where he trained the world’s largest artificial neural network (with over 160 billion parameters) and helped guide the analytics for the Synthesys cognitive computing platform, which tackles problems in government intelligence, finance, and healthcare. Grokking Deep Learning is his first book.

Find Andrew online at his blog (iamtrask.github.io) and @iamtrask on Twitter.

PyTorch Crash Course, Part 1

From Deep Learning with PyTorch by Eli Stevens and Luca Antiga

This article introduces you to PyTorch and discusses why you might want to use it in your deep learning projects.

Crunching Data with Dask


slideshare-crunching-data-with-dask

From Data Science at Scale with Python and Dask
By Jesse C. Daniel

Streaming Data with KSQL

From Kafka Streams in Action by Bill Bejeck

This article discusses KSQL, a brand-new open source, Apache 2.0 streaming SQL engine that enables stream processing with Kafka. Basically, it makes it easy to read, write, and process streaming data in real-time, at scale, using SQL-like semantics.

Using Apache Spark with Java


slideshare-using-apache-spark-with-java

From Spark with Java
By Jean Georges Perrin

Getting up and Running with Spark


slideshare-getting-up-and-running-with-spark

From Spark in Motion
By Jason Kolter

© 2018 Manning — Design Credits