What Happens behind the Scenes with Spark

From Spark with Java by Jean Georges Perrin

You’ve probably seen a simple use-case where Spark ingests data from a CSV file, then performs a simple operation, and then stores the result in the database. In this article, you’re going to see what happened behind the scenes.

Using Apache Spark with Java


From Spark with Java
By Jean Georges Perrin

Getting up and Running with Spark


From Spark in Motion
By Jason Kolter

Running Spark: an overview of Spark’s runtime architecture

From Spark in Action by Petar Zečević and Marko Bonaći.

When talking about Spark runtime architecture, we can distinguish the specifics of various cluster types from the typical Spark components shared by all. Here we describe typical Spark components that are the same regardless of the runtime mode you choose.


Spark in Action: The Notion of Resilient Distributed Dataset (RDD)

By Marko Bonaći and Petar Zečević

In this article, excerpted from Spark in Action, we talk about RDD, the fundamental abstraction in Spark.

How to start developing Spark applications in Eclipse

By Marko Bonaći author of Spark in Action
In this article, you will learn to write Spark applications using Eclipse, the most widely used development environment for JVM-based languages.

How to start developing Spark applications in Eclipse (PDF)

What’s the Advantage of a Property Graph vs. RDF graph?

From Spark GraphX in Action
Spark GraphX in Action Diagram

© 2018 Manning — Design Credits