From Spark in Action, Second Edition by Jean-Georges Perrin

Take 42% off the entire book. Just enter code slperrinĀ into the discount code box at checkout at


When you’re doing analytics on big data systems, it can be a challenge to efficiently query, stream, filter, and consolidate the data distributed across a cluster, network, or cloud system. Built especially for efficiently operating over large distributed datasets, the Spark data processing engine makes handling that data so much easier! Spark’s Java APIs provide an easy-to-use interface, near-limitless upgrade potential, and performance you’ve dreamed about all using the Java programming skills you already have! Learn more in the slide deck below.