Subject

Articles

Clustering Data into Groups, Part 2

From Data Science Bookcamp by Leonard Apeltsin

This 3-part article series covers:

Clustering data by centrality
Clustering data by density
Trade-offs between clustering algorithms
Executing clustering using the scikit-learn library
Iterating over clusters using Pandas

Clustering Data into Groups, Part 1

From Data Science Bookcamp by Leonard Apeltsin

This 3-part article series covers:

Clustering data by centrality
Clustering data by density
Trade-offs between clustering algorithms
Executing clustering using the scikit-learn library
Iterating over clusters using Pandas

Building a Memory Game using Unity’s 2D Functionality

From Unity in Action, Third Edition by Joe Hocking

A First asyncio Application

From Python Concurrency with asyncio by Matthew Fowler

This article shows how you might make your first application that leverages asyncio.

Akka’s Actors

From Akka in Action, Second Edition by Francisco Lopez-Sancho Abraham

This article discusses messaging in Akka and its new typed actors.

Big Data is Just a Lot of Small Data: using pandas UDF, part 2

From Data Analysis with Python and PySpark by Jonathan Rioux

This article covers

·         Using pandas Series UDF to accelerate column transformation compared to Python UDF.

·         Addressing the cold start of some UDF using Iterator of Series UDF.

Big Data is Just a Lot of Small Data: using pandas UDF

From Data Analysis with Python and PySpark by Jonathan Rioux

This article covers

·   Using pandas Series UDF to accelerate column transformation compared to Python UDF.

·   Addressing the cold start of some UDF using Iterator of Series UDF.

Dive into clojure.java.io

From Clojure, The Essential Reference by Renzo Borgatti

clojure.java.io contains a collection of functions to simplify the interaction with the Java Input/Output system (or simply IO). Over the years, Java evolved the original InputStream and OutputStream abstractions into Reader and Writer, eventually adding also asynchronous IO. During this transformation, Java put a lot of effort in maintaining backward compatibility, a principle also shared with Clojure. Unfortunately, there are now coexisting IO APIs that impact negatively on usability, forcing Java developers through bridges and adapters to move between different styles of IO.

Parameter Server Pattern: Tagging Entities in 8 Millions of YouTube Videos

From Distributed Machine Learning Patterns by Yuan Tang

In this article, we introduce the parameter server pattern which comes handy for situations where the model is too large to fit in a single machine such as one we would have to build for tagging entities in the 8 millions of YouTube videos.

Fetching Data from the Database

From Data-Oriented Programming by Yehonathan Sharvit

This article explores how data-oriented programming deals with retrieving data from a database.

© 2022 Manning — Design Credits