From Data Science Bookcamp by Leonard Apeltsin
This 3-part article series covers:
Clustering data by centrality
Clustering data by density
Trade-offs between clustering algorithms
Executing clustering using the scikit-learn library
Iterating over clusters using Pandas
From Data Science Bookcamp by Leonard Apeltsin
This 3-part article series covers:
Clustering data by centrality
Clustering data by density
Trade-offs between clustering algorithms
Executing clustering using the scikit-learn library
Iterating over clusters using Pandas
From Unity in Action, Third Edition by Joe Hocking
From Python Concurrency with asyncio by Matthew Fowler
This article shows how you might make your first application that leverages asyncio.
From Akka in Action, Second Edition by Francisco Lopez-Sancho Abraham
This article discusses messaging in Akka and its new typed actors.
From Data Analysis with Python and PySpark by Jonathan Rioux
This article covers
· Using pandas Series UDF to accelerate column transformation compared to Python UDF.
· Addressing the cold start of some UDF using Iterator of Series UDF.
From Data Analysis with Python and PySpark by Jonathan Rioux
This article covers
· Using pandas Series UDF to accelerate column transformation compared to Python UDF.
· Addressing the cold start of some UDF using Iterator of Series UDF.
From Clojure, The Essential Reference by Renzo Borgatti
clojure.java.io contains a collection of functions to simplify the interaction with the Java Input/Output system (or simply IO). Over the years, Java evolved the original InputStream and OutputStream abstractions into Reader and Writer, eventually adding also asynchronous IO. During this transformation, Java put a lot of effort in maintaining backward compatibility, a principle also shared with Clojure. Unfortunately, there are now coexisting IO APIs that impact negatively on usability, forcing Java developers through bridges and adapters to move between different styles of IO.
From Distributed Machine Learning Patterns by Yuan Tang
In this article, we introduce the parameter server pattern which comes handy for situations where the model is too large to fit in a single machine such as one we would have to build for tagging entities in the 8 millions of YouTube videos.
From Data-Oriented Programming by Yehonathan Sharvit
This article explores how data-oriented programming deals with retrieving data from a database.