Entries by Marcin Gorczyński

Introduction to Machine Learning with Spark and MLlib (DataFrame API)

Introduction to Machine Learning with Spark and MLlib (DataFrame API) A pretty hot topic lately is machine learning – the inter-sectional discipline closely related to computational statistics that let’s computers learn without being explicitly programmed. It has found to be of significant use in the field of data analytics – from estimating loan and insurance […]

Introduction to Streams in Akka

A very common scenario in many kinds of software is when the input data is potentially unlimited and it can appear at arbitrary intervals. The common way of handling such cases is using the Observer pattern in it’s imperative form – callbacks. But this approach creates what’s commonly called “Callback Hell”. It’s a concept basically […]

Handling failure using Xor and Validated data types

Introduction Any application sooner or later will fail. Imperative style programming usually handles this using side-effects by propagating exceptions and handling them later on. This approach introduces statefulness and deferring the error to outer bounds of the application. This creates hidden control-flow paths, that are difficult to reason about and debug properly when the code […]

Handling Split Brain scenarios with Akka

When operating an Akka cluster the developer must consider how to handle network partitions (Split Brain scenarios) and machine crashes. There are multiple strategies to handle such erratic behavior and, after a deeper explanation of the problem we are facing, I will try to present them along with their pros and cons using the Split […]