Exploring Tagless Final pattern for extensive and readable Scala code
In this post, I will try to share with you all a functional pattern I stumbled upon recently – Tagless Final. This pattern tries to address a vital problem for every software engineer: how to make sure the programs we write are correct?
I will try to explain how Tagless Final works and how it can be applied in practice while keeping things down to earth and as practical as possible. Of course, I didn’t invent it from scratch, but I would like to share what I’ve learned and maybe popularize this solution.
Kudos to Oleg Kiselyov for describing the pattern in depth and John De Goes for inspiring me to write this post.
Let’s get started.
Introducing Tagless Final
Tagless Final allows you to build a subset of the host language which is sound, typesafe, and predictable. When designed properly this subset makes it easy to write correct programs and hard to write incorrect ones. In fact, invalid states can’t be expressed at all! Later, the solution written in the hosted language is safely run to return the value to the hosting language.
For us, Scala will be our hosting language, but the pattern also applies to other ecosystems. Tagless Final can be seen as a composite pattern, it builds on top of or is inspired by many other patterns like type classes or Free monads. So you might spot some similarities.
On a conceptual level, our Scala implementation has 3 distinct parts:
- Language, that defines a subset of operations that the hosted language allows
- Bridges, helpers that express Scala values and business logic in the Language
- Interpreters, dual to Bridges run logic expressed as the Language and get the final value (This is not the official lingo, but I will be using this naming here to make explanations simpler.)
Let’s explore this with a simple example – basic math. You can find the whole code in the repo.
Language is the heart of our DSL. It defines what can we do with the hosted language.
Above we defined a few basic operations. The trait type definitely caught your eye. The
Language is parameterized by a
Wrapper type which itself has an internal type. Both type parameters will be useful for us later.
Wrapper will allow us to change the “package” on which we operate, while the type of the
Wrapper makes sure our API is typesafe.
These methods are called with or return
Wrapper[String] in contrast to those above. The consequence is that you cannot call
toUpper on an Int, which makes the API much harder to use incorrectly. In fact, to combine those two methods sets we would need a method to covert between the types, as the one below:
Fairly standard stuff: one method to create
Wrapper[X] values and a set of operations on those values.
Having a Language in place, we now need to “bridge” the gap between the hosting language (Scala) and our hosted language. Here we will build an interface for a generic bridge to do just that:
As you can see it takes an implicit
Language to build expressions and after calling
apply returns a result wrapped in the desired
Wrapper class. Although he trait might seem confusing, taking a look at some examples hopefully will clear things up:
Our bridges are really simple. They take a single Scala value
number: Int and convert it into an expression in our language. Since we are operating only on given
L we cannot represent incorrect logic, like incrementing a String. This approach for bridges is “fine-grained” – we build only simple expressions. But can easily combine those simple expressions into bigger ones:
or just follow a “coarse grained” approach where we express bigger algorithms straight away
We have defined what our meta-language can do. We expressed our problems in the language. Now it’s time to make it run. For instance with an interpreter like this:
First, we need to define a
Wrapper for us. Here we don’t need anything fancy so we will just operate on plain values. Our
NoWrap will literally default to the type it was parameterized with. After this is done, we simply need to implement our interface. Nothing fancy here.
Note: As David Barri pointed out in his comment below. The
Id type aliases might be tricky to use in production code, due to their eager evaluation and other concerns. Here we stick to it only for simplicity.
Having all 3 items in place we can run them together:
Simple enough, but why should we bother with the
Wrappers in the first place? If you ever played with Free monads you probably know why – to build multiple, specialized interpreters. To give you an example, let’s build another one. This time we will build a utility interpreter that will be helpful in later stages. The new interpreter will pretty-print our math expressions in a Lisp-like syntax, so we can easily spot mistakes in our code.
So far, so good. To reiterate: we defined the possible operations in a form of a
Language trait, we defined helpers to convert Scala values into expressions using
Language, we implemented our meta-language and run the code. Every element above is just plain Scala, but thanks to the way it is composed we achieve a few nice properties.
You are probably familiar with the expression problem, one of the classical concepts in computer science. Long story short: having an interface and a set of implementations of that interface, we want to be able to easily add operations to the interface and new interface implementations.
Ideally, because usually, OOP makes it hard to add interface methods but easy to add implementations. In FP, on the other hand, it’s easy to add new methods, but harder to add implementations.
This is a big deal for every programmer, even though we might not think about it every day. No useful software is static – written once and left unchanged for years. Our software needs to evolve, hence extensibility is an important factor when picking patterns.
Tagless Final finds itself closer to the FP part of the extensibility spectrum. Which is ok in this case. We have tools to easily and safely modify our Language, which (I presume) is the more common operation. Creating interpreters might be harder, I agree, but the cool thing is that you don’t always need to update them. Your change can be made in a non-breaking way (or at least keeping the damage to a minimum).
To explain how this works, let’s go back to the example from before. We had one math operation –
add, but let’s say we wanted another one called
multiply. We could just go and add the new method to
Language. That would be really simple, but unfortunately also caused a ripple effect throughout the system as you would have to update every interpreter that uses the language. Not good. So let’s do something else instead.
Conceptually what we did is we created a child language, which has the whole API of it’s parent, but also few new things. From this point on we can just do the same things we did for the parent.
Define a bridge:
Build an interface:
As you can see it required a bit of writing (could be less if we delegated/inherited in
interpretWithMul the definition). But the good part is that at no point we had to touch the older part – it runs as it did before.
Another interesting property that emerges from this pattern is composability. The ability to combine small parts into bigger, more powerful entities with new properties.
Let’s discuss it in a simple example. Using our original language we were able to express logic like this:
The code compiles just fine, but there’s something we could do better – performance.
10 + (((0 + 1)+1)+1) is not the optimal form for this operation.
10 + (0 + 3) or simply
10 + 3 would be much simpler and more performant.
Of course here the difference is not huge, but if our language would be doing API calls or DB queries the differences would be much bigger. Conceptually we would like to simplify our expression into a more convenient form. Here we will do that naively for one case – flattening many nested
inccalls into a single
But how to do that? The tagless final approach brings us 3 “moving parts”: Language, Bridge, and Interpreter. We cannot add this optimization to the Language itself, as this could mean that our DSL would be in constant flux. And, as we mentioned above, we would prefer not to touch the once-defined language.
Adding the optimization into bridges is plausible, but unfortunately, at call time they don’t know much about the shape we are building. Even worse, users of our language can build their own bridges which makes distributing the improved version harder. We are left with just one place – an interpreter. We will have to figure something out, so our end users will be able to opt-in to the optimization easily when they find it helpful.
One interesting solution to this problem is interpreter composability. To recap: the idea of interpreters is that they take expressions in our custom language and turn that into plain Scala values. So, if we would write a specific interpreter that takes an expression, and turns it into plain Scala, but that result would happen to be a bridge, then we could interpret his output later on.
In short: instead of interpreting into (say) a Scala Int, we will interpret into a rewritten interpretation.
Of course, the code is very simplistic, but you get the idea. During the interpretation process, we can collect more detailed information about the expression and make decisions based on that.
Here’s the output:
As you can see the expression has been rewritten into a different form. Later, this new form can be interpreted as usual.
Having explained the pattern, let’s build something that solves a more typical problem. Our toy example will use Slick to make simple queries to the database.
We will start from the language, where we will define some basic operations on our
Person case class, together with some helper classes to pass data around and make the API more typesafe.
Then we follow the routine that gets us into the Slick interpreter
Our interpreter doesn’t do anything fancy – it keeps the table definitions internally, then between calls state is accumulated in a form of QueryObj being passed around. When we reach the final point –
run, the query is being executed to retrieve the data.
And tada, we used Tagless Final with Slick.
And that’s it. We applied the pattern in Scala. Wrote a code that can be extended and composed with ease.
I hope this post inspired you to take a look at Tagless Final on your own, the same way I was inspired. Thanks for the time and keep on hacking!
- Quark: A Purely-Functional Scala DSL for Data Processing & Analytics
- Typed Tagless Final Interpreters
- From Object Algebras to Finally Tagless Interpreters
- Introduction to Tagless final
Do you like this post? Want to stay updated? Follow us on Twitter or subscribe to our Feed.