Understanding monads — Rahul Goma Phulore

(On 2013.09.23, Aleksander Sumowski and I gave a talk on monads at the Developer Meetup, ThoughtWorks Pune. This post is a transcript of that talk.)

I have come across many people who are somewhat familiar with functional programming but have a hard time understanding the idea of monads. I believe monads are a simple concept, and the intent of this talk is to provide some intuition for them. And we promise not to use burritos in our explanation.

If you have not heard of monads before, this should still be approachable.

I will try to motivate the case for monads with some day-to-day code examples. The language used for the examples is Scala.

Example 1: Presence or absence of a value

You are given a string. You have to look up this string in a map named foo. The length of the value obtained is to be used as a key in another map, named bar, to get the final value.

This is how you might do it:

1
// snippet 1.1
2
import java.{util => ju}
3
import scala.collection.JavaConverters._
4
val fooJ: ju.Map[String, String] = Map("a" -> "xoxox", "b" -> "xoxo").asJava
5

6
val barJ: ju.Map[Int, String] = Map(4 -> "P", 5 -> "Q").asJava
7

8
def calc(s: String): String = {
9
  val x = fooJ.get(s)
10
  val y = barJ.get(x.length)
11
  y
12
}

But wait. This code is incorrect. What if the key is missing in the first map itself? In that case the map will return null, leading to a NullPointerException in the next operation.

Clearly we need something that guards us against null, so we add an if check.

1
// snippet 1.2
2
def calc(s: String): String = {
3
  val x = fooJ.get(s)
4
  val y = if (x != null) {
5
    barJ.get(x.length)
6
  } else {
7
    null
8
  }
9
  y
10
}

What was wrong with snippet 1.1 was that the two operations were sequenced wrongly. There were some additional effects necessary in between, and they were missing.

It would be nice if we could write code whose shape is like that of snippet 1.1 but which behaves like the code in snippet 1.2.

Example 2: Exceptions

You have to look up a couple of keys in a JSON object and then concatenate the obtained values with a space in between. Again, either or both keys might be absent, thus leading to an exception. In such an event, the exception should bubble up to the caller.

This is how you might do it:

1
// snippet 2.1
2
import play.api.libs.json._
3

4
def calc(json: JsObject): String = {
5
  val name = (json \ "name").as[String]
6
  val address = (json \ "address").as[String]
7
  name + " " + address
8
}

This code works as expected. The necessary sequencing, or effects, here are provided by a first-class language feature called exceptions. Imagine a language without exceptions.

What might you do in such a language? Perhaps send a tuple of error code and result, or some equivalent structure.

Is it possible to write code whose shape and behaviour are like that of snippet 2.1, but which does not use first-class exceptions?

Example 3: Futures and promises

A future is an abstraction of a value that will be available at some point in time. This is a concurrency abstraction invented in the late 1970s but popularised by Java and JavaScript. Scala’s variation is a bit better.

Consider a simple, unrealistic task. We need to make two web service calls. Something mandates that the second call must be made after the first one is over. Then, after obtaining both results, you sum the lengths of the response bodies.

In a good old serial, blocking, non-future API, this might look like this:

1
// snippet 3.1
2
def calc(s: String): Int = {
3
  val googleResponse = makeWebServiceCall("https://www.google.com/#q=" + s)
4
  val bingResponse = makeWebServiceCall("http://www.bing.com/search?q=" + s)
5
  googleResponse.body.length + bingResponse.body.length
6
}

Now let us rewrite this with an API that uses futures. How are we going to sequence operations in such an API? The popular solution seems to be callbacks. With such an approach the code might look like this:

1
// snippet 3.2
2
import concurrent.{Promise, Future}
3
import play.api.libs.ws.WS
4

5
def calc(s: String): Future[Int] = {
6
  val promise = Promise[Int]()
7
  val googleResponse = WS.url("https://www.google.com/#q=" + s).get()
8
  googleResponse onSuccess { case res1 =>
9
    val bingResponse = WS.url("http://www.bing.com/search?q=" + s).get()
10
    bingResponse onSuccess { case res2 =>
11
      promise.success(res1.body.length + res2.body.length)
12
    }
13
  }
14
  promise.future
15
}

The onSuccess here provides the necessary effects in between the operations.

This code is fairly straightforward, but it has the following problems:

It exposes Promise, an implementation detail of futures.
The onSuccess plus promise pattern gets repetitive fast and soon becomes an eyesore.

It would be nice if we could write code whose shape is like that of our serial code but which has future-like semantics.

Enter Type-Driven Development

Let me introduce you to a style of programming popular in statically typed functional languages: Type-Driven Development, or TyDD.

In TyDD, you encode as much semantic information as possible in your types and values. Of course there is a lot more to TyDD than that, but it is not the subject of this talk.

Let us revisit our examples and try applying TyDD to them.

Revisiting example 1

Scala has a data type named Option, which is a sum type that encodes presence or absence semantics. Languages with different type systems might use different encodings. It is roughly defined as:

1
// snippet 1.3
2
sealed abstract class Option[+A]
3
case class Some[+A](value: A) extends Option[A]
4
case object None extends Option[Nothing]

As you would expect, calling get on Scala’s Map returns an Option value. With Scala maps, our code might look like this:

1
// snippet 1.4
2
val fooS: Map[String, String] = Map("a" -> "xoxox", "b" -> "xoxo")
3

4
val barS: Map[Int, String] = Map(4 -> "P", 5 -> "Q")
5

6
def calc(s: String): Option[String] = {
7
  fooS.get(s) match {
8
    case Some(x) => barS.get(x.length)
9
    case None => None
10
  }
11
}

One advantage of the TyDD approach should be immediately clear by now: the compiler will not allow you to treat Option[A] as A, and so you cannot inadvertently write incorrect code like snippet 1.1. You are forced to deal with optionality explicitly.

But all that sequencing with pattern matching looks yucky, and it will only get yuckier with every additional operation. Wouldn’t it be nice if the data type in question could take care of that sequencing itself?

As it happens, Option does have a method that takes care of this sequencing. The method is called flatMap, and when we use it, the code looks like this:

1
// snippet 1.5
2
def calc(s: String): Option[String] = {
3
  fooS.get(s) flatMap { x =>
4
    barS.get(x.length) flatMap { y =>
5
      Some(y)
6
    }
7
  }
8
}

In the innermost block, you need to put the value back in Option. Some is the data constructor we use for that.

If you compare this code to snippet 1.1, you will notice that their shape is similar. The code is somewhat inverted along the vertical axis, but as you will soon see, there is a fix for that.

Revisiting example 2

Scala has a data type named Try that encodes success and failure semantics. It is roughly defined as:

1
// snippet 2.2
2
sealed abstract class Try[+A]
3
case class Success[+A](value: A) extends Try[A]
4
case class Failure(throwable: Throwable) extends Try[Nothing]

Let us imagine JsObject has a method named asTry[A] that returns Try[A]. With that, the code would look like this:

1
// snippet 2.3
2
import play.api.libs.json._
3
import util.{Try, Success, Failure}
4

5
def calc(json: JsObject): Try[String] = {
6
  (json \ "name").asTry[String] match {
7
    case Success(name) =>
8
      (json \ "address").asTry[String] match {
9
        case Success(address) => Success(name + " " + address)
10
        case Failure(ex) => Failure(ex)
11
      }
12
    case Failure(ex) => Failure(ex)
13
  }
14
}

That is a fair amount of boilerplate.

As you might expect at this point, Try also has a method named flatMap that helps you do away with this boilerplate. This is how the code looks with flatMap:

1
// snippet 2.4
2
import play.api.libs.json._
3
import util.{Try, Success, Failure}
4

5
def calc(json: JsObject): Try[String] = {
6
  (json \ "name").asTry[String] flatMap { name =>
7
    (json \ "address").asTry[String] flatMap { address =>
8
      Success(name + " " + address)
9
    }
10
  }
11
}

In the innermost block, you need to put the value in Try, and we use the Success data constructor for that.

This code is very similar in shape to the code in snippet 2.1.

Revisiting example 3

Our results are already wrapped in Future, so we are already using a distinct type to encode the semantics.

Now Future also happens to have a flatMap method. Let us rewrite the code in snippet 3.2 using it.

1
// snippet 3.3
2
import concurrent.{Promise, Future}
3
import play.api.libs.ws.WS
4

5
def calc(s: String): Future[Int] = {
6
  WS.url("https://www.google.com/#q=" + s).get() flatMap { googleResponse =>
7
    WS.url("http://www.bing.com/search?q=" + s).get() flatMap { bingResponse =>
8
      Future.successful(googleResponse.body.length + bingResponse.body.length)
9
    }
10
  }
11
}

In the innermost block, we create a future that is already completed with our value.

Again, it is similar in shape to the code in snippet 3.1.

Monad comprehensions

As we have seen, flatMap allows our code to have a simple shape, with the details of sequencing abstracted away. Now monads are so common in functional programming that functional languages often provide a syntactic sugar on top of them which makes the inversion along the vertical axis go away. This sugar is called monad comprehensions. Scala calls them for-comprehensions. The similarity to for loops is superficial and should be ignored.

Here is how our code snippets might look once we start using this notation:

1
// snippet 1.6
2
def calc(s: String): Option[String] = {
3
  for {
4
    x <- fooS.get(s)
5
    y <- barS.get(x.length)
6
  } yield y
7
}
8

9
// snippet 2.5
10
import play.api.libs.json._
11
import util.{Try, Success, Failure}
12

13
def calc(json: JsObject): Try[String] = {
14
  for {
15
    name <- (json \ "name").asTry[String]
16
    address <- (json \ "address").asTry[String]
17
  } yield name + " " + address
18
}
19

20
// snippet 3.4
21
import concurrent.{Promise, Future}
22
import play.api.libs.ws.WS
23

24
def calc(s: String): Future[Int] = {
25
  for {
26
    googleResponse <- WS.url("https://www.google.com/#q=" + s).get()
27
    bingResponse <- WS.url("http://www.bing.com/search?q=" + s).get()
28
  } yield googleResponse.body.length + bingResponse.body.length
29
}

These are even closer in appearance to snippets 1.1, 2.1, and 3.1 respectively.

Note that this is just syntactic sugar and compiles down, roughly, to the code we wrote before with flatMap.

So what is a monad?

A monad is essentially a two-method interface, which can be defined as follows:

1
// snippet 4.1
2
trait Monad[M[_]] {
3
  def flatMap[A, B](x: M[A], f: A => M[B]): M[B]
4
  def point[A](x: A): M[A]
5
  def map[A, B](x: M[A], f: A => B): M[B] = flatMap(x, a => point(f(a)))
6
}

I am using the word interface here in its generic sense, not in the OO sense. A more specific term for this abstraction mechanism would be type class, and you can learn more about it here.

About the methods:

flatMap: we already talked about it.
point: it is basically a method that allows you to put a value in the monad’s context in the innermost block.
map: it is defined by default in terms of flatMap and point, and you can override it if a more performant implementation specific to the data type is possible.

As an example, here is a monad implementation for Option:

1
// snippet 4.2
2
implicit object OptionMonad extends Monad[Option] {
3
  def flatMap[A, B](x: Option[A], f: A => Option[B]): Option[B] = x match {
4
    case Some(a) => f(a)
5
    case None => None
6
  }
7

8
  def point[A](x: A): Option[A] = Some(x)
9

10
  override def map[A, B](x: Option[A], f: A => B): Option[B] = x match {
11
    case Some(a) => Some(f(a))
12
    case None => None
13
  }
14
}

The implementation of a Monad type class also needs to obey some laws, which we will not go into here.

Intuition for monads

You use the monad abstraction when:

You need customised sequencing for operations, that is, you need additional effects in between computations and want them abstracted away.
You need to abstract over the sequencing details, that is, write code that is parameterised over M, some monad, and plug in a specific monad instance as necessary. This technique is used in Precog code to good effect.

Someone once described monads as something that lets you overload the semicolon. Since Scala does not use semicolons at the end of statements or expressions, the analogy is perhaps a little less vivid here, but the basic idea still holds.

Other monads

We have covered only three monads here, Option, Try, and Future, but there are many more:

Seq for non-determinism.
Either for binary type disjunction, often used for error handling.
Reader for an implicit context passed around from which values can be read.
Writer for logging.
State for stateful computations.
ST for localised mutation.
Undo for the ability to undo and redo operations.

Kleisli composition

Given two functions f: A => B and g: B => C, it is fairly trivial to compose them together. Here is one way to write such a function:

1
// snippet 5.1
2
def compose[A, B, C](f: A => B, g: B => C): A => C =
3
  a => g(f(a))

Now what happens when I have two effectful functions instead, say f: A => M[B] and g: B => M[C]? Composing these is not trivial, because the output type of f and the input type of g do not match. We need a new composition function with a signature like this:

1
// snippet 5.2
2
def mcompose[A, B, C, M[_]](f: A => M[B], g: B => M[C]): A => M[C] =
3
  ???

It is not possible to write this generically without knowing something about M. How the operations are composed depends on the structure of M, so we let M tell us how to wire them up:

1
// snippet 5.3
2
def mcompose[A, B, C, M[_]](f: A => M[B], g: B => M[C])(implicit e: Monad[M]): A => M[C] =
3
  a => {
4
    val mb = f(a)
5
    e.flatMap(mb, { b =>
6
      val mc = g(b)
7
      e.flatMap(mc, { c =>
8
        e.point(c)
9
      })
10
    })
11
  }

This kind of composition is called Kleisli composition and is often denoted with the symbol >=>.

Monads are not alone

I hope this talk helped you gain some intuition for what a monad is and when to use this abstraction.

As it happens, monads are not alone. There is a whole family of type classes that let you abstract over computational patterns. Some key examples include Functor, Applicative, MonadPlus, Arrow, and Alternative. If you find monads interesting or useful, I recommend exploring the rest of the family as well. The best medium for doing that is probably Haskell, and here is a good book for learning the language.

Some more notes

The names flatMap and point are not standard, and different languages use different names. Some other names for flatMap are >>=, bind, m-bind, and SelectMany. Some other names for point are pure and return.
Scala’s Try violates certain monad laws and is therefore not quite a monad. However, those details are irrelevant to the basic nature of this talk. I could have used Either, but its implementation would require touching on too many tangential ideas.
A question often asked is whether monads are possible or useful in dynamic languages, given how centred they are around types. Yes, they are possible in dynamic languages, but much of their utility is lost, and they require a somewhat different encoding. I intend to blog about this separately.
Scala’s standard library does not have a Monad type class. How do its for-comprehensions work then? The answer is that the Scala compiler blindly translates them to flatMap and map method calls on that data type. Yes, map, not point. This approach has its pros and cons. Nevertheless, a Monad type class can be useful in many cases, and there are libraries that provide it.
If you want to understand monads more thoroughly, here is a very good tutorial. It is fairly long, but worth the time if the topic clicks for you.
Monad comprehension sugar is neat, but it still leaves much to be desired. People have taken the idea further in many different directions and come up with better syntaxes, such as this one.