Composable table updates in Slick

I have been using Slick on and off since 2011, when it was called ScalaQuery, and it remains my favourite database abstraction library to this day.

We at DigitalGenius are in the process of migrating a codebase that made heavy use of Doobie to Slick. With Doobie, you end up throwing out type safety and doing stringly typed programming. Since Doobie queries are essentially strings, your best shot at abstraction is concatenating string fragments. Our tables at DG follow many common patterns in how they store and retrieve data, but with mere strings at your disposal, it is terribly error-prone and often impossible to abstract over those patterns. I have expressed my displeasure with this sort of thing before.

Slick is different. It uses Scala’s abstraction mechanisms, both term and type level, to faithfully represent the SQL layer while avoiding the failings of ORMs. This allows you to compose, abstract, and model things in a typeful fashion.

We have been delighted with the migration overall, but we have stumbled into some problems along the way: bugs, unimplemented features, and jarring innards. This post covers one of the more interesting patterns that came out of that work: a way to make Slick table updates composable without giving up type safety.

Before we begin, let’s get the setup out of the way.

Setup

We will be using PostgreSQL version 9.6, Scala version 2.12.6, Slick version 3.2.2, and Slick-PG version 0.16.0.

Start a PostgreSQL instance, and create the following table in your database.

1
CREATE TABLE "employees" (
2
    "id"         TEXT     PRIMARY KEY,
3
    "name"       TEXT     NOT NULL,
4
    "department" TEXT     NOT NULL,
5
    "age"        INTEGER  NOT NULL,
6
    "salary"     INTEGER  NOT NULL
7
);

We will be using the following skeletons for our Scala code. In subsequent code snippets, we will only pull out the relevant sections from them.

1
package dbinfra
2

3
object MyPostgresProfile {
4
  // Copy this from Slick-PG README
5
}
6

7
object SlickExtensions {
8
  import MyPostgresProfile.api._
9

10
  // ...
11
}

1
package usage
2

3
import dbinfra._
4
import MyPostgresProfile.api._
5

6
final case class Employee(
7
  id: String,
8
  name: String,
9
  department: String,
10
  age: Int,
11
  salary: Int
12
)
13

14
final class EmployeesTable(tag: Tag) extends Table[Employee](tag, "employees") {
15
  def id = column[String]("id")
16
  def name = column[String]("name")
17
  def department = column[String]("department")
18
  def age = column[Int]("age")
19
  def salary = column[Int]("salary")
20

21
  def * = (id, name, department, age, salary) <> (Employee.tupled, Employee.unapply)
22
}
23

24
object EmployeesTable extends TableQuery(new EmployeesTable(_))
25

26
final class EmployeeRepository {
27
  // ...
28
}

In a real codebase, you will probably use UUID for IDs; newtypes, not naked strings, to represent textual fields; and enums to represent departments, and so on. Since none of that is relevant to this post, we leave them as strings here. We will be taking more such shortcuts in the code snippets that follow to avoid losing focus.

We will be testing this code out at the REPL. The following prelude will be needed.

1
// Prelude for a REPL session
2
import MyPostgresProfile.api._
3
import SlickExtensions._
4
import dbinfra._
5
import usage._
6
import scala.concurrent.duration._
7
import scala.concurrent.Await
8
import scala.concurrent.ExecutionContext.Implicits.global
9

10
val db = Database.forDriver(new org.postgresql.Driver, "jdbc:postgresql://localhost:54321/db-name", "user-name", "password")
11

12
def runDB[A](action: DBIO[A]): A = Await.result(db.run(action), 1.minute)

Problem

Imagine you are implementing a PATCH request for a simple domain entity, something that maps to both a REST resource and an SQL representation almost as-is. In such a case, your repository layer might have an .update method that looks something like this:

1
final class EmployeeRepository {
2
  def update(
3
    id: String,
4
    name: Option[String],
5
    department: Option[String],
6
    age: Option[Int],
7
    salary: Option[Int],
8
  ): DBIO[Int] = {
9
    ???
10
  }
11
}

The id will be used for a lookup, and then the other fields will be set to the provided values, if Some, or left unaltered otherwise.

Non-solution

This is how you normally update a single field with Slick:

1
scala> runDB { EmployeesTable += Employee("x6", "Rahul", "engg", 28, 1000) }
2
res19: Int = 1
3

4
scala> runDB { EmployeesTable.filter(_.id === "x6").map(_.name).update("Luffy") }
5
res20: Int = 1
6

7
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
8
res21: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Luffy,engg,28,1000))

And this is how you normally update multiple fields, all at once:

1
scala> runDB { EmployeesTable.filter(_.id === "x6").map(r => (r.name, r.department)).update(("Ruhi", "product")) }
2
res23: Int = 1
3

4
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
5
res24: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Ruhi,product,28,1000))

If you want to update more than two fields, you can do it in one of the following two ways, and more still by defining custom Shapes.

1
scala> runDB { EmployeesTable.filter(_.id === "x6").map(r => (r.name, r.department, r.age)).update(("Ruhi", "product", 29)) }
2
res0: Int = 1
3

4
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
5
res1: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Ruhi,product,29,2000))
6

7
scala> runDB { EmployeesTable.filter(_.id === "x6").map(r => (r.name, (r.department, r.age))).update(("Ruhi", ("product", 29))) }
8
res2: Int = 1
9

10
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
11
res3: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Ruhi,product,29,2000))

The first approach is syntactically nicer, while the second one is more compositional and therefore easier to abstract over. Scala 3 might combine the two: the former syntax with the latter representation.

In our case, the updates are conditional on whether or not a new value is supplied. So we cannot pass them all at once as shown in the examples above, but must be able to stack them on one after another. In this respect, Slick update queries do not compose.

1
scala> runDB { EmployeesTable.filter(_.id === "x6").map(_.name).update("Luffy").map(_.department).update("engg") }
2
<console>:33: error: value department is not a member of Int
3
       runDB { EmployeesTable.filter(_.id === "x6").map(_.name).update("Luffy").map(_.department).update("engg") }
4
                                                                                      ^

This happens because when you invoke .update, Slick already turns your SQL Query into a DBIO, making further query modifications impossible.

One way to do what we want with the available machinery is to pattern-match over all possible combinations of given update values, as shown below.

1
scala> val id = "x6"
2
id: String = x6
3

4
scala> val name: Option[String] = Some("Ruhi")
5
name: Option[String] = Some(Ruhi)
6

7
scala> val department: Option[String] = None
8
department: Option[String] = None
9

10
scala> val age: Option[Int] = Some(27)
11
age: Option[Int] = Some(27)
12

13
scala> val salary: Option[Int] = Some(1500)
14
salary: Option[Int] = Some(1500)
15

16
scala> val employeeWithGivenId = EmployeesTable.filter(_.id === id)
17

18
scala> (name, department, age, salary) match {
19
     |   case (None, None, None, None)                   => employeeWithGivenId.result.map(_ => 0)
20
     |   case (Some(name), None, None, None)             => employeeWithGivenId.map(_.name).update(name)
21
     |   case (Some(name), Some(department), None, None) => employeeWithGivenId.map(e => (e.name, e.department)).update((name, department))
22
     |   // ...
23
     | }

The above “works”, but now we have 2^{number of fields} = 2⁴ = 16 branches. When we add an extra field, we will have to add 16 more branches.

Clearly this does not scale, and we need something better.

First stab at a solution

If updates in Slick were first-class values, we could produce them independently of their application. We could put them in a list, compose them with each other, and so on.

The easiest way to model this would be a data type that represents a delayed .map(field).update(newValue) application. We can navigate to Slick’s codebase and copy over the arguments of .map and .update. Let’s build this data type, and also add an extension method to Query to apply this update.

1
object SlickExtensions {
2
  import slick.dbio.{DBIOAction, Effect}
3
  import slick.jdbc.{PositionedParameters, PositionedResult}
4
  import slick.lifted.{BaseColumnExtensionMethods, CanBeQueryCondition, FlatShapeLevel, OptionMapper2, Shape}
5
  import slick.sql.{FixedSqlAction, SqlAction}
6

7
  final case class Update[Record, Field, Value](field: Record => Field, value: Value)
8

9
  implicit class RichQuery[Record, U, C[_]](val underlying: Query[Record, U, C]) {
10
    def applyUpdate[Field, Value](
11
      update: Update[Record, Field, Value]
12
    )(
13
      implicit
14
      shape: Shape[_ <: FlatShapeLevel, Field, Value, Field]
15
    ): FixedSqlAction[Int, NoStream, Effect.Write] = {
16
      underlying.map(update.field).update(update.value)
17
    }
18
  }
19
}

Let’s test this out.

1
scala> runDB { EmployeesTable.filter(_.id === "x6").applyUpdate(Update(_.name, "Monkey")) }
2
res5: Int = 1
3

4
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
5
res6: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Monkey,product,28,1000))

It works!

Now we need a way to compose these updates. Let’s add a combinator, .and, to Update to enable that.

1
final case class Update[Record, Field, Value](field: Record => Field, value: Value) {
2
  def and[Field2, Value2](that: Update[Record, Field2, Value2]): Update[Record, (Field, Field2), (Value, Value2)] = {
3
    Update(record => (this.field(record), that.field(record)), (this.value, that.value))
4
  }
5
}

If, say, we had updates u1, u2, and u3, then by executing u1 and u2 and u3, we will get a composed update that represents updates on three different fields. The application of this update should work as long as Slick can figure out the needed Shape.

1
scala> val update = Update[EmployeesTable, Rep[String], String](_.name, "Pintya").
2
     |   and(Update[EmployeesTable, Rep[String], String](_.department, "management")).
3
     |   and(Update[EmployeesTable, Rep[Int], Int](_.salary, 2000))
4
update: utils.slick.SlickExtensions.Update[usage.EmployeesTable,((utils.slick.DgPostgreSqlProfile.api.Rep[String], utils.slick.DgPostgreSqlProfile.api.Rep[String]), utils.slick.DgPostgreSqlProfile.api.Rep[Int]),((String, String), Int)] = Update(utils.slick.SlickExtensions$Update$$Lambda$7649/987431776@3f774e9b,((Pintya,management),2000))
5

6
scala> runDB { EmployeesTable.filter(_.id === "x6").applyUpdate(update) }
7
res1: Int = 1
8

9
scala> runDB { EmployeesTable.filter(_.id === "x6").result }
10
res2: Seq[usage.EmployeesTable#TableElementType] = Vector(Employee(x6,Pintya,management,28,2000))

Woot! This works too.

Let’s now move on to optional updates. To make this work, we will need a way of representing “no update”, a null update object, if you will. We could revise our Update into a sum type like this:

1
sealed trait Update[Record, Field, Value] extends Product with Serializable
2

3
object Update {
4
  final case class Perform[Record, Field, Value](field: Record => Field, value: Value) extends Update[Record, Field, Value]
5
  final case class Pass[Record, Field, Value]() extends Update[Record, Field, Value]
6
}

But you will notice that defining .and on this revised Update type proves impossible. What do you do when you have a Perform[R, F1, V1] on one side, and Pass[R, F2, V2] on the other? You cannot possibly produce Update[R, (F1, F2), (V1, V2)] that performs the update on field F1.

We got very far with this approach, but we cannot seem to get any further. How could we make this work?

Existential types to the rescue

If we go back to our original Update formulation, we can make one important observation: the Update type need not wear the Field and Value type parameters on its sleeve. All we care about in our Update values is that 1) they work against EmployeesTable, and 2) the types Field and Value are internally consistent, that is, .map and .update work together. As long as that happens, we do not care what those types specifically are.

Existential types are a mechanism to ensure such internal consistency.

Here is what a reformulation along these lines looks like.

1
sealed trait Update[Record] { self =>
2
  type Field
3
  type Value
4

5
  def field: Record => Field
6
  def newValue: Value
7
  def shape: Shape[_ <: FlatShapeLevel, Field, Value, Field]
8

9
  final def apply[U, C[_]](query: Query[Record, U, C]): FixedSqlAction[Int, NoStream, Effect.Write] = {
10
    query.map(field)(shape).update(newValue)
11
  }
12
}
13

14
object Update {
15
  def apply[Record, _Field, _Value](
16
    _field: Record => _Field,
17
    _newValue: _Value
18
  )(
19
    implicit
20
    _shape: Shape[_ <: FlatShapeLevel, _Field, _Value, _Field]
21
  ): Update[Record] = {
22
    new Update[Record] {
23
      type Field = _Field
24
      type Value = _Value
25

26
      def field: Record => Field = _field
27
      def newValue: Value = _newValue
28
      def shape: Shape[_ <: FlatShapeLevel, Field, Value, Field] = _shape
29
    }
30
  }
31
}

There is a lot happening here, so let’s unpack it slowly.

Just like before, this Update type is nothing but a delayed application of .map and .update. Only the record type Record shows up on the outside; every other type parameter has been made internal. Along with all the types, any values that might use them have moved inside too, including the implicit needed by .map. Since all of these things are now internal, we also have to move the application of the update inside. Hence Update#apply.

Update.apply, the smart constructor in the companion object, acts as a seam from the point where all the types are statically known to where some of them become existential.

Now this is how you can define .and on this type.

1
sealed trait Update[Record] { self =>
2
  type Field
3
  type Value
4

5
  def field: Record => Field
6
  def newValue: Value
7
  def shape: Shape[_ <: FlatShapeLevel, Field, Value, Field]
8

9
  final def apply[U, C[_]](query: Query[Record, U, C]): FixedSqlAction[Int, NoStream, Effect.Write] = {
10
    query.map(field)(shape).update(newValue)
11
  }
12

13
  final def and(another: Update[Record]): Update[Record] = {
14
    new Update[Record] {
15
      type Field = (self.Field, another.Field)
16
      type Value = (self.Value, another.Value)
17

18
      def field: Record => Field = record => (self.field(record), another.field(record))
19

20
      def newValue: Value = (self.newValue, another.newValue)
21

22
      def shape: Shape[_ <: FlatShapeLevel, Field, Value, Field] = {
23
        Shape.tuple2Shape(self.shape, another.shape)
24
      }
25
    }
26
  }
27
}

Since Update[Record] does not track the fields it updates in its type, we can even create a dynamic list of updates, that is, a List[Update[Record]]. This removes the need for a no-update case, as that is already captured by an empty list. Here is how we can rewrite our Query extensions.

1
object SlickExtensions {
2
  implicit class RichQuery[Record, U, C[_]](val underlying: Query[Record, U, C]) {
3
    def applyUpdate(update: Update[Record]): FixedSqlAction[Int, NoStream, Effect.Write] = {
4
      update.apply(underlying)
5
    }
6

7
    def applyUpdates(updates: List[Update[Record]])(implicit ec: ExecutionContext): DBIOAction[Int, NoStream, Effect.Write with Effect.Read] = {
8
      updates.reduceLeftOption(_ and _) match {
9
        case Some(composedUpdate) => underlying.applyUpdate(composedUpdate)
10
        case None                 => underlying.result.map(_ => 0)
11
      }
12
    }
13
  }
14
}

With this new formulation, we can finally write EmployeeRepository#update as follows.

1
final class EmployeeRepository {
2
  def update(
3
    id: String,
4
    name: Option[String],
5
    department: Option[String],
6
    age: Option[Int],
7
    salary: Option[Int],
8
  ): DBIO[Int] = {
9
    val updates: List[Update[EmployeesTable]] = List(
10
      name.map(value => Update((_: EmployeesTable).name, value)),
11
      department.map(value => Update((_: EmployeesTable).department, value)),
12
      age.map(value => Update((_: EmployeesTable).age, value)),
13
      salary.map(value => Update((_: EmployeesTable).salary, value))
14
    ).flatten
15

16
    EmployeesTable.filter(_.id === id).applyUpdates(updates)
17
  }
18
}

And there you have it: composable updates with Slick.

A note on using `Option` to model update inputs

In our Employee model, none of the fields were optional. So the meaning of Option[A] for updates was clear: set the value if given, Some[A], else leave it unaltered, None.

Imagine we had an optional field, say, pensionPlan: Option[PensionPlan]. In this case, the meaning of None becomes ambiguous. Does the caller want to set pensionPlan to None, or leave it unaltered? There is no way to know. Option is a bad fit for representing patches in general. Consider defining a custom sum type like this instead:

1
sealed trait Patch[+A] extends Product with Serializable
2

3
object Patch {
4
  final case class Set[+A](value: A) extends Patch[A]
5
  case object Keep extends Patch[Nothing]
6
  case object Delete extends Patch[Nothing]
7
}

Do not shy away from defining small sum types like these. Ambiguities can cost you big.

Exercise for the reader: Redefine Update so that value has type Patch, or similar, instead.

Book recommendation

It had been a while since I last used Slick. Dave Gurnell’s book Essential Slick 3 was a great help in getting me back up to speed. If you want to get the most out of Slick, I highly recommend reading it.

Thanks to Tom Wadeson and Amar Potghan for reviewing the blog post and for valuable feedback.