New database tooling & migrations

I've been looking into edgedb, prisma, neon & planetscale out of curiosity for new database tooling. Most of them have a migration story, from prisma's migration and ORM generators to planetscale's migration branches and live schema changes. It got me wondering what are my real pain points with migrations and what would I look for in new tooling.

Here is a short list of issues I've experienced and some possible solutions provided by these tools.

Migration generation tools are not language agnostic.

Issue

At my previous job, we used Alembic which is a tool that will auto-generate migrations based on SQLchemy models. It worked fairly well for simple cases, but if I move to a new stack I'll need to learn a new tool. It would be great if there was a language agnostic migration generation tool we could all rally around.

Solution

I think having a dedicated schema language and a tool which generates migrations based on changes to that schema manifest is a good solution. Both Prisma and edgedb have taken this approach.

Having a separate manifest for your data models has other tangential benefits too like:

Automatically generated type declarations for your data.
Automatically generated ORMs. (I find ORMs useful in unit tests and simple cases but generally prefer SQL queries for most business logic.)
Simplifies some of the complexities around declaring sql schema. For instance creating relationships.
Generated migrations should encourage best practices think using CONCURRENTLY for building indexes.
Creates a single source of truth for data models without needing access to the database.

Migration generation tools don't handle complex cases.

Issue

Auto-generated migration tools like Alembic only cover the naive approach. Which means you'll usually have to manually write migrations and run a drawn out multi-stage migration or briefly take down your service while you lock a table.

For instance if you wanted to change a column type safely. You would have to follow these five steps each time.

Add the new column.
Add code to write to the new & old column.
Backfill the new column.
Remove code writing to the old column.
Drop the old column.

In this case we have clearly traded speed for uptime. Couldn't we have both?

Solution

Online schema changes can solve this, they work by creating a copy of the table and streaming new updates & inserts via triggers or replication stream to the new table, once the copy is complete it replaces the old table name with the newly copied one. This is a complex process as described by Planetscale here. Planetscale currently support this but there are also lots of other single purpose tools github's gh-ost and facebook's OSC.

A database for every PR is hard.

Issue

Vercel popularized the notion of per branch instances of your application. These preview instances allow PR reviewers and the rest of your org test features before merging and deploying to shared environments. However generally the database is an issue with these setups. Most times I've seen these preview environments point at a single database which doesn't work if you have different branches with differing schema.

Solution

The simplest solution is to run a new database for every branch, but this can be expensive and slow. Products like neon have made this simple. Because they have isolated the storage and the compute of postgres they can provide features like branching. Where you could spawn a branch with "production" data and run your migrations and previews. I imagine sqlite databases like cloud flares D1 database would also enable these sorts of use cases.

Predictions

Database's have a huge problem space to tackle, I think that's why we are seeing a trend where new database are being built around existing databases like postgres, mysql & sqlite rather than being built from scratch. It allows for new features while still offering an escape hatch when things are unsupported.

I'd bet that in the future there will be a standardised "schema" language to define data models. These new databases will accept these declarative schema files and migrate their schema and data to the new desired state, therefore making migrations declarative and mostly invisible to the application developer.