Fencing Tokens in Node.js
18 Feb 2017 | node-js distributed-systemsIn my experience, the average Node.js developer tends to be less concerned about thread safety issues and race conditions than the average developer who uses more traditional languages such as Java, C# or C++. Perhaps the reason is that Node.js code is executed in a single-thread environment, so the risk of multiple threads mutating the same shared memory at once is avoided. However, this does not mean that we’re entirely free from race conditions. For example, our database can be thought of as a shared state in which race condition may manifest.
A good start would be to recap the race condition definition. According to wikipedia:
A race condition [in a software system] is the behavior where the output of the code becomes dependent on the sequence or timing of other uncontrollable events
To make this more concrete, let’s consider a simple example. Assume we write a service to manage an inventory of books. For simplicity, each book only holds a title and an integer price. Our service should allow clients to update the price of a given book.
Even in this minimalist scenario, we may encounter a race condition if two requests to update the price of the same book arrive simultaneously. In this post we’d describe a method called Fencing Token that aims to prevent such race conditions. To set the expectations right, by preventing the race condition I mean that it’ll be possible to predict the final value of the book in face of concurrent updates in face of concurrent updates.
Our solution will use two common Node.js libraries:
- knex.js - a flexible SQL query builder
- bookshelf.js - an ORM (object-relational mapper) built on knex.js. It provides goodies such as schema definition using migrations and lifetime callbacks
This post will be quite code heavy, and most of it is presented here. You can find the full version in github.
Simultaneous updates
We’d start by describing the main book model, using a knex.js migration file:
And here’s our minimal service that allows users to update a book’s price.
The following code illustrates two simultaneous updates to the same book. In this case, it’s not deterministic what would be the final price; It could either be 20.0 or 30.0, depends on various timings that are out of our control.
Our aim therefore is to be able to determine the final book’s price in face of simultaneous updates.
Fencing Tokens
A fencing token is simply a positive number that increments its value. We maintain a specific token per each model. Before attempting to update a model instance, we generate a new token (read: increment the token), and include this token in the update attributes. Before applying an update request, we ensure that the provided token is not lower than the current latest token. Else, the update request is discarded.
We start by creating a new table to hold the fencing tokens. We’ll use one row for each model. Here’s the migration file:
Next, the FencingToken
model:
Two points are worth mentioning in the above code:
- We used knex.js
increment
command to produce a query that increments the token value atomically (in oppose first query for the current value and then update the value in separate query. This code is not atomic and should not be used). The generated query looks like this:
- The
initialize
method is used to insert the first token for a model unless it already exists. To do it, we used PostgreSQL relatively-newON CONFLICT
clause (also known asUPSERT
)
Now, let’s look at the Book
model:
Notice the verifyFencingToken
method that runs before updating a book. It checks the provided fencing token value against the latest fencing token value in the Database: if the provided value
is lower, we discard the update by throwing an exception. delete this.attributes.token
is used so we don’t actually attempt to write the token value into the books table.
Lastly, our modified BookService
which handles updating a book price:
Notice how we first generate a new token before including it in the save
arguments.
These are all the building blocks we need. We are now ready to write a small stress test:
In this test we create one book and 1,000 clients. Each client is delayed for some time and then tries to update the book’s price. What this test proves is that we can anticipate the final price of the book by identifying the query with the maximum token value.
Comments