How to use statecharts
This page tries to describe some aspects of employing statecharts in your day-to-day coding routine.
Determine scope
When you first learn about statecharts, you might get the feeling that statecharts can be used to describe the entire behaviour of an application, all the way from which screens show as part of logging in, to the state of each checkbox and text field in every screen, all represented in one big statechart. That would be a statechart from hell, and an even bigger maintenance burden. Instead, the focus should be to get a grip on the behaviour at the component level, whatever a component might be. A single screen would be a component, for sure. A single text field that might have some particular internal behaviour (e.g. it changes color based on various flags like empty, required or invalid) might warrant that to be wrapped in a component with a statechart to describe its behaviour.
In general, structure your code as you did before, by dividing things up into components. Use statecharts to describe the behaviour of each component in isolation. Use events and so on to get communication between components going just like before; keep the statechart internal to the component itself. The user of the component shouldn’t need to know that there’s a statechart within it, other than by inspecting the code or guessing (since it behaves so well).
Starting from scratch or not
If you’re starting from scratch, and you envision the component you’re building to have any form of difference of behaviour after it’s instantiated (most do), then consider adding a simple statechart from the get-go, and start out with letting the statechart determine which behaviour is appropriate. Even if you don’t have the other behaviour available, it’s always easier if the component already has a simple (one-state?) statechart.
If you’re not starting from scratch, it means you have code that reacts to events already. Speaking generally, when your code receives an action, it usually checks some booleans or the value of some internal state (e.g. if a value is greater than some threshold) before performing some updates. It is this code that the introduction of statecharts aims to replace. Throughout your event handlers, those conditionals all make up the implicit state machine. After introducing statecharts, you should end up with more or less the same state machine, but an explicit one.
Distill the API between the component and the statechart
To start using a statechart, the tangled mess that might be your component and its different behaviours need to be disentangled: The what / how needs to be separated from the why. You should end up with event handlers that send the event to the statechart, and then does what the statechart says to do, typically by invoking “actionable” items on behalf of the statechart.
Before the introduction of a statechart, you will have code with the following structure:
- Something happens in the world
- The event handler checks some booleans
- Based on those booleans, you decide to do some real work
Those booleans end up being littered throughout various event handlers, for better or worse. It’s called the “Event-Action” paradigm of programming.
After introducing statecharts, you should have code with the following structure:
- Something happens in the world
- The event handler informs the statechart
- The statechart might transition to a new state
- Based on the new state, you decide to do some real work
The big difference is that the event handler now no longer makes any decisions on what to do, it allows the statechart to make those decisions on its behalf.
The communication between your component and the statechart therefore happens as follows:
- Your object tells the statechart about an event — something happening either outside or inside the component. e.g. a keystroke, a HTTP request response arrived.
- The statechart asks the world about some thing, known as a guard — e.g. is the text field empty, or is the HTTP response complete.
- The statechart tells your object to perform some action or control some activity — e.g. tell the field that it’s “invalid”, or start parsing the results, or stop a HTTP request.
- The statechart tells you which new state your component is in.
These are the four touchpoints between the statechart and the outside world (your component). Statecharts fit into an event driven system. It accepts events, and turns them into actions.
Designing a statechart
This is the biggest hurdle if you’re new to statecharts, mainly because it is often such a foreign concept. You need to decide on the notation, if you want the benefit of a graphical representation, and ultimately which tools you’ll choose.
Tools aside, the process of designing a component’s statechart is to start by discovering the modes of the component in question. Those are good candidates for top level states of your statechart. Next, you can start to think about what events that take you between those states. Remember, at the “top level” things don’t need to be 100% precise; this is what the substates are for.
Once you have some high level states and what events cause the behaviour to change, you can add this to the statechart, and your code will be better off for it. Later, we will show how you can move more things into the states.
Example
In order to explain this process, I’m going to try to walk you through the process that went into the design of the following component: A simple search form. It is modeled after the same UI that is presented in Robust React User Interfaces with Finite State Machines.
To repeat the requirements from that article:
- Show a search input and a search button that allows the user to search for photos
- When the search button is clicked, fetch photos with the search term from Flickr
- Display the search results in a grid of small sized photos
- When a photo is clicked/tapped, show the full size photo
- When a full-sized photo is clicked/tapped again, go back to the gallery view
Off the top of my head I can think of the following top level “modes” or “states” of this application:
- Initial — no search results are available
- Searching — when the search button was clicked
- Displaying results — when displaying results
- Zoomed in — when a photo is zoomed in on
If I put my statechart hat on, these can be thought of as “top level states”:
Note that this set of states might change, since we haven’t really understood what a “mode” is. The main thing to look for is a different behaviour, i.e. that the component in question reacts in a different way to events. For example, the search app should react differently to a click on a photo when displaying results vs when zoomed in on a photo.
From the requirements it’s also pretty easy to think up the transitions between those states too. Here’s the “happy path”:
- Initial → Searching: someone typed something and hit the Search button — I’ll call this the search event
- Searching → Displaying results: the HTTP request completed with some data, the UI can be populated with stuff—I’ll call this the results event.
- Displaying results → Zoomed in: The user clicked a photo and we now zoom in on a particular photo. I’ll call this the zoom event
- Zoomed in → Displaying results: The user clicked a zoomed in photo and we now zoom out back to the results. I can call this the zoom_out event.
Again, these can be drawn into the statechart’s “top level”:
Now, if you’re an experienced statechart designer, you can probably already see one big shortcoming of this statechart. Luckily, with a diagram, they are extremely easy to discover: There is no way to get from the “results” state back into the searching state. There are no direct arrows, and there is no path to get there. (It was also not explicitly stated in the requirements document!) I’m going to ignore this problem for now, because I want to show (later) how you can fix such problems _purely_ by making changes to the state machine. So, if you spotted this by yourself, pat yourself on the back now.
Initial implementation
At this point we have enough stuff to work on to be able to get an initial implementation running too, just to get the happy path running. We can then check them off the list of “problems” that typically plague a quick implementation.
I will show the statecharts in both SCXML and XState flavours. Both will give us nice diagrams and they’re both executable statecharts, meaning I don’t have to do any manual translation from this representation to code.
First off, the top level states:
SCXML:
<scxml>
<state id="initial">
</state>
<state id="searching">
</state>
<state id="displaying_results">
</state>
<state id="zoomed_in">
</state>
</scxml>
XState:
{
"initial": "initial",
"states": {
"initial": {},
"searching": {},
"displaying_results": {},
"zoomed_in": {}
}
}
Let’s add the transitions
SCXML:
<scxml>
<state id="initial">
<transition event="search" target="searching"/>
</state>
<state id="searching">
<transition event="results" target="displaying_results"/>
</state>
<state id="displaying_results">
<transition event="zoom" target="zoomed_in"/>
</state>
<state id="zoomed_in">
<transition event="zoom_out" target="displaying_results"/>
</state>
</scxml>
XState:
{
"initial": "initial",
"states": {
"initial": {
"on": {
"search": "searching"
}
},
"searching": {
"on": {
"results": "displaying_results"
}
},
"displaying_results": {
"on": {
"zoom": "zoomed_in"
}
},
"zoomed_in": {
"on": {
"zoom_out": "displaying_results"
}
}
}
}
API Surface
If we look at our API surface—the set of events, guards and actions that we have—we can start to compile a list of things that our UI needs to provide:
- Events:
search
,results
,zoom
, andzoom_out
- States:
initial
,searching
,showing_results
andzoomed_in
Absence of data transfer
Note the absence of any data being passed back and forth: The events themselves are pretty anonymous; this is about high level things that happen in the UI.
This absence of data transfer also means that the component still needs to keep track of the “business state” — the variables and stuff that the component is busy working on. The statechart doesn’t need or care about those things, it is only concerned with triggering the right actions at the right times.
Events
The first part of the API is actually sending events from the world and to the state machine. This is also somewhat independent of the actual state machine library, as most of them take an event name in the form of a string.
To recap the events were: search
, results
, zoom
, and zoom_out
— these are all things that “happen” in the real world, that need to tell the statechart about
We want something that’s similar to this:
searchButton.onclick = statemachine.send("search");
httpRequest.then(() => statemachine.send("results"));
resultspanel.onclick = statemachine.send("zoom_in");
zoomedphoto.onclick = statemachine.send("zoom_out");
Essentially the buttons, HTTP requests and other things that generate events don’t need to know what’s going to happen; they shouldn’t. They should just blindly send the information to the state machine, and let the state machine figure out what to do.
States
Finally, the component is allowed to see which state is the “current” state. This of course depends on the statechart library, but most will happily tell you the name of the current state, typically as a string:
statemachine.currentState.value; // "searching"
Implementation
We’ve now explained all of the parts necessary to get our UI and the component talking together.
Decouple the component from the statechart?
At this point in time I think it’s very useful to point out dependency that might come creeping. The component in question easily becomes dependent on the states in the statechart. A decision has to be made, or else it will be made for you.
The statechart invariably starts out as a reflection of the modes that the component has, e.g. enabled, disabled, loading and so on. It is therefore common to use the “current state” of the statechart to reflect it in the component somewhere. For a HTML based component, this might translate into a top level CSS element, like state-enabled
and state-loading
. For a React app, it might be that your app renders different things based on the top level state. This is completely natural, and introduces an implied coupling between the statechart and the component.
This coupling may or may not be beneficial, depending on how you end up using the statechart, but you should be aware of the coupling and the problems it introduces.
Decoupled | Coupled |
---|---|
The component doesn’t know which state it’s in | The component knows which state it’s in |
The component is explicitly told when to change its mode, because the statechart says when entering this state, enter this mode | The component changes its mode automatically: whenever the statechart has handled an event, the component asks the statechart which state it’s in and uses that |
The component is explicitly told when to do stuff, because the statechart says when entering this state, do this | The component does things based on the “current state”: Whenever the statechart has handled an event, the component asks the statechart which state it’s in and executes various functions |
- If you decide to keep them decoupled, it comes at the increased cost of having to define additional actions—an increased “API surface” if you will. Additionally the statechart needs to have entry handlers (and possibly exit handlers) to turn on (and off) modes in the component. The statechart needs to be able to control explicitly what the component should be doing at any time.
- If you decide to keep them coupled, it comes at the cost of being unable to make changes to the statechart itself. Introducing a new state to make a behavioural change can no longer be done purely on the statechart side, because this new state might affect the component when it should not, or it might not affect it when it should. Often a change in the statechart has to be done along with a change in the component.
Decoupled statecharts FTW
When starting out with statecharts, it’s often easiest to get going with coupled statecharts, since it’s one less thing to learn.
For long term durability and maintainability of statecharts, it is probably best to go decoupled: remove the direct dependency between the statechart and the component. If you choose to do this, there are some notable things that the component should not be worrying about, such as:
- Which state is the statechart in? — It really doesn’t matter. What matters are the actions that are called.
- Which transition just fired? — This too doesn’t matter.
The things that matter in a decoupled statechart are: events, guards and actions.
Coding up the API
The choice of UI framework should really not be very important, since what we’re trying to describe is what to do when certain events happen. We aim to replace code that litter the UI code to hide and show components based on the “state” of various variables, and replace them with actions that control the behaviour of the UI component.
Actions
First off, let’s take a look at the actions; the desired output of the state machine. We need a component that can react to what the state machine tells it to do.
To recap, the actions that the state machine can perform are: startHttpRequest
, cancelHttpRequest
, showResults
, zoomIn
, and zoomOut
A simple performAction
function can do nicely:
function performAction(event) {
if (event === "startHttpRequest") {
// get value of text field
// construct URL
// send HTTP request
}
if (event === "cancelHttpRequest") {
// cancel HTTP request (if it's still active)
}
if (event === "showResults") {
}
if (event === "zoomIn") {
}
if (event === "zoomOut") {
}
}
Forgive the naive if handler, as I opted for something readable. I’ll leave it to the readers to figure out ways of improving this code.