Will Wilson
CEO

How Antithesis finds bugs (with help from the Super Mario Bros.)

Snouty navigates a tricky level of his own.

A year and a half ago, our friends at TigerBeetle invited me to give a sneak preview of Antithesis at Systems Distributed. This talk was recorded, but never published (since we were in stealth at the time).

We’re publishing it now because it answers one of the most common questions we get: how exactly does Antithesis explore the state spaces of complex systems, and how does it find bugs so quickly? It also explains something Alex alluded to in his post: our deterministic hypervisor isn’t just about getting perfect reproducibility of the bugs we find, it also helps us find the bugs in the first place.

And it explains all of this using everybody’s favorite Italian plumber. Check it out:

As we announced before, our platform has been playing and beating Nintendo games for many years. Hopefully you now have some idea of why we use these programs for our bug-finding research. Far from being simple toy problems, they’re actually strictly harder to explore than “real” software in several important ways.

In the talk above I skipped to the end of the Super Mario Brothers solution that our platform autonomously discovered, but we’ve embedded the full video below (sped up so you can enjoy the whole thing). Antithesis can find this solution in about forty-five minutes on a 2018-era workstation CPU. The only hints our platform received were the locations in memory that hold Mario’s X and Y coordinates in a level, and which level he’s on.

Along the way, we can draw heatmaps of where Antithesis spends most of its time in the game. Since clipping through a wall is a relatively rare event, these heatmaps end up visibly reproducing some very recognizable level outlines:

The heatmap of our fuzzer's path through Mario

And here we are (sped up) playing a Kaizo ROM hack with the exact same configuration – zero tweaking, zero hyperparameter tuning, and zero iteration on the platform. This happened literally the first time we turned the system on with the new levels. Notice that we actually get stuck at the end, but we do play the game pretty well (and we find a bug!).

Finally, the answer to the question I’m sure you’ve all been wondering about this whole time! Does Antithesis make it to the Minus World? You better believe it does:

Super Mario Bros. is a simple game, but its state space is inconceivably vast. As far as we know, we're the first autonomous system to explore that state space efficiently enough to beat the game (albeit with one or two hints).

That same capability for efficient state space exploration is what enables us to find bugs and deliver value to our customers. If you're interested in learning whether Antithesis can improve your software quality or development velocity, contact us!