Autotune (Bandits)

Autotune and Autotune AI are Multi-Armed Bandit solutions that automatically find the best variant among a group of candidates, while dynamically allocating traffic to optimize for a single target metric.

Autotune, the Multi-Armed Bandit solution, allocates traffic towards high-performing variants and can eventually identify a winning variant.

How Autotune works

Autotune is Statsig's Bayesian Multi-Armed Bandit. It test and measure different variations and their effect on a target outcome. The multi-armed bandit continuously adjusts traffic towards the best performing variations until it can confidently pick the best variation. The winning variation will then receive 100% of traffic.

Bandits seek to balance the "explore"/"exploit" problem - balancing between "exploiting" the current best known solution versus "exploring" to get more information about other solutions.

Our blog posts on Multi-Armed Bandits and Contextual Bandits go into depth on use cases and considerations.

	A/B/n Test	Multi-Armed Bandit (Autotune)	Contextual Bandit (Autotune AI)	Ranking Engine
Typical # Variants	2-3	4-8	4-8	Arbitrary #
Personalization Factor	None	None	Moderate	High
Input Data Required	None	Very Little (100+ samples)	Little - generally 1000+ samples	Tens of thousands to millions of samples
Model Efficacy	None	Basic	Moderate	High
Identifies Best Variant	Yes	Yes	No	No
Consistent User Assignment	Yes	No	No	No

Implementing Autotune

Implementing an Autotune is as simple as checking an experiment in Statsig. After initialization, or on server SDKs, this comes with sub-millisecond latency.

Autotune will have a JSON config associated with each variant, which will be returned by the SDK and can be used to modify elements of your webpage (e.g. an image URL or button color), or simply identify the variant so that you know which code to use.

When to use Autotune

Autotune has two major differences from A/B testing (Statsig Experiments):

The traffic split isn't fixed over the duration of the test. This allows Autotune to divert more traffic to the winner, fewer from the losers while making fewer mistakes. However, this means the user experience may not be consistent upon repeated visits.
Autotune can only optimize for a single metric. Autotune can't accurately measure a collection of metrics, and isn't a great way to understand secondary effects of your changes. Because of this, it works best when the metric is well-understood, has a direct and immediate relationship to the change being tested

Because of these differences, Statsig recommends Autotune in the following scenarios:

The cost of exposing users to a losing treatment is high. For example, sending new users to a landing page that is inferior may result in lost revenue or churn. While this may be a one-time loss, testing two user registration flows may result in users that never sign up. In this case, Autotune avoids permanently losing users since it can quickly adapt to feedback unlike a static A/B test.
You want the decision to be automated. Because Autotune automatically selects the winner, it requires no human decision-making. This is great for launching dozens of simultaneous tests, or for running a long-term unmonitored test.
When it's okay for users to be exposed to different experiences upon return visits. For example, changing text or recommendation algorithms.
When you have one simple metric to optimize for (eg. click-through rate) that has is an immediate effect of the test.
When you want to test multiple variations. Autotune can quickly rules out really poor performers while focusing traffic on the best variants.

Autotune should be avoided in the following scenarios:

When you have a complex ecosystem and want to understand secondary effects, tradeoffs between variants, and user behavior.
When you are optimizing for complex metrics or delayed effects.

For these cases, we recommend A/B testing with Experiments*. In general, it is also a best practice to run Autotune within an experiment with a small group of users that doesn't get the Autotune to measure the impact of the Autotune.

How Autotune works​

Implementing Autotune​

When to use Autotune​

How Autotune works

Implementing Autotune

When to use Autotune