Skip to main content

Winsorization

Winsorization

Winsorization is a common technique for removing noise in experiment results, specifically from outliers.

Winsorization refers to the practice of measuring the percentile Px of a metric and setting all values over Px to Px.

Statsig computes the Px value using all non-zero and non-null values of the metric.

Statsig Cloud

At Statsig, the default percentile for winsorization is 99.9%. This reduces the influence of extreme outliers caused by factors such as logging errors or bad actors.

Winsorization is applied to to sum and event count metrics, including imported metrics. Winsorization will not be applied to Mean, Ratio, Funnel, Participation, or User Accounting metrics.


Warehouse Native

Statsig Warehouse Native lets you configure this per metric - and choose explicitly the upper and/or lower bounds to apply. image

Winsorization is applied to to sum and event count metrics. Winsorization will not be applied to Mean, Ratio, Funnel or Participation.

Metric Capping

This is a very simple, but effective technique to handle outliers. With this capability, you can define max values for a metric for whatever unit type(s) are configured for this metric. Any value surpassing the set cap will automatically be adjusted downward to match it.

For instance, if you determine that purchases greater than $10,000 per day on your E-commerce platform should not skew analysis, any transaction exceeding this threshold will be adjusted downward to this limit, ensuring the integrity of your experiment analysis. Capped metrics are available for Event Count and Aggregation (sum) metric types.