Two-Sample Tests
Two-Sided z-Test
The z-statistic (a.k.a. z-score) of a two-sample z-test can be computed in multiple equivalent formats: where:- is the observed z-statistic (not the z-critical value )
- is the variance of the absolute delta of means
- is the variance of sample means either control or treatment group (details here)
- is the standard error of the mean of either control or treatment group (these are the terms you can find in Pulse under the Statistics tab of a metric)
Welch’s t-test
For smaller sample sizes, Welch’s t-test is the preferred statistical test for lower false positive rates in cases of unequal sizes and variances. In Pulse, Welch’s t-test is automatically applied when the degrees of freedom . We compute the t-statistic (a.k.a. t-score) identically as the two-sample z-statistic above. Additionally, we compute the degrees of freedom using: The p-value is then obtained from the t-distribution with degrees of freedom.One-Sided Z-Test
The procedure for a one-sided z-test computes the z-statistic in the same way as a two-sided test above. The one-sided p-value is obtained from the standard normal cumulative distribution function as well, but with slight differences: where:- is computed above in the two-sided test. Note that this uses the signed z-statistic, not the absolute value of the z-statistic as in the two-sided p-value.