Question 1

What is statistical power in A/B testing?

Accepted Answer

Statistical power is the probability that your test will detect a real effect when one exists. A power of 80% means there is a 20% chance of missing a genuine improvement (a false negative). Higher power requires more visitors but reduces the risk of incorrectly declaring a test inconclusive.

Question 2

What is a minimum detectable effect (MDE)?

Accepted Answer

The minimum detectable effect is the smallest improvement you want to be able to reliably detect. Setting a smaller MDE requires more visitors. Choose an MDE based on the minimum business impact that would justify acting on the test result — if a 5% lift is not worth launching for, set your MDE to something larger.

Question 3

Should I use relative or absolute MDE?

Accepted Answer

Relative MDE is a percentage improvement on your baseline (e.g., a 20% relative lift on a 5% baseline = a 6% absolute rate). Absolute MDE is a direct percentage point change (e.g., 5% to 6% is a 1pp absolute lift). Relative is more intuitive for most marketers; absolute is more precise for statistical calculations.

Question 4

What power level should I use?

Accepted Answer

80% is the industry standard and is sufficient for most tests. It means a 20% chance of a false negative — missing a real effect. Use 95% power when the cost of missing a true improvement is very high, but be aware it significantly increases the required sample size.

Question 5

What confidence level should I use?

Accepted Answer

95% is the standard. It means you accept a 5% chance of a false positive — declaring a winner when there is none. Use 90% if you are running many low-stakes tests and can tolerate slightly more noise. Use 99% only for very high-stakes decisions.

Question 6

How long should I run my A/B test?

Accepted Answer

Run the test until you reach your required sample size per variant, and for at least one to two full weeks to average out day-of-week variation. If you enter your daily visitor count above, this calculator will estimate your runtime automatically.

Question 7

What happens if I stop my test before reaching the required sample size?

Accepted Answer

Stopping early — especially when results look promising — inflates your false-positive rate significantly. This is called peeking. Always run the test to your pre-calculated sample size. Once complete, use a statistical significance calculator to evaluate your results.

A/B Test Power Calculator

Why You Need a Power Test Before Your A/B Test

Underpowered tests miss real improvements

Overpowered tests waste time and budget

It forces you to commit to an MDE

It prevents peeking and early stopping

How Statistical Power Works

Baseline conversion rate

Minimum detectable effect (MDE)

Confidence level (1 − α)

Statistical power (1 − β)

Frequently Asked Questions

What is statistical power in A/B testing?

What is a minimum detectable effect (MDE)?

Should I use relative or absolute MDE?

What power level should I use — 80% or 95%?

What confidence level should I use?

How long should I run my A/B test?

What happens if I stop before reaching the required sample size?

Ready to scale your winners?