Hypothesis

Analytics

A testable prediction about what change will improve a specific metric, used as the starting point for any meaningful experiment.

What is Hypothesis?

A hypothesis is a specific, testable prediction about what change will improve a defined outcome. In B2B marketing and sales testing, a hypothesis takes the form: 'if we change X, then Y will improve because Z.' The change is the variable, Y is the metric you are optimising, and Z is the reasoning based on evidence or logical inference about why the change should matter.

Writing hypotheses forces discipline in testing. Without one, teams run tests without a clear basis for expecting improvement and interpret results without a framework for learning. With a hypothesis, every test produces a learning regardless of whether it confirms or refutes the prediction: either you have evidence supporting the reasoning, or evidence against it, and both are useful.

The quality of a hypothesis determines the value of the test. Weak hypotheses like 'changing the subject line will improve opens' generate weak learnings. Strong hypotheses like 'a subject line referencing the prospect's specific industry challenge will outperform a generic curiosity line because our audience responds to evidence of relevance over curiosity gaps' generate learnings that inform future decisions beyond the immediate test.

Analytics terms are useful only when they change a decision. A metric can look sophisticated and still be low value if nobody knows how it is calculated, which segment matters, or what action should follow when it moves. It usually becomes more useful when it is defined alongside A/B test, Iteration, and Baseline.

Hypothesis — example

An outbound team's sequence has a 3% positive reply rate on email 3. The campaign manager hypothesises: "If we replace the product-feature focus in email 3 with a question about the prospect's specific operational challenge, positive reply rate will improve because our ICP responds to pain acknowledgment over feature description." They test it on 400 sends per variant. The pain-focused version achieves 5.8% versus 2.9% for the feature-focused version, confirming the hypothesis and establishing a principle about message framing that applies to future sequences.

A B2B team uses Hypothesis to compare sources that look similar at the lead level but perform very differently once quality and pipeline impact are included. The metric becomes more useful once it is reviewed by segment instead of in aggregate. They also make sure it connects cleanly to A/B test and Iteration so the definition is not trapped inside one team.

Frequently asked questions

What makes a good hypothesis for an outreach test?

Specificity and falsifiability. A good hypothesis names the specific variable being changed, predicts the direction of change in a specific metric, and states the reasoning. If the test cannot produce a clear yes or no answer, the hypothesis is not specific enough. If the result could be interpreted as confirming the hypothesis regardless of outcome, it is not falsifiable.

How do I generate useful hypotheses when I do not know what to test?

Review your current performance data and identify where the biggest gap between actual and target performance exists. Form a hypothesis about the most likely cause of that gap based on what you know about your audience. If reply rates are low, hypothesis candidates include: wrong ICP, weak opening line, irrelevant offer, or wrong timing. Pick the most likely cause based on evidence and test it first.

How many hypotheses should I test simultaneously?

One per controlled test. However, you can run multiple independent tests simultaneously if they target different stages of the funnel or different segments with no overlap. Running two subject line tests simultaneously on different audience segments is fine. Running two competing changes in the same email to the same audience prevents you from attributing results.

Should I document failed hypotheses?

Yes, and they are often more valuable than successful ones. A failed hypothesis tells you that the assumed mechanism does not work for your specific audience. Documenting it prevents you from retesting the same thing later and builds institutional knowledge about what your audience does not respond to.

What is the minimum sample size for a hypothesis test in outbound?

For subject line open rate tests: 100 sends per variant minimum, 200 preferred. For email body reply rate tests: 200 to 400 per variant minimum because positive reply rates are lower and variance is higher. For landing page conversion tests: 200 unique visitors per variant minimum. Below these thresholds, results are too noisy to draw reliable conclusions.