Published on *Explorable.com* (https://explorable.com)

Statistical hypothesis testing is used to determine whether an experiment conducted provides enough evidence to reject a proposition.

It is also used to remove the chance process in an experiment and establish its validity [3] and relationship with the event under consideration.

For example, suppose you want to study the effect of smoking on the occurrence of lung cancer cases. If you take a small group, it may happen that there appears no correlation [4] at all, and you find that there are many smokers with healthy lungs and many non-smokers with lung cancer.

However, it can just happen that this is by chance, and in the overall population this isn't true. In order to remove this element of chance and increase the reliability [5] of our hypothesis [6], we use statistical hypothesis testing. [7]

In this, you will first assume a hypothesis that smoking and lung cancer are unrelated. This is called the 'null hypothesis [8]', which is central to any statistical hypothesis testing.

You should therefore first choose a distribution [9] for the experimental group. Normal distribution [10] is one of the most common distributions encountered in nature, but it can be different in different special cases.

There should then be limits set on the critical value, beyond which you can assume that the experiment proves that the null hypothesis is false and therefore using statistical hypothesis testing, the experiment shows there is enough evidence to reject the null hypothesis. This is generally set at 5% or 1% chance probability.

This means that if the experiment suggests that the probability of a chance event in the experiment is less than this critical value, then the null hypothesis can be rejected.

If the null hypothesis is rejected, then we need to look for an alternative hypothesis that is in line with the experimental observations.

There is also the gray area in between, like at the 15-20% level, in which it is hard to say whether the null hypothesis can be rejected. In such cases, we can say that there is reason enough to doubt the validity of the null hypothesis but there isn't enough evidence to suggest that we reject the null hypothesis altogether.

A result in the gray area often leads to more exploration before concluding [11] anything.

The other thing with statistical hypothesis testing is that there can only be an experiment [12] performed that doubts the validity of the null hypothesis, but there can be no experiment that can somehow demonstrate that the null hypothesis is actually valid. This because of the falsifiability-principle in the scientific method.

Therefore it is a tricky situation for someone who wants to show the independence of the two events, like smoking and lung cancer in our previous example.

This problem can be overcome using a confidence interval [13] and then arguing that the experimental data reveals that the first event has a negligible (as much as the confidence interval) effect, if at all, on the second event.

In the figure below, we can see that one can argue the independence is within 0.05 times the standard deviation [14].

**Links**

[1] https://explorable.com/statistical-hypothesis-testing

[2] https://explorable.com/users/siddharth

[3] https://explorable.com/validity-and-reliability

[4] https://explorable.com/statistical-correlation

[5] https://explorable.com/statistical-reliability

[6] https://explorable.com/research-hypothesis

[7] http://itl.nist.gov/div898/handbook/prc/section1/prc13.htm

[8] https://explorable.com/null-hypothesis

[9] https://explorable.com/frequency-distribution

[10] https://explorable.com/normal-probability-distribution

[11] https://explorable.com/drawing-conclusions

[12] https://explorable.com/experimental-research

[13] https://explorable.com/statistics-confidence-interval

[14] https://explorable.com/calculate-standard-deviation