Statistical Reliability

49.4K reads

Statistical reliability is needed in order to ensure the validity and precision of the statistical analysis.

It refers to the ability to reproduce the results again and again as required. This is essential as it builds trust in the statistical analysis and the results obtained.

For example, suppose you are studying the effect of a new drug on the blood pressure in mice. You would want to do a number of tests and if the results are found to be good in controlling blood pressure, you might want to try it out in humans too.

The statistical reliability is said to be low if you measure a certain level of control at one point and a significantly different value when you perform the experiment at another time.

However, if the reliability is low, this means that the experiment that you have performed is difficult to be reproduced with similar results then the validity of the experiment decreases. This means that people will not trust in the abilities of the drug based on the statistical results you have obtained.

Validity and Reliability

In many cases, you can improve the reliability by taking in more number of tests and subjects. Simply put, reliability is a measure of consistency.

Reliability can be measured and quantified using a number of methods.

Consider the previous example, where a drug is used that lowers the blood pressure in mice. Depending on various initial conditions, the following table is obtained for the percentage reduction in the blood pressure level in two tests. (Disclaimer: This is just an illustrative example - no test has actually been conducted)

Time after injection Test 1 Test 2
1 min 5.86 5.89
2 min 6.35 6.41
3 min 7.12 6.95
4 min 9.18 9.01
5 min 12.36 12.13
6 min 14.26 14.93
7 min 16.96 15.89

Ideally, the two tests should yield the same values, in which case the statistical reliability will be 100%. However, this doesn't happen in practice, and the results are shown in the figure below. The dotted line indicates the ideal value where the values in Test 1 and Test 2 coincide.

Statistical Reliability

Using the above data, one can use the change in mean, study the types of errors in the experimentation including Type-I and Type-II errors or using retest correlation to quantify the reliability.

The use of statistical reliability is extensive in psychological studies, and therefore there is a special way to quantify this in such cases, using Cronbach's Alpha. This gives a measure of reliability or consistency.

With an increase in correlation between the items, the value of Cronbach's Alpha increases, and therefore in psychological tests and psychometric studies, this is used to study relationship between parameters and rule out chance processes.

Full reference: 

(Oct 1, 2009). Statistical Reliability. Retrieved Sep 18, 2018 from

You Are Allowed To Copy The Text

The text in this article is licensed under the Creative Commons-License Attribution 4.0 International (CC BY 4.0).

This means you're free to copy, share and adapt any parts (or all) of the text in the article, as long as you give appropriate credit and provide a link/reference to this page.

That is it. You don't need our permission to copy the article; just include a link/reference back to this page. You can use it freely (with some kind of link), and we're also okay with people reprinting in publications like books, blogs, newsletters, course-material, papers, wikipedia and presentations (with clear attribution).

Search over 500 articles on psychology, science, and experiments.

Want to stay up to date? Follow us!