Pearson Product-Moment Correlation

The Pearson Product-Moment Correlation is one of the measures of correlation which quantifies the strength as well as the direction of such relationship. It is usually denoted by the Greek letter ρ (rho).

This article is a part of the guide:

Discover 34 more articles on this topic

Browse Full Outline

In the study of relationships, two variables are said to be correlated if change in one variable is accompanied by change in the other - either in the same or opposite direction.

Quiz 1 Quiz 2 Quiz 3 All Quizzes

When to Use This Statistic

This coefficient is used only when two conditions are satisfied:

  1. the variables are in the interval or ratio scale of measurement
  2. a linear relationship between the variables is suspected




Positive and Negative Correlation

The coefficient (ρ) is calculated as the ratio of covariance between the variables to the product of their standard deviations. This formulation is very useful for two key reasons.

First, it tells us the direction of relationship. Once the coefficient is computed, ρ > 0 will indicate a positive relationship, ρ < 0 will indicate negative relationship while ρ = 0 indicates non existence of any relationship.

Second, it ensures (mathematically) that the numerical value of ρ ranges from -1.0 to +1.0. This enables us to get an idea of the strength of relationship - or rather the strength of linear relationship between the variables. The closer the coefficient is to +1.0 or -1.0, the greater the strength of the linear relationship.

As a rule of thumb, the following guidelines are often useful (though many experts would disagree somewhat on the choice of boundaries).

Range of ρ

Value of ρ Strength of relationship
-1.0 to -0.5 or 1.0 to 0.5 Strong
-0.5 to -0.3 or 0.3 to 0.5 Moderate
-0.3 to -0.1 or 0.1 to 0.3 Weak
-0.1 to 0.1 None or very weak

Properties of ρ

This measure of correlation has interesting properties:

  1. It is independent of any units of measurement. For example, the ρ value between the highest day temperature (in Centigrade) and rainfall per day (in mm) is not expressed either in terms of centigrade or mm. This is because it is not expressing a quantity, but a relationship between quantities.

  2. It is symmetric. This means that ρ between X and Y is exactly the same as ρ between Y and X.

  3. Pearson's correlation coefficient is independent of change in origin and scale. This means that ρ between temperature (in Centigrade) and rainfall (in mm) would numerically be equal to ρ between temperature (in Fahrenheit) and rainfall (in cm).

  4. If the variables are truly independent of each other, then one would obtain ρ = 0. However, the converse is not true. In other words ρ = 0 does not imply that the variables are independent - it only indicates the non existence of a non-linear relationship. You may also arrive at this result in error if your variables are not in interval or ratio scale of measurement.

Caveats and Warnings

While ρ is a powerful tool, it is a much abused one and hence has to be handled carefully.

  1. People often forget or gloss over the fact that ρ is a measure of linear relationship. Consequently a small value of ρ is often interpreted to mean non existence of relationship when actually it only indicates non existence of a linear relationship or at best a very weak linear relationship.

    Under such circumstances it is possible that a (possibly strong!) non linear relationship exists.

    It's best to construct a scatter diagram to reveal any non linear relationships before firmly concluding the non existence of a relationship. If the scatter diagram points to a non linear relationship, an appropriate transformation can often attain linearity in which case ρ can be recomputed.

  2. One has to be careful in interpreting the value of ρ, specifically when it makes no obvious sense to connect the variables in the first place.

    For example, one could compute ρ between shoe size and intelligence, or height and income. Irrespective of the value of ρ, such a correlation makes no sense and is hence termed chance or non-sense correlation. 

  3. As with many related statistics, ρ should not be used to make claims about a cause and effect relationship. Put differently, by examining the value of ρ, we can only conclude that variables X and Y are related.

    However the same value of ρ does not tell us if X influences Y or the other way round - a fact that is of key importance in regression analysis.

Full reference: 

Explorable.com, (Oct 8, 2009). Pearson Product-Moment Correlation. Retrieved Nov 09, 2024 from Explorable.com: https://explorable.com/pearson-product-moment-correlation

You Are Allowed To Copy The Text

The text in this article is licensed under the Creative Commons-License Attribution 4.0 International (CC BY 4.0).

This means you're free to copy, share and adapt any parts (or all) of the text in the article, as long as you give appropriate credit and provide a link/reference to this page.

That is it. You don't need our permission to copy the article; just include a link/reference back to this page. You can use it freely (with some kind of link), and we're also okay with people reprinting in publications like books, blogs, newsletters, course-material, papers, wikipedia and presentations (with clear attribution).





Want to stay up to date? Follow us!