Correlation & Causation


Correlation does not imply causation.

If you are in the data science field, you might have heard the quote quite a few times. “Correlation does not imply causation.” Let’s dig it down to find the crux of the saying.

There is an interesting XKCD comic about correlation.

Image for post
Source: xkcd

What is the correlation?

The association between any two random variables in a dataset is termed correlation in common statistical terms. Most of the time, we measure the linear dependent relationship between the two variables.

E.g., A and B

What is causation?

Causality, also referred to as Causation, is a property that connects one process with another. It is understood that the first is partly responsible for the second and the second is dependent on the first. Hence we can say that causation is the “slight/partial” guarantee that given an event A (cause), event B (effect) has to occur in the sequence.

E.g., A -> B ( A implies B if A then B)

Let’s return to our original statement: “It does not imply Causation.”

I feel:

Correlation is a mathematical quantity where as Causation is an physical quantity (observation).

Since there is no evident relationship between the two quantities, we can not say that the representative quantities have a cause-and-effect model since two random variables are correlated.

For any questions and inquiries, visit us on thinkitive 

Kaustubh

I look after Technology at Thinkitive. Interested in Machine Learning, Deep Learning, IoT, TinyML and many more areas of application of machine learning.

2 Comments

  1. I’d ought to seek advice from you here. Which isn’t some thing Which i do! I enjoy reading an article which will make people believe. Also, thanks for allowing me to comment!

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button