
Recently, co-bloggers Scott Sumner and Kevin Corcoran had a series of excellent posts on causation, coincidence, and identities (Scott’s post is here and Kevin’s are here, here, and here). I want to add my two cents to the conversation with some readings and thoughts for interested readers.
A theme that runs through both their posts is the idea of coincidence: that two events happen together without any apparent causation. Coincidence happens quite frequently. Two silly examples:
- July 14 I bowled the best two games of my life: 161 and 157. Even the third game, at 118, was better than average. My average is 110, so this was quite an improvement. It was also the first time in months I carried a $5 bill in my pocket. Did the $5 in my pocket cause my bowling game to improve? It can’t be a coincidence!
- On July 3, the Boston Red Sox visited Donald Trump in the White House. Then, they won 10 games in a row and went from bottom dwellers to just a few games behind the division-leading Toronto Blue Jays. Did Trump boost the Sox? It can’t be a coincidence!
Of course, both these examples are silly. Anyone claiming a $5 bill and merely being in Trump’s presence caused these events would get laughed out of a room. Indeed, there is plenty of counterevidence to indicate the combination of time and place is a coincidence: the Washington Nationals visited Trump in 2019 after their World Series victory and they’ve had a losing record ever since. It’s unlikely Trump caused either the Red Sox’s win streak or the Nationals losing streak.
Distinguishing causation from coincidence requires a good theory. Theory helps us see what is coincidence and what is causation. Theory, rigorously tested, is a vital lens to understanding the world. Bad theory leads to confusing coincidence with causation.
Of course, this is not to say that even rigorously-tested theories are ultimately correct. Miasma theory, for example, survived millennia of testing. Indeed, a lot of evidence existed to support it: bad air tended to congregate around disease. And the bad air often preceded the disease outbreak. But, after some careful study and a bit of luck, miasma theory eventually unraveled. John Snow hypothesized that certain diseases were not caused by bad air, but rather something else (he would die before germs were discovered, but he could see their existence in the data). The bad air was not causing the disease, but rather caused by the disease. (For interested readers, I highly recommend The Ghost Map by Steven Johnson.)
Determining causation is quite a tricky problem. Judea Pearl, a brilliant statistician at UCLA, has a series of books exploring causation from a statistical point of view. His technical book is called Causality and it is a difficult read. While no one will confuse me with a top-tier statistician, even those who are well-versed in the subject find it difficult.
For those of us who are not Turing Prize winners, he has a more accessible book: The Book of Why. In this book, he goes over the history of thought in causation and where we are now. Short version: We really don’t know when two things are causal. We do our best, but it’s quite a difficult problem. All models of causation have assumptions, some quite strong, and we can never be sure they actually hold.
Which brings me to my final point: the phrase “It can’t be a coincidence!” is quite likely the least scientific phrase in the English language. Not just because it is often invoked by conspiracy theorists or poor thinkers looking to push their latest half-baked idea, but also because it invokes a level of certainty one cannot have. Coincidences happen all the time. There is some probability that the causation is a coincidence. Even a claim of statistical significance (eg “P<0.05”) is a statement of probability (subject to the aforementioned modeling assumptions). Those who invoke such certainly usually do so because they lack sufficient theory and evidence to justify their claim.
When we consider the assumptions required to show causation, it should cause us to be humble enough to say, “It’s possible I am wrong.”
READER COMMENTS
David Seltzer
Jul 24 2025 at 10:28am
Jon, nicely done. Another “coincidence.” When street lights are turned on, causing the sun to set.
Robert EV
Jul 24 2025 at 1:06pm
This is why mechanistic hypotheses are very helpful. And even without mechanisms one can sometimes identify before and after (prerequisites), though this does assume that time is a directional arrow.
I’d rephrase this as “there is some possibility that the assumed causation is coincidence or indicative of a shared underlying causation”. Though depending on what you intended by the sentence it’s possible I am wrong.
Jon Murphy
Jul 24 2025 at 1:09pm
I think your rephrase is helpful actually
David Seltzer
Jul 24 2025 at 1:49pm
Jon: Some years ago, I did some Bayesian work on cause and effect in capital markets. Our well compensated quants tried to isolate cause and then measure the effects of causes. In your long study of econ, I suspect you know Bayesian inference considers the observed values of quantities to be realizations of random variables and the unobserved values to be unobserved random variables. The purpose; (G)estimate the effects of causes, not causes of effect.
Mactoul
Jul 25 2025 at 12:31am
To impute causation is always a leap of faith. The equations of physics have no direction. Force is mass times acceleration but it doesn’t say force causes acceleration or vice-versa. This isn’t a good example, by the way.
A better example is the relation between mass-energy and spacetime curvature. We do say that mass-energy causes spacetime curvature but the equation doesn’t say this. This imputation of A causing B is extraneous to the equations. The causation is entirely in our understanding and can not be formalized.
Robert EV
Jul 25 2025 at 12:58am
There appears to be something called causal calculus, and directional graphs, under which putative causation could be presumably formalized. I get that you’re saying that this isn’t done in the mathematical equations of physics that we know about, but not being a physicist I’m not willing to make the leap that what I know of physical equations pertaining to spacetime curvature, or what have you, is what actual physicists use when mathematicking about it.
Mark Barbieri
Jul 25 2025 at 4:01am
I love Tyler Vigen’s page of supposedly spurious correlations. He calls them spurious, but can anyone seriously look at the correlation between the popularity of the name ‘Sunny’ and the amount of solar power generated in Egypt and not realize there is a connection? Yeah, I guess they can.
https://www.tylervigen.com/spurious-correlations
Jon Murphy
Jul 25 2025 at 5:22am
That’s one of my favorite websites
Matthias
Jul 25 2025 at 8:19am
John Snow got very lucky. His methods could have just as well given him opposite results. He made lots of rather questionable when producing his map.
Jon Murphy
Jul 25 2025 at 8:24am
Two quick things:
1) Is the word “assumptions” missing from your last sentence?
2) Agreed on the luck thing. I’m retrospect, it looks brilliant (and it is). But at the time, it was a huge leap of faith.
Half of everything is luck.