No Paper Is That Good

Bryan Caplan

Categories: Behavioral Economics Economic Methods

By Bryan Caplan, Jul 12 2018

Last year, Noah Smith proposed his Two Paper Rule:

If you want me to read the vast literature, cite me two papers that are exemplars and paragons of that literature. Foundational papers, key recent innovations – whatever you like (but no review papers or summaries). Just two. I will read them.

If these two papers are full of mistakes and bad reasoning, I will feel free to skip the rest of the vast literature. Because if that’s the best you can do, I’ve seen enough.

If these two papers contain little or no original work, and merely link to other papers, I will also feel free to skip the rest of the vast literature. Because you could have just referred me to the papers cited, instead of making me go through an extra layer, I will assume your vast literature is likely to be a mud moat.

And if you can’t cite two papers that serve as paragons or exemplars of the vast literature, it means that the knowledge contained in that vast literature must be very diffuse and sparse. Which means it has a high likelihood of being a mud moat.

I never faced Noah’s challenge. Why not? To be totally honest, because I don’t know of any empirical papers that meet Noah’s standards. Yes, there are some literature reviews that I consider excellent, like Clemens’ “trillion-dollar bills on the sidewalk” article or Barnett and Ceci’s piece on Transfer of Learning. But I’d be loathe to point to any specific piece of research and call it a “paragon” or “exemplar.” Every article I’ve carefully examined has issues -no matter how I firmly agree with the conclusions. The highest compliments I’m comfortable paying a paper are “careful” and “cool,” never “compelling” or “clearly right.”

My slogan: No Paper Is That Good.

What’s wrong with every specific empirical paper?

First and foremost, external validity is always debatable. If you use data from 1950 to 2010, you can reasonably wonder, “But are the results relevant now?” If you use data from the 50 U.S. states, you can reasonably wonder, “But are the results relevant for Canada, or Germany, or China?” If you set up a pristine experiment, the problem just gets worse; the experiment might not even be relevant in the real world the day it was performed.

Second, identification is always debatable. Identifying a genuine “natural experiment” requires wisdom and patience. Plenty of smart people lack one or both. Calling something a “natural experiment” doesn’t make it so.

Third, even smart human beings are prone to big careless mistakes. A paper that seems impeccable to a casual reader might be based on miscoded data. Or crucial variable names could have been switched.

Fourth, although researchers like to pretend that they base their conclusions purely on “the evidence,” their priors always matter. If A seems initially obvious to you, and paper X confirms A, even researchers who know better must struggle not to say, “X shows that A is true.” The problem isn’t confidence in A, which may be completely warranted. The problem is the pretense that you believe in A because X confirms A, even though you would believe in A no matter how X came out.

Fifth, most researchers’ priors are heavily influenced by some extremely suspicious factors. Factors like: social acceptability, ideological palatability, and what you thought when you were an ignorant teenager.

To be clear, I freely admit that some papers are better than others. My claim is simply that the best existing papers are still underwhelming – and probably always will be. As Saint Paul preaches, “For all have sinned and fall short of the glory of God.”

Imagine placing papers on a continuum of convincingness from 0 to 1. 0=”provides no information at all.” 1=”decisively answers its question.” At least for questions that anyone cares about, I say the median paper hovers around .05. The best papers get up to around .20. Again, No Paper Is That Good. If you demur, consider this: In twenty years, will you still hold up the best papers of today as “paragons” or “exemplars” of compelling empirical work? If not, you already agree with me. The best papers are relatively good but absolutely mediocre. And no, you can’t just staple five top-scoring papers together to hit 1.0.

Does all of this hold for my papers, too? Of course. The most I can claim is that I am hyper-aware of my own epistemic frailty, and have a litany of self-imposed safeguards. But I totally understand why my critics would look at my best papers and say, “Meh, doesn’t really prove anything.”

Given my grim view of research, how can I remain a professional researcher? By the power of Stoicism. I do my best to reach truth despite the fact that No Paper Is That Good. I read voraciously in all the disciplines relevant to the questions on my mind – especially those review articles that Noah holds in low esteem. He’s right that research is “full of mistakes and bad reasoning”; I probably perceive even more mistakes and worse reasoning than he does. But my goal as a reader is to discover whether a paper has anything of value in it. When I toss a paper in the trash, it’s mostly because I decide the author doesn’t even aspire to answer an important question.

More fundamentally, though, I try to set aside controversial priors in favor of common sense, stay calm, and view all my identities with suspicion. And I bet. I wish there were a better way – an algorithm that assures truth. But I see no sign that such an algorithm exists.

READER COMMENTS

READ COMMENT POLICY

Daine Lee Danielson

Jul 12 2018 at 10:45pm

Highly recommended as exemplars in physics, are Einstein’s original papers on General Relativity.

robc

Jul 13 2018 at 9:03am

http://www.fisica.net/quantica/millikan_a_direct_photoelectric_determination_of_plancks_h.pdf

I couldn’t find the link to Einstein’s original paper on the photoelectric effect, but the combination of that one and the Millikan one above easily fit the Noah Smith rule.

I will let someone who actually has a PhD in physics give the exact score, but this PhD dropout thinks they are very close to 1.0.

jack pq

Jul 13 2018 at 9:27am

@robc: I suspect Bryan’s rule is true for social science, but may be less true for the natural and physical sciences, or at least true experimental sciences such as physics. Specifically, problems 1,2, 4 and 5 are much less relevant in physics. It only leaves 3.

David

Jul 13 2018 at 5:31pm

I’d have really liked to see some examples of papers Bryan considers to be at different levels of convincingness.

A Country Farmer

Jul 30 2018 at 10:04pm

I know this comment is superfluous, but I just want to encourage Dr. Caplan to post, post, and post! After reading so many of his posts, I save them off to a “wisdom” folder that is mostly filled with other Caplan posts. And many are timeless and applicable to so many areas of life.

Comments are closed.

Michael Clemens and Kate Gough: Or Means Or

David Henderson

Congress created the H-2A as a visa for temporary farmworkers: people who do not intend to settle in the United States and eventually become permanent residents. The executive branch has interpreted that to mean seasonal farmworkers. So the H-2A program currently allows workers only for parts of agriculture whose labor...

Jul 12 2018

Politics and Economics

Does immigration help the Dems?

Scott Sumner

In a previous post I argued that the answer is no: I don't believe that immigration will make America more diverse, nor do I think it will make the electorate vote more Democratic. That's because immigration from Asia and Latin America has made earlier immigrants from southern and eastern Europe seem less different,...

Jul 12 2018

Behavioral Economics

No Paper Is That Good

Bryan Caplan

Last year, Noah Smith proposed his Two Paper Rule: If you want me to read the vast literature, cite me two papers that are exemplars and paragons of that literature. Foundational papers, key recent innovations - whatever you like (but no review papers or summaries). Just two. I will read them. If these two papers a...

COLLECTION: BEHAVIORAL ECONOMICS

The article you’re reading is part of Econlib’s Behavioral Economics collection. Explore other Behavioral Economics articles:

Feb 19 2024

Does it Matter Whether Addiction is a Disease?

Scott Sumner
Jan 7 2024

Poverty: Is it Circumstances or Decision-Making?

Scott Sumner
Feb 15 2023

Correlation and causation

Scott Sumner
Feb 13 2023

What the Peltzman Effect Is and Isn't

David Henderson

No Paper Is That Good

RELATED CONTENT

Noah Smith on Whether Economics is a Science

READER COMMENTS

Daine Lee Danielson

Jul 12 2018 at 10:45pm

robc

Jul 13 2018 at 9:03am

jack pq

Jul 13 2018 at 9:27am

David

Jul 13 2018 at 5:31pm

A Country Farmer

Jul 30 2018 at 10:04pm

RECENT POST

Michael Clemens and Kate Gough: Or Means Or

Does immigration help the Dems?

No Paper Is That Good