President Trump fired the Bureau of Labor Statistics commissioner, Erika McEntarfer, after July’s jobs report showed very little job growth over the past quarter. Initially, the President accused her of “rigging” the numbers to make him look bad. More recently, members of his administration have tried to reduce the criticism to just that of substantial revisions (one such representative case is Casey Mulligan’s tweet here).
Let’s take the less inflammatory reason (unreliable jobs figures) as the true motivation here to ask a probing question: What would a successful change to the statistics program look like?
It would not be the case that revisions would disappear. With statistics, there will always be revisions. Any statistical report is necessarily built on various assumptions. Ultimately, you are collecting a sample that you use to, based on assumptions and stylized facts, make claims about the entire population. Ideally, one would survey the entire population, but that is cost-prohibitive, both in terms of money and time. So, one uses an (ideally) representative sample of the population. If those assumptions and stylized facts change or are no longer useful, then the model must be revised. Revision will, in turn, change the results of the claims the sample can support. In such a case, the presence of revised data is a sign of an improvement to the model. Without revisions, the model will become less useful over time.
What about the size of revisions? That, of course, is a concern. If the model’s revisions frequently swing by huge amounts, then the model is fundamentally flawed. But University of Central Arkansas economist Jeremy Horpedahl shows that the BLS’s data revisions have shrunk over time (see also this post by University of Louisiana economist Gary Wagner). Not much room for improvement there.
Size and frequency of revisions will depend on the sample, and most importantly, on the response rate of the sample. A major problem with the BLS data in general is that response rates have been falling. Falling response rates mean that larger and larger imputations have to be made with less data. Not ideal. Improving response rates could be a sign of better quality data.
We could also see how the BLS data correspond to other sources. ADP, the payroll company, puts out their own monthly survey of jobs. It’s not quite identical to the BLS report (see their FAQ at the bottom for differences), but it is a useful comparison tool. Indeed, the revisions to the BLS data (and ADP’s own revisions) tend to bring the two data sets closer together. Over time, the BLS’s private employment numbers and ADP’s private employment numbers differ, with ADP Report on average 1,000 jobs lower than the BLS report. Given we are talking job gains/losses in the tens, if not hundreds of thousands, each month, such a discrepancy is not bad at all.[1] Lower discrepancy between the two data sets would be a sign of improvement.
Improvements to economic data are a good thing. But any improvement will be a difficult process. One must be very, very careful about how one evaluates whether a change is an improvement.
—
[1] Note: All data are using non-seasonally adjusted figures. Since seasonal adjustment is a function of models chosen by each agency, NSA provide the best apples-to-apples comparison. However, using seasonally-adjusted figures doesn’t alter this much. The discrepancy rises to 5,000 employees per month.
READER COMMENTS
Craig
Aug 29 2025 at 10:15am
Likely double counting the overemployed remote workers. If remote is 15% of labor force and 80% have multiple screens up, I have also read 70% and a few other numbers, nobody seems to know definitively, but if true that would likely paint a rosier picture than actually exists.
Jon Murphy
Aug 29 2025 at 11:43am
Why would that be a “rosier picture than exists”?
Jon Murphy
Aug 29 2025 at 11:45am
I also hasten to add that the BLS does already track and report multiple job holders.
Craig
Aug 29 2025 at 12:04pm
They do. If you’re working, say, at Walmart and then on the weekend you get a job at McDonalds and you take the survey, yes, you’ll say that you’re working two jobs, badge of honor of sorts even. You can’t work at Walmart and McDonalds at the same time, working one job precludes working at the other, but remote work while that can be the case this is often two jobs overlapping and these people are likely going to be coy about it.
Jon Murphy
Aug 29 2025 at 12:18pm
Ok. So?
Craig
Aug 29 2025 at 12:43pm
Because they don’t know the proper offset for the double counting.
Jon Murphy
Aug 29 2025 at 1:00pm
There’s no double-counting going on. The BLS is looking at the number of jobs filled.
Jon Murphy
Aug 29 2025 at 1:03pm
In the BLS report, there are two measurements we’re talking here:
First: the number of people employed. These are the number of people currently working. Whether they are working one job or 50, it still only counts as one person employed.
Second: the number of jobs fulfilled. These are the number of jobs that are currently filled. The same person could be occupying multiple jobs. So, these are not tracking people but jobs.
In either case, there is no double-counting.
Craig
Aug 29 2025 at 1:05pm
https://recruitonomics.com/the-diverging-job-numbers-and-which-one-is-correct/
“Usually, these two surveys show similar trends. But as of last year, an extremely large gap has opened. The payroll survey shows a rapidly growing labor market with large employment gains, while the household survey shows job losses since the end of 2023!”
This was a 2024 article, but it has often been said that the economy is a ‘tale of two surveys’ and my suspicion is that at least part of that divergence are overemployed remote workers. Of course that depends on the scale of it and from the surveys alone its hard for them to tell.
Jon Murphy
Aug 29 2025 at 1:17pm
Ok? Not really sure what that article is saying that I am not.
Craig
Aug 29 2025 at 1:19pm
https://www.atlantafed.org/-/media/documents/research/publications/policy-hub/2024/08/12/06–what-accounts-for-growing-divergence-between-employment-measures.pdf
Atlanta Fed:
“The Current Employment Statistics (CES) measure of employment is likely overstating
the level of employment in 2024. Our analysis suggests that the CES forecast for the
net birth-death contribution to employment overstates the cumulative contribution to
employment by 440,000.
2. This downward adjustment to CES employment only explains a small share of the
current gap of 3.9 million between the CES and Current Population Survey (CPS)
measures of employment (after adjusting the CPS to be consistent with the CES
measure).
3. The remaining discrepancy is like due to an underestimation of population growth in the
CPS, likely to the result of an unexpected increase in net immigration. ”
Atlanta Fed could be correct but if 80% of 15% of your workforce is working two jobs that could help to explain the divergence as well.
Jon Murphy
Aug 29 2025 at 1:25pm
Ok?
I’m not sure where you’re going with all this.
There’s no double-counting going on.
These surveys are measuring different things (as both articles you cite) say.
Jon Murphy
Aug 29 2025 at 4:52pm
I guess these are the questions I need answered to clear up my confusion (as succulently as possible and with evidence, please):
First: Where is the double-counting?
Second: why would the presence of remote workers create a jobs report that suggests the jobs situation is “rosier than it is”?
Matthias
Aug 29 2025 at 11:09am
Well, you could also leave the statistics to the private sector?
If you need any extra statistics that the private sector agencies aren’t already publishing, you could pay them (via a competitive tender) to furnish them?
It looks like ADP is already doing a decent job for this specific data set? Perhaps they are less prone to political interference.
Craig
Aug 29 2025 at 11:21am
Also of note is that the government, aside from these surveys simply has access to large amounts of data as an incidence of tax collection. So with respect to, for instance, the total number of people employed, I always wonder to myself, “Why don’t they already just know this number?” After all those people are subject to withholding and then annually people file. The surveys should somewhat reconcile with those. Another one would be unemployment, state unemployment systems pay people on unemployment, now that might not capture discouraged workers, but they obviously report this on other government data and release things like ‘initial jobless claims’ and ‘continued unemployment claims’
steve
Aug 29 2025 at 2:00pm
They file annually but we want monthly reports. A private company cold do it but they would run into the same issues. If you want it to be more accurate you need larger samples. If you want larger samples it will cost more. There is no evidence of malfeasance so Trump’s advisors are walking this back. What we really had was someone fired because Trump had a temper tantrum and didnt like the news. This is the same guy who now holds control over our much of our economy and is trying to claim even more control.
Note that this is different than what we have seen in the past. In the past we had govt overreach but usually based upon a group or groups of government officials making decisions. Now we have one person deciding stuff. Its centralized government centered into one person. A person prone to temper tantrums.
Steve
Jon Murphy
Aug 29 2025 at 11:44am
You could do that. Some countries do, I believe.
TMC
Aug 29 2025 at 4:19pm
Most of the monthly revisions aren’t too bad, but those two >800k revisions looked terrible.
Jon Murphy
Aug 30 2025 at 9:53am
“Looked terrible” in terms of what? I mean, yeah, it’s a big scary number, but it represents just about a 0.24% revision on the total (if I’m remembering the calculations correctly. I’m currently traveling and cannot check the math).
David Seltzer
Aug 30 2025 at 5:17pm
Jon, BLS uses monthly labor force survey estimates from time-domain time-series sample data. This approach reduces the high variability in monthly survey estimates that are properties of small survey sample sizes. Survey estimates are the sum of a stochastic true labor force series (signal) and error (noise) generated by sampling a portion of the population. To wit.
y(t) = Y(t) +e (t)
y(t) =survey estimate
Y(t) labor force value (determinate)
e(t) = sampling error (Stochastic) with properties mean, variance and autocorrelation. I suspect the revisions and updates are done with a recursive Kalman Filter algo. Kalman filters estimate the state of a dynamic system from noisy measurements by minimizing the mean-square error. The Kalman filter consists of two primary steps. First predict, then update. (prior, posterior) In the prediction step, the filter uses the system’s model to predict the next state. In the update step, the filter uses the new measurement to update the state estimate. A little technical, but I hope this sheds some light on this.
Craig
Aug 31 2025 at 11:14am
With respect to both Household and Establishments both are relatively high sample sizes. Establiahment does have one wrinkle though which ia that the data comes in and then they have additional responses after the reporting period.
Jon Murphy
Aug 31 2025 at 11:38am
The sample size isn’t the problem here. The response rate is.
Pierre Lemieux
Sep 3 2025 at 10:46pm
Interesting take, Jon. And thanks for the link to the Horpedahl calculations.
Comments are closed.