Why I don't like Bayesian statistics

| 19 Comments

Bayesian inference is a coherent mathematical theory but I wouldn't trust it in scientific applications. Subjective prior distributions don't inspire confidence, and there's no good objective principle for choosing a noninformative prior (even if that concept were mathematically defined, which it's not). Where do prior distributions come from, anyway? I don't trust them and I see no reason to recommend that other people do, just so that I can have the warm feeling of philosophical coherence.

Bayesian theory requires a great deal of thought about the given situation to apply sensibly, and recommending that scientists use Bayes' theorem is like giving the neighborhood kids the key to your F-16. I'd rather start with tried and true methods, and then generalizing using something I can trust, like statistical theory and minimax principles, that don't depend on your subjective beliefs. Especially when the priors I see in practice are typically just convenient conjugate forms. What a coincidence that, of all the infinite variety of priors that could be chosen, it always seems like the normal, gamma, beta, etc., that turn out to be the right choice?

To restate these concerns mathematically: I like unbiased estimates and I like confidence intervals that really have their advertised confidence coverage. I know that these aren't always going to be possible, but I think the right way forward is to get as close to these goals as possible and to develop robust methods that work with minimal assumptions. The Bayesian approach--to give up even trying to approximate unbiasedness and to instead rely on stronger and stronger assumptions--that seems like the wrong way to go.

In the old days, Bayesian methods at least had the virtue of being mathematically clean. Nowadays, they all seem to be computed using Markov chain Monte Carlo, which means that, not only can you not realistically evaluate the statistical properties of the method, you can't even be sure it's converged, just adding one more item to the list of unverifiable assumptions.

People tend to believe results that support their preconceptions and disbelieve results that surprise them. Bayesian methods encourage this undisciplined mode of thinking. I'm sure that many individual Bayesian statisticians and are acting in good faith, but they're providing encouragement to sloppy and unethical scientists everywhere. And, probably worse, Bayesian techniques motivate even the best-intentioned researchers to get stuck in the rut of prior beliefs.

Bayesianism assumes: (a) Either a weak or uniform prior, in which case why bother?, (b) Or a strong prior, in which case why collect new data?, (c) Or more realistically, something in between, in which case Bayesianism always seems to duck the issue.

Nowadays people use a lot of empirical Bayes methods. I applaud the Bayesians' newfound commitment to empiricism but am skeptical of this particular approach, which always seems to rely on an assumption of "exchangeability." I do a lot of work in political science, where people are embracing Bayesian statistics as the latest methodological fad. Well, let me tell you something. The 50 states aren't exchangeable. I've lived in a few of them and visited nearly all the others, and calling them exchangeable is just silly. Calling it a hierarchical or a multilevel model doesn't change things--it's an additional level of modeling that I'd rather not do. Call me old-fashioned, but I'd rather let the data speak without applying a probability distribution to something like the 50 states which are neither random nor a sample.

Also, don't these empirical Bayes methods use the data twice? If you're going to be Bayesian, then be Bayesian: it seems like a cop-out and contradictory to the Bayesian philosophy to estimate the prior from the data. If you want to do hierarchical modeling, I prefer a method such as generalized estimating equations that makes minimal assumptions.

And don't even get me started on what Bayesians say about data collection. The mathematics of Bayesian decision theory lead inexorably to the idea that random sampling and random treatment allocation are inefficient, that the best designs are deterministic. I have no quarrel with the mathematics here--the mistake lies deeper in the philosophical foundations, the idea that the goal of statistics is to make an optimal decision. A Bayes estimator is a statistical estimator that minimizes the average risk, but when we do statistics, we're not trying to "minimize the average risk," we're trying to do estimation and hypothesis testing. If the Bayesian philosophy of axiomatic reasoning implies that we shouldn't be doing random sampling, then that's a strike against the theory right there. Bayesians also believe in the irrelevance of stopping times--that, if you stop an experiment based on the data, it doesn't change your inference. Unfortunately for the Bayesian theory, the p-value _does_ change when you alter the stopping rule, and no amount of philosophical reasoning will get you around that point.

I can't keep track of what all those Bayesians are doing nowadays--unfortunately, all sorts of people are being seduced by the promises of automatic inference through the "magic of MCMC"--but I wish they would all just stop already and get back to doing statistics the way it should be done, back in the old days when a p-value stood for something, when a confidence interval meant what it said, and statistical bias was something to eliminate, not something to embrace.

19 Comments

got me.

OK. I get it. I can read a calendar as well as the next guy. The only problem is, I agree with most of it.

I'm with jfalk.

At least the blogger and I can agree 1/365 of the time.

Finally Gelman talks some sense!

Don't ruin the perfectly good Apr 1 joke by replying today, but TOMORROW I'm expecting an overview of your counterargument :-)

So, for April 2nd I assume you'll give your responses to some of these objections? I just bought your Bayesian book, but I'd be curious to read your informal thoughts on these issues.

Well played, sir. I was very confused there for a bit. I actually briefly considered the possibility that the Andrew Gelman who coauthored Bayesian Data Analysis and the Andrew Gelman posting on this blog were two different people.

The last paragraph is beautiful.

Well,
I am a graduate student at Universidade de São Paulo, Brazil, who always have read your blog, but never happend to me to say anything. But now I have to say: what a joke! I expect you response to some claims, which are good one indeed - at least to me.

I'll come here next day while I wait for your answer!!

regards

you had me going for a sec there; you are not afraid to be controversial after all...

See you in 15 days and we can debate this issue some more ;)

Yes, please make your next post a picking-apart of this one. That would be edifying.

Brilliant!

"Bayesians also believe in the irrelevance of stopping times...."

Isn't that one of the methods so-called psychics use to fool researchers & the general public?

You almost got me fooled! The tone is so unlike you. So it is an April 1 joke!!

Yes, please write a response to this post, professor Gelman!

I am so embarrassed that I fell for the joke that I opt to remain anonymous. A counter argument would be cool. While you are at it, if you could explain how 90000 parameters could be estimated from data with 1200 observations using Bayesian MCMC methods, I would be much obliged.

Shouldn't "the old days" be "the good old days"?

Best nerdy laugh I've had in a while. Thanks Andy...

Apparently this is the one day a year that it's worth reading a Bayesian blog. A better signal to noise ratio than we normally expect from Bayesians. :)

Re the "good old days", I prefer to use modern methods of statistical inference (approximately 50 to 70 years old) rather than superceded methods which are more than 240-years-old.

Leave a comment

Subscribe to Entry

Email:

Recent Comments

  • JR: Re the "good old days", I prefer to use modern read more
  • Anonymous: Apparently this is the one day a year that it's read more
  • Simon Jackman: Best nerdy laugh I've had in a while. Thanks Andy... read more
  • Seth Roberts: Shouldn't "the old days" be "the good old days"? read more
  • Anonymous: I am so embarrassed that I fell for the joke read more
  • Anonymous: Yes, please write a response to this post, professor Gelman! read more
  • Yu-Sung: You almost got me fooled! The tone is so unlike read more
  • John: "Bayesians also believe in the irrelevance of stopping times...." Isn't read more
  • greg: Brilliant! read more
  • John: Yes, please make your next post a picking-apart of this read more
  • Brad Carlin: you had me going for a sec there; you are read more
  • Manoel Neto: Well, I am a graduate student at Universidade de São read more
  • noahpoah: Well played, sir. I was very confused there for a read more
  • charlie w: So, for April 2nd I assume you'll give your responses read more
  • Daniel Lakeland: Don't ruin the perfectly good Apr 1 joke by replying read more
  • Andrew: Finally Gelman talks some sense! read more
  • 1/356 : I'm with jfalk. At least the blogger and I can read more
  • jfalk: OK. I get it. I can read a calendar as read more
  • GoodnessOfFit: got me. read more