Daniel Lippman sent me this news article which would be an excellent thing to give your statistics students to read if you're covering confidence intervals and sampling.
6 Comments
Leave a comment
Subscribe to Entry
Email:
For more info on our research:
Blogroll
Statistics:
- Chance News
- The Endeavour
- Christian Robert
- Revolution Computing
- Yu-Sung Su's Blog
- The Numbers Guy
- Messy Matters
Visualization:
Cognitive and Behavioral Science:
- Decision Science News
- British Psychological Society Research Digest
- Seth Roberts [experimental psychology]
- Criteria's Employee Testing Blog
Social and Political Science:
- Monthly Labor Review Precis
- Marginal Revolution [economics]
- Language Log
- Social Science Statistics
- The Monkey Cage [political science]
- The Baby Name Wizard
- Vox EU
Machine Learning:
Cultural:
Pages
Research supported by the National Science Foundation
National Institutes of Health
Yahoo Research
Search
Recent Comments
- ollie: Your explanation is fine (at least for me); my writing read more
- current grad student: Ollie, A frequentist 95% confidence interval should be interpreted as read more
- ollie: Ok, for the sake of discussion. Say you take your read more
- Andrew: Vince, Ollie: I think the article's description is pretty good. read more
- ollie: Re: previous comment. I think that explanation is just plain read more
- Vince: You could ask students what they think of the author's read more
Recent Entries
- What is Info-Gap Theory?
- This is horrible
- Our new book!
- Survey analysis in R
- Conjugate prior to Dirichlets?
- Weather and Death in India: Mechanisms and Implications for Climate Change
- Funny graph
- In what settings do voters prefer candidates who look like them?
- Using a robot arm to feel a mathematical function or a statistical distribution
- Why it's not so weird that so many dentists are named Dennis: a story of conditional probability
- Connected
- Huh?
- "A gift to the audience rather than a plea for attention": Brad Paley's tips on encouraging seminar participants to ask so many damn questions you have to tell them to shut the heck up already so you can hear the rest of the damn talk
- A funny thing happened when fitting a hierarchical model . . .
- It's all downhill from here . . .
- p.bound in multilevel logistic regression
- Two kinds of book
- Home-base effect and social networks
- More on data visualization, beauty, etc.
- Discussion of Red State, Blue State
Categories
- Administrative (14)
- Art (32)
- Bayesian Statistics (215)
- Causal Inference (73)
- Decision Theory (137)
- Economics (160)
- Literature (114)
- Miscellaneous Science (106)
- Miscellaneous Statistics (377)
- Multilevel Modeling (157)
- Political Science (546)
- Public Health (113)
- Sociology (221)
- Sports (17)
- Statistical computing (110)
- Statistical graphics (129)
- Teaching (130)

You could ask students what they think of the author's explanation of margin of error. The article says:
"In theory, when a poll says 40 percent, with a 3 percentage point margin of sampling error, that means that 19 times out of 20, the number would be between 37 and 43 if you interviewed every adult American. But it’s a bell curve. So 40 is the most likely number. Thirty-nine and forty-one are a little less likely. Thirty-eight and 42 are even less likely than 39 and 41. And 37 and 43 are still less likely."
Re: previous comment. I think that explanation is just plain wrong. Setting a 95 percent confidence interval says that there is a 95 percent probability that the observation came from a distribution where p was between 37-43.
Ironically, the probability is actually quite low that the observation came from one where, say, p was every close to .4.
Vince, Ollie:
I think the article's description is pretty good. It's a Bayesian inference based on a flat prior distribution on the unknown probability. The flat prior distribution is not perfect, and problems certainly show up when sample sizes are small, but it's a good start, and it's consistent with how polls are summarized.
Ok, for the sake of discussion. Say you take your poll and calculate, say, a 50% confidence interval. That confidence interval would be a fairly narrow band about 40, no?
Now suppose we took a 25 percent confidence interval. It is even narrower, and centered about 40, right? Now drop the interval to 10%. It still contains 40 at the center, right?
I guess that my beef with this explanation (the one from the article) is that it gives too much sway to the actual statistic; that is why you see people saying "oh, Thompson's support is dropping since it was 40 today and was 42 yesterday." In fact, it is "probable" that the 40 and the 42 were samples from the same distribution.
I welcome correction and/or feedback as my questions are designed to strengthen my own understanding!
Ollie,
A frequentist 95% confidence interval should be interpreted as follows:
In repeated sampling, the confidence interval constructed from the data will cover the true value of the parameter 95% of the time.
If you are constructing a 95% bayesian interval, then the posterior probability that p belongs to the interval (.37,.43) is .95. If you construct an interval with a lower confidence level, then the probability that p belongs to the interval constructed will be smaller. The width of the confidence interval gives you some indication as to how much information you have about the "true" location of the parameter. If your 95% confidence interval is fairly wide, then the posterior probability that the parameter is within an extremely small region around phat is going to be really close to zero. However, the posterior mode is still the most likely estimate for p, even though the probability that p is exactly equal to the posterior mode is extremely unlikely.
Is it obvious that I'm a graduate student in statistics? I think that my explanation would make sense to a statistician, though I think I probably just failed in my attempt to explain a confidence interval to a non-statistician.
Your explanation is fine (at least for me); my writing is not. :)
I am not a statistician; I am a topologist, and therefore my understanding of statistics is basic.
I know that the measured p(hat) is the MLE estimator for the parameter p.
So, the explanation in the Times article is, while not precise, is ok for the lay-person, though I'd be a bit more cautious to explain that the observed "p" from a poll is reall a "point" estimate of the parameter "p".