<?xml version='1.0' encoding='UTF-8'?><rss xmlns:atom='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:thr='http://purl.org/syndication/thread/1.0' version='2.0'><channel><atom:id>tag:blogger.com,1999:blog-10065122</atom:id><lastBuildDate>Wed, 12 May 2010 06:49:30 +0000</lastBuildDate><title>Worth noting statistically</title><description></description><link>http://www.stat.columbia.edu/~tzheng/tianblog/</link><managingEditor>noreply@blogger.com (Tian)</managingEditor><generator>Blogger</generator><openSearch:totalResults>137</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-9093199073409073949</guid><pubDate>Fri, 22 Jan 2010 18:30:00 +0000</pubDate><atom:updated>2010-01-22T13:40:41.460-05:00</atom:updated><title>WSJ article: When Combined Data Reveal the Flaw of Averages</title><atom:summary type='text'>An example of Simpson's paradox. [Read more]This article was emailed to me from Jacob Weaver, an alumni of our department.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2010/01/wsj-article-when-combined-data-reveal.html</link><author>noreply@blogger.com (Tian)</author><thr:total>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-4482189214746796139</guid><pubDate>Fri, 22 Jan 2010 18:14:00 +0000</pubDate><atom:updated>2010-01-22T13:17:55.842-05:00</atom:updated><title>Investor sets eyes on "data-as-a-service" start-ups. Or should it be "Stat-as-a-service"?</title><atom:summary type='text'>Entrepreneurs in the social and real-time data sectors would do well to study up on Venrock’s Brian Ascher. At the spritely age of 43 with several big tech exits under his belt (Adify, DatAllegro, Unicru), the investor is bound to be a force in the global Silicon Valley for a while to come. He’s interested in startups that follow what he calls the "data-as-a-service" model: gather tons of data by</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2010/01/entrepreneurs-in-social-and-real-time.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-4702362445085824316</guid><pubDate>Thu, 06 Aug 2009 19:35:00 +0000</pubDate><atom:updated>2010-01-22T14:13:26.836-05:00</atom:updated><title>NY Times: "For Today’s Graduate, Just One Word: Statistics"</title><atom:summary type='text'>Here is an article from the NY Times that outlines the up-and-coming status of statisticians in today's world. Enjoy!</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2009/08/ny-times-for-todays-graduate-just-one.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-5372809837704096865</guid><pubDate>Fri, 10 Jul 2009 14:24:00 +0000</pubDate><atom:updated>2009-07-10T10:42:26.572-04:00</atom:updated><title>A make-over for your correlation matrix</title><atom:summary type='text'> This morning I received a pleasant surprise in my inbox. Taiyun Wei , a student from Hunan, China emailed me about his/her R package, corrplot. This package can produce a number of pretty "stylish" visualization of your correlation matrices. Next time I need to present such a matrix of mine, I may give it a make-over from the "wardrobe" of Taiyun's functions. More of Taiyun's art-like </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2009/07/make-over-for-your-correlation-matrix.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-4776305189541178717</guid><pubDate>Mon, 06 Jul 2009 17:26:00 +0000</pubDate><atom:updated>2009-07-06T13:37:46.903-04:00</atom:updated><title>Election-fraud detection</title><atom:summary type='text'>WSJ.com published an article on "Rise and Flaw of Internet's Election-Fraud Hunters: Benford's Law, Which Tests Numbers for Authenticity, Might Detect Vote-Rigging but Can't Prove It" that describes how one can use the limited information in the election count data to possbly detect fraud. Because of the limited nature of the data, some "model" has to be assumed, which can be wrong in the first </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2009/07/election-fraud-detection.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-9008997013122360785</guid><pubDate>Wed, 17 Jun 2009 19:35:00 +0000</pubDate><atom:updated>2009-06-17T16:06:41.116-04:00</atom:updated><title>Accepting rejections by Xiao-Li Meng</title><atom:summary type='text'>"Xiao-Li Meng took a humorous approach to the sore subject of handling rejections whenhe was part of a panel, “Reflections on Rejections,” sponsored by Harvard’s Bureau ofStudy Counsel (BSC). We publish his article here to amuse and comfort the many IMSmembers who have to deal with rejections of their research papers. ..."For full article, see page 16 of IMS Bulletin Vol 38 Issue 5.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2009/06/accepting-rejections-by-xiao-li-meng.html</link><author>noreply@blogger.com (Tian)</author><thr:total>4</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-2466508736522032452</guid><pubDate>Sun, 22 Mar 2009 01:42:00 +0000</pubDate><atom:updated>2009-03-21T21:42:36.814-04:00</atom:updated><title>A couple of good examples of observational studies for Intro Stats</title><atom:summary type='text'>Recently I noticed a couple of good examples of observational studies that can be used in my future Intro Stats teaching.1) In the Parents magazine, it reported that a study found a positive association between the frequency and severity of heartburn during pregnancy of the mom and how much hair the newborn baby has. According to the study, about two thirds of women who reported to have modest to</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2009/03/couple-of-good-examples-of.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-7602879556321737104</guid><pubDate>Tue, 22 Jul 2008 15:05:00 +0000</pubDate><atom:updated>2008-07-22T11:13:15.369-04:00</atom:updated><title>Up and downs of percentage of working men and women</title><atom:summary type='text'>New York Times published an article today on "Women Are Now Equal as Victims of Poor Economy". The whole article can be found here. I found the time plots used in this article is very interesting. It has some good graphing tactics: use of color/shade, for example and the Y axis is actually from 0-100%.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/07/up-and-downs-of-percentage-of-working.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-3765763407401526957</guid><pubDate>Mon, 21 Jul 2008 21:15:00 +0000</pubDate><atom:updated>2008-07-22T11:01:53.902-04:00</atom:updated><title>The Black Swan</title><atom:summary type='text'>"The Black Swan" is a best-seller by Nassim Nicholas Taleb, the author of another bigger best-seller "Fooled by Randomness." Both books received more than 300 reviews on Amazon.com. I flipped through the second book in the bookstore and thought it was an interesting book, especially for teaching statistics to curious and skeptical undergraduate students.Today, I stumbled over a special issue in </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/07/black-swan.html</link><author>noreply@blogger.com (Tian)</author><thr:total>6</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-9107509378090906040</guid><pubDate>Mon, 21 Jul 2008 20:38:00 +0000</pubDate><atom:updated>2008-07-21T16:53:18.023-04:00</atom:updated><title>Solution to improve integrity in medical research was proposed surrounding statistician's participation</title><atom:summary type='text'>This may be one of the most interesting articles from the president's column of AMSTAT news I have read recently. It is a reaction to an editorial of the Journal of American Medical Association that "impugning the integrity of medical science." As part of the proposed solution for the integrity problem in the pharmaceutical industry, it is suggested that clinical trial data collected by a </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/07/solution-to-improve-integrity-in.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-362542123181325874</guid><pubDate>Mon, 25 Feb 2008 17:38:00 +0000</pubDate><atom:updated>2008-02-25T13:43:36.610-05:00</atom:updated><title>Identifying gene-gene interaction that is relevant to a disease outcome</title><atom:summary type='text'>In collaboration with Dr. Dimitris Anastassiou (EE, Columbia), we just published a paper on "Identification of gene interactions associated with disease from gene expression data using synergy networks" in BMC System Biology.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/02/identifying-gene-gene-interaction-that.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-5822432891179433211</guid><pubDate>Tue, 19 Feb 2008 16:48:00 +0000</pubDate><atom:updated>2008-02-19T11:56:28.642-05:00</atom:updated><title>Tiling Arrays</title><atom:summary type='text'>Here are some references on this technology for my own use.Global Identification of Human Transcribed Sequences with Genome Tiling Arrays Science 24 December 2004:Vol. 306. no. 5705, pp. 2242 - 2246DOI: 10.1126/science.1103388Paul Bertone,1* Viktor Stolc,1,2* Thomas E. Royce,3 Joel S. Rozowsky,3 Alexander E. Urban,1 Xiaowei Zhu,1 John L. Rinn,3 Waraporn Tongprasit,4 Manoj Samanta,2 Sherman </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/02/tiling-arrays.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-5981943030232184926</guid><pubDate>Tue, 19 Feb 2008 16:45:00 +0000</pubDate><atom:updated>2008-02-19T11:47:56.161-05:00</atom:updated><title>Standard Deviation Function in R</title><atom:summary type='text'>I always thought there was no function for standard deviation in R. Since in Splus, it was stdev() if I remember correctly. I never help.seached it. I only found out that there was one: sd().</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2008/02/standard-deviation-function-in-r.html</link><author>noreply@blogger.com (Tian)</author><thr:total>2</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-6864805998793405854</guid><pubDate>Tue, 13 Nov 2007 14:18:00 +0000</pubDate><atom:updated>2007-11-13T09:23:00.757-05:00</atom:updated><title>Temp Directory and Working Director of R</title><atom:summary type='text'>For a while, I have been using the setwd() command at the beginning of a project's code. This allows one to save data, graph, output easily to a project's result folder without typing long path names. Yesterday, I was using a contributed package in R that requires the access to R's temp directory. The exact location of this folder can be found out using the command tempdir(). It just happened </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/11/temp-directory-and-working-director-of.html</link><author>noreply@blogger.com (Tian)</author><thr:total>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-6659259294767034504</guid><pubDate>Mon, 12 Nov 2007 14:32:00 +0000</pubDate><atom:updated>2007-11-12T09:36:26.506-05:00</atom:updated><title>Useful websites for graphics in R</title><atom:summary type='text'>R graphical manuals (a nice collection of examples and codes)http://cged.genes.nig.ac.jp/RGM2/index.php?clear=allCodes from the book "R graphics"http://www.stat.auckland.ac.nz/~paul/RGraphics/rgraphics.html</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/11/useful-websites-for-graphics-in-r.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-3219051988921509911</guid><pubDate>Fri, 02 Nov 2007 23:24:00 +0000</pubDate><atom:updated>2007-11-02T19:37:05.496-04:00</atom:updated><title>Fast permuting r by c two-way tables</title><atom:summary type='text'>Today, I need to permute many two-way tables of the same dimension (say 2 by 3) to carry out a test of independence. My data looks like a big matrix with each row corresponding to a two-way table (arranged by row). The permutation is just to draw a random 2 by 3 table given independence between the two dimensions, conditioning on the observed marginal distributions. I didn't find (or didn't have </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/11/fast-permuting-r-by-c-two-way-tables.html</link><author>noreply@blogger.com (Tian)</author><thr:total>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-7639507545025002788</guid><pubDate>Wed, 24 Oct 2007 13:12:00 +0000</pubDate><atom:updated>2007-10-24T09:26:49.119-04:00</atom:updated><title>Tropical infectious diseases moving north</title><atom:summary type='text'>Today, on CNN, it was said that some experts argued several epidemic consequences of global warming. One of them is tropical diseases are moving north. For example, malaria cases were reported in New Jersey.I found this claim is a little far-fetch. According to scientists, the temperature change due to global warming is about (1.33 ± 0.32 °F) during the last 100 years. The difference between the </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/10/tropical-infectious-diseases-moving.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-8257465693989531588</guid><pubDate>Mon, 17 Sep 2007 23:11:00 +0000</pubDate><atom:updated>2007-09-17T19:23:36.686-04:00</atom:updated><title>Probing genetic overlap among complex human phenotypes</title><atom:summary type='text'>Andrey Rzhetsky (a collaborator of mine) called me today. Among many things we discussed, he told me that our recent paper on estimating genetic overlap from phenotype data (time at diagnosis) had attracted quite a bit of popularity. It was the 24th most read paper of PNAS for the month of July 2007. And it was covered in MIT technology review and the Wired Science blog, among many other more </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/09/probing-genetic-overlap-among-complex.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-3774069813060866074</guid><pubDate>Sun, 16 Sep 2007 18:55:00 +0000</pubDate><atom:updated>2007-09-16T15:00:58.076-04:00</atom:updated><title>Correction: Chair Mao didn't say everything after all.</title><atom:summary type='text'>As Li pointed out, I had the wrong memory. I never aced any of these courses in my school days. BUT, I should have GOOGLED!!! :) That saying was published on May 11th, 1978. However, it was not proposed by Deng Xiaoping, despite the common perception. Deng only publicized it a lot. This is again a good example of Stigler's law of eponymy.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/09/correction-chair-mao-didnt-say.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-8211909843695548315</guid><pubDate>Fri, 07 Sep 2007 22:32:00 +0000</pubDate><atom:updated>2007-09-07T18:56:05.844-04:00</atom:updated><title>Chairman Mao was right!</title><atom:summary type='text'>I did not grow as a generation fully influenced by Mao Zedong. Actually, several days after I was born, he passed away. However, we still got to study quite a number of his quotes. One of them is "practice is the only way to validate the truth" (pardon my unpolished interpretation). 实践是检验真理的唯一标准。I have never given this quote much thought before yesterday.Yesterday, I received an email regarding a</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/09/chairman-mao-was-right.html</link><author>noreply@blogger.com (Tian)</author><thr:total>1</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-4072159393606944905</guid><pubDate>Fri, 25 May 2007 20:51:00 +0000</pubDate><atom:updated>2007-05-25T17:01:30.145-04:00</atom:updated><title>I want a PlayStation 3 ... for research</title><atom:summary type='text'>Next time I go to visit some of my friends, I will pay more attention to their little black gaming devices, called PS3. It turns out the best computer processor of our times is used in it. It is the Cell Chip developed jointly by IBM, SONY and Toshiba. According to some very scientific study comparing the Cell Chip with conventionally designed chipset, the performance for chips with comparable </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/05/i-want-playstation-3-for-research.html</link><author>noreply@blogger.com (Tian)</author><thr:total>4</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-9122344200891085304</guid><pubDate>Tue, 22 May 2007 22:43:00 +0000</pubDate><atom:updated>2007-05-22T18:44:39.694-04:00</atom:updated><title>citeulike</title><atom:summary type='text'>I came across this website that allows you to organize citations online. I think it is pretty neat. Especially it has Bibtex format right on the screen.</atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/05/citeulike.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-5744013625152240089</guid><pubDate>Mon, 14 May 2007 15:58:00 +0000</pubDate><atom:updated>2007-05-22T18:46:20.657-04:00</atom:updated><title>Mother's day spending</title><atom:summary type='text'>Men are more likely to spend money on Mother's day than women (in both percentage and amount). This is a fact but this is also a nice example of "confounding" for intro stats. I heard about this in the news (people were joking about this as a "mom's boy" phenomenon) but can't find that exact news article today. I found the data on Mother's day 2005 (a pretty neat collection of numbers). Here is </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/05/mothers-day-spending.html</link><author>noreply@blogger.com (Tian)</author></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-5056810631624210383</guid><pubDate>Wed, 09 May 2007 12:03:00 +0000</pubDate><atom:updated>2007-05-09T08:16:34.186-04:00</atom:updated><title>Divided opinion?</title><atom:summary type='text'>Anderson Copper 360 is a show on CNN that gives in-depth coverage of current events. In one of its recent airing, Anderson Copper discussed a question occurred during the recent Republican presidential debate: "do you believe in evolution?" One of the candidates added after the poll that he believed in evolution and also believed in God when watching things such as the sunset at Grand Canyon. </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/05/divided-opinion.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item><item><guid isPermaLink='false'>tag:blogger.com,1999:blog-10065122.post-6154821343372357874</guid><pubDate>Mon, 30 Apr 2007 23:12:00 +0000</pubDate><atom:updated>2007-04-30T19:33:23.401-04:00</atom:updated><title>Ten Simple Rules for Making Good Oral Presentations</title><atom:summary type='text'>PLoS Computational Biology's "Ten Simple Rules" series are pretty nice guidelines, including:Bourne PE (2005) Ten simple rules for getting published. PLoS Comp Biol 1: e57. Bourne PE, Chalupa LM (2006) Ten simple rules for getting grants. PLoS Comp Biol 2: e12. Bourne PE, Korngreen A (2006) Ten simple rules for reviewers. PLoS Comp Biol 2: e110. Bourne PE, Friedberg I (2006) Ten simple rules for </atom:summary><link>http://www.stat.columbia.edu/~tzheng/tianblog/2007/04/ten-simple-rules-for-making-good-oral.html</link><author>noreply@blogger.com (Tian)</author><thr:total>0</thr:total></item></channel></rss>