Storks deliver babies!

3
concerned that, as electronic access to full text becomes freely available at members’ libraries, people may no longer join to receive a dis- counted journal subscription. In this case, other member bene¢ts become all the more im- portant. We o¡er several electronic services for society members, many via Websites that we have created and maintain for our society partners. These include password-protected online problem solving, data sets, book reviews online, preprint access and searchable cumulative indexes. By working proactively with societies, libraries, third party providers, consortia and subscription agents, we aim to o¡er cost-e¡ective solutions to suit our journals’ needs. For more information on Blackwell Publishers, and links to other statistics sites, please see our Website at http://www.blackwellpublishers.co.uk Storks Deliver Babies (p 0.008) KEYWORDS: Teaching; Correlation; Signi¢cance; p-values. Robert Matthews Aston University, Birmingham, England. e-mail: [email protected] Summary This article shows that a highly statistically signi¢cant correlation exists between stork populations and human birth rates across Europe. While storks may not deliver babies, unthinking interpretation of correlation and p-values can certainly deliver unreliable conclusions. ^ INTRODUCTION ^ I ntroductory statistics textbooks routinely warn of the dangers of confusing correlation with causation, pointing out that while a high corre- lation coe/cient is indicative of (linear) association, it cannot be taken as a measure of causation. Such warnings are typically accompanied by illustrative examples, such as the correlation between the reading skills of children and their shoe size, or the apparent relationship between educational level and unemployment (see e.g. Freedman et al. 1998). However, such examples are often either trivially explained via an obvious confounder (e.g. age, in the case of reading age and shoe size) or are not obviously cases of mere association (e.g. educational level may indeed be at least partly responsible for time spent unemployed). In what follows, I give an example based on genuine data of an association which is clearly ludicrous, but which cannot be so easily dismissed as non-causal via an obvious confounder. My starting point is the familiar folk tale that babies are delivered by storks. The origins of this connection are believed to lie partly in the association between storks and the concept of women as bringers of life, and also in the bird’s feeding habits, which were once regarded as a search for embryonic life in water (Cooper 1992). The legend lives on to this day, with neonate- bearing storks being a regular feature of greetings cards celebrating births. While it is (I trust) obvious that the legend is complete nonsense, it is legitimate to ask precisely how one might set about refuting it scienti¢cally. If one were approaching the question in the same way that many other links are investigated (e.g. suspected links between diet and cancer risk), one may well decide to carry out a correlational study, to see if the number of storks in a country bears a simple relationship to the number of human births in that country. Although the presence of a statistically signi¢cant degree of correlation cannot be taken to imply causation, its absence would certainly constitute evidence against a simple relationship. This possibility can quickly be investigated in the present case using standard hypothesis testing, with the null hypothesis being the absence of any correlation between the number of storks and the number of live births in a particular country. This I now proceed to do. 36 . Teaching Statistics. Volume 22, Number 2, Summer 2000

Transcript of Storks deliver babies!

Page 1: Storks deliver babies!

concerned that, as electronic access to full textbecomes freely available at members' libraries,people may no longer join to receive a dis-counted journal subscription. In this case, othermember bene¢ts become all the more im-portant. We o¡er several electronic services forsociety members, many via Websites that wehave created and maintain for our societypartners. These include password-protected onlineproblem solving, data sets, book reviews online,

preprint access and searchable cumulative indexes.

By working proactively with societies, libraries,third party providers, consortia and subscriptionagents, we aim to o¡er cost-e¡ective solutions tosuit our journals' needs.

For more information on Blackwell Publishers,and links to other statistics sites, please see ourWebsite at http://www.blackwellpublishers.co.uk

Storks Deliver Babies (p � 0.008)

KEYWORDS:Teaching;Correlation;Signi¢cance;p-values.

Robert MatthewsAston University, Birmingham, England.e-mail: [email protected]

SummaryThis article shows that a highly statisticallysigni¢cant correlation exists between storkpopulations and human birth rates across Europe.While storks may not deliver babies, unthinkinginterpretation of correlation and p-values cancertainly deliver unreliable conclusions.

^ INTRODUCTION ^

I ntroductory statistics textbooks routinely warnof the dangers of confusing correlation with

causation, pointing out that while a high corre-lation coe¤cient is indicative of (linear) association,it cannot be taken as a measure of causation. Suchwarnings are typically accompanied by illustrativeexamples, such as the correlation between thereading skills of children and their shoe size, or theapparent relationship between educational leveland unemployment (see e.g. Freedman et al. 1998).However, such examples are often either triviallyexplained via an obvious confounder (e.g. age, inthe case of reading age and shoe size) or are notobviously cases of mere association (e.g.educational level may indeed be at least partlyresponsible for time spent unemployed). In whatfollows, I give an example based on genuine dataof an association which is clearly ludicrous, butwhich cannot be so easily dismissed as non-causalvia an obvious confounder.

My starting point is the familiar folk tale thatbabies are delivered by storks. The origins of thisconnection are believed to lie partly in the

association between storks and the concept ofwomen as bringers of life, and also in the bird'sfeeding habits, which were once regarded as asearch for embryonic life in water (Cooper 1992).The legend lives on to this day, with neonate-bearing storks being a regular feature of greetingscards celebrating births.

While it is (I trust) obvious that the legend iscomplete nonsense, it is legitimate to ask preciselyhow one might set about refuting it scienti¢cally. Ifone were approaching the question in the sameway that many other links are investigated (e.g.suspected links between diet and cancer risk), onemay well decide to carry out a correlational study,to see if the number of storks in a country bears asimple relationship to the number of human birthsin that country. Although the presence of astatistically signi¢cant degree of correlation cannotbe taken to imply causation, its absence wouldcertainly constitute evidence against a simplerelationship. This possibility can quickly beinvestigated in the present case using standardhypothesis testing, with the null hypothesis beingthe absence of any correlation between the numberof storks and the number of live births in aparticular country. This I now proceed to do.

36 . Teaching Statistics. Volume 22, Number 2, Summer 2000

Page 2: Storks deliver babies!

^ TESTING THE STORK-BIRTH ^RELATIONSHIP

The white stork (Ciconia ciconia) is a surprisinglycommon bird in many parts of Europe, and dataon the number of breeding pairs are available for17 European countries (Harbard 1999, pers.comm.); the latest ¢gures, covering the period from1980 to 1990, are given in table 1, along withdemographic data taken from BritannicaYearbook for 1990.

Plotting the number of stork pairs against thenumber of births in each of the 17 countries, onecan discern signs of a possible correlation betweenthe two (see ¢gure 1).

The existence of this correlation is con¢rmed byperforming a linear regression of the annualnumber of births in each country (the ¢nal columnin table 1) against the number of breeding pairsof white storks (column 3). This leads to acorrelation coe¤cient of r � 0:62, whose statisticalsigni¢cance can be gauged using the standardt-test, where t � r � p��nÿ 2�=�1ÿ r2�� and n is thesample size. In our case, n � 17 so that t � 3:06,which for �nÿ 2� � 15 degrees of freedom leads toa p-value of 0.008.

^ ANALYSIS ^

What are we to make of this result, which points

to a highly statistically signi¢cant degree ofcorrelation between stork populations and birthrates? The correlation coe¤cient is not particularlyhigh, but according to its p-value, there is only a1 in 125 chance of obtaining at least as impressivea value assuming the null hypothesis of nocorrelation were true. Yet as with any p-value (andcontrary to what unwary users of them believe),

Country Area(km2)

Storks(pairs)

Humans(106)

Birth rate(103/yr)

Albania 28,750 100 3.2 83

Austria 83,860 300 7.6 87

Belgium 30,520 1 9.9 118

Bulgaria 111,000 5000 9.0 117

Denmark 43,100 9 5.1 59

France 544,000 140 56 774

Germany 357,000 3300 78 901

Greece 132,000 2500 10 106

Holland 41,900 4 15 188

Hungary 93,000 5000 11 124

Italy 301,280 5 57 551

Poland 312,680 30,000 38 610

Portugal 92,390 1500 10 120

Romania 237,500 5000 23 367

Spain 504,750 8000 39 439

Switzerland 41,290 150 6.7 82

Turkey 779,450 25,000 56 1576

Table 1. Geographic, human and stork data for 17European countries

Fig 1. How the number of human births varies with stork populations in 17 European countries.

Teaching Statistics. Volume 22, Number 2, Summer 2000 . 37

Page 3: Storks deliver babies!

this does not imply that the probability that mere£uke really is the correct explanation is just 1 in125; still less does it imply a 124=125 � 99:2%probability that storks really do deliver babies.

Such apparent nit-picking distinctions are fre-quently overlooked by consumers of p-values. Inthe case of the correlation between storks andhuman births, however, they no longer seem sopedantic: indeed, they provide the very welcome`escape route' by which to avoid a patentlyludicrous inference. The most plausible explan-ation of the observed correlation is, of course,the existence of a confounding variable: somefactor common to both birth rates and thenumber of breeding pairs of storks which ^ likeage in the reading skill/shoe-size correlation ^ canlead to a statistical correlation between twovariables which are not directly linked themselves.One candidate for a potential confounding variableis land area: readers are invited to investigate thispossibility using the data in table 1.

^ CONCLUSION ^

Standard statistical texts routinely warn of thefallacy of mistaking correlation for causation, butthe examples they provide are usually either trivial,

with obvious confounders, or lack clear non-causality. The empirical relationship between thenumber of stork breeding pairs and human birthrates in 17 European countries provides a non-trivialexample of a correlation which is highly statisticallysigni¢cant, not immediately explicable and yetcausally nonsensical. Indeed, its sheer absurdity haspedagogic value beyond the correlation/causationfallacy alone, as it compels greater attention to bepaid to the precise meaning of p-values, andpromotes greater recognition of the fact thatrejection of the null hypothesis does not imply thecorrectness of the substantive hypothesis.

Acknowledgements

The author is very grateful to Chris Harbard ofthe Royal Society for the Protection of Birds forsupplying the stork data, and to Professor DennisLindley for valuable discussions.

ReferencesCooper, J.C. (ed.) (1992). Brewer's Myth and

Legend. London: Cassell.Freedman, D., Pisani, R. and Purves, R.

(1998). Statistics (3rd edn). New York:W.W. Norton.

The Big Ticket: How Not to Design a Game Show

KEYWORDS:Teaching;Probability.

David BurghesUniversity of Exeter, England.e-mail: [email protected]

SummaryThis article analyses a television game show andsuggests improvements.

^ INTRODUCTION ^

Y ou may have seen this game show, whichwas broadcast for 8^9 weeks on Saturdays

on television in the United Kingdom in the spring/summer of 1998 as an extended National Lotteryshow. I only saw parts of some of the shows, whilewaiting for something else, or by mistake! I wasamazed that such an unappealing programme wasbroadcast at all.

At the end of each show there was ridiculousover-promotion of a game in which the ¢nalcontestants were certain to win substantial sums ofmoney. Just in case you missed the shows, Isummarize in ¢gure 1 the way this game worked.

The game seemed incredibly predictable as it wasimpossible not to win a substantial amount, with alow ceiling on the maximum that could be won,compared with the guaranteed minimum winningamount.

38 . Teaching Statistics. Volume 22, Number 2, Summer 2000