Zum Hauptinhalt springen
Nicht aus der Schweiz? Besuchen Sie lehmanns.de

A Panorama of Statistics (eBook)

Perspectives, Puzzles and Paradoxes in Statistics
eBook Download: EPUB
2017
John Wiley & Sons (Verlag)
978-1-119-07584-4 (ISBN)

Lese- und Medienproben

A Panorama of Statistics - Eric Sowey, Peter Petocz
Systemvoraussetzungen
35,99 inkl. MwSt
(CHF 35,15)
Der eBook-Verkauf erfolgt durch die Lehmanns Media GmbH (Berlin) zum Preis in Euro inkl. MwSt.
  • Download sofort lieferbar
  • Zahlungsarten anzeigen

A Panorama of Statistics: Perspectives, Puzzles and Paradoxes in Statistics

 

Eric Sowey, School of Economics, The University of New South Wales, Sydney, Australia

Peter Petocz, Department of Statistics, Macquarie University, Sydney, Australia

This book is a stimulating panoramic tour - quite different from a textbook journey - of the world of statistics in both its theory and practice, for teachers, students and practitioners.
At each stop on the tour, the authors investigate unusual and quirky aspects of statistics, highlighting historical, biographical and philosophical dimensions of this field of knowledge. Each chapter opens with perspectives on its theme, often from several points of view. Five original and thought-provoking questions follow. These aim at widening readers' knowledge and deepening their insight. Scattered among the questions are entertaining puzzles to solve and tantalising paradoxes to explain. Readers can compare their own statistical discoveries with the authors' detailed answers to all the questions.

 

The writing is lively and inviting, the ideas are rewarding, and the material is extensively cross-referenced.

 

A Panorama of Statistics:

  • Leads readers to discover the fascinations of statistics.
  • Is an enjoyable companion to an undergraduate statistics textbook.
  • Is an enriching source of knowledge for statistics teachers and practitioners.
  • Is unique among statistics books today for its memorable content and engaging style.

 

Lending itself equally to reading through and to dipping into, A Panorama of Statistics will surprise teachers, students and practitioners by the variety of ways in which statistics can capture and hold their interest.


This book is a stimulating panoramic tour quite different from a textbook journey of the world of statistics in both its theory and practice, for teachers, students and practitioners.At each stop on the tour, the authors investigate unusual and quirky aspects of statistics, highlighting historical, biographical and philosophical dimensions of this field of knowledge. Each chapter opens with perspectives on its theme, often from several points of view. Five original and thought-provoking questions follow. These aim at widening readers knowledge and deepening their insight. Scattered among the questions are entertaining puzzles to solve and tantalising paradoxes to explain. Readers can compare their own statistical discoveries with the authors detailed answers to all the questions. The writing is lively and inviting, the ideas are rewarding, and the material is extensively cross-referenced. A Panorama of Statistics: Leads readers to discover the fascinations of statistics. Is an enjoyable companion to an undergraduate statistics textbook. Is an enriching source of knowledge for statistics teachers and practitioners. Is unique among statistics books today for its memorable content and engaging style. Lending itself equally to reading through and to dipping into, A Panorama of Statistics will surprise teachers, students and practitioners by the variety of ways in which statistics can capture and hold their interest.Reviews:"e;As befits the authors' statement that 'this is not a textbook', the structure is unusual. There are twenty-five chapters organised in five sections, each beginning with a brief perspective of a theme in statistics and finishing with five questions related to that theme. The answers provided to the questions, in section six, are as discursive and illuminating as the main body of the text. Even if you are pretty sure you know the answer, it is always worth checking what the authors have to say. Chances are that you will learn something every time. The glimpses and insights given into this enormous and far-reaching discipline succeed in being bewitching, entertaining and inviting; coverage was never the aim."e; "e;In summary, this splendid book lives up to the four 'p-values' of its title. It is panoramic in the scope of its survey of statistics, it is full of illuminating perspectives, it sets entertaining and challenging puzzles, and it explores fascinating paradoxes. Read it, enjoy it and learn from it."e;From Neil Sheldon, Teaching Statistics, volume 9, no. 2, May 2017

Eric Sowey, School of Economics, The University of NSW, Australia. Eric has 40 years' experience in teaching statistics and econometrics at all undergraduate and postgraduate levels and holds a UNSW Vice-Chancellor's Award for Excellence in Teaching. Peter Petocz, has been a statistics educator and researcher in statistics education for several decades. He is the author of numerous academic journal articles, book chapters and refereed conference presentations on various aspects of statistics pedagogy (over 40 during the last decade). Peter has been co-editor of the Statistics Education Research Journal (SERJ) since 2008, and has been involved with organising the two most recent International Conferences on Teaching Statistics.

Preface ix

Acknowledgments xi

Part I Introduction 1

1 Why is statistics such a fascinating subject? 3

2 How statistics differs from mathematics 13

3 Statistical literacy - essential in the 21st century! 20

4 Statistical inquiry on the web 26

Part II Statistical description 33

5 Trustworthy statistics are accurate, meaningful and relevant 35

6 Let's hear it for the standard deviation! 43

7 Index numbers - time travel for averages 50

8 The beguiling ways of bad statistics I 59

9 The beguiling ways of bad statistics II 66

Part III Preliminaries to inference 75

10 Puzzles and paradoxes in probability 77

11 Some paradoxes of randomness 84

12 Hidden risks for gamblers 91

13 Models in statistics 99

14 The normal distribution: history, computation and curiosities 107

Part IV Statistical inference 115

15 The pillars of applied statistics I - estimation 117

16 The pillars of applied statistics II - hypothesis testing 122

17 'Data snooping' and the significance level in multiple testing 129

18 Francis Galton and the birth of regression 135

19 Experimental design - piercing the veil of random variation 141

20 In praise of Bayes 149

Part V Some statistical byways 157

21 Quality in statistics 159

22 History of ideas: statistical personalities and the personalities of statisticians 165

23 Statistical eponymy 178

24 Statistical 'laws' 182

25 Statistical artefacts 192

Part VI Answers 197

26 Answers to the chapter questions 199

Index 306

"Emphatically, this is not a textbook, but it could be an immensely useful resource to teachers who wish to demonstrate the power and diversity of statistical reasoning, or set interesting tasks for their charges."
(John Haigh, Significance, Volume 14, Issue 4, August 2017)

1
Why is statistics such a fascinating subject?


In the real world, little is certain. Almost everything that happens is influenced, to a greater or lesser degree, by chance. As we shall explain in this chapter, statistics is our best guide for understanding the behaviour of chance events that are, in some way, measurable. No other field of knowledge is as vital for the purpose. This is quite a remarkable truth and, statisticians will agree, one source of the subject’s fascination.

You may know the saying: data are not information and information is not knowledge. This is a useful reminder! Even more useful is the insight that it is statistical methods that play the major role in turning data into information and information into knowledge.

In a world of heavily promoted commercial and political claims, a familiarity with statistical thinking can bring enormous personal and social benefits. It can help everyone to judge better what claims are trustworthy, and so become more competent and wiser as citizens, as consumers and as voters. In short, it can make ours not only a more numerate, but also a more accurately informed, society. This is an ideal we shall return to in CHAPTER 3.

Chance events are studied in the physical, biological and social sciences, in architecture and engineering, in medicine and law, in finance and marketing, and in history and politics. In all these fields and more, statistics has well‐established credentials. To use John Tukey’s charming expression, ‘being a statistician [means] you get to play in everyone’s backyard’. (There is more about this brilliant US statistician in CHAPTER 22, FIGURE 22.2.)

‐‐‐oOo‐‐‐

To gain a bird’s eye view of the kinds of practical conclusions this subject can deliver, put yourself now in a situation that is typical for an applied statistician.

Suppose you have collected some data over a continuous period of 150 weekdays on the daily number of employees absent from work in a large insurance company. These 150 numbers will, at first, seem to be just a jumble of figures. However, you – the statistician – are always looking for patterns in data, because patterns suggest the presence of some sort of systematic behaviour that may turn out to be interesting. So you ask yourself: can I find any evidence of persisting patterns in this mass of figures? You might pause to reflect on what sorts of meaningful patterns might be present, and how you could arrange the data to reveal each of them. It is clear that, even at this early stage of data analysis, there is lots of scope for creative thinking.

Exercising creativity is the antithesis of following formalised procedures. Unfortunately, there are still textbooks that present statistical analysis as no more than a set of formalised procedures. In practice, it is quite the contrary. Experience teaches the perceptive statistician that a sharpened curiosity, together with some preliminary ‘prodding’ of the data, can often lead to surprising and important discoveries. Tukey vigorously advocated this approach. He called it ‘exploratory data analysis’. Chatfield (2002) excellently conveys its flavour.

In this exploratory spirit, let’s say you decide to find out whether there is any pattern of absenteeism across the week. Suppose you notice at once that there seem generally to be more absentees on Mondays and Fridays than on the other days of the week. To confirm this impression, you average the absentee numbers for each of the days of the week over the 30 weeks of data. And, indeed, the averages are higher for Mondays and Fridays.

Then, to sharpen the picture further, you put the Monday and Friday averages into one group (Group A), and the Tuesday, Wednesday and Thursday averages into a second group (Group B), then combine the values in each group by averaging them. You find the Group A average is 104 (representing 9.5% of staff) and the Group B average is 85 (representing 7.8% of staff).

This summarisation of 30 weeks of company experience has demonstrated that staff absenteeism is, on average, 1.7 percentage points higher on Mondays and Fridays as compared with Tuesdays, Wednesdays and Thursdays. Quantifying this difference is a first step towards better understanding employee absenteeism in that company over the longer term – whether your primary interest is possible employee discontent, or the financial costs of absenteeism to management.

Creating different kinds of data summaries is termed statistical description. Numerical and graphical methods for summarising data are valuable, because they make data analysis more manageable and because they can reveal otherwise unnoticed patterns.

Even more valuable are the methods of statistics that enable statisticians to generalise to a wider setting whatever interesting behaviour they may have detected in the original data. The process of generalisation in the face of the uncertainties of the real world is called statistical inference. What makes a statistical generalisation so valuable is that it comes with an objective measure of the likelihood that it is correct.

Clearly, a generalisation will be useful in practice only if it has a high chance of being correct. However, it is equally clear that we can never be sure that a generalisation is correct, because uncertainty is so pervasive in the real world.

To return to the example we are pursuing, you may be concerned that the pattern of absenteeism detected in 30 weeks of data might continue indefinitely, to the detriment of the company. At the same time, you may be unsure that that pattern actually is a long‐term phenomenon. After all, it may have appeared in the collected data only by chance. You might, therefore, have good reason to widen your focus, from absenteeism in a particular 30‐week period to absenteeism in the long term.

You can test the hypothesis that the pattern you have detected in your data occurred by chance alone against the alternative hypothesis that it did not occur by chance alone. The alternative hypothesis suggests that the pattern is actually persistent – that is, that it is built into the long‐term behaviour of the company if there are no internal changes (by management) or external impacts (from business conditions generally). As just mentioned, the statistical technique for performing such a hypothesis test can also supply a measure of the likelihood that the test result is correct. For more on hypothesis testing, see CHAPTER 16.

When you do the test, suppose your finding is in favour of the alternative hypothesis. (Estimating the likelihood that this finding is correct requires information beyond our scope here, but there are ways of testing which optimise that likelihood.) Your finding suggests a long‐term persisting pattern in absenteeism. You then have grounds for recommending a suitable intervention to management.

Generalising to ‘a wider setting’ can also include to ‘a future setting’, as this example illustrates. In other words, statistical inference, appropriately applied, can offer a cautious way of forecasting the future – a dream that has fascinated humankind from time immemorial.

In short, statistical inference is a logical process that deals with ‘chancy’ data and generalises what those data reveal to wider settings. In those wider settings, it provides precise (as opposed to vague) conclusions which have a high chance of being correct.

‐‐‐oOo‐‐‐

But this seems paradoxical! What sort of logic is it that allows highly reliable conclusions to be drawn in the face of the world’s uncertainties? (Here, and in what follows, we say ‘highly reliable’ as a shorter way of saying ‘having a high chance of being correct’.)

To answer this pivotal question, we need first to offer you a short overview of the alternative systems of logic that philosophers have devised over the centuries. For an extended exposition, see Barker (2003).

A system of logic is a set of rules for reasoning from given assumptions towards reliable conclusions. There are just two systems of logic: deduction and induction. Each system contains two kinds of rules:

  1. rules for drawing precise conclusions in all contexts where that logic is applicable; and
  2. rules for objectively assessing how likely it is that such precise conclusions are actually correct.

The conclusions that each system yields are called deductive inferences and inductive inferences, respectively.

It’s worth a moment’s digression to mention that there are two other thought processes – analogy and intuition – which are sometimes used in an attempt to draw reliable conclusions. However, these are not systems of logic, because they lack rules, either of the second kind (analogy) or of both kinds (intuition). Thus, conclusions reached by analogy or by intuition are, in general, less reliable than those obtained by deduction or induction. You will find in QUESTIONS 9.4 and 9.5, respectively, examples of the failure of analogy and of intuition.

In what kind of problem setting is deduction applicable? And in what kind of setting is induction applicable? The distinguishing criterion is whether the setting is (or is assumed to be) one of complete certainty.

In a setting of complete certainty, deduction is applicable,...

Erscheint lt. Verlag 8.3.2017
Sprache englisch
Themenwelt Mathematik / Informatik Mathematik Statistik
Mathematik / Informatik Mathematik Wahrscheinlichkeit / Kombinatorik
Technik
Schlagworte biographical and philosophical dimensions • Historical • paradoxes • Populärwissenschaftliche Statistik • Popular Interest Statistics • Populärwissenschaftliche Statistik • Puzzles • Statistics • Statistics - Text & Reference • Statistik • Statistik / Lehr- u. Nachschlagewerke
ISBN-10 1-119-07584-X / 111907584X
ISBN-13 978-1-119-07584-4 / 9781119075844
Informationen gemäß Produktsicherheitsverordnung (GPSR)
Haben Sie eine Frage zum Produkt?
EPUBEPUB (Adobe DRM)

Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM

Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belle­tristik und Sach­büchern. Der Fließ­text wird dynamisch an die Display- und Schrift­größe ange­passt. Auch für mobile Lese­geräte ist EPUB daher gut geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine Adobe-ID und die Software Adobe Digital Editions (kostenlos). Von der Benutzung der OverDrive Media Console raten wir Ihnen ab. Erfahrungsgemäß treten hier gehäuft Probleme mit dem Adobe DRM auf.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine Adobe-ID sowie eine kostenlose App.
Geräteliste und zusätzliche Hinweise

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich