Statistical Analysis of Geographical Data (eBook)
John Wiley & Sons (Verlag)
978-1-118-52514-2 (ISBN)
Statistics Analysis of Geographical Data: An Introduction provides a comprehensive and accessible introduction to the theory and practice of statistical analysis in geography. It covers a wide range of topics including graphical and numerical description of datasets, probability, calculation of confidence intervals, hypothesis testing, collection and analysis of data using analysis of variance and linear regression. Taking a clear and logical approach, this book examines real problems with real data from the geographical literature in order to illustrate the important role that statistics play in geographical investigations. Presented in a clear and accessible manner the book includes recent, relevant examples, designed to enhance the reader's understanding.
Simon J. Dadson is Associate Professor of Physical Geography at Oxford University and Tutor in Geography at Christ Church.
Statistics Analysis of Geographical Data: An Introduction provides a comprehensive and accessible introduction to the theory and practice of statistical analysis in geography. It covers a wide range of topics including graphical and numerical description of datasets, probability, calculation of confidence intervals, hypothesis testing, collection and analysis of data using analysis of variance and linear regression. Taking a clear and logical approach, this book examines real problems with real data from the geographical literature in order to illustrate the important role that statistics play in geographical investigations. Presented in a clear and accessible manner the book includes recent, relevant examples, designed to enhance the reader s understanding.
Simon J. Dadson is Associate Professor of Physical Geography at Oxford University and Tutor in Geography at Christ Church.
1
Dealing with data
STUDY OBJECTIVES
- Understand the nature and purpose of statistical analysis in geography.
- View statistical analysis as a means of thinking critically with quantitative information.
- Distinguish between the different types of geographical data and their uses and limitations.
- Understand the nature of measurement error and the need to account for error when making quantitative statements.
- Distinguish between accuracy and precision and to understand how to report the precision of geographical measurements.
- Appreciate the methodological limitations of statistical data analysis.
1.1 The role of statistics in geography
1.1.1 Why do geographers need to use statistics?
Statistical analysis involves the collection, analysis and presentation of numerical information. It involves establishing the degree to which numerical summaries about observations can be justified, and provides the basis for forming judgements from empirical data.
Take the following media headlines, for example:
We know in the next 20 years the world population will increase to something like 8.3 billion people.
Sir John Beddington, UK Government Chief Scientist1
2010 hits global temperature high.
BBC News, 20th January 20112
Each of these statements invites critical scrutiny. The reliability of their sources encourages us to take them seriously, but how do we know that they are correct? It is hard enough to try to predict what one human being will do in any particular year, let alone what several billion are going to do in the next 20 years. How were these predictions made? How was the rate of change of world population calculated? What were the assumptions? What does the author mean by ‘something like’? The number 8.3 billion is quite a precise number: why didn’t the author just say 8 billion or almost 10 billion?
Similarly, how do we know that 2010 is the global temperature high, when temperature is only measured at a small number of measuring stations? How would we go on to investigate whether anthropogenic warming caused the record‐breaking temperature in 2010 or whether it was just a fluke?
Statistical analysis provides some of the tools that can answer some of these questions. This book introduces a set of techniques that allow you to make sure that the statistical statements that you make in your own work are based on a sound interpretation of the data that you collect.
There are four main reasons to use statistical techniques:
- to describe and measure the things that you observe;
- to characterize measurement error in your observations;
- to test hypotheses and theories;
- to predict and explain the relationships between variables.
1.2 About this book
One of the best ways to learn any mathematical skill is through repeated practice, so the approach taken in this book uses many examples. The presentation of each topic begins with an introduction to the theoretical principles: this is then followed by a worked example. Additional exercises are given to allow the reader to develop their understanding of the topics involved.
The use of computer packages is now common in statistical analysis in geography: it removes many of the tedious aspects of statistical calculation leaving the analyst to focus on experimental design, data collection, and interpretation. Nevertheless, it is essential to understand how the properties of the underlying data affect the value of the resulting statistics or the outcome of the test under evaluation.
Two kinds of computer software are referred to in this book. The more basic calculations can be performed using a spreadsheet such as Microsoft Excel. The advantages of Excel are that its user interface is well‐known and it is almost universally available in university departments and on student computers. For more advanced analysis, and in situations where the user wishes to process large quantities of data automatically, more specialized statistical software is better. This book also refers to the open‐source statistical package called ‘R’ which is freely available from http://www.r‐project.org/. In addition to offering a comprehensive collection of well‐documented statistical routines, the R software provides a scripting facility for automation of complex data analysis tasks and can produce publication‐quality graphics.
1.3 Data and measurement error
1.3.1 Types of geographical data: nominal, ordinal, interval, and ratio
Four main types of data are of interest to geographers: nominal, ordinal, interval, and ratio. Nominal data are recorded using categories. For example, if you were to interview a group of people and record their gender, the resulting data would be on a nominal, or categorical, scale. Similarly, if an ecologist were to categorize the plant species found in an area by counting the number of individual plants observed in different categories, the resulting dataset would be categorical, or nominal. The distinguishing property of nominal data is that the categories are simply names – they cannot be ranked relative to each other.
Observations recorded on an ordinal scale can be put into an order relative to one another. For example, a study in which countries are ranked by their popularity as tourist destinations would result in an ordinal dataset. A requirement here is that it is possible to identify whether one observation is larger or smaller than another, based on some measure defined by the analyst.
In contrast with nominal and ordinal scale data, interval scale data are measured on a continuous scale where the differences between different measurements are meaningful. A good example is air temperature, which can be measured to a degree of precision dictated by the quality of the thermometer being used, among other factors. Whilst it is possible to add and subtract interval scale data, they cannot be multiplied or divided. For example, it is correct to say that 30 degrees is 10 degrees hotter than 20 degrees, but it is not correct to say that 200 degrees is twice as hot as 100 degrees. This is because the Celsius temperature scale, like the Fahrenheit scale, has an arbitrarily defined origin.
Ratio scale data are similar to interval scale data but a true zero point is required, and multiplication and division are valid operations when dealing with ratio scale data. Mass is a good example: an adult with a mass of 70 kg is twice as heavy as a child with a mass of 35 kg. Temperature measured on the Kelvin scale, which has an absolute zero point, is also defined as a ratio scale measurement.
It is important from the outset of any investigation to be aware of the different types of geographical data that can be recorded, because some statistical techniques can only be applied to certain types of data. Whilst it is usually possible to convert interval data into ordinal or nominal data (e.g. rainfall values can be ranked or put into categories), it is not possible to make the conversion the other way around.
1.3.2 Spatial data types
Geographers collect data about many different subjects. Some geographical datasets have distinctly spatial components to them. In other words, they contain information about the location of a particular entity, or information about how a particular quantity varies across a region of interest. In many contexts, it is advantageous to collect information on the locations of objects in space, or to record details of the spatial relationships between entities. The two main types of spatial data that can be used are vector data and raster (or gridded) data. Vector data consist of information that is stored as a set of points that are connected to known locations in space (e.g. to represent towns, sampling locations, or places of interest). The points may be connected to form lines (e.g. to represent linear features such as roads, rivers and railways), and the lines may be connected to form polygons (e.g. to represent areas of different land cover, geological units, or administrative units).
The locations of points must be given with reference to a coordinate system which may be rectangular (i.e. given using eastings and northings in linear units such as metres), or spherical (i.e. given using latitudes and longitudes in angular units such as degrees), but which always requires the definition of unit vectors and a fixed point of origin. The most common spherical coordinate system is that of latitude and longitude, which measures points by their angular distance from an origin which is located at the equator (zero latitude) and the Greenwich meridian (zero longitude). Thus the latitude of Buckingham Palace in London, UK, is 0.14°W, 51.50°N indicating that it is 0.14 degrees west of Greenwich and 51.5 degrees north of the equator.
Whilst spherical coordinate systems are commonly used in aviation and marine navigation, and with the arrival of GPS, terrestrial navigation usually uses rectangular coordinate systems. In order to use rectangular coordinates, the spherical form of the Earth must be represented on a flat surface. This is achieved using a map projection. An example of a map projection that is used to obtain a rectangular coordinate system is the Great Britain National Grid, in which locations are defined in metres east and north of a fixed origin that is located to the south west of the Scilly Isles. Thus to give a grid...
| Erscheint lt. Verlag | 8.3.2017 |
|---|---|
| Sprache | englisch |
| Themenwelt | Mathematik / Informatik ► Mathematik ► Statistik |
| Naturwissenschaften ► Geowissenschaften ► Allgemeines / Lexika | |
| Naturwissenschaften ► Geowissenschaften ► Geografie / Kartografie | |
| Naturwissenschaften ► Geowissenschaften ► Geologie | |
| Technik | |
| Schlagworte | Analysis of Variance • calculation of confidence intervals • collection and analysis of data • confidence intervals • earth sciences • geographical investigations statistics • geographical literature • Geographie • Geography • Geowissenschaften • graphical description of datasets • hypothesis testing • linear regression • numerical description of datasets • physical geography • Physiogeographie • Probability • Simon James Dadson • statistical analysis in geography • Statistical Analysis of Geographical Data: An Introduction • Statistics • Statistics Analysis of Geographical Data • Statistik • Statistische Analyse • uncertainty analysis |
| ISBN-10 | 1-118-52514-0 / 1118525140 |
| ISBN-13 | 978-1-118-52514-2 / 9781118525142 |
| Informationen gemäß Produktsicherheitsverordnung (GPSR) | |
| Haben Sie eine Frage zum Produkt? |
Kopierschutz: Adobe-DRM
Adobe-DRM ist ein Kopierschutz, der das eBook vor Mißbrauch schützen soll. Dabei wird das eBook bereits beim Download auf Ihre persönliche Adobe-ID autorisiert. Lesen können Sie das eBook dann nur auf den Geräten, welche ebenfalls auf Ihre Adobe-ID registriert sind.
Details zum Adobe-DRM
Dateiformat: EPUB (Electronic Publication)
EPUB ist ein offener Standard für eBooks und eignet sich besonders zur Darstellung von Belletristik und Sachbüchern. Der Fließtext wird dynamisch an die Display- und Schriftgröße angepasst. Auch für mobile Lesegeräte ist EPUB daher gut geeignet.
Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen eine
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen eine
Geräteliste und zusätzliche Hinweise
Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.
aus dem Bereich