MATH20812 - 2010/2011
- Title: Practical Statistics 1
- Unit code: MATH20812
- Credits: 10
- Prerequisites: Core courses taught at the School of Mathematics in the first 3 semesters
- Co-requisite units: MATH20802 Statistical Methods
- School responsible: Mathematics
- Members of staff responsible: Dr. G. N. Boshnakov
Specification
Aims
This course unit aims to introduce essential statistical concepts and techniques and to provide the students with experience in the use of the statistical system R.
Brief Description of the unit
In this course statistical methods and concepts are put in the context of their practical application with emphasis on model selection and diagnostics. Students do a series of small projects in class and as homework. Some projects are complete data analysis exercises centred around some statistical topic.
Intended Learning Outcomes
On completion of this unit successful students will:
- be familiar with essential ideas and techniques in statistics;
- be able to choose appropriate methods to tackle a variety of data analysis problems;
- have a working knowledge of the statistical system R;
- have gained experience in the writing of statistical reports;
- be able to set up simple simulation experiments.
Future topics requiring this course unit
The acquired knowledge and computing skills will be useful for a number of third and fourth year courses, most notably statistical ones.
Syllabus
- Exploratory data analysis (3 lectures)
- Data collection and presentation. [2]
- Organisation of data analysis in R: transformations, scripts and functions. [1]
- Correlation (3 lectures)
- Sample correlation coefficient: numerical properties and interpretation. [1]
- Estimation of population correlation and test for zero correlation. [1]
- Rank correlation. [1]
- Linear regression (4 lectures)
- Simple linear regression. [3]
- Transformations of predictor and response variables. [1]
- Writing projects (1 lecture)
- Goodness of fit tests (5 lectures)
- Testing for a single distribution (K-S test). [1]
- Testing for a family of distributions. [2]
- Analysis of residuals. [1]
- Chi-square test for discrete distributions. [1]
- Tests for bivariate data (3 lectures)
- Wilcoxon rank sum test. Binomial test. [1]
- Two-way contingency tables: chi-square test of independence, testing homogeneity, inference about the odds ratio. [2]
- Introduction to Monte Carlo methods (3 lectures)
- Estimating standard errors of estimators. [1]
- Evaluation of integrals by simple Monte Carlo. [1]
- Estimating distributions of estimators. [1]
Textbooks
This course unit is not based on a single book, some suggestions are given below. More details are available from the web page of the course.
- A reference text for probability and statistical concepts studied in the first 3 semesters is a necessity, for example: Miller, Irwin; Miller, Marylees(2004) John E. Freund's mathematical statistics with applications. 7th ed. Upper Saddle River, N.J. : Pearson Prentice Hall.
- Conover, W.J. (1999) Practical nonparametric statistics, 3rd edition (mainly Chapter 3 and Chapter 6).
- Dalgaard, Peter (2002) Introductory statistics with R, New York : Springer.
- Devore, Jay; Peck, Roxy Introductory statistics, 2nd edition, 1994 (Chapters 11 and 12 cover simple linear regression and correlation.). There are more recent books by these authors with slightly different names.
- Joaquim P. Marques de Sá (2007) Applied Statistics Using SPSS, STATISTICA, MATLAB and R. Springer Berlin Heidelberg New York.
Teaching and learning methods
Two lectures and one laboratory session each week. In addition students should expect to do at least four hours of private study each week on this course unit.
Assessment
-
100% coursework consisting of in-class tests and homework projects.
