Next: Further reading
Up: Introduction
Previous: General philosophy
The development of computer technology since the 1950s
has led to the creation of many very useful statistical software
packages for analysing data.
Off-the-shelf statistical software now makes it
possible to perform analyses on a personal computer that would have been completely
impossible in the pre-computer era.
For this reason, computational statistics is now a large
and rapidly advancing branch of modern statistics.
Many diverse statistical software packages are currently
available that offer a wide variety of capabilities.
They can be broadly classified into three main categories:
- 1.
- Powerful language-based packages
For example, Splus and SAS, which are packages that allow the user
to develop their own statistical macros and functions in addition to
the comprehensive range of statistical routines available. These
powerful language-based packages are used by many practising statisticians.
They are not particularly user-friendly but once mastered can be extremely
powerful tools.
- 2.
- Interactive packages
For example, MINITAB and SPSS, which are packages that allow the user
to perform many standard statistical operations at the click of a mouse.
These are quick and easy to use and are useful for applying standard
methods but not ideally suited for developing new functions. A big danger with such packages
is that the user can easily perform operations that they do not understand.
This can create a ``black box'' view of statistical methods that
often leads to poor interpretations.
- 3.
- Packages with statistical libraries
For example, MATLAB and PV-Wave/IDL, which are primarily
data analysis and visualization
programs/languages that also include libraries including statistical functions.
These packages can be useful in climate analysis since they can cope with the large
gridded data sets quite easily and can also be used to
quickly visualise spatial data.
A problem with these packages is that the libraries often
contain only a subset
of standard statistical functions, and do not benefit from input from
professional statisticians. This is particularly the case with certain spreadsheet
packages such as EXCEL that contain rather idiosyncratic and poorly developed
statistical libraries.
Many different statistical software packages are reviewed at
http://www.stats.gla.ac.uk/cti/activities/reviews/alphabet.html.
While there are advantages in sometimes writing your own routines
using Fortran etc., statistical packages can allow a much wider range of
analysis options and have the benefit of being tested by other users.
The Do-It-Yourself (D.I.Y.) approach often favoured by climate
researchers often leads to ignorant analysis - to avoid this, try and seek
out more expert statistical advice !
Next: Further reading
Up: Introduction
Previous: General philosophy
David Stephenson
2000-09-02