Small samples of data are best presented in the form of a table 2.1. For example, Table 2.1 below presents the age, height, and weight of a sample of some colleagues at the Department of Meteorology in the University of Reading. There are n=11 objects (people) in the sample with p=3 observed variables age, height, and weight. Note that for good clarity and typesetting, published tables should not include ANY vertical lines or shading even if certain word processors allow such features.
|
The table of numbers can be considered to be a rectangular
data matrix X having n=11 rows and p=3 columns.
The data matrix X has dimension
and elements Xij where the first subscript
is the object index and the second subscript
is the variable
index. Note: it is conventional to denote variables by columns and sample
objects by rows.
The rest of this lecture will focus on the special case of descriptive
methods for univariate data
having only one variable (p=1).
Many descriptive methods have also been developed for exploring
multivariate data having more than one (p>1) variables
and some of these will be covered in later lectures.