Sampling and Estimation from Finite Populations. Yves Tille. Читать онлайн. Newlib. NEWLIB.NET

Информация о произведении:

Автор:	Yves Tille
Издательство:	John Wiley & Sons Limited
Серия:
Жанр произведения:	Математика
Год издания:	0
isbn:	9781119071273

Скачать книгу

where images denotes the empty set.

Definition 2.1

A sampling design without replacement images is a probability distribution on images such that

Definition 2.2

A random sample images is a random variable whose values are the samples:

A random sample can also be defined as a discrete random vector composed of non‐negative integer variables images . The variable images represents the number of times unit images is selected in the sample. If the sample is without replacement then variable images can only take the values 0 or 1 and therefore has a Bernoulli distribution. In general, random variables images are not independent except in very special cases. The use of indicator variables images was introduced by Cornfield (1944) and greatly simplified the notation in survey sampling theory because it allows us to clearly separate the values of the variables images or images from the source of randomness images .

Often, we try to select the sample as randomly as possible. The usual measure of randomness of a probability distribution is the entropy.

Definition 2.3

The entropy of a sampling design is the quantity

We suppose that images

We can search for sampling designs that maximize the entropy, with constraints such as a fixed sample size or given inclusion probabilities (see Section 2.3). A very random sampling design has better asymptotic properties and allows a more reliable inference (Berger, 1996, 1998a; Brewer & Donadio, 2003).

The sample size images is the number of units selected in the sample. We can write

When the sample size is not random, we say that the sample is of fixed sample size and we simply denote it by images .

The variables are observed only on the units selected in the sample. A statistic images is a function of the values images that are observed on the random sample: images . This statistic takes the value images on the sample images . The expectation under the design is defined from the sampling design:

The variance operator is defined using the expectation operator:

2.3 Inclusion Probabilities

The inclusion probability images is the probability that unit images is selected in the sample. This probability is, in theory, derived from the sampling design:

for all images . In sampling designs without replacement, the random variables images have Bernoulli distributions with parameter images There is no particular reason to select units with equal probabilities. However, it will be seen below that it is important that all inclusion probabilities be nonzero.

The second‐order inclusion probability (or joint inclusion probability) images is the probability that units images and images are selected together in the sample:

for all images In sampling designs without replacement, when images , the second‐order inclusion probability is reduced to the first‐order inclusion probability, in other words images for all images

The variance of the indicator variable images is denoted by

which is the variance of a Bernoulli variable. The covariances between indicators are

One can also use a matrix notation. Let

be a column vector. The

Скачать книгу