Statistics and Probability with Applications for Engineers and Scientists Using MINITAB, R and JMP. Bhisham C. Gupta. Читать онлайн. Newlib. NEWLIB.NET

Автор: Bhisham C. Gupta
Издательство: John Wiley & Sons Limited
Серия:
Жанр произведения: Математика
Год издания: 0
isbn: 9781119516620
Скачать книгу
by images and is defined as the ratio of the standard deviation to the mean expressed as a percentage:

      where images is the absolute value of the mean. The coefficient of variation is a relative comparison of a standard deviation to its mean and is unitless. The cv is commonly used to compare the variability in two populations. For example, we might want to compare the disparity of earnings for technicians who have the same employer but work in two different countries. In this case, we would compare the coefficient of variation of the two populations rather than compare the variances, which would be an invalid comparison. The population with a greater coefficient of variation, generally speaking, has more variability than the other. As an illustration, we consider the following example.

      Example 2.7.3 A company uses two measuring instruments, one to measure the diameters of ball bearings and the other to measure the length of rods it manufactures. The quality control department of the company wants to find which instrument measures with more precision. To achieve this goal, a quality control engineer takes several measurements of a ball bearing by using one instrument and finds the sample average images and the standard deviation images to be 3.84 and 0.02 mm, respectively. Then, he/she takes several measurements of a rod by using the other instrument and finds the sample average images and the standard deviation images to be 29.5 and 0.035 cm, respectively. Estimate the coefficient of variation from the two sets of measurements.

      Solution: Using formula (2.7.4), we have

equation equation

      Example 2.7.4 (Bus riders) The following data gives the number of persons who take a bus during the off‐peak time schedule (3–4 pm.) from Grand Central to Lower Manhattan in New York City. Using technology, find the numerical measures for these data:

17 12 12 14 15 16 16 16 16 17 17 18 18 18 19 19 20 20 20 20
20 20 20 20 21 21 21 22 22 23 23 23 24 24 25 26 26 28 28 28

      MINITAB

      1 Enter the data in column C1.

      2 From the Menu bar, select Stat Basic Statistics Display Descriptive Statistics:

      3 In the dialog box that appears, enter C1 under variables and select the option Statistics.

      4 A new dialog box Descriptive Statistics: Statistics appears. In this dialog box, select the desired statistics and click OK in the two dialog boxes. The values of all the desired statistics as shown below will appear in the Session window.

       USING R

      We can use a few built in functions in R to get basic summary statistics. Functions ‘mean()’, ‘sd()’, and ‘var()’ are used to calculate the sample mean, standard deviation, and variance, respectively. The coefficient of variation can be calculated manually using the mean and variance results. The ‘quantile()’ function is used to obtain three quantiles and the minimum and maximum. The function ‘range()’ as shown below can be used to calculate the range of the data. The task can be completed by running the following R code in the R Console window.

0% 25% 50% 75% 100%
12 17 20 23 28

      #To obtain the range we find Max‐Min range(x)[2]‐range(x)[1] images 16

Box and whisker plot with 4 diamond markers labeled A, B, E, and F representing the lower outer fence, lower inner fence, upper inner fence, and upper outer fence, respectively, and 2 stars markers labeled S and L.

      2.8.1 Construction of a Box Plot

      1 Step 1. Find the quartiles , , and for the given data set.

      2 Step 2. Draw a box with its outer lines of the box standing at the first quartile