Chapter 2 Homework Answers

2.7 Arranging the 30 observations in order:

34.04 30.57 26.81 26.33 26.32 23.66 18.00 17.45 17.00 16.69
16.54 15.37 14.97 14.41 12.91 12.31 11.01 10.92 10.70 9.98
9.26 9.11 8.92 8.90 7.98 7.58 6.54 6.49 5.83 4.56

Here are the observed relative frequencies and the histogram heights (the relative frequencies divided by the interval widths). Because the interval widths are equally wide, the histogram looks the same as a bar chart, but has different units on the vertical axis.

Interval   Number   Relative Frequency   Histogram Height
0-9.99   11   11/30 = 0.367   0.367/10 = 0.0367
10-19.99   13   13/30 = 0.433   0.433/10 = 0.0433
20-29.99   4   4/30 = 0.133   0.133/10 = 0.0133
30-39.99   2   2/30 = 0.067   0.067/10 = 0.0067
Total   30   1.000

Here is a histogram:

The center of the data is in the interval 10-20 inches, with a range of 0 to 40 inches. The data are skewed to the right (positively skewed).

2.14 Here is the graph with the vertical axis going from 0 to 600:

Here is graph with the vertical axis going from 579 to 584:

The graph with the vertical axis going down to zero puts the decline into perspective (less than a 1 percent drop), but is otherwise useless: we can hardly tell that there has been a decline and certainly can't see the fluctuations in the data (including the rise between 1910 and 1930). In the graph with zero omitted, we can clearly see the decline and the fluctuations, but we should not let the steepness of the figure mislead us about the magnitude of the decline. We have to look at the actual data to see that over this 100-year period, the height of Lake Michigan has declined by 3.3 feet, an average of about half an inch per year.

2.18 The exposure index is the explanatory variable and cancer mortality is the dependent variable since exposure to radioactive wastes may cause cancer, but not vice versa. A scatter diagram shows a positive relationship:

2.26 The extension of a line fit to these data for 1598 to 1940 indicates negative words per sentence in the year 2200! This incautious extrapolation is clearly too ridiculous to be taken seriously. (In Chapter 11, we will see how to fit a line to the data by ordinary least squares; the equation for this line is y = 264.81 - 0.125x, which implies y = 264.81 - 0.125(2200) = -10.19 in the year 2200.)

2.34 Here are the relative frequencies:

  Households in 1890 Households in 1990
Size Number Relative Frequency Number Relative Frequency
1 0.457 0.457/12.690 = 0.036 23.0 23.0/93.8 = 0.245
2 1.675 1.675/12.690 = 0.132 30.2 30.2/93.8 = 0.322
3 2.119 2.119/12.690 = 0.167 16.1 16.1/93.8 = 0.172
4 2.132 2.132/12.690 = 0.168 14.6 14.6/93.8 = 0.156
5 1.916 1.916/12.690 = 0.151 6.2 6.2/93.8 = 0.066
6 1.472 1.472/12.690 = 0.116 2.2 2.2/93.8 = 0.023
7-10 2.919 2.919/12.690 = 0.230 1.5 1.5/93.8 = 0.016
Total 12.690 1.0000 93.8 1.0000

The two bar charts show that in 1890 household sizes were fairly symmetrical about 4, but in 1990 household sizes were concentrated at 1 and 2. Households with 4 or more members were clearly the rule in 1890 and the exception in 1990.


back