STA 6166 Unit 2 Section 2 Answers

This problem will be worked in Minitab. The associated Minitab worksheet can be downloaded from here.

Do problem 6.37 on page 319. (This problem relates to comparing alloys for the manufacture of steel beams).

a. What are the populations of interest? In this case, the populations consist of load capacities (in tons) of beams that would be constructed using the old and new alloys respectively. Note that the populations are defined in terms of the responses (load capacities) and characteristics (alloy type) that we are interested in.

b. Place 99% confidence intervals on the mean load capacity of beams produced with both the new and old alloy. Basic statistics are computed in Minitab to produce the following table.


Descriptive Statistics: Load by Beam


Variable   Beam              N       Mean     Median     TrMean      StDev
Load       new              10     28.850     28.800     29.075      2.510
           old              10     23.370     23.200     23.412      1.218

Variable   Beam        SE Mean    Minimum    Maximum         Q1         Q3
Load       new           0.794     23.600     32.300     27.650     31.125
           old           0.385     21.100     25.300     22.750     24.250

With the mean estimates and the standard errors of the means, we need only the appropriate t-critical values to construct the confidence intervals using the equation on page 234. The critical value is t(0.005,9)= 3.25 from Table 2 in the book. The confidence intervals are then computed as:

New Beam (28.85±(3.25)(0.794)) = (26.27,31.43)

Old Beam (23.370±(3.25)(0.385)) = (22.12,24.62)

We could also have obtained these confidence intervals by using the "unstacked" data columns in the worksheet and the one-sample t-tests. On the one-sample t-test dialog there is the option to set the confidence interval to 99%. Remember to set the test to "two-sided". The result is the following (in part):

Variable             99.0% CI            T      P
Load_new      (  26.270,  31.430)    36.35  0.000
Load_old      (  22.117,  24.623)    60.65  0.000

c. Is the mean load capacity of the new alloy significantly greater than the mean load capacity of the currently used alloy? Use a = 0.01. Report the p-value of your test. From Minitab we have for the separate variance two sample t-test the following:


Two-Sample T-Test and CI: Load, Beam

Beam         N      Mean     StDev   SE Mean
new         10     28.85      2.51      0.79
old         10     23.37      1.22      0.39

Difference = mu (new) - mu (old)
Estimate for difference:  5.480
95% lower bound for difference: 3.917
T-Test of difference = 0 (vs >): T-Value = 6.21  P-Value = 0.000  DF = 13

From this we would reject the null hypothesis that the two alloys produce beams with equal load capacities in favor of the one-sided alternative that the new alloy is greater than the old alloy. The p-value is < 0.001. We can redo this test assuming a common variance and using a pooled standard deviation estimate to get the following:

Two-sample T for Load

Difference = mu (new) - mu (old)
Estimate for difference:  5.480
95% lower bound for difference: 3.950
T-Test of difference = 0 (vs >): T-Value = 6.21  P-Value = 0.000  DF = 18
Both use Pooled StDev = 1.97

Note that the conclusions of the test do not change, nor is the p-value any different, but the DF for the test is now 18 versus the 13 of the previous problem. The pooled standard deviation estimate is 1.97.

d. Do the condition required for the statistical techniques used in (b) and (c) appear to be satisfied? By this we wonder if the data look sufficiently normal (symmentric, uni-modal). We could look at histograms of the data, but there is very little data here to do this. Box plots don't indicate much to worry about although the distribution do not look exactly symmetric (the median is not in the middle of the inter quartile box..

We could also look at a quantile-quantile plot to determine if the data look sufficiently normal.

In this plot, if the data are truly normally distributed, they will fall mostly on the straight line. In both cases the data points fall pretty close to the straight line and hence we would conclude that the data are sufficiently normal and that the conditions required for the t-test are met.

e. The beams produced from the new alloy are more expensive thatn the beams produced from the currently used alloy. Thus, the new alloy will be used only if the mean load capacity is at least 5 tons greater than the mean load capacity of the currently used alloy. Based on this information, would you recommend that the company use the new alloy?

To check this, we need to test whether the difference between the two are sufficiently greater than 5 tons. In Minitab, we re-perform the two sample t-test only now we set the significant difference at 5. The results are as follows:

Difference = mu (new) - mu (old)
Estimate for difference:  5.480
95% lower bound for difference: 3.950
T-Test of difference = 5 (vs >): T-Value = 0.54  P-Value = 0.297  DF = 18
Both use Pooled StDev = 1.97

Note that now the null hypothesis of the test is that the difference is 5 versus the alternative that it is greater than 5. The associated p-value is 0.297, suggesting that we cannot reject the null hypothesis that the difference is actually 5 (or less). Since we do not reject the null hypothesis we cannot conclude that the difference in mean load capacity is greater than 5 tons, we would stick with the old alloy for this task.

Note that you may be a little confused with this result. Remember what we are really trying to do is show that the new alloy is 5 tons or greater load capacity than the old alloy. Our test says it is 5 OR LESS but NOT more than 5. Hence we cannot go with the new alloy since there is a high probability the load capacity is actually less than 5 tons and a very low probability that it is greater then 5.

The above computations were run using the full Minitab (on my portable). The t-test output is from the STAT > BASIC STATISTICS > 2 SAMPLE T. There is an OPTIONS button on the dialog that allows you put in the difference. When I attempted to do this in the Student Edition of Minitab, there is no OPTION button. I didn't notice this before. So, what to do. Well, we know that subtracting a number from any random variable changes the mean, but does not change the variance. So in the 'not Stacked' data, create a new "New_New" column which is the old "NEW" data with 5 subtracted from it. Now do the two sample t-test of the "New-New" vesus the "OLD" data to get a t-test that these two groups have the same means. If the original null hypothesis was that the mean of the NEW beams was 5 tons greater than that of the OLD beams, then the mean of the NEW-5 group should be the same as the mean of the OLD group. You should get the correct result.
For the stacked data, create a new column (call it DIFF) with 5 in the rows for all NEW beam cases and 0 for all OLD beam cases. Now create a second "New_Load" column with the old response column minus DIFF. Now do the two-sample t-test (stacked) using this "New_Load" as the response.

2. Using the data from problem 6.37, perform the Wilcoxon rank sum test. Do the t-test and Wilcoxon test give different results?

We will use Minitab. Note first that the Wilcoxon Rank Sum test in Minitab is the same as the Mann-Whitney test (whose name is actually the Mann-Whitney-Wilcoxon Rank Sum test). Next we also note that the Mann-Whitney test does not use "stacked data" but used "unstacked data". Originally I entered the data as stacked data (as I would like you to get into the habit of doing), so I needed to use the Manip>Unstack Columns menu option to get the data into a stacked configuration. The worksheet I have provided to you has already done this. The results of running the Mann-Whitney test is given as:

Mann-Whitney Test and CI: Load_new, Load_old

Load_new   N =  10     Median =      28.800
Load_old   N =  10     Median =      23.200
Point estimate for ETA1-ETA2 is       5.700
95.5 Percent CI for ETA1-ETA2 is (3.801,7.400)
W = 151.0
Test of ETA1 = ETA2  vs  ETA1 > ETA2 is significant at 0.0003
The test is significant at 0.0003 (adjusted for ties)

When this says the test is significant at the 0.0003 level we interpret this to mean the associated p-value is 0.0003. With such a low p-value we would reject the null hypothesis and conclude that the load capacity of the new alloy is greater than that of the old alloy. This result coincides with the t-test results.

3. What is the 95% confidence interval for the difference between the two means?

Minitab will compute the 95% CI for the difference between the two means if you are a little clever and/or observant. Use the two independent sample t-test and test the two sided alternative, Minitab will provide you with the 95% CI (equation 1 page 317) as given below.

Difference = mu (new) - mu (old)
Estimate for difference:  5.480
95% CI for difference: (3.626, 7.334)

Doing the same thing with the Mann-Whitney-Wilcoxon two independent sample rank sum test we get a 95% confidence interval for the difference in the medians as:

Point estimate for ETA1-ETA2 is       5.700
95.5 Percent CI for ETA1-ETA2 is (3.801,7.400)

4. How many beams would be needed for each group if we wanted to estimate the difference between the average loading of the two groups to be within plus or minus 0.5 tons with 95% confidence (Hint see page 314).Use the pooled variance estimate from the t-test as if it were the true variance.

Minitab doesn't do this computation directly so we will have to do this on our own. The pooled standard deviation estimate is given as: s² =1.97. The difference of interest is E=0.5 and the P(type I error) =a = 0.05. Using equation 9 page 318 we have:

Thus we would need to test 120 beams of each type to be this precise in our estimate of the difference between the two beams.

Unit 2 Section 2 Answers

Engineering