"

9 Two-Sample Inference

 

Chapter 7 discussed methods of hypothesis testing about one-population pa- rameters. Chapter 8 discussed methods of estimating population parameters from one sample using confidence intervals. This chapter will look at methods of confidence intervals and hypothesis testing for two populations. Since there are two populations, there are two random variables, two means or proportions, and two samples (though with paired samples you usually consider there to be one sample with pairs collected). Examples of where you would do this are:

Testing and estimating the difference in testosterone levels of men before and after they had children (Gettler, McDade, Feranil & Kuzawa, 2011).

Testing the claim that a diet works by looking at the weight before and after subjects are on the diet.

Estimating the difference in proportion of those who approve of President Obama in the age group 18 to 26 year old and the 55 and over age group.

All of these are examples of hypothesis tests or confidence intervals for two populations. The methods to conduct these hypothesis tests and confidence intervals will be explored in this chapter. As a reminder, all hypothesis tests are the same process. The only thing that changes is the formula that you use and the assumptions. Confidence intervals are also the same process, except that the formula is different.

 

Two Proportions

There are times you want to test a claim about two population proportions or construct a confidence interval estimate of the difference between two population proportions. As with all other hypothesis tests and confidence intervals, the process is the same though the formulas and assumptions are different.

 

289

 

 

Hypothesis Test for Two Population Proportion (2-Prop Test)

  • State the random variables and the parameters in words.

= number of successes from group 1

= number of successes from group 2

= proportion of successes in group 1

= proportion of successes in group 2

  • State the null and alternative hypotheses and the level of significance

𝐻𝑜 ∶ 𝑝1 = 𝑝2

𝐻𝑎 ∶ 𝑝1 ≠ 𝑝2. the ≠ can be replaced with < or > depending on the question. Also, state your 𝛼 level here.

  • State and check the assumptions for a hypothesis test
  • A simple random sample of size 𝑛1 is taken from population 1, and a simple random sample of size 𝑛2 is taken from population 2.
  • The samples are independent.
  • The assumptions for the binomial distribution are satisfied for both pop- ulations.
  • image𝑝̂1𝑝̂2
    image𝑝1𝑝2
    image𝑝̂2
    image𝑝2 ∗ 𝑛2 ≥ 5𝑞2 ∗ 𝑛2 ≥ 5𝑞2 = 1 − 𝑝2
    image𝑝̂2
    image𝑝̂2
    image𝑝̂1
    image𝑝1 ∗ 𝑛1 ≥ 5𝑞1 ∗ 𝑛1 ≥ 5𝑞1 = 1 − 𝑝1
    image𝑝̂1
    image𝑝̂1
    The sampling distribution of can be approximated as a normal distri- bution. To determine the sampling distribution of , you need to show thatandwhere. If this requirement is true, then the sampling distribution of is well approximated by a normal curve. The sampling distribution of can be approximated as a normal distribution. To determine the sampling distribution of , you need to show thatandwhere. If this requirement is true, then the sampling distribution of is well approxi- mated by a normal curve. However, if you do not know and , you will need to use and instead. This is not perfect, but it is the best you can do.
  • Find the sample statistics, test statistic, and p-value On R Studio, use the command

prop.test(c(x1,x2), c(n1, n2))

 

  • Conclusion

This is where you write reject or fail to reject𝐻𝑜. The rule is: if the p-value

< 𝛼, then reject 𝐻𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻𝑜.

  • Interpretation

 

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻𝑎, or you do not have enough evidence to support 𝐻𝑎.

Confidence Interval for the Difference Between Two Population Pro- portion (2-Prop Interval)

The confidence interval for the difference in proportions has the same random variables and proportions and the same assumptions as the hypothesis test for two proportions. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and proportions and state and check the assumptions before completing the confidence interval step.

  • Find the sample statistics and the confidence interval

The confidence interval estimate of the difference is found using the following command in R Studio:

prop.test(c(x1,x2), c(n1, n2), conf.level=C as a decimal)

 

  • Statistical Interpretation: In general this looks like, “You are C% confident that the confidence interval contains the true difference in proportions.”
  • Real World Interpretation: This is where you state how much more (or less) the first proportion is from the second proportion.

 

Example: Hypothesis Test for Two Population Pro- portions

Do husbands cheat on their wives in a different proportion from the proportion of wives cheat on their husbands (”Statistics brain,” 2013)? Suppose you take a group of 1000 randomly selected husbands and find that 231 had cheated on their wives. Suppose in a group of 1200 randomly selected wives, 176 cheated on their husbands. Do the data show that the proportion of husbands who cheat on their wives is different from the proportion of wives who cheat on their husbands. Test at the 5% level.

Solution:

  • State the random variables and the parameters in words.

𝑥1 = number of husbands who cheat on his wife

𝑥2 = number of wives who cheat on her husband

𝑝1 = proportion of husbands who cheat on his wife

𝑝2 = proportion of wives who cheat on her husband

  • State the null and alternative hypotheses and the level of significance

 

𝐻𝑜 ∶ 𝑝1 = 𝑝2

𝐻𝑎 ∶ 𝑝1 ≠ 𝑝2

level of significance is 𝛼 = 0.05

  • State and check the assumptions for a hypothesis test
  • A simple random sample of 1000 responses about cheating from husbands is taken. This was stated in the problem. A simple random sample of 1200 responses about cheating from wives is taken. Check: This was stated in the problem.
  • The samples are independent. Check: The samples are independent. This is true since the samples involved different genders.
  • The properties of the binomial distribution are satisfied in both popula- tions. Check: This is true since there are only two responses, there are a fixed number of trials, the probability of a success is the same, and the trials are independent.
  • image𝑝̂1𝑝̂2
    image𝑛1 ∗ 𝑝1 𝑛2 ∗ 𝑝2 𝑛1 ∗ 𝑞1𝑛2 ∗ 𝑞2
    image𝑝̂1𝑝̂2
    The sampling distributions of and can be approximated with a nor- mal distribution. Check:,,, andare all greater than or equal to 5. So both sampling distributions of and can be approximated with a normal distribution.
  • Find the sample statistics, test statistic, and p-value On R use the command:

prop.test(c(231,176), c(1000, 1200))

 

##

## 2-sample test for equality of proportions with ## continuity correction

##

## data: c out of c231 out of 1000176 out of 1200 ## X-squared = 25.173, df = 1, p-value = 5.241e-07 ## alternative hypothesis: two.sided

## 95 percent confidence interval:

## 0.05050705 0.11815962

## sample estimates:

##prop 1prop 2

## 0.2310000 0.1466667

  • Conclusion

Reject 𝐻𝑜, since the p-value is less than 5%.

  • Interpretation

This is enough evidence to support that the proportion of husbands having affairs is different from the proportion of wives having affairs.

 

 

Example: Confidence Interval for Two Population Proportions

What is the difference in proportion that husbands cheat on their wives than wives cheat on the husbands (”Statistics brain,” 2013)? Suppose you take a group of 1000 randomly selected husbands and find that 231 had cheated on their wives. Suppose in a group of 1200 randomly selected wives, 176 cheated on their husbands. Estimate the difference in the proportion of husbands and wives who cheat on their spouses using a 95% confidence level.

Solution:

  • State the random variables and the parameters in words. These were stated in example #9.1.1.
  • State and check the assumptions for the confidence interval The assumptions were stated and checked in example #9.1.1.
  • Find the sample statistics and the confidence interval

On R use the command:

prop.test(c(231,176), c(1000, 1200), conf.level = .95)

 

##

## 2-sample test for equality of proportions with ## continuity correction

##

## data: c out of c231 out of 1000176 out of 1200 ## X-squared = 25.173, df = 1, p-value = 5.241e-07 ## alternative hypothesis: two.sided

## 95 percent confidence interval:

## 0.05050705 0.11815962

## sample estimates:

##prop 1prop 2

## 0.2310000 0.1466667

  • Statistical Interpretation: You are 95% confident that 0.05050705 < 𝑝1

𝑝2 < 0.11815962 contains the true difference in proportions.

  • Real World Interpretation: The proportion of husbands who cheat on their wives is anywhere from 5.05% to 11.82% higher than the proportion of wives who cheat on their husband.

 

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

 

 

of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

  • Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year, of the 211,693 students who took the calculus AB exam 102,598 of them were female (”AP exam scores,” 2013). Is there enough evidence to show that the proportion of female students taking the biology exam is different than the proportion of female students taking the calculus AB exam? Test at the 5% level.
  • Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year, of the 211,693 students who took the calculus AB exam 102,598 of them were female (”AP exam scores,” 2013). Estimate the difference in the proportion of female students taking the biology exam and female students taking the calculus AB exam using a 90% confidence level.
  • Many high school students take the AP tests in different subject areas. In 2007, of the 211,693 students who took the calculus AB exam 102,598 of them were female and 109,095 of them were male (”AP exam scores,” 2013). Is there enough evidence to show that the proportion of female students taking the calculus AB exam is different from the proportion of male students taking the calculus AB exam? Test at the 5% level.
  • Many high school students take the AP tests in different subject areas. In 2007, of the 211,693 students who took the calculus AB exam 102,598 of them were female and 109,095 of them were male (”AP exam scores,” 2013). Estimate using a 90% level the difference in proportion of female students taking the calculus AB exam versus male students taking the calculus AB exam.
  • Are there more children diagnosed with Autism Spectrum Disorder (ASD) in states that have larger urban areas over states that are mostly rural? In the state of Pennsylvania, a fairly urban state, there are 245 eight year old diagnosed with ASD out of 18,440 eight year old evaluated. In the state of Utah, a fairly rural state, there are 45 eight year old diagnosed with ASD out of 2,123 eight year old evaluated (”Autism and developmental,” 2008). Is there enough evidence to show that the proportion of children diagnosed with ASD in Pennsylvania is different than the proportion in Utah? Test at the 1% level.
  • Are there more children diagnosed with Autism Spectrum Disorder (ASD) in states that have larger urban areas over states that are mostly rural? In the state of Pennsylvania, a fairly urban state, there are 245 eight year old diagnosed with ASD out of 18,440 eight year old evaluated. In the state of Utah, a fairly rural state, there are 45 eight year old diagnosed with ASD out of 2,123 eight year old evaluated (”Autism and developmental,”

 

2008). Estimate the difference in proportion of children diagnosed with ASD between Pennsylvania and Utah. Use a 98% confidence level.

  • A child dying from an accidental poisoning is a terrible incident. Is it more likely that a male child will get into poison than a female child? To find this out, data was collected that showed that out of 1830 children between the ages one and four who pass away from poisoning, 1031 were males and 799 were females (Flanagan, Rooney & Griffiths, 2005). Do the data show that there is different proportion of male children dying of poisoning than female children? Test at the 1% level.
  • A child dying from an accidental poisoning is a terrible incident. Is it more likely that a male child will get into poison than a female child? To find this out, data was collected that showed that out of 1830 children between the ages one and four who pass away from poisoning, 1031 were males and 799 were females (Flanagan, Rooney & Griffiths, 2005). Compute a 99% confidence interval for the difference in proportions of poisoning deaths of male and female children ages one to four.

 

 

 

 

Paired Samples for Two Means

Are two populations the same? Is the average height of men taller than the average height of women? Is the mean weight less after a diet than before?

You can compare populations by comparing their means. You take a sample from each population and compare the statistics.

Anytime you compare two populations you need to know if the samples are independent or dependent. The formulas you use are different for different types of samples.

If how you choose one sample has no effect on the way you choose the other sample, the two samples are independent. The way to think about it is that in independent samples, the individuals from one sample are overall different from the units of observations from the other sample. This will mean that sample one has no affect on sample two. The sample values from one sample are not related or paired with values from the other sample.

If you choose the samples so that a measurement in one sample is paired with a measurement from the other sample, the samples are dependent or matched or paired. (Often a before and after situation.) You want to make sure the there is a meaning for pairing data values from one sample with a specific data value from the other sample. One way to think about it is that in dependent samples, the units of observations from one sample are the same units of observations from the other sample, though there can be other reasons to pair values. This

 

 

makes the sample values from each sample paired.

In tidy data, remember each row is a unit of measurement, and each column is a variable. In paired samples, you would have two variables that you are working with. In independent samples, you would have a variable that distinguishes a unit of observation from another unit of observation. As an example, in the Pulse data frame, consider the variables pulse_before and pulse_after. Since they are measured off the same unit of observation, then comparing the two variables would be a paired samples analysis. However, consider the pulse_after and whether a person smokes would be comparing the variable pulse_after against the variable smokes to see if smoking effects a person’s pulse rate after exercise. In this case, the units of observations would be different based on smoking yes or smoking no. Consider the variable smoking to be the factor that one is interested in seeing how it effects pulse rate.

Table #9.2.1: Data frame Pulse

imagePulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##

1

170

68

22

male

yes

yes

moderate

sat

##

2

182

75

26

male

yes

yes

moderate

sat

##

3

180

85

19

male

yes

yes

moderate

ran

##

4

182

85

20

male

yes

yes

low

sat

##

5

167

70

22

male

yes

yes

low

sat

##

6

178

86

21

male

yes

yes

low

sat

##

 

pulse_before

pulse_after

year

##

1

70

71

93

##

2

80

76

93

##

3

68

125

95

##

4

70

68

95

##

5

92

84

96

##

6

76

80

98

 

Example: Independent or Dependent Samples

Determine if the following are dependent or independent samples.

  • Randomly choose 5 men and 6 women and compare their heights

Solution:

Independent, since there is no reason that one value belongs to another. The units of observations are not the same for both samples. The units of observa- tions are definitely different. A way to think about this is that the knowledge that a man is chosen in one sample does not give any information about any of the woman chosen in the other sample.

 

 

  • Choose 10 men and weigh them. Give them a new diet drug and later weigh them again.

Solution:

Dependent, since each person’s before weight can be matched with their after weight. The units of observations are the same for both samples. A way to think about this is that the knowledge that a person weighs 400 pounds at the beginning will tell you something about their weight after the diet drug.

  • Take 10 people and measure the strength of their dominant arm and their non-dominant arm.

Solution: Dependent, since you can match the two arm strengths. The units of observations are the same for both samples. So the knowledge of one person’s dominant arm strength will tell you something about the strength of their non- dominant arm.

To analyze data when there are matched or paired samples, called dependent samples, you conduct a paired t-test. Since the samples are matched, you can find the difference between the values of the two random variables.

Hypothesis Test for Two Sample Paired t-Test

  • State the random variables and the parameters in words.

𝑥1 = random variable 1

𝑥2 = random variable 2

𝜇1 = mean of random variable 1

𝜇2 = mean of random variable 2

  • State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻𝑜 ∶ 𝜇1 = 𝜇2 or𝐻𝑜 ∶ 𝜇1 − 𝜇2 = 0

𝐻𝑎 ∶ 𝜇1 ≠ 𝜇2 or 𝐻𝑎 ∶ 𝜇1 − 𝜇2 ≠ 0

However, since you are finding the differences, then you can actually think of

𝜇1 − 𝜇2 = 𝜇𝑑.

So the hypotheses could become

𝐻𝑜 ∶ 𝜇𝑑 = 0

𝐻𝑎 ∶ 𝜇𝑑 ≠ 0 Remember, you can replace ≠ with < or >. Also, state your 𝛼 level here.

  • State and check the assumptions for the hypothesis test
  • A random sample of n pairs is taken.

 

  • The population of the difference between random variables is normally distributed. In this case the population you are interested in has to do with the differences that you find. It does not matter if each random variable is normally distributed. It is only important if the differences you find are normally distributed. Just as before, the t-test is fairly robust to the assumption if the sample size is large. This means that if this assumption isn’t met, but your sample size is quite large, then the results of the t-test are valid.
  • Find the sample statistic, test statistic, and p-value

Realize that a paired test is a one sample t-test on the difference between two variables. So you are running a one-sample t-test on a new variable known as the difference variable. You need to create this difference variable by creating a new data frame. This is done on R Studio by doing the following command (The fol- lowing shows how to create the variable difference for pulse_after-pulse_before on the data frame Pulse. Change the variables used and data frame used to your data frame and variables):

Table #9.2.2: Data frame Pulse with Difference

imagePulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)Pulse_new<-Pulse%>% mutate(difference=pulse_after-pulse_before)head(Pulse_new)

## height weight age gender smokes alcohol exercise ran

##

1

170

68

22

male

yes

yes

moderate

sat

##

2

182

75

26

male

yes

yes

moderate

sat

##

3

180

85

19

male

yes

yes

moderate

ran

##

4

182

85

20

male

yes

yes

low

sat

##

5

167

70

22

male

yes

yes

low

sat

##

6

178

86

21

male

yes

yes

low

sat

##

 

pulse_before

pulse_after

year

difference

##

1

70

71

93

1

##

2

80

76

93

-4

##

3

68

125

95

57

##

4

70

68

95

-2

##

5

92

84

96

-8

##

6

76

80

98

4

Notice R Studio created a new data frame called Pulse_new and that data frame has the variable difference. Now to conduct a paired t-test use the R Studio command

t.test(~difference variable, data=New Data Frame)

Note: if the 𝐻𝑎 is <, then the command becomes

 

 

t.test(~difference variable, data=New Data Frame, alternative=”less”)

 

Similarly for > put alternative=“greater”

  • Conclusion

This is where you write reject 𝐻𝑜 or fail to reject 𝐻𝑜. The rule is: if the p-value

< 𝛼, then reject 𝐻𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻𝑜.

  • Interpretation

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻𝑎, or you do not have enough evidence to support 𝐻𝑎.

Confidence Interval for Difference in Means from Paired Samples (t- Interval)

The confidence interval for the difference in means has the same random vari- ables and means and the same assumptions as the hypothesis test for two paired samples. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and means, and state and check the assumptions before completing the confidence interval step.

  • Find the sample statistic and confidence interval Again, you will need to create a new data frame with a difference variable. Then on R Studio the command is

t.test(~difference variable, data=Data Frame, conf.level=C as a decimal)

 

  • Statistical Interpretation: In general this looks like, “You are C% confident that the statement contains the true mean difference.”
  • Real World Interpretation: This is where you state what interval contains the true mean difference.

How to check the assumptions of t-test and confidence interval:

In order for the t-test or confidence interval to be valid, the assumptions of the test must be met. So whenever you run a t-test or confidence interval, you must make sure the assumptions are met. So you need to check them. Here is how you do this:

  • For the assumption that the sample is a random sample, describe how you took the samples. Make sure your sampling technique is random and that the samples were dependent.
  • For the assumption that the population of the differences is normal, re- member the process of assessing normality from chapter 6. Make sure you use the difference variable that was created in the new data frame.

 

Example: Hypothesis Test for Paired Samples

Is the pulse rate after exercise different from the pulse rate before exercise for a woman who drinks alcohol? Use the data frame Pulse (table #9.2.3). Test at the 5% level.

Table #9.2.3: Data frame Pulse

imagePulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##

1

170

68

22

male

yes

yes

moderate

sat

##

2

182

75

26

male

yes

yes

moderate

sat

##

3

180

85

19

male

yes

yes

moderate

ran

##

4

182

85

20

male

yes

yes

low

sat

##

5

167

70

22

male

yes

yes

low

sat

##

6

178

86

21

male

yes

yes

low

sat

##

 

pulse_before

pulse_after

year

##

1

70

71

93

##

2

80

76

93

##

3

68

125

95

##

4

70

68

95

##

5

92

84

96

##

6

76

80

98

Code book for data frame Pulse See problem #3.2.5 in section 3.2.

Solution:

  • State the random variables and the parameters in words.

𝑥1 = pulse of a smoking woman who drinks alcohol after exercise

𝑥1 = pulse of a smoking woman who drinks alcohol before exercise

𝜇1 = mean pulse of a smoking woman who drinks alcohol after exercise

𝜇1 = mean pulse of a smoking woman who drinks alcohol after exercise

  • State the null and alternative hypotheses and the level of significance

𝐻𝑜 ∶ 𝜇1 = 𝜇2

𝐻𝑎 ∶ 𝜇1 ≠ 𝜇2

level of significance, 𝛼 = 0.05

  • State and check the assumptions for the hypothesis test
  • A random sample of 110 pairs of pulse rates after and before exercise was taken. Check: The data frame says that the data was collected from

 

students in classes for several years. Though this was not a random sample, it is probably a representative sample.

  • The population of the difference in after and before pulse rates is normally distributed. Check: To see if this is true, look at the density plot and the normal quantile plot for the difference between after and before. This variable must be created before the density plot and normal quantile plot can be created. The other thing is that a new data frame must be created for females drink alcohol.

Table #9.2.4: Data frame Pulse of Females Who Drink Alcohol

imagePulse_female<- Pulse%>%filter(gender==”female”, alcohol==”yes”)head(Pulse_female)

##height weight age gender smokes alcohol exercise ran

## 1

165

60

19 female

yes

yes

low ran

## 2

163

47

23 female

yes

yes

low ran

## 3

173

57

18 female

no

yes moderate sat

## 4

179

58

19 female

no

yes moderate ran

## 5

167

62

18 female

no

yes

high ran

## 6

173

64

18 female

no

yes

low sat

 

 

 

 

 

 

 

image##pulse_beforepulse_afteryear##18812098##27112598##3868893##48215093##59617693##6908893
Now mutate the Pulse_female data frame to include a difference variable

Table #9.2.5: Data frame Pulse of Females Who Drink Alcohol with Difference

imagePulse_female_new<-Pulse_female%>% mutate(difference=pulse_after-pulse_before)head(Pulse_female_new)

##height weight age gender smokes alcohol exercise ran

##

1

165

60

19

female

yes

yes

low

ran

##

2

163

47

23

female

yes

yes

low

ran

##

3

173

57

18

female

no

yes

moderate

sat

##

4

179

58

19

female

no

yes

moderate

ran

##

5

167

62

18

female

no

yes

high

ran

##

6

173

64

18

female

no

yes

low

sat

##pulse_before pulse_after year difference

 

 

## 1

88

120

98

32

## 2

71

125

98

54

## 3

86

88

93

2

## 4

82

150

93

68

## 5

96

176

93

80

## 6

90

88

93

-2

image
image
Now create a density plot and normal quantile plot on the difference variable.

 

 

 

0.015

 

 

 

 

 

 

imagedensity
0.010

 

 

 

 

 

 

0.005

 

 

 

 

 

 

0.000

020406080

difference

 

Figure 9.1: Density Plot of Differences in pulse rates

 

The density plot is not symmetrical and the normal quantile plot on the differ- ences is not linear. So you cannot assume that the distribution of the difference in pulse rates is normal. It is good that the t-test is robust if there is a large sample. The sample is of size 110, so that should be adequate to assume the conclusion is valid.

  • Find the sample statistic, test statistic, and p-value On R Studio, use the command:

t.test(~difference, data=Pulse_female_new)

##

## One Sample t-test ##

## data: difference

## t = 4.1353, df = 26, p-value = 0.0003283

 

 

 

 

image
image
image
image
image
80

 

 

 

 

image
image
image
image
image
image
60

 

 

 

 

imagesample
image
image
40

 

 

 

 

image
image
20

 

 

 

 

image
image
image
image
image
image
image
image
image
image
0

 

 

 

 

image
−2−1012

theoretical

 

Figure 9.2: Normal Quantile Plot of Differences in Weights

 

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 11.51152 34.26625

## sample estimates:

## mean of x ## 22.88889

  • Conclusion

Since the p-value < 0.05, reject 𝐻𝑜.

  • Interpretation

There is enough evidence to support that there is a difference in pulse rate before and after exercise of females who smoke.

 

Example: Hypothesis Test for Paired Samples

The New Zealand Air Force purchased a batch of flight helmets. They then found out that the helmets didn’t fit. In order to make sure that they order the correct size helmets, they measured the head size of recruits. To save money, they wanted to use cardboard calipers, but were not sure if they will be accurate enough. So they took 18 recruits and measured their heads with the cardboard calipers and also with metal calipers. The data frame is in table #9.2.6 (Helmet

 

 

Sizes for New Zealand Airforce, 2019). Do the data provide enough evidence to show that there is a difference in measurements between the cardboard and metal calipers? Use a 5% level of significance.

Table #9.2.6: Data for Head Measurements

imageHelmet<-read.csv( “https://krkozak.github.io/MAT160/helmet.csv”)head(Helmet)

##Cardboard Metal

## 1

146145

## 2

151153

## 3

163161

## 4

152151

## 5

151145

## 6

151150

Code book for data frame Helmet

Description After purchasing a batch of flight helmets that did not fit the heads of many pilots, the NZ Airforce decided to measure the head sizes of all recruits. Before this was carried out, information was collected to determine the feasibility of using cheap cardboard calipers to make the measurements, instead of metal ones which were expensive and uncomfortable. The data lists the head diameters of 18 recruits measured once using cardboard calipers and again using metal calipers. One question is whether there is any systematic difference between the two sets of calipers. One might also ask whether there is more variability in the cardboard calipers measurement than that of the metal calipers.

This data frame contains the following columns:

Cardboard: measurement using cardboard calipers (cm) Metal: measurement using metal calipers (cm)

Source Helmet Sizes for New Zealand Airforce. (n.d.). Retrieved July 20, 2019, from http://www.statsci.org/data/oz/nzhelmet.html

References Data courtesy of Dr Stephen Legg. Seber and Lee (1998). Page 545.

Solution:

  • State the random variables and the parameters in words.

𝑥1 = head measurement of recruit using cardboard caliper

𝑥2 = head measurement of recruit using metal caliper

𝜇1= mean head measurement of recruit using cardboard caliper

 

 

𝜇2 = mean head measurement of recruit using metal caliper

  • State the null and alternative hypotheses and the level of significance

𝐻𝑜 ∶ 𝜇1 = 𝜇2

𝐻𝑎 ∶ 𝜇1 ≠ 𝜇2

level of significance, 𝛼 = 0.05

  • State and check the assumptions for the hypothesis test
  • A random sample of 18 pairs of head measures of recruits with cardboard and metal caliper was taken. Check: This was not stated, but probably could be safely assumed.
  • The population of the difference in head measurements between cardboard and metal calipers is normally distributed. Check: First create the differ- ence variable, then the density plot and normal quantile plot.

Table #9.2.7: Data for Head Measurements with Difference Variable

imageHelmet_new<-Helmet%>% mutate(difference=Cardboard-Metal)head(Helmet_new)

 

##

 

Cardboard Metal

difference

##

1

146145

1

##

2

151153

-2

##

3

163161

2

##

4

152151

1

##

5

151145

6

##

6

151150

1

gf_density(~difference, data=Helmet_new)

gf_qq(~difference, data=Helmet_new)

This density plot looks somewhat bell shaped. The normal quantile plot on the differences looks somewhat linear. So you can assume that the distribution of the difference in weights is normal.

  • Find the sample statistic, test statistic, and p-value Using R Studio the command is

t.test(~difference, data=Helmet_new)

 

##

## One Sample t-test ##

## data: difference

## t = 3.1854, df = 17, p-value = 0.005415

 

 

 

 

 

 

0.20

 

 

 

 

imagedensity
0.15

 

 

 

 

0.10

 

 

 

 

0.05

 

 

 

 

0.00


 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

image
image
−20246

difference

 

 

Figure 9.3: Desnity Plot of Differences in Head Measurements

 

 

 

image
6

 

 

 

 

 

4

 

 

 

 

 

imagesample
2

 

 

 

 

 

0

 

 

 

 

 

−2

−2−1012

theoretical

 

Figure 9.4: Normal Quantile Plot of Differences in Head Measurements

 

 

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 0.5440163 2.6782060

## sample estimates:

## mean of x ## 1.611111

The sample statistic is 1.6111, the test statistic is 3.1854, and the p-value is 0.005415.

  • Conclusion

Since the p-value < 0.05, reject 𝐻𝑜.

  • Interpretation

There is enough evidence to support that the mean head measurements using the cardboard calipers are not the same as when using the metal calipers. So it looks like the New Zealand Air Force shouldn’t use the cardboard calipers.

 

Example: Confidence Interval for Paired Samples

The New Zealand Air Force purchased a batch of flight helmets. They then found out that the helmets didn’t fit. In order to make sure that they order the correct size helmets, they measured the head size of recruits. To save money, they wanted to use cardboard calipers, but were not sure if they will be accurate enough. So they took 18 recruits and measured their heads with the cardboard calipers and also with metal calipers. The data frame is in table #9.2.6 (Helmet Sizes for New Zealand Airforce, 2019). Estimate the difference in measurements between the cardboard and metal calipers using a 95% confidence interval.

Table #9.2.8: Data for Head Measurements

imageHelmet<-read.csv( “https://krkozak.github.io/MAT160/helmet.csv”)head(Helmet)

##Cardboard Metal

## 1

146145

## 2

151153

## 3

163161

## 4

152151

## 5

151145

## 6

Solution:

151150

  • State the random variables and the parameters in words. These were stated in example #9.2.2.
  • State and check the assumptions for the confidence interval

 

The assumptions were stated and checked in example #9.2.2.

  • Find the sample statistic and confidence interval Using R Studio the command is

t.test(~difference, data=Helmet_new, conf.leve=0.95)

 

##

## One Sample t-test ##

## data: difference

## t = 3.1854, df = 17, p-value = 0.005415

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 0.5440163 2.6782060

## sample estimates:

## mean of x ## 1.611111

  • image𝜇2 < 2.6782060
    image0.5440163 < 𝜇1 −
    Statistical Interpretation: You are 95% confidence that

contains the true mean difference in head measurement between using the cardboard and metal calibers.

  • Real World Interpretation: The mean head measurement using the card- board calibers is anywhere from 0.54 cm to 2.68 cm more than the head measurement using the metal calibers.

Examples #9.2.2 and #9.2.4 use the same data set, but one is conducting a hypothesis test and the other is conducting a confidence interval. Notice that the hypothesis test’s conclusion was to reject and say that there was a difference in the means, and the confidence interval does not contain the number 0. If the confidence interval did contain the number 0, then that would mean that the two means could be the same. Since the interval did not contain 0, then you could say that the means are different just as in the hypothesis test. This means that the hypothesis test and the confidence interval can produce the same interpretation. Do be careful though, you can run a hypothesis test with a particular significance level and a confidence interval with a confidence level that is not compatible with your significance level. This will mean that the conclusion from the confidence interval would not be the same as with a hypothesis test. So if you want to estimate the mean difference, then conduct a confidence interval. If you want to show that the means are different, then conduct a hypothesis test. As a reminder, the American Statistical Association (ASA) suggests not conducting hypothesis tests and just create confidence intervals.

 

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

 

 

of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

  • The cholesterol level of patients who had heart attacks was measured multiple times after the heart attack. The researchers want to see if the cholesterol level of patients who have heart attacks changes as the time since their heart attack increases. The data is in table #9.2.9. Do the data show that the mean cholesterol level of patients that have had a heart attack changes as the time increases since their heart attack? Use day2 and day4 variables to answer the question. Test at the 1% level.

table #9.2.9: Cholesterol Levels of Patients After Heart Attack

imageCholesterol<-read.csv( “https://krkozak.github.io/MAT160/cholesterol.csv”)head(Cholesterol)

##patient day2 day4 day14

##

1

1 270 218

156

##

2

2 236 234

NA

##

3

3 210 214

242

##

4

4 142 116

NA

##

5

5 280 200

NA

##

6

6 272 276

256

Code book for Data Frame Cholesterol See problem 3.1.1 in Section 3.1 homework.

  • The cholesterol level of patients who had heart attacks was measured multiple times after the heart attack. The researchers want to see if the cholesterol level of patients who have heart attacks changes as the time since their heart attack increases. The data is in table #9.2.9. Calculate a 98% confidence interval for the mean difference in cholesterol levels from day two to day four.
  • All Fresh Seafood is a wholesale fish company based on the east coast of the

U.S. Catalina Offshore Products is a wholesale fish company based on the west coast of the U.S. Table #9.2.10 contains prices from both companies for specific fish types (”Seafood online,” 2013) (”Buy sushi grade,” 2013). Do the data provide enough evidence to show that fish cost different from west coast fish wholesaler and east coast wholesaler? Test at the 5% level.

Table #9.2.10: Wholesale Prices of Fish in Dollars

imagePrice <- read.csv( “https://krkozak.github.io/MAT160/price.csv”)head(Price)

##fish east west

## 1Cod 19.99 17.99

 

 

##

2

Tilapi

6.00

13.99

##

3

Farmed Salmon

19.99

22.99

##

4

Organic Salmon

24.99

24.99

##

5

Grouper Fillet

29.99

19.99

##

6

Tuna

28.99

31.99

Code book for data frame Price

Description Price of fish was collected from two websites. One for Catalina Offshore Products (west coast) and the other for All Fresh Seafood (east coast) in 2013.

This data frame contains the following columns:

fish: type of fish for sale

east: price of fish from east coast supplier ($) west: price of fish from west coast supplier ($)

Source Seafood online. (2013, November 20). Retrieved from http://www. allfreshseafood.com/

Buy sushi grade fish online. (2013, November 20). Retrieved from http://www. catalinaop.com/

References Websites of Cataline Offshore Products and All Fresh Seafood

  • All Fresh Seafood is a wholesale fish company based on the east coast of the

U.S. Catalina Offshore Products is a wholesale fish company based on the west coast of the U.S. Table #9.2.10 contains prices from both companies for specific fish types (”Seafood online,” 2013) (”Buy sushi grade,” 2013). Find a 95% confidence interval for the mean difference in wholesale price between the east coast and west coast suppliers.

  • The British Department of Transportation studied to see if people avoid driving or shopping, or have more accidents on Friday the 13th. They collected data from different locations (Friday the 13th, 2019). The data for each location on the two different dates is in table #9.2.11. Do the data show that on average different number of people are engaged in activities on Friday the 13th? Test at the 5% level.

Table #9.2.11: Traffic Count

imageTraffic <- read.csv( “https://krkozak.github.io/MAT160/traffic.csv”)head(Traffic)

##source yearmonthX6th X13th location ## 1 traffic 1990,July 139246 1385487 to 8

## 2 traffic 1990,July 134012 132908 9 to 10

## 3 traffic 1991, September 137055 1360187 to 8

 

 

## 4 traffic 1991, September 133732 131843 9 to 10

## 5 traffic 1991, December 123552 1216417 to 8

## 6 traffic 1991, December 121139 118723 9 to 10

Code book for data frame Traffic

Description This file consists of three separate data sets, all of which address the issues of how superstitions regarding Friday the 13th affect human behavior, and whether Friday the 13th is an unlucky day. Scanlon, et al. collected data on traffic and shopping patterns and accident frequency for Fridays the 6th and 13th between October of 1989 and November of 1992.

For the first data set, the researchers obtained information from the British Department of Transport regarding the traffic flows between junctions 7 to 8 and junctions 9 to 10 of the M25 motorway. They collected the numbers of shoppers in nine different supermarkets in southeast England for the second data set. The third data set contains numbers of emergency admissions to hospitals due to transport accidents.

We present the three data sets in a combined format, with the variable “Data set” as an identifier that may be used to separate them.

This data frame contains the following columns:

source: which data set the data were obtained from year: which year the data was collected from Month: the month that the Friday was in

x6th: Number of cars passing through junction (traffic data set), shoppers for each supermarket (shopping data set), or admissions due to transport accidents (accident data set) on Friday the 6th

x13th: Number of cars passing through junction (traffic data set), shoppers for each supermarket (shopping data set), or admissions due to transport accidents (accident data set) on Friday the 13th

location: Motorway junction (traffic data set), supermarket location (shopping data set) or hospital (accident data set) to which the data correspond

Source (n.d.). Retrieved from https://www3.nd.edu/~busiforc/handouts/Data and Stories/t test/Friday The Thirteenth/Friday The Thirteenth Data.html

References Scanlon, T.J., Luben, R.N., Scanlon, F.L., Singleton, N. (1993), “Is Friday the 13th Bad For Your Health?,” BMJ, 307, 1584-1586.

  • The British Department of Transportation studied to see if people avoid driving or shopping, or have more accidents on Friday the 13th. They collected data from different locations (Friday the 13th, 2019). The data for each location on the two different dates is in table #9.2.11. Do the

 

data show that on average different number of people are engaged in activ- ities on Friday the 13th? Estimate the mean difference in activity count between the 6th and the 13th using a 95% level.

  • To determine if Reiki is an effective method for treating pain, a pilot study was carried out where a certified second-degree Reiki therapist provided treatment on volunteers. Pain was measured using a visual analogue scale (VAS) and a likert scale immediately before and after the Reiki treatment (Olson & Hanson, 1997). The data is in table #9.2.12. Do the data show that Reiki treatment reduces pain? Test at the 5% level.

Table #9.2.12: Pain Measures Before and After Reiki Treatment

imageReiki<- read.csv( “https://krkozak.github.io/MAT160/reki.csv”)head(Reiki)

##vas.before vas.after likert_before likert_after

## 1

6

3

2

1

## 2

2

1

2

1

## 3

2

0

3

0

## 4

9

1

3

1

## 5

3

0

2

0

## 6

3

2

2

2

Code book for data frame Reiki see problem #3.2.7 in Section 3.2 Home- work

  • To determine if Reiki is an effective method for treating pain, a pilot study was carried out where a certified second-degree Reiki therapist provided treatment on volunteers. Pain was measured using a visual analogue scale (VAS) and a likert scale immediately before and after the Reiki treatment (Olson & Hanson, 1997). The data is in table #9.2.12. Compute a 90% confidence level for the mean difference in VAS score from before and after Reiki treatment.
  • The female labor force participation rates (FLFPR) of women in countries from 1990 to 2018 are in table #9.2.13 (Labor force participation rate, female (% of female population ages 15+) (modeled ILO estimate), 2019). Do the data show that the mean female labor force participation rate in 1990 is different from that in the 2018 using a 5% level of significance?

Table #9.2.13: Female Labor Force Participation Rates

imageLabor <- read.csv( “https://krkozak.github.io/MAT160/labor.csv”)head(Labor)

##Country.Name Country.CodeRegion ## 1ArubaABW Latin America & Caribbean

 

image##2Lowincome43.1150043.1240043.1290043.07200##3Lower middleincome74.9450074.8790074.8260074.78200##4Upper middleincome53.7710056.2960056.6870055.74700##5HighincomeNANANANA##619.1899719.2409419.1315919.29515##y1994y1995y1996y1997y1998y1999

## 2

Afghanistan

AFG

 

South Asia

## 3

Angola

AGO

Sub-Saharan Africa

## 4

Albania

ALB

Europe & Central Asia

## 5

Andorra

AND

Europe & Central Asia

## 6

Arab World

ARB

 

 

 

##

IncomeGroup

y1990

y1991

y1992

y1993

## 1

High income

NA

NA

NA

NA

 

 

 

 

 

 

##

1

NA

NA

NA

NA

NA

NA

##

2

43.00300

43.01700

42.77000

42.55400

42.41300

42.3340

##

3

74.77000

74.78400

74.78300

74.80600

74.84600

74.8940

##

4

54.90400

53.74600

53.07500

53.81200

53.15400

52.2540

##

5

NA

NA

NA

NA

NA

NA

##

6

19.64479

19.66156

19.51602

19.27293

19.07511

19.5351

##

 

y2000

y2001

y2002

y2003

y2004

y2005

##

1

NA

NA

NA

NA

NA

NA

##

2

42.27400

42.53900

42.89900

43.28600

43.66100

44.02500

##

3

74.94100

74.96200

74.98400

75.01100

75.04800

75.09400

##

4

51.76900

51.11000

50.67900

49.75900

48.87800

48.05100

##

5

NA

NA

NA

NA

NA

NA

##

6

19.59284

19.52237

19.08892

19.32403

19.44488

19.53444

##

 

y2006

y2007

y2008y2009y2010y2011

##

1

NA

NA

NANANANA

##

2

43.59700

43.19200

42.8730 42.70900 42.73500 43.32800

##

3

75.12600

75.16500

75.2090 75.25600 75.30700 75.34400

##

4

47.38900

46.80300

46.2690 44.94500 45.69300 47.10400

##

5

NA

NA

NANANANA

##

6

19.68183

20.17107

19.8473 20.05784 20.17166 20.27703

##

 

y2012

y2013

y2014y2015y2016y2017

##

1

NA

NA

NANANANA

##

2

44.11700

45.03900

46.01700 47.00100 47.76600 48.47400

##

3

75.38900

75.43300

75.46500 75.47900 75.47000 75.45100

##

4

48.80600

44.65000

44.78900 47.67600 47.45900 47.31200

##

5

NA

NA

NANANANA

##

6

20.46453

20.76731

20.70378 20.51515 20.61605 20.56842

##

 

y2018

 

 

##

1

NA

##

2

48.66000

##

3

75.41200

##

4

47.19100

##

5

NA

 

 

## 6 20.58152

Code book for data frame Labor

Description Labor force participation rate, female (% of female population ages 15+)

This data frame contains the following columns:

Country Name: The name of a country around the world Country Code: The 3 letter country code

Region: The location of the country in the world IncomeGroup: The World Bank’s income classification

y1990-y2018: Labor force participation rate, female (% of female population ages 15+) for the years 100–2018

Source Labor force participation rate, female (% of female population ages 15 ) (modeled ILO estimate). (n.d.). Retrieved July 20, 2019, from https:

//data.worldbank.org/indicator/SL.TLF.CACT.FE.ZS

References International Labour Organization, ILOSTAT database. Data re- trieved in April 2019.

  • The female labor force participation rates (FLFPR) of women in coun- tries from 1990 to 2018 are in table #9.2.13 (Labor force participation rate, female (% of female population ages 15+) (modeled ILO estimate), 2019).Estimate the mean difference in the female labor force participation rate in 1990 to 2018 using a 95% confidence level?
  • Is the pulse rate after exercise different from the pulse rate before exercise for a man who drinks alcohol but doesn’t smoke? Use the data frame Pulse (table #9.2.3). Test at the 5% level.

Table#9.2.14: Pulse Rate Before and After Exercise

imagePulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##

1

170

68

22

male

yes

yes

moderate

sat

##

2

182

75

26

male

yes

yes

moderate

sat

##

3

180

85

19

male

yes

yes

moderate

ran

##

4

182

85

20

male

yes

yes

low

sat

##

5

167

70

22

male

yes

yes

low

sat

##

6

178

86

21

male

yes

yes

low

sat

##pulse_before pulse_after year ## 1707193

## 2807693

 

 

## 3

68

125

95

## 4

70

68

95

## 5

92

84

96

## 6

76

80

98

Code book for data frame Pulse See problem 3.2.5 in section 3.2.

  • Table #9.2.14 contains pulse rates Compute a 95% confidence interval for the mean difference in pulse rates from before and after exercise for males who drink but do not smoke.

 

 

 

 

Independent Samples for Two Means

This section will look at how to analyze when two samples are collected that are independent. As with all other hypothesis tests and confidence intervals, the process is the same though the formulas and assumptions are different.

Hypothesis Test for Independent t-Test (2-Sample t-Test)

  • State the random variables and the parameters in words.

𝑥1 = random variable 1

𝑥2= random variable 2

𝜇1= mean of random variable 1

𝜇2= mean of random variable 2

  • State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻𝑜 ∶ 𝜇1 = 𝜇2

𝐻𝑎 ∶ 𝜇1 ≠ 𝜇2, the ≠ can be replaced with < or >

Also, state your 𝛼 level here.

  • State and check the assumptions for the hypothesis test
  • image𝑛2
    image𝑛1
    A random sample of size is taken from population 1. A random sample of size is taken from population 2. Note: the samples do not need to be the same size, but the test is more robust if they are.
  • The two samples are independent.
  • Population 1 is normally distributed. Population 2 is normally distributed. Just as before, the t-test is fairly robust to the assumption if the sample size is large. This means that if this assumption isn’t met, but your sample sizes are quite large, then the results of the t-test are valid.

 

  • The population variances are unknown and not assumed to be equal. The old assumption is that the variances are equal. However, this assumption is no longer an assumption that most statisticians use. This is because it isn’t really realistic to assume that the variances are equal. So just assume the assumption of the variances being unknown and not assumed to be equal is true, and it will not be checked.
  • Find the sample statistic, test statistic, and p-value The command using R is

t.test(variable~factor, data=Data Frame)

Note: if the 𝐻𝑎 is <, then the command becomes

t.test(variable~factor, data=Data Frame, alternative=”less”)

 

Similarly for > put alternative=“greater”

  • Conclusion

This is where you write reject or fail to reject . The rule is: if the p-value < 𝛼 , then reject 𝐻𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻𝑜.

  • Interpretation

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻𝑎, or you do not have enough evidence to support 𝐻𝑎.

Confidence Interval for the Difference in Means from Two Indepen- dent Samples (2 Samp T-Int)

The confidence interval for the difference in means has the same random vari- ables and means and the same assumptions as the hypothesis test for indepen- dent samples. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and means and state and check the assumptions before completing the confidence interval step.

Find the sample statistic and confidence interval On R Studio, the command is

t.test(variable~factor, data=Data Frame, conf.level=C as a decimal)

 

  • Statistical Interpretation: In general this looks like, “You are C% confident that the interval contains the true mean difference.”
  • Real World Interpretation: This is where you state what interval contains the true difference in means, though often you state how much more (or less) the first mean is from the second mean.

 

How to check the assumptions of two sample t-test and confidence interval:

In order for the t-test or confidence interval to be valid, the assumptions of the test must be valid So whenever you run a t-test or confidence interval, you must make sure the assumptions are valid So you need to check them. Here is how you do this:

  • For the random sample assumption, describe how you took the two sam- ples. Make sure your sampling technique is random for both samples.
  • For the independent assumption, describe how they are independent sam- ples.
  • For the assumption about each population being normally distributed, remember the process of assessing normality from chapter 6. Make sure you assess each sample separately.
  • You do not need to check the variance assumption since it is being as- sumed.

 

Example: Hypothesis Test for Two Means

The cholesterol level of people vary for many reasons. The question is do people with diabetes have different cholesterol levels from people who do not have diabetes? Use the NHANES data frame. Test at the 5% level.

Table #9.3.1: NHANES Data Frame

names(NHANES) #displays the names of the variables in a data frame

 

##

[1]

“ID”

“SurveyYr”

##

[3]

“Gender”

“Age”

##

[5]

“AgeDecade”

“AgeMonths”

##

[7]

“Race1”

“Race3”

##

[9]

“Education”

“MaritalStatus”

##

[11]

“HHIncome”

“HHIncomeMid”

##

[13]

“Poverty”

“HomeRooms”

##

[15]

“HomeOwn”

“Work”

##

[17]

“Weight”

“Length”

##

[19]

“HeadCirc”

“Height”

##

[21]

“BMI”

“BMICatUnder20yrs”

##

[23]

“BMI_WHO”

“Pulse”

##

[25]

“BPSysAve”

“BPDiaAve”

##

[27]

“BPSys1”

“BPDia1”

##

[29]

“BPSys2”

“BPDia2”

##

[31]

“BPSys3”

“BPDia3”

##

[33]

“Testosterone”

“DirectChol”

##

[35]

“TotChol”

“UrineVol1”

 

 

##

[37]

“UrineFlow1”

“UrineVol2”

##

[39]

“UrineFlow2”

“Diabetes”

##

[41]

“DiabetesAge”

“HealthGen”

##

[43]

“DaysPhysHlthBad”

“DaysMentHlthBad”

##

[45]

“LittleInterest”

“Depressed”

##

[47]

“nPregnancies”

“nBabies”

##

[49]

“Age1stBaby”

“SleepHrsNight”

##

[51]

“SleepTrouble”

“PhysActive”

##

[53]

“PhysActiveDays”

“TVHrsDay”

##

[55]

“CompHrsDay”

“TVHrsDayChild”

##

[57]

“CompHrsDayChild”

“Alcohol12PlusYr”

##

[59]

“AlcoholDay”

“AlcoholYear”

##

[61]

“SmokeNow”

“Smoke100”

##

[63]

“Smoke100n”

“SmokeAge”

##

[65]

“Marijuana”

“AgeFirstMarij”

##

[67]

“RegularMarij”

“AgeRegMarij”

##

[69]

“HardDrugs”

“SexEver”

##

[71]

“SexAge”

“SexNumPartnLife”

##

[73]

“SexNumPartYear”

“SameSex”

##

[75]

“SexOrientation”

“PregnantNow”

Code book for data frame NHANES type help(“NHANES”) in the R Console.

Solution:

  • State the random variables and the parameters in words.

𝑥1 = Cholesterol level of people with diabetes

𝑥2 = Cholesterol level of people without diabetes

𝜇1 = mean cholesterol level of people with diabetes

𝜇2 = mean cholesterol level of people without diabetes

  • State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻𝑜 ∶ 𝜇1 = 𝜇2

𝐻𝑎 ∶ 𝜇1 ≠ 𝜇2

level of significance, 𝛼 = 0.05

  • State and check the assumptions for the hypothesis test
  • A random sample of cholesterol levels of people with diabetes is taken. A random sample of cholesterol levels of people without diabetes is taken. Check: The NHANES data frame uses cluster sampling which incorpo- rates random sampling, so the sample is probably representative. This assumption has been met.

 

  • The two samples are independent. Check: This is because either they were dealing with people who have diabetes or not.
  • Population of all cholesterol levels of people who have diabetes is normally distributed. Population of all cholesterol levels of people without diabetes is normally distributed. Check:

gf_density(~TotChol|Diabetes, data=NHANES)

 

 

 

 

imageNAYesNo
0.4

 

 

 

 

 

0.3

 

 

 

 

 

imagedensity
0.2

 

 

 

 

 

0.1

 

 

 

 

 

0.0

510510510

TotChol

 

Figure 9.5: Density Plot of Cholesterol levels of people with and without Dia- betes:

 

Both the yes group and the no group look somewhat bell shaped.

gf_qq(~TotChol|Diabetes, data=NHANES)

 

Both the yes group and the no group look somewhat linear.

The population of all cholesterol levels of people who have diabetes is probably normally distributed. The population of all cholesterol levels of people who do not have diabetes is probably normally distributed.

  • Find the sample statistic, test statistic, and p-value

The variable is cholesterol (TotChol) and separating based on if a person has diabetes or not. So the factor is Diabetes. Using R Studio the command would be

 

 

 

 

 

 

 

 

 

 

 

 

 

imageNo
imageYes
imageNA
imagesample
10

 

 

 

 

 

 

 

 

5

 

 

 

 

 

 

 

−4−2024 −4−2024 −4−2024

theoretical

 

Figure 9.6: Normal Quantile Plot of Cholesterol Levels of people with and without Diabetes

 

 

t.test(TotChol~Diabetes, data=NHANES)

 

##

## Welch Two Sample t-test ##

## data: TotChol by Diabetes

## t = 2.4286, df = 809.7, p-value = 0.01537

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## 0.02105115 0.19851114

## sample estimates:

## mean in group No mean in group Yes ##4.8879364.778155

  • Conclusion

Reject 𝐻𝑜 since the p-value < 𝛼.

  • Interpretation

This is enough evidence to support that people who have diabetes have different cholesterol levels on average from people who do not have diabetes.

 

 

Example: Confidence Interval in Two Samples

The cholesterol level of people vary for many reasons. The question is how different is the cholesterol levels of people with diabetes from people who do not have diabetes? Use the NHANES data frame. Compute a 95% confidence interval.

Solution:

  • State the random variables and the parameters in words. These were stated in example #9.3.1
  • State and check the assumptions for the hypothesis test The assumptions were stated and checked in example #9.3.1.
  • Find the sample statistic and confidence interval

The variable is cholesterol (TotChol) and separating based on if a person has diabetes or not. So the factor is Diabetes. Using R Studio the command would be

t.test(TotChol~Diabetes, data=NHANES, conf.level=0.95)

 

##

## Welch Two Sample t-test ##

## data: TotChol by Diabetes

## t = 2.4286, df = 809.7, p-value = 0.01537

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## 0.02105115 0.19851114

## sample estimates:

## mean in group No mean in group Yes ##4.8879364.778155

  • Statistical Interpretation:You are 95% confident that the interval

0.02105115 < 𝜇1 − 𝜇2 < 0.19851114 contains the true difference in means.

  • Real World Interpretation: The mean cholesterol level for people with diabetes is anywhere from 0.021 mmol/L to 0.199 mmol/L more than the mean cholesterol level for people without diabetes.

 

Example: Hypothesis Test for Two Means

The amount of sodium in beef and poultry hot dogs was measured. (”SOCR 012708 id,” 2013). The data is in table #9.3.2. Is there enough evidence to show that beef has different amounts of sodium on average than poultry hot dogs? Use a 5% level of significance.

Table #9.3.2: Hotdog Data

 

 

imageHotdog<-read.csv( “https://krkozak.github.io/MAT160/hotdog_beef_poultry.csv”)head(Hotdog)

##type calories sodium

## 1 Beef

186495

## 2 Beef

181477

## 3 Beef

176425

## 4 Beef

149322

## 5 Beef

184482

## 6 Beef

190587

Code book for data frame Hotdog

Description Results of a laboratory analysis of calories and sodium content of major hot dog brands. Researchers for Consumer Reports analyzed three types of hot dog: beef, poultry, and meat (mostly pork and beef, but up to 15% poultry meat). The meat was left off this data frame so a two-sample t-test could be performed.

This data frame contains the following columns:

type: Type of hot dog (beef or poultry) calories: Calories per hot dog

sodium: Milligrams of sodium per hot dog

Source SOCR 012708 id data hotdogs. (2013, November 13). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_012708_ID_Data_HotDogs

References SOCR Home page: http://www.socr.ucla.edu

Solution:

  • State the random variables and the parameters in words.

𝑥1 = sodium level in beef hotdogs

𝑥2 = sodium level in poultry hotdogs

𝜇1 = mean sodium level in beef hotdogs

𝜇2 = mean sodium level in poultry hotdogs

  • State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻𝑜 ∶ 𝜇1 = 𝜇2

𝐻𝑜 ∶ 𝜇1 ≠ 𝜇2

level of significance: 𝛼 = 0.05

 

 

  • State and check the assumptions for the hypothesis test
  • A random sample of 20 sodium levels in beef hotdogs is taken. A random sample of 20 sodium levels in poultry hotdogs. Check: The code does not state if either sample was randomly selected, but since Consumer Reports performed the test, it is safe to assume the samples were both random.
  • The two samples are independent. Check: These are different types of hotdogs so this is true.
  • Population of all sodium levels in beef hotdogs is normally distributed. Population of all sodium levels in poultry hotdogs is normally distributed. Check:

gf_density(~sodium|type, data=Hotdog)

 

 

 

 

 

imageBeef
imagePoultry
0.004

 

 

 

 

0.003

 

 

 

 

imagedensity
0.002

 

 

 

 

0.001

 

 

 

 

0.000

300400500600300400500600

sodium

 

Figure 9.7: Density Plot of Sodium Levels in Hotdogs

 

The density plot for beef hotdogs looks somewhat bell shaped, but the density plot for poultry hotdogs does not look bell shaped.

gf_qq(~sodium|type, data=Hotdog)

 

The normal quantile plot for the sodium level in beef hotdogs looks somewhat linear. The normal quantile plot for the sodium level in poultry hotdogs does not look linear. The population of all sodium levels in beef hotdogs may be normally distributed, but the population of all sodium levels in poultry hotdogs

 

 

 

 

 

 

 

 

imageBeef
imagePoultry
600

 

 

 

 

imagesample
500

 

 

 

 

400

 

 

 

 

300

 

 

 

−2−1012 −2−1012

theoretical

 

Figure 9.8: Normal Quantile Plot of Sodium Levels in Hotdogs

 

is probably not normally distributed. The sample size is not very large so the results of the test may not be valid. A larger sample would be a good idea.

  • Find the sample statistic, test statistic, and p-value

Using R Studio the variable is sodium levels (sodium) in different types of hot dogs. So the factor is type. The command is

t.test(sodium~type, data=Hotdog)

 

##

## Welch Two Sample t-test ##

## data: sodium by type

## t = -1.8798, df = 34.983, p-value = 0.06848

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## -120.3257064.625706

## sample estimates:

##mean in group Beef mean in group Poultry ##401.15459.00

  • Conclusion: Fail to reject 𝐻𝑜 since the p-value ≥ 𝛼.
  • Interpretation

 

This is not enough evidence to support that beef hotdogs’ sodium level is differ- ent from poultry hotdogs. (Though do realize that the population assumptions is not valid, so this interpretation may be invalid.)

 

Example: Confidence Interval for Two Independent Samples

The amount of sodium in beef and poultry hotdogs was measured. (”SOCR 012708 id,” 2013). The data is in table #9.3.2. Find a 95% confidence interval for the mean difference in sodium levels between beef and poultry hotdogs.

Solution:

  • State the random variables and the parameters in words. These were stated in example #9.3.3.
  • State and check the assumptions for the hypothesis test The assumptions were stated and checked in example #9.3.3.
  • Find the sample statistic and confidence interval Using R Studio the vari- able is sodium levels (sodium) in different types of hot dogs. So the factor is type. The command is

t.test(sodium~type, data=Hotdog, conf.level=0.95)

 

##

## Welch Two Sample t-test ##

## data: sodium by type

## t = -1.8798, df = 34.983, p-value = 0.06848

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## -120.3257064.625706

## sample estimates:

##mean in group Beef mean in group Poultry ##401.15459.00

  • image−120.325706 < 𝜇1 − 𝜇2 < 4.625706
    Statistical Interpretation:You are 95% confident that the interval

contains the true difference in mean sodium level between beef and poultry hot dogs.

  • Real World Interpretation: The mean sodium level of beef hotdogs is anywhere from 120.33 mg less than the mean sodium level of poultry hotdogs to 4.63 mg more. (The negative sign on the lower limit implies that the first mean is less than the second mean. The positive sign on the upper limit implies that the first mean is greater than the second mean.)

Do realize that the population assumptions is not valid, so this interpretation may be invalid.

 

 

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

  • The NHANES data contains many variables. One variable is the income of households derived from the middle income of different income cate- gories. The variable is called HHIncomeMid. Is there enough evidence to show that the mean income of males is different from the mean income of females? Test at the 1% level.

 

Table #9.3.3: NHANES

 

names(NHANES)

 

## [1] “ID”

“SurveyYr”

## [3] “Gender”

“Age”

## [5] “AgeDecade”

“AgeMonths”

## [7] “Race1”

“Race3”

## [9] “Education”

“MaritalStatus”

## [11] “HHIncome”

“HHIncomeMid”

## [13] “Poverty”

“HomeRooms”

## [15] “HomeOwn”

“Work”

## [17] “Weight”

“Length”

## [19] “HeadCirc”

“Height”

## [21] “BMI”

“BMICatUnder20yrs”

## [23] “BMI_WHO”

“Pulse”

## [25] “BPSysAve”

“BPDiaAve”

## [27] “BPSys1”

“BPDia1”

## [29] “BPSys2”

“BPDia2”

## [31] “BPSys3”

“BPDia3”

## [33] “Testosterone”

“DirectChol”

## [35] “TotChol”

“UrineVol1”

## [37] “UrineFlow1”

“UrineVol2”

## [39] “UrineFlow2”

“Diabetes”

## [41] “DiabetesAge”

“HealthGen”

## [43] “DaysPhysHlthBad”

“DaysMentHlthBad”

## [45] “LittleInterest”

“Depressed”

## [47] “nPregnancies”

“nBabies”

## [49] “Age1stBaby”

“SleepHrsNight”

## [51] “SleepTrouble”

“PhysActive”

## [53] “PhysActiveDays”

“TVHrsDay”

## [55] “CompHrsDay”

“TVHrsDayChild”

## [57] “CompHrsDayChild”

“Alcohol12PlusYr”

## [59] “AlcoholDay”

“AlcoholYear”

## [61] “SmokeNow”

“Smoke100”

 

 

## [63] “Smoke100n”“SmokeAge”

## [65] “Marijuana”“AgeFirstMarij” ## [67] “RegularMarij”“AgeRegMarij” ## [69] “HardDrugs”“SexEver”

## [71] “SexAge”“SexNumPartnLife” ## [73] “SexNumPartYear”“SameSex”

## [75] “SexOrientation”“PregnantNow”

  • The NHANES data contains many variables. One variable is the income of households derived from the middle income of different income categories. The variable is called HHIncomeMid. Estimate with 95% confidence the mean difference in incomes between males and females in the U.S.
  • A study was conducted that measured the total brain volume (TBV) of patients that had schizophrenia and patients that do not have schizophre- nia. Table #9.3.4 contains the TBV of the all patients (”SOCR data oct2009,” 2013). Is there enough evidence to show that the patients with schizophrenia have a different TBV on average than a patient without schizophrenia? Test at the 10% level.

Table #9.3.4: Total Brain Volume of Patients

imageBrain <- read.csv( “https://krkozak.github.io/MAT160/brain.csv”)head(Brain)

 

##type volume ## 1n 1663407

## 2n 1583940

## 3n 1299470

## 4n 1535137

## 5n 1431890

## 6n 1578698

Code book for data frame Brain

Description A study to measure the total brain volume (TBV) (in ) of patients that had schizophrenia and patients that do not have schizophrenia.

This data frame contains the following columns:

type: whether the patient had schizophrenia (s) or did not have schizophrenia (n)

volume: the total brain volume of a patient.(𝑚𝑚3)

Source SOCR data Oct2009 id ni. (2013, November 16). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_Oct2009_ID_NI

References ”SOCR data nips,” 2013

 

 

  • A study was conducted that measured the total brain volume (TBV) of patients that had schizophrenia and patients that do not have schizophre- nia. Table #9.3.4 contains the TBV of the all patients (”SOCR data oct2009,” 2013). Is there enough evidence to show that the patients with schizophrenia have a different TBV on average than a patient without schizophrenia? Test at the 10% level. Compute a 90% confidence inter- val for the difference in TBV of patients with Schizophrenia and patients without Schizophrenia.
  • The lengths (in kilometers) of rivers on the South Island of New Zealand and what body of water they flow into are listed in table #9.3.5 (Lee, 1994). Do the data provide enough evidence to show on average that the rivers that travel to the Pacific Ocean are different length than the rivers that travel to the Tasman Sea? Use a 5% level of significance.

Table #9.3.5: Lengths of Rivers (km) Flowing to Pacific Ocean

imageLength<-read.csv( “https://krkozak.github.io/MAT160/length.csv”)head(Length)

## river length flowsto

##

1

Clarence

209

Pacific

##

2

Conway

48

Pacific

##

3

Waiau

169

Pacific

##

4

Hurunui

138

Pacific

##

5

Waipara

64

Pacific

##

6

Ashley

97

Pacific

Code book for data frame Length See problem 3.1.2 in Section 3.1 home- work.

  • The lengths (in kilometers) of rivers on the South Island of New Zealand and what body of water they flow into are listed in table #9.3.5 (Lee, 1994). Estimate the difference in mean lengths of rivers between rivers in New Zealand that travel to the Pacific Ocean and ones that travel to the Tasman Sea. Use a 95% confidence level.
  • A vitamin K shot is given to infants soon after birth. Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel (”SOCR data nips,” 2013). The data frame is in table #9.3.6. Is there enough evidence to show that infants cried a different amount on average when they are held by their mothers than if held using conventional methods? Test at the 5% level.

Table #9.3.6: Crying Time of Infants Given Shots Using New Meth- ods

imageCrying<- read.csv( “https://krkozak.github.io/MAT160/crying.csv”)

 

 

image## 1 convent63## 2 convent0## 3 convent2## 4 convent46## 5 convent33## 6 convent33
head(Crying) ##method crying

 

 

 

 

 

Code book for data frame Crying

Description Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel. One of the measurements taken was how long, in seconds, the infant cried after being given the shot. A random sample was taken from the group that was given the shot using conventional methods, and a random sample was taken from the group that was given the shot where the mother held the infant prior to and during the shot.

This data frame contains the following columns:

method: whether the infant was given the conventional method (convent) or the new method (new) prior to being given the vitamin K shot.

crying: how long the infant cried after given a vitamin K shot. (seconds)

Source SOCR data nips infantvitK shotdata. (2013, November 16). Re- trieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_NIPS_ InfantVitK_ShotData

References ”SOCR data nips,” 2013

  • A vitamin K shot is given to infants soon after birth. Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel (”SOCR data nips,” 2013). The data frame is in table #9.3.6. Calculate a 95% confidence interval for the mean difference in mean crying time after being given a vitamin K shot between infants held using conventional methods and infants held by their mothers.

 

 

 

 

Which Analysis Should You Conduct?

One of the most important concept that you need to understand is deciding which analysis you should conduct for a particular situation. To help you to

 

 

figure out the analysis to conduct, there are a series of questions you should ask yourself.

  • Does the problem deal with mean or proportion?

Sometimes the problem states explicitly the words mean or proportion, but other times you have to figure it out based on the information you are given. If you counted number of individuals that responded in the affirmative to a question, then you are dealing with proportion. If you measured something, then you are dealing with mean.

  • Does the problem have one or two samples?

So look to see if one group was measured or if two groups were measured. You need to decide if the problem describes collecting data from one group or from two groups, or if you are comparing two different groups.

  • If you have two samples, then you need to determine if the samples are independent or dependent.

If the individuals are different for both samples, then most likely the samples are independent. If you can’t tell, then determine if a data value from the first sample influences the data value in the second sample. In other words, can you pair data values together so you can find the difference, and that difference has meaning. If the answer is yes, then the samples are paired. Otherwise, the samples are independent.

  • Does the situation involve a hypothesis test or a confidence interval?

If the problem talks about “do the data show”, “is there evidence of”, “test to see”, then you are doing a hypothesis test. If the problem talks about “find the value”, “estimate the” or “find the interval”, then you are doing a confidence interval.

So if you have a situation that has two samples, independent samples, involving the mean, and is a hypothesis test, then you have a two-sample independent t-test. Now you look up the assumptions and the technology process for doing this test. Every hypothesis test involves the same six steps, and you just have to use the correct assumptions and calculations. Every confidence interval has the same five steps, and again you just need to use the correct assumptions and calculations. So this is why it is so important to figure out what analysis you should conduct.

 

 

 

Data Sources:

AP exam scores. (2013, November 20). Retrieved from http://wiki.stat.ucla. edu/socr/index.php/SOCR_Data_Dinov_030708_APExamScores

 

 

Center for Disease Control and Prevention, Prevalence of Autism Spectrum Disorders – Autism and Developmental Disabilities Monitoring Network. (2008). Autism and developmental disabilities monitoring network-2012. Retrieved from website: http://www.cdc.gov/ncbddd/autism/documents/ADDM-2012- Community-Report.pdf

Flanagan, R., Rooney, C., & Griffiths, C. (2005). Fatal poisoning in child- hood, england & wales 1968-2000. Forensic Science International, 148:121-129, Retrieved from http://www.cdc.gov/nchs/data/ice/fatal_poisoning_child.pdf

Gettler, L. T., McDade, T. W., Feranil, A. B., & Kuzawa, C. W. (2011). Longitudinal evidence that fatherhood decreases testosterone in human males. The Proceedings of the National Academy of Sciences, PNAS 2011, doi: 10.1073/pnas.1105403108

Population reference bureau. (2013, October 8). Retrieved from http://www. prb.org/DataFinder/Topic/Rankings.aspx?ind=25

Helmet Sizes for New Zealand Airforce. (n.d.). Retrieved July 20, 2019, from http://www.statsci.org/data/oz/nzhelmet.html

Seafood online. (2013, November 20). Retrieved from http://www. allfreshseafood.com/

Buy sushi grade fish online. (2013, November 20). Retrieved from http://www. catalinaop.com/

(n.d.). Retrieved from https://www3.nd.edu/~busiforc/handouts/Data and Sto- ries/t test/Friday The Thirteenth/Friday The Thirteenth Data.html

Labor force participation rate, female (% of female population ages 15 ) (modeled ILO estimate). (n.d.). Retrieved July 20, 2019, from https://data.worldbank.org/indicator/SL.TLF.CACT.FE.ZS

SOCR 012708 id data hotdogs. (2013, November 13). Retrieved from http:

//wiki.stat.ucla.edu/socr/index.php/SOCR_012708_ID_Data_HotDogs

SOCR data Oct2009 id ni. (2013, November 16). Retrieved from http://wiki. stat.ucla.edu/socr/index.php/SOCR_Data_Oct2009_ID_NI

SOCR data nips infantvitK shotdata. (2013, November 16). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_NIPS_InfantVitK_ ShotData

 

 

 

 

 

 

 

Chapter 10

 

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Example for Nick Copyright © 2025 by Nick Taylor & Dr. Dana Scully is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.