Two-Sample Inference

Nick Taylor

9 Two-Sample Inference

Chapter 7 discussed methods of hypothesis testing about one-population pa- rameters. Chapter 8 discussed methods of estimating population parameters from one sample using confidence intervals. This chapter will look at methods of confidence intervals and hypothesis testing for two populations. Since there are two populations, there are two random variables, two means or proportions, and two samples (though with paired samples you usually consider there to be one sample with pairs collected). Examples of where you would do this are:

Testing and estimating the difference in testosterone levels of men before and after they had children (Gettler, McDade, Feranil & Kuzawa, 2011).

Testing the claim that a diet works by looking at the weight before and after subjects are on the diet.

Estimating the difference in proportion of those who approve of President Obama in the age group 18 to 26 year old and the 55 and over age group.

All of these are examples of hypothesis tests or confidence intervals for two populations. The methods to conduct these hypothesis tests and confidence intervals will be explored in this chapter. As a reminder, all hypothesis tests are the same process. The only thing that changes is the formula that you use and the assumptions. Confidence intervals are also the same process, except that the formula is different.

Two Proportions

There are times you want to test a claim about two population proportions or construct a confidence interval estimate of the difference between two population proportions. As with all other hypothesis tests and confidence intervals, the process is the same though the formulas and assumptions are different.

289

Hypothesis Test for Two Population Proportion (2-Prop Test)

State the random variables and the parameters in words.

= number of successes from group 1

= number of successes from group 2

= proportion of successes in group 1

= proportion of successes in group 2

State the null and alternative hypotheses and the level of significance

𝐻_𝑜 ∶ 𝑝₁ = 𝑝₂

𝐻_𝑎 ∶ 𝑝₁ ≠ 𝑝₂. the ≠ can be replaced with < or > depending on the question. Also, state your 𝛼 level here.

State and check the assumptions for a hypothesis test
A simple random sample of size 𝑛₁ is taken from population 1, and a simple random sample of size 𝑛₂ is taken from population 2.
The samples are independent.
The assumptions for the binomial distribution are satisfied for both pop- ulations.
𝑝̂1𝑝̂2
𝑝1𝑝2
𝑝̂2
𝑝2 ∗ 𝑛2 ≥ 5𝑞2 ∗ 𝑛2 ≥ 5𝑞2 = 1 − 𝑝2
𝑝̂2
𝑝̂2
𝑝̂1
𝑝1 ∗ 𝑛1 ≥ 5𝑞1 ∗ 𝑛1 ≥ 5𝑞1 = 1 − 𝑝1
𝑝̂1
𝑝̂1
The sampling distribution of can be approximated as a normal distri- bution. To determine the sampling distribution of , you need to show thatandwhere. If this requirement is true, then the sampling distribution of is well approximated by a normal curve. The sampling distribution of can be approximated as a normal distribution. To determine the sampling distribution of , you need to show thatandwhere. If this requirement is true, then the sampling distribution of is well approxi- mated by a normal curve. However, if you do not know and , you will need to use and instead. This is not perfect, but it is the best you can do.
Find the sample statistics, test statistic, and p-value On R Studio, use the command

prop.test(c(x1,x2), c(n1, n2))

Conclusion

This is where you write reject or fail to reject𝐻_𝑜. The rule is: if the p-value

< 𝛼, then reject 𝐻_𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻_𝑜.

Interpretation

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻_𝑎, or you do not have enough evidence to support 𝐻_𝑎.

Confidence Interval for the Difference Between Two Population Pro- portion (2-Prop Interval)

The confidence interval for the difference in proportions has the same random variables and proportions and the same assumptions as the hypothesis test for two proportions. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and proportions and state and check the assumptions before completing the confidence interval step.

Find the sample statistics and the confidence interval

The confidence interval estimate of the difference is found using the following command in R Studio:

prop.test(c(x1,x2), c(n1, n2), conf.level=C as a decimal)

Statistical Interpretation: In general this looks like, “You are C% confident that the confidence interval contains the true difference in proportions.”
Real World Interpretation: This is where you state how much more (or less) the first proportion is from the second proportion.

Example: Hypothesis Test for Two Population Pro- portions

Do husbands cheat on their wives in a different proportion from the proportion of wives cheat on their husbands (”Statistics brain,” 2013)? Suppose you take a group of 1000 randomly selected husbands and find that 231 had cheated on their wives. Suppose in a group of 1200 randomly selected wives, 176 cheated on their husbands. Do the data show that the proportion of husbands who cheat on their wives is different from the proportion of wives who cheat on their husbands. Test at the 5% level.

Solution:

State the random variables and the parameters in words.

𝑥₁ = number of husbands who cheat on his wife

𝑥₂ = number of wives who cheat on her husband

𝑝₁ = proportion of husbands who cheat on his wife

𝑝₂ = proportion of wives who cheat on her husband

State the null and alternative hypotheses and the level of significance

𝐻_𝑜 ∶ 𝑝₁ = 𝑝₂

𝐻_𝑎 ∶ 𝑝₁ ≠ 𝑝₂

level of significance is 𝛼 = 0.05

State and check the assumptions for a hypothesis test
A simple random sample of 1000 responses about cheating from husbands is taken. This was stated in the problem. A simple random sample of 1200 responses about cheating from wives is taken. Check: This was stated in the problem.
The samples are independent. Check: The samples are independent. This is true since the samples involved different genders.
The properties of the binomial distribution are satisfied in both popula- tions. Check: This is true since there are only two responses, there are a fixed number of trials, the probability of a success is the same, and the trials are independent.
𝑝̂1𝑝̂2
𝑛1 ∗ 𝑝1 𝑛2 ∗ 𝑝2 𝑛1 ∗ 𝑞1𝑛2 ∗ 𝑞2
𝑝̂1𝑝̂2
The sampling distributions of and can be approximated with a nor- mal distribution. Check:,,, andare all greater than or equal to 5. So both sampling distributions of and can be approximated with a normal distribution.
Find the sample statistics, test statistic, and p-value On R use the command:

prop.test(c(231,176), c(1000, 1200))

##

## 2-sample test for equality of proportions with ## continuity correction

##

## data: c out of c231 out of 1000176 out of 1200 ## X-squared = 25.173, df = 1, p-value = 5.241e-07 ## alternative hypothesis: two.sided

## 95 percent confidence interval:

## 0.05050705 0.11815962

## sample estimates:

##prop 1prop 2

## 0.2310000 0.1466667

Conclusion

Reject 𝐻_𝑜, since the p-value is less than 5%.

Interpretation

This is enough evidence to support that the proportion of husbands having affairs is different from the proportion of wives having affairs.

Example: Confidence Interval for Two Population Proportions

What is the difference in proportion that husbands cheat on their wives than wives cheat on the husbands (”Statistics brain,” 2013)? Suppose you take a group of 1000 randomly selected husbands and find that 231 had cheated on their wives. Suppose in a group of 1200 randomly selected wives, 176 cheated on their husbands. Estimate the difference in the proportion of husbands and wives who cheat on their spouses using a 95% confidence level.

Solution:

State the random variables and the parameters in words. These were stated in example #9.1.1.
State and check the assumptions for the confidence interval The assumptions were stated and checked in example #9.1.1.
Find the sample statistics and the confidence interval

On R use the command:

prop.test(c(231,176), c(1000, 1200), conf.level = .95)

##

## 2-sample test for equality of proportions with ## continuity correction

##

## data: c out of c231 out of 1000176 out of 1200 ## X-squared = 25.173, df = 1, p-value = 5.241e-07 ## alternative hypothesis: two.sided

## 95 percent confidence interval:

## 0.05050705 0.11815962

## sample estimates:

##prop 1prop 2

## 0.2310000 0.1466667

Statistical Interpretation: You are 95% confident that 0.05050705 < 𝑝₁ −

𝑝₂ < 0.11815962 contains the true difference in proportions.

Real World Interpretation: The proportion of husbands who cheat on their wives is anywhere from 5.05% to 11.82% higher than the proportion of wives who cheat on their husband.

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year, of the 211,693 students who took the calculus AB exam 102,598 of them were female (”AP exam scores,” 2013). Is there enough evidence to show that the proportion of female students taking the biology exam is different than the proportion of female students taking the calculus AB exam? Test at the 5% level.
Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year, of the 211,693 students who took the calculus AB exam 102,598 of them were female (”AP exam scores,” 2013). Estimate the difference in the proportion of female students taking the biology exam and female students taking the calculus AB exam using a 90% confidence level.
Many high school students take the AP tests in different subject areas. In 2007, of the 211,693 students who took the calculus AB exam 102,598 of them were female and 109,095 of them were male (”AP exam scores,” 2013). Is there enough evidence to show that the proportion of female students taking the calculus AB exam is different from the proportion of male students taking the calculus AB exam? Test at the 5% level.
Many high school students take the AP tests in different subject areas. In 2007, of the 211,693 students who took the calculus AB exam 102,598 of them were female and 109,095 of them were male (”AP exam scores,” 2013). Estimate using a 90% level the difference in proportion of female students taking the calculus AB exam versus male students taking the calculus AB exam.
Are there more children diagnosed with Autism Spectrum Disorder (ASD) in states that have larger urban areas over states that are mostly rural? In the state of Pennsylvania, a fairly urban state, there are 245 eight year old diagnosed with ASD out of 18,440 eight year old evaluated. In the state of Utah, a fairly rural state, there are 45 eight year old diagnosed with ASD out of 2,123 eight year old evaluated (”Autism and developmental,” 2008). Is there enough evidence to show that the proportion of children diagnosed with ASD in Pennsylvania is different than the proportion in Utah? Test at the 1% level.
Are there more children diagnosed with Autism Spectrum Disorder (ASD) in states that have larger urban areas over states that are mostly rural? In the state of Pennsylvania, a fairly urban state, there are 245 eight year old diagnosed with ASD out of 18,440 eight year old evaluated. In the state of Utah, a fairly rural state, there are 45 eight year old diagnosed with ASD out of 2,123 eight year old evaluated (”Autism and developmental,”

2008). Estimate the difference in proportion of children diagnosed with ASD between Pennsylvania and Utah. Use a 98% confidence level.

A child dying from an accidental poisoning is a terrible incident. Is it more likely that a male child will get into poison than a female child? To find this out, data was collected that showed that out of 1830 children between the ages one and four who pass away from poisoning, 1031 were males and 799 were females (Flanagan, Rooney & Griﬀiths, 2005). Do the data show that there is different proportion of male children dying of poisoning than female children? Test at the 1% level.
A child dying from an accidental poisoning is a terrible incident. Is it more likely that a male child will get into poison than a female child? To find this out, data was collected that showed that out of 1830 children between the ages one and four who pass away from poisoning, 1031 were males and 799 were females (Flanagan, Rooney & Griﬀiths, 2005). Compute a 99% confidence interval for the difference in proportions of poisoning deaths of male and female children ages one to four.

Paired Samples for Two Means

Are two populations the same? Is the average height of men taller than the average height of women? Is the mean weight less after a diet than before?

You can compare populations by comparing their means. You take a sample from each population and compare the statistics.

Anytime you compare two populations you need to know if the samples are independent or dependent. The formulas you use are different for different types of samples.

If how you choose one sample has no effect on the way you choose the other sample, the two samples are independent. The way to think about it is that in independent samples, the individuals from one sample are overall different from the units of observations from the other sample. This will mean that sample one has no affect on sample two. The sample values from one sample are not related or paired with values from the other sample.

If you choose the samples so that a measurement in one sample is paired with a measurement from the other sample, the samples are dependent or matched or paired. (Often a before and after situation.) You want to make sure the there is a meaning for pairing data values from one sample with a specific data value from the other sample. One way to think about it is that in dependent samples, the units of observations from one sample are the same units of observations from the other sample, though there can be other reasons to pair values. This

makes the sample values from each sample paired.

In tidy data, remember each row is a unit of measurement, and each column is a variable. In paired samples, you would have two variables that you are working with. In independent samples, you would have a variable that distinguishes a unit of observation from another unit of observation. As an example, in the Pulse data frame, consider the variables pulse_before and pulse_after. Since they are measured off the same unit of observation, then comparing the two variables would be a paired samples analysis. However, consider the pulse_after and whether a person smokes would be comparing the variable pulse_after against the variable smokes to see if smoking effects a person’s pulse rate after exercise. In this case, the units of observations would be different based on smoking yes or smoking no. Consider the variable smoking to be the factor that one is interested in seeing how it effects pulse rate.

Table #9.2.1: Data frame Pulse

Pulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##	1	170	68	22	male	yes	yes	moderate	sat
##	2	182	75	26	male	yes	yes	moderate	sat
##	3	180	85	19	male	yes	yes	moderate	ran
##	4	182	85	20	male	yes	yes	low	sat
##	5	167	70	22	male	yes	yes	low	sat
##	6	178	86	21	male	yes	yes	low	sat
##		pulse_before		pulse_after		year
##	1	70		71		93
##	2	80		76		93
##	3	68		125		95
##	4	70		68		95
##	5	92		84		96
##	6	76		80		98

Example: Independent or Dependent Samples

Determine if the following are dependent or independent samples.

Randomly choose 5 men and 6 women and compare their heights

Solution:

Independent, since there is no reason that one value belongs to another. The units of observations are not the same for both samples. The units of observa- tions are definitely different. A way to think about this is that the knowledge that a man is chosen in one sample does not give any information about any of the woman chosen in the other sample.

Choose 10 men and weigh them. Give them a new diet drug and later weigh them again.

Solution:

Dependent, since each person’s before weight can be matched with their after weight. The units of observations are the same for both samples. A way to think about this is that the knowledge that a person weighs 400 pounds at the beginning will tell you something about their weight after the diet drug.

Take 10 people and measure the strength of their dominant arm and their non-dominant arm.

Solution: Dependent, since you can match the two arm strengths. The units of observations are the same for both samples. So the knowledge of one person’s dominant arm strength will tell you something about the strength of their non- dominant arm.

To analyze data when there are matched or paired samples, called dependent samples, you conduct a paired t-test. Since the samples are matched, you can find the difference between the values of the two random variables.

Hypothesis Test for Two Sample Paired t-Test

State the random variables and the parameters in words.

𝑥₁ = random variable 1

𝑥₂ = random variable 2

𝜇₁ = mean of random variable 1

𝜇₂ = mean of random variable 2

State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂ or𝐻_𝑜 ∶ 𝜇₁ − 𝜇₂ = 0

𝐻_𝑎 ∶ 𝜇₁ ≠ 𝜇₂ or 𝐻_𝑎 ∶ 𝜇₁ − 𝜇₂ ≠ 0

However, since you are finding the differences, then you can actually think of

𝜇₁ − 𝜇₂ = 𝜇_𝑑.

So the hypotheses could become

𝐻_𝑜 ∶ 𝜇_𝑑 = 0

𝐻_𝑎 ∶ 𝜇_𝑑 ≠ 0 Remember, you can replace ≠ with < or >. Also, state your 𝛼 level here.

State and check the assumptions for the hypothesis test
A random sample of n pairs is taken.

The population of the difference between random variables is normally distributed. In this case the population you are interested in has to do with the differences that you find. It does not matter if each random variable is normally distributed. It is only important if the differences you find are normally distributed. Just as before, the t-test is fairly robust to the assumption if the sample size is large. This means that if this assumption isn’t met, but your sample size is quite large, then the results of the t-test are valid.
Find the sample statistic, test statistic, and p-value

Realize that a paired test is a one sample t-test on the difference between two variables. So you are running a one-sample t-test on a new variable known as the difference variable. You need to create this difference variable by creating a new data frame. This is done on R Studio by doing the following command (The fol- lowing shows how to create the variable difference for pulse_after-pulse_before on the data frame Pulse. Change the variables used and data frame used to your data frame and variables):

Table #9.2.2: Data frame Pulse with Difference

Pulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)Pulse_new<-Pulse%>% mutate(difference=pulse_after-pulse_before)head(Pulse_new)

## height weight age gender smokes alcohol exercise ran

##	1	170	68	22	male	yes		yes	moderate	sat
##	2	182	75	26	male	yes		yes	moderate	sat
##	3	180	85	19	male	yes		yes	moderate	ran
##	4	182	85	20	male	yes		yes	low	sat
##	5	167	70	22	male	yes		yes	low	sat
##	6	178	86	21	male	yes		yes	low	sat
##		pulse_before		pulse_after		year	difference
##	1	70		71		93	1
##	2	80		76		93	-4
##	3	68		125		95	57
##	4	70		68		95	-2
##	5	92		84		96	-8
##	6	76		80		98	4

Notice R Studio created a new data frame called Pulse_new and that data frame has the variable difference. Now to conduct a paired t-test use the R Studio command

t.test(~difference variable, data=New Data Frame)

Note: if the 𝐻_𝑎 is <, then the command becomes

t.test(~difference variable, data=New Data Frame, alternative=”less”)

Similarly for > put alternative=“greater”

Conclusion

This is where you write reject 𝐻_𝑜 or fail to reject 𝐻_𝑜. The rule is: if the p-value

< 𝛼, then reject 𝐻_𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻_𝑜.

Interpretation

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻_𝑎, or you do not have enough evidence to support 𝐻_𝑎.

Confidence Interval for Difference in Means from Paired Samples (t- Interval)

The confidence interval for the difference in means has the same random vari- ables and means and the same assumptions as the hypothesis test for two paired samples. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and means, and state and check the assumptions before completing the confidence interval step.

Find the sample statistic and confidence interval Again, you will need to create a new data frame with a difference variable. Then on R Studio the command is

t.test(~difference variable, data=Data Frame, conf.level=C as a decimal)

Statistical Interpretation: In general this looks like, “You are C% confident that the statement contains the true mean difference.”
Real World Interpretation: This is where you state what interval contains the true mean difference.

How to check the assumptions of t-test and confidence interval:

In order for the t-test or confidence interval to be valid, the assumptions of the test must be met. So whenever you run a t-test or confidence interval, you must make sure the assumptions are met. So you need to check them. Here is how you do this:

For the assumption that the sample is a random sample, describe how you took the samples. Make sure your sampling technique is random and that the samples were dependent.
For the assumption that the population of the differences is normal, re- member the process of assessing normality from chapter 6. Make sure you use the difference variable that was created in the new data frame.

Example: Hypothesis Test for Paired Samples

Is the pulse rate after exercise different from the pulse rate before exercise for a woman who drinks alcohol? Use the data frame Pulse (table #9.2.3). Test at the 5% level.

Table #9.2.3: Data frame Pulse

Pulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##	1	170	68	22	male	yes	yes	moderate	sat
##	2	182	75	26	male	yes	yes	moderate	sat
##	3	180	85	19	male	yes	yes	moderate	ran
##	4	182	85	20	male	yes	yes	low	sat
##	5	167	70	22	male	yes	yes	low	sat
##	6	178	86	21	male	yes	yes	low	sat
##		pulse_before		pulse_after		year
##	1	70		71		93
##	2	80		76		93
##	3	68		125		95
##	4	70		68		95
##	5	92		84		96
##	6	76		80		98

Code book for data frame Pulse See problem #3.2.5 in section 3.2.

Solution:

State the random variables and the parameters in words.

𝑥₁ = pulse of a smoking woman who drinks alcohol after exercise

𝑥₁ = pulse of a smoking woman who drinks alcohol before exercise

𝜇₁ = mean pulse of a smoking woman who drinks alcohol after exercise

State the null and alternative hypotheses and the level of significance

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂

𝐻_𝑎 ∶ 𝜇₁ ≠ 𝜇₂

level of significance, 𝛼 = 0.05

State and check the assumptions for the hypothesis test
A random sample of 110 pairs of pulse rates after and before exercise was taken. Check: The data frame says that the data was collected from

students in classes for several years. Though this was not a random sample, it is probably a representative sample.

The population of the difference in after and before pulse rates is normally distributed. Check: To see if this is true, look at the density plot and the normal quantile plot for the difference between after and before. This variable must be created before the density plot and normal quantile plot can be created. The other thing is that a new data frame must be created for females drink alcohol.

Table #9.2.4: Data frame Pulse of Females Who Drink Alcohol

Pulse_female<- Pulse%>%filter(gender==”female”, alcohol==”yes”)head(Pulse_female)

##height weight age gender smokes alcohol exercise ran

## 1	165	60	19 female	yes	yes	low ran
## 2	163	47	23 female	yes	yes	low ran
## 3	173	57	18 female	no	yes moderate sat
## 4	179	58	19 female	no	yes moderate ran
## 5	167	62	18 female	no	yes	high ran
## 6	173	64	18 female	no	yes	low sat

##pulse_beforepulse_afteryear##18812098##27112598##3868893##48215093##59617693##6908893
Now mutate the Pulse_female data frame to include a difference variable

Table #9.2.5: Data frame Pulse of Females Who Drink Alcohol with Difference

Pulse_female_new<-Pulse_female%>% mutate(difference=pulse_after-pulse_before)head(Pulse_female_new)

##height weight age gender smokes alcohol exercise ran

##	1	165	60	19	female	yes	yes	low	ran
##	2	163	47	23	female	yes	yes	low	ran
##	3	173	57	18	female	no	yes	moderate	sat
##	4	179	58	19	female	no	yes	moderate	ran
##	5	167	62	18	female	no	yes	high	ran
##	6	173	64	18	female	no	yes	low	sat

##pulse_before pulse_after year difference

## 1	88	120	98	32
## 2	71	125	98	54
## 3	86	88	93	2
## 4	82	150	93	68
## 5	96	176	93	80
## 6	90	88	93	-2

Now create a density plot and normal quantile plot on the difference variable.

0.015

density
0.010

0.005

0.000

020406080

difference

Figure 9.1: Density Plot of Differences in pulse rates

The density plot is not symmetrical and the normal quantile plot on the differ- ences is not linear. So you cannot assume that the distribution of the difference in pulse rates is normal. It is good that the t-test is robust if there is a large sample. The sample is of size 110, so that should be adequate to assume the conclusion is valid.

Find the sample statistic, test statistic, and p-value On R Studio, use the command:

t.test(~difference, data=Pulse_female_new)

##

## One Sample t-test ##

## data: difference

## t = 4.1353, df = 26, p-value = 0.0003283

80

60

sample

40

20

0

−2−1012

theoretical

Figure 9.2: Normal Quantile Plot of Differences in Weights

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 11.51152 34.26625

## sample estimates:

## mean of x ## 22.88889

Conclusion

Since the p-value < 0.05, reject 𝐻_𝑜.

Interpretation

There is enough evidence to support that there is a difference in pulse rate before and after exercise of females who smoke.

Example: Hypothesis Test for Paired Samples

The New Zealand Air Force purchased a batch of flight helmets. They then found out that the helmets didn’t fit. In order to make sure that they order the correct size helmets, they measured the head size of recruits. To save money, they wanted to use cardboard calipers, but were not sure if they will be accurate enough. So they took 18 recruits and measured their heads with the cardboard calipers and also with metal calipers. The data frame is in table #9.2.6 (Helmet

Sizes for New Zealand Airforce, 2019). Do the data provide enough evidence to show that there is a difference in measurements between the cardboard and metal calipers? Use a 5% level of significance.

Table #9.2.6: Data for Head Measurements

Helmet<-read.csv( “https://krkozak.github.io/MAT160/helmet.csv”)head(Helmet)

##Cardboard Metal

## 1	146145
## 2	151153
## 3	163161
## 4	152151
## 5	151145
## 6	151150

Code book for data frame Helmet

Description After purchasing a batch of flight helmets that did not fit the heads of many pilots, the NZ Airforce decided to measure the head sizes of all recruits. Before this was carried out, information was collected to determine the feasibility of using cheap cardboard calipers to make the measurements, instead of metal ones which were expensive and uncomfortable. The data lists the head diameters of 18 recruits measured once using cardboard calipers and again using metal calipers. One question is whether there is any systematic difference between the two sets of calipers. One might also ask whether there is more variability in the cardboard calipers measurement than that of the metal calipers.

This data frame contains the following columns:

Cardboard: measurement using cardboard calipers (cm) Metal: measurement using metal calipers (cm)

Source Helmet Sizes for New Zealand Airforce. (n.d.). Retrieved July 20, 2019, from http://www.statsci.org/data/oz/nzhelmet.html

References Data courtesy of Dr Stephen Legg. Seber and Lee (1998). Page 545.

Solution:

State the random variables and the parameters in words.

𝑥₁ = head measurement of recruit using cardboard caliper

𝑥₂ = head measurement of recruit using metal caliper

𝜇₁= mean head measurement of recruit using cardboard caliper

𝜇₂ = mean head measurement of recruit using metal caliper

State the null and alternative hypotheses and the level of significance

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂

𝐻_𝑎 ∶ 𝜇₁ ≠ 𝜇₂

level of significance, 𝛼 = 0.05

State and check the assumptions for the hypothesis test
A random sample of 18 pairs of head measures of recruits with cardboard and metal caliper was taken. Check: This was not stated, but probably could be safely assumed.
The population of the difference in head measurements between cardboard and metal calipers is normally distributed. Check: First create the differ- ence variable, then the density plot and normal quantile plot.

Table #9.2.7: Data for Head Measurements with Difference Variable

Helmet_new<-Helmet%>% mutate(difference=Cardboard-Metal)head(Helmet_new)

##		Cardboard Metal	difference
##	1	146145	1
##	2	151153	-2
##	3	163161	2
##	4	152151	1
##	5	151145	6
##	6	151150	1

gf_density(~difference, data=Helmet_new)

gf_qq(~difference, data=Helmet_new)

This density plot looks somewhat bell shaped. The normal quantile plot on the differences looks somewhat linear. So you can assume that the distribution of the difference in weights is normal.

Find the sample statistic, test statistic, and p-value Using R Studio the command is

t.test(~difference, data=Helmet_new)

##

## One Sample t-test ##

## data: difference

## t = 3.1854, df = 17, p-value = 0.005415

0.20

density
0.15

0.10

0.05

0.00

−20246

difference

Figure 9.3: Desnity Plot of Differences in Head Measurements

6

4

sample
2

0

−2

−2−1012

theoretical

Figure 9.4: Normal Quantile Plot of Differences in Head Measurements

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 0.5440163 2.6782060

## sample estimates:

## mean of x ## 1.611111

The sample statistic is 1.6111, the test statistic is 3.1854, and the p-value is 0.005415.

Conclusion

Since the p-value < 0.05, reject 𝐻_𝑜.

Interpretation

There is enough evidence to support that the mean head measurements using the cardboard calipers are not the same as when using the metal calipers. So it looks like the New Zealand Air Force shouldn’t use the cardboard calipers.

Example: Confidence Interval for Paired Samples

The New Zealand Air Force purchased a batch of flight helmets. They then found out that the helmets didn’t fit. In order to make sure that they order the correct size helmets, they measured the head size of recruits. To save money, they wanted to use cardboard calipers, but were not sure if they will be accurate enough. So they took 18 recruits and measured their heads with the cardboard calipers and also with metal calipers. The data frame is in table #9.2.6 (Helmet Sizes for New Zealand Airforce, 2019). Estimate the difference in measurements between the cardboard and metal calipers using a 95% confidence interval.

Table #9.2.8: Data for Head Measurements

Helmet<-read.csv( “https://krkozak.github.io/MAT160/helmet.csv”)head(Helmet)

##Cardboard Metal

## 1	146145
## 2	151153
## 3	163161
## 4	152151
## 5	151145
## 6 Solution:	151150

State the random variables and the parameters in words. These were stated in example #9.2.2.
State and check the assumptions for the confidence interval

The assumptions were stated and checked in example #9.2.2.

Find the sample statistic and confidence interval Using R Studio the command is

t.test(~difference, data=Helmet_new, conf.leve=0.95)

##

## One Sample t-test ##

## data: difference

## t = 3.1854, df = 17, p-value = 0.005415

## alternative hypothesis: true mean is not equal to 0 ## 95 percent confidence interval:

## 0.5440163 2.6782060

## sample estimates:

## mean of x ## 1.611111

𝜇2 < 2.6782060
0.5440163 < 𝜇1 −
Statistical Interpretation: You are 95% confidence that

contains the true mean difference in head measurement between using the cardboard and metal calibers.

Real World Interpretation: The mean head measurement using the card- board calibers is anywhere from 0.54 cm to 2.68 cm more than the head measurement using the metal calibers.

Examples #9.2.2 and #9.2.4 use the same data set, but one is conducting a hypothesis test and the other is conducting a confidence interval. Notice that the hypothesis test’s conclusion was to reject and say that there was a difference in the means, and the confidence interval does not contain the number 0. If the confidence interval did contain the number 0, then that would mean that the two means could be the same. Since the interval did not contain 0, then you could say that the means are different just as in the hypothesis test. This means that the hypothesis test and the confidence interval can produce the same interpretation. Do be careful though, you can run a hypothesis test with a particular significance level and a confidence interval with a confidence level that is not compatible with your significance level. This will mean that the conclusion from the confidence interval would not be the same as with a hypothesis test. So if you want to estimate the mean difference, then conduct a confidence interval. If you want to show that the means are different, then conduct a hypothesis test. As a reminder, the American Statistical Association (ASA) suggests not conducting hypothesis tests and just create confidence intervals.

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

The cholesterol level of patients who had heart attacks was measured multiple times after the heart attack. The researchers want to see if the cholesterol level of patients who have heart attacks changes as the time since their heart attack increases. The data is in table #9.2.9. Do the data show that the mean cholesterol level of patients that have had a heart attack changes as the time increases since their heart attack? Use day2 and day4 variables to answer the question. Test at the 1% level.

table #9.2.9: Cholesterol Levels of Patients After Heart Attack

Cholesterol<-read.csv( “https://krkozak.github.io/MAT160/cholesterol.csv”)head(Cholesterol)

##patient day2 day4 day14

##	1	1 270 218	156
##	2	2 236 234	NA
##	3	3 210 214	242
##	4	4 142 116	NA
##	5	5 280 200	NA
##	6	6 272 276	256

Code book for Data Frame Cholesterol See problem 3.1.1 in Section 3.1 homework.

The cholesterol level of patients who had heart attacks was measured multiple times after the heart attack. The researchers want to see if the cholesterol level of patients who have heart attacks changes as the time since their heart attack increases. The data is in table #9.2.9. Calculate a 98% confidence interval for the mean difference in cholesterol levels from day two to day four.
All Fresh Seafood is a wholesale fish company based on the east coast of the

U.S. Catalina Offshore Products is a wholesale fish company based on the west coast of the U.S. Table #9.2.10 contains prices from both companies for specific fish types (”Seafood online,” 2013) (”Buy sushi grade,” 2013). Do the data provide enough evidence to show that fish cost different from west coast fish wholesaler and east coast wholesaler? Test at the 5% level.

Table #9.2.10: Wholesale Prices of Fish in Dollars

Price <- read.csv( “https://krkozak.github.io/MAT160/price.csv”)head(Price)

##fish east west

## 1Cod 19.99 17.99

##	2	Tilapi	6.00	13.99
##	3	Farmed Salmon	19.99	22.99
##	4	Organic Salmon	24.99	24.99
##	5	Grouper Fillet	29.99	19.99
##	6	Tuna	28.99	31.99

Code book for data frame Price

Description Price of fish was collected from two websites. One for Catalina Offshore Products (west coast) and the other for All Fresh Seafood (east coast) in 2013.

This data frame contains the following columns:

fish: type of fish for sale

east: price of fish from east coast supplier ($) west: price of fish from west coast supplier ($)

Source Seafood online. (2013, November 20). Retrieved from http://www. allfreshseafood.com/

Buy sushi grade fish online. (2013, November 20). Retrieved from http://www. catalinaop.com/

References Websites of Cataline Offshore Products and All Fresh Seafood

All Fresh Seafood is a wholesale fish company based on the east coast of the

U.S. Catalina Offshore Products is a wholesale fish company based on the west coast of the U.S. Table #9.2.10 contains prices from both companies for specific fish types (”Seafood online,” 2013) (”Buy sushi grade,” 2013). Find a 95% confidence interval for the mean difference in wholesale price between the east coast and west coast suppliers.

The British Department of Transportation studied to see if people avoid driving or shopping, or have more accidents on Friday the 13th. They collected data from different locations (Friday the 13th, 2019). The data for each location on the two different dates is in table #9.2.11. Do the data show that on average different number of people are engaged in activities on Friday the 13th? Test at the 5% level.

Table #9.2.11: Traﬀic Count

Traffic <- read.csv( “https://krkozak.github.io/MAT160/traffic.csv”)head(Traffic)

##source yearmonthX6th X13th location ## 1 traffic 1990,July 139246 1385487 to 8

## 2 traffic 1990,July 134012 132908 9 to 10

## 3 traffic 1991, September 137055 1360187 to 8

## 4 traffic 1991, September 133732 131843 9 to 10

## 5 traffic 1991, December 123552 1216417 to 8

## 6 traffic 1991, December 121139 118723 9 to 10

Code book for data frame Traﬀic

Description This file consists of three separate data sets, all of which address the issues of how superstitions regarding Friday the 13th affect human behavior, and whether Friday the 13th is an unlucky day. Scanlon, et al. collected data on traﬀic and shopping patterns and accident frequency for Fridays the 6th and 13th between October of 1989 and November of 1992.

For the first data set, the researchers obtained information from the British Department of Transport regarding the traﬀic flows between junctions 7 to 8 and junctions 9 to 10 of the M25 motorway. They collected the numbers of shoppers in nine different supermarkets in southeast England for the second data set. The third data set contains numbers of emergency admissions to hospitals due to transport accidents.

We present the three data sets in a combined format, with the variable “Data set” as an identifier that may be used to separate them.

This data frame contains the following columns:

source: which data set the data were obtained from year: which year the data was collected from Month: the month that the Friday was in

x6th: Number of cars passing through junction (traﬀic data set), shoppers for each supermarket (shopping data set), or admissions due to transport accidents (accident data set) on Friday the 6th

x13th: Number of cars passing through junction (traﬀic data set), shoppers for each supermarket (shopping data set), or admissions due to transport accidents (accident data set) on Friday the 13th

location: Motorway junction (traﬀic data set), supermarket location (shopping data set) or hospital (accident data set) to which the data correspond

Source (n.d.). Retrieved from https://www3.nd.edu/~busiforc/handouts/Data and Stories/t test/Friday The Thirteenth/Friday The Thirteenth Data.html

References Scanlon, T.J., Luben, R.N., Scanlon, F.L., Singleton, N. (1993), “Is Friday the 13th Bad For Your Health?,” BMJ, 307, 1584-1586.

The British Department of Transportation studied to see if people avoid driving or shopping, or have more accidents on Friday the 13th. They collected data from different locations (Friday the 13th, 2019). The data for each location on the two different dates is in table #9.2.11. Do the

data show that on average different number of people are engaged in activ- ities on Friday the 13th? Estimate the mean difference in activity count between the 6th and the 13th using a 95% level.

To determine if Reiki is an effective method for treating pain, a pilot study was carried out where a certified second-degree Reiki therapist provided treatment on volunteers. Pain was measured using a visual analogue scale (VAS) and a likert scale immediately before and after the Reiki treatment (Olson & Hanson, 1997). The data is in table #9.2.12. Do the data show that Reiki treatment reduces pain? Test at the 5% level.

Table #9.2.12: Pain Measures Before and After Reiki Treatment

Reiki<- read.csv( “https://krkozak.github.io/MAT160/reki.csv”)head(Reiki)

##vas.before vas.after likert_before likert_after

## 1	6	3	2	1
## 2	2	1	2	1
## 3	2	0	3	0
## 4	9	1	3	1
## 5	3	0	2	0
## 6	3	2	2	2

Code book for data frame Reiki see problem #3.2.7 in Section 3.2 Home- work

To determine if Reiki is an effective method for treating pain, a pilot study was carried out where a certified second-degree Reiki therapist provided treatment on volunteers. Pain was measured using a visual analogue scale (VAS) and a likert scale immediately before and after the Reiki treatment (Olson & Hanson, 1997). The data is in table #9.2.12. Compute a 90% confidence level for the mean difference in VAS score from before and after Reiki treatment.
The female labor force participation rates (FLFPR) of women in countries from 1990 to 2018 are in table #9.2.13 (Labor force participation rate, female (% of female population ages 15+) (modeled ILO estimate), 2019). Do the data show that the mean female labor force participation rate in 1990 is different from that in the 2018 using a 5% level of significance?

Table #9.2.13: Female Labor Force Participation Rates

Labor <- read.csv( “https://krkozak.github.io/MAT160/labor.csv”)head(Labor)

##Country.Name Country.CodeRegion ## 1ArubaABW Latin America & Caribbean

##2Lowincome43.1150043.1240043.1290043.07200##3Lower middleincome74.9450074.8790074.8260074.78200##4Upper middleincome53.7710056.2960056.6870055.74700##5HighincomeNANANANA##619.1899719.2409419.1315919.29515##y1994y1995y1996y1997y1998y1999

## 2	Afghanistan	AFG		South Asia
## 3	Angola	AGO	Sub-Saharan Africa
## 4	Albania	ALB	Europe & Central Asia
## 5	Andorra	AND	Europe & Central Asia
## 6	Arab World	ARB
##	IncomeGroup	y1990	y1991	y1992	y1993
## 1	High income	NA	NA	NA	NA

##	1	NA	NA	NA	NA	NA	NA
##	2	43.00300	43.01700	42.77000	42.55400	42.41300	42.3340
##	3	74.77000	74.78400	74.78300	74.80600	74.84600	74.8940
##	4	54.90400	53.74600	53.07500	53.81200	53.15400	52.2540
##	5	NA	NA	NA	NA	NA	NA
##	6	19.64479	19.66156	19.51602	19.27293	19.07511	19.5351
##		y2000	y2001	y2002	y2003	y2004	y2005
##	1	NA	NA	NA	NA	NA	NA
##	2	42.27400	42.53900	42.89900	43.28600	43.66100	44.02500
##	3	74.94100	74.96200	74.98400	75.01100	75.04800	75.09400
##	4	51.76900	51.11000	50.67900	49.75900	48.87800	48.05100
##	5	NA	NA	NA	NA	NA	NA
##	6	19.59284	19.52237	19.08892	19.32403	19.44488	19.53444
##		y2006	y2007	y2008y2009y2010y2011
##	1	NA	NA	NANANANA
##	2	43.59700	43.19200	42.8730 42.70900 42.73500 43.32800
##	3	75.12600	75.16500	75.2090 75.25600 75.30700 75.34400
##	4	47.38900	46.80300	46.2690 44.94500 45.69300 47.10400
##	5	NA	NA	NANANANA
##	6	19.68183	20.17107	19.8473 20.05784 20.17166 20.27703
##		y2012	y2013	y2014y2015y2016y2017
##	1	NA	NA	NANANANA
##	2	44.11700	45.03900	46.01700 47.00100 47.76600 48.47400
##	3	75.38900	75.43300	75.46500 75.47900 75.47000 75.45100
##	4	48.80600	44.65000	44.78900 47.67600 47.45900 47.31200
##	5	NA	NA	NANANANA
##	6	20.46453	20.76731	20.70378 20.51515 20.61605 20.56842
##		y2018
##	1	NA
##	2	48.66000
##	3	75.41200
##	4	47.19100
##	5	NA

## 6 20.58152

Code book for data frame Labor

Description Labor force participation rate, female (% of female population ages 15+)

This data frame contains the following columns:

Country Name: The name of a country around the world Country Code: The 3 letter country code

Region: The location of the country in the world IncomeGroup: The World Bank’s income classification

y1990-y2018: Labor force participation rate, female (% of female population ages 15+) for the years 100–2018

Source Labor force participation rate, female (% of female population ages 15 ) (modeled ILO estimate). (n.d.). Retrieved July 20, 2019, from https:

//data.worldbank.org/indicator/SL.TLF.CACT.FE.ZS

References International Labour Organization, ILOSTAT database. Data re- trieved in April 2019.

The female labor force participation rates (FLFPR) of women in coun- tries from 1990 to 2018 are in table #9.2.13 (Labor force participation rate, female (% of female population ages 15+) (modeled ILO estimate), 2019).Estimate the mean difference in the female labor force participation rate in 1990 to 2018 using a 95% confidence level?
Is the pulse rate after exercise different from the pulse rate before exercise for a man who drinks alcohol but doesn’t smoke? Use the data frame Pulse (table #9.2.3). Test at the 5% level.

Table#9.2.14: Pulse Rate Before and After Exercise

Pulse<-read.csv( “https://krkozak.github.io/MAT160/pulse.csv”)head(Pulse)

##height weight age gender smokes alcohol exercise ran

##	1	170	68	22	male	yes	yes	moderate	sat
##	2	182	75	26	male	yes	yes	moderate	sat
##	3	180	85	19	male	yes	yes	moderate	ran
##	4	182	85	20	male	yes	yes	low	sat
##	5	167	70	22	male	yes	yes	low	sat
##	6	178	86	21	male	yes	yes	low	sat

##pulse_before pulse_after year ## 1707193

## 2807693

## 3	68	125	95
## 4	70	68	95
## 5	92	84	96
## 6	76	80	98

Code book for data frame Pulse See problem 3.2.5 in section 3.2.

Table #9.2.14 contains pulse rates Compute a 95% confidence interval for the mean difference in pulse rates from before and after exercise for males who drink but do not smoke.

Independent Samples for Two Means

This section will look at how to analyze when two samples are collected that are independent. As with all other hypothesis tests and confidence intervals, the process is the same though the formulas and assumptions are different.

Hypothesis Test for Independent t-Test (2-Sample t-Test)

State the random variables and the parameters in words.

𝑥₁ = random variable 1

𝑥₂= random variable 2

𝜇₁= mean of random variable 1

𝜇₂= mean of random variable 2

State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂

𝐻_𝑎 ∶ 𝜇₁ ≠ 𝜇₂, the ≠ can be replaced with < or >

Also, state your 𝛼 level here.

State and check the assumptions for the hypothesis test
𝑛2
𝑛1
A random sample of size is taken from population 1. A random sample of size is taken from population 2. Note: the samples do not need to be the same size, but the test is more robust if they are.
The two samples are independent.
Population 1 is normally distributed. Population 2 is normally distributed. Just as before, the t-test is fairly robust to the assumption if the sample size is large. This means that if this assumption isn’t met, but your sample sizes are quite large, then the results of the t-test are valid.

The population variances are unknown and not assumed to be equal. The old assumption is that the variances are equal. However, this assumption is no longer an assumption that most statisticians use. This is because it isn’t really realistic to assume that the variances are equal. So just assume the assumption of the variances being unknown and not assumed to be equal is true, and it will not be checked.
Find the sample statistic, test statistic, and p-value The command using R is

t.test(variable~factor, data=Data Frame)

Note: if the 𝐻_𝑎 is <, then the command becomes

t.test(variable~factor, data=Data Frame, alternative=”less”)

Similarly for > put alternative=“greater”

Conclusion

This is where you write reject or fail to reject . The rule is: if the p-value < 𝛼 , then reject 𝐻_𝑜. If the p-value ≥ 𝛼, then fail to reject 𝐻_𝑜.

Interpretation

This is where you interpret in real world terms the conclusion to the test. The conclusion for a hypothesis test is that you either have enough evidence to support 𝐻_𝑎, or you do not have enough evidence to support 𝐻_𝑎.

Confidence Interval for the Difference in Means from Two Indepen- dent Samples (2 Samp T-Int)

The confidence interval for the difference in means has the same random vari- ables and means and the same assumptions as the hypothesis test for indepen- dent samples. If you have already completed the hypothesis test, then you do not need to state them again. If you haven’t completed the hypothesis test, then state the random variables and means and state and check the assumptions before completing the confidence interval step.

Find the sample statistic and confidence interval On R Studio, the command is

t.test(variable~factor, data=Data Frame, conf.level=C as a decimal)

Statistical Interpretation: In general this looks like, “You are C% confident that the interval contains the true mean difference.”
Real World Interpretation: This is where you state what interval contains the true difference in means, though often you state how much more (or less) the first mean is from the second mean.

How to check the assumptions of two sample t-test and confidence interval:

In order for the t-test or confidence interval to be valid, the assumptions of the test must be valid So whenever you run a t-test or confidence interval, you must make sure the assumptions are valid So you need to check them. Here is how you do this:

For the random sample assumption, describe how you took the two sam- ples. Make sure your sampling technique is random for both samples.
For the independent assumption, describe how they are independent sam- ples.
For the assumption about each population being normally distributed, remember the process of assessing normality from chapter 6. Make sure you assess each sample separately.
You do not need to check the variance assumption since it is being as- sumed.

Example: Hypothesis Test for Two Means

The cholesterol level of people vary for many reasons. The question is do people with diabetes have different cholesterol levels from people who do not have diabetes? Use the NHANES data frame. Test at the 5% level.

Table #9.3.1: NHANES Data Frame

names(NHANES) #displays the names of the variables in a data frame

##	[1]	“ID”	“SurveyYr”
##	[3]	“Gender”	“Age”
##	[5]	“AgeDecade”	“AgeMonths”
##	[7]	“Race1”	“Race3”
##	[9]	“Education”	“MaritalStatus”
##	[11]	“HHIncome”	“HHIncomeMid”
##	[13]	“Poverty”	“HomeRooms”
##	[15]	“HomeOwn”	“Work”
##	[17]	“Weight”	“Length”
##	[19]	“HeadCirc”	“Height”
##	[21]	“BMI”	“BMICatUnder20yrs”
##	[23]	“BMI_WHO”	“Pulse”
##	[25]	“BPSysAve”	“BPDiaAve”
##	[27]	“BPSys1”	“BPDia1”
##	[29]	“BPSys2”	“BPDia2”
##	[31]	“BPSys3”	“BPDia3”
##	[33]	“Testosterone”	“DirectChol”
##	[35]	“TotChol”	“UrineVol1”

##	[37]	“UrineFlow1”	“UrineVol2”
##	[39]	“UrineFlow2”	“Diabetes”
##	[41]	“DiabetesAge”	“HealthGen”
##	[43]	“DaysPhysHlthBad”	“DaysMentHlthBad”
##	[45]	“LittleInterest”	“Depressed”
##	[47]	“nPregnancies”	“nBabies”
##	[49]	“Age1stBaby”	“SleepHrsNight”
##	[51]	“SleepTrouble”	“PhysActive”
##	[53]	“PhysActiveDays”	“TVHrsDay”
##	[55]	“CompHrsDay”	“TVHrsDayChild”
##	[57]	“CompHrsDayChild”	“Alcohol12PlusYr”
##	[59]	“AlcoholDay”	“AlcoholYear”
##	[61]	“SmokeNow”	“Smoke100”
##	[63]	“Smoke100n”	“SmokeAge”
##	[65]	“Marijuana”	“AgeFirstMarij”
##	[67]	“RegularMarij”	“AgeRegMarij”
##	[69]	“HardDrugs”	“SexEver”
##	[71]	“SexAge”	“SexNumPartnLife”
##	[73]	“SexNumPartYear”	“SameSex”
##	[75]	“SexOrientation”	“PregnantNow”

Code book for data frame NHANES type help(“NHANES”) in the R Console.

Solution:

State the random variables and the parameters in words.

𝑥₁ = Cholesterol level of people with diabetes

𝑥₂ = Cholesterol level of people without diabetes

𝜇₁ = mean cholesterol level of people with diabetes

𝜇₂ = mean cholesterol level of people without diabetes

State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂

𝐻_𝑎 ∶ 𝜇₁ ≠ 𝜇₂

level of significance, 𝛼 = 0.05

State and check the assumptions for the hypothesis test
A random sample of cholesterol levels of people with diabetes is taken. A random sample of cholesterol levels of people without diabetes is taken. Check: The NHANES data frame uses cluster sampling which incorpo- rates random sampling, so the sample is probably representative. This assumption has been met.

The two samples are independent. Check: This is because either they were dealing with people who have diabetes or not.
Population of all cholesterol levels of people who have diabetes is normally distributed. Population of all cholesterol levels of people without diabetes is normally distributed. Check:

gf_density(~TotChol|Diabetes, data=NHANES)

NAYesNo
0.4

0.3

density
0.2

0.1

0.0

510510510

TotChol

Figure 9.5: Density Plot of Cholesterol levels of people with and without Dia- betes:

Both the yes group and the no group look somewhat bell shaped.

gf_qq(~TotChol|Diabetes, data=NHANES)

Both the yes group and the no group look somewhat linear.

The population of all cholesterol levels of people who have diabetes is probably normally distributed. The population of all cholesterol levels of people who do not have diabetes is probably normally distributed.

Find the sample statistic, test statistic, and p-value

The variable is cholesterol (TotChol) and separating based on if a person has diabetes or not. So the factor is Diabetes. Using R Studio the command would be

No
Yes
NA
sample
10

5

−4−2024 −4−2024 −4−2024

theoretical

Figure 9.6: Normal Quantile Plot of Cholesterol Levels of people with and without Diabetes

t.test(TotChol~Diabetes, data=NHANES)

##

## Welch Two Sample t-test ##

## data: TotChol by Diabetes

## t = 2.4286, df = 809.7, p-value = 0.01537

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## 0.02105115 0.19851114

## sample estimates:

## mean in group No mean in group Yes ##4.8879364.778155

Conclusion

Reject 𝐻_𝑜 since the p-value < 𝛼.

Interpretation

This is enough evidence to support that people who have diabetes have different cholesterol levels on average from people who do not have diabetes.

Example: Confidence Interval in Two Samples

The cholesterol level of people vary for many reasons. The question is how different is the cholesterol levels of people with diabetes from people who do not have diabetes? Use the NHANES data frame. Compute a 95% confidence interval.

Solution:

State the random variables and the parameters in words. These were stated in example #9.3.1
State and check the assumptions for the hypothesis test The assumptions were stated and checked in example #9.3.1.
Find the sample statistic and confidence interval

The variable is cholesterol (TotChol) and separating based on if a person has diabetes or not. So the factor is Diabetes. Using R Studio the command would be

t.test(TotChol~Diabetes, data=NHANES, conf.level=0.95)

##

## Welch Two Sample t-test ##

## data: TotChol by Diabetes

## t = 2.4286, df = 809.7, p-value = 0.01537

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## 0.02105115 0.19851114

## sample estimates:

## mean in group No mean in group Yes ##4.8879364.778155

Statistical Interpretation:You are 95% confident that the interval

0.02105115 < 𝜇₁ − 𝜇₂ < 0.19851114 contains the true difference in means.

Real World Interpretation: The mean cholesterol level for people with diabetes is anywhere from 0.021 mmol/L to 0.199 mmol/L more than the mean cholesterol level for people without diabetes.

Example: Hypothesis Test for Two Means

The amount of sodium in beef and poultry hot dogs was measured. (”SOCR 012708 id,” 2013). The data is in table #9.3.2. Is there enough evidence to show that beef has different amounts of sodium on average than poultry hot dogs? Use a 5% level of significance.

Table #9.3.2: Hotdog Data

Hotdog<-read.csv( “https://krkozak.github.io/MAT160/hotdog_beef_poultry.csv”)head(Hotdog)

##type calories sodium

## 1 Beef	186495
## 2 Beef	181477
## 3 Beef	176425
## 4 Beef	149322
## 5 Beef	184482
## 6 Beef	190587

Code book for data frame Hotdog

Description Results of a laboratory analysis of calories and sodium content of major hot dog brands. Researchers for Consumer Reports analyzed three types of hot dog: beef, poultry, and meat (mostly pork and beef, but up to 15% poultry meat). The meat was left off this data frame so a two-sample t-test could be performed.

This data frame contains the following columns:

type: Type of hot dog (beef or poultry) calories: Calories per hot dog

sodium: Milligrams of sodium per hot dog

Source SOCR 012708 id data hotdogs. (2013, November 13). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_012708_ID_Data_HotDogs

References SOCR Home page: http://www.socr.ucla.edu

Solution:

State the random variables and the parameters in words.

𝑥₁ = sodium level in beef hotdogs

𝑥₂ = sodium level in poultry hotdogs

𝜇₁ = mean sodium level in beef hotdogs

𝜇₂ = mean sodium level in poultry hotdogs

State the null and alternative hypotheses and the level of significance The hypotheses would be

𝐻_𝑜 ∶ 𝜇₁ = 𝜇₂

𝐻_𝑜 ∶ 𝜇₁ ≠ 𝜇₂

level of significance: 𝛼 = 0.05

State and check the assumptions for the hypothesis test
A random sample of 20 sodium levels in beef hotdogs is taken. A random sample of 20 sodium levels in poultry hotdogs. Check: The code does not state if either sample was randomly selected, but since Consumer Reports performed the test, it is safe to assume the samples were both random.
The two samples are independent. Check: These are different types of hotdogs so this is true.
Population of all sodium levels in beef hotdogs is normally distributed. Population of all sodium levels in poultry hotdogs is normally distributed. Check:

gf_density(~sodium|type, data=Hotdog)

Beef
Poultry
0.004

0.003

density
0.002

0.001

0.000

300400500600300400500600

sodium

Figure 9.7: Density Plot of Sodium Levels in Hotdogs

The density plot for beef hotdogs looks somewhat bell shaped, but the density plot for poultry hotdogs does not look bell shaped.

gf_qq(~sodium|type, data=Hotdog)

The normal quantile plot for the sodium level in beef hotdogs looks somewhat linear. The normal quantile plot for the sodium level in poultry hotdogs does not look linear. The population of all sodium levels in beef hotdogs may be normally distributed, but the population of all sodium levels in poultry hotdogs

Beef
Poultry
600

sample
500

400

300

−2−1012 −2−1012

theoretical

Figure 9.8: Normal Quantile Plot of Sodium Levels in Hotdogs

is probably not normally distributed. The sample size is not very large so the results of the test may not be valid. A larger sample would be a good idea.

Find the sample statistic, test statistic, and p-value

Using R Studio the variable is sodium levels (sodium) in different types of hot dogs. So the factor is type. The command is

t.test(sodium~type, data=Hotdog)

##

## Welch Two Sample t-test ##

## data: sodium by type

## t = -1.8798, df = 34.983, p-value = 0.06848

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## -120.3257064.625706

## sample estimates:

##mean in group Beef mean in group Poultry ##401.15459.00

Conclusion: Fail to reject 𝐻_𝑜 since the p-value ≥ 𝛼.
Interpretation

This is not enough evidence to support that beef hotdogs’ sodium level is differ- ent from poultry hotdogs. (Though do realize that the population assumptions is not valid, so this interpretation may be invalid.)

Example: Confidence Interval for Two Independent Samples

The amount of sodium in beef and poultry hotdogs was measured. (”SOCR 012708 id,” 2013). The data is in table #9.3.2. Find a 95% confidence interval for the mean difference in sodium levels between beef and poultry hotdogs.

Solution:

State the random variables and the parameters in words. These were stated in example #9.3.3.
State and check the assumptions for the hypothesis test The assumptions were stated and checked in example #9.3.3.
Find the sample statistic and confidence interval Using R Studio the vari- able is sodium levels (sodium) in different types of hot dogs. So the factor is type. The command is

t.test(sodium~type, data=Hotdog, conf.level=0.95)

##

## Welch Two Sample t-test ##

## data: sodium by type

## t = -1.8798, df = 34.983, p-value = 0.06848

## alternative hypothesis: true difference in means is not equal to 0 ## 95 percent confidence interval:

## -120.3257064.625706

## sample estimates:

##mean in group Beef mean in group Poultry ##401.15459.00

−120.325706 < 𝜇1 − 𝜇2 < 4.625706
Statistical Interpretation:You are 95% confident that the interval

contains the true difference in mean sodium level between beef and poultry hot dogs.

Real World Interpretation: The mean sodium level of beef hotdogs is anywhere from 120.33 mg less than the mean sodium level of poultry hotdogs to 4.63 mg more. (The negative sign on the lower limit implies that the first mean is less than the second mean. The positive sign on the upper limit implies that the first mean is greater than the second mean.)

Do realize that the population assumptions is not valid, so this interpretation may be invalid.

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

The NHANES data contains many variables. One variable is the income of households derived from the middle income of different income cate- gories. The variable is called HHIncomeMid. Is there enough evidence to show that the mean income of males is different from the mean income of females? Test at the 1% level.

Table #9.3.3: NHANES
names(NHANES)
## [1] “ID”	“SurveyYr”
## [3] “Gender”	“Age”
## [5] “AgeDecade”	“AgeMonths”
## [7] “Race1”	“Race3”
## [9] “Education”	“MaritalStatus”
## [11] “HHIncome”	“HHIncomeMid”
## [13] “Poverty”	“HomeRooms”
## [15] “HomeOwn”	“Work”
## [17] “Weight”	“Length”
## [19] “HeadCirc”	“Height”
## [21] “BMI”	“BMICatUnder20yrs”
## [23] “BMI_WHO”	“Pulse”
## [25] “BPSysAve”	“BPDiaAve”
## [27] “BPSys1”	“BPDia1”
## [29] “BPSys2”	“BPDia2”
## [31] “BPSys3”	“BPDia3”
## [33] “Testosterone”	“DirectChol”
## [35] “TotChol”	“UrineVol1”
## [37] “UrineFlow1”	“UrineVol2”
## [39] “UrineFlow2”	“Diabetes”
## [41] “DiabetesAge”	“HealthGen”
## [43] “DaysPhysHlthBad”	“DaysMentHlthBad”
## [45] “LittleInterest”	“Depressed”
## [47] “nPregnancies”	“nBabies”
## [49] “Age1stBaby”	“SleepHrsNight”
## [51] “SleepTrouble”	“PhysActive”
## [53] “PhysActiveDays”	“TVHrsDay”
## [55] “CompHrsDay”	“TVHrsDayChild”
## [57] “CompHrsDayChild”	“Alcohol12PlusYr”
## [59] “AlcoholDay”	“AlcoholYear”
## [61] “SmokeNow”	“Smoke100”

## [63] “Smoke100n”“SmokeAge”

## [65] “Marijuana”“AgeFirstMarij” ## [67] “RegularMarij”“AgeRegMarij” ## [69] “HardDrugs”“SexEver”

## [71] “SexAge”“SexNumPartnLife” ## [73] “SexNumPartYear”“SameSex”

## [75] “SexOrientation”“PregnantNow”

The NHANES data contains many variables. One variable is the income of households derived from the middle income of different income categories. The variable is called HHIncomeMid. Estimate with 95% confidence the mean difference in incomes between males and females in the U.S.
A study was conducted that measured the total brain volume (TBV) of patients that had schizophrenia and patients that do not have schizophre- nia. Table #9.3.4 contains the TBV of the all patients (”SOCR data oct2009,” 2013). Is there enough evidence to show that the patients with schizophrenia have a different TBV on average than a patient without schizophrenia? Test at the 10% level.

Table #9.3.4: Total Brain Volume of Patients

Brain <- read.csv( “https://krkozak.github.io/MAT160/brain.csv”)head(Brain)

##type volume ## 1n 1663407

## 2n 1583940

## 3n 1299470

## 4n 1535137

## 5n 1431890

## 6n 1578698

Code book for data frame Brain

Description A study to measure the total brain volume (TBV) (in ) of patients that had schizophrenia and patients that do not have schizophrenia.

This data frame contains the following columns:

type: whether the patient had schizophrenia (s) or did not have schizophrenia (n)

volume: the total brain volume of a patient.(𝑚𝑚3)

Source SOCR data Oct2009 id ni. (2013, November 16). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_Oct2009_ID_NI

References ”SOCR data nips,” 2013

A study was conducted that measured the total brain volume (TBV) of patients that had schizophrenia and patients that do not have schizophre- nia. Table #9.3.4 contains the TBV of the all patients (”SOCR data oct2009,” 2013). Is there enough evidence to show that the patients with schizophrenia have a different TBV on average than a patient without schizophrenia? Test at the 10% level. Compute a 90% confidence inter- val for the difference in TBV of patients with Schizophrenia and patients without Schizophrenia.
The lengths (in kilometers) of rivers on the South Island of New Zealand and what body of water they flow into are listed in table #9.3.5 (Lee, 1994). Do the data provide enough evidence to show on average that the rivers that travel to the Pacific Ocean are different length than the rivers that travel to the Tasman Sea? Use a 5% level of significance.

Table #9.3.5: Lengths of Rivers (km) Flowing to Pacific Ocean

Length<-read.csv( “https://krkozak.github.io/MAT160/length.csv”)head(Length)

## river length flowsto

##	1	Clarence	209	Pacific
##	2	Conway	48	Pacific
##	3	Waiau	169	Pacific
##	4	Hurunui	138	Pacific
##	5	Waipara	64	Pacific
##	6	Ashley	97	Pacific

Code book for data frame Length See problem 3.1.2 in Section 3.1 home- work.

The lengths (in kilometers) of rivers on the South Island of New Zealand and what body of water they flow into are listed in table #9.3.5 (Lee, 1994). Estimate the difference in mean lengths of rivers between rivers in New Zealand that travel to the Pacific Ocean and ones that travel to the Tasman Sea. Use a 95% confidence level.
A vitamin K shot is given to infants soon after birth. Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel (”SOCR data nips,” 2013). The data frame is in table #9.3.6. Is there enough evidence to show that infants cried a different amount on average when they are held by their mothers than if held using conventional methods? Test at the 5% level.

Table #9.3.6: Crying Time of Infants Given Shots Using New Meth- ods

Crying<- read.csv( “https://krkozak.github.io/MAT160/crying.csv”)

## 1 convent63## 2 convent0## 3 convent2## 4 convent46## 5 convent33## 6 convent33
head(Crying) ##method crying

Code book for data frame Crying

Description Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel. One of the measurements taken was how long, in seconds, the infant cried after being given the shot. A random sample was taken from the group that was given the shot using conventional methods, and a random sample was taken from the group that was given the shot where the mother held the infant prior to and during the shot.

This data frame contains the following columns:

method: whether the infant was given the conventional method (convent) or the new method (new) prior to being given the vitamin K shot.

crying: how long the infant cried after given a vitamin K shot. (seconds)

Source SOCR data nips infantvitK shotdata. (2013, November 16). Re- trieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_NIPS_ InfantVitK_ShotData

References ”SOCR data nips,” 2013

A vitamin K shot is given to infants soon after birth. Nurses at Northbay Healthcare were involved in a study to see if how they handle the infants could reduce the pain the infants feel (”SOCR data nips,” 2013). The data frame is in table #9.3.6. Calculate a 95% confidence interval for the mean difference in mean crying time after being given a vitamin K shot between infants held using conventional methods and infants held by their mothers.

Which Analysis Should You Conduct?

One of the most important concept that you need to understand is deciding which analysis you should conduct for a particular situation. To help you to

figure out the analysis to conduct, there are a series of questions you should ask yourself.

Does the problem deal with mean or proportion?

Sometimes the problem states explicitly the words mean or proportion, but other times you have to figure it out based on the information you are given. If you counted number of individuals that responded in the aﬀirmative to a question, then you are dealing with proportion. If you measured something, then you are dealing with mean.

Does the problem have one or two samples?

So look to see if one group was measured or if two groups were measured. You need to decide if the problem describes collecting data from one group or from two groups, or if you are comparing two different groups.

If you have two samples, then you need to determine if the samples are independent or dependent.

If the individuals are different for both samples, then most likely the samples are independent. If you can’t tell, then determine if a data value from the first sample influences the data value in the second sample. In other words, can you pair data values together so you can find the difference, and that difference has meaning. If the answer is yes, then the samples are paired. Otherwise, the samples are independent.

Does the situation involve a hypothesis test or a confidence interval?

If the problem talks about “do the data show”, “is there evidence of”, “test to see”, then you are doing a hypothesis test. If the problem talks about “find the value”, “estimate the” or “find the interval”, then you are doing a confidence interval.

So if you have a situation that has two samples, independent samples, involving the mean, and is a hypothesis test, then you have a two-sample independent t-test. Now you look up the assumptions and the technology process for doing this test. Every hypothesis test involves the same six steps, and you just have to use the correct assumptions and calculations. Every confidence interval has the same five steps, and again you just need to use the correct assumptions and calculations. So this is why it is so important to figure out what analysis you should conduct.

Data Sources:

AP exam scores. (2013, November 20). Retrieved from http://wiki.stat.ucla. edu/socr/index.php/SOCR_Data_Dinov_030708_APExamScores

Center for Disease Control and Prevention, Prevalence of Autism Spectrum Disorders – Autism and Developmental Disabilities Monitoring Network. (2008). Autism and developmental disabilities monitoring network-2012. Retrieved from website: http://www.cdc.gov/ncbddd/autism/documents/ADDM-2012- Community-Report.pdf

Flanagan, R., Rooney, C., & Griﬀiths, C. (2005). Fatal poisoning in child- hood, england & wales 1968-2000. Forensic Science International, 148:121-129, Retrieved from http://www.cdc.gov/nchs/data/ice/fatal_poisoning_child.pdf

Gettler, L. T., McDade, T. W., Feranil, A. B., & Kuzawa, C. W. (2011). Longitudinal evidence that fatherhood decreases testosterone in human males. The Proceedings of the National Academy of Sciences, PNAS 2011, doi: 10.1073/pnas.1105403108

Population reference bureau. (2013, October 8). Retrieved from http://www. prb.org/DataFinder/Topic/Rankings.aspx?ind=25

Helmet Sizes for New Zealand Airforce. (n.d.). Retrieved July 20, 2019, from http://www.statsci.org/data/oz/nzhelmet.html

Seafood online. (2013, November 20). Retrieved from http://www. allfreshseafood.com/

Buy sushi grade fish online. (2013, November 20). Retrieved from http://www. catalinaop.com/

(n.d.). Retrieved from https://www3.nd.edu/~busiforc/handouts/Data and Sto- ries/t test/Friday The Thirteenth/Friday The Thirteenth Data.html

Labor force participation rate, female (% of female population ages 15 ) (modeled ILO estimate). (n.d.). Retrieved July 20, 2019, from https://data.worldbank.org/indicator/SL.TLF.CACT.FE.ZS

SOCR 012708 id data hotdogs. (2013, November 13). Retrieved from http:

//wiki.stat.ucla.edu/socr/index.php/SOCR_012708_ID_Data_HotDogs

SOCR data Oct2009 id ni. (2013, November 16). Retrieved from http://wiki. stat.ucla.edu/socr/index.php/SOCR_Data_Oct2009_ID_NI

SOCR data nips infantvitK shotdata. (2013, November 16). Retrieved from http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data_NIPS_InfantVitK_ ShotData

Chapter 10

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Two Proportions

Hypothesis Test for Two Population Proportion (2-Prop Test)

Confidence Interval for the Difference Between Two Population Pro- portion (2-Prop Interval)

Example: Hypothesis Test for Two Population Pro- portions

Solution:

Example: Confidence Interval for Two Population Proportions

Solution:

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

Paired Samples for Two Means

Table #9.2.1: Data frame Pulse

Example: Independent or Dependent Samples

Solution:

Solution:

Hypothesis Test for Two Sample Paired t-Test

Table #9.2.2: Data frame Pulse with Difference

Confidence Interval for Difference in Means from Paired Samples (t- Interval)

How to check the assumptions of t-test and confidence interval:

Example: Hypothesis Test for Paired Samples

Table #9.2.3: Data frame Pulse

Solution:

Table #9.2.4: Data frame Pulse of Females Who Drink Alcohol

Table #9.2.5: Data frame Pulse of Females Who Drink Alcohol with Difference

Example: Hypothesis Test for Paired Samples

Table #9.2.6: Data for Head Measurements

Code book for data frame Helmet

Solution:

Table #9.2.7: Data for Head Measurements with Difference Variable

Example: Confidence Interval for Paired Samples

Table #9.2.8: Data for Head Measurements

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results

table #9.2.9: Cholesterol Levels of Patients After Heart Attack

Table #9.2.10: Wholesale Prices of Fish in Dollars

Code book for data frame Price

Table #9.2.11: Traﬀic Count

Code book for data frame Traﬀic

Table #9.2.12: Pain Measures Before and After Reiki Treatment

Table #9.2.13: Female Labor Force Participation Rates

Code book for data frame Labor

Table#9.2.14: Pulse Rate Before and After Exercise

Independent Samples for Two Means

Hypothesis Test for Independent t-Test (2-Sample t-Test)

Confidence Interval for the Difference in Means from Two Indepen- dent Samples (2 Samp T-Int)

How to check the assumptions of two sample t-test and confidence interval:

Example: Hypothesis Test for Two Means

Table #9.3.1: NHANES Data Frame

Solution:

Example: Confidence Interval in Two Samples

Solution:

Example: Hypothesis Test for Two Means

Table #9.3.2: Hotdog Data

Code book for data frame Hotdog

Solution:

Example: Confidence Interval for Two Independent Samples

Solution:

Homework

In each problem show all steps of the hypothesis test or confidence interval. If some of the assumptions are not met, note that the results of the test or interval may not be correct and then continue the process of the hypothesis test or confidence interval.

Table #9.3.4: Total Brain Volume of Patients

Code book for data frame Brain

Table #9.3.5: Lengths of Rivers (km) Flowing to Pacific Ocean

Table #9.3.6: Crying Time of Infants Given Shots Using New Meth- ods

Code book for data frame Crying

Which Analysis Should You Conduct?

License

Share This Book