2. Suppose your organization is committed to the promotion of public health in Boston, and your department specifically focuses on teenagers. With the permission of several area middle and high schools, your organization fielded a survey to a large group of students. The questionnaire included several health behavior related questions, including whether the student smokes, and if so, what age they started and how many cigarettes/packs per week they typically smoke. For this problem, we will only be looking at a small portion of the data set - specifically the high school seniors from the sample who indicated that they smoke. The data set (attached as an Excel file) includes the age at which these high school students began smoking.
a) Using SPSS, calculate the mean, SD, and SE.
b) Using SPSS, create a histogram for the sample
c) Estimate the population mean using the 95% confidence interval
d) State your conclusion following the format we discussed in class (however, instead of using "population mean" substitute in what the population mean actually represents - in this case "the mean starting age of Boston teenagers who smoke")
e) Assuming that the true population mean (which in this case refers to the starting age of teenage smokers in the Boston area) is 15.5, create a sampling distribution (distribution of the sample means) and mark out 3 standard errors (SE) in each direction. (This can be drawn by hand).
f) Referring to your graph (and considering other possible sample means you could have obtained), explain why you are 95% confident that your confidence interval in part (d) "captures" the true population mean.
g) Is it possible that you could have a sample (no matter how well you designed the study) in which the mean and resulting confidence interval would not include the true population mean? If so, what is the probability of this occurring? Explain.
This posting contains solution to following problem on Statistics using SPSS.