Purchase Solution

Regression analysis

Not what you're looking for?

Ask Custom Question

Life insurance companies are interested in predicting how long their customers will live, because their premiums and profitability depend on such numbers. An actuary for one insurance company gathered data from 100 recently deceased male customers contained in the file longevity.xls. He recorded the age at death of the customer plus the ages at death of his mother and father, the mean ages at death of his grand mothers, the mean ages at death of his grandfathers, and whether the deceased customer was or was not a smoker.

The actual variables in the data set, longevity.xls, are

Longvity = Age of the male customer at death.
Mother = mother's age at death.
Father = father's age at death.
Gmothers = mean age of grandmothers at death
Gfathers = mean age of grandfathers at death

Smoker = 1 if customer was a smoker
0 if customer was not a smoker
Longevity = Age of the deceased male customer

Estimate a simple regression model showing the relationship between the age the male customer died at and the customer was a smoker.

a. What percent of the variation in the customer's age at death is explained by whether the customer was a smoker?
b. What is your estimated value for Interpret what this says about the relationship between longevity and smoking.
c. Test the hypothesis that smoking have no influence on the customer's longevity against the alternative that it does have a negative effect on longevity. Assume  = .05.
d. Develop a 95 percent prediction interval estimate of longevity for a particular male who smoked.

Now estimate the multiple regression model:

longvty = beta0 + beta1*smoker + beta2*Mother + beta3*Father +beta4*Gmothers + beta5*Gfathers

e. Did including Mother, Father, Gmothers, and Gfathers have an effect on the estimates of 0 and 1? If so, what were the effects? Why did these estimates change?
f. Explain and interpret what the coefficient estimates of 2 3 , 4, and 5 mean.
g. How has adding Mother, Father, Gmothers, and Gfathers into the model improved the explanatory power of the model? Explain.
h. Do the variables Gmothers and Gfathers have a significant effect on longevity? Explain.

Reestimate the model, but this time do not include the variables Gmothers and Gfathers. In other words, estimate the following regression model:

longvty = beta0 + beta1*smoker + beta2*Mother + beta3*Father

i. How did removing Gmothers and Gfathers affect the mode;'s goodness of fit?

j. Using this new model, develop a point estimate of the longevity of a male who smoked and whose mother passed away at the age of 78 and whose father died at the age of 90.

Attachments
Purchase this Solution

Solution Summary

The solution provides step by step method for the calculation of regression model . Formula for the calculation and Interpretations of the results are also included.

Purchase this Solution


Free BrainMass Quizzes
Measures of Central Tendency

This quiz evaluates the students understanding of the measures of central tendency seen in statistics. This quiz is specifically designed to incorporate the measures of central tendency as they relate to psychological research.

Measures of Central Tendency

Tests knowledge of the three main measures of central tendency, including some simple calculation questions.

Terms and Definitions for Statistics

This quiz covers basic terms and definitions of statistics.

Know Your Statistical Concepts

Each question is a choice-summary multiple choice question that presents you with a statistical concept and then 4 numbered statements. You must decide which (if any) of the numbered statements is/are true as they relate to the statistical concept.