# Multiple Regression Analysis

[1] A consumer products company wants to measure the effectiveness of different types of advertising media in the promotion of its products. Specially, two types of advertising media are to be considered: radio and television advertising and newspaper advertising (including the cost of discount coupons). A sample of 22 cities with approximately equal population is selected for study during a test period of one month. Each city is allocated a specific expenditure level both for radio and television advertising and for newspaper advertising. The sales of the product (in thousands of dollars) and also the levels of media expenditure (in thousands of dollars) during the test month are recorded, with the data set ADVERTISE.xls. Using EXCEL or PHStat2, answer the following:

Determine whether there is a significant relationship between sales and the two independent variables (radio advertising and newspaper advertising) at the 0.05 level of significance.

Interpret the meaning of the F-value.

State the multiple regression equation.

Interpret the meaning of the slopes b1 and b2 in this problem.

Interpret the meaning of the regression coefficient b0 in this problem.

Predict the average sales for a city in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

Set up a 95% confidence interval estimate for the average sales for cities in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

Set up a 95% confidence interval estimate for the sales for an individual city in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

Determine the coefficient of multiple determination, r2.

Determine the adjusted r2.

Perform a residual analysis on your results and determine the adequacy of the fit of the model.

Set up a 95% confidence interval estimate of the population slope between sales and radio and television advertising.

At the 0.05 level of significance, determine whether each explanatory variable makes a significant contribution to the regression model. On the basis of these results, indicate the independent variables that should be included in this model.

Compute the coefficients of partial determination, r2Y1.2.and r2Y2.1.and interpret their meaning.

[2] A collector of antique grandfather clocks believes that the price (in dollars) received for the clocks at an antique auction increases with the age of the clocks and with the number of bidders. Thus the model is hypothesized is

where y = auction price, x1 = age of clock (years) and x2 = number of bidders.

A sample of 32 auction prices of grandfather clocks, along with their ages and the number of bidders, is given below (clock_bidders.xls).

State the multiple regression equation.

Interpret the meaning of the slopes b1 and b2 in the model.

Interpret the meaning of the regression coefficient b0.

Test H0: 2 = 0 against H1: 2 ≠ 0. Interpret your finding.

Use a 95% confidence interval to estimate 2. Interpret the p-value corresponding to the estimate 2. Does the confidence interval support your interpretation in d)?

Determine the coefficient of multiple determination r2 and interpret its meaning.

Perform a residual analysis on your results and determine the adequacy of the fit of the model.

Plot the residuals against the prices. Is there evidence of a pattern in the residuals? Explain.

At = 0.05, is there evidence of positive autocorrelation in the residuals?

Suppose a collector, having observed many auctions, believes that the rate of increase of the auction price with age will be driven upward by a large number of bidders. In other words, the collector believes that the age of clock and the number of bidders should interact. At the 0.05 level of significance, determine whether the interaction term (x1x x2) makes a significant contribution to the model. If so, what is the multiple regression equation?

On the basis of the results of a) - j), what is the most appropriate model? Explain.

https://brainmass.com/statistics/correlation-and-regression-analysis/multiple-regression-analysis-613422

#### Solution Preview

[1] A consumer products company wants to measure the effectiveness of different types of advertising media in the promotion of its products. Specially, two types of advertising media are to be considered: radio and television advertising and newspaper advertising (including the cost of discount coupons). A sample of 22 cities with approximately equal population is selected for study during a test period of one month. Each city is allocated a specific expenditure level both for radio and television advertising and for newspaper advertising. The sales of the product (in thousands of dollars) and also the levels of media expenditure (in thousands of dollars) during the test month are recorded, with the data set ADVERTISE.xls. Using EXCEL or PHStat2, answer the following:

Answers

Determine whether there is a significant relationship between sales and the two independent variables (radio advertising and newspaper advertising) at the 0.05 level of significance.

The significance of the regression model is tested using F-test.

Null hypothesis:

H0: There is no significant relationship between sales and the two independent variables.

Alternative hypothesis:

H1: There is significant relationship between sales and the two independent variables.

F-statistic = 58.1533

P-value = 0.000

Clearly F-statistic is significant, as the p-value is less than the significance level 0.05. The data provides sufficient evidence to conclude that there is a significant relationship between sales and the two independent variables (radio advertising and newspaper advertising).

Details

ANOVA

df SS MS F Significance F

Regression 2 2155643.6028 1077821.8014 58.1533 0.0000

Residual 19 352148.9881 18534.1573

Total 21 2507792.5909

Interpret the meaning of the F-value.

F-statistic follows F distribution with df (2, 19). Since the F-statistic is significant with p-value less than 0.05, we can conclude that the regression model is significant in predicting sales.

State the multiple regression equation.

The estimated regression equation is given by,

Sales = 33.1304 + 13.2306 * Radio + 19.8378 * Newspaper

Details

Coefficients Standard Error t Stat P-value

Intercept 33.1304 117.0835 0.2830 0.7803

Radio 13.2306 1.5181 8.7151 0.0000

Newspaper 19.8378 2.7656 7.1731 0.0000

Interpret the meaning of the slopes b1 and b2 in this problem.

b1 = 13.2306

For an increase in the radio advertising expenditure by $1, the sales increase by $13.23.

b2 = 19.8378

For an increase in the newspaper advertising expenditure by $1, the sales increase by $19.84.

Interpret the meaning of the regression coefficient b0 in this problem.

b0 = 33.1304

In the absence of radio and newspaper advertisements, the average sales will be $33.13.

Predict the average sales for a city in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

Predicted sales is given by,

Sales = 33.1304 + (13.2306 * 20) + (19.8378 * 20) = $694.50 in ('000)

Set up a 95% confidence interval estimate for the average sales for cities in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

95% confidence interval estimate for the average sales for cities in which radio and television advertising is $20,000 and newspaper advertising is $20,000 is given by,

(574.85, 814.14)

Set up a 95% confidence interval estimate for the sales for an individual city in which radio and television advertising is $20,000 and newspaper advertising is $20,000.

95% confidence interval estimate for the sales for an individual city in which radio and television advertising is $20,000 and newspaper advertising is $20,000 is given by,

(385.45, 1003.54)

Details

Confidence Interval Estimate and Prediction Interval

Data

Confidence Level 95%

1

Radio given value 20

Newspaper ...

#### Solution Summary

The solution provides step by step method for the calculation of regression analysis. Formula for the calculation and Interpretations of the results are also included.