You work in the shipping and logistics department for Beast Buy, an American mail order company that specializes in pet food for VERY exotic pets. Beast Buy has four warehouses/distribution centers that provide product to each of four different regions in the US; Atlanta services the southeast, Boston services the northeast, Cleveland services the Midwest, and Denver services the west. Your manager has recently asked you to analyze the efficiency of these distribution centers.
The data tab contains data from each of the last 26 weeks for each of the four distribution centers. The first column simply denotes the week in which the data were collected. The second column indicates which warehouse the data are from (1=Atlanta, 2=Boston, 3=Cleveland, 4=Denver). The third column contains the distribution cost (in thousands of US dollars) associated with the particular warehouse in each week, and the final column contains data on the number of orders routed through each warehouse each week.
Use this data to answer the following questions. Problem 4.1 comes from week 9 material, 4.2 comes from week 10 material, and 4.3 relates to week 11 material. Because this is due before week 11, question 4.3 is extra credit. Should you encounter any difficulties with these problems, the optional problems below are very similar to the questions in this problem set, and the answers to the optional questions can be found in the back of the textbook. You can also request that the tutor work extensively with you on the optional problems.
"The first issue your boss has asked you to address is whether or not there are differences in distribution cost between each of the four warehouses. Use the 0.05 level of significance to:
a) Perform a one-way ANOVA to look for differences in distribution costs between warehouses.
b) If the results in (a) indicate that it is appropriate, use the Tukey-Kramer procedure to determine which distribution centers differ in mean distribution costs.
c) Briefly summarize (in plain English) your procedures and the results of (a) and (b) for your manager.
Excel Tips: When using the Data Analysis ToolPak, Excel requires that your data be formatted differently for ANOVA than for regression. The data as downloaded is formatted correctly for regression analysis, so you will have to transform your data prior to estimating the ANOVA.
"In addition to looking at differences between distribution centers, your manager also wants to know the relationship between the number of orders routed through each center and the distribution cost. Thus, the number of orders is your independent variable and the cost is your dependent variable.
a) Construct a scatter plot of the two variables.
b) Estimate a simple linear regression between these two variables.
c) Interpret the meaning of ?0 and ?1.
d) Predict the mean distribution costs of 500 orders, 1000 orders, and 1500 orders. Are these appropriate predictions?
e) Comment briefly on the predictive power/statistical significance of your estimates. "
EXTRA CREDIT. To answer this question, you will need to read the materials for week 11 of the class. Here, your task is to combine 4.1 and 4.2 into a single multiple regression model using dummy variables. To put the data in an Excel-friendly format, you will first need to create dummy variables for each of the different processing centers. a) Estimate a multiple regression model, again using cost as the dependent variable, however for your independent variables you will want to use orders AND your set of dummy variables (see technical note below).
b) Comment on the results from (a) in light of your results in 4.1 and 4.2. Does the coefficient on orders here match your results from 4.2? Do the coefficients on your dummy variables, and differences between the coefficients on your dummy variables, match up with your results from 4.1?
Technical note: For technical reasons owing to matrix calculus, you cannot estimate a regression with an exhaustive set of dummy variables. One of your dummies will have to be omitted from the model. For ease of interpretation when answering the question, I would suggest that you look at your results from the Tukey-Kramer test in 4.1: if your results there are such that one of the centers was significantly different from most/all of the rest, choose that one to omit in your regression. For example, if you find that 2 of the differences were significant, Boston/Atlanta and Boston/Cleveland, do not include the Boston dummy in your regression.
The solution provides step by step method for the calculation of regression analysis. Formula for the calculation and Interpretations of the results are also included. Levine, questions 11.14, 13.74 not included in response.