Share
Explore BrainMass

Regression analysis to predict hourly wage

This question refers to the estimated regressions in table 1 computed using data for 1988 from the U.S. Current Population Survey. The data set consists of information on 4000 full-time full-year workers. The highest educational achievement for each worker was either a high school diploma or a bachelor's degree. The worker's ages ranged from 25 to 34 years. The dataset also contained information on the region of the country where the person lived, marital status, and number of children. For the purposes of these exercises let
AHE = average hourly earnings (in 1998 dollars)
College = binary variable (1 if college, 0 if high school)
Female = binary variable (1 if female, 0 if male) Age = age (in years)
Ntheast = binary variable (1 if Region = Northeast, 0 otherwise)
Midwest = binary variable (1 if Region = Midwest, 0 otherwise)
South = binary variable (1 if Region = South, 0 otherwise)
West = binary variable (1 if Region = West, 0 otherwise)

a) Write a regression equation that uses these variables to predict average hourly earnings and be sure to define the Gender variable.

b) Explain how you avoid the dummy variable trap (Hint: why you omitted one regional variable and what happen if it is included).

c) Explain the important meanings of regional difference (compare to reference group).

Attachments

Solution Preview

See the attached file for data.

a) Write a regression equation that uses these variables to predict average hourly earnings and be sure to define the Gender variable.
Using the second regression (column two of the above table) we find:
Males: AHE=4.40 + 0.29 (X3)
Females: AHE= 4.40 -2.62 + 0.29 (X3) = 1.78 + 0.29 (X3)
This is the national average for wages for men and women, with age as a factor. To remove age as a factor, use the first regression. This gives you an average wage across all ...

Solution Summary

Using regression analysis to predict hourly wages for men and women in different regions of the country

$2.19