Purchase Solution

Regression model using MATLAB

Not what you're looking for?

Ask Custom Question

The table attached (see excel file) contains data related to performance and success statistics for LPGA golfers in 2009. The matrix X contains 11 predictor variables:
1. Average drive (yards)
2. Percent of fairways hit
3. Percent of greens reached in regulation
4. Average putts per round
5. Percent of sand saves (2 shots to hole)
6. Tournaments played in
7. Green in regulation putts per hole
8. Completed tournaments
9. Average percentile in tournaments (high is good)
10. Rounds completed
11. Average strokes per round
The column vector y contains the output variable, prize winnings ($1000s). For each variable in x and y.
1. Divide the data into training, test, and validation data sets and describe how you divided the data into these three sets and why it is appropriate.
2. Develop a linear regression model to predict the prize winnings ($1000s) using the 11 other variables. Then, select a subset of these variables and develop a competing model. Use the correlation coefficients to explain why this representation works. Remember to pad your inputs with a column of ones to allow for a non-zero y-intercept.
3. Find the single best predictor of prize winnings ($1000s) and the pair of predictors that performs best.
4. Identify or derive any additional inputs that may be good predictors of prize winnings ($1000s) based on nonlinear relationships of the available predictors. Use linear-in-parameters regression with the non-linear term(s) to evaluate any model improvement.
5. Compare the performance of all models using the root mean squared error (RMSE) of the test data set. Select the best model. Explain why it is the best.
6. Find the validation error of your best model.

Purchase this Solution

Solution Summary

The table attached (see excel file) contains data related to performance and success statistics for LPGA golfers in 2009.

Solution provided by:
Education
  • BSc, University of Bucharest
  • MSc, Ovidius
  • MSc, Stony Brook
  • PhD (IP), Stony Brook
Recent Feedback
  • "Thank you "
  • "Thank You Chris this draft really helped me understand correlation."
  • "Thanks for the prompt return. Going into the last meeting tonight before submission. "
  • "Thank you for your promptness and great work. This will serve as a great guideline to assist with the completion of our project."
  • "Thanks for the product. It is an excellent guideline for the group. "
Purchase this Solution


Free BrainMass Quizzes
Know Your Statistical Concepts

Each question is a choice-summary multiple choice question that presents you with a statistical concept and then 4 numbered statements. You must decide which (if any) of the numbered statements is/are true as they relate to the statistical concept.

Terms and Definitions for Statistics

This quiz covers basic terms and definitions of statistics.

Measures of Central Tendency

Tests knowledge of the three main measures of central tendency, including some simple calculation questions.

Measures of Central Tendency

This quiz evaluates the students understanding of the measures of central tendency seen in statistics. This quiz is specifically designed to incorporate the measures of central tendency as they relate to psychological research.