Purchase Solution

Partial Least Squares Regression

Not what you're looking for?

Ask Custom Question

TQ5: Partial Least Squares Regression
Investigate PLS models to predict the prize winnings ($1000s) given a variety of information about performance and success statistics for LPGA golfers in 2009. The table attached (see excel file) contains data related to performance and success statistics for LPGA golfers in 2009. The matrix X contains 11 predictor variables:
1. Average drive (yards)
2. Percent of fairways hit
3. Percent of greens reached in regulation
4. Average putts per round
5. Percent of sand saves (2 shots to hole)
6. Tournaments played in
7. Green in regulation putts per hole
8. Completed tournaments
9. Average percentile in tournaments (high is good)
10. Rounds completed
11. Average strokes per round
The column vector y contains the output variable, prize winnings ($1000s). For each variable in x and y. This assignment will look at the predictive ability of partial least squares regression (PLS) and compare it to the methods we've investigated previously in TQ3 and TQ4

1. Divide the data into training, test, and validation data sets. Use the same training, test, and validation data sets that you used in TQ3 and TQ4.
2. Use cross-validation to determine the appropriate number of latent variables for your PLS model. Be sure to describe the cross-validation method used
3. Analyze the loadings of the first few LVs. What do these tell you about the relationships between the inputs and between the inputs and the output?
4. Compare the LV loadings to the PC loadings from TQ4. What similarities and differences are there in the LVs and PCs? Explain any similarities or differences in the context of PLS and PCA (the correlation of PCs to the output may be helpful here!).
5. Compare the validation performance of your PLS model with that of your best PCR and regression models. Comment on the results.
6. Is there evidence that a nonlinear PLS model would outperform the linear PLS? Explain your reasoning (but you don't need to develop the nonlinear PLS model).

Purchase this Solution

Solution Summary

Using PLS models in MINITAB, the Solution predicts the prize winnings based on the performance and success statistics for LPGA golfers.

Solution provided by:
Education
  • BSc, University of Bucharest
  • MSc, Ovidius
  • MSc, Stony Brook
  • PhD (IP), Stony Brook
Recent Feedback
  • "Thank you "
  • "Thank You Chris this draft really helped me understand correlation."
  • "Thanks for the prompt return. Going into the last meeting tonight before submission. "
  • "Thank you for your promptness and great work. This will serve as a great guideline to assist with the completion of our project."
  • "Thanks for the product. It is an excellent guideline for the group. "
Purchase this Solution


Free BrainMass Quizzes
Measures of Central Tendency

Tests knowledge of the three main measures of central tendency, including some simple calculation questions.

Measures of Central Tendency

This quiz evaluates the students understanding of the measures of central tendency seen in statistics. This quiz is specifically designed to incorporate the measures of central tendency as they relate to psychological research.

Know Your Statistical Concepts

Each question is a choice-summary multiple choice question that presents you with a statistical concept and then 4 numbered statements. You must decide which (if any) of the numbered statements is/are true as they relate to the statistical concept.

Terms and Definitions for Statistics

This quiz covers basic terms and definitions of statistics.