# Descriptive Statistics & Correlation Coefficient

7. The figure below has six scatter diagrams for hypothetical data. The correlation coefficients, in scrambled order, are:

-0.85 -0.38 -1.00 0.06 0.97 0.62

Match the scatter diagrams with the correlation coefficients.

9. Find the correlation coefficient for each of the three data sets shown below.

X y x y x y

1 5 1 1 1 2

1 3 1 2 1 2

1 5 1 1 1 2

1 7 1 3 1 2

2 3 2 1 2 4

2 3 2 4 2 4

2 1 2 1 2 4

3 1 3 2 3 6

3 1 3 2 3 6

4 1 4 3 4 8

1. The r.m.s. error of the regression line for predicting y from x is______.

(i) SD of y

(ii) SD of x

(iii) r x SD of y

(iv) r x SD of x

(v) x SD of y

(vi) x SD of x

Excel (Correlation)

The attached dataset scores.xls has two variables both numeric:

- Midterm (scores)

- Final (scores)

Data set

midterm final gender

80 95 F

83 88 M

88 78 F

82 79 F

85 91 M

77 72 M

84 87 F

76 73 M

91 92 M

74 82 F

85 89 M

92 95 F

76 79 M

75 85 M

90 92 M

84 80 F

87 94 F

80 85 F

86 91 M

84 89 F

95 88 F

80 75 M

83 89 M

90 88 M

71 77 F

84 86 M

93 88 F

68 81 F

87 82 F

79 75 M

84 82 M

67 73 F

93 95 M

78 82 F

80 86 M

86 98 M

84 76 F

82 91 F

93 84 M

86 76 M

88 95 M

85 86 F

90 88 M

87 85 F

91 90 F

75 88 F

89 86 F

98 99 F

87 90 M

81 78 F

1. Compute summary statistics of the two variables and plot the data and comment of the relationship between midterm and final scores.

2. Compute the correlation coefficient, r, between midterm and final. Interpret your findings.

