13 Multiple Linear Regression

Let's say we want to make a prediction about the following four people who have these scores on the five OCEAN scales:

Openness Conscientousness Extraversion Agreeableness Neuroticism
Participant 1 9 8 6 6 5
Participant 2 8 9 5 5 6
Participant 3 5 6 8 8 9
Participant 4 5 6 9 9 8

And let's say we have this output from the multiple linear regression model we created using all of the five OCEAN scales as predictors:

Unstandardised Coefficient t-value p-value
Intercept 197.65 22.43 .0001
Openness -5.28 -7.68 .0001
Conscientiousness -5.68 -7.44 .0001
Extraversion -0.4 -0.65 .522
Agreeableness 0.64 0.69 .495
Neuroticism 1.89 2.56 .014
--- --- --- ---
Multiple R2 .7619 Adjusted R2 .7348
F-statistic 28.16 on 5 and 44 DF p-value .001

13.1 Making a Prediction

Now based on the information we have gained above we can actually use that to make predictions about a person we have not measured based on the formula:

\[\hat{Y} = b_{0} + b_{1}X_{1} + b_{2}X_{2} + b_{3}X_{3} + b_{4}X_{4} + b_{5}X_{5}\]

Well technically it is

\[\hat{Y} = b_{0} + b_{1}X_{1} + b_{2}X_{2} + b_{3}X_{3} + b_{4}X_{4} + b_{5}X_{5} + error\]

But we will disregard the error for now. Also you might have noticed the small hat about the \(Y\) making it \(\hat{Y}\) (pronounced Y-hat). This means we are making a prediction of Y (\(\hat{Y}\)) as opposed to an actually measured (i.e. observed) value (\(Y\)).

And to break that formula down a bit we can say:

  • \(b_{0}\) is the intercept of the model. In this case \(b_{0}\) = 197.65
  • \(b_{1}X_{1}\), for example, is read at the non-standardised coeffecient of the first predictor \(b_{1}\) multiplied by the measured value of the first predictor \(X_{1}\)
  • \(b_{1}\), \(b_{2}\), \(b_{3}\), \(b_{4}\), \(b_{5}\) are the non-standardised coefficient values of the different predictors in the model. There is one non-standardised coefficient for each predictor. For example, let's say Openness is our first predictor and as such \(b_{1}\) = -5.28
  • \(X_{1}\), \(X_{2}\), \(X_{3}\), \(X_{4}\), \(X_{5}\) are the measured values of the different predictors for a participant. For example, for Participant 1, again assuming Openness is our first predictor, then \(X_{1}\) = 9. If Conscientousness is our second predictor, Extraversion our third predictor, Agreeableness our fourth predictor, and Neuroticism our fifth predictor, then \(X_{2}\) = 8, \(X_{3}\) = 6, \(X_{4}\) = 6 and \(X_{5}\) = 5.

Then, using the information above, we know can start to fill in the information for Participant 1 as follows:

\[\hat{Y} = 197.65 + (-5.28 \times 9) + (-5.68 \times 8) + (-0.4 \times 6) + (0.64 \times 6) + (1.89 \times 5)\]

And if we start to work that through, dealing with the multiplications first, we see:

\[\hat{Y} = 197.65 + (-47.52) + (-45.44) + (-2.4) + (3.84) + (9.45)\]

Which then becomes:

\[\hat{Y} = 115.58\]

Giving a predicted value of \(\hat{Y}\) = 115.58, to two decimal places.

Likewise for Participant 2 we would have:

\[\hat{Y} = 197.65 + (-5.28 \times 8) + (-5.68 \times 9) + (-0.4 \times 5) + (0.64 \times 5) + (1.89 \times 6)\]

And if we start to work that through, dealing with the multiplications first, we see:

\[\hat{Y} = 197.65 + (-42.24) + (-51.12) + (-2) + (3.2) + (11.34)\]

Which then becomes:

\[\hat{Y} = 116.83\]

Giving a predicted value of \(\hat{Y}\) = 116.83, to two decimal places.

Likewise for Participant 3 we would have:

\[\hat{Y} = 197.65 + (-5.28 \times 5) + (-5.68 \times 6) + (-0.4 \times 8) + (0.64 \times 8) + (1.89 \times 9)\]

And if we start to work that through, dealing with the multiplications first, we see:

\[\hat{Y} = 197.65 + (-26.4) + (-34.08) + (-3.2) + (5.12) + (17.01)\]

Which then becomes:

\[\hat{Y} = 156.1\]

Giving a predicted value of \(\hat{Y}\) = 156.1, to two decimal places.

And finally for Participant 4 we would have:

\[\hat{Y} = 197.65 + (-5.28 \times 5) + (-5.68 \times 6) + (-0.4 \times 9) + (0.64 \times 9) + (1.89 \times 8)\]

And if we start to work that through, dealing with the multiplications first, we see:

\[\hat{Y} = 197.65 + (-26.4) + (-34.08) + (-3.6) + (5.76) + (15.12)\]

Which then becomes:

\[\hat{Y} = 154.45\]

Giving a predicted value of \(\hat{Y}\) = 154.45, to two decimal places.

And so in summary, based on our above model, and the above measured values, we would predict the following values:

  • Participant 1, \(\hat{Y}\) = 115.58
  • Participant 2, \(\hat{Y}\) = 116.83
  • Participant 3, \(\hat{Y}\) = 156.1
  • Participant 4, \(\hat{Y}\) = 154.45