15 One-Way Within-Subjects tables

Notes:

This chapter is currently under development but will eventually show how to complete ANOVAs and ANOVA tables by hand.
To show you how very similar a between groups and within groups ANOVA are, we have tweaked the study from the One-Way Between-Subjects Scenario previously introduced.
We use the term repeated measures and within-subjects interchangably here
My thanks to Dr Helena Paterson for building this chapter.

15.1 The ANOVA table

One-Way Within-Subjects Scenario: A research team are investigating the influence of mainstream media on life happiness. They record Life Happiness scores from 0 to 100 with higher scores meaning more happy with life from a large cohort of participants (n = 153) and record their life happiness scores at two time points; once before a happiness intervention and once after the intervention. The researchers want to check whether there is a difference in terms of Life Happiness due to the intervention. Calculate F from the below one-way ANOVA output table comparing mean Life Happiness scores across the two time points and state whether there is a significant difference between the pre- and post-intervention happiness scores. The Sums of Squares (\(SS\)) have been given to you.

Conditions	SS	df	Mean Square	F-value
Between	44.56
Within	2126.3
Total	2170.86

In order to complete the above table we need to:

determine the degrees of freedom (\(df\)) of the Between and the Within components
use the \(df\) to calculate the Mean Squares (\(MS\))
use the Mean Squares to calculate the F-value

Degrees of Freedom

In an ANOVA there are two degrees of freedom. The first is the degrees of freedom related to the number of groups/conditions/models and is usually called \(df_{Between}\), \(df_{Condition}\), or \(df_{Model}\). They all refer to the same idea. Regardless of what you call it, this df is calculated as:

\[df_{between} = k - 1\]

Where \(k\) is the number of conditions or groups.

The second degrees of freedom is related to the number of participants/observations/data-points and is usually called \(df_{Within}\) or \(df_{Model}\); again both referring to the same idea. This df is calculated as:

\[df_{within} = ((N \times k) - 1) - (N - 1) - (k - 1)\]

Where \(N\) is total number of participants in the whole experiment and \(k\) is again the number of conditions or times that you observed the participants. \(N \times k\) is essentially all the observations in your study because you collected \(k\) data points from \(N\) participants.

For example, if you had a total of 5 participants in your experiment (\(N\) = 5) and you measured them three times on the same dependent variable (\(k\) = 3) then you would do:

\[df_{within} = ((5 \times 3)-1) - (5-1) - (3-1) = 8\]

However, the \(df_{within}\) can also be phrased more simply as:

\[df_{within} = (N-1)(k-1)\]

This can be read as:

subtracting one from the total number of participants, \(N\),
subtracting one from the total number of conditions, \(k\),
and then multiplying those two values together.

Which would, assuming the five participants and three groups, look like this:

\[df_{within} = (N-1)(k-1) = (5 - 1) \times (3 - 1) = 4 \times 2 = 8\]

So, returning to the original study we were looking at, first we see each person's happiness has been measured twice (before and after intervention) - meaning that \(k\) = 2. As such:

\[df_{between} = (k-1) = 2 - 1 = 1\]

Next we see that we have a total number of participants of \(N\) = 153. And if we put that into the formula for the \(df_{Within}\) we see:

\[df_{Within} = (N-1)\times(k-1) = (153 - 1)\times(2 - 1) = 152\]

Meaning that we have \(df_{Between}\) = 1, and \(df_{Within}\) = 152, and we can fill those into our table as such:

Conditions	\(SS\)	\(df\)
Between	44.56	1
Within	2126.3	152
Total	2170.86

Mean Squares

Great! Now the next step is to calculate the Mean Square value for both the Between (i.e. Groups) and Within (i.e. Residuals) components. The Mean Square is really an abbreviation of the Mean Sums of Squares and is the Sums of Squares of a component divided by the degrees of freedom. Or in other words:

\[MS = \frac{SS}{df}\]

Looking at the table above we have all the information we need to calculate both the \(MS_{Between}\) and the \(MS_{Within}\). They would look like:

\[MS_{Between} = \frac{SS_{Between}}{df_{Between}} = \frac{44.56}{1} = 44.56\]

And

\[MS_{Within} = \frac{SS_{Within}}{df_{Within}} = \frac{2126.3}{152} = 13.9888158\]

Meaning that we have \(MS_{Between}\) = 44.56 and \(MS_{Within}\) = 13.9888158, which we can add in to our ANOVA table as such:

Conditions	\(SS\)	\(df\)	Mean Square
Between	44.56	1	44.56
Within	2126.3	152	13.9888158
Total	2170.86

The F-value

Awesome. Finally, we need to calculate our F-value which is the ratio between the \(MS_{Between}\) and \(MS_{Within}\), and can be written as:

\[F = \frac{MS_{Between}}{MS_{Within}}\]

And so if we put our numbers in from above we see:

\[F = \frac{MS_{Between}}{MS_{Within}} = \frac{44.56}{13.9888158} = 3.1854019\]

Giving us an F-value of F = 3.19 (to two decimal places), which we can now put into our table to complete our table as such:

Conditions	\(SS\)	\(df\)	Mean Square	F-value
Between	44.56	1	44.56	3.19
Within	2126.3	152	13.9888158
Total	2170.86

Effect size

There are a few effect sizes that people use for the ANOVA with one of the more common ones being what is called partial eta-squared with the symbol \(\eta_p^2\) (as \(\eta\) is the symbol for "eta", and is pronounced "eat-ah", and the \(_p^2\) indicating "partial" and "squared"). The formula for partial eta-squared is:

\[\eta_{p^2} = \frac{SS_{Between}}{SS_{Between}+SS_{Within}}\]

From above, we know that:

\(SS_{Between}\) = 44.56 and
\(SS_{Within}\) = 2126.3

And if we start to fill those into the formula we see:

\[\eta_p^2 = \frac{44.56}{44.56 + 2126.3} = \frac{44.56}{2170.86} = 0.0205264\]

Meaning that this analysis has an effect size of \(\eta_p^2\) = 0.02 (to two decimal places).

The Write-UP

The standard APA format for writing up an ANOVA is usually:

F(\(df_{Between}\), \(df_{Within}\)) = F-value, p = p-value, \(\eta_p^2\) = effectsize-value

From above we know the two dfs and we know the F-value and the effect size, so what we need now is to determine the significance.

Determining Significance

If we were to look at a critical values look-up table for \(df_{Between}\) = 1, \(df_{Within}\) = 152, and \(\alpha = .05\), we would see that the closest we have is for \(df_{Between}\) = 1, \(df_{Within}\) = 100, which has a critical value of \(F_{crit}\) = 3.94. Given that our F-value is smaller than our \(F_{crit}\) then we can say our result is not significant, and as such would be written up as F(1, 152) = 3.19, p > .05, \(\eta_p^2\) = 0.02.

Remember: If you were writing this up as a report, and analysed the data in R, then you would see the p-value was actually p = 0.0760836, and would be written up as p = 0.076

15.2 Look-Up table

Remembering that the \(F_{crit}\) value is the smallest F-value you need to find a significant effect, find the \(F_{crit}\) for your dfs, assuming \(\alpha = .05\). If the \(F\)-value you calculated is equal to or larger than \(F_{crit}\) then your test is significant. In this table, to fit it on to the page, \(df_{Between}\) is written as df1, and \(df_{Within}\) is written as df2.

dfs	df1 = 1	df1 = 2	df1 = 3	df1 = 4	df1 = 5
df2 = 1	161.448	199.5	215.707	224.583	230.162
df2 = 2	18.513	19	19.164	19.247	19.296
df2 = 3	10.128	9.552	9.277	9.117	9.013
df2 = 4	7.709	6.944	6.591	6.388	6.256
df2 = 5	6.608	5.786	5.409	5.192	5.05
df2 = 10	4.965	4.103	3.708	3.478	3.326
df2 = 15	4.543	3.682	3.287	3.056	2.901
df2 = 20	4.351	3.493	3.098	2.866	2.711
df2 = 25	4.242	3.385	2.991	2.759	2.603
df2 = 30	4.171	3.316	2.922	2.69	2.534
df2 = 40	4.085	3.232	2.839	2.606	2.449
df2 = 50	4.034	3.183	2.79	2.557	2.4
df2 = 60	4.001	3.15	2.758	2.525	2.368
df2 = 70	3.978	3.128	2.736	2.503	2.346
df2 = 80	3.96	3.111	2.719	2.486	2.329
df2 = 90	3.947	3.098	2.706	2.473	2.316
df2 = 100	3.936	3.087	2.696	2.463	2.305

14 One-Way Between-Subjects tables

16 Random formulas