This site is no longer maintained and has been left for archival purposes
Text and links may be out of date
Analysis of variance (ANOVA) for factorial combinations of treatments Elsewhere on this site we have dealt with ANOVA for simple comparisons of treatments. We can also use ANOVA for combinations of treatments, where two factors (e.g. pH and temperature) are applied in every possible combination. These are called factorial designs, and we can analyse them even if we do not have replicates. This type of analysis is called TWO-WAY ANOVA. Suppose that we have grown one bacterium in broth culture at 3 different pH levels at 4 different temperatures. We have 12 flasks in all, but no replicates. Growth was measured by optical density (O.D.). Construct a table as follows (O.D. is given in fictitious whole numbers here for convenience).
Then calculate the following (see the worked example and the output from Microsoft "Excel"). (a) S x, S x2, (S x)2 / n, and for each column in the table. (b) S x, S x2, (S x)2 / n, and for each row. (c) Find the grand total by adding all S x for columns (it should be the same for rows). Square this grand total and then divide by uv, where u is the number of data entries in each row, and v is number of data entries in each column. Call this value D; in our example it is (336)2 12 = 9408. (d) Find the sum of S x2 values for columns; call this A. It will be the same for S x2 of rows. In our example it is 11570. (e) Find the sum of S x2/n values for columns; call this B. In our example it is 11304. (f) Find the sum of S x2/n values for rows; call this C. In our example it is 9646. (g) Set out a table of analysis of variance as follows:
[* Where u is the number of data entries in each row, and v is the number of data entries in each column); note that the total df is always one fewer than the total number of entries in the table of data. *** Obtained by subtracting the between-columns and between-rows sums of squares from total sum of squares. Now do a variance ratio test to obtain F values: (1) For between columns (pH): F = Between columns mean square / Residual mean square
(2) For between rows (temperature) F = Between rows mean square / Residual mean square
In each case, consult a table of F (p = 0.05 or p = 0.01 or p = 0.001) where u is the between-treatments df (columns or rows, as appropriate) and v is residual df. If the calculated F value exceeds the tabulated value then the treatment effect (temperature or pH) is significant. In our example, for the effect of pH (u is 2 degrees of freedom, v is 6 df) the critical F value at p = 0.05 is 5.14. In fact, we have a significant effect of pH at p = 0.001. For the effect of temperature (u is 3 degrees of freedom, v is 6 df) the critical F value at p = 0.05 is 4.76. We find that the effect of temperature is significant at p = 0.01.
Below, we see a print-out of this analysis from "Excel". We select Anova: Two-Factor Without Replication from the analysis tools package. Note that the Anova table gives Source of Variation separately for Rows, Columns and Error (= Residual).
Of interest, another piece of information is revealed by this analysis - the effects of temperature do not interact with effects of pH. In other words, a change of temperature does not change the response to pH, and vice-versa.We can deduce this because the residual (error) mean square (MS) is small compared with the mean squares for temperature (columns) or pH (rows). [A low residual mean square tells us that most variation in the data is accounted for by the separate effects of temperature and pH]. But suppose that our data were as follows:
Here an increase of temperature increases growth at low pH but decreases growth at high pH. If we analysed these data we would probably find no significant effect of temperature or pH, because these factors interact to influence growth. The residual mean square would be very large. This type of result is not uncommon - for example, patients' age might affect their susceptibility to levels of stress. Inspection of our data strongly suggests that there is interaction. To analyse it, we would need to repeat the experiment with two replicates, then use a slightly more complex analysis of variance to test for (1) separate temperature effects, (2) separate pH effects, and (3) significant effects of interaction. As an example, below is shown a print-out from "Excel" of the following table, where I have assumed that we did the experiment above with replication.
The procedure in "Excel" is as follows. 1. Enter the replicates as separate rows. 2. From the analysis tools menu, choose Anova: Two-Factor with Replication. 3. Insert all the cells of the table in Input range (Anova assumes that column A and row 1 are used for headings). 4. Enter "2" (in our case) where asked for "Rows per sample". In the table displayed on the screen (see below) the analysis shows the means for each temperature and each pH. It also tells us the following (see the bottom rows of the table). (i) There is no significant difference between temperatures overall ("Excel" has called the temperature "Sample") because the calculated F value (3.148) is less than the critical F value (3.49). (ii) There is very highly significant (p = 0.0008) effect of pH ("Columns") overall. (iii) There is very highly significant interaction (p = 0.0000397) between temperature and pH. In other words, the response to pH depends on the temperature, or vice-versa. This might have been the purpose of doing the experiment - to see how the organism behaves when subjected to combinations of factors.
Note: Because there is so much interaction, it is difficult to analyse the separate effects of temperature and pH. We should repeat the analysis, using separate parts of the data. For example, ANOVA for all the pH treatments at 25oC , then at 30oC, then 35oC and 40oC. But we could assemble all the means (there are 12) in ranked order and do a multiple range test to find significant differences. |
This site is no longer maintained and has been left for archival purposes
Text and links may be out of date