Tldr
The total variability of a random variable can be split into the average variability within different conditions and the variability of the averages across those conditions.
The Law of Total Variance states that the total variance of a random variable can be expressed as:
- The average spread within groups: , it quantifies the average variance of when you consider specific conditions or groups defined by .
- The spread between the averages of the groups: , it measures how much the conditional means of (the average of for each group of ) vary from each other.
The power of the Law of Total Variance (LOTV) lies in its ability to decompose variances, offering a deeper understandings of the factors contributing to overall variability.
For instance, consider the variance of heights of all students in a school. Using LOTV, we can decompose this total variance into:
- The average variance of heights within each grade level.
- The variance between the average heights of each grade level.
This decomposition allows us to discern whether the overall variability in student heights is more influenced by difference between grades (e.g., seniors are generally taller than freshmen) or by the natural height variations within each grade (e.g., some freshmen are taller than others). This provides valuable insights into the primary drivers of variability.
(Murphy, 2022, pp. 42–43)