Simpson's paradox is a phenomenon in statistics where a trend that appears in several different groups of data reverses or disappears when the groups are combined. This paradox can lead to misleading conclusions if the data is not properly analyzed, as the overall relationship may not reflect the relationships within the individual groups. The key concept behind Simpson's paradox is that the aggregation of data can mask or confound relationships due to lurking variables or different underlying distributions.
New to topics? Read the docs here!