Equality of Outcome metrics
The idea of equality of outcome metrics, is to compare the rate of success in the privileged group with the rate of success in the unprivileged group. We define the success rate of a particular group as:
The idea is that an unbiased system would present roughly similar success rates across groups. There are two main ways of quantifying this across-groups comparison: one is by taking the ratio (disparate impact) and the other one is to take the difference (statistical parity). We will refer to the success rate for the unprivileged group as , and to the success rate for the privileged group as . Typically, these metrics are used in recruitment or in an academic context.
We list here the mathematical definitions for a few common metrics:
Disparate Impact: Measures the ratio of success rates. The ideal value is 1. The system is considered not biased if this quantity falls between 0.8 and 1.2.
Statistical Parity: Measures the difference in success rates. The ideal value is 0. A negative value means that the unprivileged group is unfavoured.
Cohen-D: this is essentially a standardised statistical parity. This value should ideally be small or negligible.
with being the number of individuals in the minority and majority groups respectively, and:
2-SD Rule: Another metric that normalises statistical parity. The ideal value is 0.
where is the total success rate (across all groups), and , with being the total number of individuals.
An example of how to measure bias in a binary classification problem in recruitment can be found in our notebook, which can be accessed here or downloaded as the following file:
Last updated