TY - GEN
T1 - Distribution-Specific Auditing for Subgroup Fairness
AU - Hsu, Daniel
AU - Huang, Jizhou
AU - Juba, Brendan
N1 - Publisher Copyright:
© Daniel Hsu, Jizhou Huang, and Brendan Juba.
PY - 2024/6
Y1 - 2024/6
N2 - We study the problem of auditing classifiers for statistical subgroup fairness. Kearns et al. [20] showed that the problem of auditing combinatorial subgroups fairness is as hard as agnostic learning. Essentially all work on remedying statistical measures of discrimination against subgroups assumes access to an oracle for this problem, despite the fact that no efficient algorithms are known for it. If we assume the data distribution is Gaussian, or even merely log-concave, then a recent line of work has discovered efficient agnostic learning algorithms for halfspaces. Unfortunately, the reduction of Kearns et al. was formulated in terms of weak, “distribution-free” learning, and thus did not establish a connection for families such as log-concave distributions. In this work, we give positive and negative results on auditing for Gaussian distributions: On the positive side, we present an alternative approach to leverage these advances in agnostic learning and thereby obtain the first polynomial-time approximation scheme (PTAS) for auditing nontrivial combinatorial subgroup fairness: we show how to audit statistical notions of fairness over homogeneous halfspace subgroups when the features are Gaussian. On the negative side, we find that under cryptographic assumptions, no polynomial-time algorithm can guarantee any nontrivial auditing, even under Gaussian feature distributions, for general halfspace subgroups.
AB - We study the problem of auditing classifiers for statistical subgroup fairness. Kearns et al. [20] showed that the problem of auditing combinatorial subgroups fairness is as hard as agnostic learning. Essentially all work on remedying statistical measures of discrimination against subgroups assumes access to an oracle for this problem, despite the fact that no efficient algorithms are known for it. If we assume the data distribution is Gaussian, or even merely log-concave, then a recent line of work has discovered efficient agnostic learning algorithms for halfspaces. Unfortunately, the reduction of Kearns et al. was formulated in terms of weak, “distribution-free” learning, and thus did not establish a connection for families such as log-concave distributions. In this work, we give positive and negative results on auditing for Gaussian distributions: On the positive side, we present an alternative approach to leverage these advances in agnostic learning and thereby obtain the first polynomial-time approximation scheme (PTAS) for auditing nontrivial combinatorial subgroup fairness: we show how to audit statistical notions of fairness over homogeneous halfspace subgroups when the features are Gaussian. On the negative side, we find that under cryptographic assumptions, no polynomial-time algorithm can guarantee any nontrivial auditing, even under Gaussian feature distributions, for general halfspace subgroups.
KW - agnostic learning
KW - Fairness auditing
KW - intractability
UR - http://www.scopus.com/inward/record.url?scp=85196090427&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.FORC.2024.5
DO - 10.4230/LIPIcs.FORC.2024.5
M3 - Conference contribution
AN - SCOPUS:85196090427
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 5th Symposium on Foundations of Responsible Computing, FORC 2024
A2 - Rothblum, Guy N.
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 5th Symposium on Foundations of Responsible Computing, FORC 2024
Y2 - 12 June 2024 through 14 June 2024
ER -