TY - JOUR
T1 - SymmPI
T2 - Predictive inference for data with group symmetries
AU - Dobriban, Edgar
AU - Yu, Mengxin
N1 - Publisher Copyright:
© 2025 The Royal Statistical Society. All rights reserved. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site - for further information please contact [email protected].
PY - 2025/11/1
Y1 - 2025/11/1
N2 - Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often - for instance, in standard conformal prediction - relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive inference aim to predict unobserved outcomes in sequences of feature-outcome observations. Meanwhile, there is interest in predictive inference under more general observation models (e.g. for partially observed features) and for data satisfying more general distributional symmetries (e.g. rotationally invariant observations in physics). Here, we propose SymmPI, a methodology for predictive inference when data distributions have general group symmetries in arbitrary observation models. Our methods leverage the novel notion of distributionally equivariant transformations, which process the data while preserving their distributional invariances. We show that SymmPI has valid coverage under distributional invariance and characterize its performance under distribution shift, recovering recent results as special cases. We apply SymmPI to predict unobserved values associated with vertices in a network, where the distribution is unchanged under relabellings that keep the network structure unchanged. In several simulations in a two-layer hierarchical model, and in an empirical data analysis example, SymmPI performs favourably compared with existing methods.
AB - Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often - for instance, in standard conformal prediction - relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive inference aim to predict unobserved outcomes in sequences of feature-outcome observations. Meanwhile, there is interest in predictive inference under more general observation models (e.g. for partially observed features) and for data satisfying more general distributional symmetries (e.g. rotationally invariant observations in physics). Here, we propose SymmPI, a methodology for predictive inference when data distributions have general group symmetries in arbitrary observation models. Our methods leverage the novel notion of distributionally equivariant transformations, which process the data while preserving their distributional invariances. We show that SymmPI has valid coverage under distributional invariance and characterize its performance under distribution shift, recovering recent results as special cases. We apply SymmPI to predict unobserved values associated with vertices in a network, where the distribution is unchanged under relabellings that keep the network structure unchanged. In several simulations in a two-layer hierarchical model, and in an empirical data analysis example, SymmPI performs favourably compared with existing methods.
KW - conformal prediction
KW - group symmetry
KW - invariance
KW - predictive inference
UR - https://www.scopus.com/pages/publications/105021386328
U2 - 10.1093/jrsssb/qkaf022
DO - 10.1093/jrsssb/qkaf022
M3 - Article
AN - SCOPUS:105021386328
SN - 1369-7412
VL - 87
SP - 1353
EP - 1381
JO - Journal of the Royal Statistical Society. Series B: Statistical Methodology
JF - Journal of the Royal Statistical Society. Series B: Statistical Methodology
IS - 5
ER -