TY - GEN
T1 - A Scalable Shannon Entropy Estimator
AU - Golia, Priyanka
AU - Juba, Brendan
AU - Meel, Kuldeep S.
N1 - Publisher Copyright:
© 2022, The Author(s).
PY - 2022
Y1 - 2022
N2 - Quantified information flow (QIF) has emerged as a rigorous approach to quantitatively measure confidentiality; the information-theoretic underpinning of QIF allows the end-users to link the computed quantities with the computational effort required on the part of the adversary to gain access to desired confidential information. In this work, we focus on the estimation of Shannon entropy for a given program Π. As a first step, we focus on the case wherein a Boolean formula φ(X, Y) captures the relationship between inputs X and output Y of Π. Such formulas φ(X, Y) have the property that for every valuation to X, there exists exactly one valuation to Y such that φ is satisfied. The existing techniques require O(2 m) model counting queries, where m= | Y|. We propose the first efficient algorithmic technique, called EntropyEstimation to estimate the Shannon entropy of φ with PAC-style guarantees, i.e., the computed estimate is guaranteed to lie within a (1 ± ε) -factor of the ground truth with confidence at least 1 - δ. Furthermore, EntropyEstimation makes only O(min(m,n)ε2) counting and sampling queries, where m= | Y|, and n= | X|, thereby achieving a significant reduction in the number of model counting queries. We demonstrate the practical efficiency of our algorithmic framework via a detailed experimental evaluation. Our evaluation demonstrates that the proposed framework scales to the formulas beyond the reach of the previously known approaches.
AB - Quantified information flow (QIF) has emerged as a rigorous approach to quantitatively measure confidentiality; the information-theoretic underpinning of QIF allows the end-users to link the computed quantities with the computational effort required on the part of the adversary to gain access to desired confidential information. In this work, we focus on the estimation of Shannon entropy for a given program Π. As a first step, we focus on the case wherein a Boolean formula φ(X, Y) captures the relationship between inputs X and output Y of Π. Such formulas φ(X, Y) have the property that for every valuation to X, there exists exactly one valuation to Y such that φ is satisfied. The existing techniques require O(2 m) model counting queries, where m= | Y|. We propose the first efficient algorithmic technique, called EntropyEstimation to estimate the Shannon entropy of φ with PAC-style guarantees, i.e., the computed estimate is guaranteed to lie within a (1 ± ε) -factor of the ground truth with confidence at least 1 - δ. Furthermore, EntropyEstimation makes only O(min(m,n)ε2) counting and sampling queries, where m= | Y|, and n= | X|, thereby achieving a significant reduction in the number of model counting queries. We demonstrate the practical efficiency of our algorithmic framework via a detailed experimental evaluation. Our evaluation demonstrates that the proposed framework scales to the formulas beyond the reach of the previously known approaches.
UR - https://www.scopus.com/pages/publications/85135873143
U2 - 10.1007/978-3-031-13185-1_18
DO - 10.1007/978-3-031-13185-1_18
M3 - Conference contribution
AN - SCOPUS:85135873143
SN - 9783031131844
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 363
EP - 384
BT - Computer Aided Verification - 34th International Conference, CAV 2022, Proceedings
A2 - Shoham, Sharon
A2 - Vizel, Yakir
PB - Springer Science and Business Media Deutschland GmbH
T2 - 34th International Conference on Computer Aided Verification, CAV 2022
Y2 - 7 August 2022 through 10 August 2022
ER -