Conditional Mutual Information Estimation for Mixed, Discrete and Continuous Data

  • Octavio Cesar Mesner
  • , Cosma Rohilla Shalizi

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Fields like public health, public policy, and social science often want to quantify the degree of dependence between variables whose relationships take on unknown functional forms. Typically, in fact, researchers in these fields are attempting to evaluate causal theories, and so want to quantify dependence after conditioning on other variables that might explain, mediate or confound causal relations. One reason conditional mutual information is not more widely used for these tasks is the lack of estimators which can handle combinations of continuous and discrete random variables, common in applications. This article develops a new method for estimating mutual and conditional mutual information for data samples containing a mix of discrete and continuous variables. We prove that this estimator is consistent and show, via simulation, that it is more accurate than similar estimators.

    Original languageEnglish
    Article number9201164
    Pages (from-to)464-484
    Number of pages21
    JournalIEEE Transactions on Information Theory
    Volume67
    Issue number1
    DOIs
    StatePublished - Jan 2021

    Keywords

    • Conditional mutual information
    • discrete and continuous data
    • nearest neighbors

    Fingerprint

    Dive into the research topics of 'Conditional Mutual Information Estimation for Mixed, Discrete and Continuous Data'. Together they form a unique fingerprint.

    Cite this