ROBUST HIGH-DIMENSIONAL TUNING FREE MULTIPLE TESTING

  • Jianqing Fan
  • , Zhipeng Lou
  • , Mengxin Yu

    Research output: Contribution to journalArticlepeer-review

    1 Scopus citations

    Abstract

    A stylized feature of high-dimensional data is that many variables have heavy tails, and robust statistical inference is critical for valid large-scale statistical inference. Yet, the existing developments such as Winsorization, Huberization and median of means require the bounded second moments and involve variable-dependent tuning parameters, which hamper their fidelity in applications to large-scale problems. To liberate these constraints, this paper revisits the celebrated Hodges-Lehmann (HL) estimator for estimating location parameters in both the one- and two-sample problems, from a nonasymptotic perspective. Our study develops Berry-Esseen inequality and Cramér-type moderate deviation for the HL estimator based on newly developed nonasymptotic Bahadur representation and builds data-driven confidence intervals via a weighted bootstrap approach. These results allow us to extend the HL estimator to large-scale studies and propose tuning-free and moment-free high-dimensional inference procedures for testing global null and for large-scale multiple testing with false discovery proportion control. It is convincingly shown that the resulting tuning-free and moment-free methods control false discovery proportion at a prescribed level. The simulation studies lend further support to our developed theory.

    Original languageEnglish
    Pages (from-to)2093-2115
    Number of pages23
    JournalAnnals of Statistics
    Volume51
    Issue number5
    DOIs
    StatePublished - Oct 2023

    Keywords

    • heavy-tailed data
    • largescale multiple testing
    • Robust statistical inference
    • tuning free
    • weighted bootstrap

    Fingerprint

    Dive into the research topics of 'ROBUST HIGH-DIMENSIONAL TUNING FREE MULTIPLE TESTING'. Together they form a unique fingerprint.

    Cite this