Reward Delay Attacks on Deep Reinforcement Learning

  • Anindya Sarkar
  • , Jiarui Feng
  • , Yevgeniy Vorobeychik
  • , Christopher Gill
  • , Ning Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Most reinforcement learning algorithms implicitly assume strong synchrony. We present novel attacks targeting Q-learning that exploit a vulnerability entailed by this assumption by delaying the reward signal for a limited time period. We consider two types of attack goals: targeted attacks, which aim to cause a target policy to be learned, and untargeted attacks, which simply aim to induce a policy with a low reward. We evaluate the efficacy of the proposed attacks through a series of experiments. Our first observation is that reward-delay attacks are extremely effective when the goal is simply to minimize reward. Indeed, we find that even naive baseline reward-delay attacks are also highly successful in minimizing the reward. Targeted attacks, on the other hand, are more challenging, although we nevertheless demonstrate that the proposed approaches remain highly effective at achieving the attacker’s targets. In addition, we introduce a second threat model that captures a minimal mitigation that ensures that rewards cannot be used out of sequence. We find that this mitigation remains insufficient to ensure robustness to attacks that delay, but preserve the order, of rewards.

Original languageEnglish
Title of host publicationDecision and Game Theory for Security - 13th International Conference, GameSec 2022, Proceedings
EditorsFei Fang, Haifeng Xu, Yezekael Hayel
PublisherSpringer Science and Business Media Deutschland GmbH
Pages212-230
Number of pages19
ISBN (Print)9783031263682
DOIs
StatePublished - 2023
Event13th International Conference on Decision and Game Theory for Security, GameSec 2022 - Pittsburgh, United States
Duration: Oct 26 2022Oct 28 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13727 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference on Decision and Game Theory for Security, GameSec 2022
Country/TerritoryUnited States
CityPittsburgh
Period10/26/2210/28/22

Keywords

  • Adversarial attack
  • Deep reinforcement learning
  • Reward delay attack

Fingerprint

Dive into the research topics of 'Reward Delay Attacks on Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this