Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions

Zihao Deng, Siddartha Devic, Brendan Juba

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Fingerprint

Dive into the research topics of 'Polynomial Time Reinforcement Learning in Factored State MDPs with Linear Value Functions'. Together they form a unique fingerprint.

Keyphrases

Computer Science

Mathematics