Low-cost learning via active data procurement

Jacob Abernethy, Yiling Chen, Chien Ju Ho, Bo Waggoner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

We design mechanisms for online procurement of data held by strategic agents for machine learning tasks. We study a model in which agents cannot fabricate data, but may lie about their cost of furnishing their data. The challenge is to use past data to actively price future data in order to obtain learning guarantees, even when agents' costs can depend arbitrarily on the data itself. We show how to convert a large class of no-regret algorithms into online posted-price and learning mechanisms. Our results parallel classic sample complexity guarantees, but with the key resource constraint being money rather than quantity of data available. With a budget constraint B, we give robust risk (predictive error) bounds on the order of 1/√B. In many cases our guarantees are significantly better due to an active-learning approach that leverages correlations between costs and data. Our algorithms and analysis go through a model of no-regret learning with T arriving pairs (cost, data) and a budget constraint of B, coupled with the "online to batch conversion". Our regret bounds for this model are on the order of T/√B and we give lower bounds on the same order.

Original languageEnglish
Title of host publicationEC 2015 - Proceedings of the 2015 ACM Conference on Economics and Computation
PublisherAssociation for Computing Machinery, Inc
Pages619-636
Number of pages18
ISBN (Electronic)9781450334105
DOIs
StatePublished - Jun 15 2015
Event16th ACM Conference on Economics and Computation, EC 2015 - Portland, United States
Duration: Jun 15 2015Jun 19 2015

Publication series

NameEC 2015 - Proceedings of the 2015 ACM Conference on Economics and Computation

Conference

Conference16th ACM Conference on Economics and Computation, EC 2015
Country/TerritoryUnited States
CityPortland
Period06/15/1506/19/15

Keywords

  • Data procurement
  • Machine learning
  • Mechanisms
  • Online learning

Fingerprint

Dive into the research topics of 'Low-cost learning via active data procurement'. Together they form a unique fingerprint.

Cite this