LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Wang, Yu; Liu, Jiayi; Liu, Yuxiang; Hao, Jun; He, Yang; Hu, Jinghe; Yan, Weipeng P.; Li, Mantian

Computer Science > Machine Learning

arXiv:1708.05565 (cs)

[Submitted on 18 Aug 2017 (v1), last revised 1 Sep 2017 (this version, v2)]

Title:LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Authors:Yu Wang, Jiayi Liu, Yuxiang Liu, Jun Hao, Yang He, Jinghe Hu, Weipeng P. Yan, Mantian Li

View PDF

Abstract:We present LADDER, the first deep reinforcement learning agent that can successfully learn control policies for large-scale real-world problems directly from raw inputs composed of high-level semantic information. The agent is based on an asynchronous stochastic variant of DQN (Deep Q Network) named DASQN. The inputs of the agent are plain-text descriptions of states of a game of incomplete information, i.e. real-time large scale online auctions, and the rewards are auction profits of very large scale. We apply the agent to an essential portion of JD's online RTB (real-time bidding) advertising business and find that it easily beats the former state-of-the-art bidding policy that had been carefully engineered and calibrated by human experts: during this http URL's June 18th anniversary sale, the agent increased the company's ads revenue from the portion by more than 50%, while the advertisers' ROI (return on investment) also improved significantly.

Comments:	8 pages, 12 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:1708.05565 [cs.LG]
	(or arXiv:1708.05565v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1708.05565

Submission history

From: Yu Wang [view email]
[v1] Fri, 18 Aug 2017 11:25:30 UTC (1,073 KB)
[v2] Fri, 1 Sep 2017 14:05:09 UTC (1,073 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-08

Change to browse by:

cs
cs.AI
cs.CL
cs.GT

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yu Wang
Jiayi Liu
Yuxiang Liu
Jun Hao
Yang He

…

export BibTeX citation

Computer Science > Machine Learning

Title:LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators