Google Scholar

Cycles in adversarial regularized learning

P Mertikopoulos, C Papadimitriou, G Piliouras - Proceedings of the twenty …, 2018 - SIAM

Regularized learning is a fundamental technique in online optimization, machine learning,
and many other fields of computer science. A natural question that arises in this context is …

Save Cite Cited by 359 Related articles All 12 versions

[PDF] arxiv.org

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

P Mertikopoulos, B Lecouat, H Zenati, CS Foo… - arXiv preprint arXiv …, 2018 - arxiv.org

Owing to their connection with generative adversarial networks (GANs), saddle-point
problems have recently attracted considerable interest in machine learning and beyond. By …

Save Cite Cited by 333 Related articles All 10 versions View as HTML

[PDF] arxiv.org

Learning in games via reinforcement and regularization

P Mertikopoulos, WH Sandholm - Mathematics of Operations …, 2016 - pubsonline.informs.org

We investigate a class of reinforcement learning dynamics where players adjust their
strategies based on their actions’ cumulative payoffs over time—specifically, by playing mixed …

Save Cite Cited by 182 Related articles All 25 versions

[PDF] arxiv.org

Learning in games with continuous action sets and unknown payoff functions

P Mertikopoulos, Z Zhou - Mathematical Programming, 2019 - Springer

This paper examines the convergence of no-regret learning in games with continuous action
sets. For concreteness, we focus on learning via “dual averaging”, a widely used class of no…

Save Cite Cited by 283 Related articles All 15 versions

[PDF] neurips.cc

On the almost sure convergence of stochastic gradient descent in non-convex problems

P Mertikopoulos, N Hallak, A Kavis… - Advances in Neural …, 2020 - proceedings.neurips.cc

In this paper, we analyze the trajectories of stochastic gradient descent (SGD) with the aim of
understanding their convergence properties in non-convex problems. We first show that the …

Save Cite Cited by 100 Related articles All 19 versions View as HTML

[PDF] neurips.cc

Bandit learning in concave N-person games

…, D Leslie, P Mertikopoulos - Advances in Neural …, 2018 - proceedings.neurips.cc

This paper examines the long-run behavior of learning with bandit feedback in non-cooperative
concave games. The bandit framework accounts for extremely low-information …

Save Cite Cited by 141 Related articles All 14 versions View as HTML

[PDF] neurips.cc

[PDF][PDF] No-regret learning and mixed Nash equilibria: They do not mix

…, T Lianeas, P Mertikopoulos… - arXiv preprint arXiv …, 2020 - proceedings.neurips.cc

Understanding the behavior of no-regret dynamics in general 𝑁-player games is a fundamental
question in online learning and game theory. A folk result in the field states that, in finite …

Save Cite Cited by 36 Related articles All 14 versions View as HTML

[PDF] sciencedirect.com

Riemannian game dynamics

P Mertikopoulos, WH Sandholm - Journal of Economic Theory, 2018 - Elsevier

We study a class of evolutionary game dynamics defined by balancing a gain determined
by the game's payoffs against a cost of motion that captures the difficulty with which the …

Save Cite Cited by 60 Related articles All 19 versions

[PDF] arxiv.org

On the convergence of gradient-like flows with noisy gradient input

P Mertikopoulos, M Staudigl - SIAM Journal on Optimization, 2018 - SIAM

In view of solving convex optimization problems with noisy gradient input, we analyze the
asymptotic behavior of gradient-like flows under stochastic disturbances. Specifically, we focus …

Save Cite Cited by 70 Related articles All 17 versions Library Search

[PDF] neurips.cc

Learning with bandit feedback in potential games

…, J Cohen, P Mertikopoulos - Advances in Neural …, 2017 - proceedings.neurips.cc

This paper examines the equilibrium convergence properties of no-regret learning with
exponential weights in potential games. To establish convergence with minimal information …

Save Cite Cited by 114 Related articles All 10 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

Cycles in adversarial regularized learning

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

Learning in games via reinforcement and regularization

Learning in games with continuous action sets and unknown payoff functions

On the almost sure convergence of stochastic gradient descent in non-convex problems

Bandit learning in concave N-person games

[PDF][PDF] No-regret learning and mixed Nash equilibria: They do not mix

Riemannian game dynamics

On the convergence of gradient-like flows with noisy gradient input

Learning with bandit feedback in potential games