Academic Journal

A unified stochastic approximation framework for learning in games.

Bibliographic Details
Title: A unified stochastic approximation framework for learning in games.
Authors: Mertikopoulos, Panayotis1 (AUTHOR) panayotis.mertikopoulos@imag.fr, Hsieh, Ya-Ping2 (AUTHOR), Cevher, Volkan3 (AUTHOR)
Superior Title: Mathematical Programming. Jan2024, Vol. 203 Issue 1/2, p559-609. 51p.
Subject Terms: *NASH equilibrium, STOCHASTIC approximation, EDUCATIONAL games, MACHINE learning, ROBBERS
Abstract: We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential/multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In addition to providing an integrated view of these algorithms, our framework further allows us to obtain several new convergence results, both asymptotic and in finite time, in both continuous and finite games. Specifically, we provide a range of criteria for identifying classes of Nash equilibria and sets of action profiles that are attracting with high probability, and we also introduce the notion of coherence, a game-theoretic property that includes strict and sharp equilibria, and which leads to convergence in finite time. Importantly, our analysis applies to both oracle-based and bandit, payoff-based methods—that is, when players only observe their realized payoffs. [ABSTRACT FROM AUTHOR]
Copyright of Mathematical Programming is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Business Source Premier
Description
Description not available.