Search Results - STOCHASTIC approximation

1

Conference

Riemannian stochastic optimization methods avoid strict saddle points

Authors: Hsieh, Ya-Ping, Karimi, Mohammad Reza, Krause, Andreas, Mertikopoulos, Panayotis

Contributors: Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology Zürich (ETH Zürich), Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019)

Superior Title: Riemannian stochastic optimization methods avoid strict saddle points ; NeurIPS 2023 - 37th Conference on Neural Information Processing Systems ; https://inria.hal.science/hal-04307876 ; NeurIPS 2023 - 37th Conference on Neural Information Processing Systems, Dec 2023, New Orleans (LA), United States. pp.1-27

Subject Terms: Optimization on manifolds, Stochastic approximation, Saddle-point avoidance, Riemannian Robbins-Monro algorithms, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Subject Geographic: New Orleans (LA), United States

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2311.02374; hal-04307876; https://inria.hal.science/hal-04307876; https://inria.hal.science/hal-04307876/document; https://inria.hal.science/hal-04307876/file/Main.pdf; ARXIV: 2311.02374

Availability: https://inria.hal.science/hal-04307876
https://inria.hal.science/hal-04307876/document
https://inria.hal.science/hal-04307876/file/Main.pdf

2

Academic Journal

A unified stochastic approximation framework for learning in games

Authors: Mertikopoulos, Panayotis, Hsieh, Ya-Ping, Cevher, Volkan

Contributors: Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Department of Computer Science ETH Zürich (D-INFK), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology Zürich (ETH Zürich), Ecole Polytechnique Fédérale de Lausanne (EPFL), ANR-19-CE48-0018,ALIAS,Apprentissage adaptatif multi-agent(2019), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-11-LABX-0025,PERSYVAL-lab,Systemes et Algorithmes Pervasifs au confluent des mondes physique et numérique(2011)

Superior Title: ISSN: 0025-5610.

Subject Terms: Nash equilibrium, Continuous games, Finite games, Variational stability, Stochastic approximation, [INFO.INFO-GT]Computer Science [cs]/Computer Science and Game Theory [cs.GT], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2206.03922; hal-03874012; https://hal.science/hal-03874012; https://hal.science/hal-03874012v2/document; https://hal.science/hal-03874012v2/file/Main.pdf; ARXIV: 2206.03922

Availability: https://doi.org/10.1007/s10107-023-02001-y
https://hal.science/hal-03874012
https://hal.science/hal-03874012v2/document
https://hal.science/hal-03874012v2/file/Main.pdf

Zugang prüfen (DOI)

3

Academic Journal

A unified stochastic approximation framework for learning in games

Authors: Mertikopoulos, Panayotis, Hsieh, Ya-Ping, Cevher, Volkan

Contributors: Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Department of Computer Science ETH Zürich (D-INFK), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology Zürich (ETH Zürich), Ecole Polytechnique Fédérale de Lausanne (EPFL), ANR-19-CE48-0018,ALIAS,Apprentissage adaptatif multi-agent(2019), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-11-LABX-0025,PERSYVAL-lab,Systemes et Algorithmes Pervasifs au confluent des mondes physique et numérique(2011)

Superior Title: ISSN: 0025-5610.

Subject Terms: Nash equilibrium, Continuous games, Finite games, Variational stability, Stochastic approximation, [INFO.INFO-GT]Computer Science [cs]/Computer Science and Game Theory [cs.GT], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2206.03922; hal-03874012; https://hal.science/hal-03874012; https://hal.science/hal-03874012v2/document; https://hal.science/hal-03874012v2/file/Main.pdf; ARXIV: 2206.03922

Availability: https://doi.org/10.1007/s10107-023-02001-y
https://hal.science/hal-03874012
https://hal.science/hal-03874012v2/document
https://hal.science/hal-03874012v2/file/Main.pdf

Zugang prüfen (DOI)

4

Academic Journal

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Authors: Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Contributors: Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP), École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Toulouse UMR5219 (IMT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS), The Chinese University of Hong Kong Hong Kong (CUHK), Fondation Simone et Cino Del Duca Institut de France (author G. Fort), HKRGC Project 23203520 (author H-T. Wai), Hi!Paris FLAG project (authors A. Dieuleveut and E. Moulines)

Superior Title: ISSN: 1053-587X ; IEEE Transactions on Signal Processing ; https://hal.science/hal-03979922 ; IEEE Transactions on Signal Processing, 2023, 71, pp.3117-3148. ⟨10.1109/TSP.2023.3301121⟩.

Subject Terms: Stochastic approximation, Convergence Analysis, Compressed Stochastic Gradient, Expectation Maximization, TD-learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]

Relation: hal-03979922; https://hal.science/hal-03979922; https://hal.science/hal-03979922/document; https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Availability: https://doi.org/10.1109/TSP.2023.3301121
https://hal.science/hal-03979922
https://hal.science/hal-03979922/document
https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Zugang prüfen (DOI)

5

Academic Journal

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Authors: Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Contributors: Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP), École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Toulouse UMR5219 (IMT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS), The Chinese University of Hong Kong Hong Kong (CUHK), Fondation Simone et Cino Del Duca Institut de France (author G. Fort), HKRGC Project 23203520 (author H-T. Wai), Hi!Paris FLAG project (authors A. Dieuleveut and E. Moulines)

Superior Title: ISSN: 1053-587X ; IEEE Transactions on Signal Processing ; https://hal.science/hal-03979922 ; IEEE Transactions on Signal Processing, 2023, 71, pp.3117-3148. ⟨10.1109/TSP.2023.3301121⟩.

Subject Terms: Stochastic approximation, Convergence Analysis, Compressed Stochastic Gradient, Expectation Maximization, TD-learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]

Relation: hal-03979922; https://hal.science/hal-03979922; https://hal.science/hal-03979922/document; https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Availability: https://doi.org/10.1109/TSP.2023.3301121
https://hal.science/hal-03979922
https://hal.science/hal-03979922/document
https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Zugang prüfen (DOI)

6

Academic Journal

A unified stochastic approximation framework for learning in games

Authors: Mertikopoulos, Panayotis, Hsieh, Ya-Ping, Cevher, Volkan

Contributors: Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Department of Computer Science ETH Zürich (D-INFK), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology Zürich (ETH Zürich), Ecole Polytechnique Fédérale de Lausanne (EPFL), ANR-19-CE48-0018,ALIAS,Apprentissage adaptatif multi-agent(2019), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-11-LABX-0025,PERSYVAL-lab,Systemes et Algorithmes Pervasifs au confluent des mondes physique et numérique(2011)

Superior Title: ISSN: 0025-5610.

Subject Terms: Nash equilibrium, Continuous games, Finite games, Variational stability, Stochastic approximation, [INFO.INFO-GT]Computer Science [cs]/Computer Science and Game Theory [cs.GT], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2206.03922; hal-03874012; https://hal.science/hal-03874012; https://hal.science/hal-03874012v2/document; https://hal.science/hal-03874012v2/file/Main.pdf; ARXIV: 2206.03922

Availability: https://doi.org/10.1007/s10107-023-02001-y
https://hal.science/hal-03874012
https://hal.science/hal-03874012v2/document
https://hal.science/hal-03874012v2/file/Main.pdf

Zugang prüfen (DOI)

7

Report

Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions ; Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems

Authors: Comte, Céline, Jonckheere, Matthieu, Sanders, Jaron, Senen-Cerda, Albert

Contributors: Centre National de la Recherche Scientifique (CNRS), Équipe Services et Architectures pour Réseaux Avancés (LAAS-SARA), Laboratoire d'analyse et d'architecture des systèmes (LAAS), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Eindhoven University of Technology Eindhoven (TU/e), Réseaux, Mobiles, Embarqués, Sans fil, Satellites (IRIT-RMESS), Institut de recherche en informatique de Toulouse (IRIT), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université Toulouse III - Paul Sabatier (UT3)

Superior Title: https://hal.science/hal-04329790 ; 2023.

Subject Terms: reinforcement learning, policy-gradient method, exponential families, product-form stationary distribution, stochastic approximation, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-PF]Computer Science [cs]/Performance [cs.PF], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-PR]Mathematics [math]/Probability [math.PR]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2312.02804; hal-04329790; https://hal.science/hal-04329790; https://hal.science/hal-04329790/document; https://hal.science/hal-04329790/file/paper.pdf; ARXIV: 2312.02804

Availability: https://hal.science/hal-04329790
https://hal.science/hal-04329790/document
https://hal.science/hal-04329790/file/paper.pdf

8

Report

Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions ; Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems

Authors: Comte, Céline, Jonckheere, Matthieu, Sanders, Jaron, Senen-Cerda, Albert

Contributors: Centre National de la Recherche Scientifique (CNRS), Équipe Services et Architectures pour Réseaux Avancés (LAAS-SARA), Laboratoire d'analyse et d'architecture des systèmes (LAAS), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Equipe Réseaux, Informatique, Systèmes de Confiance (LAAS-RISC), Eindhoven University of Technology Eindhoven (TU/e)

Superior Title: https://hal.science/hal-04329790 ; 2023.

Subject Terms: reinforcement learning, policy-gradient method, exponential families, product-form stationary distribution, stochastic approximation, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-PF]Computer Science [cs]/Performance [cs.PF], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-PR]Mathematics [math]/Probability [math.PR]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2312.02804; hal-04329790; https://hal.science/hal-04329790; https://hal.science/hal-04329790/document; https://hal.science/hal-04329790/file/paper.pdf; ARXIV: 2312.02804

Availability: https://hal.science/hal-04329790
https://hal.science/hal-04329790/document
https://hal.science/hal-04329790/file/paper.pdf

9

Report

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Authors: Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Contributors: Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP), École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Toulouse UMR5219 (IMT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS), The Chinese University of Hong Kong Hong Kong (CUHK), Fondation Simone et Cino Del Duca Institut de France (author G. Fort), HKRGC Project 23203520 (author H-T. Wai), Hi!Paris FLAG project (authors A. Dieuleveut and E. Moulines)

Superior Title: https://hal.science/hal-03979922 ; 2023.

Subject Terms: Stochastic approximation, Convergence Analysis, Compressed Stochastic Gradient, Expectation Maximization, TD-learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]

Relation: hal-03979922; https://hal.science/hal-03979922; https://hal.science/hal-03979922/document; https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Availability: https://hal.science/hal-03979922
https://hal.science/hal-03979922/document
https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

10

Report

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Authors: Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Contributors: Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP), École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Toulouse UMR5219 (IMT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS), The Chinese University of Hong Kong Hong Kong (CUHK), Fondation Simone et Cino Del Duca Institut de France (author G. Fort), HKRGC Project 23203520 (author H-T. Wai), Hi!Paris FLAG project (authors A. Dieuleveut and E. Moulines)

Superior Title: https://hal.science/hal-03979922 ; 2023.

Subject Terms: Stochastic approximation, Convergence Analysis, Compressed Stochastic Gradient, Expectation Maximization, TD-learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]

Relation: hal-03979922; https://hal.science/hal-03979922; https://hal.science/hal-03979922/document; https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Availability: https://hal.science/hal-03979922
https://hal.science/hal-03979922/document
https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

11

Report

Stochastic Approximation Beyond Gradient for Signal Processing and Machine Learning

Authors: Dieuleveut, Aymeric, Fort, Gersende, Moulines, Eric, Wai, Hoi-To

Contributors: Centre de Mathématiques Appliquées - Ecole Polytechnique (CMAP), École polytechnique (X)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Toulouse UMR5219 (IMT), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS), The Chinese University of Hong Kong Hong Kong, Fondation Simone et Cino Del Duca Institut de France (author G. Fort), HKRGC Project 23203520 (author H-T. Wai), Hi!Paris FLAG project (authors A. Dieuleveut and E. Moulines)

Superior Title: https://hal.science/hal-03979922 ; 2023.

Subject Terms: Stochastic approximation, Convergence Analysis, Compressed Stochastic Gradient, Expectation Maximization, TD-learning, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]

Relation: hal-03979922; https://hal.science/hal-03979922; https://hal.science/hal-03979922/document; https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

Availability: https://hal.science/hal-03979922
https://hal.science/hal-03979922/document
https://hal.science/hal-03979922/file/main-v3-commited_HAL.pdf

12

Report

Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions ; Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems

Authors: Comte, Céline, Jonckheere, Matthieu, Sanders, Jaron, Senen-Cerda, Albert

Contributors: Centre National de la Recherche Scientifique (CNRS), Équipe Services et Architectures pour Réseaux Avancés (LAAS-SARA), Laboratoire d'analyse et d'architecture des systèmes (LAAS), Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse), Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Université Toulouse Capitole (UT Capitole), Université de Toulouse (UT), Equipe Réseaux, Informatique, Systèmes de Confiance (LAAS-RISC), Eindhoven University of Technology Eindhoven (TU/e), Réseaux, Mobiles, Embarqués, Sans fil, Satellites (IRIT-RMESS), Institut de recherche en informatique de Toulouse (IRIT), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Toulouse Mind & Brain Institut (TMBI), Université Toulouse - Jean Jaurès (UT2J), Université de Toulouse (UT)-Université de Toulouse (UT)-Université Toulouse III - Paul Sabatier (UT3), Université Toulouse III - Paul Sabatier (UT3)

Superior Title: https://hal.science/hal-04329790 ; 2023.

Subject Terms: reinforcement learning, policy-gradient method, exponential families, product-form stationary distribution, stochastic approximation, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [INFO.INFO-PF]Computer Science [cs]/Performance [cs.PF], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-PR]Mathematics [math]/Probability [math.PR]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2312.02804; hal-04329790; https://hal.science/hal-04329790; https://hal.science/hal-04329790/document; https://hal.science/hal-04329790/file/paper.pdf; ARXIV: 2312.02804

Availability: https://hal.science/hal-04329790
https://hal.science/hal-04329790/document
https://hal.science/hal-04329790/file/paper.pdf

13

Academic Journal

Distributed stochastic optimization with large delays

Authors: Zhou, Zhengyuan, Mertikopoulos, Panayotis, Bambos, Nicholas, Glynn, Peter, W, Ye, Yinyu

Contributors: New York University New York (NYU), NYU System (NYU), Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Stanford University, ANR-16-CE33-0004,ORACLESS,Stratégies adaptatives d'allocation des ressources dans les réseaux sans fil dynamiques(2016), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-11-LABX-0025,PERSYVAL-lab,Systemes et Algorithmes Pervasifs au confluent des mondes physique et numérique(2011), ANR-19-CE48-0018,ALIAS,Apprentissage adaptatif multi-agent(2019), European Project: GAMENET

Superior Title: ISSN: 0364-765X.

Subject Terms: Distributed optimization, Delays, Stochastic gradient descent, Stochastic approximation, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2107.02919; hal-03342384; https://inria.hal.science/hal-03342384; https://inria.hal.science/hal-03342384/document; https://inria.hal.science/hal-03342384/file/Main.pdf; ARXIV: 2107.02919

Availability: https://doi.org/10.1287/moor.2021.1200
https://inria.hal.science/hal-03342384
https://inria.hal.science/hal-03342384/document
https://inria.hal.science/hal-03342384/file/Main.pdf

Zugang prüfen (DOI)

14

Academic Journal

Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

Authors: Bianchi, Pascal, Hachem, Walid, Schechtman, Sholom

Contributors: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Institut Polytechnique de Paris (IP Paris), Laboratoire d'Informatique Gaspard-Monge (LIGM), École des Ponts ParisTech (ENPC)-Centre National de la Recherche Scientifique (CNRS)-Université Gustave Eiffel

Superior Title: ISSN: 1877-0533.

Subject Terms: Stochastic approximation, Non convex and non smooth optimization, Backpropagation algorithm, Differential inclusions, Clarke subdifferential, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2005.08513; hal-02564349; https://hal.science/hal-02564349; https://hal.science/hal-02564349v3/document; https://hal.science/hal-02564349v3/file/clarke.pdf; ARXIV: 2005.08513

Availability: https://doi.org/10.1007/s11228-022-00638-z
https://hal.science/hal-02564349
https://hal.science/hal-02564349v3/document
https://hal.science/hal-02564349v3/file/clarke.pdf

Zugang prüfen (DOI)

15

Academic Journal

Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

Authors: Bianchi, Pascal, Hachem, Walid, Schechtman, Sholom

Contributors: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Institut Polytechnique de Paris (IP Paris), Laboratoire d'Informatique Gaspard-Monge (LIGM), École des Ponts ParisTech (ENPC)-Centre National de la Recherche Scientifique (CNRS)-Université Gustave Eiffel

Superior Title: ISSN: 1877-0533.

Subject Terms: Stochastic approximation, Non convex and non smooth optimization, Backpropagation algorithm, Differential inclusions, Clarke subdifferential, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2005.08513; hal-02564349; https://hal.science/hal-02564349; https://hal.science/hal-02564349v3/document; https://hal.science/hal-02564349v3/file/clarke.pdf; ARXIV: 2005.08513

Availability: https://doi.org/10.1007/s11228-022-00638-z
https://hal.science/hal-02564349
https://hal.science/hal-02564349v3/document
https://hal.science/hal-02564349v3/file/clarke.pdf

Zugang prüfen (DOI)

16

Academic Journal

Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

Authors: Bianchi, Pascal, Hachem, Walid, Schechtman, Sholom

Contributors: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Institut Polytechnique de Paris (IP Paris), Laboratoire d'Informatique Gaspard-Monge (LIGM), École des Ponts ParisTech (ENPC)-Centre National de la Recherche Scientifique (CNRS)-Université Gustave Eiffel

Superior Title: ISSN: 1877-0533.

Subject Terms: Stochastic approximation, Non convex and non smooth optimization, Backpropagation algorithm, Differential inclusions, Clarke subdifferential, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2005.08513; hal-02564349; https://hal.science/hal-02564349; https://hal.science/hal-02564349v3/document; https://hal.science/hal-02564349v3/file/clarke.pdf; ARXIV: 2005.08513

Availability: https://doi.org/10.1007/s11228-022-00638-z
https://hal.science/hal-02564349
https://hal.science/hal-02564349v3/document
https://hal.science/hal-02564349v3/file/clarke.pdf

Zugang prüfen (DOI)

17

Academic Journal

Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

Authors: Bianchi, Pascal, Hachem, Walid, Schechtman, Sholom

Contributors: Signal, Statistique et Apprentissage (S2A), Laboratoire Traitement et Communication de l'Information (LTCI), Institut Mines-Télécom Paris (IMT)-Télécom Paris-Institut Mines-Télécom Paris (IMT)-Télécom Paris, Département Images, Données, Signal (IDS), Télécom ParisTech, Institut Polytechnique de Paris (IP Paris), Laboratoire d'Informatique Gaspard-Monge (LIGM), École des Ponts ParisTech (ENPC)-Centre National de la Recherche Scientifique (CNRS)-Université Gustave Eiffel

Superior Title: ISSN: 1877-0533.

Subject Terms: Stochastic approximation, Non convex and non smooth optimization, Backpropagation algorithm, Differential inclusions, Clarke subdifferential, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2005.08513; hal-02564349; https://hal.science/hal-02564349; https://hal.science/hal-02564349v3/document; https://hal.science/hal-02564349v3/file/clarke.pdf; ARXIV: 2005.08513

Availability: https://doi.org/10.1007/s11228-022-00638-z
https://hal.science/hal-02564349
https://hal.science/hal-02564349v3/document
https://hal.science/hal-02564349v3/file/clarke.pdf

Zugang prüfen (DOI)

18

Academic Journal

Distributed stochastic optimization with large delays

Authors: Zhou, Zhengyuan, Mertikopoulos, Panayotis, Bambos, Nicholas, Glynn, Peter, Ye, Yinyu

Contributors: New York University New York (NYU), NYU System (NYU), Performance analysis and optimization of LARge Infrastructures and Systems (POLARIS), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Informatique de Grenoble (LIG), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA), Stanford University, ANR-16-CE33-0004,ORACLESS,Stratégies adaptatives d'allocation des ressources dans les réseaux sans fil dynamiques(2016), ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-11-LABX-0025,PERSYVAL-lab,Systemes et Algorithmes Pervasifs au confluent des mondes physique et numérique(2011), ANR-19-CE48-0018,ALIAS,Apprentissage adaptatif multi-agent(2019), European Project: GAMENET

Superior Title: ISSN: 0364-765X.

Subject Terms: Distributed optimization, Delays, Stochastic gradient descent, Stochastic approximation, [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2107.02919; hal-03342384; https://inria.hal.science/hal-03342384; https://inria.hal.science/hal-03342384/document; https://inria.hal.science/hal-03342384/file/Main.pdf; ARXIV: 2107.02919

Availability: https://doi.org/10.1287/moor.2021.1200
https://inria.hal.science/hal-03342384
https://inria.hal.science/hal-03342384/document
https://inria.hal.science/hal-03342384/file/Main.pdf

Zugang prüfen (DOI)

19

Book

Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme

Authors: Avrachenkov, Konstantin, Borkar, Vivek, S, Dolhare, Harsh, P, Patil, Kishor

Contributors: Network Engineering and Operations (NEO ), Inria Sophia Antipolis - Méditerranée (CRISAM), Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Department of Electrical Engineering IIT-Bombay (EE-IIT), Indian Institute of Technology Kanpur (IIT Kanpur), Projet PIA - ANSWER - FSN2 (P159564-2661789\DOS0060094), Alexey Piunovskiy, Yi Zhang

Superior Title: Modern Trends in Controlled Stochastic Processes: Theory and Applications, V.III ; https://inria.hal.science/hal-03462350 ; Alexey Piunovskiy; Yi Zhang. Modern Trends in Controlled Stochastic Processes: Theory and Applications, V.III, 41, Springer International Publishing, pp.192-220, 2021, Emergence, Complexity and Computation, 978-3-030-76928-4. ⟨10.1007/978-3-030-76928-4_10⟩ ; https://link.springer.com/book/10.1007/978-3-030-76928-4

Subject Terms: Markov Decision Process (MDP), approximate dynamic programming, Deep Reinforcement Learning (DRL), stochastic approximation, Deep Q-Network (DQN), Full Gradient DQN, Bellman error minimization, AMS 2000 subject classification: 93E35, 68T05, 90C40, 93E35, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [MATH.MATH-PR]Mathematics [math]/Probability [math.PR]

Relation: hal-03462350; https://inria.hal.science/hal-03462350; https://inria.hal.science/hal-03462350/document; https://inria.hal.science/hal-03462350/file/Full_Gradient_main.pdf

Availability: https://doi.org/10.1007/978-3-030-76928-4_1010.1007/978-3-030-76928-4
https://inria.hal.science/hal-03462350
https://inria.hal.science/hal-03462350/document
https://inria.hal.science/hal-03462350/file/Full_Gradient_main.pdf

Zugang prüfen (DOI)

20

Report

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

Authors: Godichon-Baggioni, Antoine, Werge, Nicklas, Wintenberger, Olivier

Contributors: Laboratoire de Probabilités, Statistique et Modélisation (LPSM (UMR_8001)), Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Université Paris Cité (UPCité)

Superior Title: https://hal.science/hal-03343481 ; 2022.

Subject Terms: streaming data, stochastic optimization, stochastic approximation, large-scale, machine learning, mini-batch, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC], [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]

Relation: info:eu-repo/semantics/altIdentifier/arxiv/2109.07117; hal-03343481; https://hal.science/hal-03343481; https://hal.science/hal-03343481v3/document; https://hal.science/hal-03343481v3/file/main.pdf; ARXIV: 2109.07117

Availability: https://hal.science/hal-03343481
https://hal.science/hal-03343481v3/document
https://hal.science/hal-03343481v3/file/main.pdf

Narrow Search