Neural Network Training as an Optimal Control Problem: An Augmented Lagrangian Approach - 九大コレクション | 九州大学附属図書館

＜会議発表論文＞
Neural Network Training as an Optimal Control Problem: An Augmented Lagrangian Approach

作成者	著者識別子 0000-0002-2317-9395 作成者名 Evens, Brecht 所属機関所属機関名 Department of Electrical Engineering ESAT-STADIUS, KU Leuven
	著者識別子 0000-0002-7969-8565 作成者名 Latafat, Puya 所属機関所属機関名 Department of Electrical Engineering ESAT-STADIUS, KU Leuven
	著者識別子 100021522 50898749 0000-0002-6044-0169 作成者名 Themelis, Andreas セメリス, アンドレアス所属機関所属機関名 Faculty of Information Science and Electrical Engineering (ISEE), Kyushu University 九州大学大学院システム情報科学研究院
	著者識別子 0000-0002-8846-6352 作成者名 Suykens, Johan 所属機関所属機関名 Department of Electrical Engineering ESAT-STADIUS, KU Leuven
	著者識別子 0000-0003-4824-7697 作成者名 Patrinos, Panagiotis 所属機関所属機関名 Department of Electrical Engineering (ESAT-STADIUS), KU Leuven
本文言語	英語
出版者	Institute of Electrical and Electronics Engineers (IEEE)
発行日	2021
収録物名	IEEE Conference on Decision and Control (CDC)
開始ページ	5136
終了ページ	5143
会議情報	会議名 IEEE Conference on Decision and Control (CDC) 回次 60 主催機関 Institute of Electrical and Electronics Engineers (IEEE) 開催期間 2021.12.14-17 開催地 Austin, TX テキサス州オースティン開催国アメリカ合衆国
出版タイプ	Accepted Manuscript
アクセス権	open access
権利関係	© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
関連DOI	以下の異版 https://doi.org/10.1109/CDC45484.2021.9682842
関連DOI
概要	Training of neural networks amounts to nonconvex optimization problems that are typically solved by using backpropagation and (variants of) stochastic gradient descent. In this work we propose an alte...rnative approach by viewing the training task as a nonlinear optimal control problem. Under this lens, backpropagation amounts to the sequential approach (single shooting) to optimal control, where the states variables have been eliminated. It is well known that single shooting may lead to ill conditioning, and for this reason the simultaneous approach (multiple shooting) is typically preferred. Motivated by this hypothesis, an augmented Lagrangian algorithm is developed that only requires an approximate solution to the Lagrangian subproblems up to a user-defined accuracy. By applying this framework to the training of neural networks, it is shown that the inner Lagrangian subproblems are amenable to be solved using Gauss-Newton iterations. To fully exploit the structure of neural networks, the resulting linear least squares problems are addressed by employing an approach based on forward dynamic programming. Finally, the eectiveness of our method is showcased on regression datasets.続きを見る

本文ファイル

ファイル	ファイルタイプ	サイズ	閲覧回数	説明
4785491	pdf	236 KB	211

詳細

PISSN	0743-1546
EISSN	2576-2370
レコードID	4785491
主題	Neural networks
	augmented Lagrangian method
	Gauss-Newton method
	dynamic programming
助成情報	助成機関名 Research Foundation Flanders (FWO) 研究課題番号 G0A0920N 研究課題名 research projects
	助成機関名 Research Foundation Flanders (FWO) 研究課題番号 G086518N 研究課題名 research projects
	助成機関名 Research Foundation Flanders (FWO) 研究課題番号 G086318N 研究課題名 research projects
	助成機関名 Research Foundation Flanders (FWO) 研究課題番号 1196820N 研究課題名 PhD grant
	助成機関名 Research Council KU Leuven 研究課題番号 C14/18/068 研究課題名 C1 project
	助成機関名 Fonds de la Recherche Scientifique (FNRS and the Fonds Wetenschappelijk Onderzoek) 研究課題番号 30468160 (SeLMA) 研究課題名 EOS project
登録日	2022.06.01
更新日	2023.06.16