論文

査読有り 筆頭著者 責任著者
2017年10月

Resolution of Singularities Introduced by Hierarchical Structure in Deep Neural Networks

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
  • Tohru Nitta

28
10
開始ページ
2282
終了ページ
2293
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1109/TNNLS.2016.2580741
出版者・発行元
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

We present a theoretical analysis of singular points of artificial deep neural networks, resulting in providing deep neural network models having no critical points introduced by a hierarchical structure. It is considered that such deep neural network models have good nature for gradient-based optimization. First, we show that there exist a large number of critical points introduced by a hierarchical structure in deep neural networks as straight lines, depending on the number of hidden layers and the number of hidden neurons. Second, we derive a sufficient condition for deep neural networks having no critical points introduced by a hierarchical structure, which can be applied to general deep neural networks. It is also shown that the existence of critical points introduced by a hierarchical structure is determined by the rank and the regularity of weight matrices for a specific class of deep neural networks. Finally, two kinds of implementation methods of the sufficient conditions to have no critical points are provided. One is a learning algorithm that can avoid critical points introduced by the hierarchical structure during learning (called avoidant learning algorithm). The other is a neural network that does not have some critical points introduced by the hierarchical structure as an inherent property (called avoidant neural network).

リンク情報
DOI
https://doi.org/10.1109/TNNLS.2016.2580741
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000411293200006&DestApp=WOS_CPL
ID情報
  • DOI : 10.1109/TNNLS.2016.2580741
  • ISSN : 2162-237X
  • eISSN : 2162-2388
  • Web of Science ID : WOS:000411293200006

エクスポート
BibTeX RIS