DSpace at KOASAS: TRAINABILITY OF ReLU NETWORKS AND DATA-DEPENDENT INITIALIZATION

DSpace at KOASAS

College of Natural Sciences(자연과학대학)Dept. of Mathematical Sciences(수리과학과)MA-Journal Papers(저널논문)

TRAINABILITY OF ReLU NETWORKS AND DATA-DEPENDENT INITIALIZATION

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 188
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Shin, Yeonjong	ko
dc.contributor.author	Karniadakis, George Em	ko
dc.date.accessioned	2022-07-06T02:00:35Z	-
dc.date.available	2022-07-06T02:00:35Z	-
dc.date.created	2022-07-06	-
dc.date.issued	2020	-
dc.identifier.citation	Journal of Machine Learning for Modeling and Computing, v.1, no.1, pp.39 - 74	-
dc.identifier.issn	2689-3967	-
dc.identifier.uri	http://hdl.handle.net/10203/297251	-
dc.description.abstract	In this paper we study the trainability of rectified linear unit (ReLU) networks at initialization. A ReLU neuron is said to be dead if it only outputs a constant for any input. Two death states of neurons are introduced−tentative and permanent death. A network is then said to be trainable if the number of permanently dead neurons is sufficiently small for a learning task. We refer to the probability of a randomly initialized network being trainable as trainability. We show that a network being trainable is a necessary condition for successful training, and the trainability serves as an upper bound of training success rates. In order to quantify the trainability, we study the probability distribution of the number of active neurons at initialization. In many applications, overspecified or overparameterized neural networks are successfully employed and shown to be trained effectively. With the notion of trainability, we show that overparameterization is both a necessary and a sufficient condition for achieving a zero training loss. Furthermore, we propose a data-dependent initialization method in an overparameterized setting. Numerical examples are provided to demonstrate the effectiveness of the method and our theoretical findings.	-
dc.language	English	-
dc.publisher	BEGELL HOUSE Inc.	-
dc.title	TRAINABILITY OF ReLU NETWORKS AND DATA-DEPENDENT INITIALIZATION	-
dc.type	Article	-
dc.type.rims	ART	-
dc.citation.volume	1	-
dc.citation.issue	1	-
dc.citation.beginningpage	39	-
dc.citation.endingpage	74	-
dc.citation.publicationname	Journal of Machine Learning for Modeling and Computing	-
dc.identifier.doi	10.1615/.2020034126	-
dc.contributor.localauthor	Shin, Yeonjong	-
dc.contributor.nonIdAuthor	Karniadakis, George Em	-
dc.description.isOpenAccess	N	-

Appears in Collection: MA-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

TRAINABILITY OF ReLU NETWORKS AND DATA-DEPENDENT INITIALIZATION

KOASAS

Communities & Collections