Random mask-based estimation of the distribution algorithm for stacked auto-encoder one-step pre-training. (August 2021)
- Record Type:
- Journal Article
- Title:
- Random mask-based estimation of the distribution algorithm for stacked auto-encoder one-step pre-training. (August 2021)
- Main Title:
- Random mask-based estimation of the distribution algorithm for stacked auto-encoder one-step pre-training
- Authors:
- Xu, Qingyang
Liu, Anbang
Yuan, Xianfeng
Song, Yong
Zhang, Chengjin
Li, Yibin - Abstract:
- Highlights: A stacked auto-encoder pre-training without gradient information. A Random Mask-based Estimation of Distribution Algorithm for large-scale optimization problem. A EDA based stacked auto-encoder one-step pre-training against layer-wise pretraining. Abstract: The deep learning techniques have received great achievements in computer vision, natural language processing, etc. The success of deep neural networks depends on the sufficient training of parameters. The traditional way of neural network training is a gradient-based algorithm, which suffers the disadvantage of gradient disappearing, especially for the deeper neural network. Recently, a heuristic algorithm has been proposed for deeper neural network optimization. In this paper, a random mask and elitism univariate continuous estimation of distribution algorithm based on the Gaussian model is proposed to pre-train staked auto-encoder, and then a Stochastic Gradient Descent (SGD) based fine-tuning process is carried out for local searching. In the improved estimation of the distribution algorithm, two individual update strategies are defined; one group of individuals is generated according to the constructed probabilistic model, and another is updated according to the statistics of advanced individuals that aim to reduce the probability of combination explosion and time consumption according to the mask information. In the simulations, different architectures, different mask ratios and different promisingHighlights: A stacked auto-encoder pre-training without gradient information. A Random Mask-based Estimation of Distribution Algorithm for large-scale optimization problem. A EDA based stacked auto-encoder one-step pre-training against layer-wise pretraining. Abstract: The deep learning techniques have received great achievements in computer vision, natural language processing, etc. The success of deep neural networks depends on the sufficient training of parameters. The traditional way of neural network training is a gradient-based algorithm, which suffers the disadvantage of gradient disappearing, especially for the deeper neural network. Recently, a heuristic algorithm has been proposed for deeper neural network optimization. In this paper, a random mask and elitism univariate continuous estimation of distribution algorithm based on the Gaussian model is proposed to pre-train staked auto-encoder, and then a Stochastic Gradient Descent (SGD) based fine-tuning process is carried out for local searching. In the improved estimation of the distribution algorithm, two individual update strategies are defined; one group of individuals is generated according to the constructed probabilistic model, and another is updated according to the statistics of advanced individuals that aim to reduce the probability of combination explosion and time consumption according to the mask information. In the simulations, different architectures, different mask ratios and different promising individual ratios are adopted to testify the effectiveness of the improved algorithm. According to simulation results, the estimation of thr distribution algorithm has a steady optimization ability for the shallow and stacked auto-encoder by one-step pre-training combining SGD based fine-tuning for the MNIST dataset. The proposed model will achieve a state-of-the-art performance on Fashion-MNIST. … (more)
- Is Part Of:
- Computers & industrial engineering. Volume 158(2021)
- Journal:
- Computers & industrial engineering
- Issue:
- Volume 158(2021)
- Issue Display:
- Volume 158, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 158
- Issue:
- 2021
- Issue Sort Value:
- 2021-0158-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-08
- Subjects:
- Deep neural network -- Stacked auto-encoder -- Estimation of distribution algorithm -- Random mask -- One-step pre-training
Engineering -- Data processing -- Periodicals
Industrial engineering -- Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03608352 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cie.2021.107400 ↗
- Languages:
- English
- ISSNs:
- 0360-8352
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.713000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 17323.xml