The committee machine: computational to statistical gaps in learning a two-layers neural network*This is the original and extended version of: Aubin B, Maillard A, Barbier J, Krzakala F, Macris N and Zdeborová L 2018 The committee machine: Computational to statistical gaps in learning a two-layers neural network Advances in Neural Information Processing Systems 31 ed S Bengio et al (Red Hook, NY: Curran Associates, Inc) pp 3223–34. (20th December 2019)

Record Type:: Journal Article
Title:: The committee machine: computational to statistical gaps in learning a two-layers neural network*This is the original and extended version of: Aubin B, Maillard A, Barbier J, Krzakala F, Macris N and Zdeborová L 2018 The committee machine: Computational to statistical gaps in learning a two-layers neural network Advances in Neural Information Processing Systems 31 ed S Bengio et al (Red Hook, NY: Curran Associates, Inc) pp 3223–34. (20th December 2019)
Main Title:: The committee machine: computational to statistical gaps in learning a two-layers neural network*This is the original and extended version of: Aubin B, Maillard A, Barbier J, Krzakala F, Macris N and Zdeborová L 2018 The committee machine: Computational to statistical gaps in learning a two-layers neural network Advances in Neural Information Processing Systems 31 ed S Bengio et al (Red Hook, NY: Curran Associates, Inc) pp 3223–34.
Authors:: Aubin, Benjamin
Maillard, Antoine
Barbier, Jean
Krzakala, Florent
Macris, Nicolas
Zdeborová, Lenka
Abstract:: Abstract: Heuristic tools from statistical physics have been used in the past to locate the phase transitions and compute the optimal learning and generalization errors in the teacher-student scenario in multi-layer neural networks. In this paper, we provide a rigorous justification of these approaches for a two-layers neural network model called the committee machine, under a technical assumption. We also introduce a version of the approximate message passing (AMP) algorithm for the committee machine that allows optimal learning in polynomial time for a large set of parameters. We find that there are regimes in which a low generalization error is information-theoretically achievable while the AMP algorithm fails to deliver it; strongly suggesting that no efficient algorithm exists for those cases, unveiling a large computational gap.
Is Part Of:: Journal of statistical mechanics. (2019:Dec.)
Journal:: Journal of statistical mechanics
Issue:: (2019:Dec.)
Issue Display:: Volume 1000060 (2019)
Year:: 2019
Volume:: 1000060
Issue Sort Value:: 2019-1000060-0000-0000
Page Start:
Page End:
Publication Date:: 2019-12-20
Subjects:: Statistical mechanics -- Periodicals
Mechanics -- Statistical methods -- Periodicals
530.1305
Journal URLs:: http://ioppublishing.org/ ↗
DOI:: 10.1088/1742-5468/ab43d2 ↗
Languages:: English
ISSNs:: 1742-5468
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 14316.xml