Structure learning in inverse Ising problems using ℓ2-regularized linear estimator. (27th May 2021)
- Record Type:
- Journal Article
- Title:
- Structure learning in inverse Ising problems using ℓ2-regularized linear estimator. (27th May 2021)
- Main Title:
- Structure learning in inverse Ising problems using ℓ2-regularized linear estimator
- Authors:
- Meng, Xiangming
Obuchi, Tomoyuki
Kabashima, Yoshiyuki - Abstract:
- Abstract: The inference performance of the pseudolikelihood method is discussed in the framework of the inverse Ising problem when the ℓ 2 -regularized (ridge) linear regression is adopted. This setup is introduced for theoretically investigating the situation where the data generation model is different from the inference one, namely the model mismatch situation. In the teacher-student scenario under the assumption that the teacher couplings are sparse, the analysis is conducted using the replica and cavity methods, with a special focus on whether the presence/absence of teacher couplings is correctly inferred or not. The result indicates that despite the model mismatch, one can perfectly identify the network structure using naive linear regression without regularization when the number of spins N is smaller than the dataset size M, in the thermodynamic limit N → ∞. Further, to access the underdetermined region M < N, we examine the effect of the ℓ 2 regularization, and find that biases appear in all the coupling estimates, preventing the perfect identification of the network structure. We, however, find that the biases are shown to decay exponentially fast as the distance from the center spin chosen in the pseudolikelihood method grows. Based on this finding, we propose a two-stage estimator: in the first stage, the ridge regression is used and the estimates are pruned by a relatively small threshold; in the second stage the naive linear regression is conducted only on theAbstract: The inference performance of the pseudolikelihood method is discussed in the framework of the inverse Ising problem when the ℓ 2 -regularized (ridge) linear regression is adopted. This setup is introduced for theoretically investigating the situation where the data generation model is different from the inference one, namely the model mismatch situation. In the teacher-student scenario under the assumption that the teacher couplings are sparse, the analysis is conducted using the replica and cavity methods, with a special focus on whether the presence/absence of teacher couplings is correctly inferred or not. The result indicates that despite the model mismatch, one can perfectly identify the network structure using naive linear regression without regularization when the number of spins N is smaller than the dataset size M, in the thermodynamic limit N → ∞. Further, to access the underdetermined region M < N, we examine the effect of the ℓ 2 regularization, and find that biases appear in all the coupling estimates, preventing the perfect identification of the network structure. We, however, find that the biases are shown to decay exponentially fast as the distance from the center spin chosen in the pseudolikelihood method grows. Based on this finding, we propose a two-stage estimator: in the first stage, the ridge regression is used and the estimates are pruned by a relatively small threshold; in the second stage the naive linear regression is conducted only on the remaining couplings, and the resultant estimates are again pruned by another relatively large threshold. This estimator with the appropriate regularization coefficient and thresholds is shown to achieve the perfect identification of the network structure even in 0 < M / N < 1. Results of extensive numerical experiments support these findings. … (more)
- Is Part Of:
- Journal of statistical mechanics. (2021:May)
- Journal:
- Journal of statistical mechanics
- Issue:
- (2021:May)
- Issue Display:
- Volume 1000077 (2021)
- Year:
- 2021
- Volume:
- 1000077
- Issue Sort Value:
- 2021-1000077-0000-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-05-27
- Subjects:
- inference of graphical models -- learning theory -- machine learning -- network reconstruction
Statistical mechanics -- Periodicals
Mechanics -- Statistical methods -- Periodicals
530.1305 - Journal URLs:
- http://ioppublishing.org/ ↗
- DOI:
- 10.1088/1742-5468/abfa10 ↗
- Languages:
- English
- ISSNs:
- 1742-5468
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16664.xml