This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
The nonconvex global consensus problem with regularization [1] has the following form:
In many practical applications,
Note that the problem (2) owns
The augmented Lagrangian function with multipliers
ADMM was initially introduced in the 1970s [3, 4], and its convergence properties for convex case have been extensively studied. However, ADMM or its directly extended version may not converge when there is a nonconvex function in the objective. Yang et al. [5] studied the convergence of the ADMM for the nonconvex optimization model which come from the background/foreground extraction. Hong et al. [6] analyzed the convergence of alternating direction method of multipliers for a family of nonconvex problems. Guo et al. [7] studied the convergence of ADMM for multiblock nonconvex separable optimization models.
Recently, some scholars studied the inertial type of ADMM for convex optimization. For example, Chen et al. [8] analyzed a class of inertial ADMM for linearly constrained separable convex optimization, and Moudafi and Elissabeth [9] extended the inertial technique to solve the maximal monotone operator inclusion problem. The research interests for the nonconvex cases are increasing in recent years; e.g., Chao et al. [10] proposed and analyzed an inertial proximal ADMM for a class of nonconvex optimization problems while all the above inertial ADMM algorithms were presented for solving only two-block optimization problem (not for multiple-block case). Whether the convergence of the inertial ADMM is assured when the involved number of blocks is more than two? It is an important problem to research.
The purpose of the present study is to examine the convergence of inertial ADMM with multiblocks for nonconvex consensus problem under the assumption that the potential function satisfies the Kurdyka–Lojasiewicz property. The preliminary numerical results show the effectiveness of the proposed algorithm.
The rest of this paper is organized as follows. In Section 2, some necessary preliminaries for further analysis are summarized. Section 3 proposes a multiblock nonconvex inertial ADMM algorithm and analyzes its convergence under some conditions. In Section 4, we prove the validity of the algorithm by the numerical experiment. Finally, some conclusions are drawn in Section 5.
2. Preliminaries
Let
For a set
The Lagrangian function of (2), with multiplier
Definition 1.
If
A very important technique to prove the convergence of the ADMM for nonconvex optimization problems relies on the assumption that the potential function satisfying the following Kurdyka–Lojasiewicz property (KL property) [11–14].
For notational simplicity, we use
(i)
(ii)
Definition 2.
(see [14]) (KL property). Let
3. Algorithm and Convergence Analysis
For convenience, we fix the following notations:
Algorithm 1.
Inertial ADMM (IADMM). Choose
From the optimality conditions of (8) (a) and (8) (b), we have
Remark 1.
Compared with the inertial ADMM in [10], each subproblem in our algorithm has the inertial term, and we handle multiblock case here.
Subsequently, we will discuss the convergence of Algorithm 1 under the following assumptions.
Assumption 1.
(i)
(ii)
Lemma 1.
For each
Proof.
Since
Thus,
Hence, the result is obtained.
Lemma 2.
Select
Proof.
By the definition of the augmented Lagrangian function, (8) (c) and (15), we have
From (8) (a) and (8) (b), we obtain
Therefore, we have
Adding up (17) and (20), by the Assumption 1 (ii), we have
Then, the results are obtained.
Remark 2.
From Assumption 1 (ii), we know that
If we take
From Lemma 2, we have
Lemma 3.
If the sequence
Proof.
Since the sequence
From Lemma 2, it yields
Hence,
Consequently,
Lemma 4.
There exists
Proof.
From the definition of
From Lemma 1 and the optimality conditions, we get
From (29) and (30), we obtain
Thus,
It follows from Assumption 1 and Lemma 1 that there exists
Lemma 5.
Let
And if
Proof.
In view of the definition of
Let
Since
Let
That is,
Thus,
Since
From above, we get
Together with the continuity of
Thus,
From (37) and Lemma 5, we have
Therefore, from (39) and the descent of
Thus,
Theorem 1.
Let
Proof.
By Lemma 5, we have
(i) If there exist an integer
Thus, for any
(ii) Assume that
Thus, when
In view of
From the concavity of
Since
Let
Since
From (48) and (49), we obtain
Summing up the above formula for
Notice that
Thus,
From Lemma 1, we get
By Lemma 5, we conclude that the sequences
4. Numerical Experiment
In this section, we present the results of a simple numerical example to verify the effectiveness of Algorithm 1. We consider the following compressive sense problem, which takes the following form:
Let
Simplifying the procedures (58), we obtain the closed-form iterative formulas:
The experimental data are generated as follows. We use distributed computing toolbox in MATLAB, and the purpose is to achieve simple distributed computing. Suppose the feature matrix
Table 1
Comparison among two algorithms under different parameters.
| Iter. | Time (s) | |||
| Case 1 (ADMM) | 191 | 22.725124 | 600 | |
| Case 2 (IADMM) | 137 | 16.41164 | ||
| Case 3 (IADMM) | 159 | 18.777039 | ||
| Case 4 (ADMM) | 191 | 22.926591 | 500 | |
| Case 5 (IADMM) | 163 | 19.139843 | ||
| Case 6 (IADMM) | 178 | 21.54476 |
The values of
[figure(s) omitted; refer to PDF]
where
From Table 1, and Figures 1 and 2, we can see that ADMM converges more slowly than IADMM since “Iter.” of ADMM bigger than that of IADMM under the same conditions. Finally, numerical results show that the algorithm is feasible and effective.
5. Conclusion
In this paper, inspired by the application of nonconvex global consensus problem with regularization, we propose multiblock inertial ADMM algorithm for solving certain nonconvex global consensus problems. We have proven its convergence under some suitable conditions, and it turns out that any cluster point of the sequence generated by the proposed algorithm is a critical point. Numerical experiment is conducted to illustrate the effectiveness of the multiblock inertial ADMM (IADMM) algorithm. Its potential of the flexible multiblock inertial ADMM to analyze and design other types of nonconvex case, as well as a more thorough computational study, are topics of our further research.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (72071130; 71901145); National Social Science Fund Major Project of China (21&ZD200; 20&ZD199); Humanities and Social Sciences Research Project of the Ministry of Education (20YJC820030) and China Postdoctoral Science Foundation (2021M692047).
[1] S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein, "Distributed optimization and statistical learning via the alternating direction method of multipliers," Foundations and Trends in Machine Learning, vol. 3,DOI: 10.1561/2200000016, 2010.
[2] G. Li, T. K. Pong, "Global convergence of splitting methods for nonconvex composite optimization," SIAM Journal on Optimization, vol. 25 no. 4, pp. 2434-2460, DOI: 10.1137/140998135, 2015.
[3] R. Glowinski, A. Marroco, "Sur l'approximation, par éléments finis d'ordre un, et la résolution, par pénalisation-dualité d'une classe de problèmes de Dirichlet non linéaires," Revue française d'automatique, informatique, recherche opérationnelle. Analyse numérique, vol. 9 no. 2, pp. 41-76, DOI: 10.1051/m2an/197509r200411, 1975.
[4] D. Gabay, B. Mercier, "A dual algorithm for the solution of nonlinear variational problems via finite element approximation," Computers & Mathematics with Applications, vol. 2 no. 1, pp. 17-40, DOI: 10.1016/0898-1221(76)90003-1, 1976.
[5] L. Yang, T. K. Pong, X. Chen, "Alternating direction method of multipliers for a class of nonconvex and nonsmooth problems with applications to background/foreground extraction," SIAM Journal on Imaging Sciences, vol. 10 no. 1, pp. 74-110, DOI: 10.1137/15m1027528, 2017.
[6] M. Hong, Z. Q. Luo, M. Razaviyayn, "Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems," SIAM Journal on Optimization, vol. 26 no. 1, pp. 337-364, DOI: 10.1137/140990309, 2016.
[7] K. Guo, D. Han, D. Z. W. Wang, T. Wu, "Convergence of ADMM for multi-block nonconvex separable optimization models," Frontiers of Mathematics in China, vol. 12 no. 5, pp. 1139-1162, DOI: 10.1007/s11464-017-0631-6, 2017.
[8] C. H. Chen, R. H. Chan, S. Q. Ma, J. F. Yang, "Inertial proximal ADMM for linearly constrained separable convex optimization," SIAM Journal on Imaging Sciences, vol. 8 no. 4, pp. 2239-2267, DOI: 10.1137/15100463x, 2015.
[9] A. Moudafi, E. Elissabeth, "Approximate inertial proximal methods using the enlargement of maximal monotone operators," International Journal of Pure and Applied Mathematics, vol. 5 no. 3, pp. 283-299, 2003.
[10] M. Chao, Y. Zhang, J. Jian, "An inertial proximal alternating direction method of multipliers for nonconvex optimization," International Journal of Computer Mathematics, vol. 98, 2021.
[11] K. Kurdyka, "On gradients of functions definable in o-minimal structures," Annales de l'Institut Fourier, vol. 48 no. 3, pp. 769-783, DOI: 10.5802/aif.1638, 1998.
[12] S. Lojasiewicz, "Sur la geométrie semi-et sous-analytique," Annales de l'Institut Fourier, vol. 43 no. 5, pp. 1575-1595, DOI: 10.5802/aif.1384, 1993.
[13] S. Lojasiewicz, "Une propriété topologique des sous-ensembles analytiques réels," Les Équations Aux Dérivées Partielles, 1963.
[14] J. Bolte, A. Daniilidis, O. Ley, L. Mazet, "Characterizations of Łojasiewicz inequalities: subgradient flows, talweg, convexity," Transactions of the American Mathematical Society, vol. 362 no. 6, pp. 3319-3363, DOI: 10.1090/s0002-9947-09-05048-x, 2009.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2023 Yang Liu and Yazheng Dang. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
The alternating direction method of multipliers (ADMM) is one of the most powerful and successful methods for solving various nonconvex consensus problem. The convergence of the conventional ADMM (i.e., 2-block) for convex objective functions has been stated for a long time. As an accelerated technique, the inertial effect was used by many authors to solve 2-block convex optimization problem. This paper combines the ADMM and the inertial effect to construct an inertial alternating direction method of multipliers (IADMM) to solve the multiblock nonconvex consensus problem and shows the convergence under some suitable conditions. Simulation experiment verifies the effectiveness and feasibility of the proposed method.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Details
; Dang, Yazheng 2
1 Department of Information Science and Technology, East China University of Political Science and Law, Shanghai 200237, China; China Institute for Smart Court, Shanghai Jiao Tong University, Shanghai 200030, China
2 School of Management, University of Shanghai for Science and Technology, Shanghai 200093, China





