Back To ADMM Home.

7.1 Global Variable Consensus

Corresponding to the distribution across samples in the next chapter.

Consider the problem with one single objective function, but can be distributed into several parts:

minimizef(x)=Ni=1fi(x)

We can always extend the definition of the function fi to be plus infinity if x is out of its domain. The advantage of ADMM is that each split of the objective function could be implemented into different processors, such that the algorithm could run in parallex.

Then we can rewrite the problem into a global consensus problem (since that the constraints are that all the local variables should agree) :

minimizeNi=1fi(xi)subject toxi=z, i=1,...,N

The augmented Lagrangian is :

L(x,z,y)=Ni=1fi(xi)+yTi(xiz)+(ρ/2)xiz22)

The updates of ADMM will be :

xk+1i:=argminxi(fi(xi)+ykTi(xizk)+(ρ/2)xizk22)zk+1:=argminzNi=1(ykTi(xk+1iz)+(ρ/2)xk+1iz22)yk+1i:=yki+ρ(xk+1izk+1)

The update of z could be further simplified by :

zk+1=argminzNi=1(ykTi(xk+1iz)+(ρ/2)xk+1iz22)=argminzNi=1(yki+ρxk+1i)Tz+(ρ/2)NzTz zk+1=(1/N)Ni=1(xk+1i+(1/ρ)yki)

we can further simpliy it by introcuding the notation of averages:

zk+1:=ˉxk+1+(1/ρ)ˉyk ˉyk+1:=ˉyki+ρ(ˉxk+1zk+1)=0

So we have :

zk+1:=ˉxk+1

Finally we have the ADMM updates:

xk+1i:=argminxi(fi(xi)+ykTi(xiˉxk)+(ρ/2)xiˉxk22)zk+1=ˉxk+1yk+1i:=yki+ρ(xk+1iˉxk+1)

7.2 Sharing

Corresponding to the distribution across features in the next chapter.

minimizeNi=1fi(xi)+g(Ni=1xi)

In the next chapter, we will have function f corresponding to the regularization term, and g corresponding to the objective function splitted by feature subsets.

The ADMM form the equivalent problem is, where each subset corresponding to a subset of features:

minimizeNi=1fi(xi)+g(Ni=1zi)subject toxi=zi, i=1,...,N

The scaled form of ADMM is :

xk+1i:=argminxi(fi(xi)+(ρ/2)xizki+uki22)zk+1:=argminz(g(Ni=1zi)+(ρ/2)Ni=1xk+1izi+uki22)uk+1i:=uki+xk+1izk+1i

We can further simplify the z update by the problem:

minimizeg(Nˉz)+(ρ/2)Ni=1ziai22subject toˉz=(1/N)Ni=1zi

Where ai=uki+xk+1i, minimize the objective function while fixing the ˉz, we have the lagrangian is:

L(zi,λ)=g(Nˉz)+(ρ/2)Ni=1ziai22+λT(ˉz(1/N)Ni=1zi)

From the first order optimal condition we have :

ρ(ziai)λ/N=0

With the dual function d being:

d(λ)=g(Nˉz)+Nρ/2λρN22+λT(ˉz(1/N)Ni=1(ai+λρN))

Using the first order optimal condition of the dual function :

(ˉzˉa)λρN=0

Finally, we have :

zi=ai+(ˉzˉa)

To solve ˉz, we solve :

minimizeg(Nˉz)+(ρ/2)Ni=1ˉzˉa22=g(Nˉz)+(Nρ/2)ˉzˉa22

And applying the update of zi into the udpate of x , we have the ADMM udpates expression:

xk+1i:=argminxi(fi(xi)+(ρ/2)xixki+ˉxkˉzk+ˉuk22)ˉzk+1:=argminˉz(g(Nˉz)+(Nρ/2)ˉzˉukˉxk+122)ˉuk+1:=ˉuk+ˉxk+1ˉzk+1

Back To ADMM Home.