Matrix Calculus

来源:互联网 发布:centos net snmp 安装 编辑:程序博客网 时间:2024/05/17 22:18

Matrix Calculus

  • Matrix Calculus
    • Derivative of the vector with respect to vector
    • Derivative of a Scalar with Respect to Vector
    • Derivative of Vector with Respect to Scalar
      • EXAMPLE
    • Chain rule for Vectors
    • Derivative of Scalar with Respect to Matrix
      • EXAMPLE
    • Product rules for matrix-functions
    • Derivatives of Matrices Vectors and Scalar Forms
    • Derivatives of Traces
      • Basics

Derivative of the vector with respect to vector

x=x1x2xn,y=y1y2ym

yx=y1x1y1x2y1xny2x1y2x2y2xnymx1ymx2ymxn

Derivative of a Scalar with Respect to Vector

yx=yx1yx2yxn

Derivative of Vector with Respect to Scalar

yx=[y1xy2xymx]

EXAMPLE

1、

y=Ax
where A is a square matrix of order n
y=Ax=[a1a2an]x1x2xn=i=1n(aixi)yxj=i=1n(aixi)xj=(ajxj)xj=aTj,yx=AT

2、

y=xTA

y=xTA=[xTa1xTa2xTan]=[i=1nxiai1i=1nxiai2i=1nxiain]yxj=(yTxj)T=[i=1nxiai1i=1nxiai2i=1nxiain]TxjT=((aj)T)T=ajyx=A

3、

y=xTx

y=xTx=[x1x2xn]x1x2xn=(i=1nxixi)yxj=2xjyx=2x

4、

y=xTAx
where A is a square matrix of order n

y=xTAx=[xTa1xTa2xTan]x=j=1n((xTaj)xj)=j=1n(i=1nxiaij)xj

yxk=j=1n(i=1nxiaij)xjxk=j=1n((i=1nxiaij)xj+(i=1nxiaij)xj)xk=j=1n(akjxj)+i=1nxiaik=aTkx+akxyx=aT1x+a1xaT2x+a2xaTnx+anx=ATx+Ax

Chain rule for Vectors

x=x1x2xn,y=y1y2yr,z=z1z2zm

(zx)T=z1x1z2x1zmx1z1x2z2x2zmx2z1xnz2xnzmxn,zixj=q=1rziyqyqxj(zx)T=q=1rz1yqyqx1q=1rz2yqyqx1q=1rzmyqyqx1q=1rz1yqyqx2q=1rz2yqyqx2q=1rzmyqyqx1q=1rz1yqyqxnq=1rz2yqyqxnq=1rzmyqyqxn=z1y1z2y1zmy1z1y2z2y2zmy2z1yrz2yrzmyry1x1y2x1yrx1y1x2y2x2yrx2y1xny2xnyrxn=(zy)T(yx)T=(yxzy)Tzx=yxzy

Derivative of Scalar with Respect to Matrix

X=x11x21xm1x12x22xm2x1nx2nxmn,yX=yx11yx21yxm1yx12yx22yxm2yx1nyx2nyxmn=[yXij]

EXAMPLE

1、

y=tr(X)=i=1nxii,yX=I

2、

|Y|X,|Y|xrs=ij|Y|yijyijxrs=ijYijyijxrs
where where Yi,j is the cofactor of the element yi,j in |Y|

3、

URn×k,VRm×k,YURn×m

J(U,V)=UVTY2F+λ2(U2F+V2F)=i,j(aUiaVjaYij)2+λ2i,aU2ia+j,aV2ja

JUia=2j(aUiaVjaYij)Vja+λUia=2j(UiVTjYij)Vja+λUia=2(UiVTYi)V.a+λUia

JUi=2(UiVTYi)V+λUi

JU=2(UVTY)V+λUr

JV=2(UVTY)TU+λV

Product rules for matrix-functions

X(f(X)Tg(X))=X(f(X))g(X)+X(g(X))f(X)

Derivatives of Matrices, Vectors and Scalar Forms

Alt text这里写图片描述
Alt text
Alt text
Alt text
Alt text

Derivatives of Traces

Basics

Tr(A)=iAiiTr(AB)=Tr(BA)Tr(A+B)=Tr(A)+Tr(B)Tr(ABC)=Tr(BCA)=Tr(CAB)aTa=Tr(aaT)

Alt text
Alt text

总结记忆:若转置矩阵(向量)对原始矩阵(向量)求偏导—左右两边系数改变顺序,不转置
**若原始矩阵(向量)对原始矩阵(向量)求偏导—左右两边系数转置,顺序不变

0 0
原创粉丝点击