ESL-ICA

来源：互联网发布：怎么在淘宝上买av种子编辑：程序博客网时间：2024/05/17 06:40

Independent Component Analysis
- ICA Maximum Likelihood
  - Sigmoid function
- FastICA
  - Whiten
  - Python Implement

Independent Component Analysis

假设x∈ℝp×1是观测直，s∈ℝp×1是源，则有:

x = A s (1)

这里提到一个概念，自由（Independent）与不相关的关系(uncorrelated)：

p(x,y) = p(x)p(y) => E[x,y] = E[x]E[y]

前者是自由的充要条件，后者是uncorrelated的条件。
ICA与PCA的区别在于，对于ICA，s是independent而对于PCA则仅要求s是uncorrelated. 那么ICA要求s是non-Gaussian Distribution，因为虽然Gaussian Dist是uncorrelated的但不满足independent的条件。

ICA & Maximum Likelihood

欲求s，我们需要求解A或者W=A−1，假设s的概率分布是ps(s)，则x的分布是：

p x (x) = p s (W s) | W | (2) x = [x 1, x 2, . . ., x p] T s = [s 1, s 2, . . ., s p] T

记:

W = ⎡ ⎣ ⎢ ⎢ ⎢ ⎢ - w T 1 - - w T 2 - . . . . . . - w T p - ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ s = W x

则有：

s i = w T i \times x i p x (x) = p s (w T i x i) i = 1, 2, . . . p

Sigmoid function

我们并不确定ps(s)的表达形式，可以提出先验的pdf，当然不能用Gaussian pdf, 而是由sigmoid function变形而来：

\int p s (s) = s i g (s) = 1 1 + e - s p s (s) = s i g' (s) s i g' (s) = s i g (s) (1 - s i g (s))

记data set,

X∈ℝn×p 我们列出有关W的，ML的推导过程如下：

L (w) = \prod i = 1 n p x (x) = \prod i = 1 n \prod j = 1 p p x (x j) = \prod i = 1 n \prod j = 1 p p s (w T i x) | W | = \prod i = 1 n \prod j = 1 p s i g' (w T i x) | W | l (w) = \sum i = 1 n (\sum j = 1 p l n (s i g' (w T i x)) + l n | W |)

\partial l ( w ) \partial w j = \sum i = 1 n [\sum j = 1 p ( s i g ( 1 - s i g ) ) ' s i g ' + | W | - 1 | W | (W - 1) T] = \partial l ( w ) \partial w j = \sum i = 1 n [\sum j = 1 p (1 - 2 s i g (w T i x)) + (W - 1) T]

SGD迭代的方式为：

F o r a t r a i n i n g e x a m p l e x (i) \in ℝ p W n e w = W o l d - α (⎡ ⎣ ⎢ ⎢ ⎢ ⎢ ⎢ 1 - 2 s i g (w T 1 x (i)) 1 - 2 s i g (w T 2 x (i)) . . . . . . 1 - 2 s i g (w T p x (i)) ⎤ ⎦ ⎥ ⎥ ⎥ ⎥ ⎥ (x (i)) T + (W T) - 1) α > 0

FastICA

算法流程如下：

Whiten

白化的定义：
设x是一个随机变量，存在一个线性变换V将它变换成z： z=Vx且E[zzT]=I，V就是白化矩阵。
x协方差covariance是

C x = E [x x T] = P D P - 1

P单位化，则

C x = E [x x T] = P D P T

那么

V=D−1/2PT.

Python Implement

#!/usr/bin/env python#FastICA from ICA book, table 8.4 import mathimport randomimport matplotlib.pyplot as pltfrom numpy import *%matpltlibn_components = 2def f1(x, period = 4):    return 0.5*(x-math.floor(x/period)*period)def create_data():    #data number    n = 500    #data time    T = [0.1*xi for xi in range(0, n)]    #source    S = array([[sin(xi)  for xi in T], [f1(xi) for xi in T]], float32)    #mix matrix    A = array([[0.8, 0.2], [-0.3, -0.7]], float32)    return T, S, dot(A, S)def whiten(X):    #zero mean    X_mean = X.mean(axis=-1)    X -= X_mean[:, newaxis]    #whiten    A = dot(X, X.transpose())    D , E = linalg.eig(A)    D2 = linalg.inv(array([[D[0], 0.0], [0.0, D[1]]], float32))    D2[0,0] = sqrt(D2[0,0]); D2[1,1] = sqrt(D2[1,1])    V = dot(D2, E.transpose())    return dot(V, X), V

def _logcosh(x, fun_args=None, alpha = 1):    gx = tanh(alpha * x, x); g_x = gx ** 2; g_x -= 1.; g_x *= -alpha    return gx, g_x.mean(axis=-1)

返回的是g(x)和E(g’(x))

def do_decorrelation(W):    #black magic    s, u = linalg.eigh(dot(W, W.T))    return dot(dot(u * (1. / sqrt(s)), u.T), W)

这里用到一些技巧：

W W T = V D 2 V T

D2是

WWT的特征值。

[W W T] - 1 = V D - 2 V T [W W T] - 1 2 = V D - 1 2 V T

def do_fastica(X):    n, m = X.shape; p = float(m); g = _logcosh    #black magic    X *= sqrt(X.shape[1])    #create w    W = ones((n,n), float32)    for i in range(n):         for j in range(i):            W[i,j] = random.random()    #compute W    maxIter = 200    for ii in range(maxIter):        gwtx, g_wtx = g(dot(W, X))        W = dot(gwtx, X.T) / p - g_wtx[:, newaxis] * W

w i = E [x i g (w T i x i)] - E [g' (w T I x i)] w i

        W1 = do_decorrelation(W)

W 1 = (W W T) - 1 / 2 W

        lim = max( abs(abs(diag(dot(W1, W.T))) - 1) )        W = W1        if lim < 0.0001:            break    return Wdef show_data(T, S):    plt.plot(T, [S[0,i] for i in range(S.shape[1])], marker="*")    plt.plot(T, [S[1,i] for i in range(S.shape[1])], marker="o")    plt.show()def main():    T, S, D = create_data()    Dwhiten, K = whiten(D)    W = do_fastica(Dwhiten)    #Sr: reconstructed source    Sr = dot(dot(W, K), D)    show_data(T, D)    show_data(T, S)    show_data(T, Sr)if __name__ == "__main__":    main()

0 0