Dimensionality Reduction

Updated: December 1, 2022

Dimensionality Reduction

Too many features leads to worse performance. Distance measures perform poorly and the indicent of outliers increases. Data can be represented in a lower dimensional space. Reduce dimensionality by selecting subset (feature elimination). Combine with linear and non-linear transformation.

PCA

Principal Component Analysis (PCA) is a dimensionality reduction technique. It is a linear transformation that projects the data into a lower dimensional space. Let direction $v_1$ and length $\lambda_1$ be the first principal component. $v_1$ is perpendicular to $v_2$ which has length $\lambda_2$ .

SVD

A_{m\times n} = U_{m\times m} \Sigma_{m\times n} V_{n\times n}^T

Principal components are calculated from $V$
Truncated SVD is used for dimensionality reduction from n to k
Variance is sensitive to scaling .

                  from sklearn.decompositon import PCA
PCAinst = PCA(n_components=2) #create instance
x_trans = PCAinst.fit_transform(x_train) #fit the instance on the data

                

Non-linear

Kernel PCA

Use kernel trick introduced in SVM to map down linear relationship.

                  from sklearn.decomposition import KernelPCA
kPCA = KernelPCA(n_components=2, kernel='rbf', gamma=0.04)
x_kpca = KPCA.fit_transform(x_train)

                

Multi-Dimensional Scaling (MDS)

MDS maintains the distance between points in a low-dimensional space.

                  from sklearn.decomposition import MDS
mds = MDS(n_components=2)
x_mds = mds.fit_transform(x_train)

                

Others: Isomap, TSNE

Non-negative Matrix Factorization

Decompositing matrix only non-negative values. For example, vectorized words, images. Let $V = W\tiems H$ so that $term-documentMatrix = Term\to Topics + Topic\to Docs$ . Also in Image, we can compress only shaded values.
Can never undo the application of a latent feature, it is much more careful about what it adds at each step. In some applications, this can make for more human interpretable latent features.
Thus NMF are non othogonal.

                  from sklearn.decomposition import NMF
nmf = NMF(n_components=2, init='random')
x_nmf = nmf.fit(x)

                

Summary

Example of Dim Redcution

Share on

Twitter Facebook LinkedIn

Younghun Lee

Dimensionality Reduction

Dimensionality Reduction

PCA

SVD

Non-linear

Kernel PCA

Multi-Dimensional Scaling (MDS)

Others: Isomap, TSNE

Non-negative Matrix Factorization

Summary

Example of Dim Redcution

Share on

Leave a comment

You may also enjoy

Policy Gradient Method

Machine Learning on Apple stock daily return2

Machine Learning on Apple stock daily return

Reinforcement Learning