Principal Components Analysis
PCA is used to reduce high dimensional data set to lower dimensions. PCA
can be used to identify patterns in data, express the data in a lower
dimensional space. That way, similarities and differences can be
highlighted. It is mostly used in face recognition and image compression.
There are several flaws one has to be aware of when working with PCA:
- Linearity assumption - data is assumed to be linear combinations of some
basis. There exist non-linear methods such as kernel PCA that alleviate
that problem.
- Principal components are assumed to be orthogonal. ICA tries to cope with
this limitation.
- Mean and covariance are assumed to be statistically important.
- Large variances are assumed to have important dynamics.
Parallelization strategy
Design of packages