Abstract
Principal components analysis (PCA) is of great use in representation of multi-dimensional data sets, often providing a useful compression mechanism. Sometimes, input data sets are drawn from disparate domains, such that components of the input are heterogeneous, making them difficult to compare in scale. When this occurs, it is possible for one component to dominate another in the PCA at the expense of the information content of the original data. We present an approach to balancing the contributions of different components that is constructive; it generalises to the case of the addition of several variables. Conjectures about improved approaches and more complex data sets are presented. The approach is demonstrated on two current research applications.
Original language | English |
---|---|
Pages (from-to) | 2047-2053 |
Number of pages | 7 |
Journal | Pattern Recognition |
Volume | 31 |
Issue number | 12 |
DOIs | |
Publication status | Published - Dec 1998 |
Externally published | Yes |
Keywords
- Dimension reduction
- Entropy
- Minimum variance
- Principal components
- Scale