Title: A PLS method for seeking canonical correlations in case of perfect multicollinearity
Authors: Michele Gallo - University of Naples Orientale (Italy) [presenting]
Violetta Simonacci - University of Napoli L (Italy)
Abstract: Canonical correlation analysis (CCA) is a useful tool for investigating the relationships between two sets of variables. If dispersion matrices can be inverted, canonical variates with maximal correlation are generally identified by means of singular value decomposition. However, when one or both variable groups are compositional, this classical approach cannot be followed. Compositional data are positive values which carry relative information describing the parts of a whole. In consequence they present a perfectly multicollinear structure and are characterized by singular dispersion matrices. As a solution to this issue which excludes a standard approach, an alternative way of computing canonical variates is proposed. Data are first transformed in log-ratio coordinates, then the Partial Least Squares approach is applied. This method provides a fast and easy way to deal with non-invertible dispersion matrices and, in addition, it yields results which are easy to interpret. The proposed methodology is assessed in an experimental study in which a comparison among alternative PLS algorithms is also provided, namely NIPALS, SIMPLS and Kernel.