Simplivariate models: uncovering the underlying biology in functional genomics data

One of the first steps in analyzing high-dimensional functional genomics data is an exploratory analysis of such data. Cluster Analysis and Principal Component Analysis are then usually the method of choice. Despite their versatility they also have a severe drawback: they do not always generate simple and interpretable solutions. On the basis of the observation that functional genomics data often contain both informative and non-informative variation, we propose a method that finds sets of variables containing informative variation. This informative variation is subsequently expressed in easily interpretable simplivariate components.We present a new implementation of the recently introduced simplivariate models. In this implementation, the informative variation is described by multiplicative models that can adequately represent the relations between functional genomics data. Both a simulated and two real-life metabolomics data sets show good performance of the method.

 

Authors: 
E. Saccenti E, J.A. Westerhuis, A.K. Smilde, M.J. van der Werf, J.A. Hageman, M.M.W.B. Hendriks
DOI: 
10.1371/journal.pone.0020747
Pages: 
2011; 6 (6): e20747
Published in: 
Plos One
Date of publication: 
June, 2011
Status of the publication: 
Published/accepted