Comparison of Regularized Regression Methods for and#126;Omics Data

Animesh Acharjee; Richard Finkers; Richard GF Visser; Chris Maliepaard

doi:10.4172/2153-0769.1000126

Comparison of Regularized Regression Methods for ~Omics Data

Abstract

Animesh Acharjee, Richard Finkers, Richard GF Visser and Chris Maliepaard

Background: In this study, we compare methods that can be used to relate a phenotypic trait of interest to an ~omics data set, where the number or variables outnumbers by far the number of samples.

Methods: We apply univariate regression and different regularized multiple regression methods: ridge regression (RR), LASSO, elastic net (EN), principal components regression (PCR), partial least squares regression (PLS), sparse partial least squares regression (SPLS), support vector regression (SVR) and random forest regression (RF). These regression methods were applied to a data set from a potato mapping population, where we predict potato flesh colour from a metabolomics data set.

Results: We compare the methods in terms of the mean square error of prediction of the trait, goodness of fit of the models, and the selection and ranking of the metabolites. In terms of the prediction error, elastic net performed better than the other methods. Different numbers of variables are selected by the methods that allow variable selection but seven variables were in common between LASSO, EN and SPLS. SPLS performed better than EN with respect to the selection of grouped correlated variables.

Conclusions: Four out of these seven variables selected by LASSO, EN, SPLS were putatively identified as carotenoid derived compounds; since the carotenoid pathway is important for flesh colour of potato, this indicates that meaningful compounds are selected. We developed a web application that can perform all the described methods, and that includes a double cross validation for optimization of the methods and for proper estimation of the prediction error.

PDF

Share this article

Google Scholar citation report

Citations: 895

Metabolomics:Open Access received 895 citations as per Google Scholar report

Metabolomics:Open Access peer review process verified at publons

Indexed In

CAS Source Index (CASSI)
Index Copernicus
Google Scholar
Open J Gate
Genamics JournalSeek
ResearchBible
Electronic Journals Library
RefSeek
Directory of Research Journal Indexing (DRJI)
Hamdard University
EBSCO A-Z
OCLC- WorldCat
Scholarsteer
SWB online catalog
Virtual Library of Biology (vifabio)
Publons
Geneva Foundation for Medical Education and Research
Euro Pub

Metabolomics:Open Access

Comparison of Regularized Regression Methods for ~Omics Data

Abstract

Awards & Nominations

50+ Million Readerbase

Journal Highlights

Google Scholar citation report

Citations: 895

Metabolomics:Open Access peer review process verified at publons

Indexed In

Related Links

Open Access Journals