Article,

LASSO MODELING AS AN ALTERNATIVE TO PCA BASED MULTIVARIATE MODELS TO SYSTEM WITH HEAVY SPARSITY: “BIODIESEL QUALITY BY NIR SPECTROSCOPY”

, , and .
Applied Mathematics and Sciences: An International Journal (MathSJ), 7 (1): 01 - 12 (March 2020)

Abstract

Principal component analysis (PCA) is a widespread and widely used in various areas of science such as bioinformatics, econometrics, and chemometrics among others. Once that PCA is based in the eigenvalues and the eigenvectors which are a very weak approach to high dimension systems with degrees of sparsity and in these situations the PCA is no longer a recommended procedure. Sparsity is very common in near infrared spectroscopy due to the large number of spectra required and the water absorption broad bands what makes these spectra very similar and with heavy sparsity in matrix dataset, demoting the precision and accuracy, in the multivariate modeling and within projections of data matrix in smaller dimensions. To overcoming these shortcomings the LASSO, a not PCA based method, model was applied to a NIR spectra dataset from Biodiesel and its performance was, statistically, compared with traditional multivariate modeling such as PCR and PLSR.

Tags

Users

  • @journalmathsj

Comments and Reviews