Cargando…

Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error

Multiple linear regression analysis is widely used to link an outcome with predictors for better understanding of the behaviour of the outcome of interest. Usually, under the assumption that the errors follow a normal distribution, the coefficients of the model are estimated by minimizing the sum of...

Descripción completa

Detalles Bibliográficos
Autores principales: Jäntschi, Lorentz, Bálint, Donatella, Bolboacă, Sorana D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5174750/
https://www.ncbi.nlm.nih.gov/pubmed/28090215
http://dx.doi.org/10.1155/2016/8578156
_version_ 1782484552514535424
author Jäntschi, Lorentz
Bálint, Donatella
Bolboacă, Sorana D.
author_facet Jäntschi, Lorentz
Bálint, Donatella
Bolboacă, Sorana D.
author_sort Jäntschi, Lorentz
collection PubMed
description Multiple linear regression analysis is widely used to link an outcome with predictors for better understanding of the behaviour of the outcome of interest. Usually, under the assumption that the errors follow a normal distribution, the coefficients of the model are estimated by minimizing the sum of squared deviations. A new approach based on maximum likelihood estimation is proposed for finding the coefficients on linear models with two predictors without any constrictive assumptions on the distribution of the errors. The algorithm was developed, implemented, and tested as proof-of-concept using fourteen sets of compounds by investigating the link between activity/property (as outcome) and structural feature information incorporated by molecular descriptors (as predictors). The results on real data demonstrated that in all investigated cases the power of the error is significantly different by the convenient value of two when the Gauss-Laplace distribution was used to relax the constrictive assumption of the normal distribution of the error. Therefore, the Gauss-Laplace distribution of the error could not be rejected while the hypothesis that the power of the error from Gauss-Laplace distribution is normal distributed also failed to be rejected.
format Online
Article
Text
id pubmed-5174750
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-51747502017-01-15 Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error Jäntschi, Lorentz Bálint, Donatella Bolboacă, Sorana D. Comput Math Methods Med Research Article Multiple linear regression analysis is widely used to link an outcome with predictors for better understanding of the behaviour of the outcome of interest. Usually, under the assumption that the errors follow a normal distribution, the coefficients of the model are estimated by minimizing the sum of squared deviations. A new approach based on maximum likelihood estimation is proposed for finding the coefficients on linear models with two predictors without any constrictive assumptions on the distribution of the errors. The algorithm was developed, implemented, and tested as proof-of-concept using fourteen sets of compounds by investigating the link between activity/property (as outcome) and structural feature information incorporated by molecular descriptors (as predictors). The results on real data demonstrated that in all investigated cases the power of the error is significantly different by the convenient value of two when the Gauss-Laplace distribution was used to relax the constrictive assumption of the normal distribution of the error. Therefore, the Gauss-Laplace distribution of the error could not be rejected while the hypothesis that the power of the error from Gauss-Laplace distribution is normal distributed also failed to be rejected. Hindawi Publishing Corporation 2016 2016-12-07 /pmc/articles/PMC5174750/ /pubmed/28090215 http://dx.doi.org/10.1155/2016/8578156 Text en Copyright © 2016 Lorentz Jäntschi et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jäntschi, Lorentz
Bálint, Donatella
Bolboacă, Sorana D.
Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title_full Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title_fullStr Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title_full_unstemmed Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title_short Multiple Linear Regressions by Maximizing the Likelihood under Assumption of Generalized Gauss-Laplace Distribution of the Error
title_sort multiple linear regressions by maximizing the likelihood under assumption of generalized gauss-laplace distribution of the error
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5174750/
https://www.ncbi.nlm.nih.gov/pubmed/28090215
http://dx.doi.org/10.1155/2016/8578156
work_keys_str_mv AT jantschilorentz multiplelinearregressionsbymaximizingthelikelihoodunderassumptionofgeneralizedgausslaplacedistributionoftheerror
AT balintdonatella multiplelinearregressionsbymaximizingthelikelihoodunderassumptionofgeneralizedgausslaplacedistributionoftheerror
AT bolboacasoranad multiplelinearregressionsbymaximizingthelikelihoodunderassumptionofgeneralizedgausslaplacedistributionoftheerror