Cargando…

Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches

Housing value is a major component of the aggregate expenditure used in the analyses of welfare status of households in the development economics literature. Therefore, an accurate estimation of housing services is important to obtain the value of housing in household surveys. Data show that a signi...

Descripción completa

Detalles Bibliográficos
Autores principales: Embaye, Weldensie T., Zereyesus, Yacob Abrehe, Chen, Bowen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7877618/
https://www.ncbi.nlm.nih.gov/pubmed/33571198
http://dx.doi.org/10.1371/journal.pone.0244953
_version_ 1783650205662445568
author Embaye, Weldensie T.
Zereyesus, Yacob Abrehe
Chen, Bowen
author_facet Embaye, Weldensie T.
Zereyesus, Yacob Abrehe
Chen, Bowen
author_sort Embaye, Weldensie T.
collection PubMed
description Housing value is a major component of the aggregate expenditure used in the analyses of welfare status of households in the development economics literature. Therefore, an accurate estimation of housing services is important to obtain the value of housing in household surveys. Data show that a significant proportion of households in a typical Living Standard Measurement Survey (LSMS), adopted by the Word Bank and others, are self-owned. The standard approach to predict the housing value for such surveys is based on the rental cost of the house. A hedonic pricing applying an Ordinary Least Squares (OLS) method is normally used to predict rental values. The literature shows that Machine Learning (ML) methods, shown to uncover generalizable patterns based on a given data, have better predictive power over OLS applied in other valuation exercises. We examined whether or not a class of ML methods (e.g. Ridge, LASSO, Tree, Bagging, Random Forest, and Boosting) provided superior prediction of rental value of housing over OLS methods accounting for spatial autocorrelations using household level survey data from Uganda, Tanzania, and Malawi, across multiple years. Our results showed that the Machine Learning methods (Boosting, Bagging, Forest, Ridge and LASSO) are the best models in predicting house values using out-of-sample data set for all the countries and all the years. On the other hand, Tree regression underperformed relative to the various OLS models, over the same data sets. With the availability of abundant data and better computing power, ML methods provide viable alternative to predicting housing values in household surveys.
format Online
Article
Text
id pubmed-7877618
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-78776182021-02-19 Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches Embaye, Weldensie T. Zereyesus, Yacob Abrehe Chen, Bowen PLoS One Research Article Housing value is a major component of the aggregate expenditure used in the analyses of welfare status of households in the development economics literature. Therefore, an accurate estimation of housing services is important to obtain the value of housing in household surveys. Data show that a significant proportion of households in a typical Living Standard Measurement Survey (LSMS), adopted by the Word Bank and others, are self-owned. The standard approach to predict the housing value for such surveys is based on the rental cost of the house. A hedonic pricing applying an Ordinary Least Squares (OLS) method is normally used to predict rental values. The literature shows that Machine Learning (ML) methods, shown to uncover generalizable patterns based on a given data, have better predictive power over OLS applied in other valuation exercises. We examined whether or not a class of ML methods (e.g. Ridge, LASSO, Tree, Bagging, Random Forest, and Boosting) provided superior prediction of rental value of housing over OLS methods accounting for spatial autocorrelations using household level survey data from Uganda, Tanzania, and Malawi, across multiple years. Our results showed that the Machine Learning methods (Boosting, Bagging, Forest, Ridge and LASSO) are the best models in predicting house values using out-of-sample data set for all the countries and all the years. On the other hand, Tree regression underperformed relative to the various OLS models, over the same data sets. With the availability of abundant data and better computing power, ML methods provide viable alternative to predicting housing values in household surveys. Public Library of Science 2021-02-11 /pmc/articles/PMC7877618/ /pubmed/33571198 http://dx.doi.org/10.1371/journal.pone.0244953 Text en © 2021 Embaye et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Embaye, Weldensie T.
Zereyesus, Yacob Abrehe
Chen, Bowen
Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title_full Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title_fullStr Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title_full_unstemmed Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title_short Predicting the rental value of houses in household surveys in Tanzania, Uganda and Malawi: Evaluations of hedonic pricing and machine learning approaches
title_sort predicting the rental value of houses in household surveys in tanzania, uganda and malawi: evaluations of hedonic pricing and machine learning approaches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7877618/
https://www.ncbi.nlm.nih.gov/pubmed/33571198
http://dx.doi.org/10.1371/journal.pone.0244953
work_keys_str_mv AT embayeweldensiet predictingtherentalvalueofhousesinhouseholdsurveysintanzaniaugandaandmalawievaluationsofhedonicpricingandmachinelearningapproaches
AT zereyesusyacobabrehe predictingtherentalvalueofhousesinhouseholdsurveysintanzaniaugandaandmalawievaluationsofhedonicpricingandmachinelearningapproaches
AT chenbowen predictingtherentalvalueofhousesinhouseholdsurveysintanzaniaugandaandmalawievaluationsofhedonicpricingandmachinelearningapproaches