Cargando…

Case study data for joint modeling of insurance claims and lapsation

The dataset tracks 40,284 insurance clients over five years, between 2010 and 2015, who subscribed to both automobile and homeowners insurance. We have combined information on these customers. First, the characteristics including age, gender or driving experience, among others and dates of renewal f...

Descripción completa

Detalles Bibliográficos
Autores principales: Guillen, Montserrat, Bolancé, Catalina, Frees, Edward W., Valdez, Emiliano A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8640253/
https://www.ncbi.nlm.nih.gov/pubmed/34901350
http://dx.doi.org/10.1016/j.dib.2021.107639
_version_ 1784609304105451520
author Guillen, Montserrat
Bolancé, Catalina
Frees, Edward W.
Valdez, Emiliano A.
author_facet Guillen, Montserrat
Bolancé, Catalina
Frees, Edward W.
Valdez, Emiliano A.
author_sort Guillen, Montserrat
collection PubMed
description The dataset tracks 40,284 insurance clients over five years, between 2010 and 2015, who subscribed to both automobile and homeowners insurance. We have combined information on these customers. First, the characteristics including age, gender or driving experience, among others and dates of renewal for the two types of policies considered here. Note that we have only considered clients corresponding to persons and not commercial firms that can also underwrite home and motor insurance policies. Second, the policy data file for motor vehicle insurance consists of all vehicle insurance coverage including power, driving area or whether there is a second driver that drives the car occasionally. Third, the policy data file for homeowners insurance has information on the property such as value of the building (essentially the value of the home without any furniture, apparel and personal items), location and type of dwelling. Besides these three sources, we have access to data containing information on the number of claims and total cost of those claims per year and per policy type. So, for all policies that are in force, we finally have up to a five year record of the yearly cost of claims in the motor insurance and in the home coverage. If the customer does not renew one of those two policies or both, we do not have more information after this lapse occurs. After summarizing the data, we provide the usual marginal analysis, where we fit regression models using Tweedie distributions for claims and a logistic model for lapse. Data can be used for joint analysis of insurance policyholders with more than one product.
format Online
Article
Text
id pubmed-8640253
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86402532021-12-09 Case study data for joint modeling of insurance claims and lapsation Guillen, Montserrat Bolancé, Catalina Frees, Edward W. Valdez, Emiliano A. Data Brief Data Article The dataset tracks 40,284 insurance clients over five years, between 2010 and 2015, who subscribed to both automobile and homeowners insurance. We have combined information on these customers. First, the characteristics including age, gender or driving experience, among others and dates of renewal for the two types of policies considered here. Note that we have only considered clients corresponding to persons and not commercial firms that can also underwrite home and motor insurance policies. Second, the policy data file for motor vehicle insurance consists of all vehicle insurance coverage including power, driving area or whether there is a second driver that drives the car occasionally. Third, the policy data file for homeowners insurance has information on the property such as value of the building (essentially the value of the home without any furniture, apparel and personal items), location and type of dwelling. Besides these three sources, we have access to data containing information on the number of claims and total cost of those claims per year and per policy type. So, for all policies that are in force, we finally have up to a five year record of the yearly cost of claims in the motor insurance and in the home coverage. If the customer does not renew one of those two policies or both, we do not have more information after this lapse occurs. After summarizing the data, we provide the usual marginal analysis, where we fit regression models using Tweedie distributions for claims and a logistic model for lapse. Data can be used for joint analysis of insurance policyholders with more than one product. Elsevier 2021-11-26 /pmc/articles/PMC8640253/ /pubmed/34901350 http://dx.doi.org/10.1016/j.dib.2021.107639 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Data Article
Guillen, Montserrat
Bolancé, Catalina
Frees, Edward W.
Valdez, Emiliano A.
Case study data for joint modeling of insurance claims and lapsation
title Case study data for joint modeling of insurance claims and lapsation
title_full Case study data for joint modeling of insurance claims and lapsation
title_fullStr Case study data for joint modeling of insurance claims and lapsation
title_full_unstemmed Case study data for joint modeling of insurance claims and lapsation
title_short Case study data for joint modeling of insurance claims and lapsation
title_sort case study data for joint modeling of insurance claims and lapsation
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8640253/
https://www.ncbi.nlm.nih.gov/pubmed/34901350
http://dx.doi.org/10.1016/j.dib.2021.107639
work_keys_str_mv AT guillenmontserrat casestudydataforjointmodelingofinsuranceclaimsandlapsation
AT bolancecatalina casestudydataforjointmodelingofinsuranceclaimsandlapsation
AT freesedwardw casestudydataforjointmodelingofinsuranceclaimsandlapsation
AT valdezemilianoa casestudydataforjointmodelingofinsuranceclaimsandlapsation