Cargando…

A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows

Fasciola hepatica and Ostertagia ostertagi are internal parasites of cattle compromising physiology, productivity, and well-being. Parasites are complex in their effect on hosts, sometimes making it difficult to identify clear directions of associations between infection and production parameters. T...

Descripción completa

Detalles Bibliográficos
Autores principales: Oehm, Andreas W., Springer, Andrea, Jordan, Daniela, Strube, Christina, Knubben-Schweizer, Gabriela, Jensen, Katharina Charlotte, Zablotski, Yury
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9273072/
https://www.ncbi.nlm.nih.gov/pubmed/35816512
http://dx.doi.org/10.1371/journal.pone.0271413
_version_ 1784744995892232192
author Oehm, Andreas W.
Springer, Andrea
Jordan, Daniela
Strube, Christina
Knubben-Schweizer, Gabriela
Jensen, Katharina Charlotte
Zablotski, Yury
author_facet Oehm, Andreas W.
Springer, Andrea
Jordan, Daniela
Strube, Christina
Knubben-Schweizer, Gabriela
Jensen, Katharina Charlotte
Zablotski, Yury
author_sort Oehm, Andreas W.
collection PubMed
description Fasciola hepatica and Ostertagia ostertagi are internal parasites of cattle compromising physiology, productivity, and well-being. Parasites are complex in their effect on hosts, sometimes making it difficult to identify clear directions of associations between infection and production parameters. Therefore, unsupervised approaches not assuming a structure reduce the risk of introducing bias to the analysis. They may provide insights which cannot be obtained with conventional, supervised methodology. An unsupervised, exploratory cluster analysis approach using the k–mode algorithm and partitioning around medoids detected two distinct clusters in a cross-sectional data set of milk yield, milk fat content, milk protein content as well as F. hepatica or O. ostertagi bulk tank milk antibody status from 606 dairy farms in three structurally different dairying regions in Germany. Parasite–positive farms grouped together with their respective production parameters to form separate clusters. A random forests algorithm characterised clusters with regard to external variables. Across all study regions, co–infections with F. hepatica or O. ostertagi, respectively, farming type, and pasture access appeared to be the most important factors discriminating clusters (i.e. farms). Furthermore, farm level lameness prevalence, herd size, BCS, stage of lactation, and somatic cell count were relevant criteria distinguishing clusters. This study is among the first to apply a cluster analysis approach in this context and potentially the first to implement a k–medoids algorithm and partitioning around medoids in the veterinary field. The results demonstrated that biologically relevant patterns of parasite status and milk parameters exist between farms positive for F. hepatica or O. ostertagi, respectively, and negative farms. Moreover, the machine learning approach confirmed results of previous work and shed further light on the complex setting of associations a between parasitic diseases, milk yield and milk constituents, and management practices.
format Online
Article
Text
id pubmed-9273072
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-92730722022-07-12 A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows Oehm, Andreas W. Springer, Andrea Jordan, Daniela Strube, Christina Knubben-Schweizer, Gabriela Jensen, Katharina Charlotte Zablotski, Yury PLoS One Research Article Fasciola hepatica and Ostertagia ostertagi are internal parasites of cattle compromising physiology, productivity, and well-being. Parasites are complex in their effect on hosts, sometimes making it difficult to identify clear directions of associations between infection and production parameters. Therefore, unsupervised approaches not assuming a structure reduce the risk of introducing bias to the analysis. They may provide insights which cannot be obtained with conventional, supervised methodology. An unsupervised, exploratory cluster analysis approach using the k–mode algorithm and partitioning around medoids detected two distinct clusters in a cross-sectional data set of milk yield, milk fat content, milk protein content as well as F. hepatica or O. ostertagi bulk tank milk antibody status from 606 dairy farms in three structurally different dairying regions in Germany. Parasite–positive farms grouped together with their respective production parameters to form separate clusters. A random forests algorithm characterised clusters with regard to external variables. Across all study regions, co–infections with F. hepatica or O. ostertagi, respectively, farming type, and pasture access appeared to be the most important factors discriminating clusters (i.e. farms). Furthermore, farm level lameness prevalence, herd size, BCS, stage of lactation, and somatic cell count were relevant criteria distinguishing clusters. This study is among the first to apply a cluster analysis approach in this context and potentially the first to implement a k–medoids algorithm and partitioning around medoids in the veterinary field. The results demonstrated that biologically relevant patterns of parasite status and milk parameters exist between farms positive for F. hepatica or O. ostertagi, respectively, and negative farms. Moreover, the machine learning approach confirmed results of previous work and shed further light on the complex setting of associations a between parasitic diseases, milk yield and milk constituents, and management practices. Public Library of Science 2022-07-11 /pmc/articles/PMC9273072/ /pubmed/35816512 http://dx.doi.org/10.1371/journal.pone.0271413 Text en © 2022 Oehm et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Oehm, Andreas W.
Springer, Andrea
Jordan, Daniela
Strube, Christina
Knubben-Schweizer, Gabriela
Jensen, Katharina Charlotte
Zablotski, Yury
A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title_full A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title_fullStr A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title_full_unstemmed A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title_short A machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
title_sort machine learning approach using partitioning around medoids clustering and random forest classification to model groups of farms in regard to production parameters and bulk tank milk antibody status of two major internal parasites in dairy cows
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9273072/
https://www.ncbi.nlm.nih.gov/pubmed/35816512
http://dx.doi.org/10.1371/journal.pone.0271413
work_keys_str_mv AT oehmandreasw amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT springerandrea amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT jordandaniela amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT strubechristina amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT knubbenschweizergabriela amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT jensenkatharinacharlotte amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT zablotskiyury amachinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT oehmandreasw machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT springerandrea machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT jordandaniela machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT strubechristina machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT knubbenschweizergabriela machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT jensenkatharinacharlotte machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows
AT zablotskiyury machinelearningapproachusingpartitioningaroundmedoidsclusteringandrandomforestclassificationtomodelgroupsoffarmsinregardtoproductionparametersandbulktankmilkantibodystatusoftwomajorinternalparasitesindairycows