Cargando…

Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining

The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of s...

Descripción completa

Detalles Bibliográficos
Autores principales: Orozco-Arias, Simon, Núñez-Rincón, Ana María, Tabares-Soto, Reinel, López-Álvarez, Diana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6336012/
https://www.ncbi.nlm.nih.gov/pubmed/30656067
http://dx.doi.org/10.7717/peerj.6193
_version_ 1783387998183751680
author Orozco-Arias, Simon
Núñez-Rincón, Ana María
Tabares-Soto, Reinel
López-Álvarez, Diana
author_facet Orozco-Arias, Simon
Núñez-Rincón, Ana María
Tabares-Soto, Reinel
López-Álvarez, Diana
author_sort Orozco-Arias, Simon
collection PubMed
description The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of species in a spatial distribution given by coordinates. We used 17 species of the genus Brachypodium, downloaded from the Global Biodiversity Information Facility data repository or obtained from bibliographical sources, to test an algorithm with the spatial points process technique used by Silva et al. (2016), generating association rules for co-occurrence analysis. Brachypodium spp. has emerged as an effective model for monocot species, growing in different environments, latitudes, and elevations; thereby, representing a wide range of biotic and abiotic conditions that may be associated with adaptive natural genetic variation. We created seven datasets of two, three, four, six, seven, 15, and 17 species in order to test the algorithm with four different distances (1, 5, 10, and 20 km). Several measurements (support, confidence, lift, Chi-square, and p-value) were used to evaluate the quality of the results generated by the algorithm. No negative association rules were created in the datasets, while 95 positive co-occurrences rules were found for datasets with six, seven, 15, and 17 species. Using 20 km in the dataset with 17 species, we found 16 positive co-occurrences involving five species, suggesting that these species are coexisting. These findings are corroborated by the results obtained in the dataset with 15 species, where two species with broad range distributions present in the previous dataset are eliminated, obtaining seven positive co-occurrences. We found that B. sylvaticum has co-occurrence relations with several species, such as B. pinnatum, B. rupestre, B. retusum, and B. phoenicoides, due to its wide distribution in Europe, Asia, and north of Africa. We demonstrate the utility of the algorithm implemented for the analysis of co-occurrence of 17 species of the genus Brachypodium, agreeing with distributions existing in nature. Data mining has been applied in the field of biological sciences, where a great amount of complex and noisy data of unseen proportion has been generated in recent years. Particularly, ecological data analysis represents an opportunity to explore and comprehend biological systems with data mining and bioinformatics tools.
format Online
Article
Text
id pubmed-6336012
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-63360122019-01-17 Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining Orozco-Arias, Simon Núñez-Rincón, Ana María Tabares-Soto, Reinel López-Álvarez, Diana PeerJ Bioinformatics The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of species in a spatial distribution given by coordinates. We used 17 species of the genus Brachypodium, downloaded from the Global Biodiversity Information Facility data repository or obtained from bibliographical sources, to test an algorithm with the spatial points process technique used by Silva et al. (2016), generating association rules for co-occurrence analysis. Brachypodium spp. has emerged as an effective model for monocot species, growing in different environments, latitudes, and elevations; thereby, representing a wide range of biotic and abiotic conditions that may be associated with adaptive natural genetic variation. We created seven datasets of two, three, four, six, seven, 15, and 17 species in order to test the algorithm with four different distances (1, 5, 10, and 20 km). Several measurements (support, confidence, lift, Chi-square, and p-value) were used to evaluate the quality of the results generated by the algorithm. No negative association rules were created in the datasets, while 95 positive co-occurrences rules were found for datasets with six, seven, 15, and 17 species. Using 20 km in the dataset with 17 species, we found 16 positive co-occurrences involving five species, suggesting that these species are coexisting. These findings are corroborated by the results obtained in the dataset with 15 species, where two species with broad range distributions present in the previous dataset are eliminated, obtaining seven positive co-occurrences. We found that B. sylvaticum has co-occurrence relations with several species, such as B. pinnatum, B. rupestre, B. retusum, and B. phoenicoides, due to its wide distribution in Europe, Asia, and north of Africa. We demonstrate the utility of the algorithm implemented for the analysis of co-occurrence of 17 species of the genus Brachypodium, agreeing with distributions existing in nature. Data mining has been applied in the field of biological sciences, where a great amount of complex and noisy data of unseen proportion has been generated in recent years. Particularly, ecological data analysis represents an opportunity to explore and comprehend biological systems with data mining and bioinformatics tools. PeerJ Inc. 2019-01-14 /pmc/articles/PMC6336012/ /pubmed/30656067 http://dx.doi.org/10.7717/peerj.6193 Text en © 2019 Orozco-Arias et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Orozco-Arias, Simon
Núñez-Rincón, Ana María
Tabares-Soto, Reinel
López-Álvarez, Diana
Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title_full Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title_fullStr Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title_full_unstemmed Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title_short Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
title_sort worldwide co-occurrence analysis of 17 species of the genus brachypodium using data mining
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6336012/
https://www.ncbi.nlm.nih.gov/pubmed/30656067
http://dx.doi.org/10.7717/peerj.6193
work_keys_str_mv AT orozcoariassimon worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining
AT nunezrinconanamaria worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining
AT tabaressotoreinel worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining
AT lopezalvarezdiana worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining