Cargando…
Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining
The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6336012/ https://www.ncbi.nlm.nih.gov/pubmed/30656067 http://dx.doi.org/10.7717/peerj.6193 |
_version_ | 1783387998183751680 |
---|---|
author | Orozco-Arias, Simon Núñez-Rincón, Ana María Tabares-Soto, Reinel López-Álvarez, Diana |
author_facet | Orozco-Arias, Simon Núñez-Rincón, Ana María Tabares-Soto, Reinel López-Álvarez, Diana |
author_sort | Orozco-Arias, Simon |
collection | PubMed |
description | The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of species in a spatial distribution given by coordinates. We used 17 species of the genus Brachypodium, downloaded from the Global Biodiversity Information Facility data repository or obtained from bibliographical sources, to test an algorithm with the spatial points process technique used by Silva et al. (2016), generating association rules for co-occurrence analysis. Brachypodium spp. has emerged as an effective model for monocot species, growing in different environments, latitudes, and elevations; thereby, representing a wide range of biotic and abiotic conditions that may be associated with adaptive natural genetic variation. We created seven datasets of two, three, four, six, seven, 15, and 17 species in order to test the algorithm with four different distances (1, 5, 10, and 20 km). Several measurements (support, confidence, lift, Chi-square, and p-value) were used to evaluate the quality of the results generated by the algorithm. No negative association rules were created in the datasets, while 95 positive co-occurrences rules were found for datasets with six, seven, 15, and 17 species. Using 20 km in the dataset with 17 species, we found 16 positive co-occurrences involving five species, suggesting that these species are coexisting. These findings are corroborated by the results obtained in the dataset with 15 species, where two species with broad range distributions present in the previous dataset are eliminated, obtaining seven positive co-occurrences. We found that B. sylvaticum has co-occurrence relations with several species, such as B. pinnatum, B. rupestre, B. retusum, and B. phoenicoides, due to its wide distribution in Europe, Asia, and north of Africa. We demonstrate the utility of the algorithm implemented for the analysis of co-occurrence of 17 species of the genus Brachypodium, agreeing with distributions existing in nature. Data mining has been applied in the field of biological sciences, where a great amount of complex and noisy data of unseen proportion has been generated in recent years. Particularly, ecological data analysis represents an opportunity to explore and comprehend biological systems with data mining and bioinformatics tools. |
format | Online Article Text |
id | pubmed-6336012 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-63360122019-01-17 Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining Orozco-Arias, Simon Núñez-Rincón, Ana María Tabares-Soto, Reinel López-Álvarez, Diana PeerJ Bioinformatics The co-occurrence of plant species is a fundamental aspect of plant ecology that contributes to understanding ecological processes, including the establishment of ecological communities and its applications in biological conservation. A priori algorithms can be used to measure the co-occurrence of species in a spatial distribution given by coordinates. We used 17 species of the genus Brachypodium, downloaded from the Global Biodiversity Information Facility data repository or obtained from bibliographical sources, to test an algorithm with the spatial points process technique used by Silva et al. (2016), generating association rules for co-occurrence analysis. Brachypodium spp. has emerged as an effective model for monocot species, growing in different environments, latitudes, and elevations; thereby, representing a wide range of biotic and abiotic conditions that may be associated with adaptive natural genetic variation. We created seven datasets of two, three, four, six, seven, 15, and 17 species in order to test the algorithm with four different distances (1, 5, 10, and 20 km). Several measurements (support, confidence, lift, Chi-square, and p-value) were used to evaluate the quality of the results generated by the algorithm. No negative association rules were created in the datasets, while 95 positive co-occurrences rules were found for datasets with six, seven, 15, and 17 species. Using 20 km in the dataset with 17 species, we found 16 positive co-occurrences involving five species, suggesting that these species are coexisting. These findings are corroborated by the results obtained in the dataset with 15 species, where two species with broad range distributions present in the previous dataset are eliminated, obtaining seven positive co-occurrences. We found that B. sylvaticum has co-occurrence relations with several species, such as B. pinnatum, B. rupestre, B. retusum, and B. phoenicoides, due to its wide distribution in Europe, Asia, and north of Africa. We demonstrate the utility of the algorithm implemented for the analysis of co-occurrence of 17 species of the genus Brachypodium, agreeing with distributions existing in nature. Data mining has been applied in the field of biological sciences, where a great amount of complex and noisy data of unseen proportion has been generated in recent years. Particularly, ecological data analysis represents an opportunity to explore and comprehend biological systems with data mining and bioinformatics tools. PeerJ Inc. 2019-01-14 /pmc/articles/PMC6336012/ /pubmed/30656067 http://dx.doi.org/10.7717/peerj.6193 Text en © 2019 Orozco-Arias et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Orozco-Arias, Simon Núñez-Rincón, Ana María Tabares-Soto, Reinel López-Álvarez, Diana Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title | Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title_full | Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title_fullStr | Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title_full_unstemmed | Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title_short | Worldwide co-occurrence analysis of 17 species of the genus Brachypodium using data mining |
title_sort | worldwide co-occurrence analysis of 17 species of the genus brachypodium using data mining |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6336012/ https://www.ncbi.nlm.nih.gov/pubmed/30656067 http://dx.doi.org/10.7717/peerj.6193 |
work_keys_str_mv | AT orozcoariassimon worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining AT nunezrinconanamaria worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining AT tabaressotoreinel worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining AT lopezalvarezdiana worldwidecooccurrenceanalysisof17speciesofthegenusbrachypodiumusingdatamining |