Cargando…

Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm

[Image: see text] It is of great significance to evaluate and predict coalbed methane (CBM) production for the exploitation and exploration of CBM. The flow characteristics of gas and water are very complicated and important in the process of CBM exploitation. In recent years, machine learning has b...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Jie, Zhao, Yuhan, Hu, Qiujia, Zhang, Yang, Shao, Tangsha, Fan, Bin, Jiang, Yaodong, Chen, Zhen, Zhao, Meng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9025998/
https://www.ncbi.nlm.nih.gov/pubmed/35474819
http://dx.doi.org/10.1021/acsomega.2c00519
_version_ 1784691015270006784
author Zhu, Jie
Zhao, Yuhan
Hu, Qiujia
Zhang, Yang
Shao, Tangsha
Fan, Bin
Jiang, Yaodong
Chen, Zhen
Zhao, Meng
author_facet Zhu, Jie
Zhao, Yuhan
Hu, Qiujia
Zhang, Yang
Shao, Tangsha
Fan, Bin
Jiang, Yaodong
Chen, Zhen
Zhao, Meng
author_sort Zhu, Jie
collection PubMed
description [Image: see text] It is of great significance to evaluate and predict coalbed methane (CBM) production for the exploitation and exploration of CBM. The flow characteristics of gas and water are very complicated and important in the process of CBM exploitation. In recent years, machine learning has been introduced to analyze CBM well production and its influence based on the historical production data. However, there are some problems with the determination of hyperparameters in machine learning algorithms. Some previous random forests (RF) models of CBM production prediction were suitable for individual CBM wells, but for different types of CBM wells, a large amount of time is needed to adjust the hyperparameters. Therefore, a genetic algorithm (GA) was applied to optimize RF, and a hybrid GA–RF algorithm was presented to solve this problem, which can automatically adjust two important hyperparameters, n(tree) and m(try), and adapt different types of CBM wells. Meanwhile, the Pearson method and RF were carried out in this work to analyze the data of CBM well production to avoid multicollinearity caused by the improper selection of the model’s independent variables. The importance and correlation analysis of drainage control parameters, including casing pressure (P(c)), bottom-hole pressure (P(b)), stroke frequency (f(s)), liquid column depth (D(L)), daily decline of bottom-hole pressure (P(bd)), and daily decline of casing pressure (P(cd)) were obtained. It was found that the casing pressure, bottom-hole pressure, and stroke frequency had more effects on the gas production of CBM wells than other drainage control parameters. Furthermore, the correlation and importance order of the influencing factors were: P(c) > P(b) > f(s) > P(bd) > P(cd) > D(L) and P(c) > P(b) > f(s) > D(L) > P(bd) > P(cd), respectively. A CBM production model based on the GA–RF algorithm was constructed to study and predict the gas production of CBM wells in Qinshui Basin, China. Compared with the production model based on RF, this model can automatically optimize its hyperparameters to adapt to different types of CBM wells, and the mean-square-error of the GA–RF algorithm can be reduced by 40–60% than that of RF. 93% of the training errors were less than 5%, and 89% of the prediction errors were less than 10%. The GA–RF model can spot promptly the main influencing factors of CBM production and has high accuracy for the production prediction of CBM wells.
format Online
Article
Text
id pubmed-9025998
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-90259982022-04-25 Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm Zhu, Jie Zhao, Yuhan Hu, Qiujia Zhang, Yang Shao, Tangsha Fan, Bin Jiang, Yaodong Chen, Zhen Zhao, Meng ACS Omega [Image: see text] It is of great significance to evaluate and predict coalbed methane (CBM) production for the exploitation and exploration of CBM. The flow characteristics of gas and water are very complicated and important in the process of CBM exploitation. In recent years, machine learning has been introduced to analyze CBM well production and its influence based on the historical production data. However, there are some problems with the determination of hyperparameters in machine learning algorithms. Some previous random forests (RF) models of CBM production prediction were suitable for individual CBM wells, but for different types of CBM wells, a large amount of time is needed to adjust the hyperparameters. Therefore, a genetic algorithm (GA) was applied to optimize RF, and a hybrid GA–RF algorithm was presented to solve this problem, which can automatically adjust two important hyperparameters, n(tree) and m(try), and adapt different types of CBM wells. Meanwhile, the Pearson method and RF were carried out in this work to analyze the data of CBM well production to avoid multicollinearity caused by the improper selection of the model’s independent variables. The importance and correlation analysis of drainage control parameters, including casing pressure (P(c)), bottom-hole pressure (P(b)), stroke frequency (f(s)), liquid column depth (D(L)), daily decline of bottom-hole pressure (P(bd)), and daily decline of casing pressure (P(cd)) were obtained. It was found that the casing pressure, bottom-hole pressure, and stroke frequency had more effects on the gas production of CBM wells than other drainage control parameters. Furthermore, the correlation and importance order of the influencing factors were: P(c) > P(b) > f(s) > P(bd) > P(cd) > D(L) and P(c) > P(b) > f(s) > D(L) > P(bd) > P(cd), respectively. A CBM production model based on the GA–RF algorithm was constructed to study and predict the gas production of CBM wells in Qinshui Basin, China. Compared with the production model based on RF, this model can automatically optimize its hyperparameters to adapt to different types of CBM wells, and the mean-square-error of the GA–RF algorithm can be reduced by 40–60% than that of RF. 93% of the training errors were less than 5%, and 89% of the prediction errors were less than 10%. The GA–RF model can spot promptly the main influencing factors of CBM production and has high accuracy for the production prediction of CBM wells. American Chemical Society 2022-04-07 /pmc/articles/PMC9025998/ /pubmed/35474819 http://dx.doi.org/10.1021/acsomega.2c00519 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Zhu, Jie
Zhao, Yuhan
Hu, Qiujia
Zhang, Yang
Shao, Tangsha
Fan, Bin
Jiang, Yaodong
Chen, Zhen
Zhao, Meng
Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title_full Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title_fullStr Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title_full_unstemmed Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title_short Coalbed Methane Production Model Based on Random Forests Optimized by a Genetic Algorithm
title_sort coalbed methane production model based on random forests optimized by a genetic algorithm
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9025998/
https://www.ncbi.nlm.nih.gov/pubmed/35474819
http://dx.doi.org/10.1021/acsomega.2c00519
work_keys_str_mv AT zhujie coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT zhaoyuhan coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT huqiujia coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT zhangyang coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT shaotangsha coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT fanbin coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT jiangyaodong coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT chenzhen coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm
AT zhaomeng coalbedmethaneproductionmodelbasedonrandomforestsoptimizedbyageneticalgorithm