首页»
最新录用
Petroleum Science > DOI: https://doi.org/10.1016/j.petsci.2024.11.012
Effect of preprocessing on performances of machine learning-based mineral composition analysis on gas hydrate sediments, Ulleung Basin, East Sea Open Access
文章信息
作者:Hong-Keun Jin, Ju-Young Park, Sun-Young Park, Byeong-Kook Son, Bae-Hyun Min, Kyung-Book Lee
作者单位:
投稿时间:
引用方式:Hong-Keun Jin, Ju-Young Park, Sun-Young Park, Byeong-Kook Son, Bae-Hyun Min, Kyung-Book Lee, Effect of preprocessing on performances of machine learning-based mineral composition analysis on gas hydrate sediments, Ulleung Basin, East Sea, Petroleum Science, 2024, https://doi.org/10.1016/j.petsci.2024.11.012.
文章摘要
Abstract: Gas hydrate (GH) is an unconventional resource estimated at 1,000−120,000 trillion m3 worldwide. Research on GH is ongoing to determine its geological and flow characteristics for commercial production. After two large-scale drilling expeditions to study the GH-bearing zone in the Ulleung Basin, the mineral composition of 488 sediment samples was analyzed using X-ray diffraction (XRD). Because the analysis is costly and dependent on experts, a machine learning model was developed to predict the mineral composition using XRD intensity profiles as input data. However, the model's performance was limited because of improper preprocessing of the intensity profile. Because preprocessing was applied to each feature, the intensity trend was not preserved even though this factor is the most important when analyzing mineral composition. In this study, the profile was preprocessed for each sample using min-max scaling because relative intensity is critical for mineral analysis. For 49 test data among the 488 data, the convolutional neural network (CNN) model improved the average absolute error and coefficient of determination by 41% and 46%, respectively, than those of CNN model with feature-based preprocessing. This study confirms that combining preprocessing for each sample with CNN is the most efficient approach for analyzing XRD data. The developed model can be used for the compositional analysis of sediment samples from the Ulleung Basin and the Korea Plateau. In addition, the overall procedure can be applied to any XRD data of sediments worldwide.
关键词
-
Keywords: Sample-based preprocessing; X-ray diffraction (XRD); machine learning; mineral composition; gas hydrate (GH); Ulleung Basin