(Created blank page)
 
Line 1: Line 1:
 +
                               
 +
==Abstract==
  
 +
Non-linear soil behaviour adds complexity in accurate parameter selection for numerical modelling. One of these parameters is the small-strain shear stiffness. This parameter depends strongly on the soil mass density and the shear wave velocity; the latter can be determined through in-situ tests or laboratory tests. The paper focuses on training various machine learning models to predict shear wave velocity estimates based on raw data from cone penetration test soundings. Three decision tree algorithms are considered for the analysis: XGBRegressor, HistGradientRegressor, and RandomForest. Various data preprocessing approaches are investigated, including noise removal and outlier identification, to assess their impact on the model performance. The results indicate that different data preprocessing approaches yield significant differences in the model performances. When applied to unseen raw data from a sand site of the Norwegian GeoTest Site, the model demonstrates promising predictive capabilities and is in a good agreement with well-known correlations. This study underlines the importance of data quality and preprocessing for reliable machine learning models. To enhance transparency and reproducibility, a GitHub repository with all the used files is made available online.

Revision as of 15:42, 6 June 2024

Abstract

Non-linear soil behaviour adds complexity in accurate parameter selection for numerical modelling. One of these parameters is the small-strain shear stiffness. This parameter depends strongly on the soil mass density and the shear wave velocity; the latter can be determined through in-situ tests or laboratory tests. The paper focuses on training various machine learning models to predict shear wave velocity estimates based on raw data from cone penetration test soundings. Three decision tree algorithms are considered for the analysis: XGBRegressor, HistGradientRegressor, and RandomForest. Various data preprocessing approaches are investigated, including noise removal and outlier identification, to assess their impact on the model performance. The results indicate that different data preprocessing approaches yield significant differences in the model performances. When applied to unseen raw data from a sand site of the Norwegian GeoTest Site, the model demonstrates promising predictive capabilities and is in a good agreement with well-known correlations. This study underlines the importance of data quality and preprocessing for reliable machine learning models. To enhance transparency and reproducibility, a GitHub repository with all the used files is made available online.

Back to Top

Document information

Published on 06/06/24
Submitted on 06/06/24

Volume Data-driven site characterization, 2024
DOI: 10.23967/isc.2024.148
Licence: CC BY-NC-SA license

Document Score

0

Views 0
Recommendations 0

Share this document

claim authorship

Are you one of the authors of this document?