Cargando…

Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults

We present a data‐driven approach to clustering or grouping Global Navigation Satellite System (GNSS) stations according to observed velocities, displacements or other selected characteristics. Clustering GNSS stations provides useful scientific information, and is a necessary initial step in other...

Descripción completa

Detalles Bibliográficos
Autores principales: Granat, Robert, Donnellan, Andrea, Heflin, Michael, Lyzenga, Gregory, Glasscoe, Margaret, Parker, Jay, Pierce, Marlon, Wang, Jun, Rundle, John, Ludwig, Lisa G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8596415/
https://www.ncbi.nlm.nih.gov/pubmed/34820480
http://dx.doi.org/10.1029/2021EA001680
_version_ 1784600368154411008
author Granat, Robert
Donnellan, Andrea
Heflin, Michael
Lyzenga, Gregory
Glasscoe, Margaret
Parker, Jay
Pierce, Marlon
Wang, Jun
Rundle, John
Ludwig, Lisa G.
author_facet Granat, Robert
Donnellan, Andrea
Heflin, Michael
Lyzenga, Gregory
Glasscoe, Margaret
Parker, Jay
Pierce, Marlon
Wang, Jun
Rundle, John
Ludwig, Lisa G.
author_sort Granat, Robert
collection PubMed
description We present a data‐driven approach to clustering or grouping Global Navigation Satellite System (GNSS) stations according to observed velocities, displacements or other selected characteristics. Clustering GNSS stations provides useful scientific information, and is a necessary initial step in other analysis, such as detecting aseismic transient signals (Granat et al., 2013, https://doi.org/10.1785/0220130039). Desired features of the data can be selected for clustering, including some subset of displacement or velocity components, uncertainty estimates, station location, and other relevant information. Based on those selections, the clustering procedure autonomously groups the GNSS stations according to a selected clustering method. We have implemented this approach as a Python application, allowing us to draw upon the full range of open source clustering methods available in Python's scikit‐learn package (Pedregosa et al., 2011, https://doi.org/10.5555/1953048.2078195). The application returns the stations labeled by group as a table and color coded KML file and is designed to work with the GNSS information available from GeoGateway (Donnellan et al., 2021, https://doi.org/10.1007/s12145-020-00561-7; Heflin et al., 2020, https://doi.org/10.1029/2019ea000644) but is easily extensible. We demonstrate the methodology on California and western Nevada. The results show partitions that follow faults or geologic boundaries, including for recent large earthquakes and post‐seismic motion. The San Andreas fault system is most prominent, reflecting Pacific‐North American plate boundary motion. Deformation reflected as class boundaries is distributed north and south of the central California creeping section. For most models a cluster boundary connects the southernmost San Andreas fault with the Eastern California Shear Zone (ECSZ) rather than continuing through the San Gorgonio Pass.
format Online
Article
Text
id pubmed-8596415
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-85964152021-11-22 Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults Granat, Robert Donnellan, Andrea Heflin, Michael Lyzenga, Gregory Glasscoe, Margaret Parker, Jay Pierce, Marlon Wang, Jun Rundle, John Ludwig, Lisa G. Earth Space Sci Technical Reports: Methods We present a data‐driven approach to clustering or grouping Global Navigation Satellite System (GNSS) stations according to observed velocities, displacements or other selected characteristics. Clustering GNSS stations provides useful scientific information, and is a necessary initial step in other analysis, such as detecting aseismic transient signals (Granat et al., 2013, https://doi.org/10.1785/0220130039). Desired features of the data can be selected for clustering, including some subset of displacement or velocity components, uncertainty estimates, station location, and other relevant information. Based on those selections, the clustering procedure autonomously groups the GNSS stations according to a selected clustering method. We have implemented this approach as a Python application, allowing us to draw upon the full range of open source clustering methods available in Python's scikit‐learn package (Pedregosa et al., 2011, https://doi.org/10.5555/1953048.2078195). The application returns the stations labeled by group as a table and color coded KML file and is designed to work with the GNSS information available from GeoGateway (Donnellan et al., 2021, https://doi.org/10.1007/s12145-020-00561-7; Heflin et al., 2020, https://doi.org/10.1029/2019ea000644) but is easily extensible. We demonstrate the methodology on California and western Nevada. The results show partitions that follow faults or geologic boundaries, including for recent large earthquakes and post‐seismic motion. The San Andreas fault system is most prominent, reflecting Pacific‐North American plate boundary motion. Deformation reflected as class boundaries is distributed north and south of the central California creeping section. For most models a cluster boundary connects the southernmost San Andreas fault with the Eastern California Shear Zone (ECSZ) rather than continuing through the San Gorgonio Pass. John Wiley and Sons Inc. 2021-10-29 2021-11 /pmc/articles/PMC8596415/ /pubmed/34820480 http://dx.doi.org/10.1029/2021EA001680 Text en © 2021 Jet Propulsion Laboratory, California Institute of Technology. Government sponsorship acknowledged. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Technical Reports: Methods
Granat, Robert
Donnellan, Andrea
Heflin, Michael
Lyzenga, Gregory
Glasscoe, Margaret
Parker, Jay
Pierce, Marlon
Wang, Jun
Rundle, John
Ludwig, Lisa G.
Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title_full Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title_fullStr Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title_full_unstemmed Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title_short Clustering Analysis Methods for GNSS Observations: A Data‐Driven Approach to Identifying California's Major Faults
title_sort clustering analysis methods for gnss observations: a data‐driven approach to identifying california's major faults
topic Technical Reports: Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8596415/
https://www.ncbi.nlm.nih.gov/pubmed/34820480
http://dx.doi.org/10.1029/2021EA001680
work_keys_str_mv AT granatrobert clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT donnellanandrea clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT heflinmichael clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT lyzengagregory clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT glasscoemargaret clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT parkerjay clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT piercemarlon clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT wangjun clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT rundlejohn clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults
AT ludwiglisag clusteringanalysismethodsforgnssobservationsadatadrivenapproachtoidentifyingcaliforniasmajorfaults