-
QuakeFlow: A Scalable Machine-learning-based Earthquake Monitoring Workflow with Cloud Computing
Authors:
Weiqiang Zhu,
Alvin Brian Hou,
Robert Yang,
Avoy Datta,
S. Mostafa Mousavi,
William L. Ellsworth,
Gregory C. Beroza
Abstract:
Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To…
▽ More
Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To facilitate the application of machine-learning algorithms to large-volume seismic records, we developed a cloud-based earthquake monitoring workflow, QuakeFlow, that applies multiple processing steps to generate earthquake catalogs from raw seismic data. QuakeFlow uses a deep learning model, PhaseNet, for picking P/S phases and a machine learning model, GaMMA, for phase association with approximate earthquake location and magnitude. Each component in QuakeFlow is containerized, allowing straightforward updates to the pipeline with new deep learning/machine learning models, as well as the ability to add new components, such as earthquake relocation algorithms. We built QuakeFlow in Kubernetes to make it auto-scale for large datasets and to make it easy to deploy on cloud platforms, which enables large-scale parallel processing. We used QuakeFlow to process three years of continuous archived data from Puerto Rico, and found more than a factor of ten more events that occurred on much the same structures as previously known seismicity. We applied Quakeflow to monitoring frequent earthquakes in Hawaii and found over an order of magnitude more events than are in the standard catalog, including many events that illuminate the deep structure of the magmatic system. We also added Kafka and Spark streaming to deliver real-time earthquake monitoring results. QuakeFlow is an effective and efficient approach both for improving realtime earthquake monitoring and for mining archived seismic data sets.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
Earthquake Phase Association using a Bayesian Gaussian Mixture Model
Authors:
Weiqiang Zhu,
Ian W. McBrearty,
S. Mostafa Mousavi,
William L. Ellsworth,
Gregory C. Beroza
Abstract:
Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a…
▽ More
Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a challenging problem. We present a new association method, the Gaussian Mixture Model Association (GaMMA), that combines the Gaussian mixture model for phase measurements (both time and amplitude), with earthquake location, origin time, and magnitude estimation. We treat earthquake phase association as an unsupervised clustering problem in a probabilistic framework, where each earthquake corresponds to a cluster of P and S phases with hyperbolic moveout of arrival times and a decay of amplitude with distance. We use a multivariate Gaussian distribution to model the collection of phase picks for an event, the mean of which is given by the predicted arrival time and amplitude from the causative event. We carry out the pick assignment for each earthquake and determine earthquake parameters (i.e., earthquake location, origin time, and magnitude) under the maximum likelihood criterion using the Expectation-Maximization (EM) algorithm. The GaMMA method does not require the typical association steps of other algorithms, such as grid-search or supervised training. The results on both synthetic test and the 2019 Ridgecrest earthquake sequence show that GaMMA effectively associates phases from a temporally and spatially dense earthquake sequence while producing useful estimates of earthquake location and magnitude.
△ Less
Submitted 18 September, 2021;
originally announced September 2021.
-
Low-magnitude Seismicity with a Downhole Distributed Acoustic Sensing Array -- examples from the FORGE Geothermal Experiment
Authors:
Ariel Lellouch,
Ryan Schultz,
Nathaniel J. Lindsey,
Biondo Biondi,
William L. Ellsworth
Abstract:
We show the capabilities of a downhole Distributed Acoustic Sensing (DAS) array in detecting, locating and characterizing low-magnitude earthquakes occurring in the vicinity of the Frontier Observatory for Research in Geothermal Energy (FORGE) site in Utah. 10.5 days of continuous data were acquired in a monitoring well at the FORGE geothermal site during the initial stimulation of an Enhanced Geo…
▽ More
We show the capabilities of a downhole Distributed Acoustic Sensing (DAS) array in detecting, locating and characterizing low-magnitude earthquakes occurring in the vicinity of the Frontier Observatory for Research in Geothermal Energy (FORGE) site in Utah. 10.5 days of continuous data were acquired in a monitoring well at the FORGE geothermal site during the initial stimulation of an Enhanced Geothermal System in April-May 2019. Earthquake activity beneath Mineral Mountains, Utah also occurred within 10 km of the FORGE monitoring well. During the experiment, four events from those areas were cataloged by the University of Utah Seismograph Stations. Our processing of DAS data, including template matching, finds 82 earthquakes during that period, of which 16 are visible on the regional network. The magnitude of completeness obtained by DAS processing is better by at least M=0.5 than the dense surface array around the FORGE site. While a single vertical DAS array is limited in terms of event location due to its azimuthal ambiguity, multiple DAS wells or a combination of a downhole array with surface stations or near-surface horizontal DAS could jointly resolve locations. All detected events probably originated from the two active source areas and can be clustered into several distinct families.
△ Less
Submitted 9 July, 2020; v1 submitted 26 June, 2020;
originally announced June 2020.