1. Earthquake detection through computationally efficient similarity search
- Author
-
Gregory C. Beroza, Karianne J. Bergen, Ossian O'Reilly, and C. E. Yoon
- Subjects
seismic monitoring ,Computer science ,Nearest neighbor search ,Poison control ,Locality-sensitive hashing ,Physics::Geophysics ,Waveform ,Mathematics::Metric Geometry ,earthquake detection ,earthquakes ,Simulation ,Research Articles ,Seismology ,Multidisciplinary ,business.industry ,Autocorrelation ,Fingerprint (computing) ,computational seismology ,InformationSystems_DATABASEMANAGEMENT ,SciAdv r-articles ,Pattern recognition ,similarity search ,Thresholding ,13. Climate action ,Scalability ,Artificial intelligence ,locality-sensitive hashing ,business ,Research Article - Abstract
Data mining techniques applied to continuous waveforms reveal many previously unknown small earthquakes., Seismology is experiencing rapid growth in the quantity of data, which has outpaced the development of processing algorithms. Earthquake detection—identification of seismic events in continuous data—is a fundamental operation for observational seismology. We developed an efficient method to detect earthquakes using waveform similarity that overcomes the disadvantages of existing detection methods. Our method, called Fingerprint And Similarity Thresholding (FAST), can analyze a week of continuous seismic waveform data in less than 2 hours, or 140 times faster than autocorrelation. FAST adapts a data mining algorithm, originally designed to identify similar audio clips within large databases; it first creates compact “fingerprints” of waveforms by extracting key discriminative features, then groups similar fingerprints together within a database to facilitate fast, scalable search for similar fingerprint pairs, and finally generates a list of earthquake detections. FAST detected most (21 of 24) cataloged earthquakes and 68 uncataloged earthquakes in 1 week of continuous data from a station located near the Calaveras Fault in central California, achieving detection performance comparable to that of autocorrelation, with some additional false detections. FAST is expected to realize its full potential when applied to extremely long duration data sets over a distributed network of seismic stations. The widespread application of FAST has the potential to aid in the discovery of unexpected seismic signals, improve seismic monitoring, and promote a greater understanding of a variety of earthquake processes.
- Published
- 2015