25. D. Eberhart, Ph., Investigation of crustal structure and active tectonic processes in the coast ranges, central California, PhD. thesis in Stanford University (1989).
26. Z. Wang and A. Nur, Seismic and Acoustic Velocities in Reservoir Rocks, Volume 2: SEG Geophysics Reprint Series 10, Society of Exploration Geophysicists (1992).
27. J. P. Dvorkin, and S. Alkhater, Pore fluid and porosity mapping from seismic, First Break, 22, 53–57 (2004).
28. I. Takahashi, Rock physics as a quantitative tool for seismic reservoir characterization, INPEX Corporation, p. 175 (2005).
29. P. Avseth, T. Mukerji, and G. Mavko, Quantitative Seismic Interpretation, Cambridge University Press, Cambridge (2005).
1 * Corresponding author: [email protected]
3
Anomaly Detection within Homogenous Geologic Area
Simon Katz1⋆, Fred Aminzadeh2, George Chilingar2 and Leonid Khilyuk1
1Russian Academy of Natural Sciences, US Branch, Los Angeles, California
2Petroleum Engineering Program, School of Engineering, University of Southern California
Abstract
This paper describes the development and investigation of a methodology and multi-step algorithms for the detection of geologic anomalies. The algorithms presented in this paper belong to the group of machine learning one-class classification techniques. The developed methodology includes the following steps: (a) compiling a single-class training set that includes data recorded in a known homogeneous area, (b) construction of universal anomaly detection classifiers (AD classifiers) and high-resolution classifiers specifically designed for detection of an anomaly of a certain type, (c) construction of adaptive AD classifiers, and (d) testing a hypothesis about an anomaly type using data from the part of the anomaly detected by the AD classifier and from the training set.
Three basis AD classifiers were suggested and tested for identification of anomalies: (a) distance-from-the-center of the training set, (b) sparsity of neighbors from the training set in the vicinity of a tested record, and (c) divergence from the center of the training set. Distance-from-the-center of the training set and sparsity of neighbors are universal and may be used for detection of anomalies with unknown properties and for optimization of adaptive classifiers. Divergence is a specialized classifier designed for detection of anomalies with known properties. New adaptive classifiers presented in the paper may be used for detection of anomalies of unknown type with efficiency superior to universal classifiers. The performed tests with the developed algorithms demonstrate the high efficiency of the developed one-class classifiers in the detection of gas-filled sands as anomalies even when moderate size training sets were utilized. A hypothesis test about anomaly type had statistically significant results in the wide range of test set sizes when an adaptive classifier was used for anomaly detection.
Keywords: Anomaly detection, one-class classification, posterior ROC-curve analysis, optimization, seismic velocities, rock density, brine, gas, shale
3.1 Introduction
An anomaly detection problem arises in geological research and reservoir characterization when, for example, there is a need to locate gas-filled sands in the vicinity of the area with identified brine-filled sands or shales, Katz et al., [8]. Other examples of anomaly detection problems include finding the location of an overpressure zone using data obtained from an area with normal gas pressure Dvorkin et al., [6], Gurevich et al., [7] and finding the location of fractured carbonates filled with gas, Chilingar et al., [5], Maity and Aminzadeh [12].
Detection of anomalous behavior in seismic and/or well log data, which can be attributed to the presence of hydrocarbon, has been the subject of many research activities. Seismic attributes introduced by Taner [16] have been widely used to detect seismic anomalies. Subsequently, many methods have been used to combine such attributes, either through clustering, Aminzadeh and Chaterjee [10], or neural networks Aminzadeh et al., [13]. Then, the meta attributes concept was introduced which combines machine and human intelligence for a targeted anomaly detection Aminzadeh [11‒15]. For a comprehensive overview of seismic attribute methodologies see Chopra and Murfurt [17].
The methodology for anomaly detection (AD) presented in this paper is different from multiclass classification where a training set is formed as a union of several subsets, and each subset contains records from a single class. Records from all classes are mixed in the training set. It is also different from the classical methodology for outlier detection. Outlier detection methodology is aimed at analysis of a dataset that may contain both regular and outlier records which are not identified prior to outlier analysis, so there is no training set (Barnett [2], Barnett and Lewis [3], Agrawal and Raghavan [1], Breunig et al. [4]).
In this paper writers tested new algorithms that belong to the group of one-class classification machine learning techniques. One-class classification is a comparatively new area of machine learning research (see for example Tax and Duin [18], Muñoz-Marí [19], Marti et al. [24]). Machine learning one-class classification methodology presented in this paper has several advantages compared to outlier detection. This methodology is model-free and all anomaly detection decisions are based on the use of a value of expected false discovery rate. The main advantage of this methodology is its ability to work efficiently with training sets of moderate size.
Three basis AD classifiers are tested in this paper for identification of geologic anomalies: (a) distance in multi-dimensional space from-the-center of the training set, (b) sparsity of neighbors from the training set in the vicinity of a tested record, and (c) Bregman type divergence from the center of the training set. Distance and sparsity classifiers are universal techniques. To apply them for anomaly detection one does not need prior information about properties of a possible anomaly. Divergence is a highly specialized AD classifier that takes into account differences in properties of petrophysical parameters in a regular geologic area and those within a potential anomaly. Universal classifiers were used in this paper for construction of adaptive aggregated classifiers which structure is automatically modified according to the estimated properties of the studied anomaly of unknown type.
The writers used data on compression (Vp), shear (Vs) waves velocities, and rock density ρ for various geologic conditions presented by Ramos and Castagna [9]. Their dataset contains a total of 75 measurements of Vp, Vs, and density ρ for gas-filled sands, brine-sands, and shales. There are a total of 75 triplets (Vp, Vs, ρ) in their dataset. Part of brine-filled sands and shales data was used in this paper for construction of multiple bootstrap-generated training sets, whereas test sets included a combination of records from gas-filled and brine-filled sands, and shales.
3.2 Anomaly Detection Methodology
The geologic anomaly detection methodology developed and tested in this paper relies on the use of a single-class training set that includes a fixed number of multidimensional records Y=(yk, 1, yk, 2 , …, yk, M). We constructed anomaly detection classifiers (AD classifiers) in such a way that they tend to have two main properties: (a) smaller values for the records within the training set and the records outside the training set that are of the same type as the training set records, (b) larger values for anomalous records outside the training set. AD classifiers are constructed as functions of one or several petrophysical parameters recorded at the location that is to be checked for anomaly status:
(3.1)