1.3.3.1. Superpixel-level spectral change representation
The original SCVs focus on the spectral variation representation from each individual pixel, thus ignoring spatial correlation with neighboring pixels and the local spectral homogeneity associated with real land-cover objects. This may lead to the commission and omission errors, and a decrease in the overall detection accuracy. Superpixel segmentation can capture image redundancy and generate convenient primitives to compute representative features, while reducing the complexity of the subsequent processing and analysis. In this work, we used the simple linear iterative clustering (SLIC) algorithm (Achanta et al. 2012) as the core algorithm for generating superpixel segments. Compared to the other popular segmentation methods, SLIC offers a better performance in boundary adherence and generates superpixels efficiently under the same hardware conditions. Moreover, SLIC is memory efficient, and only requires the storage of the distance from each pixel to its nearest cluster center. Most importantly, it can be smoothly integrated within the proposed method, especially from the pixel to superpixel-level spectral change representation and detection, which drives a proper algorithm utilization.
The general idea of the SLIC algorithm is to find small regional clusters by considering their local homogeneity (Achanta et al. 2012). The key step is to calculate the distance d that implements a measurement from the spectral–spatial point of view. Let dcolor and dxy be the spectral and spatial distances between two given pixels α and β, respectively, defined as:
[1.14]
[1.15]
Here, (L, A, B)T denotes the CIELAB color space values, with L being the color lightness and A and B representing color values along red-green and blue-yellow axes, respectively. (x, y)T denotes the coordinates of a given pixel. A final weighted distance measure dαβ can be defined as:
[1.16]
where s is the width of grids. It controls the size of created superpixels, i.e. the greater the s, the larger the superpixels. A roughly equal-sized grid interval can be defined as
In order to enhance the spectral variations due to the limited bands in multispectral images, principal component analysis (PCA) is applied to the original SCVs. This strengthens the change representation and extends the feature space. The first three principal components (i.e. PCs) are used in the SLIC algorithm to generate the segments (i.e. superpixels) with the identified boundaries. Then original SCVs are stacked with the PCs to create an enhanced feature set (denoted as SCVs-PC). Note that normalization is conducted on SCVs-PC bands to make data dynamic range consistent. Finally, a mean operation is applied on each segment in the SCVs-PC bands, where the mean vector is used to replace the original SCVs-PC vectors, in order to achieve the enhanced spectral change representation at the superpixel level. Note that, by doing this, the change information is concentrated with spectral–spatial coherence and the computational cost is largely reduced when compared with the original pixel-wise processing.
1.3.3.2. Determination of the optimal segmentation scale
As mentioned previously, the number of superpixels N and the compactness factor m need to be determined in the SLIC algorithm. Note that in practical implementations, the parameter m impacts less than N on the segmentation results. Therefore, after multiple trials, we fixed m = 30 in this work. The only focus is on the determination of the optimal segmentation scale parameter N . To this end, an unsupervised strategy is applied based on the analysis of the global entropy. Note that after the mean operation on each superpixel, the texture information in the segments will be relatively suppressed, which may have an influence on the following CD performance. The main idea of the used criterion is to evaluate the information maintained in the superpixel-level segmented image inherited from the original pixel-level image. Thus, the one-dimensional image entropy (Global Entropy, GE) (Han et al. 2008) is calculated based on multi-scale segmentation results, where the optimal segmentation scale is determined by analyzing the change of GE values:
[1.17]
where n denotes the gray level and p = (pk)k=1,2,...,n contains the histogram counts of the first three bands of Y'. It is worth noting that with the increasing N, GE values are expected to increase continuously and approach the value of the original image. The logarithmic function is then used to fit the GE results to estimate the threshold for the optimal segmentation scale. A detailed description of this step is provided in Table 1.1.
Table 1.1. Determination of the optimal segmentation scale based on GE analysis
Step 1: Initialize N, which is approximated as the smaller one in the rows and columns ofthe input image, and the segmentation scale interval is set approximately equal to (row +column)/20. |
Step 2: Calculate the GE value of each segmented image under different searching scales. |
Step 3: Fit the logarithmic function on the obtained GE results and calculate the gradient. |
Step 4: Estimate the optimal segmentation scale by analyzing the gradient of GE values, where the convergence threshold TGE is defined approximately equal to 100/(row + column) based on the input image. |
1.3.3.3. Decision fusion-based CD
The multiclass CD is implemented based on the change magnitude and direction variables (i.e. ρ and θ) constructed by S2CVA at the superpixel level. In paticular, a decision fusion-based CD step is proposed to enhance and optimize the binary CD output (see Figure 1.5), by analyzing the change magnitude of SCVs-PC bands. Three binary CD results are taken into account based on a majority voting process, to determine whether a given superpixel (segment) is changed or not. The first input is made from the pixel-level binary CD result constrained by the segmented boundaries, where a threshold Tρ is defined on ρ by using the EM algorithm under a Bayesian framework (denoted as Bayesian-EM) (Bruzzone and Prieto 2000a). Change and no-change pixels are counted within each superpixel, respectively, in order to determine the label of that superpixel with respect to the counting