.tiffpng.pngconvert #1 \OutputFile \AppendGraphicsExtensions.tiff
Texture Edge detection by Patch consensus (TEP)
Abstract
We propose Texture Edge detection using Patch consensus (TEP) which is a training-free method to detect the boundary of texture. We propose a new simple way to identify the texture edge location, using the consensus of segmented local patch information. While on the boundary, even using local patch information, the distinction between textures are typically not clear, but using neighbor consensus give a clear idea of the boundary. We utilize local patch, and its response against neighboring regions, to emphasize the similarities and the differences across different textures. The step of segmentation of response further emphasizes the edge location, and the neighborhood voting gives consensus and stabilize the edge detection. We analyze texture as a stationary process to give insight into the patch width parameter verses the quality of edge detection. We derive the necessary condition for textures to be distinguished, and analyze the patch width with respect to the scale of textures. Various experiments are presented to validate the proposed model.
1 Introduction
Texture has been explored for decades [32] and fruitful results are established for different type of textures: Markov random field [25] is widely used for texture synthesis. Its generative feature fits well with the randomness of certain types of textures (wood surface, sand); Lattice based method [18] is powerful in modeling highly symmetric and periodic texture, when the texton is relatively well defined (wall paper, honeycomb); Frequency/Wavelet analysis [20, 31] utilize spacial filters to vectorize textures and plays an important role in image compression [17] and texture classification and segmentation [33, 14]. We refer to [30, 25] for a comprehensive review of classical models. Textures remains to be a challenging topic, since there is no general nor precise definition of textures, and the boundaries of different textures are especially difficult to recognize.
We explore texture edge detection and segmentation. The classical Canny edge detection [5] detects edge locations by thinning the mask function, which is computed by applying thresholds to the magnitude of . One of the most well-known variational segmentation models is the Mumford-Shah functional [23],
(1) |
Here, and are positive parameters, is a bounded image domain, is the given image, and denotes one-dimensional Hausdorff measure of the object boundary . Chan and Vese [7] proposed using level set method, and it gives very effective piece-wise constant segmentation. For textured image segmentation, some texture descriptors, such as Gabor filter, can be used with these models. Gabor filter [22] can detect localized frequency response in varied orientations and scales. Figure 1 illustrates challenges of texture edge detection. For a comprehensive review of classical texture segmentation models, we refer to [13]. In real images, there are many different types of textures and it is typically difficult to find a proper filter bank that is suitable for all types of images. More recent network based approaches with data-adaptive property is capable of accomplishing high level tasks as semantic segmentation, e.g., [3, 29, 34, 10, 19], assuming the network is well-trained.
(a) | (b) | (c) |
---|---|---|
In this paper, we propose filter-free and training-free approach utilizing local patches response to capture the similarities and the differences between textures. Non-local filter [4] for image denoising averages pixel intensity values over similar image patches effectively. In [9], the authors extend this to texture synthesis algorithm, where the pixels are selected over similar image patches to regenerate textures. One of the difficulties in applying such nonlocal filter for texture edge detection is that near the boundary of texture, there may not be similar patches to give the clear boundary, different from denoising non textures edges.
We use local patch information utilizing similar responses within one texture, and different responses against different textures, and propose a Texture Edge detection method by Patch consensus (TEP). We propose a consensus based edge detection, which utilize the fact that away from the boundary often gives a clear idea about where the boundary of the texture should be located. We analyze the statistical condition for the texture edge to be detected by patch-wise similarity, and explore the relation between the patch width and the performance of the proposed method. The contributions of this paper are as follows:
-
•
We propose a simple training-free filter-free Texture Edge detection method using patch consensus (TEP).
-
•
We statistically analyze when the texture can be separated by the patch consensus.
-
•
Numerical results are presented to validate the proposed model.
The paper is outlined as follows: In Section 2, the details of texture edge detection with Patch consensus (TEP) is presented. Statistical analysis of the proposed model is provided in Section 3. In Section 4, we present the algorithms and numerical implementation details, and in Section 5 various experiments with comparisons and applications are presented.
2 The proposed model: Texture Edge detection by Patch consensus (TEP)
Let the discrete image domain be , and let the matrix denote the given image, where represents the intensity value at a pixel location . We consider a square neighborhood of with the width 2 to be
and we denote the vector version of the image patch of to be that . We refer to as the patch width parameter. We set the order of the entry in the vector to be column-wise from left to right, i.e., let the matrix be the image, the transformation matrix pair , where
which transforms a square patch in to a vector in . Here denotes Konecker product, and is dimensional identity matrix.
The main idea of the proposed method, Texture Edge detection by Path consensus (TEP), is as follows. From a local patch , a patch response is considered. We segment these patch responses to emphasize the similarities and the differences of patch responses. Then, we collect these segmentation boundaries and construct the edge function in .
[Step 1] For each patch , we define the patch response in a larger domain as
(2) |
Here with represents the half width of the comparison neighborhood. We refer to as the large comparison region width parameter. This measures the similarity of a patch at and a patch at . When the patches and are similar, it gives near zero value, and when they are very different, it gives a high value. For computational efficiency, we take from the neighborhood , but one may use .
[Step 2] To emphasize the texture differences and capture the edge information more clearly, we segment the response on using the following unsupervised multiphase segmentation model [26]:
(3) |
where is the indicator function of each phase which partitions , is the number of phases, denotes one-dimensional Hausdorff measure, is the set of all boundaries, and is the intensity average of the phase . Here the scale term is the perimeter over the area of each phase . This model (3) automatically finds the number of phases by a greedy algorithm. In this paper, we bound the number of phases to be , thus it finds either one or two phases within the response . We define the local edge function to be
(4) |
This represents the edge from the point of view of patch .
[Step 3] A local response for points on the boundary of texture doesn’t give a good edge information in general, thus we use consensus and collect the segmented patches to determine the edge function , by superposing for ;
(5) |
This becomes a non-binary edge function representing the ratio of ’s neighbors that voted as an edge pixel. This superposition gives consensus among patch responses. Even when the texture boundary is not very clear, points away from the boundary can still give a good information about the edge location.
The flowchart of the proposed model TEP is presented in Figure 2: for each pixel and its patch , the patch responses on a larger domain is computed as . We use unsupervised segmentation to segment the patch responses to emphasize the similarities and the differences among these patch responses. The gradient of phases is used to compute the local edges . Finally, the consensus is used to get the edge map . Since we use the observer patch as input, the proposed method is self-adaptive to the image without the need of training. This also reduces the number of hyper-parameters needed in filter based approaches. The parameters needed for TEP are the patch width parameter of , the large comparison region width parameter , and one regularity parameter for the unsupervised multi-phase segmentation.
3 Analytical properties of the proposed model
In this section, we statistically analyze when the texture can be separated by the patch consensus. In particular, we model the texture as random fields, derive the necessary conditions for our model to generate distinguishable patch responses for different textures in the sense of patch-wise Euclidean distance, and study the roles of the patch width parameter , and the large comparison region width parameter .
3.1 Texture as Stationary Random Field
Random field models the self-similarity property of the natural stochastic textures well that statistical approaches are proposed for structure-texture decomposition and image denoising [16, 36, 35]. In this paper, we model texture as a two dimensional Gaussian random field [1] defined on pixels , and study how the decay of correlation of the texture random field helps to identify texture boundaries from the patch responses for stochastic textures. In the context of discussing image patches as random vectors, we use calligraphic letter to denote random vector and lowercase letter to denote a concrete vector of the same size as . We start with introducing the definitions of Gaussian random field and its related properties.
Definition 1.
Let be the pixel index. The set of random variables is a Gaussian random field, if is a -dimensional Gaussian random vector for arbitrary choices of indices , where and denotes a Gaussian variable indexed by pixel location . The probability density of is given by
where is the expectation vector and is the nonnegative definite covariance matrix.
A Gaussian random field is completely determined by its first and the second moments, i.e., its mean and covariance , and Gaussian distribution is suitable for many natural stochastic textures [36]. In this paper, we assume fast decaying of the correlation with respect to pixelwise distance and choose a squared exponential covariance function such as
(6) |
which makes the random field stationary and isotropic, here is the magnitude parameter and is the decaying rate parameter. We remark that we choose the squared exponential decaying covariance (6) for the convenience of computation, and the derivations of this section can be generalized to decaying covariance functions of any order. For textures with spatially repetitive patterns, it is natural to assume the corresponding random field to be stationary [36].
Definition 2.
A Gaussian random field is called stationary, if for every and , the joint distribution of the Gaussian random vector is independent of .
Definition 3.
A stationary Gaussian random field is called isotropic, if its covariance function only depends on the relative distance of pixels and , i.e., .
An immediate consequence of Definition 2 is that the distribution of the -dimensional image patch is independent of the choice of the patch center, i.e. for all . The patch response involves the observation of two patches and . The mutual distribution of the two patches follows
(13) |
where the covariance matrix only depends on the relative distance of pixels , as a consequence of Definition 3. The entries of the covariance function is given by (6), i.e.,
where denotes the relative distance of ’th pixel in and ’th pixel in . Let as in section 2, and we assume , which guarantees that and do not overlap, hence for all . This leads to an upper bound of the Frobenius norm of the covariance matrix :
(14) |
Fixing , the cross term as , where is null matrix. Comparing (6) and (14), the decaying rate of correlation of the image patches is consistent with the rate of the pixel-wise covariance function .
The conditional distribution of with respect to is again multivariate Gaussian, which is fully determined by its mean and variance functions
(15) | ||||
(16) |
Combining with (14), and converge to and as , i.e,
3.2 Characteristics of the patch response
In the following, we provide the main results of the section. In order to compute the expectation of , we need the following lemma:
Lemma 1 (Expectation of quadratic form [27]).
Let be a random vector with mean and variance , and let be an symmetric matrix. Then
where is the trace operator.
Theorem 1.
The proof is presented in Appendix A. Theorem 1 describes the expectation of when the patch centered at location is drawn from .
When it is not, i.e. comparing two different textures, let be another random field independent from , where the covariance function is given as
for some . If the patch is observing , we simply have
since . The expectation of the patch response is given as
(18) |
Suppose there are two textures in while the patch in is drawn from . Texture edge can be detected if the quantities (17) and (18) differs, preferably significantly differs. This difference is described by
(19) |
Notice that this difference (19) is a function of . For larger , this separation is clearer, yet, more likely may encounter another texture in a real image when is far away from . It is rather important to search for edges in in the region that is large.
3.3 Stability of the patch response w.r.t. the patch width parameter
We explore how the value of the patch response concentrates to its expectation with respect to the patch width parameter . Since is a quadratic form of two Gaussian vectors, its distribution can be described by a variation of the distribution [2].
Theorem 2.
Proof.
Fix , and denote , then follows the Gaussian distribution , where according to (16), we have
Let be an orthogonal matrix that diagonalize , that is, , where are the eigenvalues of . Define a new random vector
here is standard Gaussian, i.e., . The observed patch response can be reformulated as
(20) |
where , and and denote the ’th element of vectors , and , respectively. The response (20) is a weighted sum of squares of independent Gaussian variables . Each follows noncentral chi-squared distribution , which is fully described by the degree of freedom and noncentrality parameter , and the mean and variance of such distribution is given by and . Specifically, we have . The density of the patch response (20) in general does not have a closed form [8]. With , its variance becomes
∎
In Figure 3, (a) shows a synthetic image consists of two textures (left) and (right) from Brodatz texture images set111The Brodatz texture image set is obtained from https://sipi.usc.edu/database/. Two patches and are marked with blue and yellow squares correspondingly. (b) and (c) show two patch responses and . The brightness is proportional to the value of the patch responses. This shows that with a suitable patch width parameter , the texture edge is clearly emphasized, which is consistent with the analysis in section 3.2. The contrast of two textured regions clearly indicates the edge location.
(a) | (b) | (c) |
---|---|---|
The edge function is given by the consensus of many patch responses. For accurate edge detection, these responses from different observer patches should be consistent, that is many patch responses should recognize there is an edge. This can be measured by the distribution of , the expected response from the perspective of patch . In Figure 4 (a), the histograms of the two textures in Figure 3 (a) are presented. The intensity values of the two textures are heavily overlapped which indicates the challenges of using intensity based method to detect the texture boundaries. By using the patch based consensus, Figure 4 (b) and (c) show that as the patch width parameter increases, the more concentrated the expectation becomes. This is consistent with Theorem 2, thus hel** to distinguish two textures. Figure 4 (b) and (c) show the estimated distribution of , (b) is assuming the pixel is from a random field , and (c) is assuming the pixel is from a random field . Note that is given from (17) in Theorem 1, if is equipped with , and from (18), if equipped with . Two sets of the distributions are concentrated around estimated expectations, which are computed from the pixel-wise means and variances of the texture images. We observe the concentration effect with a larger concentration rate, which is due to the variance difference of two textures. In particular, we have , and according to Theorem 2, the variance of is for Figure 4 (b) and for Figure 4 (c). Neither of the textures and are strictly stationary nor isotropic, yet our model well-describes the behavior of the patch response.
(a) | (b) | (c) |
---|---|---|
3.4 The patch width parameter and edge detection
The quality of edge detection depends on the intensity contrast of the patch response . This contrast is given by the responses observing or by the patch centered at , and the regularity of is related to the choice of .
For two texture separation, we first assume is chosen that contains two different textures and with a fixed pixel away from any texture boundary. We use the squared Hellinger distance [11] of two probability density functions to compare the two different patch responses:
(21) |
here denotes the inner product. Squared Hellinger distance (21) is a bounded metric that measures the similarity of the probability density functions in terms of the overlap. In Figure 5, the blue curve indicates the squared Hellinger distance of the patch responses of observing textures and from the perspective of patch and the red curve is from the perspective of patch . These curves represents the differences of the density functions shown in Figure 4 (b) and (c). As increases, two responses get better separated in Figure 4 (b) and (c), which is represented as the increasing value of squared Hellinger distance. The growth of two blue and red curves are different as increases, which is due to the difference in the variance of two textures and in Figure 4. The horizontal dash line in Figure 5 shows a wide range of which gives the separation of two textures.
In practice, the patch width parameter only needs to meet segmentation requirement of one of two adjacent textures.
4 Numerical Details
We summarize the proposed method in Algorithm 1 which includes following modifications for an efficient computation.
First, when computing the patch response , if two points and are very close, i.e. is inside for small, the patches and overlapped in most parts. This results in unwanted singularity around the center of . We remove this center singularity with a local average:
(22) |
The patch response in the subdomain is replaced by the average over its complement . In practice, we choose when the patch width parameter .
Secondly, when is close to, but not overlapped with, any texture edge, the patch centered at some pixel may still see the texture edge outside . This can cause the local edge function to report a false positive edge inside , and give thick and blurry edge on . We make the local edge function to only respond within , by the following modification
(23) |
where is the distance of pixel to the boundary of , i.e.,
Thirdly, we bound the number of phases to be in the segmentation step. When , the energy (3) reduces to the variance of the given image. The effect of the parameter can be interpreted as a threshold on the segmentation model to give one or two phases. We set = 0.01 to 0.05, when normalized patch response is used. When the given image range is and the patch response is not normalize, we use = 450 to 1,000. In Figure 5, the horizontal dashed line represents the distance threshold for the two textures to be separated, i.e. the segmentation model to choose . The controls the regularity for the local edge function , and efficiently reduce the unwanted edge detected. With fixed, textures requires different patch width parameters to find an edge (if there is one).
5 Numerical Experiments
In this section, we present numerical results exploring different aspects of the proposed model. First, Figure 6 represents the procedure of the proposed method. In the center figure, for each point , yellow boxes show the local patch with , and the blue boxes show the patch responses in . For each zoomed location, we present the yellow local patch , patch response and the local edge function . For , , and , two regions are identified and an edge is found between two textures. For , two edges are found separating the patch response to three regions, here two of these three regions represents the same textured region. Notice for and , although textures are changing and patch response shows some textures, they are identified to be the same textured regions and no edges are found.
5.1 Real images with texture
We represent the texture edge detection result for real textured images, and show comparison with Canny edge detection [5]. In Figure 7, TEP finds texture and object boundaries without finding edges within textures. Zoom-in of the red and the yellow boxes in (a) are presented in (d)-(g), where (d) and (f) shows how TEP only finds the boundary of the textures. In (d), TEP result considers the checkerboard texture as one region, and is able to detect the subtle transitions at the corner of the table. The train rail is considered as an entity, despite the track lines in (d), while, the Canny edge detection in (e) finds sharp gradient change as edges, and finds the edges of the checkerboard pattern also. In (f), notice that the shades caused by wrinkles are ignored by the proposed model, while it is captured by Canny edge detection in (g). For TEP, , and are used, while for the Canny edge detection, we used for hysteresis thresholding and for Gaussian blurring.
(a) | (b) | (c) |
---|---|---|
(d) | (e) | (f) | (g) |
---|---|---|---|
In Figure 8, first two rows (a)-(g), the worm details are understood as texture in (d). For TEP, , and are used, and for Canny edge detector, threshold parameters and for Gaussian filter are used. In Figure 8 last row, the details of the hair is understood as texture by TEP, while Canny edge detection finds the details. For TEP, , and are used, and for Canny edge detector, threshold parameters and for Gaussian filter are used. TEP consistently represents the region better even for textures with complicated and large scale patterns.
(a) | (b) | (c) |
---|---|---|
(d) | (e) | (f) | (g) |
---|---|---|---|
(h) | (i) | (j) |
---|---|---|
TEP is a training-free method for texture edge detection. Yet, in Figure 9, we present images from the Berkeley segmentation dataset BSDS500, and compare with the state-of-the-art machine learning model, Edge Detector with Transformer (EDTER) [24] as an example. Since the Berkeley Segmentation dataset for edge detection was published [21], it has been a benchmark for contour detection, especially in machine learning community [34, 19, 24, 10]. These methods are trained with color images with ground-truth data provided by human experts [21] that these methods aim at object detection. On the other hand, We apply TED only on gray scale images without any a priori knowledge of the image. TEP detects local texture edges, and this is not an object detection method. Even then, in Figure 9, TEP shows good edge detection and provides comparable results to the deep learning model. In the first row images, TEP and EDTER both finds large scale region with bricks (while Canny edge detection finds details of the bricks). TEP gives different strength to the edge, some parts are weaker edges than others, while EDTER gives the same strength, since it is object oriented contour detection. In the second and third rows, TEP edge is closer to the given image, grou** different texture correctly, and TEP finds irregular texture boundary. In the last row, while TEP finds more details of the dress, EDTER is simplified, and TEP sees the texture of the flood and finds the edge of the texture, while EDTER finds the edges in the floor tiles. With texture edge detection, TEP can give comparable good edge detection results.
(a) | (b) | (c) | (d) |
---|---|---|---|
5.2 The scale of the texture vs the patch width parameter
The patch width parameter can be adjusted to find different scales in the image. In Figure 10, we experiment with image (a) which has different scales of texture for each object. The background has the smallest texton - the smallest repeating unit in the texture. The triangle, the circle and the square all have different sizes of texton in increasing order. From image (b) to (d), the patch width parameter is increasing from to 8, and as increases, TEP sees bigger patches as a texture. In (b) with a small , each texton in the square is identifies as a separate region, since it understands the texture only in the small scale that each texton within the square is understood as a separate region. In the circle while the edge of the circle is identified, within the circle it also shows some texture details. In image (d), even the big texton is captured by a large , that all objects clearly shows the texture edge boundary. When is large enough, TEP ignores the fine details within the textured region.
(a) | (b) | (c) | (d) |
---|---|---|---|
Figure 11 shows real example in (a), using for (b) and for (c). In (b), the starfish shape is identified but with many details, since with a small , only very small scale texture is identified as one region. In (c), with a large , larger textures, e.g., inside the starfish, is identified as one textured region, and only boundary of the starfish is emphasized. As the patch width parameter increase, TEP focuses on large scale structure of the given image, while grou** small details as one region.
(a) | (b) | (c) |
---|---|---|
5.3 Robustness Against Noise and Multiple junctions
We test the robustness of TEP against different levels and types of noise. Figure 12 shows the TEP result against additive Gaussian noise with increasing variance, and the TEP result against increasing salt-and-pepper noise. In the first row (a), Gaussian noise with variance 0.02, 0.04, 0.06, 0.08 to 0.1 are added from the left to the right. In the second row (b), they are TEP results with parameters , and (using normalized patch response). In the third row (c), textured images with Salt and Pepper noise ranging from , to are shown from the left to the right. In the forth row (d), TEP results are shown with the same parameters , and (with normalized patch response). As more noise are added, some parts of edge strength gets weaker. Otherwise, TEP shows robustness against Gaussian and Salt and Pepper noise. In Figure 12 (d) row, clear edge is detected up to 40-50% of Salt and Pepper noise.
(a) | |||||
---|---|---|---|---|---|
(b) |
(c) | |||||
---|---|---|---|---|---|
(d) |
When detecting edges, finding sharp junctions can be challenging, e.g., due to multiple edges meeting at one point, it can easily get blurry. In Figure 13, we experiment with images with multiple junctions of various textures. TEP computes the non-binary edge function by collecting local segmentation results, that as long as the texture can be separated, multiple junction can also be identified. This is consistent with equation (19) in Section 3, the differences of average intensities help the proposed method to identify the texture boundaries. Figure 13 (b) and (d) show TEP results showing clear edges near the junction point. The experiment results show TEP well handles junction problem with and . In (d), the strength of the edge, value may be not as high for some points near the junction in the center, and for a very accurate edge detection, multiple junction will impose challenges. One can further improve the sharpness of edge detection with small modifications, which we further discussed in Appendix B.
(a) | (b) | (c) | (d) |
---|---|---|---|
5.4 Image segmentation using the edge function and image decomposition
Using the edge function , we can design a color segmentation method, using chromaticity and brightness model [6]. We separate the given color image to the brightness and the chromaticity :
We use the edge function, and propose the following segmentation functional for each chromaticity and brightness components, with ,
(24) | ||||
where is an edge indication function, , such that is strictly decreasing, and for . In order to utilize texture edge in an effective way, needs to control the smoothness of inversely proportional to the strength of within the range of . In application, since is generated through consensus, is far from zero at texture edge, we choose to enhance the convexity of , which creates wider region near thus properly control the smoothness of .
The functionals (24) are minimized by considering the Euler-Lagrange equations with gradient decent with time evolution:
using finite difference scheme. We only used the brightness of the image to compute . Figure 14 (a) is the given image, (b) two-phase clustering of image (a), (c) shows the segmentation result of (24) and (d) is two-phase clustering of image (c). Within each region, small scale details are removed, while the edge is kept very sharp.
(a) | (b) | (c) | (d) |
---|---|---|---|
We present more segmentation results in Figure 15. The brightness of the image is used to compute , and we used parameters , and , with . For , is used and for evolution. The first row, the grains are identified as one texture, as well as some textures on the ground as another texture. In the second row, branches with leafs, and grass regions are identifies as different textures. In the third row, grains on the rock are identified as one texture, and they are well separated from the fur of the animal, even when the colors are similar. In the forth row, the texture within the coral are identified as a texture, and detail of the oscillatory boundary are well identified.
(a) | (b) | (c) |
---|---|---|
This method can be naturally extended to image decomposition, and in Figure 16, we present the remainder after the segmentation showing the details of the image.
(a) | (b) | (c) |
---|---|---|
In Appendix, we further discuss more details of the proposed method, e.g. proof of Theorem 1, behavior of periodic texture, and junction refinement.
6 Concluding Remarks
We proposed a texture edge detection method which utilize patch based consensus. We use the local patch and its response, to emphasize the similarities and the differences among textures, and segmentation of the patch response helps to locate the edge location clearly. On the boundary of texture, local patch information and patch response is not as accurate to identify edge location, that we utilize the neighbor consensus to stabilize the process. We statistically analyze when the texture can be separated, and derive necessary conditions to distinguish textures. The proposed method has three parameters which are not very sensitive to choose, and show how patch width and the size of texton is related. This method is robust to different type of noise, and can handle multiple junctions. This method is training-free and filter-free approach. We provided numerical details and various experiments which illustrate the properties of the proposed model.
In the future, one may consider refining and thinning the edge thickness, which will also improve image decomposition application in Figure 16. We can also consider using multi-scale approach to further separate more object related edges, e.g., using different scales of patch responses. We may improve the performance of TEP via utilizing a scheme with adaptive patch size which can handle more complicated real images. Also, different types of kernels can be used instead of squared euclidean distance when comparing image patches, in order to enhance the sensitivity to certain types of textures.
References
- [1] Robert J Adler and Jonathan E Taylor. Random fields and geometry. Springer Science & Business Media, 2009.
- [2] Serge B. Provost A.M. Mathai. Quadratic forms in random variables: theory and applications. M. Dekker, 1992.
- [3] Gedas Bertasius, Jianbo Shi, and Lorenzo Torresani. Deepedge: A multi-scale bifurcated deep network for top-down contour detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4380–4389, 2015.
- [4] Antoni Buades, Bartomeu Coll, and Jean-Michel Morel. Non-Local Means Denoising. Image Processing On Line, 1:208–212, 2011.
- [5] John Canny. A computational approach to edge detection. IEEE Transactions on pattern analysis and machine intelligence, PAMI-8(6):679–698, 1986.
- [6] Tony F Chan, Sung Ha Kang, and Jianhong Shen. Total variation denoising and enhancement of color images based on the cb and hsv color models. Journal of Visual Communication and Image Representation, 12(4):422–435, 2001.
- [7] Tony F Chan and Luminita A Vese. Active contours without edges. IEEE Transactions on image processing, 10(2):266–277, 2001.
- [8] Robert B Davies. Algorithm as 155: The distribution of a linear combination of random variables. Applied Statistics, pages 323–333, 1980.
- [9] Alexei A Efros and Thomas K Leung. Texture synthesis by non-parametric sampling. In Proceedings of the seventh IEEE international conference on computer vision, volume 2, pages 1033–1038. IEEE, 1999.
- [10] Jianzhong He, Shiliang Zhang, Ming Yang, Yanhu Shan, and Tiejun Huang. Bi-directional cascade network for perceptual edge detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3828–3837, 2019.
- [11] Ernst Hellinger. Neue begründung der theorie quadratischer formen von unendlichvielen veränderlichen. Journal für die reine und angewandte Mathematik, 1909(136):210–271, 1909.
- [12] Byung-Woo Hong, Stefano Soatto, Kangyu Ni, and Tony Chan. The scale of a texture and its application to segmentation. In 2008 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8. IEEE, 2008.
- [13] Dana E Ilea and Paul F Whelan. Image segmentation based on the integration of colour–texture descriptors—a review. Pattern Recognition, 44(10-11):2479–2501, 2011.
- [14] Anil K Jain and Farshid Farrokhnia. Unsupervised texture segmentation using gabor filters. Pattern recognition, 24(12):1167–1186, 1991.
- [15] Peter W Jones and Triet M Le. Local scales and multiscale image decompositions. Applied and Computational Harmonic Analysis, 26(3):371–394, 2009.
- [16] Samah Khawaled and Yehoshua Y Zeevi. On the self-similarity of natural stochastic textures. arXiv preprint arXiv:1906.06768, 2019.
- [17] Adrian S Lewis and G Knowles. Image compression using the 2-d wavelet transform. IEEE Transactions on image Processing, 1(2):244–250, 1992.
- [18] Yanxi Liu, Robert T Collins, and Yanghai Tsin. A computational model for periodic pattern perception based on frieze and wallpaper groups. IEEE transactions on pattern analysis and machine intelligence, 26(3):354–371, 2004.
- [19] Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, and Xiang Bai. Richer convolutional features for edge detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3000–3009, 2017.
- [20] S Livens, P Scheunders, G Van de Wouwer, and D Van Dyck. Wavelets for texture analysis, an overview. 1997.
- [21] D. Martin, C. Fowlkes, D. Tal, and J. Malik. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In Proc. 8th Int’l Conf. Computer Vision, volume 2, pages 416–423, July 2001.
- [22] Rajiv Mehrotra, Kameswara Rao Namuduri, and Nagarajan Ranganathan. Gabor filter-based edge detection. Pattern recognition, 25(12):1479–1494, 1992.
- [23] David Bryant Mumford and Jayant Shah. Optimal approximations by piecewise smooth functions and associated variational problems. Communications on pure and applied mathematics, 1989.
- [24] Mengyang Pu, Ya** Huang, Yuming Liu, Qingji Guan, and Haibin Ling. Edter: Edge detection with transformer. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 1402–1412, 2022.
- [25] Lara Raad, Axel Davy, Agnès Desolneux, and Jean-Michel Morel. A survey of exemplar-based texture synthesis. Annals of Mathematical Sciences and Applications, 3(1):89–148, 2018.
- [26] Berta Sandberg, Sung Ha Kang, and Tony F Chan. Unsupervised multiphase segmentation: A phase balancing model. IEEE transactions on image processing, 19(1):119–130, 2009.
- [27] George AF Seber and Alan J Lee. Linear regression analysis, volume 330. John Wiley & Sons, 2003.
- [28] Jean Serra and Luc Vincent. An overview of morphological filtering. Circuits, Systems and Signal Processing, 11:47–108, 1992.
- [29] Xavier Soria, Edgar Riba, and Angel Sappa. Dense extreme inception network: Towards a robust cnn model for edge detection. In 2020 IEEE Winter Conference on Applications of Computer Vision (WACV), pages 1912–1921, 2020.
- [30] Mihran Tuceryan and Anil K Jain. Texture analysis. Handbook of pattern recognition and computer vision, pages 235–276, 1993.
- [31] Michael Unser. Texture classification and segmentation using wavelet frames. IEEE Transactions on image processing, 4(11):1549–1560, 1995.
- [32] Luc Van Gool, Piet Dewaele, and André Oosterlinck. Texture analysis anno 1983. Computer vision, graphics, and image processing, 29(3):336–357, 1985.
- [33] Li Wang and Dong-Chen He. Texture classification using texture spectrum. Pattern recognition, 23(8):905–910, 1990.
- [34] Saining ”Xie and Zhuowen” Tu. Holistically-nested edge detection. In Proceedings of IEEE International Conference on Computer Vision, 2015.
- [35] Ruotao Xu, Yong Xu, and Yuhui Quan. Structure-texture image decomposition using discriminative patch recurrence. IEEE Transactions on Image Processing, 30:1542–1555, 2020.
- [36] Ido Zachevsky and Yehoshua Y Josh Zeevi. Statistics of natural stochastic textures and their application in image denoising. IEEE Transactions on Image Processing, 25(5):2130–2145, 2016.
Appendix A Proof of Theorem 1.
Proof.
Note that is a quadratic function of two random vectors and , the expectation needs to be computed with double integral
To handle the conditional expectation , one needs the conditional distribution of , which is Gaussian with mean (15) and variance (16). Then
(25) | ||||
(26) |
where Lemma 1 is applied to the first term of the right hand side of (25). To compute the expectation of (26) with respect to , we have the following identities:
With substitution, the expectation of is then given as
∎
Appendix B Junction edge refinement
For the edges near a junction point, the strength of may be weaker than straight edges as seem in Figure 13. This is because the local patches at a location x which is near a junction point, may observed another textures which is a bit further from the two immediate two texture edge near x and confuse the segmentation. In Figure 17, (b) shows this effect. This can be improved by a simple dilation-erosion operation from mathematical morphology [28]. By using line shaped structuring element, where the orientation of the line should be parallel to the existing edge direction, the edge function is improved as in Figure 17 (c) and (d).
(a) | (b) | (c) | (d) |
---|---|---|---|
Appendix C Periodic texture and its patch response
Extending from the analysis in Section 3, we numerically present the cases for periodic texture. For periodic texture, it is interesting to notice that the variance of expectation is also strongly correlated to the period of the texture. In Figure 18, we show the distribution of with varying patch width parameter . The variance is near zero whenever the patch width parameter matches the periods of the texture, while the general decreasing effect discussed in section 3 still exists. This is consistent with the work by Hong, et al. [12], where the authors observed that for a periodic image, some statistical distance measurement of the image patch vs the entire image vanishes whenever the patch width parameter is a multiple of the texture period. In another work [15], authors measured the scale of the texture by applying time varying Gaussian kernel to the image, and observe when the averaging process has big jump, and use it to measure the scale.
In this paper, we choose relatively small while kee** the patch response consistent and stable. For periodic texture, we can use these estimations to help find the scale of the texture.
(a) | (b) |
---|---|