Surrogate-based cross-correlation for particle image velocimetry

Yong Lee School of Mechanical and Electronic Engineering, Wuhan University of Technology (WHUT), Wuhan 430070, China    Fuqiang Gu College of Computer Science, Chongqing University, Chongqing 400044, China    Zeyu Gong State Key Laboratory of Intelligent Manufacturing Equipment and Technology, School of Mechanical Science and Engineering, Huazhong University of Science and Technology (HUST), Wuhan 430074, China    Ding Pan    Wenhui Zeng [email protected] School of Mechanical and Electronic Engineering, Wuhan University of Technology (WHUT), Wuhan 430070, China
(May 19, 2024)
Abstract

This paper presents a novel surrogate-based cross-correlation (SBCC) framework to improve the correlation performance for practical particle image velocimetry (PIV). The basic idea is that an optimized surrogate filter/image, replacing one raw image, will produce a more accurate and robust correlation signal. Specifically, the surrogate image is encouraged to generate perfect Gaussian-shaped correlation map to tracking particles (PIV image pair) while producing zero responses to image noise (context images). And the problem is formularized with an objective function composed of surrogate loss and consistency loss. As a result, the closed-form solution provides an efficient multivariate operator that could consider other negative context images. Compared with the state-of-the-art baseline methods (background subtraction, robust phase correlation, etc.), our SBCC method exhibits significant performance improvement (accuracy and robustness) on the synthetic dataset and several challenging experimental PIV cases. Besides, our implementation with experimental details (https://github.com/yongleex/SBCC) is also available for interested researchers.

preprint: AIP/123-QED

I Introduction

Particle Image Velocimetry (PIV) is a popular non-intrusive instrument for flow field measurement in experimental fluid dynamicsAdrian (1984); Raffel et al. (2018); Lee and Mei (2022). PIV generates quantitative vector field by analyzing the consecutive particle recordings. However, the particle images are easily deteriorated in a practical measurement due to non-uniform light illumination, light reflections, background noise sources, camera dark noise, etc Honkanen and Nobach (2005); Deen et al. (2010); Sciacchitano and Scarano (2014). Therefore, the accuracy and robustness of PIV results could be significantly decreased, recognized as uncertainty Sciacchitano (2019), peak-locking Michaelis, Neal, and Wieneke (2016) and/or outliers Wang et al. (2015); Lee, Yang, and Yin (2017a). Thus, in this work, we focus on the challenging PIV estimation problem caused by the deteriorated particle recordings.

Over the past 40 years, the mainstream velocity estimation methods — cross-correlation Willert and Gharib (1991); Scarano (2001); Wang, He, and Wang (2020); Zhu et al. (2022); Gao et al. (2021), optical flow (OF) Corpetti et al. (2006); Zhong, Yang, and Yin (2017); Lu et al. (2021) and deep neural-network (DNN) regression Lee, Yang, and Yin (2017b); Cai et al. (2019a); Lagemann et al. (2021); Cao et al. (2024)— are not particularly designed for robust PIV estimation. 1). The vanilla standard cross-correlation (SCC) computes the image similarity (dot product) as a function of the relative displacement. And SCC is not robust to image noise (such as, additive background noise, non-uniform illumination) because the noise is also correlated Eckstein and Vlachos (2009). Therefore, the generalized cross-correlation (GCC) methods improve signal-to-noise ratio of PIV cross-correlation via post-processing the correlation coefficients with different spectral filters, including phase correlation (PC) Horner and Gianino (1984), symmetric phase-only filter (SPOF) Wernet (2005), robust phase correlation (RPC) Eckstein and Vlachos (2009) to name a few. As a result, the GCC methods have achieved acceptable performance and have been extensively equipped for the majority of PIV software. 2). The optical flow Corpetti et al. (2006) methods employ a preservation principle, namely that particle image brightness attribute does not change after a movement, to estimate the particle displacement. The risk of failure rises if the brightness preservation principle breaks, which often occurs with image noise in practical measurements. Thus, replacing the brightness attribute with other robust attributes (image gradient, image phase) could be a straightforward modification Zhong, Yang, and Yin (2017). Besides, a deliberated OF model with improved regularization term also contributes to accurate PIV estimation Bao, Yang, and ** (2014); Lu et al. (2021). Meanwhile, the complex OF models often come with heavy computation cost. 3). As efficient inference methods, DNN-based regression methodsLee, Yang, and Yin (2017b); Cai et al. (2019a); Zhang and Piggott (2020); Yu et al. (2021); Lagemann et al. (2021); Yu et al. (2023) have been attracting researchers’ interest due to the powerful model capacity. However, the generalization of DNN for PIV depends on the noise type of training dataset Lagemann et al. (2022). Totally speaking, the CC, OF, DNN can be treated as bivariate operators that only take in two particle image frames, regardless of the concrete noise signal of a measurement. Herein, we focus on the CC methods employed by most practitioners Kähler et al. (2016); Liberzon et al. (2016); Xie, Wang, and Xu (2022).

A straightforward alternative to achieve robust PIV analysis is to directly improve the image quality Dellenback, Macharivilakathu, and Pierce (2000); Shavit, Lowe, and Steinbuck (2007); Lee et al. (2022); Fan et al. (2023); Zhao et al. (2024). Among different image pre-processing, the background subtraction performs well given a good reference background— concrete noise signal Mejia-Alvarez and Christensen (2013); Mendez et al. (2017); Kähler et al. (2016); Wang et al. (2020). The background image can either be recorded in the absence of seeding, or, if this is not possible, through temporal or spatial analysis from raw PIV recordings Raffel et al. (2018). The background image could be the minimum intensity image from double-frame PIV images, and works reliably for nonstational flow with severe background noise Honkanen and Nobach (2005); Deen et al. (2010). The varying background image could be extracted via a temporal Butterworth filter from a large number of raw PIV recordings Sciacchitano and Scarano (2014). A customized background can be adaptive reconstructed through proper orthogonal decomposition (POD) Mendez et al. (2017). Without extra temporal information, the spatial low-pass filter (LPF) utilizes the blur image to approximate a background, including Gaussian filter, median filter Adrian and Westerweel (2011), anisotropic diffusion Adatrao and Sciacchitano (2019), etc. Due to effectiveness, background subtraction has become an essential step of standard PIV pipeline (Fig. 1(a)). However, using one background to model the complex noise signal is still challenging.

Correlation filters have achieved competitive success in object tracking by learning a discriminative linear tracker from several image templates Bolme et al. (2010); Henriques et al. (2012, 2015). It generates an optimal filter/tracker that maximizes the convolution/tracking performance from multiple templates, as detailed in Section II. As a result of closed-form solution, correlation filter algorithm (minimum output sum of squared error, MOSSE Bolme et al. (2010)) is not only easy to implement but significant faster. The extensive experiments exhibit that the learned filter outperforms the original image templates in both robustness and accuracy.

Refer to caption
(a) Standard PIV pipeline with background subtraction
Refer to caption
(b) Our new PIV pipeline with SBCC
Figure 1: Different illustrative pipelines for noisy PIV images. \raisebox{-.7pt} {E}⃝: background/contexts extraction; symmetric-difference\ominus: background subtraction; tensor-product\bigotimes: cross-correlation. \raisebox{-.1pt} {$\star$}⃝: surrogate-based cross-correlation.

Our insight is that taking more negative context images (the concrete noise signal) into account will obtain a more robust tracker, and the MOSSE algorithm can be easily adapted to negative contexts with little effort. Different from object tracking video, the images of PIV pair are templates for each other Wereley and Meinhart (2001). As a result, a novel surrogate-based cross correlation (SBCC) framework is proposed by combining forward surrogate tracking with backward surrogate tracking. And SBCC—a multivariate operator— enables us to reform PIV pipeline for accurate and robust measurement, rather than pursuing a perfect clean image (Fig. 1(b)). The main contributions are:

  1. 1.

    Inspired by the MOSSE filter, the SBCC is a new robust PIV analysis tool that employs several negative contexts, via robust surrogates and bi-directional consistency.

  2. 2.

    Based on MOSSE objective and bi-directional consistency formula, a concise closed-form solution to the problem is obtained. To our surprise, a set of widely-used generalized cross-correlation methods are special cases of the closed-form solution of SBCC framework.

  3. 3.

    The improvement of SBCC has been extensively verified on both synthetic and real PIV images.

The rest of this paper is arranged as follows. The related works are given in Section II. Section III describes our SBCC method from the problem formulation to optimization. And Section IV demonstrates the performance on synthetic datasets and real PIV images in comparison with baseline methods. Finally, the Section V remarks the paper with several concluding comments.

II Related works

Refer to caption
(a) SCC
Refer to caption
(b) GCC
Figure 2: Different cross-correlation methods. The tensor-product\bigotimes denotes the standard cross-correlation in Fourier frequency domain (a). The generalized cross-correlation methods (b), the \bigodot represents an element-wise multiplication.

II.1 Cross-correlations

The cross-correlation response r(𝐱)𝑟𝐱r(\mathbf{x})italic_r ( bold_x ) of image pair f1(𝐱)subscript𝑓1𝐱f_{1}(\mathbf{x})italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_x ), f2(𝐱)subscript𝑓2𝐱f_{2}(\mathbf{x})italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_x ) indicates the image displacement. Due to the Convolution Theorem, the computation of cross-correlation r(𝐱)𝑟𝐱r(\mathbf{x})italic_r ( bold_x ) becomes a fast element-wise multiplication in the Fourier frequency domain. That says, R(ω)=F1(ω)F2(ω)𝑅𝜔subscript𝐹1𝜔subscriptsuperscript𝐹2𝜔R(\mathbf{\omega})=F_{1}(\mathbf{\omega})F^{*}_{2}(\mathbf{\omega})italic_R ( italic_ω ) = italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ω ) italic_F start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ω ), where R(ω),F1(ω),F2(ω)𝑅𝜔subscript𝐹1𝜔subscript𝐹2𝜔R(\omega),F_{1}(\omega),F_{2}(\omega)italic_R ( italic_ω ) , italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ω ) , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ω ) are the Fourier transform of r(𝐱)𝑟𝐱r(\mathbf{x})italic_r ( bold_x ), f1(𝐱)subscript𝑓1𝐱f_{1}(\mathbf{x})italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( bold_x ), f2(𝐱)subscript𝑓2𝐱f_{2}(\mathbf{x})italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( bold_x ) and the superscript ()(^{*})( start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) denotes complex conjugation. Hereafter, we will simplify the notations (F1(ω),F2(ω),subscript𝐹1𝜔subscript𝐹2𝜔F_{1}(\omega),F_{2}(\omega),...italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ω ) , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ω ) , …) as (F1,F2,subscript𝐹1subscript𝐹2F_{1},F_{2},...italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , …) by omitting the frequency ω𝜔\omegaitalic_ω. Due to its wide adoption in PIV estimation, this vanilla cross-correlation method will be referred to as the standard cross-correlation (SCC) method, as shown in Fig. 2 (a).

RSCC=F1F2subscript𝑅𝑆𝐶𝐶subscript𝐹1subscriptsuperscript𝐹2R_{SCC}=F_{1}F^{*}_{2}italic_R start_POSTSUBSCRIPT italic_S italic_C italic_C end_POSTSUBSCRIPT = italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (1)

To enhance the correlation signal, the generalized cross-correlation (GCC) methods amend the SCC correlation RSCCsubscript𝑅𝑆𝐶𝐶R_{SCC}italic_R start_POSTSUBSCRIPT italic_S italic_C italic_C end_POSTSUBSCRIPT with different spectral filters, i.e.,

RGCC=ψ(F1,F2)F1F2subscript𝑅𝐺𝐶𝐶𝜓subscript𝐹1subscript𝐹2subscript𝐹1superscriptsubscript𝐹2R_{GCC}=\psi(F_{1},F_{2})F_{1}F_{2}^{*}italic_R start_POSTSUBSCRIPT italic_G italic_C italic_C end_POSTSUBSCRIPT = italic_ψ ( italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT (2)

where ψ()𝜓\psi(\cdot)italic_ψ ( ⋅ ) denotes the modification operation (PHAT filter Horner and Gianino (1984), SPOF filter Wernet (2005), RPC filter Eckstein and Vlachos (2009), etc.). Compared with the RPREsubscript𝑅𝑃𝑅𝐸R_{PRE}italic_R start_POSTSUBSCRIPT italic_P italic_R italic_E end_POSTSUBSCRIPT (image pre-processing), the RGCCsubscript𝑅𝐺𝐶𝐶R_{GCC}italic_R start_POSTSUBSCRIPT italic_G italic_C italic_C end_POSTSUBSCRIPT can be viewed as a post-processing of F1F2subscript𝐹1superscriptsubscript𝐹2F_{1}F_{2}^{*}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Several GCC instances are listed in Table. 1. The filters of these GCC methods share a special type, ψ(F1,F2)=ϕ(|F1F2|)𝜓subscript𝐹1subscript𝐹2italic-ϕsubscript𝐹1superscriptsubscript𝐹2\psi(F_{1},F_{2})=\phi(|F_{1}F_{2}^{*}|)italic_ψ ( italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) = italic_ϕ ( | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | ), as demonstrated in Fig. 2 (b).

Table 1: Different cross-correlation methods.
Methods Equations Comments
SCCRaffel et al. (2018) RSCC=F1F2subscript𝑅𝑆𝐶𝐶subscript𝐹1superscriptsubscript𝐹2R_{SCC}=F_{1}F_{2}^{*}italic_R start_POSTSUBSCRIPT italic_S italic_C italic_C end_POSTSUBSCRIPT = italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT -
Pre-processing RPRE=(HF1)(HF2)subscript𝑅𝑃𝑅𝐸𝐻subscript𝐹1superscript𝐻subscript𝐹2R_{PRE}=(HF_{1})(HF_{2})^{*}italic_R start_POSTSUBSCRIPT italic_P italic_R italic_E end_POSTSUBSCRIPT = ( italic_H italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( italic_H italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT Filter H𝐻Hitalic_H
Background subs RBGS=(F1B)(F2B)subscript𝑅𝐵𝐺𝑆subscript𝐹1𝐵superscriptsubscript𝐹2𝐵R_{BGS}=(F_{1}-B)(F_{2}-B)^{*}italic_R start_POSTSUBSCRIPT italic_B italic_G italic_S end_POSTSUBSCRIPT = ( italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_B ) ( italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_B ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT Background B𝐵Bitalic_B
PCHorner and Gianino (1984) RPC=F1F2|F1F2|subscript𝑅𝑃𝐶subscript𝐹1superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹2R_{PC}=\frac{F_{1}F_{2}^{*}}{|F_{1}F_{2}^{*}|}italic_R start_POSTSUBSCRIPT italic_P italic_C end_POSTSUBSCRIPT = divide start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | end_ARG GCC method
SPOFWernet (2005) RSPOF=F1F2|F1F2|subscript𝑅𝑆𝑃𝑂𝐹subscript𝐹1superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹2R_{SPOF}=\frac{F_{1}F_{2}^{*}}{\sqrt{|F_{1}F_{2}^{*}|}}italic_R start_POSTSUBSCRIPT italic_S italic_P italic_O italic_F end_POSTSUBSCRIPT = divide start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | end_ARG end_ARG GCC method
RPCEckstein, Charonko, and Vlachos (2008); Eckstein and Vlachos (2009) RRPC=GF1F2|F1F2|subscript𝑅𝑅𝑃𝐶𝐺subscript𝐹1superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹2R_{RPC}=\frac{GF_{1}F_{2}^{*}}{|F_{1}F_{2}^{*}|}italic_R start_POSTSUBSCRIPT italic_R italic_P italic_C end_POSTSUBSCRIPT = divide start_ARG italic_G italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | end_ARG GCC method
ρ𝜌\rhoitalic_ρ-CSPC Shen and Liu (2009) RCSPC=F1F2|F1F2|ρ+ϵsubscript𝑅𝐶𝑆𝑃𝐶subscript𝐹1superscriptsubscript𝐹2superscriptsubscript𝐹1superscriptsubscript𝐹2𝜌italic-ϵR_{CSPC}=\frac{F_{1}F_{2}^{*}}{|F_{1}F_{2}^{*}|^{\rho}+\epsilon}italic_R start_POSTSUBSCRIPT italic_C italic_S italic_P italic_C end_POSTSUBSCRIPT = divide start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT italic_ρ end_POSTSUPERSCRIPT + italic_ϵ end_ARG GCC method
SBCC (ours) RSBCC=2GF1F2F1F1+F2F2+2μΣPiPisubscript𝑅𝑆𝐵𝐶𝐶2𝐺subscript𝐹1superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹1subscript𝐹2superscriptsubscript𝐹22𝜇Σsubscript𝑃𝑖superscriptsubscript𝑃𝑖R_{SBCC}=\frac{2GF_{1}F_{2}^{*}}{F_{1}F_{1}^{*}+F_{2}F_{2}^{*}+2\mu\Sigma P_{i% }P_{i}^{*}}italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT = divide start_ARG 2 italic_G italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + 2 italic_μ roman_Σ italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG Contexts Pisubscript𝑃𝑖P_{i}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT

II.2 Correlation filter

The correlation filter can be derived either from an objective function specifically formularized in the Fourier domain Bolme et al. (2010) or from ridge regression and circulant matrices Henriques et al. (2012). Slightly different from Bolme et al. (2010); Henriques et al. (2012), we provide our understanding of correlation filter as following. Given a set of aligned template images Ti,i{1,2,,n}subscript𝑇𝑖𝑖12𝑛T_{i},i\in\{1,2,...,n\}italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ { 1 , 2 , … , italic_n }, the MOSSE method Bolme et al. (2010); Henriques et al. (2012) finds a surrogate filter/tracker S𝑆Sitalic_S that produces the best tracking performance, i.e., the cross-correlation response ri(x)=1(TiS)subscript𝑟𝑖xsuperscript1subscript𝑇𝑖superscript𝑆r_{i}(\textbf{x})=\mathcal{F}^{-1}(T_{i}S^{*})italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( x ) = caligraphic_F start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) is encouraged to be an isotropic Gaussian-shaped response g(𝐱)𝑔𝐱g(\mathbf{x})italic_g ( bold_x ), where 1()superscript1\mathcal{F}^{-1}(\cdot)caligraphic_F start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ⋅ ) denotes inverse fourier transform. In addition, a regularization term |S|2superscript𝑆2|S|^{2}| italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is employed to avoid the over fitting and gains the stability, similar to Wiener filtering or ridge regression Henriques et al. (2012). Hence, the minimum output sum of squared error (MOSSE) objective arrives,

JMOSSE(S)=Σi=1n|GTiS|2+μ|S|2subscript𝐽𝑀𝑂𝑆𝑆𝐸𝑆superscriptsubscriptΣ𝑖1𝑛superscript𝐺subscript𝑇𝑖superscript𝑆2𝜇superscript𝑆2\begin{split}{J}_{MOSSE}(S)=\mathop{\Sigma}_{i=1}^{n}|G-T_{i}S^{*}|^{2}+\mu|S|% ^{2}\end{split}start_ROW start_CELL italic_J start_POSTSUBSCRIPT italic_M italic_O italic_S italic_S italic_E end_POSTSUBSCRIPT ( italic_S ) = roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_G - italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_μ | italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW (3)

where μ𝜇\muitalic_μ controls the amount of regularization, and it is recommended to be 0.10.10.10.1 Bolme et al. (2010). The n𝑛nitalic_n is the number of positive templates, and G𝐺Gitalic_G denotes the Fourier transform of a Gaussian function g(𝐱)𝑔𝐱g(\mathbf{x})italic_g ( bold_x ). A closed-form solution of MOSSE,

S^=ΣiGTiΣiTiTi+μsuperscript^𝑆subscriptΣ𝑖𝐺superscriptsubscript𝑇𝑖subscriptΣ𝑖subscript𝑇𝑖superscriptsubscript𝑇𝑖𝜇\hat{S}^{*}=\frac{\Sigma_{i}GT_{i}^{*}}{\Sigma_{i}T_{i}T_{i}^{*}+\mu}over^ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = divide start_ARG roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_G italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG roman_Σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_μ end_ARG (4)

The μ𝜇\muitalic_μ plays an important role to make the denominator not equal to zero. Interestingly, the numerator is the correlation between the input and the desired output and the denominator is the energy spectrum of the input.

Our observation is that the regularization term of Eq.(3), |S|2=|01S|2superscript𝑆2superscript01superscript𝑆2|S|^{2}=|0-1\cdot S^{*}|^{2}| italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = | 0 - 1 ⋅ italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, could be regarded as a special term for a negative template (delta function, 1δ(𝐱)1𝛿𝐱1\Leftrightarrow\delta(\mathbf{x})1 ⇔ italic_δ ( bold_x )). That is to say, the MOSSE filter also expects the cross-correlation between S𝑆Sitalic_S and a negative Dirac delta function to be zero. Now, it is clear that the parameter μ𝜇\muitalic_μ controls the relative importance for this negative template response.

III Surrogate-based cross-correlation

III.1 Problem formulation

As mentioned in Section I, the standard PIV pipeline might fail due to one limited background image. We thus introduce the surrogate-based cross-correlation which utilizes multiple context background images to enhance particles correlation. Specifically, a surrogate filter/image (S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT or S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) is assumed to have a better cross-correlation response under a well-designed surrogate objective Jsurrsubscript𝐽𝑠𝑢𝑟𝑟J_{surr}italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT, which considers the tracking performance as well as robustness. Meanwhile, similar to ensemble correlation, the forward correlation response Rfsubscript𝑅𝑓R_{f}italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT (with surrogate S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) and backward response Rbsubscript𝑅𝑏R_{b}italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT (with surrogate S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) are combined via correlation consistency objective Jcorrsubscript𝐽𝑐𝑜𝑟𝑟{J}_{corr}italic_J start_POSTSUBSCRIPT italic_c italic_o italic_r italic_r end_POSTSUBSCRIPT. Thus, the robust PIV estimation problem is formulated with two objectives, as illustrated in Fig. 3,

Refer to caption
Figure 3: The bi-directional surrogate model, SBCC. The surrogate images S1,S2subscript𝑆1subscript𝑆2S_{1},S_{2}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and correlation response R𝑅Ritalic_R are jointly optimized with surrogate objective Jsurrsubscript𝐽𝑠𝑢𝑟𝑟{J}_{surr}italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT and correlation consistency objective Jcorrsubscript𝐽𝑐𝑜𝑟𝑟{J}_{corr}italic_J start_POSTSUBSCRIPT italic_c italic_o italic_r italic_r end_POSTSUBSCRIPT (blue dash arrows). The Rfsubscript𝑅𝑓R_{f}italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT and Rbsubscript𝑅𝑏R_{b}italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT represent the correlation with forward surrogate (S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT) and backward surrogate (S2subscript𝑆2S_{2}italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) respectively.
R^,S^1,S^2=argminR,S1,S2JSBCC(R,S1,S2;F1,F2)^𝑅subscript^𝑆1subscript^𝑆2subscript𝑅subscript𝑆1subscript𝑆2subscript𝐽𝑆𝐵𝐶𝐶𝑅subscript𝑆1subscript𝑆2subscript𝐹1subscript𝐹2\hat{R},\hat{S}_{1},\hat{S}_{2}=\mathop{\arg\min}_{R,S_{1},S_{2}}{J}_{SBCC}(R,% S_{1},S_{2};F_{1},F_{2})over^ start_ARG italic_R end_ARG , over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = start_BIGOP roman_arg roman_min end_BIGOP start_POSTSUBSCRIPT italic_R , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT ( italic_R , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) (5)

with

JSBCC(R,S1,S2;F1,F2)=Jsurr(S1;F1)+Jsurr(S2;F2)surrogate objective+Jcorr(Rf,R)+Jcorr(Rb,R)correlation consistency objectivesubscript𝐽𝑆𝐵𝐶𝐶𝑅subscript𝑆1subscript𝑆2subscript𝐹1subscript𝐹2subscriptsubscript𝐽𝑠𝑢𝑟𝑟subscript𝑆1subscript𝐹1subscript𝐽𝑠𝑢𝑟𝑟subscript𝑆2subscript𝐹2surrogate objectivesubscriptsubscript𝐽𝑐𝑜𝑟𝑟subscript𝑅𝑓𝑅subscript𝐽𝑐𝑜𝑟𝑟subscript𝑅𝑏𝑅correlation consistency objective\begin{split}{J}_{SBCC}&(R,S_{1},S_{2};F_{1},F_{2})\\ &=\underbrace{{J}_{surr}(S_{1};F_{1})+{J}_{surr}(S_{2};F_{2})}_{\textrm{% surrogate objective}}\\ &\quad+\underbrace{{J}_{corr}(R_{f},R)+{J}_{corr}(R_{b},R)}_{\textrm{% correlation consistency objective}}\end{split}start_ROW start_CELL italic_J start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT end_CELL start_CELL ( italic_R , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = under⏟ start_ARG italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ; italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) + italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT surrogate objective end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + under⏟ start_ARG italic_J start_POSTSUBSCRIPT italic_c italic_o italic_r italic_r end_POSTSUBSCRIPT ( italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_R ) + italic_J start_POSTSUBSCRIPT italic_c italic_o italic_r italic_r end_POSTSUBSCRIPT ( italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT , italic_R ) end_ARG start_POSTSUBSCRIPT correlation consistency objective end_POSTSUBSCRIPT end_CELL end_ROW (6)

where Rf=S1F2,Rb=F1S2formulae-sequencesubscript𝑅𝑓subscript𝑆1superscriptsubscript𝐹2subscript𝑅𝑏subscript𝐹1superscriptsubscript𝑆2R_{f}=S_{1}F_{2}^{*},R_{b}=F_{1}S_{2}^{*}italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are the forward correlation response and backward correlation response. Note that the surrogates S1,S2subscript𝑆1subscript𝑆2S_{1},S_{2}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are no longer the processed results of image pre-processing due to the coupled SBCC structure.

Surrogate objective. To gain robustness of surrogate filter, a well-designed surrogate objective is thus constructed, which makes use of the positive template and other negative context images. The negative samples are proved to be useful for representation learning Chen, Lee, and Soh (2021). Similar to MOSSE Bolme et al. (2010), our surrogate objective Jsurrsubscript𝐽𝑠𝑢𝑟𝑟{J}_{surr}italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT (Fig. 4) is given before a detail explanation,

Refer to caption
Figure 4: The motivation of surrogate objective is to find a surrogate S1subscript𝑆1S_{1}italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT that does not response to the negative context images P1,P2,,Pmsubscript𝑃1subscript𝑃2subscript𝑃𝑚P_{1},P_{2},...,P_{m}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.
Jsurr(S;F)=|GFS|2MOSSE term+μ1mΣi=1m|0PiS|2negative context term=|GFS|2+μ1mΣi=1m|PiS|2subscript𝐽𝑠𝑢𝑟𝑟𝑆𝐹subscriptsuperscript𝐺𝐹superscript𝑆2MOSSE termsubscript𝜇1𝑚superscriptsubscriptΣ𝑖1𝑚superscript0subscript𝑃𝑖superscript𝑆2negative context termsuperscript𝐺𝐹superscript𝑆2𝜇1𝑚superscriptsubscriptΣ𝑖1𝑚superscriptsubscript𝑃𝑖superscript𝑆2\begin{split}{J}_{surr}(S;F)&=\underbrace{|G-FS^{*}|^{2}}_{\textrm{MOSSE term}% }+\underbrace{\mu\frac{1}{m}\mathop{\Sigma}_{i=1}^{m}|0-P_{i}S^{*}|^{2}}_{% \textrm{negative context term}}\\ &=|G-FS^{*}|^{2}+\mu\frac{1}{m}\mathop{\Sigma}_{i=1}^{m}|P_{i}S^{*}|^{2}\\ \end{split}start_ROW start_CELL italic_J start_POSTSUBSCRIPT italic_s italic_u italic_r italic_r end_POSTSUBSCRIPT ( italic_S ; italic_F ) end_CELL start_CELL = under⏟ start_ARG | italic_G - italic_F italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_POSTSUBSCRIPT MOSSE term end_POSTSUBSCRIPT + under⏟ start_ARG italic_μ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT | 0 - italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_POSTSUBSCRIPT negative context term end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = | italic_G - italic_F italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_μ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT | italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW (7)

where Pi,i{1,2,,m}subscript𝑃𝑖𝑖12𝑚P_{i},i\in\{1,2,...,m\}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ { 1 , 2 , … , italic_m } are the negative context images. The G𝐺Gitalic_G denotes the Fourier transform of Gaussian functions g(𝐱)𝑔𝐱g(\mathbf{x})italic_g ( bold_x ). The μ𝜇\muitalic_μ is also the coefficient for the negative context term. The MOSSE term is preserved to encourage a Gaussian-shaped correlation map. Reducing the filter’s response to the background/noise could decrease the number of outliers, because most outliers occur when the images have similar image background or other noisy pattern. Recall that the regularization term |S|2superscript𝑆2|S|^{2}| italic_S | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT of MOSSE, it encourages the surrogate filter S𝑆Sitalic_S produce zero response to the special δ(𝐱)𝛿𝐱\delta(\mathbf{x})italic_δ ( bold_x ). However, the negative context term of Eq. (7) encourages the filter S𝑆Sitalic_S produce zero response to all context images. We choose the backgrounds from temporal minimum value (MIN bg) and spatial low-pass results (LPF bg) as the context images in this work. Obviously, other options are also supported. Compared to the δ(𝐱)𝛿𝐱\delta(\mathbf{x})italic_δ ( bold_x ), the context images are more likely to have a similar noisy pattern with PIV test images.

Correlation consistency objective. Different from object tracking, the paired images of PIV can be treated as templates to each other. Our SBCC framework (Fig. 3) models it as forward and backward correlation. To obtain a consistency result, a square error encourages a minimum distance between Rx{Rb,Rf}subscript𝑅𝑥subscript𝑅𝑏subscript𝑅𝑓R_{x}\in\{R_{b},R_{f}\}italic_R start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ∈ { italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT , italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT } to the final correlation map R𝑅Ritalic_R.

Jcorr(Rx,R)=|RRx|2subscript𝐽𝑐𝑜𝑟𝑟subscript𝑅𝑥𝑅superscript𝑅subscript𝑅𝑥2\begin{split}{J}_{corr}(R_{x},R)&=|R-R_{x}|^{2}\end{split}start_ROW start_CELL italic_J start_POSTSUBSCRIPT italic_c italic_o italic_r italic_r end_POSTSUBSCRIPT ( italic_R start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_R ) end_CELL start_CELL = | italic_R - italic_R start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW (8)

Recall Rf=S1F2,Rb=F1S2formulae-sequencesubscript𝑅𝑓subscript𝑆1superscriptsubscript𝐹2subscript𝑅𝑏subscript𝐹1superscriptsubscript𝑆2R_{f}=S_{1}F_{2}^{*},R_{b}=F_{1}S_{2}^{*}italic_R start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_R start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are the forward correlation response and backward correlation response, and R𝑅Ritalic_R is the final cross-correlation response of SBCC.

Take the surrogate objective (Eq. (7)) and correlation consistency objective (Eq. (8)) back into the problem (Eq. (6)). The specific objective function of SBCC is arrived,

JSBCC(R,S1,S2;F1,F2)=|GF1S1|2+μ1mΣi=1m|PiS1|2+|GF2S2|2+μ1mΣi=1m|PiS2|2+|RS1F2|2+|RF1S2|2subscript𝐽𝑆𝐵𝐶𝐶𝑅subscript𝑆1subscript𝑆2subscript𝐹1subscript𝐹2superscript𝐺subscript𝐹1superscriptsubscript𝑆12𝜇1𝑚superscriptsubscriptΣ𝑖1𝑚superscriptsubscript𝑃𝑖superscriptsubscript𝑆12superscript𝐺subscript𝐹2superscriptsubscript𝑆22𝜇1𝑚superscriptsubscriptΣ𝑖1𝑚superscriptsubscript𝑃𝑖superscriptsubscript𝑆22superscript𝑅subscript𝑆1superscriptsubscript𝐹22superscript𝑅subscript𝐹1superscriptsubscript𝑆22\begin{split}{J}_{SBCC}&(R,S_{1},S_{2};F_{1},F_{2})\\ &=|G-F_{1}S_{1}^{*}|^{2}+\mu\frac{1}{m}\mathop{\Sigma}_{i=1}^{m}|P_{i}S_{1}^{*% }|^{2}\\ &\quad+|G-F_{2}S_{2}^{*}|^{2}+\mu\frac{1}{m}\mathop{\Sigma}_{i=1}^{m}|P_{i}S_{% 2}^{*}|^{2}\\ &\quad+|R-S_{1}F_{2}^{*}|^{2}+|R-F_{1}S_{2}^{*}|^{2}\\ \end{split}start_ROW start_CELL italic_J start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT end_CELL start_CELL ( italic_R , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ; italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = | italic_G - italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_μ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT | italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + | italic_G - italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_μ divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT | italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + | italic_R - italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + | italic_R - italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW (9)

This objective is the sum of several squared errors with three unknown complex variables S1,S2,Rsubscript𝑆1subscript𝑆2𝑅S_{1},S_{2},Ritalic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_R. The parameter μ𝜇\muitalic_μ controls the relative importance of the negative context images.

III.2 Optimization of SBCC objective

The optimization of JSBCCsubscript𝐽𝑆𝐵𝐶𝐶{J}_{SBCC}italic_J start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT (Eq. 9) is almost identical to the optimization problems in Bolme et al. (2010); Henriques et al. (2012). The difference is that SBCC objective is a quadratic convex function with three complex variables. The closed-form solution thus can be found by setting the partials to zeroes,

RSBCC:=R^=2GF1F2F1F1+F2F2+2μQS^1=F2R^+GF1F1F1+F2F2+μQS^2=F1R^+GF2F1F1+F2F2+μQassignsubscript𝑅𝑆𝐵𝐶𝐶^𝑅2𝐺subscript𝐹1superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹1subscript𝐹2superscriptsubscript𝐹22𝜇𝑄subscript^𝑆1subscript𝐹2^𝑅𝐺subscript𝐹1subscript𝐹1superscriptsubscript𝐹1subscript𝐹2superscriptsubscript𝐹2𝜇𝑄superscriptsubscript^𝑆2superscriptsubscript𝐹1^𝑅𝐺superscriptsubscript𝐹2subscript𝐹1superscriptsubscript𝐹1subscript𝐹2superscriptsubscript𝐹2𝜇𝑄\begin{split}R_{SBCC}:=\hat{R}&=\frac{2GF_{1}F_{2}^{*}}{F_{1}F_{1}^{*}+F_{2}F_% {2}^{*}+2\mu Q}\\ \hat{S}_{1}&=\frac{F_{2}\hat{R}+GF_{1}}{F_{1}F_{1}^{*}+F_{2}F_{2}^{*}+\mu Q}\\ \hat{S}_{2}^{*}&=\frac{F_{1}^{*}\hat{R}+GF_{2}^{*}}{F_{1}F_{1}^{*}+F_{2}F_{2}^% {*}+\mu Q}\\ \end{split}start_ROW start_CELL italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT := over^ start_ARG italic_R end_ARG end_CELL start_CELL = divide start_ARG 2 italic_G italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + 2 italic_μ italic_Q end_ARG end_CELL end_ROW start_ROW start_CELL over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL = divide start_ARG italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG + italic_G italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_μ italic_Q end_ARG end_CELL end_ROW start_ROW start_CELL over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_CELL start_CELL = divide start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over^ start_ARG italic_R end_ARG + italic_G italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_μ italic_Q end_ARG end_CELL end_ROW (10)

where Q=1mΣi=1mPiPi𝑄1𝑚superscriptsubscriptΣ𝑖1𝑚subscript𝑃𝑖superscriptsubscript𝑃𝑖Q=\frac{1}{m}\Sigma_{i=1}^{m}P_{i}P_{i}^{*}italic_Q = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is the average Fourier power spectrum of negative context images. The cross-correlation response, RSBCCsubscript𝑅𝑆𝐵𝐶𝐶R_{SBCC}italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT, incorporates this Q𝑄Qitalic_Q component to obtain a robustness correlation by considering the noisy background in these negative context images. The terms in RSBCC(R_{SBCC}(italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT (Eq. (10)) have clear interpretation. The numerator is the cross-correlation between F1subscript𝐹1F_{1}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and F2subscript𝐹2F_{2}italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with a Gaussian filter (G𝐺Gitalic_G), and the denominator is the power spectrum sum of F1subscript𝐹1F_{1}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, F2subscript𝐹2F_{2}italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and negative context images Pisubscript𝑃𝑖P_{i}italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Note that the S^1subscript^𝑆1\hat{S}_{1}over^ start_ARG italic_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT depends on F1subscript𝐹1F_{1}italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT as well as F2subscript𝐹2F_{2}italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

III.3 Short discussion

Due to the clear meaning of the objective function JSBCCsubscript𝐽𝑆𝐵𝐶𝐶{J}_{SBCC}italic_J start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT, the solution RSBCCsubscript𝑅𝑆𝐵𝐶𝐶R_{SBCC}italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT has good interpretation as well. Observing the SBCC solution, we found that a set of hand-crafted GCC methods are special cases of SBCC. That is to say, our SBCC framework provides a new perspective to understand existing GCC methods (Table. 1).

Firstly, we consider a simple but useful equation,

F1F1+F2F2=(|F1||F2|)2+2|F1F2|2|F1F2|subscript𝐹1superscriptsubscript𝐹1subscript𝐹2superscriptsubscript𝐹2superscriptsubscript𝐹1subscript𝐹222subscript𝐹1superscriptsubscript𝐹22subscript𝐹1superscriptsubscript𝐹2F_{1}F_{1}^{*}+F_{2}F_{2}^{*}=(|F_{1}|-|F_{2}|)^{2}+2|F_{1}F_{2}^{*}|\geq 2|F_% {1}F_{2}^{*}|italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | - | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 2 | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | ≥ 2 | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | (11)

Thus, the mathematical representation of PC, RPC, 1-CSPC become special SBCC cases with an impractical noise-free assumption (|F1|=|F2|subscript𝐹1subscript𝐹2|F_{1}|=|F_{2}|| italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT |). That says,

RPC=RSBCC|G=1,|F1|=|F2|,μ=0R1CSPC=RSBCC|G=1,|F1|=|F2|,μ=ϵ,Pi=1RRPC=RSBCC|Gg(x),|F1|=|F2|,μ=0subscript𝑅𝑃𝐶evaluated-atsubscript𝑅𝑆𝐵𝐶𝐶formulae-sequence𝐺1formulae-sequencesubscript𝐹1subscript𝐹2𝜇0subscript𝑅1𝐶𝑆𝑃𝐶evaluated-atsubscript𝑅𝑆𝐵𝐶𝐶formulae-sequence𝐺1formulae-sequencesubscript𝐹1subscript𝐹2formulae-sequence𝜇italic-ϵsubscript𝑃𝑖1subscript𝑅𝑅𝑃𝐶evaluated-atsubscript𝑅𝑆𝐵𝐶𝐶𝐺𝑔𝑥subscript𝐹1subscript𝐹2𝜇0\begin{split}R_{PC}\ \ &=R_{SBCC}|_{G=1,|F_{1}|=|F_{2}|,\mu=0}\\ R_{1-CSPC}&=R_{SBCC}|_{G=1,|F_{1}|=|F_{2}|,\mu=\epsilon,P_{i}=1}\\ R_{RPC}&=R_{SBCC}|_{G\Leftrightarrow g(x),|F_{1}|=|F_{2}|,\mu=0}\\ \end{split}start_ROW start_CELL italic_R start_POSTSUBSCRIPT italic_P italic_C end_POSTSUBSCRIPT end_CELL start_CELL = italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_G = 1 , | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , italic_μ = 0 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_R start_POSTSUBSCRIPT 1 - italic_C italic_S italic_P italic_C end_POSTSUBSCRIPT end_CELL start_CELL = italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_G = 1 , | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , italic_μ = italic_ϵ , italic_P start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_R start_POSTSUBSCRIPT italic_R italic_P italic_C end_POSTSUBSCRIPT end_CELL start_CELL = italic_R start_POSTSUBSCRIPT italic_S italic_B italic_C italic_C end_POSTSUBSCRIPT | start_POSTSUBSCRIPT italic_G ⇔ italic_g ( italic_x ) , | italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , italic_μ = 0 end_POSTSUBSCRIPT end_CELL end_ROW (12)

which means that, PC and 1-CSPC encourage a Dirac delta response (G=1)𝐺1(G=1)( italic_G = 1 ), while RPC method expects a Gaussian-shaped response. Only 1-CSPC method has considered a special context image δ(𝐱)𝛿𝐱\delta(\mathbf{x})italic_δ ( bold_x ) implicitly. However, all methods employ a noise-free assumption (|F1|=|F2|subscript𝐹1subscript𝐹2|F_{1}|=|F_{2}|| italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = | italic_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT |).

The SPOF is beyond a brief understanding. However, the SPOF can be treated as an ensemble correlation Delnoij et al. (1999) due to the following relationship, RSPOFRSPOF=RSCCRPCsubscript𝑅𝑆𝑃𝑂𝐹subscript𝑅𝑆𝑃𝑂𝐹subscript𝑅𝑆𝐶𝐶subscript𝑅𝑃𝐶R_{SPOF}R_{SPOF}=R_{SCC}R_{PC}italic_R start_POSTSUBSCRIPT italic_S italic_P italic_O italic_F end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_S italic_P italic_O italic_F end_POSTSUBSCRIPT = italic_R start_POSTSUBSCRIPT italic_S italic_C italic_C end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_P italic_C end_POSTSUBSCRIPT. It might imply that multiple SBCC frameworks can provide more complex surrogate for SPOF and general ρlimit-from𝜌\rho-italic_ρ -CSPC methods. Note that none of the existing GCC methods explicitly take any negative context images into consideration.

IV Experiments

In this section, the performance of SBCC is extensively evaluated through visualizing the correlation map, analysing parameter sensitivity, conducting measurement on synthetic images and experimental PIV images. The detailed implementation and additional results are provided at the project repository, https://github.com/yongleex/SBCC.

Baseline methods. Several widely-accepted approaches are adopted to conduct a fair evaluation. And they are, standard cross-correlation (SCC) Raffel et al. (2018), symmetric phase-only filter (SPOF) Wernet (2005), robust phase correlation (RPC) Eckstein and Vlachos (2009). Regarding the background subtraction methods, we choose the minimum intensity image from double-frame PIV image Honkanen and Nobach (2005) and the spatial low-pass filter (LPF) Adrian and Westerweel (2011), resulting in SCC-MIN and SCC-LPF. To exclude the influence of other factors, single-pass cross-correlation without any post-processing is utilized for all testing methods.

Evaluation criteria. In addition to subjective visual judgement, three objective metrics are also employed to quantify the performance: 1) the root mean-square-error (RMSE) Raffel et al. (2018); Lee, Yang, and Yin (2017b); Lee and Mei (2022), 2) the average endpoint error (AEE) Lagemann et al. (2021), 3) the execution time for different image size.

RMSE=1NΣi=1N𝐯e,i𝐯t,i2AEE=1NΣi=1N𝐯e,i𝐯t,i𝑅𝑀𝑆𝐸1𝑁superscriptsubscriptΣ𝑖1𝑁superscriptnormsubscript𝐯𝑒𝑖subscript𝐯𝑡𝑖2𝐴𝐸𝐸1𝑁superscriptsubscriptΣ𝑖1𝑁delimited-∥∥subscript𝐯𝑒𝑖subscript𝐯𝑡𝑖\begin{split}RMSE&=\sqrt{\frac{1}{N}\mathop{\Sigma}_{i=1}^{N}\|\mathbf{v}_{e,i% }-\mathbf{v}_{t,i}\|^{2}}\\ AEE&=\frac{1}{N}\mathop{\Sigma}_{i=1}^{N}\|\mathbf{v}_{e,i}-\mathbf{v}_{t,i}\|% \end{split}start_ROW start_CELL italic_R italic_M italic_S italic_E end_CELL start_CELL = square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_N end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ∥ bold_v start_POSTSUBSCRIPT italic_e , italic_i end_POSTSUBSCRIPT - bold_v start_POSTSUBSCRIPT italic_t , italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL italic_A italic_E italic_E end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ∥ bold_v start_POSTSUBSCRIPT italic_e , italic_i end_POSTSUBSCRIPT - bold_v start_POSTSUBSCRIPT italic_t , italic_i end_POSTSUBSCRIPT ∥ end_CELL end_ROW (13)

where 𝐯e,i=(vx,vy)subscript𝐯𝑒𝑖subscript𝑣𝑥subscript𝑣𝑦\mathbf{v}_{e,i}=(v_{x},v_{y})bold_v start_POSTSUBSCRIPT italic_e , italic_i end_POSTSUBSCRIPT = ( italic_v start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ) is the ithsuperscript𝑖𝑡i^{th}italic_i start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT estimated vector out of N𝑁Nitalic_N points, while the 𝐯t,isubscript𝐯𝑡𝑖\mathbf{v}_{t,i}bold_v start_POSTSUBSCRIPT italic_t , italic_i end_POSTSUBSCRIPT denotes the ithsuperscript𝑖𝑡i^{th}italic_i start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT ground truth.

IV.1 On correlation coefficients

Refer to caption
(a) A test image pair (u=5.0pixel)𝑢5.0𝑝𝑖𝑥𝑒𝑙(u=-5.0pixel)( italic_u = - 5.0 italic_p italic_i italic_x italic_e italic_l ) and estimated backgrounds
Refer to caption
(b) The correlation maps
Refer to caption
(c) The correlation coefficient at Δy=0Δ𝑦0\Delta y=0roman_Δ italic_y = 0
Figure 5: A correlation responses on synthetic particle images with background. Best viewed in color.

Fig. 5 (a) gives a test pair of synthetic PIV interrogation window (particle displacement is 5.0pixel5.0𝑝𝑖𝑥𝑒𝑙-5.0pixel- 5.0 italic_p italic_i italic_x italic_e italic_l in horizontal), with a unrealistic strong additive background. Obviously, the minimum intensity image (MIN bg) recovers the still background, while the LPF (LPF bg) can not tell the background and particle image apart due to frequency aliasing.

Fig. 5 (b) and (c) provide the cross-correlation coefficients for this challenging synthetic case. The SCC, SPOF, RPC fail to obtain the correct response peak due to a lack of noise signal. Although the SCC-LPF method does not show the correct peak, the rough background also helps to increase the image similarity at the correct displacement. Without surprising, the SCC-MIN method is demonstrated with perfect correlation peak with good background estimation in this case. Compared to other methods, the SBCC has a correlation map with two distinct peaks, and correlation peak (maximum similarity) is correctly located at (5.0,0.0)5.00.0(-5.0,0.0)( - 5.0 , 0.0 ). Closing observing the curve around (5.0,0.0)5.00.0(-5.0,0.0)( - 5.0 , 0.0 ), the SBCC has a similar landscape with SCC-MIN. Thus, we can conclude that SBCC provides another feasible mechanism to perform robust cross-correlation with the help of background signals.

IV.2 On parameter sensitivity

The only parameter μ𝜇\muitalic_μ needs to be determined for our SBCC method working at the best condition. Similar to the background subtraction, it’s very difficult to obtain ideal context images. Hence, increasing μ𝜇\muitalic_μ will not always benefit the robustness or accuracy. We thus argue that there is an optimal parameter μ𝜇\muitalic_μ for the practical PIV measurement.

Refer to caption
Figure 6: Synthetic particle images with/without different backgrounds (uniform flow).

To study the effect of different μ𝜇\muitalic_μ values, we also employ the synthetic particle images from particle image generator (PIG). Here, we use a subset of a ready dataset (1000 uniform flow) Cai et al. (2019b, a). Different from the noise-free images, a random synthetic image is added to the clean synthetic image pair, to simulate the still background. The background images are from a bubble dataset (1219 images of class 1) Bai et al. (2021); Park et al. (2021), synthetic sinusoidal and square signal, as demonstrated in Fig. 6.

Refer to caption
(a) RMSE for a background-free image pair
Refer to caption
(b) RMSE for an image pair with bubble background
Figure 7: Effect of the parameter μ𝜇\muitalic_μ of SBCC. (u=0.85pixel,v=1.23pixelformulae-sequence𝑢0.85𝑝𝑖𝑥𝑒𝑙𝑣1.23𝑝𝑖𝑥𝑒𝑙u=0.85pixel,v=1.23pixelitalic_u = 0.85 italic_p italic_i italic_x italic_e italic_l , italic_v = 1.23 italic_p italic_i italic_x italic_e italic_l)

Fig. 7 illustrates the experimental results with both background- free and bubble background image pair (top row of Fig. 6) respectively, and the parameter μ𝜇\muitalic_μ varies from 0 to 16 with interval 0.10.10.10.1. In comparison with noise-free case, the RMSE of all methods are increased when additive bubble noise is added. It reflects the PIV challenge caused by the unwanted background. Interestingly, the sensitivity curve of SBCC have an optimal value that corresponds with the minimum RMSE value. The optimal value μ𝜇\muitalic_μ of complex background is larger than that of background-free situation. Meanwhile, an improvement of SBCC happens for a large range of μ𝜇\muitalic_μ, i.e., range [0,6]06[0,6][ 0 , 6 ] for background-free and [1,16]116[1,16][ 1 , 16 ] for bubble background. Thus, setting a proper μ𝜇\muitalic_μ could be an interesting problem for future work. Anyway, we set μ𝜇\muitalic_μ to 3.03.03.03.0 arbitrarily by taking all cases into consideration based on the results of this experiment. Note that, the fixed μ(3.0)𝜇3.0\mu(3.0)italic_μ ( 3.0 ) is not changed for different measurement cases, and the extensive results illustrate that this value could have a universal robust performance.

IV.3 On synthetic PIV images

Table 2: Synthetic experiment on one image pair. Performance measured by RMSE. The best in Bold.
Background Free Bubble Sinusoidal Square
SCC 0.3753 0.5054 0.7930 0.4644
SCC-MIN 0.4422 0.4422 0.4422 0.4422
SCC-LPF 0.3919 0.3487 0.3934 0.3244
SPOF 0.4060 0.4530 0.4725 0.4382
RPC 0.1883 0.6597 0.3724 0.2274
SBCC 0.2195 0.2765 0.1914 0.1985
Table 3: Synthetic experiment on one image pair. Performance measured by AEE. The best in Bold.
Background Free Bubble Sinusoidal Square
SCC 0.3422 0.4557 0.7813 0.4311
SCC-MIN 0.3827 0.3827 0.3827 0.3827
SCC-LPF 0.3438 0.3213 0.3721 0.3054
SPOF 0.3768 0.4181 0.4460 0.4099
RPC 0.1619 0.4193 0.3440 0.1983
SBCC 0.2050 0.2457 0.1741 0.1810

To quantitatively compare the performance, a synthetic image pair is sampled Cai et al. (2019a) with different backgrounds (Fig. 6). Table. 2 and 3 give the RMSE and AEE values for the processed results. For the background-free scenario, vanilla SCC yields satisfactory result, while the variations (SCC-MIN, SCC-LPF and SPOF) do not show a consistent improvement in accuracy. For the cases with background, all variants have a better performance than SCC. It implies that background subtraction (SCC-MIN, SCC-LPF) and spectral filters (SPOF, RPC) are all effective to address background problem. In this experiment, the RPC has the smallest measurement error on background-free images, while SBCC achieves significant improvement for the cases with background.

Refer to caption
Figure 8: Boxplot of the RMSE for the synthetic 1000 uniform cases with 4 types of backgrounds and 6 PIV CC estimators.
Refer to caption
Figure 9: Boxplot of the AEE for the synthetic 1000 uniform cases with 4 types of backgrounds and 6 PIV CC estimators.

In addition to case study, the Monte Carlo simulation is widely adopted in the assessment of PIV measurement uncertainty Raffel et al. (2018); Lee, Yang, and Yin (2017b). Recall that, the 1000 synthetic particle image pairs (256×256256256256\times 256256 × 256 pixel2) are synthesized with uniform flows ground truth Cai et al. (2019b), and the backgrounds are sampled from a bubble dataset Park et al. (2021) and two artificial signals (random sinusoidal wave and square wave). The boxplots in Fig. 8 and 9 present the statistical results of 1000 cases measured by RMSE and AEE. The one-pass SCC and RPC method have acceptable measurement error (RMSE 0.2similar-toabsent0.2\sim 0.2∼ 0.2) for background-free cases due to varying seeding density, particle diameters, illuminations in the image generator. However, they are not robust enough to backgrounds. Both SCC-MIN and SPOF have poor performance on all cases. The reasons might be the low-quality background of SCC-MIN, while peak-locking causes a significant error of SPOF. On the contrary, both SCC-LPF and SBCC performs well for all test cases. We speculate that the synthetic backgrounds might be well estimated by a LPF, resulting in the good performance of SCC-LPF. Note that, our SBCC is more accurate than SCC-LPF statistically.

IV.4 On real recorded PIV images

Refer to caption
(a) A lab-made PIV setup with an industrial camera (HIKROBOT, MV-CA016-10UM) and multiple semiconductor lasers (532nm𝑛𝑚nmitalic_n italic_m)
Refer to caption
(b) Left: interactive flow around L-shaped plate Liberzon et al. (2016); Middle: hypersonic flow over a step model Lu (2023); Right: rotational flow with background from (a).
Figure 10: A lab-made setup (a) and 3 practical test cases (b).
Refer to caption
Figure 11: Vectors from the interactive flow. The color background corresponds to velocity magnitude. Best viewed in color.
Refer to caption
Figure 12: Vectors from the hypersonic flow. The color background corresponds to velocity magnitude. Best viewed in color.
Refer to caption
Figure 13: Vectors from the rotational flow. The color background corresponds to velocity magnitude. Best viewed in color.

In addition to synthetic images, we also tested SBCC on three challenging recorded PIV image pairs (Fig. 10 (b)). The first case records an interactive flow around L-shaped plate in OpenPIV Liberzon et al. (2016). The second PIV case records a hypersonic flow (5Ma5𝑀𝑎5Ma5 italic_M italic_a) over a step model Lu (2023). The third image pair is from a lab-made PIV setup (Fig. 10 (a)), which represents a liquid column rotating with a still text leaflet background (similar to PIV Challenge 2014, Case F). These flows have particular structures with large displacement (10absent10\approx 10≈ 10 pixel), and the images have non-uniform illumination, out-of-plane effects and noise background. Thus, they are considered as the challenging test examples.

Fig. 11, 12 and 13 provide the raw vector field results, computed with different one-pass cross-correlation methods without any post-processing operation. The pseudo-color backgrounds represent the vector magnitudes. Overall, different methods output similar flow patterns, verifying that these widely-used methods do work in practice. Unfortunately, the accuracy for each method can not be exactly assessed due to unknown ground truth. Thus, we visually check the outliers for the problematic areas in white boxes to evaluate the robustness of each method. The left area of interactive flow image is under weak illumination, and thus full of random noise. The results indicate the spectral filters (SPOF, RPC and SBCC) can effectively cope with this problem, as reported in related works Eckstein and Vlachos (2009). The middle box of hypersonic flow, with strong out-of-plane movement, is full of uncorrelated particles. We argue that the MIN background could catch some content of the noise signal, resulting in the less outliers of SCC-MIN and SBCC. The complex background of rotational flow is obviously difficult to reconstruct with MIN or LPF approaches, which explains the bad performance of SCC-MIN and SCC-LPF. Besides, the spectral filter of RPC also does not work well for this case. However, combining multiple background contexts and spectral filter, our SBCC has an impressive performance with a few outliers for this strong background of advertising words. Based on the impressive results of comprehensive (synthetic and recorded) experiments, the overall effectiveness of our SBCC has been confirmed.

IV.5 On computing cost

Table 4: The running time (seconds) (averaged over 10 runs, with standard deviations).
Image size 128×128128128128\times 128128 × 128 256×256256256256\times 256256 × 256 512×512512512512\times 512512 × 512 1014×1024101410241014\times 10241014 × 1024
SCC 0.0022±0.0002plus-or-minus0.00220.00020.0022\pm 0.00020.0022 ± 0.0002 0.0055±0.0009plus-or-minus0.00550.00090.0055\pm 0.00090.0055 ± 0.0009 0.0233±0.0019plus-or-minus0.02330.00190.0233\pm 0.00190.0233 ± 0.0019 0.0792±0.0031plus-or-minus0.07920.00310.0792\pm 0.00310.0792 ± 0.0031
SCC-MIN 0.0020±0.0005plus-or-minus0.00200.00050.0020\pm 0.00050.0020 ± 0.0005 0.0055±0.0007plus-or-minus0.00550.00070.0055\pm 0.00070.0055 ± 0.0007 0.0220±0.0009plus-or-minus0.02200.00090.0220\pm 0.00090.0220 ± 0.0009 0.0785±0.0026plus-or-minus0.07850.00260.0785\pm 0.00260.0785 ± 0.0026
SCC-LPF 0.0070±0.0127plus-or-minus0.00700.01270.0070\pm 0.01270.0070 ± 0.0127 0.0081±0.0008plus-or-minus0.00810.00080.0081\pm 0.00080.0081 ± 0.0008 0.0277±0.0006plus-or-minus0.02770.00060.0277\pm 0.00060.0277 ± 0.0006 0.0838±0.0030plus-or-minus0.08380.00300.0838\pm 0.00300.0838 ± 0.0030
SPOF 0.0019±0.0001plus-or-minus0.00190.00010.0019\pm 0.00010.0019 ± 0.0001 0.0071±0.0004plus-or-minus0.00710.00040.0071\pm 0.00040.0071 ± 0.0004 0.0232±0.0005plus-or-minus0.02320.00050.0232\pm 0.00050.0232 ± 0.0005 0.0834±0.0032plus-or-minus0.08340.00320.0834\pm 0.00320.0834 ± 0.0032
RPC 0.0022±0.0002plus-or-minus0.00220.00020.0022\pm 0.00020.0022 ± 0.0002 0.0067±0.0002plus-or-minus0.00670.00020.0067\pm 0.00020.0067 ± 0.0002 0.0232±0.0007plus-or-minus0.02320.00070.0232\pm 0.00070.0232 ± 0.0007 0.0822±0.0024plus-or-minus0.08220.00240.0822\pm 0.00240.0822 ± 0.0024
SBCC 0.0070±0.0034plus-or-minus0.00700.00340.0070\pm 0.00340.0070 ± 0.0034 0.0187±0.0004plus-or-minus0.01870.00040.0187\pm 0.00040.0187 ± 0.0004 0.0576±0.0034plus-or-minus0.05760.00340.0576\pm 0.00340.0576 ± 0.0034 0.1995±0.0315plus-or-minus0.19950.03150.1995\pm 0.03150.1995 ± 0.0315

To demonstrate the efficiency of proposed SBCC, the baseline methods and SBCC were finally tested using Python 3.8 on a 2.70GHz i5-11400H laptop computer (HP OMEN 16) with RAM 16.00GB. Tested over 10 runs, the execution results (Table. 4) with varied image size demonstrated that the SBCC has an acceptable computational cost. Recall that an extra FFT operation of contexts is needed, and it is worthwhile to cost 23similar-to232\sim 32 ∼ 3 times computation for the accuracy improvement. For a 1024×1024102410241024\times 10241024 × 1024 case, the execution time of our SBCC is less than 0.20.20.20.2 second.

V Conclusion

Inspired by correlation filter, a novel SBCC framework is proposed to enhance the cross-correlation performance by incorporating multiple negative context images. As a multivariate operator, the SBCC is the closed-form solution of a well-designed optimization problem, which is formulated with both surrogate objective and correlation consistency objective. To our surprising, this framework also provides an alternative surrogate view for a set of generalized cross-correlation methods (PC, RPC, 1-CSPC, SPOF, etc). On the correlation response, the SBCC method is more likely to achieve a desired Gaussian-shaped correlation response, encouraged by the objective function. And an arbitrary parameter μ(3.0)𝜇3.0\mu(3.0)italic_μ ( 3.0 ) is fixed through parameter sensitivity analysis. Finally, the performance improvement of SBCC is verified with massive synthetic and real PIV image pairs. An interesting point is that SBCC paves a new way for robust PIV cross correlation analysis via employing negative context images. Moving forward, we plan to apply SBCC beyond PIV to other tasks including one-dimensional time series, digital image correlation, acoustic imaging, etc.

Acknowledgment

This work was supported by the National Natural Science Foundation of China (Grant No.: 52205575), Natural Science Foundation of Hubei Province (Grant Number: 2023AFB128) and Teaching Research Project of Wuhan University of Technology (Grant Number: W2022093). The authors would like to thank Dr. B. Wieneke and Zhenghao Cen for beneficial discussion.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  • Adrian (1984) R. J. Adrian, “Scattering particle characteristics and their effect on pulsed laser measurements of fluid flow: speckle velocimetry vs particle image velocimetry,” Applied optics 23, 1690–1691 (1984).
  • Raffel et al. (2018) M. Raffel, C. E. Willert, F. Scarano, C. J. Kähler, S. T. Wereley,  and J. Kompenhans, Particle image velocimetry: a practical guide (Springer, 2018).
  • Lee and Mei (2022) Y. Lee and S. Mei, “Diffeomorphic particle image velocimetry,” IEEE Transactions on Instrumentation and Measurement 71 (2022).
  • Honkanen and Nobach (2005) M. Honkanen and H. Nobach, “Background extraction from double-frame piv images,” Experiments in fluids 38, 348–362 (2005).
  • Deen et al. (2010) N. G. Deen, P. Willems, M. van Sint Annaland, J. Kuipers, R. G. Lammertink, A. J. Kemperman, M. Wessling,  and W. G. van der Meer, “On image pre-processing for piv of single-and two-phase flows over reflecting objects,” Experiments in fluids 49, 525–530 (2010).
  • Sciacchitano and Scarano (2014) A. Sciacchitano and F. Scarano, “Elimination of piv light reflections via a temporal high pass filter,” Measurement Science and Technology 25, 084009 (2014).
  • Sciacchitano (2019) A. Sciacchitano, “Uncertainty quantification in particle image velocimetry,” Measurement Science and Technology 30, 092001 (2019).
  • Michaelis, Neal, and Wieneke (2016) D. Michaelis, D. R. Neal,  and B. Wieneke, “Peak-locking reduction for particle image velocimetry,” Measurement Science and Technology 27, 104005 (2016).
  • Wang et al. (2015) H. Wang, Q. Gao, L. Feng, R. Wei,  and J. Wang, “Proper orthogonal decomposition based outlier correction for piv data,” Experiments in Fluids 56, 1–15 (2015).
  • Lee, Yang, and Yin (2017a) Y. Lee, H. Yang,  and Z. Yin, “Outlier detection for particle image velocimetry data using a locally estimated noise variance,” Measurement Science and Technology 28, 035301 (2017a).
  • Willert and Gharib (1991) C. E. Willert and M. Gharib, “Digital particle image velocimetry,” Experiments in fluids 10, 181–193 (1991).
  • Scarano (2001) F. Scarano, “Iterative image deformation methods in piv,” Measurement science and technology 13, R1 (2001).
  • Wang, He, and Wang (2020) H. Wang, G. He,  and S. Wang, “Globally optimized cross-correlation for particle image velocimetry,” Experiments in Fluids 61, 1–17 (2020).
  • Zhu et al. (2022) X. Zhu, C. Xu, M. M. Hossain, J. Li, B. Zhang,  and B. C. Khoo, “Approach to select optimal cross-correlation parameters for light field particle image velocimetry,” Physics of Fluids 34 (2022).
  • Gao et al. (2021) Q. Gao, H. Lin, H. Tu, H. Zhu, R. Wei, G. Zhang,  and X. Shao, “A robust single-pixel particle image velocimetry based on fully convolutional networks with cross-correlation embedded,” Physics of Fluids 33 (2021).
  • Corpetti et al. (2006) T. Corpetti, D. Heitz, G. Arroyo, E. Mémin,  and A. Santa-Cruz, “Fluid experimental flow estimation based on an optical-flow scheme,” Experiments in fluids 40, 80–97 (2006).
  • Zhong, Yang, and Yin (2017) Q. Zhong, H. Yang,  and Z. Yin, “An optical flow algorithm based on gradient constancy assumption for piv image processing,” Measurement Science and Technology 28, 055208 (2017).
  • Lu et al. (2021) J. Lu, H. Yang, Q. Zhang,  and Z. Yin, “An accurate optical flow estimation of piv using fluid velocity decomposition,” Experiments in Fluids 62, 1–16 (2021).
  • Lee, Yang, and Yin (2017b) Y. Lee, H. Yang,  and Z. Yin, “Piv-dcnn: cascaded deep convolutional neural networks for particle image velocimetry,” Experiments in Fluids 58, 171 (2017b).
  • Cai et al. (2019a) S. Cai, S. Zhou, C. Xu,  and Q. Gao, “Dense motion estimation of particle images via a convolutional neural network,” Experiments in Fluids 60, 1–16 (2019a).
  • Lagemann et al. (2021) C. Lagemann, K. Lagemann, S. Mukherjee,  and W. Schröder, “Deep recurrent optical flow learning for particle image velocimetry data,” Nature Machine Intelligence 3, 641–651 (2021).
  • Cao et al. (2024) L. Cao, M. M. Hossain, J. Li,  and C. Xu, “Three-dimensional particle image velocimetry measurement through three-dimensional u-net neural network,” Physics of Fluids 36 (2024).
  • Eckstein and Vlachos (2009) A. Eckstein and P. P. Vlachos, “Digital particle image velocimetry (dpiv) robust phase correlation,” Measurement Science and Technology 20, 055401 (2009).
  • Horner and Gianino (1984) J. L. Horner and P. D. Gianino, “Phase-only matched filtering,” Applied optics 23, 812–816 (1984).
  • Wernet (2005) M. P. Wernet, “Symmetric phase only filtering: a new paradigm for dpiv data processing,” Measurement Science and Technology 16, 601 (2005).
  • Bao, Yang, and ** (2014) L. Bao, Q. Yang,  and H. **, “Fast edge-preserving patchmatch for large displacement optical flow,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014) pp. 3534–3541.
  • Zhang and Piggott (2020) M. Zhang and M. D. Piggott, “Unsupervised learning of particle image velocimetry,” in High Performance Computing: ISC High Performance 2020 International Workshops, Frankfurt, Germany (Springer, 2020) pp. 102–115.
  • Yu et al. (2021) C. Yu, X. Bi, Y. Fan, Y. Han,  and Y. Kuai, “Lightpivnet: An effective convolutional neural network for particle image velocimetry,” IEEE Transactions on Instrumentation and Measurement  (2021).
  • Yu et al. (2023) C. Yu, Y. Fan, X. Bi, Y. Kuai,  and Y. Chang, “Deep dual recurrence optical flow learning for time-resolved particle image velocimetry,” Physics of Fluids 35 (2023).
  • Lagemann et al. (2022) C. Lagemann, K. Lagemann, S. Mukherjee,  and W. Schröder, “Generalization of deep recurrent optical flow estimation for particle-image velocimetry data,” Measurement Science and Technology 33, 094003 (2022).
  • Kähler et al. (2016) C. J. Kähler, T. Astarita, P. P. Vlachos, J. Sakakibara, R. Hain, S. Discetti, R. La Foy,  and C. Cierpka, “Main results of the 4th international piv challenge,” Experiments in Fluids 57, 97 (2016).
  • Liberzon et al. (2016) A. Liberzon, D. Lasagna, M. Aubert, P. Bachant, J. Borg, et al., “Openpiv/openpiv-python: Updated pyprocess with extended area search method,”   (2016).
  • Xie, Wang, and Xu (2022) Z. Xie, H. Wang,  and D. Xu, “Spatiotemporal optimization on cross correlation for particle image velocimetry,” Physics of Fluids 34 (2022).
  • Dellenback, Macharivilakathu, and Pierce (2000) P. A. Dellenback, J. Macharivilakathu,  and S. R. Pierce, “Contrast-enhancement techniques for particle-image velocimetry,” Applied optics 39, 5978–5990 (2000).
  • Shavit, Lowe, and Steinbuck (2007) U. Shavit, R. J. Lowe,  and J. V. Steinbuck, “Intensity cap**: a simple method to improve cross-correlation piv results,” Experiments in Fluids 42, 225–240 (2007).
  • Lee et al. (2022) Y. Lee, S. Zhang, M. Li,  and X. He, “Blind inverse gamma correction with maximized differential entropy,” Signal Processing 193, 108427 (2022).
  • Fan et al. (2023) Y. Fan, C. Guo, Y. Han, W. Qiao, P. Xu,  and Y. Kuai, “Deep-learning-based image preprocessing for particle image velocimetry,” Applied Ocean Research 130, 103406 (2023).
  • Zhao et al. (2024) F. Zhao, Z. Zhou, D. Hung, X. Li,  and M. Xu, “Flow field reconstruction from spray imaging: A hybrid physics-based and machine learning approach based on two-phase fluorescence particle image velocimetry measurements,” Physics of Fluids 36 (2024).
  • Mejia-Alvarez and Christensen (2013) R. Mejia-Alvarez and K. Christensen, “Robust suppression of background reflections in piv images,” Measurement Science and Technology 24, 027003 (2013).
  • Mendez et al. (2017) M. A. Mendez, M. Raiola, A. Masullo, S. Discetti, A. Ianiro, R. Theunissen,  and J.-M. Buchlin, “Pod-based background removal for particle image velocimetry,” Experimental Thermal and Fluid Science 80, 181–192 (2017).
  • Wang et al. (2020) L. Wang, C. Pan, J. Liu,  and C. Cai, “Ratio-cut background removal method and its application in near-wall ptv measurement of a turbulent boundary layer,” Measurement Science and Technology 32, 025302 (2020).
  • Adrian and Westerweel (2011) R. J. Adrian and J. Westerweel, Particle image velocimetry, 30 (Cambridge university press, 2011).
  • Adatrao and Sciacchitano (2019) S. Adatrao and A. Sciacchitano, “Elimination of unsteady background reflections in piv images by anisotropic diffusion,” Measurement Science and Technology 30, 035204 (2019).
  • Bolme et al. (2010) D. S. Bolme, J. R. Beveridge, B. A. Draper,  and Y. M. Lui, “Visual object tracking using adaptive correlation filters,” in 2010 IEEE computer society conference on computer vision and pattern recognition (IEEE, 2010) pp. 2544–2550.
  • Henriques et al. (2012) J. F. Henriques, R. Caseiro, P. Martins,  and J. Batista, “Exploiting the circulant structure of tracking-by-detection with kernels,” in European conference on computer vision (Springer, 2012) pp. 702–715.
  • Henriques et al. (2015) J. F. Henriques, R. Caseiro, P. Martins,  and J. Batista, “High-speed tracking with kernelized correlation filters,” IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 583–596 (2015).
  • Wereley and Meinhart (2001) S. T. Wereley and C. D. Meinhart, “Second-order accurate particle image velocimetry,” Experiments in fluids 31, 258–268 (2001).
  • Eckstein, Charonko, and Vlachos (2008) A. C. Eckstein, J. Charonko,  and P. Vlachos, “Phase correlation processing for dpiv measurements,” Experiments in Fluids 45, 485–500 (2008).
  • Shen and Liu (2009) M. Shen and H. Liu, “A modified cross power-spectrum phase method based on microphone array for acoustic source localization,” in 2009 IEEE International Conference on Systems, Man and Cybernetics (IEEE, 2009) pp. 1286–1291.
  • Chen, Lee, and Soh (2021) K. Chen, Y. Lee,  and H. Soh, “Multi-modal mutual information (mummi) training for robust self-supervised deep reinforcement learning,” in IEEE International Conference on Robotics and Automation (ICRA) (2021).
  • Delnoij et al. (1999) E. Delnoij, J. Westerweel, N. G. Deen, J. Kuipers,  and W. P. M. van Swaaij, “Ensemble correlation piv applied to bubble plumes rising in a bubble column,” Chemical Engineering Science 54, 5159–5171 (1999).
  • Cai et al. (2019b) S. Cai, J. Liang, Q. Gao, C. Xu,  and R. Wei, “Particle image velocimetry based on a deep learning motion estimator,” IEEE Transactions on Instrumentation and Measurement 69, 3538–3554 (2019b).
  • Bai et al. (2021) C. Bai, H. Park, C. Y. Ng,  and L. Wang, “Classification of gas dispersion states via deep learning based on images obtained from a bubble sampler,” Chemical Engineering Journal Advances 5, 100064 (2021).
  • Park et al. (2021) H. Park, C. Bai, C. Y. Ng,  and L. Wang, “Bubble image database,”  (2021).
  • Lu (2023) J. Lu, Research on Variational Optical Flow Particle Image Velocimetry in Hypersonic FlowsPh.D. thesis, Huazhong University of Science and Technology (2023).