Efficient Design and Implementation of Fast-Convolution-Based Variable-Bandwidth Filters
Abstract
This paper introduces an efficient design approach for a fast-convolution-based variable-bandwidth (VBW) filter. The proposed approach is based on a hybrid of frequency sampling and optimization (HFSO), that offers significant computational complexity reduction compared to existing solutions for a given performance. The paper provides a design procedure based on minimax optimization to obtain the minimum complexity of the overall filter. A design example includes a comparison of the proposed design-based VBW filter and time-domain designed VBW filters implemented in the time domain and in the frequency domain. It is shown that not only the implementation complexity can be reduced but also the design complexity by excluding any computations when the bandwidth of the filter is adjusted. Moreover, memory requirements are also decreased compared to the existing frequency-domain implementations.
Index Terms:
Variable bandwidth filter, fast convolution, frequency-domain design, time-varying systems, overlap-save, multirate filter banksI Introduction
Due to the increasing demand for reconfigurable systems in the contemporary world of technologies, variable digital filters (VDFs) are required in many digital signal processing applications, for example, in medical devices [1] and communication systems [2, 3]. The main advantage of these filters over regular digital filters is that they offer variability of the frequency response by adjusting only one or a few parameters without online filter design. Most recent papers focused on VDFs [1, 2, 3] have shown that this approach allows to significantly reduce implementation complexity and thereby hardware complexity compared to regular filters requiring online design for every new specification. Nevertheless, implementations of VDFs in the time domain may still cause a rather high computational complexity for stringent requirements.
Research works on finite-impulse-response (FIR) filters have shown that a filter can be implemented in the frequency domain using the fast convolution (FC) with much lower complexity than in the time domain [4, 5, 6, 7]. The most widely used techniques are the overlap-save (OLS) and overlap-add (OLA) methods which employ the discrete Fourier transform (DFT) and its inverse (IDFT) [8, 9]. The FC-based VDFs [3, 10] have shown significant reduction of the computational complexity compared to time-domain implementations.
Typically, regardless of the implementation, VDFs are designed by optimizing the impulse response values to satisfy specification requirements for the frequency response, hereafter this is also referred as time-domain design. In this case, the filter DFT coefficients are given by the DFT of the optimized impulse response. For frequency-domain implementations, this fact leads to less efficient implementation and update of variable coefficients in terms of computational complexity. Thus, this work is devoted to frequency-domain design of FC-based VDFs, specifically variable-bandwidth (VBW) filters, by instead optimizing the filter DFT coefficients. The OLS method is considered here due to somewhat less implementation complexity compared to the OLA method [7].
The main contributions of the paper are as follows.
-
1.
An approach to FC-based VBW filter design allowing to significantly reduce both implementation and design complexity is proposed. The core idea of the proposal is to design a filter based on a hybrid of frequency sampling and optimization (HFSO) with the aim of utilizing the same set of transition band values for different bandwidths in the implementation. The proposed HFSO method consists of designing a VBW filter by optimizing only one set of the transition band samples while the passband and stopband values are directly obtained by sampling the desired frequency response. This means that the magnitude of the filter DFT coefficients for various bandwidths are sets of the same values, i.e., ones, zeros, and the optimized transition band values. This approach leads to a very simple update of the filter DFT coefficients when the bandwidth is varied, which was the drawback of the variable-weight-based structure with the lowest implementation complexity proposed in [10] requiring computations for every new value of the variable parameter. Hence, our proposed approach allows to simultaneously lower the implementation complexity, by cutting the number of general multiplications per sample, and significantly reduce the design complexity when the bandwidth is adjusted. Moreover, the proposed approach allows to eliminate the dependence of the implementation and design complexities on the variable bandwidth range while it is one of the main limitations in the existing VBW filter implementations [11, 10].
-
2.
A systematic procedure for FC-based VBW filter design using the HFSO approach is provided. A similar approach based on the frequency sampling has been utilized for regular FIR filter design [12, 13, 14, 15]. However, the authors did not consider FIR filter implementations using the OLS or OLA methods, which together with the frequency sampling method make the filter a linear periodic time-varying (LPTV) system. Design of these systems has to include measuring the distortion and aliasing functions, which can be derived from a multirate filter bank representation, or a set of time-invariant impulse responses of the corresponding periodically time-varying impulse-response (PTVIR) representation. Although it was mentioned in [3] that a frequency-domain designed FC-based filter, causing cyclic distortions, can be modeled as an LPTV system, the authors did not provide details whether the corresponding time-invariant responses have been controlled during the design procedure. In our proposed approach, the overall filter is designed by ensuring that each of the corresponding time-invariant responses meets the specification using minimax optimization.
The rest of the paper is organized as follows. Section II gives a brief overview of the overlap-save method while Section III presents the proposed approach of an FC-based VBW filter design using the HFSO method. Section IV provides an example and computational-complexity analysis. Finally, Section V concludes the paper.
II Overlap-Save Method
The main concept of the FC using the OLS method is that the input signal is divided into overlap** segments , , where is a segment index and the overlap** part is . Then, for each segment, the following computations are carried out.
-
1.
The segment is transformed via an -point DFT.
-
2.
The DFT coefficients are multiplied by the filter DFT coefficients .
-
3.
An -point IDFT is performed.
-
4.
The first samples of the resulting block are discarded, so that the output segments of length are no longer overlap**.
-
5.
The output sequence is obtained by concatenating the resulting segments as .
In classical OLS filtering, the system is intended to be time-invariant, i.e., the aliasing error is zero. In this case, the DFT coefficients of an FIR filter correspond to the -point DFT of the impulse response of length , designed by any of the available methods [8]. This means for , where .
In this paper, the proposed HFSO design approach111The HFSO design approach will be discussed in details in Section III-B. restricts the frequency response to take on fixed values for certain frequencies. Even if the effective order of the underlying filter is , after an -point IDFT, this results in an -length impulse response (although the impulse response values are quite small for ) and thereby makes the overall system an -periodic time-varying system, i.e., the filter impulse response coefficients change from sample to sample [9]. In these systems, aliasing cannot be cancelled but can be suppressed to any desired level through a proper design, which is considered in this paper. An LPTV system can be represented using a PTVIR representation corresponding to a set of time-invariant impulse responses , , (or their corresponding frequency responses ) [9, 16, 7]. This representation can be a better indicator of the worst-case time-domain error of the overall system comparing to the multirate filter-bank representation in terms of distortion and aliasing [7].
III Proposed Approach to Design a VBW Filter Implemented Using the OLS method
III-A VBW Filter Implemented Using the OLS method
The desired frequency response of a VBW filter of effective length is considered here as
(1) |
where is the center of the transition band, whereas and represent variable passband and stopband edges, respectively. The transition width is assumed to be fixed.
The VBW filter frequency-domain implementation proposed in [10] is shown in Fig. 1. This structure utilizes variable DFT coefficients that have to be recomputed whenever is changed. As was mentioned in Section II, this implementation can be equivalently represented using the PTVIR representation, which is shown in Fig. 2. The output of the system can be written as
(2) |
where are modified responses from [7] by introducing the variable parameter . They can be expressed as
(3) |
with being the impulse responses of and circular versions of each other, i.e., . This allows to avoid computations of for every in the design. The last response corresponds to the IDFT of , i.e.,
(4) |
The DFT coefficients are obtained using the HFSO approach that will be described in the following subsection.
III-B Proposed HFSO Approach
The proposed HFSO approach consists of two parts: sampling of the passband and stopband of the desired frequency response in (1) and optimization of the samples belonging to the transition band, which is generally not specified in the desired response, in order to minimize the passband and stopband ripples of all responses , , for . Thus, the coefficients are given as
(5) |
with the magnitude response samples given by
(6) |
where , , and are the passband, transition band, and the stopband regions, respectively. Here, and are the first and the last sample indices of the transition band, correspondingly, given by
(7) |
where the values of and are assumed to be discretized so that and with and being the fixed transition width and variable parameter in terms of frequency bins, . It is assumed that the transition band samples can be utilized for different values of as it is illustrated in Fig. 3. Therefore, do not depend on and can be written as , with .
Considering that the VBW filter impulse response is real-valued, the responses are also real-valued. Then, in (4) is given by
(8) |
III-C Minimax Design
In order to design an LPTV system modeled as a set of time-invariant frequency responses, one needs to ensure that each of the responses approximates the desired response [7]. Therefore, for a VBW filter design, each has to approximate the desired frequency response , given by222The desired response in (9) differs from in (1) by an additional delay of samples due to the OLS implementation.
(9) |
In this paper, they are approximated in the minimax sense. This means that the overall filter is here designed by solving the following approximation problem. For given and , find transition band values of the DFT coefficients as well as to
(10) |
for , , and , where is the error function, given by
(11) |
with as in (3), as in (9). The filter will meet the specification if after the optimization satisfies , where is the maximum prespecified approximation error.
III-D Implementation and Design Complexity
Fixed multiplication rate | Variable multiplication rate | Addition rate | Memory |
---|---|---|---|
The overall complexity of the frequency-domain implementations consists of DFT/IDFT transforms and complex multiplications by . In this paper, we consider that the former is implemented using the split-radix fast Fourier transform (FFT) algorithm, the complex multiplication inside the transformation is implemented using 3 real fixed multiplications and 3 real additions, the segments and are real-valued and . Therefore, each FFT and its inverse (IFFT) require multiplications and additions [17]. The multiplications by can be implemented as two consecutive multiplications: a variable (general) multiplication by real and a fixed complex multiplication by . The former is variable and real, thus, it can be implemented using two real general multiplications. The latter is complex and fixed, therefore, it can be implemented using 3 real fixed multiplications and 3 real additions. Moreover, for particular combinations of and such that , , the exponents can be implemented even cheaper (e.g., for , , there are no computations required, and for , , only two real multiplications and two real additions can be used). Additionally, for zero-valued , there are no multiplications by the exponents needed. Thus, the number of fixed multiplications is varied depending on and the ratio . Here, we specify the maximum possible number for fair comparison. Considering that the DFT coefficients and are conjugate symmetric and ones and zeros in do not require any computations, the number of fixed multiplications, general multiplications and additions are , , and , respectively. Therefore, the total implementation complexity per sample is expressed as a fixed multiplication rate , variable multiplication rate , and addition rate summarized in Table I.
For the proposed approach, there are no computations when is altered. Therefore, the design complexity includes only memory to store values of .
III-E Design Procedure Minimizing the Overall Complexity
To minimize the overall complexity per sample, the values of , , and have to be determined. The value of is obtained based on the filter order as , where the initial can be estimated by [18]
(12) |
The optimal value of is estimated using [7] and rounded to the nearest . The value of is computed as .
Further, the approximation problem in (10) is a convex optimization problem which guarantees that the solution is globally optimal in the minimax sense that can be solved using any regular solver for such problems. In this paper, the optimization problem is solved using the optimization toolbox in MATLAB with discretized into grid points. After the optimization, if the approximation error , one should reduce the order and repeat the optimization. If the approximation error , the order must be increased, the values of , , recomputed and the optimization repeated until the .
IV Example
In this example, we apply the proposed design for a VBW filter and compare the so obtained OLS implementation with time-domain designed VBW filters implemented using the Farrow structure [11] and the OLS method [10].
The specification of the VBW filter is given as follows: , , . According to the estimation expressions in Section III-E, the estimated filter order , which gives , and we choose . Thus, the transition band and bandwidth in terms of bins are and , respectively. The passband and stopband regions are discretized to grid points.
After the optimization outlined in Section III-E, , , . Figure 4 plots the magnitude responses of the VBW filter, where the top plot shows responses for and (for better visibility) and corresponding transition band samples , the mid plot shows all for , and the bottom plot shows the error in (11). It is seen that the stopband level of all the responses is less than 60 dB.
The implementation and design complexity rates are listed in Tables II and III, respectively. Since in this example, the exponents in (5) can be implemented using 2 real multiplications and 2 additions only for every odd whereas there are no computations for every even . It is seen that the proposed approach shows significant savings in both complexities compared to other implementations. Moreover, the proposed approach allows to design and implement a VBW filter with the same complexity regardless the range of the variable parameter , while the filters in [11] and [10] cover only narrow range of variable parameter for a reasonable complexity. This means that for the entire variable band case, which is illustrated in Fig. 5, the complexity savings offered by the proposed method will be even much higher.
V Conclusions
This paper proposed an efficient approach to design an FC-based VBW filter using the HFSO. As shown through the design example, the proposed technique can significantly reduce both the implementation and design complexities by avoiding any arithmetic operations when the variable parameter is changed and by relaxing memory requirements.
References
- [1] I. Raghu, N. Haridas, and E. Elias, “High performance continuous variable bandwidth digital filter design for hearing aid application,” AEÜ - Int. J. Electron. Commun., vol. 92, pp. 36–53, 2018.
- [2] I. Raghu and E. Elias, “Low complexity spectrum sensing technique for cognitive radio using Farrow structure digital filters,” Eng. Science Tech., Int. Journal, vol. 22, no. 1, pp. 131–142, Feb. 2019.
- [3] M. Renfors, J. Yli-Kaakinen, and F. J. Harris, “Analysis and design of efficient and flexible fast-convolution based multirate filter banks,” IEEE Trans. Signal Process., vol. 62, no. 15, pp. 3768–3783, Aug. 2014.
- [4] J. J. Shynk, “Frequency-domain and multirate adaptive filtering,” IEEE Signal Process. Magazine, vol. 9, no. 1, pp. 14–37, Jan. 1992.
- [5] K. Ishihara, R. Kudo, T. Kobayashi, A. Sano, Y. Takatori, T. Nakagawa, and Y. Miyamoto, “Frequency-domain equalization for coherent optical transmission systems,” in Proc. Optical Fiber Commun. Conf. and Exposition and the National Fiber Optic Engineers Conf., Mar. 6-10, 2011, pp. 1–3.
- [6] A. Kovalev, O. Gustafsson, and M. Garrido, “Implementation approaches for 512-tap 60 gsa/s chromatic dispersion FIR filters,” in Proc. 51st Asilomar Conf. Signals, Systems, and Computers, Oct. 29–Nov. 1, 2017.
- [7] H. Johansson and O. Gustafsson, “On frequency-domain implementation of digital FIR filters using overlap-add and overlap-save techniques,” Feb. 2023, doi: 10.48550/ARXIV.2302.08845.
- [8] A. Oppenheim and R. Schafer, Discrete-Time Signal Processing, 3rd ed. Pearson, 2010.
- [9] A. Daher, E. H. Baghious, G. Burel, and E. Radoi, “Overlap-save and overlap-add filters: Optimal design and comparison,” IEEE Trans. Signal Processing, vol. 58, no. 6, pp. 3066–3075, Jun. 2010.
- [10] O. Moryakova and H. Johansson, “Frequency-domain implementations of variable digital FIR filters using the overlap-save technique,” in Proc. 2023 24th Int. Conf. Digital Signal Processing (DSP), Rhodes, Greece, Jun. 11–13, 2023, pp. 1–5.
- [11] P. Löwenborg and H. Johansson, “Minimax design of adjustable-bandwidth linear-phase FIR filters,” IEEE Trans. Circuits Syst. I: Reg. Papers, vol. 53, no. 2, pp. 431–439, Feb. 2006.
- [12] L. Rabiner, B. Gold, and C. McGonegal, “An approach to the approximation problem for nonrecursive digital filters,” IEEE Trans. Audio and Electroacoustics, vol. 18, no. 2, pp. 83–106, Jun. 1970.
- [13] S. Harris and E. Ifeachor, “Automatic design of frequency sampling filters by hybrid genetic algorithm techniques,” IEEE Trans. Signal Process., vol. 46, no. 12, pp. 3304–3314, Dec. 1998.
- [14] S. Salcedo-Sanz, F. Cruz-Roldan, C. Heneghan, and X. Yao, “Evolutionary design of digital filters with application to subband coding and data transmission,” IEEE Trans. Signal Process., vol. 55, no. 4, pp. 1193–1203, Apr. 2007.
- [15] R. Y. Belorutsky and I. S. Savinykh, “Modified technique of FIR filter design by the frequency sampling method,” in 2016 11th Int. Forum on Strategic Technology (IFOST), Jun. 01-03, 2016, pp. 259–262.
- [16] P. P. Vaidyanathan, Multirate Systems and Filter Banks. USA: Prentice-Hall, Inc., 1993.
- [17] H. Sorensen, D. Jones, M. Heideman, and C. Burrus, “Real-valued fast Fourier transform algorithms,” IEEE Trans. Acoustics, Speech, and Signal Process., vol. 35, no. 6, pp. 849–863, June 1987.
- [18] M. G. Bellanger, Digital Processing of Signals. John Wiley and Sons, 1984.