[W-]weak form
The weak form of the SDOF and MDOF equation of motion, part II: A numerical method for the SDOF problem
Abstract
A new, more efficient, numerical method for the SDOF problem is presented. Its construction is based on the weak form of the equation of motion, as obtained in [KK24], using piece-wise polynomial functions as interpolation functions. The approximation rate can be arbitrarily high, proportional to the degree of the interpolation functions, tempered only by numerical instability. Moreover, the mechanical energy of the system is conserved. Consequently, all significant drawbacks of existing algorithms, such as the limitations imposed by the Dahlqvist Barrier theorem [Dah56] and the need for introduction of numerical dam**, have been overcome.
Contents
- 1 Application of the weak formulation to piece-wise polynomial interpolation
- 2 Choice of functions
- 3 Some useful properties of Bernstein polynomials
- 4 From Legendre to Bernstein basis
- 5 From to , the space
- 6 The first step of the algorithm,
- 7 The remaining steps
- 8 The case
- 9 Error estimates
- 10 Conclusions
1 Application of the weak formulation to piece-wise polynomial interpolation
The algorithm presented herein, based on the weak formulation of the SDOF problem as obtained in [KK24], is constructed as follows. A partition of the interval into intervals of length is considered, and the approximate solution will be polynomial in each subinterval of the type
(1) |
The space formed by such functions will be denoted by .
The breakpoints , i.e. the points where the expression of a function in is allowed to change are equally spaced in , and the corresponding time-step is . The time-step is chosen so that
(2) |
where is the number of time-steps of the algorithm.
The functions , defined here below, for some fixed , will form a basis for the restriction of functions in in each subinterval formed by consecutive break-points. The weak formulation of the SDOF problem will be considered in each such interval . The algorithm thus takes automatically a time-step algorithm character and, even though it does not represent the full potential of the weak formulation, can be seen to be already more powerful than those in the literature.
Consequently, there is some change in notation with respect to [KK24], since still stands for the upper bound of the interval in which the solution is approximated, but the intervals on which the weak formulation is considered are precisely the , for where .
The authors are already working on the construction of a numerical method based on the weak formulation on the entire interval , which will also have a step-by-step nature.
2 Choice of functions
Fix . The space
(3) |
is chosen to be equal to the space of polynomial functions of a given degree , . It is of dimension . The space
(4) |
is thus equal to the space of polynomial functions of degree and of dimension .
The basis of given in the next equation is known in the literature as the Bernstein polynomials, see, e.g., [Kac38]. Call, for ,
(5) |
All functions are by their definition non-negative. The normalization factor
(6) |
is chosen so that
(7) |
i.e. so that the form a partition of unity on the interval (cf. [dB78], Β§IX).
3 Some useful properties of Bernstein polynomials
3.1 The functions and their derivatives
The Bernstein polynomials form a basis of polynomial functions of degree defined on . For this particular choice of basis, one can immediately verify that
(8) |
More precisely,
(9) |
The following lemma will be useful for the calculation of the derivatives of Bernstein polynomials. The derivative of a polynomial of degree is a polynomial of degree . The following lemma shows how to express a Bernstein polynomial of degree in terms of Bernstein polynomials of degree .
Lemma 3.1.
The following formula holds true for :
(10) |
Proof.
The proof is elementary and is based on the following observation:
(11) |
and the rest is a direct calculation involving the normalization factors. β
The following lemma is elementary. The proof is by immediate calculation and use of lemma 3.1.
Lemma 3.2.
The following formula holds true for , where for convenience, for all , :
(12) |
The following corollary follows from direct calculation.
Corollary 3.3.
The local maximum of for is attained at
Proof.
From the formula of the lemma it easily follows that
(13) |
whose only root inside is and the corollary follows. β
3.1.1 Integrals and products
The following lemma concerns the integral of a Bernstein polynomial. It will be useful for the calculation of scalar products.
Lemma 3.4.
The integral of a Bernstein polynomial over depends only on and :
(14) |
Proof.
The proof can proceed by integration by parts. Alternatively, lemma 3.2 implies that, for ,
(15) |
where is the Kronecker delta function, equal to if and otherwise. The conclusion of the lemma follows directly by using the fact that the Bernstein polynomials form a partition of unity, which implies that
(16) |
from which the conclusion follows easily. β
A formula for indefinite integrals of Bernstein polynomials can also be obtained. The proof is by integration by parts and application of lemma 3.9 for .
Lemma 3.5.
It holds true that
(17) |
Proof.
Direct calculation gives
(18) |
so that, all in all,
(19) |
It can be seen directly, either by calculation or by the standing convention that , that
(20) |
so that, for all ,
(21) |
In particular, lemma 3.4 can be obtained by this formula.
The last part follows from the fact that the Bernstein polynomials form a partition of unity. β
The product of two Bernstein polynomials satisfies the following simple rule, the proof of which is left to the reader.
Lemma 3.6.
It holds true that, for and and ,
(22) |
Combining lemmas 3.4 and 3.6 directly yields the following corollary. For notation, the reader is referred to eq. (22) of [KK24].
Corollary 3.7.
The scalar product of two Bernstein polynomials is given by
(23) |
The norm of a Bernstein polynomial is given by
(24) |
A numerical estimation of the behavior of products is given in the following lemma.
Lemma 3.8.
For it holds true that
(25) |
In particular, since
(26) |
one gets directly the following lemma, concerning the moments of the Bernstein polynomials. It will be useful in the estimation of scalar products, using the formula for the scalar product.
Lemma 3.9.
For and ,
(27) |
In the notation of the lemma,
(28) |
is the rising factorial and the following identity was used.
(29) |
The following lemma can now be proven.
Lemma 3.10.
Let and . Then, the scalar product of two Bernstein polynomials is given by
(30) |
The norm of a Bernstein polynomial is given by
(31) |
In statement of the lemma, Kummerβs confluent hypergeometric function is used, which is defined as
(32) |
where the rising factorial is defined in eq. (28), and the integral representation holds in the domain and , which is the case of interest for the lemma.
The proof of lemma uses the power-series expansion of the exponential,
(33) |
lemma 3.6 and its corollary. Both the lemma and its corollary are, of course, special cases of this lemma, obtained by posing (where the formal convention when is used).
Proof.
By definition and by lemmas 3.4, 3.6 and 3.9,
(34) |
The expression using Kummerβs confluent hypergeometric function follows from direct application of the power series definition, or the integral formula and the definition of the Bernstein polynomials. The series converges as it is bounded above by the exponential series. β
The proof of the previous lemma also provides a calculation for the product of an exponential with a Bernstein polynomial and its expression in the Bernstein polynomial basis. It is stated in the form of a lemma for convenience.
Lemma 3.11.
The following formula holds true
(35) |
It is reminded that the rising factorial is defined in eq. (28). The following corollary also follows directly.
Corollary 3.12.
The integral of a Bernstein polynomial multiplied by an exponential is given by
(36) |
and its definite integral by
(37) |
3.2 Polynomial approximation of functions in Sobolev spaces
In the following, theorems and from [CQ82] will be needed. For completeness, the statements and the context of the needed results are presented here below.
In the proof, the Legendre basis for the space of polynomials of degree is more useful than the Bernstein polynomials, due to their property of being orthogonal with respect to the -norm when restricted in the interval . The Legendre polynomials have a number of possible expressions, both implicit and explicit, one of them being
(38) |
The polynomial is of degree and satisfies
(39) |
so that
(40) |
form a complete orthonormal basis for .
For , the Legendre basis is given by
(41) |
and is thus equivalent to the Bernstein basis, obtained by after a transformation of the domain of the (which are defined on ), see [Far00] for a study of the basis transformations.
A function admits the Fourier-Legendre representation
(42) |
where and the orthogonal projection on the space of polynomials of degree is given by
(43) |
With this notation, the following statements hold. They consist in a simple rephrasing of the corresponding statements in the reference [CQ82]. To each statement a corollary is given, obtained just by transforming the interval to the interval , which is of interest in the present article. The proof of the corollaries is omitted, as it follows directly from the transformation of functions and integrals under composition with affine transformations.
Theorem 3.13 (Theorem of [CQ82]).
For any real , there exists a constant such that for any ,
(44) |
Corollary 3.14.
For any real , there exists a constant such that for any ,
(45) |
for all such that .
Theorem 3.15 (Theorem of [CQ82]).
For any real , there exists a constant such that for any ,
(46) |
where
(47) |
Corollary 3.16.
For any real , there exists a constant such that for any ,
(48) |
for all such that .
Finally, the following lemma (lem. of [CQ82]) on the growth of Sobolev norms of polynomials and its corollary will be needed.
Lemma 3.17.
For and a polynomial of degree , i.e. such that , the following holds
(49) |
Corollary 3.18.
For , and a polynomial of degree , i.e. such that , the following holds
(50) |
4 From Legendre to Bernstein basis
In this section, the Legendre polynomials defined on the interval will be used, given by
(51) |
The following formula from [Far00] is a restatement of the proposition of the paper using the notation established herein.
Proposition 4.1 ([Far00], Prop. ).
Let
(52) |
be the expression of a Bernstein polynomial as a linear combination of Lagrange polynomials, where the factor imposes normalization. Then
(53) |
where and .
Using the expression obtained in lem. 3.11, the following corollary is immediate.
Corollary 4.2.
For a Bernstein polynomial multiplied by an exponential, it holds that
(54) |
where
(55) |
Anticipating Β§9, a brief study on the higher-order truncation of Bernstein polynomials in the Legendre basis will be will be presented below. In [Far00], the author discusses the difficulty of simplifying formulas like the one in prop. 4.1 here above. The authors of the present work tried to obtain at least estimates for the decay of for , which would be useful in the error estimates, but no exploitable result was obtained. What was opted for instead was a numerical study covering the range of useful degrees of approximation , which does not exceed due to numerical stability issues.
The following graphs in figures 1, 2 and 3 feature the exponent satisfying
(56) |
so that
(57) |
for , where the homogeneous norm is obtained by lem. 3.6 using
(58) |
The growth is compared with even though the degree of is and therefore its norm growth is governed by powers of , purely for reasons of convenience. This quantity will be relevant in the error rate estimates. The case of a lower degree, is also presented for reasons of comparison, in order to show that lower degree Legendre polynomials do bear a larger part of the norm of the Bernstein polynomials, as is expected from general theory.
A linear model giving an upper bound for as a function of is also given, for illustration purposes. The linear models, fitting the actual data rather tightly, read
(59) |
It is clear from the graphs, as well as from the linear upper bounds, that the lower-order coefficient bears a more significant part of the weight than the higher order coefficients, and that its dependence on is milder, as should be expected from theory.
5 From to , the space
All the above properties are proved for the basis defined on the interval . Since the goal is to obtain an approximate solution defined on , one can translate the functions by for , in order to obtain the functions
(60) |
where . The functions
(61) |
form a basis of the piecewise polynomial functions of degree on , with break-points . In other words, if is the space of such functions, if, and only if, there exist such that
(62) |
It should be noted that the functions in are not well defined at break-points, but since the break-points form a set of measure this is not of concern. The ambiguity in the definition will be lifted rightaway when continuity conditions at break-points are imposed. The dimension of the space is equal to .
Piecewise polynomial functions are in iff they are in . The SDOF equation being a second order equation, the value of the first derivative is an independent variable. As a consequence, it has to be determined by the choice of functions at break-points and not by the algorithm itself. The space in which the solution should be placed is, therefore,
(63) |
A function is iff the function itself and its first derivative are continuous at break-points. In terms of the coefficients , this is equivalent to
(64) |
The space is consequently of dimension , since it is obtained by imposing linearly independent conditions on each break-point other than and on functions in .
Since the acceleration is a dependent variable, no continuity conditions are imposed at break-points. The authors actually tried the version of the algorithm where such conditions were imposed, and the method failed to converge.
6 The first step of the algorithm,
In what follows, is an arbitrary natural number, and so that .
The approximate solution restricted to the interval is thus assumed to have the form
(65) |
where are the unknown coefficients. The function bearing the boundary conditions is defined as
(66) |
Since only the first and last functions bear the boundary conditions, we immediately obtain
(67) |
where is unknown and treated as a parameter.
Consequently, the approximate solution to the weak problem, assumes the form
(68) |
so that
(69) |
Direct application of the method outlined in Β§9 of [KK24] gives the matrix with elements
(70) |
and the column vector with elements
(71) |
for , depending on the parameter .
The linear system
(72) |
where is obtained by drop** the first and last columns of matrix , solves the problem assuming known boundary conditions and thus treating as a parameter. Then, the equation
(73) |
allows to solve for and determine the rest of the coefficients .
In total, if one now treats and as unknown quantities and moves the terms in containing them to the lhs of eq. (72) and considers the full vector of coefficients , they obtain the following linear system
(74) |
where the matrix of known coefficients
(75) |
with formed by the first two columns of matrix , and by the remaining columns of the same matrix. The rhs vector reads
(76) |
or, in a more compact form,
(77) |
where
(78) |
The linear system can be efficiently solved in the following way:
-
1.
the first two equations are decoupled from the rest of the system and can be solved separately. They are already in (lower) triangular form.
-
2.
the first column of the vector multiplied by and its second column multiplied by are subtracted from the vector giving the vector
-
3.
the rest of the coefficients, , are obtained by
(79)
The first step of the algorithm is completed.
7 The remaining steps
One then can calculate the final displacement
(80) |
and final velocity
(81) |
and iterate the algorithm for the desired number of timesteps in the following way.
For the second step, , the approximate solution restricted to the interval and is thus assumed to have the form
(82) |
The initial displacement and velocity are and as calculated here above and the algorithm of the first step applies verbatim. The next steps are carried out in the same fashion.
Remark.
A direct way to parallelize the algorithm using two processors would be the following. The first processor calculates and inverts the matrix , obtaining the matrix that is used throughout the algorithm. The second processor calculates the rhs vector , appending the vector corresponding to the timestep at the end of the vector constructed at the end of the -th timestep, and feeds the result to the first processor, which applies the algorithm for each timestep.
There does not seem to be a way to use more processors for this problem. However, piecemeal computation by the second processor of the vector for each timestep , instead of construction of the entire vector of dimension is a direct improvement in terms of memory usage, and should become necessary when treating large-scale MDOF problems.
8 The case
The case where is the only case where a direct comparison can be made between the method proposed herein with the traditional step-wise methods, as it is the case of an approximation by a quadradic polynomial. In this case, the matrix of eq. (70) is a vector, and all calculations can be made explicit.
8.1 The undamped case
The expressions when , even though still explicit, become cumbersome. The case merits, thus, a separate study for clarity of exposition.
The matrix of eq. (74) reads
(83) |
or, equivalently, already in lower triangular form
(84) |
The rhs vector of the linear system as defined in eq. (76) reads
(85) |
Consequently, the linear system admits the unique solution
(86) |
which yields
(87) |
This results in the final displacement at time , being equal to as implied by eq. (80), i.e. by
(88) |
The final velocity, , as implied by eq. (81), is given by
(89) |
Using these formulas, the following theorem can be proved.
Theorem 8.1.
For , , and , the algorithm proposed in Β§6 boils down to the step-wise iteration of
(90) |
where and are given as initial conditions, and is the external excitation force.
The approximation rates are as follows:
-
1.
For ,
-
(a)
for and ,
(91) -
(b)
for and ,
(92)
-
(a)
-
2.
For and piece-wise constant in each interval :
(93) The constant in the big-O notation is proportional to .
In the notation of the theorem, represents the exact solution of the corresponding SDOF problem,
(94) |
is the mechanical energy of the approximate solution, and is the mechanical energy of the exact solution at time .
Proof.
The iteration is just a restatement of the calculations preceding the statement of the theorem.
Regarding the error rates, they follow from the calculation of the Taylor expansions of the iteration formulas for the corresponding cases.
More precisely, the Taylor expansion of the function
(95) |
around is
(96) |
Similarly, the expansion of
(97) |
is
(98) |
These Taylor expansions account for the good approximation of the displacement in a homogeneous problem, with excitation force and non-trivial initial conditions (displacement and velocity, respectively).
In the same fashion, the expansion of
(99) |
reads
(100) |
which, together with the expansion of eq. (96), accounts for the good approximation of the velocity in the same problem.
Regarding the conservation of Mechanical Energy, the Taylor expansion of
(101) |
is
(102) |
while that of
(103) |
is
(104) |
These approximations establish the good energy-conservation properties of the simplest case of the algorithm proposed herein.
Concerning the terms containing the external force, the following can be obtained by direct calculation. One gets, for ,
(105) |
Since is the Legendre polynomial of order , this last expression is the projection of onto linear polynomials, where it is reminded that is defined modulo a constant. Equivalently, is the projection of onto constants. Replacing in gives
(106) |
The solution of the undamped SDOF system for , homogeneous initial conditions and reads
(107) |
The Taylor development around of
(108) |
reads is
(109) |
which establishes that the method produces a th order approximation under the assumption that the external excitation is constant in . Concerning the velocity, the force term in eq. (89) for reads . This term gives rise to the same Taylor development as in eq. (99), resulting in a rd order approximation for the velocity.
The Taylor development for the Mechanical Energy of the system reads
(110) |
β
8.2 The damped case
In the case where , the matrix of eq. (74) reads
(111) |
where
(112) |
Since for all relevant values of , one obtains directly the formulas for the undamped case when is set to .
The rhs vector of the linear system as defined in eq. (76) reads
(113) |
This yields the solution
(114) |
Explicit expressions for the quantities , can be obtained, but they are cumbersome and not actually useful.
As in the previous paragraph, the displacement and velocity at time can be obtained by the following expressions
(115) |
The following theorem can be now be proved.
Theorem 8.2.
For , , and , the algorithm proposed in Β§6 boils down to the step-wise iteration of
(116) |
where and are given as initial conditions, and is the external excitation force and the are defined in eq. (112).
The approximation rates are as follows:
-
1.
For ,
-
(a)
for and ,
(117) -
(b)
for and ,
(118)
-
(a)
-
2.
For and piece-wise exponential of the form in each interval , with ,:
(119) The constant in the big-O notation depends on .
where represents the exact solution of the corresponding SDOF problem.
In the notation of the theorem,
(120) |
is the modified mechanical energy of the approximate solution, and is the modified mechanical energy of the exact solution at time . The modification is by the exponential factor appearing in the definition of the relevant quantities, see e.g. eq. (70).
Proof.
The iteration is just a restatement of the calculations preceding the statement of the theorem.
Regarding the error rates, they follow from the calculation of the Taylor expansions of the iteration formulas for the corresponding cases. Throughout the proof, has been set to for definiteness. The general case follows by rescaling accordingly. The notation shortcut will be used, while it is reminded that .
Firstly, consider the homogeneous case, .
More precisely, consider the function
(121) |
which is equal to the approximate minus the exact solution for and . Its Taylor expansion around is
(122) |
Similarly, consider the function
(123) |
which is the approximate displacement minus the exact one for and . The expansion of this function reads
(124) |
These Taylor expansions account for the good approximation of the displacement in a homogeneous problem, with excitation force and non-trivial initial conditions (displacement and velocity, respectively).
Similarly, consider the function
(125) |
which is the approximate velocity minus the exact one for and . The expansion of this function reads
(126) |
Finally, the function
(127) |
is the approximate velocity minus the exact one for and . The expansion of this function reads
(128) |
Regarding the conservation of Mechanical Energy for and , the Taylor expansion of the relevant function (which is too cumbersome to write down) is
(129) |
while that of the modified Mechanical Energy reads
(130) |
Turning to the conservation of Mechanical Energy for and , the Taylor expansion of the relevant function (again, too cumbersome to write down) is
(131) |
while that of the modified Mechanical Energy reads
(132) |
These approximations establish the good energy-conservation properties of the simplest case of the algorithm proposed here-in in the homogeneous case.
Concerning the terms containing the external force, the following can be obtained by direct calculation. One gets for , that , as in the undamped case, is projected onto linear polynomials, resulting in being of the form . Equivalently, is the projection of onto functions of the same form . Replacing in gives
(133) |
i.e. the same result as in the homogeneous case.
The solution of the undamped SDOF system for , homogeneous initial conditions and reads
(134) |
The approximate solution in this case yields
(135) |
The Taylor development around of the exact minus the approximate value for the displacement reads
(136) |
while for the velocity it reads
(137) |
Finally, the Taylor development of the difference in the Mechanical Energy reads
(138) |
while that of the modified Mechanical Energy reads
(139) |
β
9 Error estimates
The first step in establishing the estimates for the error of approximation is to obtain relevant density results for the spaces onto which the excitation force is projected.
9.1 Some approximation properties
Firstly, the spaces relevant to approximation, cf. eq.(103) of [KK24], need to be defined in the present context:
(140) |
where
(141) |
The algorithm proposed herein can be summed up as follows. Let a time-step , a polynomial degree of approximation , , and an interval with with be given. Then, the approximate solution of Β§9 of [KK24] is constructed in each subinterval , of length , and the final displacement and velocity at each step will be used as initial displacement and velocity for the next step. Therefore, in order to obtain convergence one can choose between letting , , or eventually both. As a consequence, two types of density results are needed.
From the StoneβWeierstrass approximation theorem (see [Rud76]), the following follows immediately.
Theorem 9.1.
As
(142) |
in the sense of pointwise convergence.
Proof.
The first assertion follows directly from the Stone-Weierstrass theorem, while the second from the fact that is the space of polynomials of degree with zero mean value, since they are derivatives of polynomials of degree satisfying homogeneous boundary conditions. β
Corollary 9.2.
For any fixed ,
(143) |
9.2 The error rate due to the projection error term
The term appears in corollary 9.6 of [KK24], and controls the convergence of the approximation. The following two propositions will allow the estimation of the approximation error. Under the weakest possible assumption that , which is the case if has Dirac- type forces, no rate of convergence can be obtained. Under the relatively week assumption that , an integral of , be , which covers the case where Heaviside excitation function, the following results can be proved. Naturally, additional regularity of improves the rate of convergence of the method.
Proposition 9.3.
Let be the derivative of , a function such that , where . Then,
(144) |
This proposition provides an estimate for the first term in the estimate of corollary 9.6 of [KK24], the error due to projection onto a finite dimensional subsbpace. This error is bounded by
(145) |
for a given excitation function as or . In the special case where is smooth, the rate of convergence is (theoretically) faster than any negative power of the degree of polynomial approximation. Naturally, when the degree is high enough (around in the experiments carried out be the authors), numerical instabilities appear so this is a purely theoretical result.
9.3 The error rate due to the misalignment of subspaces
In the following, the error rates due to the angles appearing in the statement of corollary 9.6 of [KK24] are obtained. The case of an undamped system is examined first, since the expressions are much simpler, and the arguments more transparent. In this section, the norms of the quantities will be understood as the homogeneous norms, equal to
(147) |
which is the relevant norm, since the are defined modulo an integration constant.
Since, by time-reparametrization, can be brought to , with and transforming according to the rule of eq. (42) of [KK24], will be suppressed for simplicity and the standing assumption will be .
The graphs presented in the following paragraphs are obtained using the data available in this json file. Reference to exact values will be avoided for readability, but the interested reader can access the data, stored in the form of a dictionary.
9.3.1 The undamped system, preliminary calculations
Firstly, concerning the integral of the excitation function in the undamped case when the displacement function is a Bernstein polynomial, the following holds.
Lemma 9.4.
Let and , , be given. Then, for ,
(148) |
Proof.
The first item follows directly from lemma 4.2 of [KK24] for and an integration by parts.
The third item follows from the same lemma, using the fact that has degree just like functions in , and functions in are considered modulo their mean value. β
The previous lemma has the following corollary, whose proof follows immediately from lem. 3.5, 3.6 and prop. 4.1.
Corollary 9.5.
It holds true that
(149) |
Consequently,
(150) |
The (symmetric, positive-semi-definite) matrix of scalar products
(151) |
is constructed, and its eigenvectors calculated. This is the matrix of scalar products of forces appearing in the construction of the approximate solution.
The first two eigenvectors correspond to , or very small, eigenvalues. Linear combinations of these eigenvectors construct the solutions to the homogeneous problem, with excitation function and non-trivial initial conditions. The first two co-ordinates of the null-eigenvectors are linearly independent, allowing for a solution of the problems
(152) |
where
(153) |
Then, following the corresponding part of (74), the eigenvectors are expressed in terms of linear combinations of initial conditions resulting in a basis of the nullspace and corresponding to the solutions with unit initial displacement and velocity; and initial displacement and unit velocity, respectively.
Following the numerical study carried out following prop. 4.1, the equivalent study of the projection of
(154) |
as in the corollary here above can now be presented.
9.3.2 The undamped system,
The study is carried out for in a time-scale at which , i.e. for where is the natural period of the system. The scaling properties with respect to will be studied afterwards.
It should be noted that in the undamped case the terms coming from the derivatives are in the kernel of the projection operator , while the integrals are not. Their projection on the orthogonal of the kernel, however, remains very small as the study will establish.
The quantity
(155) |
can be calculated, the subscript standing for βhomogeneousβ, so that
(156) |
The full signature of the function actually reads so that for brevity in notation. In other terms, the quantity of prop. 9.5 of [KK24] satisfies
(157) |
The graph in fig. 4 plots an estimate of . The estimate was obtained by randomly sampling initial conditions and kee** the maximum angle over these samples. All null eigenvalues were smaller than (in fact much smaller, but a threshold was set in order to assure smallness).
It can be seen that the point of diminishing returns is reached at around , where . For the corresponding value is . The rise in the exponent is arguably due to numerical instabilities and the diminishing returns point could be pushed further by using more sophisticated numerical analysis machinery. The broken-line model is given by
(158) |
and it is quite pessimistic in the first leg. In any case, for all relevant values of , .
This is the worst-case scenario for the convergence of the algorithm. A range of observed rates of convergence can also be visualized by plotting the exponent of the mean value of the projection angle over the samples, as well as the best-case scenario. The exponent corresponding to the mean value of the angle is denoted by , and the best-case scenario by . They are plotted in fig. 5, where it can be seen that the mean value closely tracks the worst-case exponent.
A significant spectral gap separates the two null eigenvectors with the rest of the eigenvectors, whose linear combinations construct the solutions to the non-homogeneous problem, with non-trivial excitation function. Subtraction of the corresponding multiples of from imposes initial conditions, and the eigenvectors obtained in this way are denoted by . It should be noted that is still a basis of eignvectors, only not orthogonal. Then, the quantity
(159) |
can be calculated, so that
(160) |
so that the angle of prop. 9.5 of [KK24] satisfies
(161) |
The sine of the angle, instead of the tangent, is calculated for purely practical reasons, as dividing with the hypotenuse is numerically more stable. In the limit , which is the interesting case, the quantities are equal for all practical reasons. For a given approximate solution in , the quantity expresses the proportion of the force that contributes to the error term.
The graph in fig. 6 plots an estimate of . The estimate was obtained by randomly sampling linear combinations of vectors in and getting the maximum angle over these samples. The spectral gap condition was that the minimal non-null eigenvalue be at least times greater than the biggest null eigenvalue. The spectral gap is actually a lot bigger than orders of magnitude, but, as before, a threshold was set in order to assure a sufficient spectral gap.
In the non-homogeneous problem, no point of diminishing returns is observed, which corroborates the hypothesis of not sufficiently strong numerical analysis machinery for treating the null eigenvalues that are (very close to) . The linear model is given by
(162) |
and it is also a bit pessimistic in the first leg. In any case, for all relevant values of , one still has that . It can be remarked that, already for the exponent is negative, unlike for the homogeneous problem. However, the factor is merely of the order of , not enough to produce an acceptable error rate.
As for the homogeneous problem, the best-case scenario angle and the exponent of the mean angle are plotted in fig. 7, showing that the worst-case scenario tends to be more pessimistic in the homogeneous than in the non-homogeneous case, with the exponent of the mean angle sitting farther than the worst case angle.
9.3.3 The undamped system, dependence on
In this section, the dependence of the angles of projection on the timestep parameter will be studied. The scaling of all quantities involved is highly non-trivial, since the derivative scales like , the integral like , and the products of derivatives and integrals are constant with respect to .
It should be expected that for small the dominant term come from the derivative, leading to small angles since . For intermediate values of , all terms should contribute roughly in par, while for larger the integral should start to dominate, leading to greater angles, since integrals are not in the kernel of the projection operator . The products of derivatives and integrals have complicated contribution to the norm of the force, since the derivative can have either sign, while the integrals are always .
As for , the same number of points are sampled, kee** the worst exponent The same thresholds for null eigenvalues an the spectral gap. Timesteps were sampled in the range from up to , i.e. in the range of to of the natural period of the system with and .
The function is plotted in figs. 8 and 9 for values of and small, resp. large values of , and in fig. 10 for values of and large values of . The split in was necessary in order to avoid division by at , and the split in for clarity.
The graph of fig. 8, featuring values of in the range , shows a very fast rate of convergence, as all exponents are consistently , which is an added factor of convergence on top of the factor , c.f. eq. (164). The graph of fig. 9, featuring , shows a more complicated behavior. While for the exponent is positive, leading to a faster convergence, smaller slows down convergence for . The latter remains fast, nonetheless, since, say
(165) |
The convergence for is rather indifferent to , if only slightly accelerated by small values of , with exponents in the range .
The graph of fig. 10 shows equally a non-monotonous behavior of the exponent for . The graph features only high values of , namely , since for low values of timesteps greater than the natural period of the system will not produce a small error rate. Already for convergence can remain fast for large values of , since for ,
(166) |
Convergence is faster for , while the dependence on gets milder for .
Concerning the non-homogeneous problem, following eq. (159), the quantity
(167) |
can be calculated, so that
(168) |
The function is plotted in figs. 11, 12, and 13, with the same parameters as for the homogeneous problem.
As show fig. 11, for low-order polynomial approximation and smaller than of the natural period, the smallness of the timestep contributes to the rate of convergence with an exponent ranging between linear and quadratic, and getting better as . As becomes comparable with the natural period of the system, the corresponding factor never eats too much on the factor produced by , but ceases to contribute.
The same picture holds for higher order polynomial approximation, if only for slightly larger values of , as can be seen in fig. 12. It is reminded that the factor is, in this case, orders of magnitute smaller, so that the effect of the negative exponent for close to is negligible.
Moving on to timesteps greater than the natural period of the system, it can be seen in fig. 13 that for the factor does contribute to the convergence rate, since the exponent is negative. Convergence becomes practically indifferent to the value of the timestep for since the exponent remains negative, but is very close to .
These graphs suggest that mixed approximation schemes, using different degrees of polynomial approximation for the homogeneous and non-homogeneous part, are conceivable, but are outside the scope of the present work.
9.3.4 The damped system, preliminary calculations
Concerning the integral of the excitation function in the damped case when the displacement function is a Bernstein polynomial, the following can be obtained.
Lemma 9.6.
Let and , , be given. Then, for ,
(169) |
The lemma just states the definitions of the relevant quantities. Closed forms involving the hypergeometric function can be obtained, using lemmas 3.10 and 3.11, and cor. 3.12. They are, however, very cumbersome and not easy to manipulate.
The statement corresponding to cor. 9.5 also becomes difficult to write down. What can be said about the projection is that multiplication by the exponential leaks mass from the derivative into the orthogonal of the kernel of the projection operator , weighed by powers of .
At the linear level, , and this results in
(170) |
being non-trivial, since is of order , c.f. also lem. 3.9. In the same way, the presence of the exponential factor leaks more mass of the integral from the kernel of the projection operator to its orthogonal. In both cases, mass is added both in the kernel and its orthogonal, and the trade-off seems impossible to analyze accurately.
The same procedure as in the undamped case can be followed in order to obtain numerical results concerning the matrix of the products of forces.
9.3.5 The effect of dam**,
The first part of the study is performed by kee** , constant, and studying how the curve of 4 changes as dam** is switched on. As explained in the previous paragraph, the dependence of the involved quantities on are very complicated, so, once again the authors are restricted to a numerical study of the phenomenon.
To this end and regarding the homogeneous problem, the quantity of eq. (163) is calculated, taking into account the dependence of on the dam** coefficient . The result is plotted in figs. 14, 15, 16, and 17. The graphs are split for and for for clarity.
The figures show that, in general, the projection angle deteriorates as grows. As in the undamped case, the part of the graph that is monotonously increasing for with held constant can probably be attributed to issues in the numerical calculation of null eigenvectors.
The dominant effect of dam** is to increase the mass in the kernel of the projection operator faster than in the orthogonal of the kernel. The effect becomes weaker as increases, leading to the inverse phenomenon for and small values of as can be seen in graph 16.
Overall, the introduction of dam** slows down the rate of convergence, since for all it holds that
(171) |
with the losses being bigger for small .
The corresponding quantity for the non-homogeneous problem, is also calculated, as defined in eq. (167), with switched on. The exponent is plotted in figs. 18, 19, 20, and 21, with the same split for and for , for the sake of clarity.
The figures show that the effect of very small dam** can be in the direction of increasing the rate of convergence, but the overall tendency is to decrease the rate. For all it holds that
(172) |
which implies that the approximation of the non-homogeneous problem is less severely affected by the introduction of dam** than the homogeneous one.
9.3.6 The effect of dam**, dependence on
Finally, the dependence on is studied, with dam** switched on. Once again, the scaling rule for the projection angle seems impossible to analyze with means other than a numerical study.
Thus, the exponent is plotted in the following figures, for a fixed value of in each one. It is the quantity of eq. (163), which now reads
(173) |
with all relevant parameters being variable, so that
(174) |
For , the exponent is plotted in fig. 22. Contrary to what was observed for , there is a clear monotonicity, with the exponent increasing sharply and steadily as increases. This effect counteracts sufficiently the loss of convergence due to the deterioration of as increases, since, for example,
(175) |
while
(176) |
where it is reminded that the value represents the timestep as a fraction of the natural period of the system. The same calculation for yields, respectively, for the undamped system, and for .
The corresponding graph for can be found in fig. 23, where the same phenomenon as for can be observed.
The pattern begins to change for higher order approximations, as can be seen in figs. 24 and 25, featuring the curves for . The monotonic gains are clear for timesteps up to of the natural period of the system, where a tip** point is reached and monotonicity breaks for stong dam** coefficients. The exponent of the timestep remains more favorable, nonetheless, for damped systems relative to the undamped case.
Finally, for and timesteps smaller than the natural period the exponent is plotted in fig. 26, while for timesteps ranging from up to , in fig. 27.
In the case of timesteps smaller than the natural period of the system, dam** leads in general to gains in convergence speed, while for timesteps greater than the period the picture is more complicated, as there is no clear monotonicity except for the higher part of the interval of dam** coefficients studied.
Turning to the non-homogeneous problem now, the quantity
(177) |
can be calculated, so that
(178) |
As before, the results are presented for fixed , varying and in figs. 28, 29, 30, 31.
In this case, the monotonicity is reversed, with convergence speed deteriorating for all relevant values of the timestep. The dependence of on becomes milder as increases for all relevant values of . This leads to the following, considerable, deterioration of the convergence rate when passing from to for and :
(179) |
while
(180) |
For a timestep the quantities read, respectively, and .
Finally, for and timesteps smaller than the natural period of the system, fig. 32 shows that in general dam** results in a slower congergence rate, which is practically indifferent to except for small values of and . Fig. 33 shows that in the case of timesteps greater than the natural period, convergence is also practically indifferent to , with all exponents close to .
9.3.7 The aggregate convergence factor
In this final section on the error due to the misalignment of spaces, the aggregate factor is calculated, i.e. the quantity
(181) |
for . This is necessary due to the convoluted dependence of the exponents on and , where it is reminded that the dependence on is the expected one, i.e. decreasing in a practically monotonous way. All graphs are logarithmic in order to accommodate the large range of values of the convergence factor.
9.4 The error at the end of the timestep
More precise estimates can be be obtained for the error of approximation at time , the timestep of the method, both for the homogeneous and the non-homogeneous problem, thanks to the estimates obtained in [KK24], in prop. 9.9 and 9.12 and the calculations in the proof of lem. 4.4, and more precisely eq. (38) and (39). These calculations have as a direct consequence the estimates of propositions 9.9 and 9.12, i.e.
(184) |
These expressions are significant because they provide better bounds for the error propagation from one timestep to the next. This is because the final displacement and velocity of one timestep are the initial conditions for the next one, and an error in these initial conditions propagates through the exact solution of the homogeneous SDOF problem.
Proposition 9.7.
The error of approximation for the displacement and the velocity at time satisfy the following estimates
(185) |
In the statement of the proposition, the functions and are defined in eq. (181) and have been studied in Β§9.3.7.
Proof.
By prop. 9.7 of [KK24], it holds that if is the integral of the force corresponding to the SDOF problem with homogeneous initial conditions and is the solution. Application of this inequality in the case of the force absorbing the initial conditions and the force corresponding to the error of approximation gives
(186) |
The remaining factors of the estimate on the final displacement are obtained by direct application of the definitions of the related quantities on eq. (184), using cor. 9.6 of [KK24].
Regarding the velocity, eq. (184) yields directly the estimate by substitution of the estimate for the displacement and the estimate for the force used in the estimate of the displacement. β
9.5 Error propagation
Proposition 9.7 concludes the error estimates for the algorithm proposed herein. This proposition produces the rate of the propagation of the error, since the final displacement and velocity of timestep are the initial conditions of timestep , and the error is accumulated by the error of approximation of the homogeneous problem. To this error, the error due to the projection of the external force, if the latter is non-zero, is added.
Thus, for initial conditions and , and external force , the error of displacement at time is given directly by the estimates of prop. 9.7. The errors for times is given by prop. 9.9 of [KK24], namely
(187) |
while for the velocity, by prop. 9.12
(188) |
If , are the displacement and velocity of the approximate solution at time , then is given by the same formula of prop. 9.7, plus a term
(189) |
due to the error propagation, similarly for the velocity.
Higher regularity for results in bounds for the velocity as detailed in thm. 9.16 of [KK24], which can be seen by taking one derivative of the equation of motion and applying the same estimates on the velocity instead of the displacement.
10 Conclusions
Even though direct comparison with traditional step-wise methods such as the Newmark method is not easy to make, due to the fundamental difference in the way the methods are built, the advantage of working with the weak formulation and then deriving a numerical method can already be seen in the study of the convergence of the method.
From the estimates it is already clear that, if the order of polynomial approximation is sufficiently high, timesteps comparable to the eigenperiod of the system can potentially give competitive results. This is known to be outside the scope of traditional timestep methods.
Naturally, increasing the order of polynomial approximation comes with additional computational overhead, but on the other hand the number of iterations may become smaller without hindering the precision of the method. The assessment of the trade-off, even though of crucial importance for applications, goes beyond the scope of the present work.
The numerical evidence illustrating the quality of the convergence of the method will be presented in part III of the paper.
References
- [dB78] C.Β deΒ Boor, A practical guide to splines, Applied Mathematical Sciences 27, Springer-Verlag, New York-Berlin, 1978. MRΒ 507062.
- [CQ82] C.Β Canuto and A.Β Quarteroni, Approximation results for orthogonal polynomials in sobolev spaces, Mathematics of Computation 38 (1982), 67β86.
- [Dah56] G.Β Dahlquist, Convergence and stability in the numerical integration of ordinary differential equations, MATHEMATICA SCANDINAVICA 4 (1956), 33β53. http://dx.doi.org/10.7146/math.scand.a-10454. Available at http://www.mscand.dk/article/view/10454.
- [Far00] R.Β T. Farouki, Legendreβbernstein basis transformations, Journal of Computational and Applied Mathematics 119 (2000), 145β160. http://dx.doi.org/https://doi.org/10.1016/S0377-0427(00)00376-9. Available at https://www.sciencedirect.com/science/article/pii/S0377042700003769.
- [Kac38] M.Β Kac, Une remarque sur les polynomes de ms bernstein, Studia Mathematica 7 (1938), 49β51.
- [KK24] N.Β Karaliolios and D.Β L. Karabalis, The weak form of the SDOF and MDOF equation of motion, part I: Theory, 2024. arXivΒ 2407.00441.
- [Rud76] W.Β Rudin, Principles of mathematical analysis, 3d ed. ed., McGraw-Hill New York, 1976 (English). Available at http://www.loc.gov/catdir/toc/mh031/75017903.html.