-
Polycatenated Architected Materials
Authors:
Wenjie Zhou,
Sujeeka Nadarajah,
Liuchi Li,
Anna G. Izard,
Aashutosh K. Prachet,
Payal Patel,
Hujie Yan,
Xiaoxing Xia,
Chiara Daraio
Abstract:
Architected materials derive their properties from the geometric arrangement of their internal structural elements, rather than solely from their chemical composition. They can display remarkable behaviors such as high strength while being lightweight, negative Poisson's ratios, and shear-normal coupling. However, architected materials so far have either exhibited solid-like or fluid-like behavior…
▽ More
Architected materials derive their properties from the geometric arrangement of their internal structural elements, rather than solely from their chemical composition. They can display remarkable behaviors such as high strength while being lightweight, negative Poisson's ratios, and shear-normal coupling. However, architected materials so far have either exhibited solid-like or fluid-like behavior, but not both. Here, we introduce a class of materials that consist of linked particles assembled in three-dimensional domains, forming polycatenated architected materials (PAMs). We propose a general framework for PAMs that translates arbitrary crystalline networks into particles' concatenations and design particles' geometry. The resulting materials are cohesive, yet the individual particles retain some kinematic freedom. In response to small external loads, PAMs behave like non-Newtonian fluids, showing both shear-thinning and shear-thickening responses. At larger strains, PAMs behave like solids, showing a nonlinear stress-strain relation, like lattices and foams. These responses are regulated by a jamming transition determined by the particles' arrangement and the direction of loading. PAMs are scalable, showing comparable mechanical responses at both millimeter- and micrometer-scales. However, micro-PAMs can change shape in response to electrostatic charges. PAM's properties are relevant for develo** stimuli-responsive materials, energy-absorbing systems and morphing architectures.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Infinite Divisibility of the Product of Two Correlated Normal Random Variables and Exact Distribution of the Sample Mean
Authors:
Robert E. Gaunt,
Saralees Nadarajah,
Tibor K. Pogány
Abstract:
We prove that the distribution of the product of two correlated normal random variables with non-zero means and arbitrary variances is infinitely divisible. We also obtain exact formulas for the probability density function of the sum of independent copies of such random variables.
We prove that the distribution of the product of two correlated normal random variables with non-zero means and arbitrary variances is infinitely divisible. We also obtain exact formulas for the probability density function of the sum of independent copies of such random variables.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Adjoint-based goal-oriented implicit shock tracking using full space mesh optimization
Authors:
Pranshul Thakur,
Siva Nadarajah
Abstract:
Solutions to the governing partial differential equations obtained from a discrete numerical scheme can have significant errors, especially near shocks when the discrete representation of the solution cannot fully capture the discontinuity in the solution. A recent approach to shock tracking [1, 2] has been to implicitly align the faces of mesh elements with the shock, yielding accurate solutions…
▽ More
Solutions to the governing partial differential equations obtained from a discrete numerical scheme can have significant errors, especially near shocks when the discrete representation of the solution cannot fully capture the discontinuity in the solution. A recent approach to shock tracking [1, 2] has been to implicitly align the faces of mesh elements with the shock, yielding accurate solutions on coarse meshes. In engineering applications, the solution field is often used to evaluate a scalar functional of interest, such as lift or drag over an airfoil. While functionals are sensitive to errors in the flow solution, certain regions in the domain are more important for accurate evaluation of the functional than the rest. Using this fact, we formulate a goal-oriented implicit shock tracking approach that captures a segment of the shock that is important for evaluating the functional. Shock tracking is achieved using Lagrange-Newton-Krylov-Schur (LNKS) full space optimizer, with the objective of minimizing the adjoint-weighted residual error indicator. We also present a method to evaluate the sensitivity and the Hessian of the functional error. Using available block preconditioners for LNKS [3, 4] makes the full space approach scalable. The method is applied to test cases of two-dimensional advection and inviscid compressible flows to demonstrate functional-dependent shock tracking. Tracking the entire shock without using artificial dissipation results in the error converging at the orders of $\mathcal{O}(h^{p+1})$.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Discretely Nonlinearly Stable Weight-Adjusted Flux Reconstruction High-Order Method for Compressible Flows on Curvilinear Grids
Authors:
Alexander Cicchino,
Siva Nadarajah
Abstract:
Provable nonlinear stability bounds the discrete approximation and ensures that the discretization does not diverge. For high-order methods, discrete nonlinear stability and entropy stability, have been successfully implemented for discontinuous Galerkin (DG) and residual distribution schemes, where the stability proofs depend on properties of L2-norms. In this paper, nonlinearly stable flux recon…
▽ More
Provable nonlinear stability bounds the discrete approximation and ensures that the discretization does not diverge. For high-order methods, discrete nonlinear stability and entropy stability, have been successfully implemented for discontinuous Galerkin (DG) and residual distribution schemes, where the stability proofs depend on properties of L2-norms. In this paper, nonlinearly stable flux reconstruction (NSFR) schemes are developed for three-dimensional compressible flow in curvilinear coordinates. NSFR is derived by merging the energy stable FR (ESFR) framework with entropy stable DG schemes. NSFR is demonstrated to use larger time-steps than DG due to the ESFR correction functions. NSFR differs from ESFR schemes in the literature since it incorporates the FR correction functions on the volume terms through the use of a modified mass matrix. We also prove that discrete kinetic energy stability cannot be preserved to machine precision for quadrature rules where the surface quadrature is not a subset of the volume quadrature. This paper also presents the NSFR modified mass matrix in a weight-adjusted form. This form reduces the computational cost in curvilinear coordinates through sum-fcatorization and low-storage techniques. The nonlinear stability properties of the scheme are verified on a nonsymmetric curvilinear grid for the inviscid Taylor-Green vortex problem and the correct orders of convergence were obtained for a manufactured solution. Lastly, we perform a computational cost comparison between conservative DG, overintegrated DG, and our proposed entropy conserving NSFR scheme, and find that our proposed entropy conserving NSFR scheme is computationally competitive with the conservative DG scheme.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
On Pólya's random walk constants
Authors:
Robert E. Gaunt,
Saralees Nadarajah,
Tibor K. Pogány
Abstract:
A celebrated result in probability theory is that a simple symmetric random walk on the $d$-dimensional lattice $\mathbb{Z}^d$ is recurrent for $d=1,2$ and transient for $d\geq 3$. In this note, we derive a closed-form expression, in terms of the Lauricella function $F_C$, for the return probability for all $d\geq3$. Previously, a closed-form formula had only been available for $d=3$.
A celebrated result in probability theory is that a simple symmetric random walk on the $d$-dimensional lattice $\mathbb{Z}^d$ is recurrent for $d=1,2$ and transient for $d\geq 3$. In this note, we derive a closed-form expression, in terms of the Lauricella function $F_C$, for the return probability for all $d\geq3$. Previously, a closed-form formula had only been available for $d=3$.
△ Less
Submitted 6 March, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
An L2-Error Estimate of Energy Stable Flux Reconstruction Method
Authors:
Erwan Lambert,
Siva Nadarajah
Abstract:
Energy stable flux reconstruction (ESFR) is a high-order numerical method used for solving partial differential equations in computational fluid dynamics. This method is designed to preserve the energy stability of the underlying partial differential equation system with respect to a broken Sobolev norm. A class of one-parameter ESFR schemes has been identified to be stable for the one-dimensional…
▽ More
Energy stable flux reconstruction (ESFR) is a high-order numerical method used for solving partial differential equations in computational fluid dynamics. This method is designed to preserve the energy stability of the underlying partial differential equation system with respect to a broken Sobolev norm. A class of one-parameter ESFR schemes has been identified to be stable for the one-dimensional linear advection equation. This class includes some well-known high-order methods such as the discontinuous Galerkin method and spectral difference method. The main advantage of the energy stable flux reconstruction is to allow for an increase in the maximum admissible time step while retaining the stability and accuracy properties of the underlying scheme. However numerical experiments have shown that beyond a certain value of the parameter, the optimal order of accuracy is lost. This article develops an L2-error estimate for the energy stable flux reconstruction scheme applied to the one-dimensional advection equation and demonstrates the exact expression that contributes towards the loss of the optimal order.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
Scalable Evaluation of Hadamard Products with Tensor Product Basis for Entropy-Stable High-Order Methods
Authors:
Alexander Cicchino,
Siva Nadarajah
Abstract:
A sum-factorization form for the evaluation of Hadamard products with a tensor product basis is derived in this work. The proposed algorithm allows for Hadamard products to be computed in $\mathcal{O}\left(n^{d+1}\right)$ flops rather than $\mathcal{O}\left(n^{2d}\right)$, where $d$ is the dimension of the problem. With this improvement, entropy conserving and stable schemes, that require a dense…
▽ More
A sum-factorization form for the evaluation of Hadamard products with a tensor product basis is derived in this work. The proposed algorithm allows for Hadamard products to be computed in $\mathcal{O}\left(n^{d+1}\right)$ flops rather than $\mathcal{O}\left(n^{2d}\right)$, where $d$ is the dimension of the problem. With this improvement, entropy conserving and stable schemes, that require a dense Hadamard product in the general modal case, become computationally competitive with the modal discontinuous Galerkin (DG) scheme. We numerically demonstrate the application of the sum-factorized Hadamard product in our in-house partial differential equation solver PHiLiP based on the Nonlinearly Stable Flux Reconstruction scheme. We demonstrate that the entropy conserving flow solver scales at $\mathcal{O}\left(n^{d+1}\right)$ for three-dimensional compressible flow in curvilinear coordinates, along with a computational cost comparison with the modal DG and over-integrated DG schemes.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Uniform convergence rates of skew-normal extremes
Authors:
Qian Xiong,
Zuoxiang Peng,
Saralees Nadarajah
Abstract:
Let $M_n=\max \left(X_1, X_2, \ldots, X_n \right)$ denote the partial maximum of an independent and identically distributed skew-normal random sequence. In this paper, the rate of uniform convergence of skew-normal extremes is derived. It is shown that with optimal normalizing constants the convergence rate of $\left(M_{n}-b_n\right)/a_n$ to its ultimate extreme value distribution is proportional…
▽ More
Let $M_n=\max \left(X_1, X_2, \ldots, X_n \right)$ denote the partial maximum of an independent and identically distributed skew-normal random sequence. In this paper, the rate of uniform convergence of skew-normal extremes is derived. It is shown that with optimal normalizing constants the convergence rate of $\left(M_{n}-b_n\right)/a_n$ to its ultimate extreme value distribution is proportional to $1/\log n$.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Provably Stable Flux Reconstruction High-Order Methods on Curvilinear Elements
Authors:
Alexander Cicchino,
David C. Del Rey Fernández,
Siva Nadarajah,
Jesse Chan,
Mark H. Carpenter
Abstract:
Provably stable flux reconstruction (FR) schemes are derived for partial differential equations cast in curvilinear coordinates. Specifically, energy stable flux reconstruction (ESFR) schemes are considered as they allow for design flexibility as well as stability proofs for the linear advection problem on affine elements. Additionally, split forms are examined as they enable the development of en…
▽ More
Provably stable flux reconstruction (FR) schemes are derived for partial differential equations cast in curvilinear coordinates. Specifically, energy stable flux reconstruction (ESFR) schemes are considered as they allow for design flexibility as well as stability proofs for the linear advection problem on affine elements. Additionally, split forms are examined as they enable the development of energy stability proofs. The first critical step proves, that in curvilinear coordinates, the discontinuous Galerkin (DG) conservative and non-conservative forms are inherently different--even under exact integration and analytically exact metric terms. This analysis demonstrates that the split form is essential to develo** provably stable DG schemes on curvilinear coordinates and motivates the construction of metric dependent ESFR correction functions in each element. Furthermore, the provably stable FR schemes differ from schemes in the literature that only apply the ESFR correction functions to surface terms or on the conservative form, and instead incorporate the ESFR correction functions on the full split form of the equations. It is demonstrated that the scheme is divergent when the correction functions are only used for surface reconstruction in curvilinear coordinates. We numerically verify the stability claims for our proposed FR split forms and compare them to ESFR schemes in the literature. Lastly, the newly proposed provably stable FR schemes are shown to obtain optimal orders of convergence. The scheme loses the orders of accuracy at the equivalent correction parameter value c as that of the one-dimensional ESFR scheme.
△ Less
Submitted 23 September, 2021;
originally announced September 2021.
-
Offline-Online Reinforcement Learning for Energy Pricing in Office Demand Response: Lowering Energy and Data Costs
Authors:
Doseok Jang,
Lucas Spangher,
Manan Khattar,
Utkarsha Agwan,
Selvaprabuh Nadarajah,
Costas Spanos
Abstract:
Our team is proposing to run a full-scale energy demand response experiment in an office building. Although this is an exciting endeavor which will provide value to the community, collecting training data for the reinforcement learning agent is costly and will be limited. In this work, we examine how offline training can be leveraged to minimize data costs (accelerate convergence) and program impl…
▽ More
Our team is proposing to run a full-scale energy demand response experiment in an office building. Although this is an exciting endeavor which will provide value to the community, collecting training data for the reinforcement learning agent is costly and will be limited. In this work, we examine how offline training can be leveraged to minimize data costs (accelerate convergence) and program implementation costs. We present two approaches to doing so: pretraining our model to warm start the experiment with simulated tasks, and using a planning model trained to simulate the real world's rewards to the agent. We present results that demonstrate the utility of offline reinforcement learning to efficient price-setting in the energy demand response problem.
△ Less
Submitted 14 August, 2021;
originally announced August 2021.
-
Nonlinearly Stable Flux Reconstruction High-Order Methods in Split Form
Authors:
Alexander Cicchino,
Siva Nadarajah,
David C. Del Rey Fernández
Abstract:
The flux reconstruction (FR) method has gained popularity in the research community as it recovers promising high-order methods through modally filtered correction fields, such as the discontinuous Galerkin method, amongst others, on unstructured grids over complex geometries. Moreover, FR schemes, specifically energy stable FR (ESFR) schemes also known as Vincent-Castonguay-Jameson-Huynh schemes,…
▽ More
The flux reconstruction (FR) method has gained popularity in the research community as it recovers promising high-order methods through modally filtered correction fields, such as the discontinuous Galerkin method, amongst others, on unstructured grids over complex geometries. Moreover, FR schemes, specifically energy stable FR (ESFR) schemes also known as Vincent-Castonguay-Jameson-Huynh schemes, have proven attractive as they allow for design flexibility as well as stability proofs for the linear advection problem on affine elements. Additionally, split forms have recently seen a resurgence in research activity due to their resultant nonlinear (entropy) stability proofs. This paper derives for the first time nonlinearly stable ESFR schemes in split form that enable nonlinear stability proofs for, uncollocated, modal, ESFR split forms with different volume and surface cubature nodes. The critical enabling technology is applying the splitting to the discrete stiffness operator. This naturally leads to appropriate surface and numerical fluxes, enabling both entropy stability and conservation proofs. When these schemes are recast in strong form, they differ from schemes found in the ESFR literature as the ESFR correction functions are incorporated on the volume integral. Furthermore, numerical experiments are conducted verifying that the new class of proposed ESFR split forms is nonlinearly stable in contrast to the standard split form ESFR approach. Lastly, the new ESFR split form is shown to obtain the correct orders of accuracy.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Full-Space Approach to Aerodynamic Shape Optimization
Authors:
Doug Shi-Dong,
Siva Nadarajah
Abstract:
Aerodynamic shape optimization (ASO) involves finding an optimal surface while constraining a set of nonlinear partial differential equations (PDE). The conventional approaches use quasi-Newton methods operating in the reduced-space, where the PDE constraints are eliminated at each design step by decoupling the flow solver from the optimizer. Conversely, the full-space Lagrange-Newton-Krylov-Schur…
▽ More
Aerodynamic shape optimization (ASO) involves finding an optimal surface while constraining a set of nonlinear partial differential equations (PDE). The conventional approaches use quasi-Newton methods operating in the reduced-space, where the PDE constraints are eliminated at each design step by decoupling the flow solver from the optimizer. Conversely, the full-space Lagrange-Newton-Krylov-Schur (LNKS) approach couples the design and flow iteration by simultaneously minimizing the objective function and improving feasibility of the PDE constraints, which requires less iterations of the forward problem. Additionally, the use of second-order information leads to a number of design iterations independent of the number of control variables. We discuss the necessary ingredients to build an efficient LNKS ASO framework as well as the intricacies of their implementation. The LNKS approach is then compared to reduced-space approaches on a benchmark two-dimensional test case using a high-order discontinuous Galerkin method to discretize the PDE constraint.
△ Less
Submitted 26 November, 2020;
originally announced November 2020.
-
Self-adapting Robustness in Demand Learning
Authors:
Boxiao Chen,
Selvaprabu Nadarajah,
Parshan Pakiman,
Stefanus Jasin
Abstract:
We study dynamic pricing over a finite number of periods in the presence of demand model ambiguity. Departing from the typical no-regret learning environment, where price changes are allowed at any time, pricing decisions are made at pre-specified points in time and each price can be applied to a large number of arrivals. In this environment, which arises in retailing, a pricing decision based on…
▽ More
We study dynamic pricing over a finite number of periods in the presence of demand model ambiguity. Departing from the typical no-regret learning environment, where price changes are allowed at any time, pricing decisions are made at pre-specified points in time and each price can be applied to a large number of arrivals. In this environment, which arises in retailing, a pricing decision based on an incorrect demand model can significantly impact cumulative revenue. We develop an adaptively-robust-learning (ARL) pricing policy that learns the true model parameters from the data while actively managing demand model ambiguity. It optimizes an objective that is robust with respect to a self-adapting set of demand models, where a given model is included in this set only if the sales data revealed from prior pricing decisions makes it "probable". As a result, it gracefully transitions from being robust when demand model ambiguity is high to minimizing regret when this ambiguity diminishes upon receiving more data. We characterize the stochastic behavior of ARL's self-adapting ambiguity sets and derive a regret bound that highlights the link between the scale of revenue loss and the customer arrival pattern. We also show that ARL, by being conscious of both model ambiguity and revenue, bridges the gap between a distributionally robust policy and a follow-the-leader policy, which focus on model ambiguity and revenue, respectively. We numerically find that the ARL policy, or its extension thereof, exhibits superior performance compared to distributionally robust, follow-the-leader, and upper-confidence-bound policies in terms of expected revenue and/or value at risk.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
Numerical Dissipation Based Error Estimators and Grid Adaptation for Large Eddy Simulation
Authors:
Yao Jiang,
Siva Nadarajah
Abstract:
Grid adaptation for implicit Large Eddy Simulation (LES) is a non-trivial challenge due to the inherent coupling of the modeling and numerical errors. An attempt to address the challenge first requires a comprehensive assessment and then the development of error estimators to highlight regions that require refinement. Following the work of Schranner et al., a novel approach to estimate the numeric…
▽ More
Grid adaptation for implicit Large Eddy Simulation (LES) is a non-trivial challenge due to the inherent coupling of the modeling and numerical errors. An attempt to address the challenge first requires a comprehensive assessment and then the development of error estimators to highlight regions that require refinement. Following the work of Schranner et al., a novel approach to estimate the numerical dissipation of the turbulent kinetic energy (TKE) equations is proposed. The presented approach allows the computation of the local numerical dissipation for arbitrary curvilinear grids through a post-processing procedure. This method, as well as empirical and kinetic-energy-based approaches, are employed to estimate the inherent numerical TKE. We incorporate the numerical TKE to evaluate an effective eddy viscosity, an effective Kolmogorov length scale, and an effective TKE to build a family of Index Quality (IQ) based error estimators. The proposed IQ based estimators are then assessed and utilized to show their effectiveness through an application of grid adaptation for the periodic hill test case and transitional flow over the SD 7003 airfoil. Numerical results are validated through a comparison against reference LES and experimental data. Flow over the adapted grids appear better abled to capture pertinent flow features and integrated functions, such as the lift and drag coefficients.
△ Less
Submitted 6 November, 2020;
originally announced November 2020.
-
A new robust class of skew elliptical distributions
Authors:
H. Kwong,
S. Nadarajah
Abstract:
A new robust class of multivariate skew distributions is introduced. Practical aspects such as parameter estimation method of the proposed class are discussed, we show that the proposed class can be fitted under a reasonable time frame. Our study shows that the class of distributions is capable to model multivariate skewness structure and does not suffer from the curse of dimensionality as heavily…
▽ More
A new robust class of multivariate skew distributions is introduced. Practical aspects such as parameter estimation method of the proposed class are discussed, we show that the proposed class can be fitted under a reasonable time frame. Our study shows that the class of distributions is capable to model multivariate skewness structure and does not suffer from the curse of dimensionality as heavily as other distributions of similar complexity do, such as the class of canonical skew distributions. We also derive a nested form of the proposed class which appears to be the most flexible class of multivariate skew distributions in literature that has a closed-form density function. Numerical examples on two data sets, i) a data set containing daily river flow data recorded in the UK; and ii) a data set containing biomedical variables of athletes collected by the Australian Institute of Sports (AIS), are demonstrated. These examples further support the practicality of the proposed class on moderate dimensional data sets.
△ Less
Submitted 16 November, 2020; v1 submitted 3 November, 2020;
originally announced November 2020.
-
A Parameter-free and Projection-free Restarting Level Set Method for Adaptive Constrained Convex Optimization Under the Error Bound Condition
Authors:
Qihang Lin,
Runchao Ma,
Selvaprabu Nadarajah,
Negar Soheili
Abstract:
Recent efforts to accelerate first-order methods have focused on convex optimization problems that satisfy a geometric property known as error-bound condition, which covers a broad class of problems, including piece-wise linear programs and strongly convex programs. Parameter-free first-order methods that employ projection-free updates have the potential to broaden the benefit of acceleration. Suc…
▽ More
Recent efforts to accelerate first-order methods have focused on convex optimization problems that satisfy a geometric property known as error-bound condition, which covers a broad class of problems, including piece-wise linear programs and strongly convex programs. Parameter-free first-order methods that employ projection-free updates have the potential to broaden the benefit of acceleration. Such a method has been developed for unconstrained convex optimization but is lacking for general constrained convex optimization. We propose a parameter-free level-set method for the latter constrained case based on projection-free subgradient decent that exhibits accelerated convergence for problems that satisfy an error-bound condition. Our method maintains a separate copy of the level-set sub-problem for each level parameter value and restarts the computation of these copies based on objective function progress. Applying such a restarting scheme in a level-set context is novel and results in an algorithm that dynamically adapts the precision of each copy. This property is key to extending prior restarting methods based on static precision that have been proposed for unconstrained convex optimization to handle constraints. We report promising numerical performance relative to benchmark methods.
△ Less
Submitted 29 September, 2022; v1 submitted 28 October, 2020;
originally announced October 2020.
-
A robust quasi-optimal test norm for a DPG discretization of the convection-diffusion equation
Authors:
Stephen Metcalfe,
Siva Nadarajah
Abstract:
In this work, we propose a new quasi-optimal test norm for a discontinuous Petrov-Galerkin (DPG) discretization of the ultra-weak formulation of the convection-diffusion equation. We prove theoretically that the proposed test norm leads to bounds between the target norm and the energy norm induced by the test norm which are robust with respect to the diffusion parameter in the solution and gradien…
▽ More
In this work, we propose a new quasi-optimal test norm for a discontinuous Petrov-Galerkin (DPG) discretization of the ultra-weak formulation of the convection-diffusion equation. We prove theoretically that the proposed test norm leads to bounds between the target norm and the energy norm induced by the test norm which are robust with respect to the diffusion parameter in the solution and gradient components and have favorable scalings in the trace components. We conclude with numerical experiments to confirm our theoretical results.
△ Less
Submitted 12 August, 2020;
originally announced August 2020.
-
Self-guided Approximate Linear Programs
Authors:
Parshan Pakiman,
Selvaprabu Nadarajah,
Negar Soheili,
Qihang Lin
Abstract:
Approximate linear programs (ALPs) are well-known models based on value function approximations (VFAs) to obtain policies and lower bounds on the optimal policy cost of discounted-cost Markov decision processes (MDPs). Formulating an ALP requires (i) basis functions, the linear combination of which defines the VFA, and (ii) a state-relevance distribution, which determines the relative importance o…
▽ More
Approximate linear programs (ALPs) are well-known models based on value function approximations (VFAs) to obtain policies and lower bounds on the optimal policy cost of discounted-cost Markov decision processes (MDPs). Formulating an ALP requires (i) basis functions, the linear combination of which defines the VFA, and (ii) a state-relevance distribution, which determines the relative importance of different states in the ALP objective for the purpose of minimizing VFA error. Both these choices are typically heuristic: basis function selection relies on domain knowledge while the state-relevance distribution is specified using the frequency of states visited by a heuristic policy. We propose a self-guided sequence of ALPs that embeds random basis functions obtained via inexpensive sampling and uses the known VFA from the previous iteration to guide VFA computation in the current iteration. Self-guided ALPs mitigate the need for domain knowledge during basis function selection as well as the impact of the initial choice of the state-relevance distribution, thus significantly reducing the ALP implementation burden. We establish high probability error bounds on the VFAs from this sequence and show that a worst-case measure of policy performance is improved. We find that these favorable implementation and theoretical properties translate to encouraging numerical results on perishable inventory control and options pricing applications, where self-guided ALP policies improve upon policies from problem-specific methods. More broadly, our research takes a meaningful step toward application-agnostic policies and bounds for MDPs.
△ Less
Submitted 12 October, 2021; v1 submitted 8 January, 2020;
originally announced January 2020.
-
Pathwise Optimization for Merchant Energy Production
Authors:
Bo Yang,
Selvaprabu Nadarajah,
Nicola Secomandi
Abstract:
We study merchant energy production modeled as a compound switching and timing option. The resulting Markov decision process is intractable. State-of-the-art approximate dynamic programming methods applied to realistic instances of this model yield policies with large optimality gaps that are attributed to a weak upper (dual) bound on the optimal policy value. We extend pathwise optimization from…
▽ More
We study merchant energy production modeled as a compound switching and timing option. The resulting Markov decision process is intractable. State-of-the-art approximate dynamic programming methods applied to realistic instances of this model yield policies with large optimality gaps that are attributed to a weak upper (dual) bound on the optimal policy value. We extend pathwise optimization from stop** models to merchant energy production to investigate this issue. We apply principal component analysis and block coordinate descent in novel ways to respectively precondition and solve the ensuing ill conditioned and large scale linear program, which even a cutting-edge commercial solver is unable to handle directly. Compared to standard methods, our approach leads to substantially tighter dual bounds and smaller optimality gaps at the expense of considerably larger computational effort. Specifically, we provide numerical evidence for the near optimality of the operating policies based on least squares Monte Carlo and compute slightly better ones using our approach on a set of existing benchmark ethanol production instances. These findings suggest that both these policies are effective for the class of models we investigate. Our research has potential relevance for other commodity merchant operations settings.
△ Less
Submitted 28 December, 2019;
originally announced December 2019.
-
An asynchronous incomplete block LU preconditioner for computational fluid dynamics on unstructured grids
Authors:
Aditya Kashi,
Siva Nadarajah
Abstract:
We present a study of the effectiveness of asynchronous incomplete LU factorization preconditioners for the time-implicit solution of compressible flow problems while exploiting thread-parallelism within a compute node. A block variant of the asynchronous fine-grain parallel preconditioner adapted to a finite volume discretization of the compressible Navier-Stokes equations on unstructured grids i…
▽ More
We present a study of the effectiveness of asynchronous incomplete LU factorization preconditioners for the time-implicit solution of compressible flow problems while exploiting thread-parallelism within a compute node. A block variant of the asynchronous fine-grain parallel preconditioner adapted to a finite volume discretization of the compressible Navier-Stokes equations on unstructured grids is presented, and convergence theory is extended to the new variant. Experimental (numerical) results on the performance of these preconditioners on inviscid and viscous laminar two-dimensional steady-state test cases are reported. It is found, for these compressible flow problems, that the block variant performs much better in terms of convergence, parallel scalability and reliability than the original scalar asynchronous ILU preconditioner. For viscous flow, it is found that the ordering of unknowns may determine the success or failure of asynchronous block-ILU preconditioning, and an ordering of grid cells suitable for solving viscous problems is presented.
△ Less
Submitted 4 October, 2020; v1 submitted 1 December, 2019;
originally announced December 2019.
-
A Data Efficient and Feasible Level Set Method for Stochastic Convex Optimization with Expectation Constraints
Authors:
Qihang Lin,
Selvaprabu Nadarajah,
Negar Soheili,
Tianbao Yang
Abstract:
Stochastic convex optimization problems with expectation constraints (SOECs) are encountered in statistics and machine learning, business, and engineering. In data-rich environments, the SOEC objective and constraints contain expectations defined with respect to large datasets. Therefore, efficient algorithms for solving such SOECs need to limit the fraction of data points that they use, which we…
▽ More
Stochastic convex optimization problems with expectation constraints (SOECs) are encountered in statistics and machine learning, business, and engineering. In data-rich environments, the SOEC objective and constraints contain expectations defined with respect to large datasets. Therefore, efficient algorithms for solving such SOECs need to limit the fraction of data points that they use, which we refer to as algorithmic data complexity. Recent stochastic first order methods exhibit low data complexity when handling SOECs but guarantee near-feasibility and near-optimality only at convergence. These methods may thus return highly infeasible solutions when heuristically terminated, as is often the case, due to theoretical convergence criteria being highly conservative. This issue limits the use of first order methods in several applications where the SOEC constraints encode implementation requirements. We design a stochastic feasible level set method (SFLS) for SOECs that has low data complexity and emphasizes feasibility before convergence. Specifically, our level-set method solves a root-finding problem by calling a novel first order oracle that computes a stochastic upper bound on the level-set function by extending mirror descent and online validation techniques. We establish that SFLS maintains a high-probability feasible solution at each root-finding iteration and exhibits favorable iteration complexity compared to state-of-the-art deterministic feasible level set and stochastic subgradient methods. Numerical experiments on three diverse applications validate the low data complexity of SFLS relative to the former approach and highlight how SFLS finds feasible solutions with small optimality gaps significantly faster than the latter method.
△ Less
Submitted 1 January, 2020; v1 submitted 7 August, 2019;
originally announced August 2019.
-
An n-dimensional Rosenbrock Distribution for MCMC Testing
Authors:
Filippo Pagani,
Martin Wiegand,
Saralees Nadarajah
Abstract:
The Rosenbrock function is an ubiquitous benchmark problem for numerical optimisation, and variants have been proposed to test the performance of Markov Chain Monte Carlo algorithms. In this work we discuss the two-dimensional Rosenbrock density, its current $n$-dimensional extensions, and their advantages and limitations. We then propose a new extension to arbitrary dimensions called the Hybrid R…
▽ More
The Rosenbrock function is an ubiquitous benchmark problem for numerical optimisation, and variants have been proposed to test the performance of Markov Chain Monte Carlo algorithms. In this work we discuss the two-dimensional Rosenbrock density, its current $n$-dimensional extensions, and their advantages and limitations. We then propose a new extension to arbitrary dimensions called the Hybrid Rosenbrock distribution, which is composed of conditional normal kernels arranged in such a way that preserves the key features of the original kernel. Moreover, due to its structure, the Hybrid Rosenbrock distribution is analytically tractable and possesses several desirable properties, which make it an excellent test model for computational algorithms.
△ Less
Submitted 7 May, 2020; v1 submitted 22 March, 2019;
originally announced March 2019.
-
alphastable: An R Package for Modelling Multivariate Stable and Mixture of Symmetric Stable Distributions
Authors:
Mahdi Teimouri,
Mahdi Torshizi,
Adel Mohammadpour,
Saralees Nadarajah
Abstract:
The family of stable distributions received extensive applications in many fields of studies since it incorporates both the skewness and heavy tails. In this paper, we introduce a package written in the R language called alphastable. The alphastable performs a variety of tasks including: 1- generating random numbers from univariate, truncated, and multivariate stable distributions. 2- computing th…
▽ More
The family of stable distributions received extensive applications in many fields of studies since it incorporates both the skewness and heavy tails. In this paper, we introduce a package written in the R language called alphastable. The alphastable performs a variety of tasks including: 1- generating random numbers from univariate, truncated, and multivariate stable distributions. 2- computing the probability density function of univariate and multivariate elliptically contoured stable distributions, 3- computing the distribution function of univariate stable distributions, 4- estimating the parameters of univariate symmetric stable, univariate Cauchy, mixture of Cauchy, mixture of univariate symmetric stable, multivariate elliptically contoured stable, and multivariate strictly stable distributions. This package, as it will be shown, is very useful for modelling data in univariate and multivariate cases that arise in the fields of finance and economics.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
Stability of Energy Stable Flux Reconstruction for the Diffusion Problem using the Interior Penalty and Bassi and Rebay II Numerical Fluxes for Linear Triangular Elements
Authors:
Samuel Quaegebeur,
Siva Nadarajah
Abstract:
The flux reconstruction (FR) method has gained popularity within the research community. The approach has been demonstrated to recover high-order methods such as the discontinuous Galerkin (DG) method. Stability analyses have been conducted for a linear advection problem leading to the energy stable flux reconstruction (ESFR) methods also named Vincent-Castonguay-Jameson-Huynh (VCJH) methods. ESFR…
▽ More
The flux reconstruction (FR) method has gained popularity within the research community. The approach has been demonstrated to recover high-order methods such as the discontinuous Galerkin (DG) method. Stability analyses have been conducted for a linear advection problem leading to the energy stable flux reconstruction (ESFR) methods also named Vincent-Castonguay-Jameson-Huynh (VCJH) methods. ESFR schemes can be viewed as DG schemes with modally filtered correction fields. Using this class of methods, the linear advection diffusion problem has been shown to be stable using the local discontinuous Galerkin scheme (LDG) to compute the viscous numerical flux. This stability proof has been extended for linear triangular and tetrahedra elements. Although the LDG scheme is commonly used, it requires, on particular meshes, a wide stencil, which raises the computational cost.
As a consequence, many prefer the compact interior penalty (IP) or the Bassi and Rebay II (BR2) numerical fluxes. This article, for the first time, derives, for both schemes, a condition on the penalty term to ensure stability. Moreover the article establishes that for both the IP and BR2 numerical fluxes, the stability of the ESFR scheme is independent of the auxiliary correction field. A von Neumann analysis is conducted to study the maximal time step of various ESFR methods.
△ Less
Submitted 13 May, 2018;
originally announced May 2018.
-
On the Geometric Conservation Law for the Non Linear Frequency Domain and Time-Spectral Methods
Authors:
Marc Benoit,
Siva Nadarajah
Abstract:
The aim of this paper is to present and validate two new procedures to enforce the Geometric Conservation Law (GCL) on a moving grid for an Arbitrary Lagrangian Eulerian (ALE) formulation of the Euler equations discretized in time for either the Non Linear Frequency Domain (NLFD) or Time-Spectral (TS) methods. The equations are spatially discretized by a structured finite-volume scheme on a hexahe…
▽ More
The aim of this paper is to present and validate two new procedures to enforce the Geometric Conservation Law (GCL) on a moving grid for an Arbitrary Lagrangian Eulerian (ALE) formulation of the Euler equations discretized in time for either the Non Linear Frequency Domain (NLFD) or Time-Spectral (TS) methods. The equations are spatially discretized by a structured finite-volume scheme on a hexahedral mesh. The derived methodologies follow a general approach where the positions and the velocities of the grid points are known at each time step. The integrated face mesh velocities are derived either from the Approximation of the Exact Volumetric Increments (AEVI) relative to the undeformed mesh or exactly computed based on a Trilinear Map** (TRI-MAP) between the physical space and the computational domain. The accuracy of the AEVI method highly depends on the computation of the volumetric increments and limits the temporal-order of accuracy of the deduced integrated face mesh velocities to between one and two. Thus defeating the purpose of the NLFD method which possesses spectral rate of convergence. However, the TRI-MAP method has proven to be more computationally efficient, ensuring the satisfaction of the GCL once the convergence of the time derivative of the cell volume is reached in Fourier space. The methods are validated numerically by verifying the conservation of uniform flow and by comparing the integrated face mesh velocities to the exact values derived from the map**.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Stability of Energy Stable Flux Reconstruction for the Diffusion Problem using the Interior Penalty and Bassi and Rebay II Numerical Fluxes
Authors:
Samuel Quaegebeur,
Siva Nadarajah,
Farshad Navah,
Philip Zwanenburg
Abstract:
Recovering some prominent high-order approaches such as the discontinuous Galerkin (DG) or the spectral difference (SD) methods, the flux reconstruction (FR) approach has been adopted by many individuals in the research community and is now commonly used to solve problems on unstructured grids over complex geometries. This approach relies on the use of correction functions to obtain a differential…
▽ More
Recovering some prominent high-order approaches such as the discontinuous Galerkin (DG) or the spectral difference (SD) methods, the flux reconstruction (FR) approach has been adopted by many individuals in the research community and is now commonly used to solve problems on unstructured grids over complex geometries. This approach relies on the use of correction functions to obtain a differential form for the discrete problem. A class of correction functions, named energy stable flux reconstruction (ESFR) functions, has been proven stable for the linear advection problem. This proof has then been extended for the diffusion equation using the local discontinuous Galerkin (LDG) scheme to compute the numerical fluxes. Although the LDG scheme is commonly used, many prefer the interior penalty (IP), as well as the Bassi and Rebay II (BR2) schemes. Similarly to the LDG proof, this article provides a stability analysis for the IP and the BR2 numerical fluxes. In fact, we obtain a theoretical condition on the penalty term to ensure stability. This result is then verified through numerical simulations. To complete this study, a von Neumann analysis is conducted to provide a combination of parameters producing the maximal time step while converging at the correct order. All things considered, this article has for purpose to provide the community with a stability condition while using the IP and the BR2 schemes.
△ Less
Submitted 18 March, 2018;
originally announced March 2018.
-
An extension of Azzalini's method
Authors:
Filippo Domma,
Božidar V. Popović,
Saralees Nadarajah
Abstract:
The aim of this paper is to extend Azzalini's method. This extension is done in two stages: consider two dependent and non-identically distributed random variables say $X_1$ and $X_2$; model the dependence between $X_1$ and $X_2$ by a copula. To illustrate the new method, we assume $X_1$ and $X_2$ are exponential random variables. This assumption leads to a new distribution called the Generalized…
▽ More
The aim of this paper is to extend Azzalini's method. This extension is done in two stages: consider two dependent and non-identically distributed random variables say $X_1$ and $X_2$; model the dependence between $X_1$ and $X_2$ by a copula. To illustrate the new method, we assume $X_1$ and $X_2$ are exponential random variables. This assumption leads to a new distribution called the Generalized Weighted Exponential Distribution (GWED), a generalization of Gupta and Kundu (2009)'s Weighted Exponential Distribution (WED). Some mathematical properties of the GWED are derived, and its parameters estimated by maximum likelihood. The GWED is applied to biochemical data sets showing its good performance compared to the WED.
△ Less
Submitted 3 March, 2018;
originally announced March 2018.
-
On the verification of CFD solvers of all orders of accuracy on curved wall-bounded domains and for realistic RANS flows
Authors:
Farshad Navah,
Siva Nadarajah
Abstract:
This paper aims at extending the code verification methodology to high-order accurate scheme implementations on curved wall-bounded domains as well as for realistic turbulent flows such as the flat plate boundary layer modelled by the Reynolds-averaged Navier-Stokes (RANS) equations. Two new manufactured solutions (MSs) are introduced with demonstrated ability to verify the treatment of slip and n…
▽ More
This paper aims at extending the code verification methodology to high-order accurate scheme implementations on curved wall-bounded domains as well as for realistic turbulent flows such as the flat plate boundary layer modelled by the Reynolds-averaged Navier-Stokes (RANS) equations. Two new manufactured solutions (MSs) are introduced with demonstrated ability to verify the treatment of slip and no-slip boundary conditions in high-order frameworks on curved domains. These MSs serve as well to discuss the impact of the method of computation of boundary normals on the order of accuracy (OOA) of the solution at the wall. Furthermore, two turbulent boundary layer MSs from literature, devised to mimic the genuine features of RANS-modelled flows in the vicinity of wall, are compared in terms of their suitability to achieving high-order accuracy. A number of useful concepts in verification are explored through these cases such as the limit values of the Spalart-Allmaras (SA) turbulence model source terms at the wall, the verification of the modified vorticity term of the modified SA model, the grid sensitivity of wall-bounded turbulent flows, the inadequacy of substituting solution verification to code verification and the effect of non-dimensionalization of the solution on the minimization of iterative errors via residual convergence. In all cases, demonstrations are carried for orders of accuracy up to the sixth.
△ Less
Submitted 29 December, 2017;
originally announced January 2018.
-
A comprehensive high-order solver verification methodology for free fluid flows
Authors:
Farshad Navah,
Siva Nadarajah
Abstract:
The aim of this article is to present a comprehensive methodology for the verification of computational fluid dynamics (CFD) solvers with a special attention to aspects pertinent to discretizations with orders of accuracy (OOAs) higher than two. The method of manufactured solutions (MMS) is adopted and a series of manufactured solutions (MSs) is introduced that examines various components of CFD s…
▽ More
The aim of this article is to present a comprehensive methodology for the verification of computational fluid dynamics (CFD) solvers with a special attention to aspects pertinent to discretizations with orders of accuracy (OOAs) higher than two. The method of manufactured solutions (MMS) is adopted and a series of manufactured solutions (MSs) is introduced that examines various components of CFD solvers for free flows (not bounded by walls), including inviscid, laminar and turbulent problems when the latter are modelled by the Reynolds-averaged Navier-Stokes (RANS) equations. The treatment of curved elements is also examined. These MSs are furthermore conceived with demonstrated suitability for the verification of OOAs up to the sixth. Each MS is as well utilized to discuss salient aspects useful to the code verification methodology such as the relative qualities of the most useful norms in measuring the discretization error, the sensitivity analysis of the verification process to forcing function terms, the relation between residual minimization and discretization error convergence in iterative solutions and finally the sensitivity of high-order discretizations to grid stretching and self-similarity. Furthermore, scripts and code are provided as accompanying material to assist the interested reader in reproducing the verification results of each manufactured solution (MS).
△ Less
Submitted 26 December, 2017;
originally announced December 2017.
-
An alternative approach for compatibility of two discrete conditional distributions
Authors:
Indranil Ghosh,
Saralees Nadarajah
Abstract:
Conditional specification of distributions is a develo** area with increasing applications. In the finite discrete case, a variety of compatible conditions can be derived. In this paper, we propose an alternative approach to study the compatibility of two conditional probability distributions under the finite discrete setup. A technique based on rank-based criterion is shown to be particularly c…
▽ More
Conditional specification of distributions is a develo** area with increasing applications. In the finite discrete case, a variety of compatible conditions can be derived. In this paper, we propose an alternative approach to study the compatibility of two conditional probability distributions under the finite discrete setup. A technique based on rank-based criterion is shown to be particularly convenient for identifying compatible distributions corresponding to complete conditional specification including the case with zeros.The proposed methods are illustrated with several examples.
△ Less
Submitted 1 November, 2017;
originally announced November 2017.
-
On some further properties and application of Weibull-R family of distributions
Authors:
Indranil Ghosh,
Saralees Nadarajah
Abstract:
In this paper, we provide some new results for the Weibull-R family of distributions (Alzaghal, Ghosh and Alzaatreh (2016)). We derive some new structural properties of the Weibull-R family of distributions. We provide various characterizations of the family via conditional moments, some functions of order statistics and via record values.
In this paper, we provide some new results for the Weibull-R family of distributions (Alzaghal, Ghosh and Alzaatreh (2016)). We derive some new structural properties of the Weibull-R family of distributions. We provide various characterizations of the family via conditional moments, some functions of order statistics and via record values.
△ Less
Submitted 31 October, 2017;
originally announced November 2017.
-
On the Necessity of Superparametric Geometry Representation for Discontinuous Galerkin Methods on Domains with Curved Boundaries
Authors:
Philip Zwanenburg,
Siva Nadarajah
Abstract:
We provide numerical evidence demonstrating the necessity of employing a superparametric geometry representation in order to obtain optimal convergence orders on two-dimensional domains with curved boundaries when solving the Euler equations using Discontinuous Galerkin methods. However, concerning the obtention of optimal convergence orders for the Navier-Stokes equations, we demonstrate numerica…
▽ More
We provide numerical evidence demonstrating the necessity of employing a superparametric geometry representation in order to obtain optimal convergence orders on two-dimensional domains with curved boundaries when solving the Euler equations using Discontinuous Galerkin methods. However, concerning the obtention of optimal convergence orders for the Navier-Stokes equations, we demonstrate numerically that the use of isoparametric geometry representation is sufficient for the case considered here.
△ Less
Submitted 3 May, 2017;
originally announced May 2017.
-
Partial hyperplane activation for generalized intersection cuts
Authors:
Aleksandr M. Kazachkov,
Selvaprabu Nadarajah,
Egon Balas,
François Margot
Abstract:
The generalized intersection cut (GIC) paradigm is a recent framework for generating cutting planes in mixed integer programming with attractive theoretical properties. We investigate this computationally unexplored paradigm and observe that a key hyperplane activation procedure embedded in it is not computationally viable. To overcome this issue, we develop a novel replacement to this procedure c…
▽ More
The generalized intersection cut (GIC) paradigm is a recent framework for generating cutting planes in mixed integer programming with attractive theoretical properties. We investigate this computationally unexplored paradigm and observe that a key hyperplane activation procedure embedded in it is not computationally viable. To overcome this issue, we develop a novel replacement to this procedure called partial hyperplane activation (PHA), introduce a variant of PHA based on a notion of hyperplane tilting, and prove the validity of both algorithms. We propose several implementation strategies and parameter choices for our PHA algorithms and provide supporting theoretical results. We computationally evaluate these ideas in the COIN-OR framework on MIPLIB instances. Our findings shed light on the the strengths of the PHA approach as well as suggest properties related to strong cuts that can be targeted in the future.
△ Less
Submitted 16 December, 2018; v1 submitted 7 March, 2017;
originally announced March 2017.
-
Unbiased estimates for products of moments and cumulants for finite and infinite populations
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
Let $F=F_N$ be the distribution of a finite real population of size $N$. Let $\widehat{F}=F_N$ be the empirical distribution of a sample of size $n$ drawn from the population without replacement. We prove the following remarkable {\it inversion principle} for obtaining unbiased estimates. Let $ T \left(F_N\right)$ be any product of the moments or cumulants of $F_N$. Let…
▽ More
Let $F=F_N$ be the distribution of a finite real population of size $N$. Let $\widehat{F}=F_N$ be the empirical distribution of a sample of size $n$ drawn from the population without replacement. We prove the following remarkable {\it inversion principle} for obtaining unbiased estimates. Let $ T \left(F_N\right)$ be any product of the moments or cumulants of $F_N$. Let $T_{n, N} \left( F_N \right) = E T \left( F_n \right)$. Then $E T_{N, n} \left( F_n \right) = T \left( F_N \right)$. We also obtain an explicit expression for $T_{n, N} \left(F_N\right)$ for all $ T \left( F_N \right)$ of order up to 6.
We also prove the following related result. If $F_n$ and $F_N$ are the sample and population distributions, the only functionals for which $E T \left( F_n \right) = λ_{n, N} T \left( F_N \right)$ are noncentral moments, and generalized second and third order central moments. For these three cases the eigenvalues are $λ_{n, N}=1$, $\left( 1 - n^{-1} \right) \left( 1 - N^{-1} \right)^{-1}$, and $\left( 1 - n^{-1} \right) \left( 1 - 2n^{-1} \right) \left( 1 - N^{-1} \right)^{-1} \left( 1 - 2N^{-1} \right)^{-1}$ respectively.
△ Less
Submitted 27 October, 2014;
originally announced October 2014.
-
The distribution of the maximum of an ARMA(1, 1) process
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
We give the cumulative distribution function of $M_n$, the maximum of a sequence of $n$ observations from an ARMA(1, 1) process. Solutions are first given in terms of repeated integrals and then for the case, where the underlying random variables are absolutely continuous. The distribution of $M_n$ is then given as a weighted sum of the $n$th powers of the eigenvalues of a non-symmetric Fredholm k…
▽ More
We give the cumulative distribution function of $M_n$, the maximum of a sequence of $n$ observations from an ARMA(1, 1) process. Solutions are first given in terms of repeated integrals and then for the case, where the underlying random variables are absolutely continuous. The distribution of $M_n$ is then given as a weighted sum of the $n$th powers of the eigenvalues of a non-symmetric Fredholm kernel. The weights are given in terms of the left and right eigenfunctions of the kernel.
These results are large deviations expansions for estimates, since the maximum need not be standardized to have a limit. In fact, such a limit need not exist.
△ Less
Submitted 26 December, 2013;
originally announced December 2013.
-
Rates of convergence of extremes from skew normal samples
Authors:
Xin Liao,
Zuoxiang Peng,
Saralees Nadarajah,
Xiaoqian Wang
Abstract:
For a skew normal random sequence, convergence rates of the distribution of its partial maximum to the Gumbel extreme value distribution are derived. The asymptotic expansion of the distribution of the normalized maximum is given under an optimal choice of norming constants. We find that the optimal convergence rate of the normalized maximum to the Gumbel extreme value distribution is proportional…
▽ More
For a skew normal random sequence, convergence rates of the distribution of its partial maximum to the Gumbel extreme value distribution are derived. The asymptotic expansion of the distribution of the normalized maximum is given under an optimal choice of norming constants. We find that the optimal convergence rate of the normalized maximum to the Gumbel extreme value distribution is proportional to $1/\log n$.
△ Less
Submitted 5 December, 2012;
originally announced December 2012.
-
The chain rule for functionals with applications to functions of moments
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
The chain rule for derivatives of a function of a function is extended to a function of a statistical functional, and applied to obtain approximations to the cumulants, distribution and quantiles of functions of sample moments, and so to obtain third order confidence intervals and estimates of reduced bias for functions of moments. As an example we give the distribution of the standardized skewnes…
▽ More
The chain rule for derivatives of a function of a function is extended to a function of a statistical functional, and applied to obtain approximations to the cumulants, distribution and quantiles of functions of sample moments, and so to obtain third order confidence intervals and estimates of reduced bias for functions of moments. As an example we give the distribution of the standardized skewness for a normal sample to magnitude $O(n^{-2})$, where $n$ is the sample size.
△ Less
Submitted 1 November, 2012;
originally announced November 2012.
-
Expansions about the gamma for the distribution and quantiles of a standard estimate
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
We give expansions for the distribution, density, and quantiles of an estimate, building on results of Cornish, Fisher, Hill, Davis and the authors. The estimate is assumed to be non-lattice with the standard expansions for its cumulants. By expanding about a skew variable with matched skewness, one can drastically reduce the number of terms needed for a given level of accuracy. The building block…
▽ More
We give expansions for the distribution, density, and quantiles of an estimate, building on results of Cornish, Fisher, Hill, Davis and the authors. The estimate is assumed to be non-lattice with the standard expansions for its cumulants. By expanding about a skew variable with matched skewness, one can drastically reduce the number of terms needed for a given level of accuracy. The building blocks generalize the Hermite polynomials. We demonstrate with expansions about the gamma.
△ Less
Submitted 15 October, 2012;
originally announced October 2012.
-
Accurate inference for a one parameter distribution based on the mean of a transformed sample
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
A great deal of inference in statistics is based on making the approximation that a statistic is normally distributed. The error in doing so is generally $O(n^{-1/2})$ and can be very considerable when the distribution is heavily biased or skew. This note shows how one may reduce this error to $O(n^{-(j+1)/2})$, where $j$ is a given integer. The case considered is when the statistic is the mean of…
▽ More
A great deal of inference in statistics is based on making the approximation that a statistic is normally distributed. The error in doing so is generally $O(n^{-1/2})$ and can be very considerable when the distribution is heavily biased or skew. This note shows how one may reduce this error to $O(n^{-(j+1)/2})$, where $j$ is a given integer. The case considered is when the statistic is the mean of the sample values from a continuous one-parameter distribution, after the sample has undergone an initial transformation.
△ Less
Submitted 11 September, 2010;
originally announced September 2010.
-
Nonparametric estimates of low bias
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
We consider the problem of estimating an arbitrary smooth functional of $k \geq 1 $ distribution functions (d.f.s.) in terms of random samples from them. The natural estimate replaces the d.f.s by their empirical d.f.s. Its bias is generally $\sim n^{-1}$, where $n$ is the minimum sample size, with a {\it $p$th order} iterative estimate of bias $ \sim n^{-p}$ for any $p$. For $p \leq 4$, we give a…
▽ More
We consider the problem of estimating an arbitrary smooth functional of $k \geq 1 $ distribution functions (d.f.s.) in terms of random samples from them. The natural estimate replaces the d.f.s by their empirical d.f.s. Its bias is generally $\sim n^{-1}$, where $n$ is the minimum sample size, with a {\it $p$th order} iterative estimate of bias $ \sim n^{-p}$ for any $p$. For $p \leq 4$, we give an explicit estimate in terms of the first $2p - 2$ von Mises derivatives of the functional evaluated at the empirical d.f.s. These may be used to obtain {\it unbiased} estimates, where these exist and are of known form in terms of the sample sizes; our form for such unbiased estimates is much simpler than that obtained using polykays and tables of the symmetric functions. Examples include functions of a mean vector (such as the ratio of two means and the inverse of a mean), standard deviation, correlation, return times and exceedances. These $p$th order estimates require only $\sim n $ calculations. This is in sharp contrast with computationally intensive bias reduction methods such as the $p$th order bootstrap and jackknife, which require $\sim n^p $ calculations.
△ Less
Submitted 31 July, 2010;
originally announced August 2010.
-
The distribution and quantiles of functionals of weighted empirical distributions when observations have different distributions
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
This paper extends Edgeworth-Cornish-Fisher expansions for the distribution and quantiles of nonparametric estimates in two ways. Firstly it allows observations to have different distributions. Secondly it allows the observations to be weighted in a predetermined way. The use of weighted estimates has a long history including applications to regression, rank statistics and Bayes theory. However,…
▽ More
This paper extends Edgeworth-Cornish-Fisher expansions for the distribution and quantiles of nonparametric estimates in two ways. Firstly it allows observations to have different distributions. Secondly it allows the observations to be weighted in a predetermined way. The use of weighted estimates has a long history including applications to regression, rank statistics and Bayes theory. However, asymptotic results have generally been only first order (the CLT and weak convergence). We give third order asymptotics for the distribution and percentiles of any smooth functional of a weighted empirical distribution, thus allowing a considerable increase in accuracy over earlier CLT results.
Consider independent non-identically distributed ({\it non-iid}) observations $X_{1n}, ..., X_{nn}$ in $R^s$. Let $\hat{F}(x)$ be their {\it weighted empirical distribution} with weights $w_{1n}, ..., w_{nn}$. We obtain cumulant expansions and hence Edgeworth-Cornish-Fisher expansions for $T(\hat{F})$ for any smooth functional $T(\cdot)$ by extending the concepts of von Mises derivatives to signed measures of total measure 1. As an example we give the cumulant coefficients needed for Edgeworth-Cornish-Fisher expansions to $O(n^{-3/2})$ for the sample variance when observations are non-iid.
△ Less
Submitted 23 February, 2010;
originally announced February 2010.
-
The distribution of the maximum of a second order autoregressive process: the continuous case
Authors:
C. S. Withers,
S. Nadarajah
Abstract:
We give the distribution function of $M_n$, the maximum of a sequence of $n$ observations from an autoregressive process of order 2. Solutions are first given in terms of repeated integrals and then for the case, where the underlying random variables are absolutely continuous. When the correlations are positive, P(M_n \leq x) =a_{n,x}, where a_{n,x}= \sum_{j=1}^\infty β_{jx} ν_{jx}^{n} = O (ν_{1…
▽ More
We give the distribution function of $M_n$, the maximum of a sequence of $n$ observations from an autoregressive process of order 2. Solutions are first given in terms of repeated integrals and then for the case, where the underlying random variables are absolutely continuous. When the correlations are positive, P(M_n \leq x) =a_{n,x}, where a_{n,x}= \sum_{j=1}^\infty β_{jx} ν_{jx}^{n} = O (ν_{1x}^{n}), where $\{ν_{jx}\}$ are the eigenvalues of a non-symmetric Fredholm kernel, and $ν_{1x}$ is the eigenvalue of maximum magnitude. The weights $β_{jx}$ depend on the $j$th left and right eigenfunctions of the kernel.
These results are large deviations expansions for estimates, since the maximum need not be standardized to have a limit. In fact such a limit need not exist.
△ Less
Submitted 1 February, 2010; v1 submitted 28 January, 2010;
originally announced January 2010.
-
Expansions for Quantiles and Multivariate Moments of Extremes for Distributions of Pareto Type
Authors:
Saralees Nadarajah,
Christopher S. Withers
Abstract:
Let $X_{nr}$ be the $r$th largest of a random sample of size $n$ from a distribution $F (x) = 1 - \sum_{i = 0}^\infty c_i x^{-α- i β}$ for $α> 0$ and $β> 0$. An inversion theorem is proved and used to derive an expansion for the quantile $F^{-1} (u)$ and powers of it. From this an expansion in powers of $(n^{-1}, n^{-β/α})$ is given for the multivariate moments of the extremes…
▽ More
Let $X_{nr}$ be the $r$th largest of a random sample of size $n$ from a distribution $F (x) = 1 - \sum_{i = 0}^\infty c_i x^{-α- i β}$ for $α> 0$ and $β> 0$. An inversion theorem is proved and used to derive an expansion for the quantile $F^{-1} (u)$ and powers of it. From this an expansion in powers of $(n^{-1}, n^{-β/α})$ is given for the multivariate moments of the extremes $\{X_{n, n - s_i}, 1 \leq i \leq k \}/n^{1/α}$ for fixed ${\bf s} = (s_1, ..., s_k)$, where $k \geq 1$. Examples include the Cauchy, Student $t$, $F$, second extreme distributions and stable laws of index $α< 1$.
△ Less
Submitted 25 March, 2009;
originally announced March 2009.
-
Analytic Bias Reduction for $k$-Sample Functionals
Authors:
Christopher S. Withers,
Saralees Nadarajah
Abstract:
We give analytic methods for nonparametric bias reduction that remove the need for computationally intensive methods like the bootstrap and the jackknife.
We call an estimate {\it $p$th order} if its bias has magnitude $n_0^{-p}$ as $n_0 \to \infty$, where $n_0$ is the sample size (or the minimum sample size if the estimate is a function of more than one sample). Most estimates are only first…
▽ More
We give analytic methods for nonparametric bias reduction that remove the need for computationally intensive methods like the bootstrap and the jackknife.
We call an estimate {\it $p$th order} if its bias has magnitude $n_0^{-p}$ as $n_0 \to \infty$, where $n_0$ is the sample size (or the minimum sample size if the estimate is a function of more than one sample). Most estimates are only first order and require O(N) calculations, where $N$ is the total sample size. The usual bootstrap and jackknife estimates are second order but they are computationally intensive, requiring $O(N^2)$ calculations for one sample. By contrast Jaeckel's infinitesimal jackknife is an analytic second order one sample estimate requiring only O(N) calculations. When $p$th order bootstrap and jackknife estimates are available, they require $O(N^p)$ calculations, and so become even more computationally intensive if one chooses $p>2$.
For general $p$ we provide analytic $p$th order nonparametric estimates that require only O(N) calculations. Our estimates are given in terms of the von Mises derivatives of the functional being estimated, evaluated at the empirical distribution.
For products of moments an unbiased estimate exists: our form for this "polykay" is much simpler than the usual form in terms of power sums.
△ Less
Submitted 16 March, 2009;
originally announced March 2009.
-
Explicit expressions for the variogram of first--order intrinsic autoregressions
Authors:
Tibor K. Pogány,
Saralees Nadarajah
Abstract:
Exact and explicit expressions for the variogram of first--order intrinsic autoregressions have not been known. Various asymptotic expansions and approximations have been used to compute the variogram. In this note, an exact and explicit expression applicable for all parameter values is derived. The expression involves Appell's hypergeometric function of the fourth kind. Various particular cases…
▽ More
Exact and explicit expressions for the variogram of first--order intrinsic autoregressions have not been known. Various asymptotic expansions and approximations have been used to compute the variogram. In this note, an exact and explicit expression applicable for all parameter values is derived. The expression involves Appell's hypergeometric function of the fourth kind. Various particular cases of the expression are also derived.
△ Less
Submitted 19 February, 2009;
originally announced February 2009.
-
Almost Sure Convergence of Extreme Order Statistics
Authors:
Zuoxiang Peng,
Jiaona Li,
Saralees Nadarajah
Abstract:
Let $M_n^{(k)}$ denote the $k$th largest maximum of a sample $(X_1,X_2,...,X_n)$ from parent $X$ with continuous distribution. Assume there exist normalizing constants $a_n>0$, $b_n\in \mathbb{R}$ and a nondegenerate distribution $G$ such that $a_n^{-1}(M_n^{(1)}-b_n)\stackrel{w}{\to}G$. Then for fixed $k\in \mathbb{N}$, the almost sure convergence of \[\frac{1}{D_N}\sum_{n=k}^Nd_n\mathbb{I}\{M_…
▽ More
Let $M_n^{(k)}$ denote the $k$th largest maximum of a sample $(X_1,X_2,...,X_n)$ from parent $X$ with continuous distribution. Assume there exist normalizing constants $a_n>0$, $b_n\in \mathbb{R}$ and a nondegenerate distribution $G$ such that $a_n^{-1}(M_n^{(1)}-b_n)\stackrel{w}{\to}G$. Then for fixed $k\in \mathbb{N}$, the almost sure convergence of \[\frac{1}{D_N}\sum_{n=k}^Nd_n\mathbb{I}\{M_n^{(1)}\le a_nx_1+b_n,M_n^{(2)}\le a_nx_2+b_n,...,M_n^{(k)}\le a_nx_k+b_n\}\] is derived if the positive weight sequence $(d_n)$ with $D_N=\sum_{n=1}^Nd_n$ satisfies conditions provided by Hörmann.
△ Less
Submitted 3 October, 2008;
originally announced October 2008.
-
A class of unbiased location invariant Hill-type estimators for heavy tailed distributions
Authors:
Jiaona Li,
Zuoxiang Peng,
Saralees Nadarajah
Abstract:
Based on the methods provided in Caeiro and Gomes (2002) and Fraga Alves (2001), a new class of location invariant Hill-type estimators is derived in this paper. Its asymptotic distributional representation and asymptotic normality are presented, and the optimal choice of sample fraction by Mean Squared Error is also discussed for some special cases. Finally comparison studies are provided for s…
▽ More
Based on the methods provided in Caeiro and Gomes (2002) and Fraga Alves (2001), a new class of location invariant Hill-type estimators is derived in this paper. Its asymptotic distributional representation and asymptotic normality are presented, and the optimal choice of sample fraction by Mean Squared Error is also discussed for some special cases. Finally comparison studies are provided for some familiar models by Monte Carlo simulations.
△ Less
Submitted 23 September, 2008;
originally announced September 2008.
-
Asymptotic tail properties of the distributions in the class of dispersion models
Authors:
Alexandre B. Simas,
Gauss M. Cordeiro,
Saralees Nadarajah
Abstract:
The class of dispersion models introduced by Jørgensen (1997b) covers many known distributions such as the normal, Student t, gamma, inverse Gaussian, hyperbola, von-Mises, among others. We study the small dispersion asymptotic (Jørgensen, 1987b) behavior of the probability density functions of dispersion models which satisfy the uniformly convergent saddlepoint approximation. Our results extend…
▽ More
The class of dispersion models introduced by Jørgensen (1997b) covers many known distributions such as the normal, Student t, gamma, inverse Gaussian, hyperbola, von-Mises, among others. We study the small dispersion asymptotic (Jørgensen, 1987b) behavior of the probability density functions of dispersion models which satisfy the uniformly convergent saddlepoint approximation. Our results extend those obtained by Finner et al. (2008).
△ Less
Submitted 10 September, 2008;
originally announced September 2008.
-
The distribution of the maximum of a first order moving average: the discrete case
Authors:
Christopher S. Withers,
Saralees Nadarajah
Abstract:
We give the distribution of $M_n$, the maximum of a sequence of $n$ observations from a moving average of order 1. Solutions are first given in terms of repeated integrals and then for the case where the underlying independent random variables are discrete. When the correlation is positive, $$ P(M_n \max^n_{i=1} X_i \leq x) = \sum_{j=1}^\infty β_{jx} ν_{jx}^{n} \approx B_{x} r{1x}^{n} $$ where…
▽ More
We give the distribution of $M_n$, the maximum of a sequence of $n$ observations from a moving average of order 1. Solutions are first given in terms of repeated integrals and then for the case where the underlying independent random variables are discrete. When the correlation is positive, $$ P(M_n \max^n_{i=1} X_i \leq x) = \sum_{j=1}^\infty β_{jx} ν_{jx}^{n} \approx B_{x} r{1x}^{n} $$ where $\{ν_{jx}\}$ are the eigenvalues of a certain matrix, $r_{1x}$ is the maximum magnitude of the eigenvalues, and $I$ depends on the number of possible values of the underlying random variables. The eigenvalues do not depend on $x$ only on its range.
△ Less
Submitted 6 April, 2009; v1 submitted 4 February, 2008;
originally announced February 2008.
-
The distribution of the maximum of a first order moving average: the continuous case
Authors:
Christopher S. Withers,
Saralees Nadarajah
Abstract:
We give the distribution of $M_n$, the maximum of a sequence of $n$ observations from a moving average of order 1. Solutions are first given in terms of repeated integrals and then for the case where the underlying independent random variables have an absolutely continuous density. When the correlation is positive,…
▽ More
We give the distribution of $M_n$, the maximum of a sequence of $n$ observations from a moving average of order 1. Solutions are first given in terms of repeated integrals and then for the case where the underlying independent random variables have an absolutely continuous density. When the correlation is positive, $$ P(M_n %\max^n_{i=1} X_i \leq x) =\ \sum_{j=1}^\infty β_{jx} ν_{jx}^{n} \approx B_{x} ν_{1x}^{n} $$ where %$\{X_i\}$ is a moving average of order 1 with positive correlation, and $\{ν_{jx}\}$ are the eigenvalues (singular values) of a Fredholm kernel and $ν_{1x}$ is the eigenvalue of maximum magnitude. A similar result is given when the correlation is negative. The result is analogous to large deviations expansions for estimates, since the maximum need not be standardized to have a limit. % there are more terms, and $$P(M_n <x) \approx B'_{x}\ (1+ν_{1x})^n.$$
For the continuous case the integral equations for the left and right eigenfunctions are converted to first order linear differential equations. The eigenvalues satisfy an equation of the form $$\sum_{i=1}^\infty w_i(λ-θ_i)^{-1}=λ-θ_0$$ for certain known weights $\{w_i\}$ and eigenvalues $\{θ_i\}$ of a given matrix. This can be solved by truncating the sum to an increasing number of terms.
△ Less
Submitted 6 September, 2009; v1 submitted 4 February, 2008;
originally announced February 2008.