Skip to main content

Showing 1–5 of 5 results for author: O'Reilly, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02458  [pdf, other

    math.ST stat.ML

    Statistical Advantages of Oblique Randomized Decision Trees and Forests

    Authors: Eliza O'Reilly

    Abstract: This work studies the statistical advantages of using features comprised of general linear combinations of covariates to partition the data in randomized decision tree and forest regression algorithms. Using random tessellation theory in stochastic geometry, we provide a theoretical analysis of a class of efficiently generated random tree and forest estimators that allow for oblique splits along s… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 43 pages, 2 figures

    MSC Class: Primary 62G05; secondary 60D05

  2. arXiv:2212.13597  [pdf, other

    math.OC math.MG math.ST stat.ML

    Optimal Regularization for a Data Source

    Authors: Oscar Leong, Eliza O'Reilly, Yong Sheng Soh, Venkat Chandrasekaran

    Abstract: In optimization-based approaches to inverse problems and to statistical estimation, it is common to augment criteria that enforce data fidelity with a regularizer that promotes desired structural properties in the solution. The choice of a suitable regularizer is typically driven by a combination of prior domain information and computational considerations. Convex regularizers are attractive compu… ▽ More

    Submitted 5 February, 2024; v1 submitted 27 December, 2022; originally announced December 2022.

  3. arXiv:2110.14779  [pdf, other

    math.OC math.ST stat.ML

    Spectrahedral Regression

    Authors: Eliza O'Reilly, Venkat Chandrasekaran

    Abstract: Convex regression is the problem of fitting a convex function to a data set consisting of input-output pairs. We present a new approach to this problem called spectrahedral regression, in which we fit a spectrahedral function to the data, i.e. a function that is the maximum eigenvalue of an affine matrix expression of the input. This method represents a significant generalization of polyhedral (al… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    MSC Class: Primary 90C22; 90C47; secondary 62G08; 62G20

  4. arXiv:2109.10541  [pdf, other

    math.ST math.PR stat.ML

    Minimax Rates for High-Dimensional Random Tessellation Forests

    Authors: Eliza O'Reilly, Ngoc Mai Tran

    Abstract: Random forests are a popular class of algorithms used for regression and classification. The algorithm introduced by Breiman in 2001 and many of its variants are ensembles of randomized decision trees built from axis-aligned partitions of the feature space. One such variant, called Mondrian forests, was proposed to handle the online setting and is the first class of random forests for which minima… ▽ More

    Submitted 29 October, 2023; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: 26 pages

    MSC Class: 60D05; 62G07

  5. arXiv:2002.00797  [pdf, other

    stat.ML cs.LG math.PR

    Stochastic geometry to generalize the Mondrian Process

    Authors: Eliza O'Reilly, Ngoc Tran

    Abstract: The stable under iterated tessellation (STIT) process is a stochastic process that produces a recursive partition of space with cut directions drawn independently from a distribution over the sphere. The case of random axis-aligned cuts is known as the Mondrian process. Random forests and Laplace kernel approximations built from the Mondrian process have led to efficient online learning methods an… ▽ More

    Submitted 13 September, 2021; v1 submitted 3 February, 2020; originally announced February 2020.

    MSC Class: 60D05; 62G07