Search | arXiv e-print repository

Best $L_p$ Isotonic Regressions, $p \in \{0, 1, \infty\}$

Abstract: Given a real-valued weighted function $f$ on a finite dag, the $L_p$ isotonic regression of $f$, $p \in [0,\infty]$, is unique except when $p \in [0,1] \cup \{\infty\}$. We are interested in determining a ``best'' isotonic regression for $p \in \{0, 1, \infty\}$, where by best we mean a regression satisfying stronger properties than merely having minimal norm. One approach is to use strict $L_p$ r… ▽ More Given a real-valued weighted function $f$ on a finite dag, the $L_p$ isotonic regression of $f$, $p \in [0,\infty]$, is unique except when $p \in [0,1] \cup \{\infty\}$. We are interested in determining a ``best'' isotonic regression for $p \in \{0, 1, \infty\}$, where by best we mean a regression satisfying stronger properties than merely having minimal norm. One approach is to use strict $L_p$ regression, which is the limit of the best $L_q$ approximation as $q$ approaches $p$, and another is lex regression, which is based on lexical ordering of regression errors. For $L_\infty$ the strict and lex regressions are unique and the same. For $L_1$, strict $q \scriptstyle\searrow 1$ is unique, but we show that $q \scriptstyle\nearrow 1$ may not be, and even when it is unique the two limits may not be the same. For $L_0$, in general neither of the strict and lex regressions are unique, nor do they always have the same set of optimal regressions, but by expanding the objectives of $L_p$ optimization to $p < 0$ we show $p{ \scriptstyle \nearrow} 0$ is the same as lex regression. We also give algorithms for computing the best $L_p$ isotonic regression in certain situations. △ Less

Submitted 31 May, 2023; originally announced June 2023.

arXiv:2212.02640 [pdf, other]

Low Power Mesh Algorithms for Image Problems

Authors: Quentin Stout

Abstract: We analyze a physically motivated fine-grained mesh-connected computer model, assuming that a word of information takes a fixed area and that it takes unit time and unit energy to move a word unit distance. This is a representation of computing on a chip with myriad tiny processors arranged as a mesh. While most mesh algorithms assume all processors are active at all times, we give algorithms that… ▽ More We analyze a physically motivated fine-grained mesh-connected computer model, assuming that a word of information takes a fixed area and that it takes unit time and unit energy to move a word unit distance. This is a representation of computing on a chip with myriad tiny processors arranged as a mesh. While most mesh algorithms assume all processors are active at all times, we give algorithms that have only a few processors on at any one time, which reduces the power required. We apply this approach to basic problems involving images, showing that there can be dramatic reductions in the peak power with only small, if any, changes in the time required. We also show that these algorithms give a more efficient way to utilize power when more power is available. △ Less

Submitted 5 December, 2022; originally announced December 2022.

arXiv:2107.00251 [pdf, ps, other]

$L_p$ Isotonic Regression Algorithms Using an $L_0$ Approach

Authors: Quentin F. Stout

Abstract: Significant advances in flow algorithms have changed the relative performance of various approaches to algorithms for $L_p$ isotonic regression. We show a simple plug-in method to systematically incorporate such advances, and advances in determining violator dags, with no assumptions about the algorithms' structures. The method is based on the standard algorithm for $L_0$ (Hamming distance) isoton… ▽ More Significant advances in flow algorithms have changed the relative performance of various approaches to algorithms for $L_p$ isotonic regression. We show a simple plug-in method to systematically incorporate such advances, and advances in determining violator dags, with no assumptions about the algorithms' structures. The method is based on the standard algorithm for $L_0$ (Hamming distance) isotonic regression (by finding anti-chains in a violator dag), coupled with partitioning based on binary $L_1$ isotonic regression. For several important classes of graphs the algorithms are already faster (in O-notation) than previously published ones, close to or at the lower bound, and significantly faster than those implemented in statistical packages. We consider exact and approximate results for $L_p$ regressions, $p=0$ and $1 \leq p < \infty$, and a variety of orderings. △ Less

Submitted 29 June, 2023; v1 submitted 1 July, 2021; originally announced July 2021.

Comments: Remove mistaken mistaken in abstract about unweighted points in $d$-dimensional space. Added references, revised paper to make it more readable Revised material to make some sections clearer. Replaced Gau, Liu, Peng paper with van den Brend, Lee, Liu, Saranurak, Sidford, Song, Wang, D (i.e., did exactly what paper shows, namely plug and play, replace flow algorithm when better one appears)

arXiv:2106.00279 [pdf, ps, other]

$L_0$ Isotonic Regression With Secondary Objectives

Authors: Quentin F. Stout

Abstract: We provide algorithms for isotonic regression minimizing $L_0$ error (Hamming distance). This is also known as monotonic relabeling, and is applicable when labels have a linear ordering but not necessarily a metric. There may be exponentially many optimal relabelings, so we look at secondary criteria to determine which are best. For arbitrary ordinal labels the criterion is maximizing the number o… ▽ More We provide algorithms for isotonic regression minimizing $L_0$ error (Hamming distance). This is also known as monotonic relabeling, and is applicable when labels have a linear ordering but not necessarily a metric. There may be exponentially many optimal relabelings, so we look at secondary criteria to determine which are best. For arbitrary ordinal labels the criterion is maximizing the number of labels which are only changed to an adjacent label (and recursively apply this). For real-valued labels we minimize the $L_p$ error. For linearly ordered sets we also give algorithms which minimize the sum of the $L_p$ and weighted $L_0$ errors, a form of penalized (regularized) regression. We also examine $L_0$ isotonic regression on multidimensional coordinate-wise orderings. Previous algorithms took $Θ(n^3)$ time, but we reduce this to $o(n^{3/2})$. △ Less

Submitted 10 June, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

Comments: Adds references, improves exposition

arXiv:2102.03674 [pdf, other]

Generating Artificial Core Users for Interpretable Condensed Data

Authors: Amy Nesky, Quentin F. Stout

Abstract: Recent work has shown that in a dataset of user ratings on items there exists a group of Core Users who hold most of the information necessary for recommendation. This set of Core Users can be as small as 20 percent of the users. Core Users can be used to make predictions for out-of-sample users without much additional work. Since Core Users substantially shrink a ratings dataset without much loss… ▽ More Recent work has shown that in a dataset of user ratings on items there exists a group of Core Users who hold most of the information necessary for recommendation. This set of Core Users can be as small as 20 percent of the users. Core Users can be used to make predictions for out-of-sample users without much additional work. Since Core Users substantially shrink a ratings dataset without much loss of information, they can be used to improve recommendation efficiency. We propose a method, combining latent factor models, ensemble boosting and K-means clustering, to generate a small set of Artificial Core Users (ACUs) from real Core User data. Our ACUs have dense rating information, and improve the recommendation performance of real Core Users while remaining interpretable. △ Less

Submitted 6 February, 2021; originally announced February 2021.

Comments: 11 pages, 5 figures

arXiv:1507.02226 [pdf, ps, other]

L infinity Isotonic Regression for Linear, Multidimensional, and Tree Orders

Authors: Quentin F. Stout

Abstract: Algorithms are given for determining $L_\infty$ isotonic regression of weighted data. For a linear order, grid in multidimensional space, or tree, of $n$ vertices, optimal algorithms are given, taking $Θ(n)$ time. These improve upon previous algorithms by a factor of $Ω(\log n)$. For vertices at arbitrary positions in $d$-dimensional space a $Θ(n \log^{d-1} n)$ algorithm employs iterative sorting… ▽ More Algorithms are given for determining $L_\infty$ isotonic regression of weighted data. For a linear order, grid in multidimensional space, or tree, of $n$ vertices, optimal algorithms are given, taking $Θ(n)$ time. These improve upon previous algorithms by a factor of $Ω(\log n)$. For vertices at arbitrary positions in $d$-dimensional space a $Θ(n \log^{d-1} n)$ algorithm employs iterative sorting to yield the functionality of a multidimensional structure while using only $Θ(n)$ space. The algorithms utilize a new non-constructive feasibility test on a rendezvous graph, with bounded error envelopes at each vertex. △ Less

Submitted 22 June, 2017; v1 submitted 8 July, 2015; originally announced July 2015.

Comments: updated references, minor modifications

arXiv:1502.01435 [pdf, ps, other]

Optimal component labeling algorithms for mesh-connected computers and VLSI

Authors: Quentin F. Stout

Abstract: Given an undirected graph $G$ of $n$ weighted edges, stored one edge per processor in a square mesh of $n$ processors, we show how to determine the connected components and a minimal spanning forest in $Θ(\sqrt{n})$ time. More generally, we show how to solve these problems in $Θ(n^{1/d})$ time when the mesh is a $d$-dimensional cube, where the implied constants depend upon $d$. Given an undirected graph $G$ of $n$ weighted edges, stored one edge per processor in a square mesh of $n$ processors, we show how to determine the connected components and a minimal spanning forest in $Θ(\sqrt{n})$ time. More generally, we show how to solve these problems in $Θ(n^{1/d})$ time when the mesh is a $d$-dimensional cube, where the implied constants depend upon $d$. △ Less

Submitted 5 February, 2015; originally announced February 2015.

arXiv:1412.2844 [pdf, ps, other]

Optimal Reduced Isotonic Regression

Authors: Janis Hardwick, Quentin F. Stout

Abstract: Isotonic regression is a shape-constrained nonparametric regression in which the regression is an increasing step function. For $n$ data points, the number of steps in the isotonic regression may be as large as $n$. As a result, standard isotonic regression has been criticized as overfitting the data or making the representation too complicated. So-called "reduced" isotonic regression constrains t… ▽ More Isotonic regression is a shape-constrained nonparametric regression in which the regression is an increasing step function. For $n$ data points, the number of steps in the isotonic regression may be as large as $n$. As a result, standard isotonic regression has been criticized as overfitting the data or making the representation too complicated. So-called "reduced" isotonic regression constrains the outcome to be a specified number of steps $b$, $b \leq n$. However, because the previous algorithms for finding the reduced $L_2$ regression took $Θ(n+bm^2)$ time, where $m$ is the number of steps of the unconstrained isotonic regression, researchers felt that the algorithms were too slow and instead used approximations. Other researchers had results that were approximations because they used a greedy top-down approach. Here we give an algorithm to find an exact solution in $Θ(n+bm)$ time, and a simpler algorithm taking $Θ(n+b m \log m)$ time. These algorithms also determine optimal $k$-means clustering of weighted 1-dimensional data. △ Less

Submitted 8 December, 2014; originally announced December 2014.

arXiv:1412.2379 [pdf, ps, other]

An Algorithm for $L_\infty$ Approximation by Step Functions

Authors: Quentin F. Stout

Abstract: An algorithm is given for determining an optimal $b$-step approximation of weighted data, where the error is measured with respect to the $L_\infty$ norm. For data presorted by the independent variable the algorithm takes $Θ(n + \log n \cdot b(1+\log n/b))$ time and $Θ(n)$ space. This is $Θ(n \log n)$ in the worst case and $Θ(n)$ when $b = O(n/\log n \log\log n)$. A minor change determines an opti… ▽ More An algorithm is given for determining an optimal $b$-step approximation of weighted data, where the error is measured with respect to the $L_\infty$ norm. For data presorted by the independent variable the algorithm takes $Θ(n + \log n \cdot b(1+\log n/b))$ time and $Θ(n)$ space. This is $Θ(n \log n)$ in the worst case and $Θ(n)$ when $b = O(n/\log n \log\log n)$. A minor change determines an optimal reduced isotonic regression in the same time and space bounds, and the algorithm also solves the $k$-center problem for 1-dimensional weighted data. △ Less

Submitted 4 May, 2015; v1 submitted 7 December, 2014; originally announced December 2014.

arXiv:1101.3758 [pdf, ps, other]

doi 10.1088/0067-0049/194/2/23

Crash: A Block-Adaptive-Mesh Code for Radiative Shock Hydrodynamics - Implementation and Verification

Authors: B. van der Holst, G. Toth, I. V. Sokolov, K. G. Powell, J. P. Holloway, E. S. Myra, Q. Stout, M. L. Adams, J. E. Morel, R. P. Drake

Abstract: We describe the CRASH (Center for Radiative Shock Hydrodynamics) code, a block adaptive mesh code for multi-material radiation hydrodynamics. The implementation solves the radiation diffusion model with the gray or multigroup method and uses a flux limited diffusion approximation to recover the free-streaming limit. The electrons and ions are allowed to have different temperatures and we include a… ▽ More We describe the CRASH (Center for Radiative Shock Hydrodynamics) code, a block adaptive mesh code for multi-material radiation hydrodynamics. The implementation solves the radiation diffusion model with the gray or multigroup method and uses a flux limited diffusion approximation to recover the free-streaming limit. The electrons and ions are allowed to have different temperatures and we include a flux limited electron heat conduction. The radiation hydrodynamic equations are solved in the Eulerian frame by means of a conservative finite volume discretization in either one, two, or three-dimensional slab geometry or in two-dimensional cylindrical symmetry. An operator split method is used to solve these equations in three substeps: (1) solve the hydrodynamic equations with shock-capturing schemes, (2) a linear advection of the radiation in frequency-logarithm space, and (3) an implicit solve of the stiff radiation diffusion, heat conduction, and energy exchange. We present a suite of verification test problems to demonstrate the accuracy and performance of the algorithms. The CRASH code is an extension of the Block-Adaptive Tree Solarwind Roe Upwind Scheme (BATS-R-US) code with this new radiation transfer and heat conduction library and equation-of-state and multigroup opacity solvers. Both CRASH and BATS-R-US are part of the publicly available Space Weather Modeling Framework (SWMF). △ Less

Submitted 19 January, 2011; originally announced January 2011.

Comments: 51 pages, 19 figures; submitted to Astrophysical Journal

Showing 1–10 of 10 results for author: Stout, Q