Skip to main content

Showing 1–7 of 7 results for author: Soen, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.18686  [pdf, other

    stat.ML cs.LG

    Rejection via Learning Density Ratios

    Authors: Alexander Soen, Hisham Husain, Philip Schulz, Vu Nguyen

    Abstract: Classification with rejection emerges as a learning paradigm which allows models to abstain from making predictions. The predominant approach is to alter the supervised learning pipeline by augmenting typical loss functions, letting model rejection incur a lower loss than an incorrect prediction. Instead, we propose a different distributional perspective, where we seek to find an idealized data di… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2402.05379  [pdf, other

    cs.LG stat.ML

    Tradeoffs of Diagonal Fisher Information Matrix Estimators

    Authors: Alexander Soen, Ke Sun

    Abstract: The Fisher information matrix characterizes the local geometry in the parameter space of neural networks. It elucidates insightful theories and useful tools to understand and optimize neural networks. Given its high computational cost, practitioners often use random estimators and evaluate only the diagonal entries. We examine two such estimators, whose accuracy and sample complexity depend on the… ▽ More

    Submitted 2 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  3. arXiv:2201.12947  [pdf, other

    stat.ML cs.LG

    Fair Wrap** for Black-box Predictions

    Authors: Alexander Soen, Ibrahim Alabdulmohsin, Sanmi Koyejo, Yishay Mansour, Nyalleng Moorosi, Richard Nock, Ke Sun, Lexing Xie

    Abstract: We introduce a new family of techniques to post-process ("wrap") a black-box classifier in order to reduce its bias. Our technique builds on the recent analysis of improper loss functions whose optimization can correct any twist in prediction, unfairness being treated as a twist. In the post-processing, we learn a wrapper function which we define as an $α$-tree, which modifies the prediction. We p… ▽ More

    Submitted 1 November, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Published in Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

  4. arXiv:2107.04205  [pdf, other

    cs.LG stat.ML

    On the Variance of the Fisher Information for Deep Learning

    Authors: Alexander Soen, Ke Sun

    Abstract: In the realm of deep learning, the Fisher information matrix (FIM) gives novel insights and useful tools to characterize the loss landscape, perform second-order optimization, and build geometric learning theories. The exact FIM is either unavailable in closed form or too expensive to compute. In practice, it is almost always estimated based on empirical samples. We investigate two such estimators… ▽ More

    Submitted 27 October, 2021; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  5. arXiv:2104.07932  [pdf, other

    cs.LG cs.CE stat.ML

    Interval-censored Hawkes processes

    Authors: Marian-Andrei Rizoiu, Alexander Soen, Shidi Li, Pio Calderon, Leanne Dong, Aditya Krishna Menon, Lexing Xie

    Abstract: Interval-censored data solely records the aggregated counts of events during specific time intervals - such as the number of patients admitted to the hospital or the volume of vehicles passing traffic loop detectors - and not the exact occurrence time of the events. It is currently not understood how to fit the Hawkes point processes to this kind of data. Its typical loss function (the point proce… ▽ More

    Submitted 25 November, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Journal ref: Journal of Machine Learning Research, 23(338):1-84, 2022. https://jmlr.org/papers/v23/21-0917.html

  6. arXiv:2012.00188  [pdf, other

    stat.ML cs.LG

    Fair Densities via Boosting the Sufficient Statistics of Exponential Families

    Authors: Alexander Soen, Hisham Husain, Richard Nock

    Abstract: We introduce a boosting algorithm to pre-process data for fairness. Starting from an initial fair but inaccurate distribution, our approach shifts towards better data fitting while still ensuring a minimal fairness guarantee. To do so, it learns the sufficient statistics of an exponential family with boosting-compliant convergence. Importantly, we are able to theoretically prove that the learned d… ▽ More

    Submitted 15 August, 2023; v1 submitted 30 November, 2020; originally announced December 2020.

    Comments: Published in Proceedings of the 40th International Conference on Machine Learning (ICML2023)

  7. arXiv:2007.14082  [pdf, other

    cs.LG stat.ML

    UNIPoint: Universally Approximating Point Processes Intensities

    Authors: Alexander Soen, Alexander Mathews, Daniel Grixti-Cheng, Lexing Xie

    Abstract: Point processes are a useful mathematical tool for describing events over time, and so there are many recent approaches for representing and learning them. One notable open question is how to precisely describe the flexibility of point process models and whether there exists a general model that can represent all point processes. Our work bridges this gap. Focusing on the widely used event intensi… ▽ More

    Submitted 2 March, 2021; v1 submitted 28 July, 2020; originally announced July 2020.