Skip to main content

Showing 1–12 of 12 results for author: Niethammer, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2202.08070  [pdf, other

    cs.LG stat.ML

    On Measuring Excess Capacity in Neural Networks

    Authors: Florian Graf, Sebastian Zeng, Bastian Rieck, Marc Niethammer, Roland Kwitt

    Abstract: We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as res… ▽ More

    Submitted 19 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Updated to Neurips 2022 camera-ready version

  2. arXiv:2102.08817  [pdf, other

    stat.ML cs.LG

    Dissecting Supervised Contrastive Learning

    Authors: Florian Graf, Christoph D. Hofer, Marc Niethammer, Roland Kwitt

    Abstract: Minimizing cross-entropy over the softmax scores of a linear map composed with a high-capacity encoder is arguably the most popular choice for training neural networks on supervised learning tasks. However, recent works show that one can directly optimize the encoder instead, to obtain equally (or even more) discriminative representations via a supervised variant of a contrastive objective. In thi… ▽ More

    Submitted 2 March, 2023; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: v4 updates: - updated appendix section S1.3 - this includes fixing an oversight in the proofs (Lemma 1 missed an equality condition, which now appears in Lemma 2) - improved figure quality

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:3821-3830, 2021

  3. arXiv:2008.10797  [pdf, other

    cs.LG stat.ML

    The Fairness-Accuracy Pareto Front

    Authors: Susan Wei, Marc Niethammer

    Abstract: Algorithmic fairness seeks to identify and correct sources of bias in machine learning algorithms. Confoundingly, ensuring fairness often comes at the cost of accuracy. We provide formal tools in this work for reconciling this fundamental tension in algorithm fairness. Specifically, we put to use the concept of Pareto optimality from multi-objective optimization and seek the fairness-accuracy Pare… ▽ More

    Submitted 18 November, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: added toy figs to illustrate pareto optimality, some re-organization for clarity following reviewer comments

  4. arXiv:2006.04259  [pdf, other

    cs.LG stat.ML

    Deep Goal-Oriented Clustering

    Authors: Yifeng Shi, Christopher M. Bender, Junier B. Oliva, Marc Niethammer

    Abstract: Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction… ▽ More

    Submitted 15 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 15 pages

  5. arXiv:2002.04805  [pdf, other

    cs.LG math.AT stat.ML

    Topologically Densified Distributions

    Authors: Christoph D. Hofer, Florian Graf, Marc Niethammer, Roland Kwitt

    Abstract: We study regularization in the context of small sample-size learning with over-parameterized neural networks. Specifically, we shift focus from architectural properties, such as norms on the network weights, to properties of the internal representations before a linear classifier. Specifically, we impose a topological constraint on samples drawn from the probability measure induced in that space.… ▽ More

    Submitted 17 May, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

  6. arXiv:1912.00434  [pdf, other

    q-bio.QM eess.IV stat.AP

    Joint and individual analysis of breast cancer histologic images and genomic covariates

    Authors: Iain Carmichael, Benjamin C. Calhoun, Katherine A. Hoadley, Melissa A. Troester, Joseph Geradts, Heather D. Couture, Linnea Olsson, Charles M. Perou, Marc Niethammer, Jan Hannig, J. S. Marron

    Abstract: A key challenge in modern data analysis is understanding connections between complex and differing modalities of data. For example, two of the main approaches to the study of breast cancer are histopathology (analyzing visual characteristics of tumors) and genetics. While histopathology is the gold standard for diagnostics and there have been many recent breakthroughs in genetics, there is little… ▽ More

    Submitted 13 April, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

  7. arXiv:1909.09877  [pdf, other

    cs.LG stat.ML

    Deep Message Passing on Sets

    Authors: Yifeng Shi, Junier Oliva, Marc Niethammer

    Abstract: Modern methods for learning over graph input data have shown the fruitfulness of accounting for relationships among elements in a collection. However, most methods that learn over set input data use only rudimentary approaches to exploit intra-collection relationships. In this work we introduce Deep Message Passing on Sets (DMPS), a novel method that incorporates relational learning for sets. DMPS… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: 11 pages, 8 figures

  8. arXiv:1907.07739  [pdf, other

    cs.LG cs.CV stat.ML

    Deep Multi-View Learning via Task-Optimal CCA

    Authors: Heather D. Couture, Roland Kwitt, J. S. Marron, Melissa Troester, Charles M. Perou, Marc Niethammer

    Abstract: Canonical Correlation Analysis (CCA) is widely used for multimodal data analysis and, more recently, for discriminative tasks such as multi-view learning; however, it makes no use of class labels. Recent CCA methods have started to address this weakness but are limited in that they do not simultaneously optimize the CCA projection for discrimination and the CCA projection itself, or they are linea… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  9. arXiv:1906.09003  [pdf, other

    cs.LG cs.CG math.AT stat.ML

    Connectivity-Optimized Representation Learning via Persistent Homology

    Authors: Christoph Hofer, Roland Kwitt, Mandar Dixit, Marc Niethammer

    Abstract: We study the problem of learning representations with controllable connectivity properties. This is beneficial in situations when the imposed structure can be leveraged upstream. In particular, we control the connectivity of an autoencoder's latent space via a novel type of loss, operating on information from persistent homology. Under mild conditions, this loss is differentiable and we present a… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  10. arXiv:1905.10996  [pdf, other

    cs.LG math.AT stat.ML

    Graph Filtration Learning

    Authors: Christoph D. Hofer, Florian Graf, Bastian Rieck, Marc Niethammer, Roland Kwitt

    Abstract: We propose an approach to learning with graph-structured data in the problem domain of graph classification. In particular, we present a novel type of readout operation to aggregate node features into a graph-level representation. To this end, we leverage persistent homology computed via a real-valued, learnable, filter function. We establish the theoretical foundation for differentiating through… ▽ More

    Submitted 17 May, 2021; v1 submitted 27 May, 2019; originally announced May 2019.

  11. arXiv:1803.02726  [pdf, other

    cs.SI physics.soc-ph stat.ML

    Stochastic Block Models with Multiple Continuous Attributes

    Authors: Natalie Stanley, Thomas Bonacci, Roland Kwitt, Marc Niethammer, Peter J. Mucha

    Abstract: The stochastic block model (SBM) is a probabilistic model for community structure in networks. Typically, only the adjacency matrix is used to perform SBM parameter inference. In this paper, we consider circumstances in which nodes have an associated vector of continuous attributes that are also used to learn the node-to-community assignments and corresponding SBM parameters. While this assumption… ▽ More

    Submitted 7 March, 2018; originally announced March 2018.

  12. arXiv:1707.05961  [pdf

    cs.CV q-bio.NC stat.ML

    Multidimensional classification of hippocampal shape features discriminates Alzheimer's disease and mild cognitive impairment from normal aging

    Authors: Emilie Gerardin, Gaël Chételat, Marie Chupin, Rémi Cuingnet, Béatrice Desgranges, Ho-Sung Kim, Marc Niethammer, Bruno Dubois, Stéphane Lehéricy, Line Garnero, Francis Eustache, Olivier Colliot

    Abstract: We describe a new method to automatically discriminate between patients with Alzheimer's disease (AD) or mild cognitive impairment (MCI) and elderly controls, based on multidimensional classification of hippocampal shape features. This approach uses spherical harmonics (SPHARM) coefficients to model the shape of the hippocampi, which are segmented from magnetic resonance images (MRI) using a fully… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

    Comments: Data used in the preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database

    Journal ref: NeuroImage, 47 (4), pp.1476-86, 2009