Skip to main content

Showing 1–23 of 23 results for author: Dyer, E L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.11742  [pdf, other

    cs.LG stat.ML

    Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

    Authors: Chiraag Kaushik, Ran Liu, Chi-Heng Lin, Amrit Khera, Matthew Y **, Wenrui Ma, Vidya Muthukumar, Eva L Dyer

    Abstract: Classification models are expected to perform equally well for different classes, yet in practice, there are often large gaps in their performance. This issue of class bias is widely studied in cases of datasets with sample imbalance, but is relatively overlooked in balanced datasets. In this work, we introduce the concept of spectral imbalance in features as a potential source for class dispariti… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 25 pages, 9 figures

  2. arXiv:2310.16046  [pdf, other

    cs.LG q-bio.NC

    A Unified, Scalable Framework for Neural Population Decoding

    Authors: Mehdi Azabou, Vinam Arora, Venkataramana Ganesh, Ximeng Mao, Santosh Nachimuthu, Michael J. Mendelson, Blake Richards, Matthew G. Perich, Guillaume Lajoie, Eva L. Dyer

    Abstract: Our ability to use deep learning approaches to decipher neural activity would likely benefit from greater scale, in terms of both model size and datasets. However, the integration of many neural recordings into one unified model is challenging, as each recording contains the activity of different neurons from different individual animals. In this paper, we introduce a training framework and archit… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023

  3. arXiv:2308.14596  [pdf, other

    cs.CV cs.LG

    LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration

    Authors: Ran Liu, Sahil Khose, **gyun Xiao, Lakshmi Sathidevi, Keerthan Ramnath, Zsolt Kira, Eva L. Dyer

    Abstract: Despite significant advances in deep learning, models often struggle to generalize well to new, unseen domains, especially when training data is limited. To address this challenge, we propose a novel approach for distribution-aware latent augmentation that leverages the relationships across samples to guide the augmentation procedure. Our approach first degrades the samples stochastically in the l… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  4. arXiv:2308.09198  [pdf, other

    cs.LG cs.SI

    Half-Hop: A graph upsampling approach for slowing down message passing

    Authors: Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Veličković, Eva L. Dyer

    Abstract: Message passing neural networks have shown a lot of success on graph-structured data. However, there are many instances where message passing can lead to over-smoothing or fail when neighboring nodes belong to different classes. In this work, we introduce a simple yet general framework for improving learning in message passing neural networks. Our approach essentially upsamples edges in the origin… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Published as a conference paper at ICML 2023

  5. arXiv:2303.08811  [pdf, other

    cs.LG cs.RO

    Relax, it doesn't matter how you get there: A new self-supervised approach for multi-timescale behavior analysis

    Authors: Mehdi Azabou, Michael Mendelson, Nauman Ahad, Maks Sorokin, Shantanu Thakoor, Carolina Urzay, Eva L. Dyer

    Abstract: Natural behavior consists of dynamics that are complex and unpredictable, especially when trying to predict many steps into the future. While some success has been found in building representations of behavior under constrained or simplified task-based conditions, many of these models cannot be applied to free and naturalistic settings where behavior becomes increasingly hard to model. In this wor… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.07041

  6. arXiv:2302.11023  [pdf, other

    cs.LG q-bio.NC

    Learning signatures of decision making from many individuals playing the same game

    Authors: Michael J Mendelson, Mehdi Azabou, Suma Jacob, Nicola Grissom, David Darrow, Becket Ebitz, Alexander Herman, Eva L. Dyer

    Abstract: Human behavior is incredibly complex and the factors that drive decision making--from instinct, to strategy, to biases between individuals--often vary over multiple timescales. In this paper, we design a predictive framework that learns representations to encode an individual's 'behavioral style', i.e. long-term behavioral trends, while simultaneously predicting future actions and choices. The mod… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: 4 pages, 2 figures. To be published in IEEE NER

  7. arXiv:2301.00345  [pdf, other

    cs.CV cs.LG

    MTNeuro: A Benchmark for Evaluating Representations of Brain Structure Across Multiple Levels of Abstraction

    Authors: Jorge Quesada, Lakshmi Sathidevi, Ran Liu, Nauman Ahad, Joy M. Jackson, Mehdi Azabou, **gyun Xiao, Christopher Liding, Matthew **, Carolina Urzay, William Gray-Roncal, Erik C. Johnson, Eva L. Dyer

    Abstract: There are multiple scales of abstraction from which we can describe the same image, depending on whether we are focusing on fine-grained details or a more global attribute of the image. In brain map**, learning to automatically parse images to build representations of both small-scale features (e.g., the presence of cells or blood vessels) and global properties of an image (e.g., which brain reg… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 10 pages, 4 figures, Accepted at NeurIPS 2022

  8. arXiv:2210.05021  [pdf, other

    cs.LG stat.ML

    The good, the bad and the ugly sides of data augmentation: An implicit spectral regularization perspective

    Authors: Chi-Heng Lin, Chiraag Kaushik, Eva L. Dyer, Vidya Muthukumar

    Abstract: Data augmentation (DA) is a powerful workhorse for bolstering performance in modern machine learning. Specific augmentations like translations and scaling in computer vision are traditionally believed to improve generalization by generating new (artificial) data from the same distribution. However, this traditional viewpoint does not explain the success of prevalent augmentations in modern machine… ▽ More

    Submitted 27 February, 2024; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 72 pages, 8 figures

  9. arXiv:2206.07041  [pdf, other

    cs.LG

    Learning Behavior Representations Through Multi-Timescale Bootstrap**

    Authors: Mehdi Azabou, Michael Mendelson, Maks Sorokin, Shantanu Thakoor, Nauman Ahad, Carolina Urzay, Eva L. Dyer

    Abstract: Natural behavior consists of dynamics that are both unpredictable, can switch suddenly, and unfold over many different timescales. While some success has been found in building representations of behavior under constrained or simplified task-based conditions, many of these models cannot be applied to free and naturalistic settings due to the fact that they assume a single scale of temporal dynamic… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  10. arXiv:2206.06131  [pdf, other

    q-bio.NC cs.LG

    Seeing the forest and the tree: Building representations of both individual and collective dynamics with transformers

    Authors: Ran Liu, Mehdi Azabou, Max Dabagia, **gyun Xiao, Eva L. Dyer

    Abstract: Complex time-varying systems are often studied by abstracting away from the dynamics of individual components to build a model of the population-level dynamics from the start. However, when building a population-level description, it can be easy to lose sight of each individual and how they contribute to the larger picture. In this paper, we present a novel transformer architecture for learning fr… ▽ More

    Submitted 20 October, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: accepted by NeurIPS 2022

  11. arXiv:2202.04000  [pdf, other

    cs.LG

    Learning Sinkhorn divergences for supervised change point detection

    Authors: Nauman Ahad, Eva L. Dyer, Keith B. Hengen, Yao Xie, Mark A. Davenport

    Abstract: Many modern applications require detecting change points in complex sequential data. Most existing methods for change point detection are unsupervised and, as a consequence, lack any information regarding what kind of changes we want to detect or if some kinds of changes are safe to ignore. This often results in poor change detection performance. We present a novel change point detection framework… ▽ More

    Submitted 10 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: 19 pages, 13 figures. Reorganized figures and text for improved readability

  12. arXiv:2111.02338  [pdf, other

    cs.LG

    Drop, Swap, and Generate: A Self-Supervised Approach for Generating Neural Activity

    Authors: Ran Liu, Mehdi Azabou, Max Dabagia, Chi-Heng Lin, Mohammad Gheshlaghi Azar, Keith B. Hengen, Michal Valko, Eva L. Dyer

    Abstract: Meaningful and simplified representations of neural activity can yield insights into how and what information is being processed within a neural circuit. However, without labels, finding representations that reveal the link between the brain and behavior can be challenging. Here, we introduce a novel unsupervised approach for learning disentangled representations of neural activity called Swap-VAE… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: To be published in Neurips 2021

  13. arXiv:2109.04463  [pdf, other

    cs.LG q-bio.NC

    Neural Latents Benchmark '21: Evaluating latent variable models of neural population activity

    Authors: Felix Pei, Joel Ye, David Zoltowski, Anqi Wu, Raeed H. Chowdhury, Hansem Sohn, Joseph E. O'Doherty, Krishna V. Shenoy, Matthew T. Kaufman, Mark Churchland, Mehrdad Jazayeri, Lee E. Miller, Jonathan Pillow, Il Memming Park, Eva L. Dyer, Chethan Pandarinath

    Abstract: Advances in neural recording present increasing opportunities to study neural activity in unprecedented detail. Latent variable models (LVMs) are promising tools for analyzing this rich activity across diverse neural systems and behaviors, as LVMs do not depend on known relationships between the activity and external experimental variables. However, progress with LVMs for neuronal population activ… ▽ More

    Submitted 17 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

  14. arXiv:2102.10106  [pdf, other

    cs.LG

    Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction

    Authors: Mehdi Azabou, Mohammad Gheshlaghi Azar, Ran Liu, Chi-Heng Lin, Erik C. Johnson, Kiran Bhaskaran-Nair, Max Dabagia, Bernardo Avila-Pires, Lindsey Kitchell, Keith B. Hengen, William Gray-Roncal, Michal Valko, Eva L. Dyer

    Abstract: State-of-the-art methods for self-supervised learning (SSL) build representations by maximizing the similarity between different transformed "views" of a sample. Without sufficient diversity in the transformations used to create views, however, it can be difficult to overcome nuisance variables in the data and build rich representations. This motivates the use of the dataset itself to find similar… ▽ More

    Submitted 13 December, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  15. arXiv:2102.06514  [pdf, other

    cs.LG cs.SI stat.ML

    Large-Scale Representation Learning on Graphs via Bootstrap**

    Authors: Shantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Mehdi Azabou, Eva L. Dyer, Rémi Munos, Petar Veličković, Michal Valko

    Abstract: Self-supervised learning provides a promising path towards eliminating the need for costly label information in representation learning on graphs. However, to achieve state-of-the-art performance, methods often need large numbers of negative examples and rely on complex augmentations. This can be prohibitively expensive, especially for large graphs. To address these challenges, we introduce Bootst… ▽ More

    Submitted 20 February, 2023; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: Published as a conference paper at ICLR 2022

  16. arXiv:2012.11589  [pdf, other

    cs.LG

    Making transport more robust and interpretable by moving data through a small number of anchor points

    Authors: Chi-Heng Lin, Mehdi Azabou, Eva L. Dyer

    Abstract: Optimal transport (OT) is a widely used technique for distribution alignment, with applications throughout the machine learning, graphics, and vision communities. Without any additional structural assumptions on trans-port, however, OT can be fragile to outliers or noise, especially in high dimensions. Here, we introduce a new form of structured OT that simultaneously learns low-dimensional struct… ▽ More

    Submitted 17 July, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

    Journal ref: International Conference on Machine Learning (ICML) 2021

  17. arXiv:2006.02624  [pdf, other

    cs.LG stat.ML

    Bayesian optimization for modular black-box systems with switching costs

    Authors: Chi-Heng Lin, Joseph D. Miano, Eva L. Dyer

    Abstract: Most existing black-box optimization methods assume that all variables in the system being optimized have equal cost and can change freely at each iteration. However, in many real world systems, inputs are passed through a sequence of different operations or modules, making variables in earlier stages of processing more costly to update. Such structure imposes a cost on switching variables in earl… ▽ More

    Submitted 11 October, 2021; v1 submitted 3 June, 2020; originally announced June 2020.

  18. arXiv:1906.11768  [pdf, other

    stat.ML cs.LG

    Hierarchical Optimal Transport for Multimodal Distribution Alignment

    Authors: John Lee, Max Dabagia, Eva L. Dyer, Christopher J. Rozell

    Abstract: In many machine learning applications, it is necessary to meaningfully aggregate, through alignment, different but related datasets. Optimal transport (OT)-based approaches pose alignment as a divergence minimization problem: the aim is to transform a source dataset to match a target dataset using the Wasserstein distance as a divergence measure. We introduce a hierarchical formulation of OT which… ▽ More

    Submitted 3 November, 2019; v1 submitted 27 June, 2019; originally announced June 2019.

  19. arXiv:1604.03629  [pdf, other

    q-bio.QM cs.CV

    Quantifying mesoscale neuroanatomy using X-ray microtomography

    Authors: Eva L. Dyer, William Gray Roncal, Hugo L. Fernandes, Doga Gürsoy, Vincent De Andrade, Rafael Vescovi, Kamel Fezzaa, Xianghui Xiao, Joshua T. Vogelstein, Chris Jacobsen, Konrad P. Körding, Narayanan Kasthuri

    Abstract: Methods for resolving the 3D microstructure of the brain typically start by thinly slicing and staining the brain, and then imaging each individual section with visible light photons or electrons. In contrast, X-rays can be used to image thick samples, providing a rapid approach for producing large 3D brain maps without sectioning. Here we demonstrate the use of synchrotron X-ray microtomography (… ▽ More

    Submitted 26 July, 2016; v1 submitted 12 April, 2016; originally announced April 2016.

    Comments: 28 pages, 9 figures

  20. arXiv:1505.05208  [pdf, other

    stat.ML cs.LG

    oASIS: Adaptive Column Sampling for Kernel Matrix Approximation

    Authors: Raajen Patel, Thomas A. Goldstein, Eva L. Dyer, Azalia Mirhoseini, Richard G. Baraniuk

    Abstract: Kernel matrices (e.g. Gram or similarity matrices) are essential for many state-of-the-art approaches to classification, clustering, and dimensionality reduction. For large datasets, the cost of forming and factoring such kernel matrices becomes intractable. To address this challenge, we introduce a new adaptive sampling algorithm called Accelerated Sequential Incoherence Selection (oASIS) that sa… ▽ More

    Submitted 19 May, 2015; originally announced May 2015.

    ACM Class: G.1.0; G.4

  21. arXiv:1505.00824  [pdf, other

    cs.IT cs.CV cs.LG stat.ML

    Self-Expressive Decompositions for Matrix Approximation and Clustering

    Authors: Eva L. Dyer, Tom A. Goldstein, Raajen Patel, Konrad P. Kording, Richard G. Baraniuk

    Abstract: Data-aware methods for dimensionality reduction and matrix decomposition aim to find low-dimensional structure in a collection of data. Classical approaches discover such structure by learning a basis that can efficiently express the collection. Recently, "self expression", the idea of using a small subset of data vectors to represent the full collection, has been developed as an alternative to le… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    Comments: 11 pages, 7 figures

  22. arXiv:1503.08169  [pdf, other

    cs.DC cs.LG

    RankMap: A Platform-Aware Framework for Distributed Learning from Dense Datasets

    Authors: Azalia Mirhoseini, Eva L. Dyer, Ebrahim. M. Songhori, Richard G. Baraniuk, Farinaz Koushanfar

    Abstract: This paper introduces RankMap, a platform-aware end-to-end framework for efficient execution of a broad class of iterative learning algorithms for massive and dense datasets. Our framework exploits data structure to factorize it into an ensemble of lower rank subspaces. The factorization creates sparse low-dimensional representations of the data, a property which is leveraged to devise effective m… ▽ More

    Submitted 27 October, 2016; v1 submitted 27 March, 2015; originally announced March 2015.

    Comments: 13 pages, 10 figures

  23. arXiv:1303.4778  [pdf, other

    cs.LG math.NA stat.ML

    Greedy Feature Selection for Subspace Clustering

    Authors: Eva L. Dyer, Aswin C. Sankaranarayanan, Richard G. Baraniuk

    Abstract: Unions of subspaces provide a powerful generalization to linear subspace models for collections of high-dimensional data. To learn a union of subspaces from a collection of data, sets of signals in the collection that belong to the same subspace must be identified in order to obtain accurate estimates of the subspace structures present in the data. Recently, sparse recovery methods have been shown… ▽ More

    Submitted 3 July, 2013; v1 submitted 19 March, 2013; originally announced March 2013.

    Comments: 32 pages, 7 figures, 1 table

    Journal ref: Journal of Machine Learning Research, Vol.14, Issue 1, pp. 2487-2517, January 2013