Skip to main content

Showing 1–23 of 23 results for author: Bruss, B

.
  1. arXiv:2406.11463  [pdf, other

    cs.LG stat.ML

    Just How Flexible are Neural Networks in Practice?

    Authors: Ravid Shwartz-Ziv, Micah Goldblum, Arpit Bansal, C. Bayan Bruss, Yann LeCun, Andrew Gordon Wilson

    Abstract: It is widely believed that a neural network can fit a training set containing at least as many samples as it has parameters, underpinning notions of overparameterized and underparameterized models. In practice, however, we only find solutions accessible via our training procedure, including the optimizer and regularizers, limiting flexibility. Moreover, the exact parameterization of the function c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2312.02517  [pdf, other

    cs.LG cs.AI

    Simplifying Neural Network Training Under Class Imbalance

    Authors: Ravid Shwartz-Ziv, Micah Goldblum, Yucen Lily Li, C. Bayan Bruss, Andrew Gordon Wilson

    Abstract: Real-world datasets are often highly class-imbalanced, which can adversely impact the performance of deep learning models. The majority of research on training neural networks under class imbalance has focused on specialized loss functions, sampling techniques, or two-stage training procedures. Notably, we demonstrate that simply tuning existing components of standard deep learning pipelines, such… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023. Code available at https://github.com/ravidziv/SimplifyingImbalancedTraining

  3. arXiv:2311.05877  [pdf, other

    cs.LG cs.AI

    A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning

    Authors: Valeriia Cherepanova, Roman Levin, Gowthami Somepalli, Jonas Gei**, C. Bayan Bruss, Andrew Gordon Wilson, Tom Goldstein, Micah Goldblum

    Abstract: Academic tabular benchmarks often contain small sets of curated features. In contrast, data scientists typically collect as many features as possible into their datasets, and even engineer new features from existing ones. To prevent overfitting in subsequent downstream modeling, practitioners commonly use automated feature selection methods that identify a reduced subset of informative features. E… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

    Journal ref: Conference on Neural Information Processing Systems 2023

  4. arXiv:2309.03999  [pdf, other

    cs.CV cs.LG

    Adapting Self-Supervised Representations to Multi-Domain Setups

    Authors: Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi

    Abstract: Current state-of-the-art self-supervised approaches, are effective when trained on individual domains but show limited generalization on unseen domains. We observe that these models poorly generalize even when trained on a mixture of domains, making them unsuitable to be deployed under diverse real-world setups. We therefore propose a general-purpose, lightweight Domain Disentanglement Module (DDM… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Published at BMVC 2023

  5. arXiv:2307.10504  [pdf, other

    cs.CV cs.LG

    Identifying Interpretable Subspaces in Image Representations

    Authors: Neha Kalibhat, Shweta Bhardwaj, Bayan Bruss, Hamed Firooz, Maziar Sanjabi, Soheil Feizi

    Abstract: We propose Automatic Feature Explanation using Contrasting Concepts (FALCON), an interpretability framework to explain features of image representations. For a target feature, FALCON captions its highly activating cropped images using a large captioning dataset (like LAION-400m) and a pre-trained vision-language model like CLIP. Each word among the captions is scored and ranked leading to a small… ▽ More

    Submitted 7 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Published at ICML 2023 Code: https://github.com/NehaKalibhat/falcon-explain

  6. arXiv:2304.03368  [pdf, other

    cs.LG cs.HC

    From Explanation to Action: An End-to-End Human-in-the-loop Framework for Anomaly Reasoning and Management

    Authors: Xueying Ding, Nikita Seleznev, Senthil Kumar, C. Bayan Bruss, Leman Akoglu

    Abstract: Anomalies are often indicators of malfunction or inefficiency in various systems such as manufacturing, healthcare, finance, surveillance, to name a few. While the literature is abundant in effective detection algorithms due to this practical relevance, autonomous anomaly detection is rarely used in real-world scenarios. Especially in high-stakes applications, a human-in-the-loop is often involved… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  7. arXiv:2207.05566  [pdf, other

    cs.LG cs.AI

    BASED-XAI: Breaking Ablation Studies Down for Explainable Artificial Intelligence

    Authors: Isha Hameed, Samuel Sharpe, Daniel Barcklow, Justin Au-Yeung, Sahil Verma, Jocelyn Huang, Brian Barr, C. Bayan Bruss

    Abstract: Explainable artificial intelligence (XAI) methods lack ground truth. In its place, method developers have relied on axioms to determine desirable properties for their explanations' behavior. For high stakes uses of machine learning that require explainability, it is not sufficient to rely on axioms as the implementation, or its usage, can fail to live up to the ideal. As a result, there exists act… ▽ More

    Submitted 1 September, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 6 pages, accepted by the KDD 2022 Workshop on Machine Learning for Finance (KDD MLF 2022)

  8. arXiv:2206.15306  [pdf, other

    cs.LG stat.ML

    Transfer Learning with Deep Tabular Models

    Authors: Roman Levin, Valeriia Cherepanova, Avi Schwarzschild, Arpit Bansal, C. Bayan Bruss, Tom Goldstein, Andrew Gordon Wilson, Micah Goldblum

    Abstract: Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models is that they learn reusable features and are easily fine-tuned in new domains. This property is often exploited in computer vision and natural language applica… ▽ More

    Submitted 7 August, 2023; v1 submitted 30 June, 2022; originally announced June 2022.

    Journal ref: International Conference on Learning Representations (ICLR), 2023

  9. arXiv:2204.00650  [pdf

    cs.DS

    Double-Hashing Algorithm for Frequency Estimation in Data Streams

    Authors: Nikita Seleznev, Senthil Kumar, C. Bayan Bruss

    Abstract: Frequency estimation of elements is an important task for summarizing data streams and machine learning applications. The problem is often addressed by using streaming algorithms with sublinear space data structures. These algorithms allow processing of large data while using limited data storage. Commonly used streaming algorithms, such as count-min sketch, have many advantages, but do not take i… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: 9 pages, 10 figures

    ACM Class: E.2; E.4; F.2.2

  10. arXiv:2112.00890  [pdf, other

    cs.LG

    Counterfactual Explanations via Latent Space Projection and Interpolation

    Authors: Brian Barr, Matthew R. Harrington, Samuel Sharpe, C. Bayan Bruss

    Abstract: Counterfactual explanations represent the minimal change to a data sample that alters its predicted classification, typically from an unfavorable initial class to a desired target class. Counterfactuals help answer questions such as "what needs to change for this application to get accepted for a loan?". A number of recently proposed approaches to counterfactual generation give varying definitions… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 10 pages, 6 figures

  11. arXiv:2107.10597  [pdf

    eess.SY

    Application-driven Test and Evaluation Framework for Indoor Localization Systems in Warehouses

    Authors: Jakob Schyga, Johannes Hinckeldeyn, Benjamin Bruss, Christoph Bamberger, Jochen Kreutzfeldt

    Abstract: Despite their potential of increasing operational efficiency, transparency, and safety, the use of Localization and Tracking Systems (LTSs) in warehouse environments remains seldom. One reason is the lack of market transparency and stakeholder's trust in the systems' performance as a consequence of poor use of Test and Evaluation (T&E) methods and transferability of the obtained T&E results. The T… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

    Comments: Work-in-Progress Paper for the Eleventh International Conference on Indoor Positioning and Indoor Navigation (IPIN) 2021

  12. arXiv:2106.09643  [pdf, other

    cs.AI

    MetaBalance: High-Performance Neural Networks for Class-Imbalanced Data

    Authors: Arpit Bansal, Micah Goldblum, Valeriia Cherepanova, Avi Schwarzschild, C. Bayan Bruss, Tom Goldstein

    Abstract: Class-imbalanced data, in which some classes contain far more samples than others, is ubiquitous in real-world applications. Standard techniques for handling class-imbalance usually work by training on a re-weighted loss or on re-balanced data. Unfortunately, training overparameterized neural networks on such objectives causes rapid memorization of minority class data. To avoid this trap, we harne… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  13. arXiv:2106.01342  [pdf, other

    cs.LG cs.AI stat.ML

    SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

    Authors: Gowthami Somepalli, Micah Goldblum, Avi Schwarzschild, C. Bayan Bruss, Tom Goldstein

    Abstract: Tabular data underpins numerous high-impact applications of machine learning from fraud detection to genomics and healthcare. Classical approaches to solving tabular problems, such as gradient boosting and random forests, are widely used by practitioners. However, recent deep learning methods have achieved a degree of performance competitive with popular techniques. We devise a hybrid deep learnin… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  14. arXiv:2012.09301  [pdf, other

    cs.LG

    Latent-CF: A Simple Baseline for Reverse Counterfactual Explanations

    Authors: Rachana Balasubramanian, Samuel Sharpe, Brian Barr, Jason Wittenbach, C. Bayan Bruss

    Abstract: In the environment of fair lending laws and the General Data Protection Regulation (GDPR), the ability to explain a model's prediction is of paramount importance. High quality explanations are the first step in assessing fairness. Counterfactuals are valuable tools for explainability. They provide actionable, comprehensible explanations for the individual who is subject to decisions made from the… ▽ More

    Submitted 22 June, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

  15. arXiv:2010.01693  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.NE

    DLGNet-Task: An End-to-end Neural Network Framework for Modeling Multi-turn Multi-domain Task-Oriented Dialogue

    Authors: Oluwatobi O. Olabiyi, Prarthana Bhattarai, C. Bayan Bruss, Zachary Kulis

    Abstract: Task oriented dialogue (TOD) requires the complex interleaving of a number of individually controllable components with strong guarantees for explainability and verifiability. This has made it difficult to adopt the multi-turn multi-domain dialogue generation capabilities of streamlined end-to-end open-domain dialogue systems. In this paper, we present a new framework, DLGNet-Task, a unified task-… ▽ More

    Submitted 6 October, 2020; v1 submitted 4 October, 2020; originally announced October 2020.

  16. arXiv:2009.05636  [pdf, other

    q-fin.ST cs.LG

    Machine Learning for Temporal Data in Finance: Challenges and Opportunities

    Authors: Jason Wittenbach, Brian d'Alessandro, C. Bayan Bruss

    Abstract: Temporal data are ubiquitous in the financial services (FS) industry -- traditional data like economic indicators, operational data such as bank account transactions, and modern data sources like website clickstreams -- all of these occur as a time-indexed sequence. But machine learning efforts in FS often fail to account for the temporal richness of these data, even in cases where domain knowledg… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: KDD '20 ML in Finance Workshop

  17. arXiv:2007.10532  [pdf, other

    cs.LG stat.ML

    Towards Ground Truth Explainability on Tabular Data

    Authors: Brian Barr, Ke Xu, Claudio Silva, Enrico Bertini, Robert Reilly, C. Bayan Bruss, Jason D. Wittenbach

    Abstract: In data science, there is a long history of using synthetic data for method development, feature selection and feature engineering. Our current interest in synthetic data comes from recent work in explainability. Today's datasets are typically larger and more complex - requiring less interpretable models. In the setting of \textit{post hoc} explainability, there is no ground truth for explanations… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: 6 pages, 5 figures

  18. arXiv:2007.00591  [pdf, other

    cs.LG cs.SI stat.ML

    Navigating the Dynamics of Financial Embeddings over Time

    Authors: Antonia Gogoglou, Brian Nguyen, Alan Salimov, Jonathan Rider, C. Bayan Bruss

    Abstract: Financial transactions constitute connections between entities and through these connections a large scale heterogeneous weighted graph is formulated. In this labyrinth of interactions that are continuously updated, there exists a variety of similarity-based patterns that can provide insights into the dynamics of the financial system. With the current work, we propose the application of Graph Repr… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  19. arXiv:2006.10252  [pdf, other

    cs.LG stat.ML

    Quantifying Challenges in the Application of Graph Representation Learning

    Authors: Antonia Gogoglou, C. Bayan Bruss, Brian Nguyen, Reza Sarshogh, Keegan E. Hines

    Abstract: Graph Representation Learning (GRL) has experienced significant progress as a means to extract structural information in a meaningful way for subsequent learning tasks. Current approaches including shallow embeddings and Graph Neural Networks have mostly been tested with node classification and link prediction tasks. In this work, we provide an application oriented perspective to a set of popular… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  20. arXiv:1910.03081  [pdf, ps, other

    cs.LG stat.ML

    On the Interpretability and Evaluation of Graph Representation Learning

    Authors: Antonia Gogoglou, C. Bayan Bruss, Keegan E. Hines

    Abstract: With the rising interest in graph representation learning, a variety of approaches have been proposed to effectively capture a graph's properties. While these approaches have improved performance in graph machine learning tasks compared to traditional graph techniques, they are still perceived as techniques with limited insight into the information encoded in these representations. In this work, w… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Graph Representation Learning workshop

  21. Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

    Authors: Anh Truong, Austin Walters, Jeremy Goodsitt, Keegan Hines, C. Bayan Bruss, Reza Farivar

    Abstract: There has been considerable growth and interest in industrial applications of machine learning (ML) in recent years. ML engineers, as a consequence, are in high demand across the industry, yet improving the efficiency of ML engineers remains a fundamental challenge. Automated machine learning (AutoML) has emerged as a way to save time and effort on repetitive tasks in ML pipelines, such as data pr… ▽ More

    Submitted 3 September, 2019; v1 submitted 15 August, 2019; originally announced August 2019.

  22. arXiv:1907.07225  [pdf, other

    cs.LG cs.SI stat.ML

    DeepTrax: Embedding Graphs of Financial Transactions

    Authors: C. Bayan Bruss, Anish Khazane, Jonathan Rider, Richard Serpe, Antonia Gogoglou, Keegan E. Hines

    Abstract: Financial transactions can be considered edges in a heterogeneous graph between entities sending money and entities receiving money. For financial institutions, such a graph is likely large (with millions or billions of edges) while also sparsely connected. It becomes challenging to apply machine learning to such large and sparse graphs. Graph representation learning seeks to embed the nodes of a… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

  23. arXiv:1907.01705  [pdf, other

    cs.LG stat.ML

    Graph Embeddings at Scale

    Authors: C. Bayan Bruss, Anish Khazane, Jonathan Rider, Richard Serpe, Saurabh Nagrecha, Keegan E. Hines

    Abstract: Graph embedding is a popular algorithmic approach for creating vector representations for individual vertices in networks. Training these algorithms at scale is important for creating embeddings that can be used for classification, ranking, recommendation and other common applications in industry. While industrial systems exist for training graph embeddings on large datasets, many of these distrib… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: Workshop on Mining and Learning with Graphs 2019