Skip to main content

Showing 1–7 of 7 results for author: McCarter, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05593  [pdf, other

    cs.LG stat.ML

    Unmasking Trees for Tabular Data

    Authors: Calvin McCarter

    Abstract: We herein describe UnmaskingTrees, a method and open-source software package for tabular data generation and, especially, imputation. Our experiments suggest that training gradient-boosted trees to incrementally unmask features offers a simple, strong baseline for imputation.

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2404.04221  [pdf, other

    cs.CL

    How Lexical is Bilingual Lexicon Induction?

    Authors: Harsh Kohli, Helian Feng, Nicholas Dronen, Calvin McCarter, Sina Moeini, Ali Kebarighotbi

    Abstract: In contemporary machine learning approaches to bilingual lexicon induction (BLI), a model learns a map** between the embedding spaces of a language pair. Recently, retrieve-and-rank approach to BLI has achieved state of the art results on the task. However, the problem remains challenging in low-resource settings, due to the paucity of data. The task is complicated by factors such as lexical var… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures. Paper accepted at NAACL Findings 2024

  3. arXiv:2310.18805  [pdf, other

    cs.LG

    Inverse distance weighting attention

    Authors: Calvin McCarter

    Abstract: We report the effects of replacing the scaled dot-product (within softmax) attention with the negative-log of Euclidean distance. This form of attention simplifies to inverse distance weighting interpolation. Used in simple one hidden layer networks and trained with vanilla cross-entropy loss on classification problems, it tends to produce a key matrix containing prototypes and a value matrix with… ▽ More

    Submitted 6 December, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: Associative Memory & Hopfield Networks Workshop at NeurIPS 2023

  4. arXiv:2309.10194  [pdf, other

    stat.ML cs.LG

    The Kernel Density Integral Transformation

    Authors: Calvin McCarter

    Abstract: Feature preprocessing continues to play a critical role when applying machine learning and statistical methods to tabular data. In this paper, we propose the use of the kernel density integral transformation as a feature preprocessing step. Our approach subsumes the two leading feature preprocessing methods as limiting cases: linear min-max scaling and quantile transformation. We demonstrate that,… ▽ More

    Submitted 19 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Published in Transactions on Machine Learning Research (10/2023)

  5. arXiv:2207.05808  [pdf, other

    cs.LG cs.CV

    Look-ups are not (yet) all you need for deep learning inference

    Authors: Calvin McCarter, Nicholas Dronen

    Abstract: Fast approximations to matrix multiplication have the potential to dramatically reduce the cost of neural network inference. Recent work on approximate matrix multiplication proposed to replace costly multiplications with table-lookups by fitting a fast hash function from training data. In this work, we propose improvements to this previous work, targeted to the deep learning inference setting, wh… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  6. arXiv:2205.06287  [pdf, other

    cs.LG cs.AR

    Adaptive Block Floating-Point for Analog Deep Learning Hardware

    Authors: Ayon Basumallik, Darius Bunandar, Nicholas Dronen, Nicholas Harris, Ludmila Levkova, Calvin McCarter, Lakshmi Nair, David Walter, David Widemann

    Abstract: Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) inference than their digital counterparts. However, recent studies show that DNNs on AMS devices with fixed-point numbers can incur an accuracy penalty because of precision loss. To mitigate this penalty, we present a novel AMS-compatible adaptive block floating-point (ABFP) number representation. We… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: 13 pages including Appendix, 7 figures, under submission at IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  7. arXiv:2203.12720  [pdf, other

    stat.ML cs.LG

    Towards Backwards-Compatible Data with Confounded Domain Adaptation

    Authors: Calvin McCarter

    Abstract: Most current domain adaptation methods address either covariate shift or label shift, but are not applicable where they occur simultaneously and are confounded with each other. Domain adaptation approaches which do account for such confounding are designed to adapt covariates to optimally predict a particular label whose shift is confounded with covariate shift. In this paper, we instead seek to a… ▽ More

    Submitted 6 April, 2022; v1 submitted 23 March, 2022; originally announced March 2022.