Skip to main content

Showing 1–7 of 7 results for author: Halloran, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00209  [pdf, other

    cs.LG cs.AI

    Mamba State-Space Models Can Be Strong Downstream Learners

    Authors: John T. Halloran, Manbir Gulati, Paul F. Roysdon

    Abstract: Mamba state-space models (SSMs) have recently outperformed state-of-the-art (SOTA) Transformer large language models (LLMs) in various tasks and been widely adapted. However, Mamba's downstream learning capabilities remain either unexplored$\unicode{x2013}$e.g., mixed-precision (MPFT) and parameter-efficient fine-tuning (PEFT)--or under-evaluated$\unicode{x2013}$e.g., in-context learning (ICL). Fo… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures, 3 tables

  2. arXiv:2201.01240  [pdf, other

    cs.CY

    Feedback and Engagement on an Introductory Programming Module

    Authors: Beate Grawemeyer, John Halloran, Matthew England, David Croft

    Abstract: We ran a study on engagement and achievement for a first year undergraduate programming module which used an online learning environment containing tasks which generate automated feedback. Students could also access human feedback from traditional labs. We gathered quantitative data on engagement and achievement which allowed us to split the cohort into 6 groups. We then ran interviews with studen… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Comments: To appear in Proc. CEP 2022

    ACM Class: K.3.2

  3. arXiv:2008.03433  [pdf, other

    cs.LG cs.DC q-bio.QM stat.ML

    GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification

    Authors: John T. Halloran, David M. Rocke

    Abstract: One of the most efficient methods to solve L2-regularized primal problems, such as logistic regression and linear support vector machine (SVM) classification, is the widely used trust region Newton algorithm, TRON. While TRON has recently been shown to enjoy substantial speedups on shared-memory multi-core systems, exploiting graphical processing units (GPUs) to speed up the method is significantl… ▽ More

    Submitted 14 October, 2020; v1 submitted 7 August, 2020; originally announced August 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  4. arXiv:1909.02136  [pdf, other

    q-bio.QM cs.LG stat.ML

    Learning Concave Conditional Likelihood Models for Improved Analysis of Tandem Mass Spectra

    Authors: John T. Halloran, David M. Rocke

    Abstract: The most widely used technology to identify the proteins present in a complex biological sample is tandem mass spectrometry, which quickly produces a large collection of spectra representative of the peptides (i.e., protein subsequences) present in the original sample. In this work, we greatly expand the parameter learning capabilities of a dynamic Bayesian network (DBN) peptide-scoring algorithm,… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: 16 pages. A partitioned version of this appeared in NeurIPS 2018

  5. arXiv:1909.02093  [pdf, other

    q-bio.QM cs.LG stat.ML

    Gradients of Generative Models for Improved Discriminative Analysis of Tandem Mass Spectra

    Authors: John T. Halloran, David M. Rocke

    Abstract: Tandem mass spectrometry (MS/MS) is a high-throughput technology used toidentify the proteins in a complex biological sample, such as a drop of blood. A collection of spectra is generated at the output of the process, each spectrum of which is representative of a peptide (protein subsequence) present in the original complex sample. In this work, we leverage the log-likelihood gradients of generati… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: 13 pages. A partitioned version of this appeared in NIPS 2017

  6. arXiv:1807.06574  [pdf, ps, other

    cs.LG math.OC stat.ML

    Jensen: An Easily-Extensible C++ Toolkit for Production-Level Machine Learning and Convex Optimization

    Authors: Rishabh Iyer, John T. Halloran, Kai Wei

    Abstract: This paper introduces Jensen, an easily extensible and scalable toolkit for production-level machine learning and convex optimization. Jensen implements a framework of convex (or loss) functions, convex optimization algorithms (including Gradient Descent, L-BFGS, Stochastic Gradient Descent, Conjugate Gradient, etc.), and a family of machine learning classifiers and regressors (Logistic Regression… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

  7. arXiv:1210.4904  [pdf

    cs.CE q-bio.QM

    Spectrum Identification using a Dynamic Bayesian Network Model of Tandem Mass Spectra

    Authors: Ajit P. Singh, John Halloran, Jeff A. Bilmes, Katrin Kirchoff, William S. Noble

    Abstract: Shotgun proteomics is a high-throughput technology used to identify unknown proteins in a complex mixture. At the heart of this process is a prediction task, the spectrum identification problem, in which each fragmentation spectrum produced by a shotgun proteomics experiment must be mapped to the peptide (protein subsequence) which generated the spectrum. We propose a new algorithm for spectrum id… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-775-785