Skip to main content

Showing 1–9 of 9 results for author: Willmott, D

.
  1. arXiv:2311.08479  [pdf, other

    cs.LG cs.CV cs.DC

    Leveraging Foundation Models to Improve Lightweight Clients in Federated Learning

    Authors: Xidong Wu, Wan-Yi Lin, Devin Willmott, Filipe Condessa, Yufei Huang, Zhenzhen Li, Madan Ravi Ganesh

    Abstract: Federated Learning (FL) is a distributed training paradigm that enables clients scattered across the world to cooperatively learn a global model without divulging confidential data. However, FL faces a significant challenge in the form of heterogeneous data distributions among clients, which leads to a reduction in performance and robustness. A recent approach to mitigating the impact of heterogen… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 6 Pages + Appendices

  2. arXiv:2210.03651  [pdf, other

    cs.CV cs.AI cs.LG

    Understanding the Covariance Structure of Convolutional Filters

    Authors: Asher Trockman, Devin Willmott, J. Zico Kolter

    Abstract: Neural network weights are typically initialized at random from univariate distributions, controlling just the variance of individual weights even in highly-structured operations like convolutions. Recent ViT-inspired convolutional networks such as ConvMixer and ConvNeXt use large-kernel depthwise convolutions whose learned filters have notable structure; this presents an opportunity to study thei… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  3. arXiv:2205.06133  [pdf, other

    physics.comp-ph physics.chem-ph

    Orbital Mixer: Using Atomic Orbital Features for Basis Dependent Prediction of Molecular Wavefunctions

    Authors: Kirill Shmilovich, Devin Willmott, Ivan Batalov, Mordechai Kornbluth, Jonathan Mailoa, J. Zico Kolter

    Abstract: Leveraging ab initio data at scale has enabled the development of machine learning models capable of extremely accurate and fast molecular property prediction. A central paradigm of many previous works focuses on generating predictions for only a fixed set of properties. Recent lines of research instead aim to explicitly learn the electronic structure via molecular wavefunctions from which other q… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  4. arXiv:2102.00029  [pdf, other

    cs.LG cs.CR

    You Only Query Once: Effective Black Box Adversarial Attacks with Minimal Repeated Queries

    Authors: Devin Willmott, Anit Kumar Sahu, Fatemeh Sheikholeslami, Filipe Condessa, Zico Kolter

    Abstract: Researchers have repeatedly shown that it is possible to craft adversarial attacks on deep classifiers (small perturbations that significantly change the class label), even in the "black-box" setting where one only has query access to the classifier. However, all prior work in the black-box setting attacks the classifier by repeatedly querying the same image with minor modifications, usually thous… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

  5. arXiv:2008.05994  [pdf

    physics.comp-ph cs.LG

    A community-powered search of machine learning strategy space to find NMR property prediction models

    Authors: Lars A. Bratholm, Will Gerrard, Brandon Anderson, Shaojie Bai, Sunghwan Choi, Lam Dang, Pavel Hanchar, Addison Howard, Guillaume Huard, Sanghoon Kim, Zico Kolter, Risi Kondor, Mordechai Kornbluth, Youhan Lee, Youngsoo Lee, Jonathan P. Mailoa, Thanh Tu Nguyen, Milos Popovic, Goran Rakocevic, Walter Reade, Wonho Song, Luka Stojanovic, Erik H. Thiede, Nebojsa Tijanic, Andres Torrubia , et al. (4 additional authors not shown)

    Abstract: The rise of machine learning (ML) has created an explosion in the potential strategies for using data to make scientific predictions. For physical scientists wishing to apply ML strategies to a particular domain, it can be difficult to assess in advance what strategy to adopt within a vast space of possibilities. Here we outline the results of an online community-powered effort to swarm search the… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  6. arXiv:2007.07210  [pdf, other

    cs.LG stat.ML

    Simple and Efficient Hard Label Black-box Adversarial Attacks in Low Query Budget Regimes

    Authors: Satya Narayan Shukla, Anit Kumar Sahu, Devin Willmott, J. Zico Kolter

    Abstract: We focus on the problem of black-box adversarial attacks, where the aim is to generate adversarial examples for deep learning models solely based on information limited to output label~(hard label) to a queried data input. We propose a simple and efficient Bayesian Optimization~(BO) based approach for develo** black-box adversarial attacks. Issues with BO's performance in high dimensions are avo… ▽ More

    Submitted 11 June, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at KDD 2021. arXiv admin note: substantial text overlap with arXiv:1909.13857

  7. arXiv:1909.13857  [pdf, other

    cs.LG stat.ML

    Black-box Adversarial Attacks with Bayesian Optimization

    Authors: Satya Narayan Shukla, Anit Kumar Sahu, Devin Willmott, J. Zico Kolter

    Abstract: We focus on the problem of black-box adversarial attacks, where the aim is to generate adversarial examples using information limited to loss function evaluations of input-output pairs. We use Bayesian optimization~(BO) to specifically cater to scenarios involving low query budgets to develop query efficient adversarial attacks. We alleviate the issues surrounding BO in regards to optimizing high… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  8. Improving RNA secondary structure prediction via state inference with deep recurrent neural networks

    Authors: Devin Willmott, David Murrugarra, Qiang Ye

    Abstract: The problem of determining which nucleotides of an RNA sequence are paired or unpaired in the secondary structure of an RNA, which we call RNA state inference, can be studied by different machine learning techniques. Successful state inference of RNA sequences can be used to generate auxiliary information for data-directed RNA secondary structure prediction. Bidirectional long short-term memory (L… ▽ More

    Submitted 23 February, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 15 pages, 3 figures, and 5 tables

    MSC Class: 92

    Journal ref: Computational and Mathematical Biophysics, 8(1), 36-50, 2020

  9. arXiv:1707.09520  [pdf, other

    stat.ML cs.LG

    Orthogonal Recurrent Neural Networks with Scaled Cayley Transform

    Authors: Kyle Helfrich, Devin Willmott, Qiang Ye

    Abstract: Recurrent Neural Networks (RNNs) are designed to handle sequential data but suffer from vanishing or exploding gradients. Recent work on Unitary Recurrent Neural Networks (uRNNs) have been used to address this issue and in some cases, exceed the capabilities of Long Short-Term Memory networks (LSTMs). We propose a simpler and novel update scheme to maintain orthogonal recurrent weight matrices wit… ▽ More

    Submitted 19 June, 2018; v1 submitted 29 July, 2017; originally announced July 2017.

    Comments: 12 pages