Skip to main content

Showing 1–25 of 25 results for author: Roberts, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00894  [pdf, other

    cs.LG cs.AI cs.CL

    Pretrained Hybrids with MAD Skills

    Authors: Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi GNVV, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

    Abstract: While Transformers underpin modern large language models (LMs), there is a growing list of alternative architectures with new capabilities, promises, and tradeoffs. This makes choosing the right LM architecture challenging. Recently-proposed $\textit{hybrid architectures}$ seek a best-of-all-worlds approach that reaps the benefits of all architectures. Hybrid design is difficult for two reasons: i… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2406.00166  [pdf, other

    nlin.CG cs.ET

    On complexity of colloid cellular automata

    Authors: Andrew Adamatzky, Nic Roberts, Raphael Fortulan, Noushin Raeisi Kheirabadi, Panagiotis Mougkogiannis, Michail-Antisthenis Tsompanas, Genaro J. Martinez, Georgios Ch. Sirakoulis, Alessandro Chiolerio

    Abstract: The colloid cellular automata do not imitate the physical structure of colloids but are governed by logical functions derived from the colloids. We analyse the space-time complexity of Boolean circuits derived from the electrical responses of colloids: ZnO (zinc oxide, an inorganic compound also known as calamine or zinc white, which naturally occurs as the mineral zincite), proteinoids (microsphe… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  3. arXiv:2307.14430  [pdf, other

    cs.CL cs.LG

    Skill-it! A Data-Driven Skills Framework for Understanding and Training Language Models

    Authors: Mayee F. Chen, Nicholas Roberts, Kush Bhatia, Jue Wang, Ce Zhang, Frederic Sala, Christopher Ré

    Abstract: The quality of training data impacts the performance of pre-trained large language models (LMs). Given a fixed budget of tokens, we study how to best select data that leads to good downstream model performance across tasks. We develop a new framework based on a simple hypothesis: just as humans acquire interdependent skills in a deliberate order, language models also follow a natural order when le… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  4. arXiv:2307.12226  [pdf, other

    cs.LG cs.AI stat.ML

    Geometry-Aware Adaptation for Pretrained Models

    Authors: Nicholas Roberts, Xintong Li, Dyah Adila, Sonia Cromp, Tzu-Heng Huang, Jitian Zhao, Frederic Sala

    Abstract: Machine learning models -- including prominent zero-shot models -- are often trained on datasets whose labels are only a small proportion of a larger label space. Such spaces are commonly equipped with a metric that relates the labels via distances between them. We propose a simple approach to exploit this information to adapt the trained model to reliably predict new classes -- or, in the case of… ▽ More

    Submitted 27 November, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  5. arXiv:2307.02664  [pdf, other

    cs.ET cond-mat.soft

    Logical circuits in colloids

    Authors: Nic Roberts, Noushin Raeisi Kheirabadi, Michail-Antisthenis Tsompanas, Alessandro Chiolerio, Marco Crepaldi, Andrew Adamatzky

    Abstract: Colloid-based computing devices offer remarkable fault tolerance and adaptability to varying environmental conditions due to their amorphous structure. An intriguing observation is that a colloidal suspension of ZnO nanoparticles in DMSO exhibits reconfiguration when exposed to electrical stimulation and produces spikes of electrical potential in response. This study presents a novel laboratory pr… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  6. arXiv:2304.10675  [pdf, other

    cs.ET

    Propagation of electrical signals by fungi

    Authors: Richard Mayne, Nic Roberts, Neil Phillips, Roshan Weerasekera, Andrew Adamatzky

    Abstract: Living fungal mycelium networks are proven to have properties of memristors, capacitors and various sensors. To further progress our designs in fungal electronics we need to evaluate how electrical signals can be propagated through mycelium networks. We investigate the ability of mycelium-bound composites to convey electrical signals, thereby enabling the transmission of frequency-modulated inform… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  7. arXiv:2211.13375  [pdf, other

    cs.LG cs.AI stat.ML

    Lifting Weak Supervision To Structured Prediction

    Authors: Harit Vishwakarma, Nicholas Roberts, Frederic Sala

    Abstract: Weak supervision (WS) is a rich set of techniques that produce pseudolabels by aggregating easily obtained but potentially noisy label estimates from a variety of sources. WS is theoretically well understood for binary classification, where simple approaches enable consistent estimation of pseudolabel noise rates. Using this result, it has been shown that downstream models trained on the pseudolab… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  8. arXiv:2210.03324  [pdf, other

    cs.LG cs.AI stat.ML

    AutoML for Climate Change: A Call to Action

    Authors: Renbo Tu, Nicholas Roberts, Vishak Prasad, Sibasis Nayak, Paarth Jain, Frederic Sala, Ganesh Ramakrishnan, Ameet Talwalkar, Willie Neiswanger, Colin White

    Abstract: The challenge that climate change poses to humanity has spurred a rapidly develo** field of artificial intelligence research focused on climate change applications. The climate change AI (CCAI) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML)… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  9. arXiv:2208.14362  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

    Authors: Nicholas Roberts, Xintong Li, Tzu-Heng Huang, Dyah Adila, Spencer Schoenberg, Cheng-Yu Liu, Lauren Pick, Haotian Ma, Aws Albarghouthi, Frederic Sala

    Abstract: Weak supervision (WS) is a powerful method to build labeled datasets for training supervised models in the face of little-to-no labeled data. It replaces hand-labeling data with aggregating multiple noisy-but-cheap label estimates expressed by labeling functions (LFs). While it has been used successfully in many domains, weak supervision's application scope is limited by the difficulty of construc… ▽ More

    Submitted 24 November, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2022 Datasets and Benchmarks Track

  10. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  11. arXiv:2203.12023  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Generative Modeling Helps Weak Supervision (and Vice Versa)

    Authors: Benedikt Boecking, Nicholas Roberts, Willie Neiswanger, Stefano Ermon, Frederic Sala, Artur Dubrawski

    Abstract: Many promising applications of supervised machine learning face hurdles in the acquisition of labeled data in sufficient quantity and quality, creating an expensive bottleneck. To overcome such limitations, techniques that do not depend on ground truth labels have been studied, including weak supervision and generative modeling. While these techniques would seem to be usable in concert, improving… ▽ More

    Submitted 11 March, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Published as a conference paper at ICLR 2023

    ACM Class: I.2.0; I.4.m

  12. arXiv:2112.07236  [pdf, other

    cs.ET

    Logics in fungal mycelium networks

    Authors: Andrew Adamatzky, Phil Ayres, Alexander E. Beasley, Nic Roberts, Martin Tegelaar, Michail-Antisthenis Tsompanas, Han A. B. Wösten

    Abstract: The living mycelium networks are capable of efficient sensorial fusion over very large areas and distributed decision making. The information processing in the mycelium networks is implemented via propagation of electrical and chemical signals en pair with morphological changes in the mycelium structure. These information processing mechanisms are manifested in experimental laboratory findings tha… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: To be published in special issue of Logica Universalis --- "Logic, Spatial Algorithms and Visual Reasoning", edited by Andrew Schumann and Jerzy Król, 2022

  13. arXiv:2112.03865  [pdf, other

    cs.LG cs.AI

    Universalizing Weak Supervision

    Authors: Changho Shin, Winfred Li, Harit Vishwakarma, Nicholas Roberts, Frederic Sala

    Abstract: Weak supervision (WS) frameworks are a popular way to bypass hand-labeling large datasets for training data-hungry models. These approaches synthesize multiple noisy but cheaply-acquired estimates of labels into a set of high-quality pseudolabels for downstream training. However, the synthesis technique is specific to a particular kind of label, such as binary labels or sequences, and each new lab… ▽ More

    Submitted 29 November, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: ICLR 2022

  14. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  15. arXiv:2111.11231  [pdf, other

    cs.ET

    Fungal electronics

    Authors: Andrew Adamatzky, Phil Ayres, Alexander E. Beasley, Alessandro Chiolerio, Mohammad M. Dehshibi, Antoni Gandia, Elena Albergati, Richard Mayne, Anna Nikolaidou, Nic Roberts, Martin Tegelaar, Michail-Antisthenis Tsompanas, Neil Phillips, Han A. B. Wösten

    Abstract: Fungal electronics is a family of living electronic devices made of mycelium bound composites or pure mycelium. Fungal electronic devices are capable of changing their impedance and generating spikes of electrical potential in response to external control parameters. Fungal electronics can be embedded into fungal materials and wearables or used as stand alone sensing and computing devices.

    Submitted 22 November, 2021; originally announced November 2021.

  16. arXiv:2110.05668  [pdf, other

    cs.CV cs.LG

    NAS-Bench-360: Benchmarking Neural Architecture Search on Diverse Tasks

    Authors: Renbo Tu, Nicholas Roberts, Mikhail Khodak, Junhong Shen, Frederic Sala, Ameet Talwalkar

    Abstract: Most existing neural architecture search (NAS) benchmarks and algorithms prioritize well-studied tasks, e.g. image classification on CIFAR or ImageNet. This makes the performance of NAS approaches in more diverse areas poorly understood. In this paper, we present NAS-Bench-360, a benchmark suite to evaluate methods on domains beyond those traditionally studied in architecture search, and use it to… ▽ More

    Submitted 19 January, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2022 Datasets and Benchmarks Track

  17. arXiv:2108.05336  [pdf, other

    cs.ET

    Mining logical circuits in fungi

    Authors: Nic Roberts, Andrew Adamatzky

    Abstract: Living substrates are capable for nontrivial map**s of electrical signals due to the substrate nonlinear electrical characteristics. This property can be used to realise Boolean functions. Input logical values are represented by amplitude or frequency of electrical stimuli. Output logical values are decoded from electrical responses of living substrates. We demonstrate how logical circuits can b… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

  18. arXiv:2103.15798  [pdf, other

    cs.LG cs.AI cs.CV math.NA stat.ML

    Rethinking Neural Operations for Diverse Tasks

    Authors: Nicholas Roberts, Mikhail Khodak, Tri Dao, Liam Li, Christopher Ré, Ameet Talwalkar

    Abstract: An important goal of AutoML is to automate-away the design of neural networks on new tasks in under-explored domains. Motivated by this goal, we study the problem of enabling users to discover the right neural operations given data from their specific domain. We introduce a search space of operations called XD-Operations that mimic the inductive bias of standard multi-channel convolutions while be… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: NeurIPS 2021

  19. arXiv:2011.13477  [pdf, other

    cs.CL cs.LG

    Decoding and Diversity in Machine Translation

    Authors: Nicholas Roberts, Davis Liang, Graham Neubig, Zachary C. Lipton

    Abstract: Neural Machine Translation (NMT) systems are typically evaluated using automated metrics that assess the agreement between generated translations and ground truth candidates. To improve systems with respect to these metrics, NLP researchers employ a variety of heuristic techniques, including searching for the conditional mode (vs. sampling) and incorporating various training heuristics (e.g., labe… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: Presented at the Resistance AI Workshop, 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  20. arXiv:1912.08987  [pdf, other

    cs.LG cs.CR stat.ML

    Model Weight Theft With Just Noise Inputs: The Curious Case of the Petulant Attacker

    Authors: Nicholas Roberts, Vinay Uday Prabhu, Matthew McAteer

    Abstract: This paper explores the scenarios under which an attacker can claim that 'Noise and access to the softmax layer of the model is all you need' to steal the weights of a convolutional neural network whose architecture is already known. We were able to achieve 96% test accuracy using the stolen MNIST model and 82% accuracy using the stolen KMNIST model learned using only i.i.d. Bernoulli noise inputs… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Presented at the Security and Privacy of Machine Learning Workshop, 36th International Conference on Machine Learning (ICML 2019), Long Beach, California, USA

  21. arXiv:1912.08986  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Connectomics Networks: Neural Network Architectures Inspired by Neuronal Networks

    Authors: Nicholas Roberts, Dian Ang Yap, Vinay Uday Prabhu

    Abstract: The interplay between inter-neuronal network topology and cognition has been studied deeply by connectomics researchers and network scientists, which is crucial towards understanding the remarkable efficacy of biological neural networks. Curiously, the deep learning revolution that revived neural networks has not paid much attention to topological aspects. The architectures of deep neural networks… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Presented at the Real Neurons & Hidden Units Workshop, 33rd Conference on Neural Information ProcessingSystems (NeurIPS 2019), Vancouver, Canada

  22. arXiv:1911.07418  [pdf, other

    cs.LG cs.IT stat.ML

    Grassmannian Packings in Neural Networks: Learning with Maximal Subspace Packings for Diversity and Anti-Sparsity

    Authors: Dian Ang Yap, Nicholas Roberts, Vinay Uday Prabhu

    Abstract: Kernel sparsity ("dying ReLUs") and lack of diversity are commonly observed in CNN kernels, which decreases model capacity. Drawing inspiration from information theory and wireless communications, we demonstrate the intersection of coding theory and deep learning through the Grassmannian subspace packing problem in CNNs. We propose Grassmannian packings for initial kernel layers to be initialized… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: Presented at Bayesian Deep Learning and Workshop on Information Theory and Machine Learning, 33rd Conference on Neural Information ProcessingSystems (NeurIPS 2019), Vancouver, Canada

  23. Denoising Imaging Polarimetry by an Adapted BM3D Method

    Authors: Alexander B. Tibbs, Ilse M. Daly, Nicholas W. Roberts, David R. Bull

    Abstract: Imaging polarimetry allows more information to be extracted from a scene than conventional intensity or colour imaging. However, a major challenge of imaging polarimetry is image degradation due to noise. This paper investigates the mitigation of noise through denoising algorithms and compares existing denoising algorithms with a new method, based on BM3D. This algorithm, PBM3D, gives visual quali… ▽ More

    Submitted 16 November, 2017; v1 submitted 13 November, 2017; originally announced November 2017.

  24. arXiv:1608.02567  [pdf, other

    math.NA cs.CE

    A Geometric Multigrid Preconditioning Strategy for DPG System Matrices

    Authors: Nathan V. Roberts, Jesse Chan

    Abstract: The discontinuous Petrov-Galerkin (DPG) methodology of Demkowicz and Gopalakrishnan [15,17] guarantees the optimality of the solution in an energy norm, and provides several features facilitating adaptive schemes. A key question that has not yet been answered in general - though there are some results for Poisson, e.g. - is how best to precondition the DPG system matrix, so that iterative solvers… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

  25. arXiv:1306.1740  [pdf

    cs.CR cs.NI cs.PF

    HTTPI Based Web Service Security over SOAP

    Authors: Pankaj Choudhary, Rajendra Aaseri, Nirmal Roberts

    Abstract: Now a days, a new family of web applications open applications, are emerging (e.g., Social Networking, News and Blogging). Generally, these open applications are non-confidential. The security needs of these applications are only client/server authentication and data integrity. For securing these open applications, effectively and efficiently, HTTPI, a new transport protocol is proposed, which ens… ▽ More

    Submitted 7 June, 2013; originally announced June 2013.

    Comments: International Journal of Network Security & Its Applications (IJNSA), Vol.5, No.3, May 2013

    Journal ref: Choudhary, P., Aaseri, R., Roberts, N., (2013) "HTTPI Based Web Service Security over SOAP", IJNSA, Vol.5, No.3, on pp. 55-66