Skip to main content

Showing 1–50 of 129 results for author: Taylor, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13049  [pdf, other

    cs.CY cs.AI

    Assessing AI vs Human-Authored Spear Phishing SMS Attacks: An Empirical Study Using the TRAPD Method

    Authors: Jerson Francia, Derek Hansen, Ben Schooley, Matthew Taylor, Shydra Murray, Greg Snow

    Abstract: This paper explores the rising concern of utilizing Large Language Models (LLMs) in spear phishing message generation, and their performance compared to human-authored counterparts. Our pilot study compares the effectiveness of smishing (SMS phishing) messages created by GPT-4 and human authors, which have been personalized to willing targets. The targets assessed the messages in a modified ranked… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages, 5 figures, 1 table

  2. arXiv:2406.10535  [pdf

    cs.DL

    Evaluating Open Access Advantages for Citations and Altmetrics (2011-21): A Dynamic and Evolving Relationship

    Authors: Michael Taylor

    Abstract: Differences between the impacts of Open Access (OA) and non-OA research have been observed over a wide range of citation and altmetric indicators, usually finding an Open Access Advantage (OAA) within specific fields. However, science-wide analyses covering multiple years, indicators and disciplines are lacking. Using citation counts and six altmetrics for 38.7M articles published 2011-21, we comp… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.06495  [pdf, other

    cs.LG

    Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity

    Authors: Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor

    Abstract: For autonomous agents to successfully integrate into human-centered environments, agents should be able to learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) is a promising approach that learns reward functions from human preferences. This enables RL agents to adapt their behavior based on human desires. However, humans live in a world full of d… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2405.19296  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Neural Isometries: Taming Transformations for Equivariant ML

    Authors: Thomas W. Mitchel, Michael Taylor, Vincent Sitzmann

    Abstract: Real-world geometry and 3D vision tasks are replete with challenging symmetries that defy tractable analytical expression. In this paper, we introduce Neural Isometries, an autoencoder framework which learns to map the observation space to a general-purpose latent space wherein encodings are related by isometries whenever their corresponding observations are geometrically related in world space. S… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.00746  [pdf, other

    cs.LG cs.AI cs.RO

    Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning

    Authors: Calarina Muslimani, Matthew E. Taylor

    Abstract: To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward function that captures the nuances of the task. However, reward engineering can be a difficult and time-consuming process. Instead, human-in-the-loop (HitL) RL allows agents to learn reward functions from human feedback. Despite recent successes, many of the HitL RL methods still require numerous human in… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  6. arXiv:2404.13142  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning

    Authors: Daniel May, Matthew Taylor, Petr Musilek

    Abstract: As the energy landscape evolves toward sustainability, the accelerating integration of distributed energy resources poses challenges to the operability and reliability of the electricity grid. One significant aspect of this issue is the notable increase in net load variability at the grid edge. Transactive energy, implemented through local energy markets, has recently garnered attention as a promi… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: preprint, submitted to Energy and AI

  7. arXiv:2404.13061  [pdf, other

    cs.AR cs.AI cs.LG

    FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning

    Authors: Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor

    Abstract: This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning (RL) with the goal of minimizing wirelength. In addition to our preliminary learning results, we also evaluated a novel decomposition to address the nature of la… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: accepted by ISEDA2024

  8. arXiv:2402.06819  [pdf, other

    cs.LG

    Monitored Markov Decision Processes

    Authors: Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling

    Abstract: In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not applicable in real-world problems. For example, the agent may need to ask a human to supervise its actions or activate a monitoring system to receive feedback. There… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: AAMAS 2024, Main Track

  9. arXiv:2401.02991  [pdf, other

    cs.CL cs.AI cs.LG

    GLIDE-RL: Grounded Language Instruction through DEmonstration in RL

    Authors: Chaitanya Kharyal, Sai Krishna Gottipati, Tanmay Kumar Sinha, Srijita Das, Matthew E. Taylor

    Abstract: One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL) agents grounded in natural language has been a long-standing challenge due to the complexity and ambiguity of the language and sparsity of the rewards, among ot… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 12 pages, 6 figures, to be presented at AAMAS 2024

  10. arXiv:2401.00907  [pdf, other

    cs.LG cs.AI cs.CL

    LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models

    Authors: Qianxi Li, Yingyue Cao, Jikun Kang, Tianpei Yang, Xi Chen, Jun **, Matthew E. Taylor

    Abstract: Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired answers. However, LLMs trained with SFT sometimes make simple mistakes and result in hallucinations on reasoning tasks such as question-answering. Without extern… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: Paper accepted in Human-Centric Representation Learning workshop at AAAI 2024 (https://hcrl-workshop.github.io/2024/)

  11. arXiv:2312.15339  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

    Authors: Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood, Meng Fang, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks wit… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48

  12. arXiv:2312.14322  [pdf, other

    cond-mat.mes-hall cs.DB cs.LG quant-ph

    Data Needs and Challenges of Quantum Dot Devices Automation: Workshop Report

    Authors: Justyna P. Zwolak, Jacob M. Taylor, Reed Andrews, Jared Benson, Garnett Bryant, Donovan Buterakos, Anasua Chatterjee, Sankar Das Sarma, Mark A. Eriksson, Eliška Greplová, Michael J. Gullans, Fabian Hader, Tyler J. Kovach, Pranav S. Mundada, Mick Ramsey, Torbjoern Rasmussen, Brandon Severin, Anthony Sigillito, Brennan Undseth, Brian Weber

    Abstract: Gate-defined quantum dots are a promising candidate system to realize scalable, coupled qubit systems and serve as a fundamental building block for quantum computers. However, present-day quantum dot devices suffer from imperfections that must be accounted for, which hinders the characterization, tuning, and operation process. Moreover, with an increasing number of quantum dot qubits, the relevant… ▽ More

    Submitted 12 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: White paper/overview based on a workshop held at the National Institute of Standards and Technology, Gaithersburg, MD. 13 pages

  13. arXiv:2312.11768  [pdf, other

    cs.AI cs.LG cs.MA

    Curriculum Learning for Cooperation in Multi-Agent Reinforcement Learning

    Authors: Rupali Bhati, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

    Abstract: While there has been significant progress in curriculum learning and continuous learning for training agents to generalize across a wide variety of environments in the context of single-agent reinforcement learning, it is unclear if these algorithms would still be valid in a multi-agent setting. In a competitive setting, a learning agent can be trained by making it compete with a curriculum of inc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures. Presented at Agent Learning in Open-Endedness Workshop at Neural Information Processing Systems (NeurIPS 2023)

  14. arXiv:2312.11718  [pdf, other

    cs.AI cs.HC cs.LG cs.MA stat.AP

    Human-Machine Teaming for UAVs: An Experimentation Platform

    Authors: Laila El Moujtahid, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor

    Abstract: Full automation is often not achievable or desirable in critical systems with high-stakes decisions. Instead, human-AI teams can achieve better results. To research, develop, evaluate, and validate algorithms suited for such teaming, lightweight experimentation platforms that enable interactions between humans and multiple AI agents are necessary. However, there are limited examples of such platfo… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 9 pages, 6 figures Presented at Conference on Artificial Intelligence for Defense (CAID) 2023

  15. arXiv:2311.14891  [pdf, other

    cs.CY

    Simpson's Paradox and Lagging Progress in Completion Trends of Underrepresented Students in Computer Science

    Authors: John Mason Taylor, Rebecca Drucker, Chris Alvin, Syed Fahad Sultan

    Abstract: It is imperative for the Computer Science (CS) community to ensure active participation and success of students from diverse backgrounds. This work compares CS to other areas of study with respect to success of students from three underrepresented groups: Women, Black and Hispanic or Latino. Using a data-driven approach, we show that trends of success over the years for underrepresented groups in… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  16. arXiv:2311.00810  [pdf, other

    cs.CY cs.CV cs.HC

    A Call to Arms: AI Should be Critical for Social Media Analysis of Conflict Zones

    Authors: Afia Abedin, Abdul Bais, Cody Buntain, Laura Courchesne, Brian McQuinn, Matthew E. Taylor, Muhib Ullah

    Abstract: The massive proliferation of social media data represents a transformative moment in conflict studies. This data can provide unique insights into the spread and use of weaponry, but the scale and types of data are problematic for traditional open-source intelligence. This paper presents preliminary, transdisciplinary work using computer vision to identify specific weapon systems and the insignias… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  17. Cocoon: Static Information Flow Control in Rust

    Authors: Ada Lamba, Max Taylor, Vincent Beardsley, Jacob Bambeck, Michael D. Bond, Zhiqiang Lin

    Abstract: Information flow control (IFC) provides confidentiality by enforcing noninterference, which ensures that high-secrecy values cannot affect low-secrecy values. Prior work introduces fine-grained IFC approaches that modify the programming language and use nonstandard compilation tools, impose run-time overhead, or report false secrecy leaks -- all of which hinder adoption. This paper presents Coco… ▽ More

    Submitted 18 March, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Will be published in PACMPL(OOPSLA) in October 2024

  18. arXiv:2307.05603  [pdf, other

    cs.SE cs.LG cs.PL

    Can You Improve My Code? Optimizing Programs with Local Search

    Authors: Fatemeh Abdollahi, Saqib Ameen, Matthew E. Taylor, Levi H. S. Lelis

    Abstract: This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POLIS improves a single line of the program while kee** the remaining lines fixed, using existing brute-force synthesis algorithms, and continues iterating until… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: International Joint Conference on Artificial Intelligence (IJCAI) 2023

  19. arXiv:2307.02666  [pdf, other

    cs.AR

    Chiplet Cloud: Building AI Supercomputers for Serving Large Generative Language Models

    Authors: Huwan Peng, Scott Davidson, Richard Shi, Shuaiwen Leon Song, Michael Taylor

    Abstract: Large language models (LLMs) such as OpenAI's ChatGPT and Google's Gemini have demonstrated unprecedented capabilities of autoregressive AI models across multiple tasks triggering disruptive technology innovations around the world. However, as models continue to grow the cost to serve these models also continues to grow threatening the democratization of LLMs. To address this issue, we propose C… ▽ More

    Submitted 20 May, 2024; v1 submitted 5 July, 2023; originally announced July 2023.

  20. arXiv:2306.08754  [pdf, other

    cs.LG physics.ao-ph

    ClimSim: A large multi-scale dataset for hybrid physics-ML climate emulation

    Authors: Sungduk Yu, Walter Hannah, Liran Peng, Jerry Lin, Mohamed Aziz Bhouri, Ritwik Gupta, Björn Lütjens, Justus Christopher Will, Gunnar Behrens, Julius Busecke, Nora Loose, Charles I Stern, Tom Beucler, Bryce Harrop, Benjamin R Hillman, Andrea Jenney, Savannah Ferretti, Nana Liu, Anima Anandkumar, Noah D Brenowitz, Veronika Eyring, Nicholas Geneva, Pierre Gentine, Stephan Mandt, Jaideep Pathak , et al. (31 additional authors not shown)

    Abstract: Modern climate projections lack adequate spatial and temporal resolution due to computational constraints. A consequence is inaccurate and imprecise predictions of critical processes such as storms. Hybrid methods that combine physics with machine learning (ML) have introduced a new generation of higher fidelity climate simulators that can sidestep Moore's Law by outsourcing compute-hungry, short,… ▽ More

    Submitted 6 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Outstanding Datasets and Benchmarks Track Paper

  21. arXiv:2304.01802  [pdf, other

    math.NA cs.LG

    Machine Learning Discovery of Optimal Quadrature Rules for Isogeometric Analysis

    Authors: Tomas Teijeiro, Jamie M. Taylor, Ali Hashemian, David Pardo

    Abstract: We propose the use of machine learning techniques to find optimal quadrature rules for the construction of stiffness and mass matrices in isogeometric analysis (IGA). We initially consider 1D spline spaces of arbitrary degree spanned over uniform and non-uniform knot sequences, and then the generated optimal rules are used for integration over higher-dimensional spaces using tensor product sense.… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 18 pages, 14 figures

    MSC Class: 65D32

  22. arXiv:2303.06121  [pdf, other

    cs.LG cs.AI

    Ignorance is Bliss: Robust Control via Information Gating

    Authors: Manan Tomar, Riashat Islam, Matthew E. Taylor, Sergey Levine, Philip Bachman

    Abstract: Informational parsimony provides a useful inductive bias for learning representations that achieve better generalization by being robust to noise and spurious correlations. We propose \textit{information gating} as a way to learn parsimonious representations that identify the minimal information required for a task. When gating information, we can learn to reveal as little information as possible… ▽ More

    Submitted 8 December, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: NeurIPS 2023

  23. arXiv:2302.07104  [pdf, other

    cs.CR cs.AR

    RISE: RISC-V SoC for En/decryption Acceleration on the Edge for Homomorphic Encryption

    Authors: Zahra Azad, Guowei Yang, Rashmi Agrawal, Daniel Petrisko, Michael Taylor, Ajay Joshi

    Abstract: Today edge devices commonly connect to the cloud to use its storage and compute capabilities. This leads to security and privacy concerns about user data. Homomorphic Encryption (HE) is a promising solution to address the data privacy problem as it allows arbitrarily complex computations on encrypted data without ever needing to decrypt it. While there has been a lot of work on accelerating HE com… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  24. arXiv:2302.06548  [pdf, other

    cs.LG cs.AI

    Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

    Authors: Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement le… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted as full-paper at AAMAS 2023

  25. arXiv:2302.00345  [pdf

    cs.DL

    Slow, slow, quick, quick, slow: five altmetric sources observed over a decade show evolving trends, by research age, attention source maturity and open access status

    Authors: Michael Taylor

    Abstract: The study of temporal trends in altmetrics is under-developed, and this multi-year observation study addresses some of the deficits in our understanding of altmetric behaviour over time. The attention surrounding research outputs, as partially captured by altmetrics, or alternative metrics, constitutes many varied forms of data. Over the years 2008-2013, a set of 7739 papers were sampled on six oc… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  26. arXiv:2301.11153  [pdf, other

    cs.LG cs.AI cs.MA

    Learning from Multiple Independent Advisors in Multi-agent Reinforcement Learning

    Authors: Sriram Ganapathi Subramanian, Matthew E. Taylor, Kate Larson, Mark Crowley

    Abstract: Multi-agent reinforcement learning typically suffers from the problem of sample inefficiency, where learning suitable policies involves the use of many data samples. Learning from external demonstrators is a possible solution that mitigates this problem. However, most prior approaches in this area assume the presence of a single demonstrator. Leveraging multiple knowledge sources (i.e., advisors)… ▽ More

    Submitted 2 March, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Paper to appear in AAMAS 2023, London, UK

  27. arXiv:2212.08302  [pdf, other

    cs.LG cs.AI

    Safe Evaluation For Offline Learning: Are We Ready To Deploy?

    Authors: Hager Radi, Josiah P. Hanna, Peter Stone, Matthew E. Taylor

    Abstract: The world currently offers an abundance of data in multiple domains, from which we can learn reinforcement learning (RL) policies without further interaction with the environment. RL agents learning offline from such data is possible but deploying them while learning might be dangerous in domains where safety is critical. Therefore, it is essential to find a way to estimate how a newly-learned age… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2021 Workshop on Deployable Decision Making in Embodied Systems [Spotlight]

  28. arXiv:2211.07545  [pdf, ps, other

    cs.RO cs.CV cs.LG

    NeurIPS 2022 Competition: Driving SMARTS

    Authors: Amir Rasouli, Randy Goebel, Matthew E. Taylor, Iuliia Kotseruba, Soheil Alizadeh, Tianpei Yang, Montgomery Alban, Florian Shkurti, Yuzheng Zhuang, Adam Scibior, Kasra Rezaee, Animesh Garg, David Meger, Jun Luo, Liam Paull, Weinan Zhang, Xinyu Wang, Xi Chen

    Abstract: Driving SMARTS is a regular competition designed to tackle problems caused by the distribution shift in dynamic interaction contexts that are prevalent in real-world autonomous driving (AD). The proposed competition supports methodologically diverse solutions, such as reinforcement learning (RL) and offline learning methods, trained on a combination of naturalistic AD data and open-source simulati… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 10 pages, 8 figures

  29. arXiv:2211.06390  [pdf, other

    cs.AR

    The BlackParrot BedRock Cache Coherence System

    Authors: Mark Wyse, Daniel Petrisko, Farzam Gilani, Yuan-Mao Chueh, Paul Gao, Dai Cheol Jung, Sripathi Muralitharan, Shashank Vijaya Ranga, Mark Oskin, Michael Taylor

    Abstract: This paper presents BP-BedRock, the open-source cache coherence protocol and system implemented within the BlackParrot 64-bit RISC-V multicore processor. BP-BedRock implements the BedRock directory-based MOESIF cache coherence protocol and includes two different open-source coherence protocol engines, one FSM-based and the other microcode programmable. Both coherence engines support coherent uncac… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  30. SA4U: Practical Static Analysis for Unit Type Error Detection

    Authors: Max Taylor, Johnathon Aurand, Feng Qin, Xiaorui Wang, Brandon Henry, Xiangyu Zhang

    Abstract: Unit type errors, where values with physical unit types (e.g., meters, hours) are used incorrectly in a computation, are common in today's unmanned aerial system (UAS) firmware. Recent studies show that unit type errors represent over 10% of bugs in UAS firmware. Moreover, the consequences of unit type errors are severe. Over 30% of unit type errors cause UAS crashes. This paper proposes SA4U: a p… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: ASE 2022

    Journal ref: 37th IEEE/ACM International Conference on Automated Software Engineering (ASE '22), October 10-14, 2022, Rochester, MI, USA

  31. arXiv:2210.06683  [pdf, other

    cs.LG cs.HC

    Augmenting Flight Training with AI to Efficiently Train Pilots

    Authors: Michael Guevarra, Srijita Das, Christabel Wayllace, Carrie Demmans Epp, Matthew E. Taylor, Alan Tay

    Abstract: We propose an AI-based pilot trainer to help students learn how to fly aircraft. First, an AI agent uses behavioral cloning to learn flying maneuvers from qualified flight instructors. Later, the system uses the agent's decisions to detect errors made by students and provide feedback to help students correct their errors. This paper presents an instantiation of the pilot trainer. We focus on teach… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 3 pages, 3 figures, submitted to AAAI-23 Demonstration Program

  32. arXiv:2209.01789  [pdf, other

    cs.AR cs.CR

    ProcessorFuzz: Guiding Processor Fuzzing using Control and Status Registers

    Authors: Sadullah Canakci, Chathura Rajapaksha, Anoop Mysore Nataraja, Leila Delshadtehrani, Michael Taylor, Manuel Egele, Ajay Joshi

    Abstract: As the complexity of modern processors has increased over the years, develo** effective verification strategies to identify bugs prior to manufacturing has become critical. Undiscovered micro-architectural bugs in processors can manifest as severe security vulnerabilities in the form of side channels, functional bugs, etc. Inspired by software fuzzing, a technique commonly used for software test… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

  33. arXiv:2209.01054  [pdf, other

    cs.MA cs.LG

    Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction

    Authors: Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni

    Abstract: Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms. Despite its popularity, it suffers from a critical drawback due to its reliance on learning from a single sample of the joint-action at a given state. As agents explore and update their policies during training, these single samples may poorly rep… ▽ More

    Submitted 22 June, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

  34. arXiv:2208.11940  [pdf

    cs.LG cs.AI

    Rail break and derailment prediction using Probabilistic Graphical Modelling

    Authors: Rebecca M. C. Taylor, Johan A. du Preez

    Abstract: Rail breaks are one of the most common causes of derailments internationally. This is no different for the South African Iron Ore line. Many rail breaks occur as a heavy-haul train passes over a crack, large defect or defective weld. In such cases, it is usually too late for the train to slow down in time to prevent a de-railment. Knowing the risk of a rail break occurring associated with a train… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: Proceedings of the 11'th International Heavy Haul Association Conference 2017

    Journal ref: Proceedings of the 11'th International Heavy Haul Association Conference (IHHA 2017), pages 799-805 Cape Town, South Africa

  35. arXiv:2208.09299  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    SimLDA: A tool for topic model evaluation

    Authors: Rebecca M. C. Taylor, Johan A. du Preez

    Abstract: Variational Bayes (VB) applied to latent Dirichlet allocation (LDA) has become the most popular algorithm for aspect modeling. While sufficiently successful in text topic extraction from large corpora, VB is less successful in identifying aspects in the presence of limited data. We present a novel variational message passing algorithm as applied to Latent Dirichlet Allocation (LDA) and compare it… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Comments: Conference Proceedings

  36. arXiv:2207.12496  [pdf, other

    cs.CV

    NeuriCam: Key-Frame Video Super-Resolution and Colorization for IoT Cameras

    Authors: Bandhav Veluri, Collin Pernu, Ali Saffari, Joshua Smith, Michael Taylor, Shyamnath Gollakota

    Abstract: We present NeuriCam, a novel deep learning-based system to achieve video capture from low-power dual-mode IoT camera systems. Our idea is to design a dual-mode camera system where the first mode is low-power (1.1 mW) but only outputs grey-scale, low resolution, and noisy video and the second mode consumes much higher power (100 mW) but outputs color and higher resolution images. To reduce total en… ▽ More

    Submitted 13 April, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: MobiCom 2023 camera-ready

  37. arXiv:2205.12332  [pdf, other

    cs.IT eess.SP

    Constant Curvature Curve Tube Codes for Low-Latency Analog Error Correction

    Authors: Anders M. Buvarp, Robert M. Taylor Jr., Kumar Vijay Mishra, Lamine M. Mili, Amir I. Zaghloul

    Abstract: Recent research in ultra-reliable and low latency communications (URLLC) for future wireless systems has spurred interest in short block-length codes. In this context, we analyze arbitrary harmonic bandwidth (BW) expansions for a class of high-dimension constant curvature curve codes for analog error correction of independent continuous-alphabet uniform sources. In particular, we employ the circum… ▽ More

    Submitted 2 August, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 15 pages, 4 tables, 11 figures

  38. arXiv:2205.02879  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Exploiting Ligand Additivity for Transferable Machine Learning of Multireference Character Across Known Transition Metal Complex Ligands

    Authors: Chenru Duan, Adriana J. Ladera, Julian C. -L. Liu, Michael G. Taylor, Isuru R. Ariyarathna, Heather J. Kulik

    Abstract: Accurate virtual high-throughput screening (VHTS) of transition metal complexes (TMCs) remains challenging due to the possibility of high multi-reference (MR) character that complicates property evaluation. We compute MR diagnostics for over 5,000 ligands present in previously synthesized transition metal complexes in the Cambridge Structural Database (CSD). To accomplish this task, we introduce a… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  39. arXiv:2204.11897  [pdf, other

    cs.LG

    Reinforcement Teaching

    Authors: Alex Lewandowski, Calarina Muslimani, Dale Schuurmans, Matthew E. Taylor, Jun Luo

    Abstract: Meta-learning strives to learn about and improve a student's machine learning algorithm. However, existing meta-learning methods either only work with differentiable algorithms or are hand-crafted to improve one specific component of an algorithm. We develop a unifying meta-learning framework, called Reinforcement Teaching, to improve the learning process of any algorithm. Under Reinforcement Teac… ▽ More

    Submitted 22 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: First two authors contributed equally

  40. arXiv:2204.07254  [pdf, other

    cs.LG cs.AI cs.MA

    Methodical Advice Collection and Reuse in Deep Reinforcement Learning

    Authors: Sahir, Ercüment İlhan, Srijita Das, Matthew E. Taylor

    Abstract: Reinforcement learning (RL) has shown great success in solving many challenging tasks via use of deep neural networks. Although using deep learning for RL brings immense representational power, it also causes a well-known sample-inefficiency problem. This means that the algorithms are data-hungry and require millions of training samples to converge to an adequate policy. One way to combat this iss… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: To be published in ALA2022: Adaptive and Learning Agents Workshop 2022 at AAMAS

  41. arXiv:2203.08553  [pdf, other

    cs.MA cs.AI

    PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

    Authors: Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Wenyuan Tao, Zhen Wang, Fazl Barez

    Abstract: Learning to collaborate is critical in Multi-Agent Reinforcement Learning (MARL). Previous works promote collaboration by maximizing the correlation of agents' behaviors, which is typically characterized by Mutual Information (MI) in different forms. However, we reveal sub-optimal collaborative behaviors also emerge with strong correlations, and simply maximizing the MI can, surprisingly, hinder t… ▽ More

    Submitted 21 February, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: The paper has been accepted by The Thirty-ninth International Conference on Machine Learning (ICML 2022) and the Cooperative AI Workshop at 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  42. arXiv:2203.05097  [pdf

    cs.DC

    A Framework for the Interoperability of Cloud Platforms: Towards FAIR Data in SAFE Environments

    Authors: Robert L. Grossman, Rebecca R. Boyles, Brandi N. Davis-Dusenbery, Amanda Haddock, Allison P. Heath, Brian D. O'Connor, Adam C. Resnick, Deanne M. Taylor, Stan Ahalt

    Abstract: As the number of cloud platforms supporting scientific research grows, there is an increasing need to support interoperability between two or more cloud platforms, as a growing amount of data is being hosted in cloud-based platforms. A well accepted core concept is to make data in cloud platforms Findable, Accessible, Interoperable and Reusable (FAIR). We introduce a companion concept that applies… ▽ More

    Submitted 15 February, 2024; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: 16 pages with 2 figures

    ACM Class: D.2.11; D.2.12; E.0

  43. arXiv:2201.03954  [pdf, other

    cs.LG cs.AI

    The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence

    Authors: Kasia S. Chmielinski, Sarah Newman, Matt Taylor, Josh Joseph, Kemi Thomas, Jessica Yurkofsky, Yue Chelsea Qiu

    Abstract: As the production of and reliance on datasets to produce automated decision-making systems (ADS) increases, so does the need for processes for evaluating and interrogating the underlying data. After launching the Dataset Nutrition Label in 2018, the Data Nutrition Project has made significant updates to the design and purpose of the Label, and is launching an updated Label in late 2020, which is p… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  44. arXiv:2112.09362  [pdf, other

    quant-ph cond-mat.mes-hall cs.CV cs.LG

    Colloquium: Advances in automation of quantum dot devices control

    Authors: Justyna P. Zwolak, Jacob M. Taylor

    Abstract: Arrays of quantum dots (QDs) are a promising candidate system to realize scalable, coupled qubit systems and serve as a fundamental building block for quantum computers. In such semiconductor quantum systems, devices now have tens of individual electrostatic and dynamical voltages that must be carefully set to localize the system into the single-electron regime and to realize good qubit operationa… ▽ More

    Submitted 25 May, 2023; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: 24 pages, 11 figures

    Journal ref: Rev. Mod. Phys. 95, 011006 (2023)

  45. arXiv:2112.09099  [pdf, other

    cs.MA

    Decentralized Mean Field Games

    Authors: Sriram Ganapathi Subramanian, Matthew E. Taylor, Mark Crowley, Pascal Poupart

    Abstract: Multiagent reinforcement learning algorithms have not been widely adopted in large scale environments with many agents as they often scale poorly with the number of agents. Using mean field theory to aggregate agents has been proposed as a solution to this problem. However, almost all previous methods in this area make a strong assumption of a centralized system where all the agents in the environ… ▽ More

    Submitted 13 April, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: This work is to appear in AAAI-22. Recent version has minor formatting changes and some typos corrected

  46. arXiv:2111.07775  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Representations for Pixel-based Control: What Matters and Why?

    Authors: Manan Tomar, Utkarsh A. Mishra, Amy Zhang, Matthew E. Taylor

    Abstract: Learning representations for pixel-based control has garnered significant attention recently in reinforcement learning. A wide range of methods have been proposed to enable efficient learning, leading to sample complexities similar to those in the full state setting. However, moving beyond carefully curated pixel data sets (centered crop, appropriate lighting, clear background, etc.) remains chall… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  47. arXiv:2111.01676  [pdf

    cs.CL

    Towards Text-based Phishing Detection

    Authors: Gilchan Park, Julia M. Taylor

    Abstract: This paper reports on an experiment into text-based phishing detection using readily available resources and without the use of semantics. The developed algorithm is a modified version of previously published work that works with the same tools. The results obtained in recognizing phishing emails are considerably better than the previously reported work; but the rate of text falsely identified as… ▽ More

    Submitted 3 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: Society for Design and Process Science (SDPS) 2013, pp.187-192. https://www.sdpsnet.org/sdps/documents/sdps-2013/SDPS_2013_proceedings.pdf

  48. arXiv:2111.01480  [pdf, other

    cs.LG stat.ML

    A derivation of variational message passing (VMP) for latent Dirichlet allocation (LDA)

    Authors: Rebecca M. C. Taylor, Dirko Coetsee, Johan A. du Preez

    Abstract: Latent Dirichlet Allocation (LDA) is a probabilistic model used to uncover latent topics in a corpus of documents. Inference is often performed using variational Bayes (VB) algorithms, which calculate a lower bound to the posterior distribution over the parameters. Deriving the variational update equations for new models requires considerable manual effort; variational message passing (VMP) has em… ▽ More

    Submitted 25 August, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 24 pages,not yet submitted anywhere

    MSC Class: G.3

  49. arXiv:2111.00345  [pdf, other

    cs.AI cs.MA

    Multi-Agent Advisor Q-Learning

    Authors: Sriram Ganapathi Subramanian, Matthew E. Taylor, Kate Larson, Mark Crowley

    Abstract: In the last decade, there have been significant advances in multi-agent reinforcement learning (MARL) but there are still numerous challenges, such as high sample complexity and slow convergence to stable policies, that need to be overcome before wide-spread deployment is possible. However, many real-world environments already, in practice, deploy sub-optimal or heuristic approaches for generating… ▽ More

    Submitted 1 March, 2023; v1 submitted 25 October, 2021; originally announced November 2021.

    Comments: Paper has been accepted to Journal of Artificial Intelligence Research (JAIR). Please refer to https://jair.org/index.php/jair/article/view/13445 for JAIR version. The most recent version includes two illustrative figures that pictorially describes the settings of the two algorithms (i.e., ADMIRAL-DM and ADMIRAL-AE)

  50. arXiv:2110.00635  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    ALBU: An approximate Loopy Belief message passing algorithm for LDA to improve performance on small data sets

    Authors: Rebecca M. C. Taylor, Johan A. du Preez

    Abstract: Variational Bayes (VB) applied to latent Dirichlet allocation (LDA) has become the most popular algorithm for aspect modeling. While sufficiently successful in text topic extraction from large corpora, VB is less successful in identifying aspects in the presence of limited data. We present a novel variational message passing algorithm as applied to Latent Dirichlet Allocation (LDA) and compare it… ▽ More

    Submitted 19 August, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Springer has accepted to publish the Proceedings of 2022 Computing Conference in the series "Lecture Notes in Networks and Systems"

    MSC Class: 62F15 ACM Class: G.3

    Journal ref: In Science and Information Conference (pp. 723-746). Springer, Cham (2022)