Skip to main content

Showing 1–14 of 14 results for author: Amani, S

.
  1. arXiv:2403.01538  [pdf

    cs.HC

    A Preliminary Exploration of the Disruption of a Generative AI Systems: Faculty/Staff and Student Perceptions of ChatGPT and its Capability of Completing Undergraduate Engineering Coursework

    Authors: Lance White, Trini Balart, Sara Amani, Dr. Kristi J. Shryock, Dr. Karan L. Watson

    Abstract: The authors of this study aim to assess the capabilities of the OpenAI ChatGPT tool to understand just how effective such a system might be for students to utilize in their studies as well as deepen understanding of faculty/staff and student perceptions about ChatGPT in general. The purpose of what is learned from the study is to continue the design of a model to facilitate the development of facu… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 22 pages, 13 figures

  2. arXiv:2307.05834  [pdf, other

    cs.LG cs.AI

    Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing

    Authors: Sanae Amani, Khushbu Pahwa, Vladimir Braverman, Lin F. Yang

    Abstract: Recently, DARPA launched the ShELL program, which aims to explore how experience sharing can benefit distributed lifelong learning agents in adapting to new challenges. In this paper, we address this issue by conducting both theoretical and empirical research on distributed multi-task reinforcement learning (RL), where a group of $N$ agents collaboratively solves $M$ tasks without prior knowledge… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  3. arXiv:2304.14415  [pdf

    cs.HC cs.AI cs.CL cs.CY

    Generative AI Perceptions: A Survey to Measure the Perceptions of Faculty, Staff, and Students on Generative AI Tools in Academia

    Authors: Sara Amani, Lance White, Trini Balart, Laksha Arora, Dr. Kristi J. Shryock, Dr. Kelly Brumbelow, Dr. Karan L. Watson

    Abstract: ChatGPT is a natural language processing tool that can engage in human-like conversations and generate coherent and contextually relevant responses to various prompts. ChatGPT is capable of understanding natural text that is input by a user and generating appropriate responses in various forms. This tool represents a major step in how humans are interacting with technology. This paper specifically… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 17 pages, 3 figures

  4. arXiv:2206.00270  [pdf, ps, other

    cs.LG stat.ML

    Provably Efficient Lifelong Reinforcement Learning with Linear Function Approximation

    Authors: Sanae Amani, Lin F. Yang, Ching-An Cheng

    Abstract: We study lifelong reinforcement learning (RL) in a regret minimization setting of linear contextual Markov decision process (MDP), where the agent needs to learn a multi-task policy while solving a streaming sequence of tasks. We propose an algorithm, called UCB Lifelong Value Distillation (UCBlvd), that provably achieves sublinear regret for any sequence of tasks, which may be adaptively chosen b… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  5. arXiv:2205.13170  [pdf, other

    cs.LG stat.ML

    Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

    Authors: Sanae Amani, Tor Lattimore, András György, Lin F. Yang

    Abstract: We study distributed contextual linear bandits with stochastic contexts, where $N$ agents act cooperatively to solve a linear bandit-optimization problem with $d$-dimensional features over the course of $T$ rounds. For this problem, we derive the first ever information-theoretic lower bound $Ω(dN)$ on the communication cost of any algorithm that performs optimally in a regret minimization setup. W… ▽ More

    Submitted 7 December, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

  6. arXiv:2106.06239  [pdf, ps, other

    cs.LG stat.ML

    Safe Reinforcement Learning with Linear Function Approximation

    Authors: Sanae Amani, Christos Thrampoulidis, Lin F. Yang

    Abstract: Safety in reinforcement learning has become increasingly important in recent years. Yet, existing solutions either fail to strictly avoid choosing unsafe actions, which may lead to catastrophic results in safety-critical systems, or fail to provide regret guarantees for settings where safety constraints need to be learned. In this paper, we address both problems by first modeling safety as an unkn… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  7. arXiv:2103.11489  [pdf, ps, other

    cs.LG stat.ML

    UCB-based Algorithms for Multinomial Logistic Regression Bandits

    Authors: Sanae Amani, Christos Thrampoulidis

    Abstract: Out of the rich family of generalized linear bandits, perhaps the most well studied ones are logisitc bandits that are used in problems with binary rewards: for instance, when the learner/agent tries to maximize the profit over a user that can select one of two possible outcomes (e.g., `click' vs `no-click'). Despite remarkable recent progress and improved algorithms for logistic bandits, existing… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

    Comments: 27 pages, 5 figures

  8. arXiv:2012.00314  [pdf, ps, other

    cs.LG stat.ML

    Decentralized Multi-Agent Linear Bandits with Safety Constraints

    Authors: Sanae Amani, Christos Thrampoulidis

    Abstract: We study decentralized stochastic linear bandits, where a network of $N$ agents acts cooperatively to efficiently solve a linear bandit-optimization problem over a $d$-dimensional space. For this problem, we propose DLUCB: a fully decentralized algorithm that minimizes the cumulative regret over the entire network. At each round of the algorithm each agent chooses its actions following an upper co… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  9. arXiv:2005.01936  [pdf, ps, other

    cs.LG stat.ML

    Regret Bounds for Safe Gaussian Process Bandit Optimization

    Authors: Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis

    Abstract: Many applications require a learner to make sequential decisions given uncertainty regarding both the system's payoff function and safety constraints. In safety-critical systems, it is paramount that the learner's actions do not violate the safety constraints at any stage of the learning process. In this paper, we study a stochastic bandit optimization problem where the unknown payoff and constrai… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 22 pages, 17 figures

  10. arXiv:1911.02156  [pdf, other

    cs.LG stat.ML

    Safe Linear Thompson Sampling with Side Information

    Authors: Ahmadreza Moradipari, Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis

    Abstract: The design and performance analysis of bandit algorithms in the presence of stage-wise safety or reliability constraints has recently garnered significant interest. In this work, we consider the linear stochastic bandit problem under additional \textit{linear safety constraints} that need to be satisfied at each round. We provide a new safe algorithm based on linear Thompson Sampling (TS) for this… ▽ More

    Submitted 29 February, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: Comparing with safe versions of linear UCB algorithms, Providing more intuition for proof sketch

  11. arXiv:1908.05814  [pdf, other

    cs.LG stat.ML

    Linear Stochastic Bandits Under Safety Constraints

    Authors: Sanae Amani, Mahnoosh Alizadeh, Christos Thrampoulidis

    Abstract: Bandit algorithms have various application in safety-critical systems, where it is important to respect the system constraints that rely on the bandit's unknown parameters at every round. In this paper, we formulate a linear stochastic multi-armed bandit problem with safety constraints that depend (linearly) on an unknown parameter vector. As such, the learner is unable to identify all safe action… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: 23 pages, 7 figures

  12. arXiv:1601.05520  [pdf, other

    cs.PL cs.LO

    COGENT: Certified Compilation for a Functional Systems Language

    Authors: Liam O'Connor, Christine Rizkallah, Zilin Chen, Sidney Amani, Japheth Lim, Yutaka Nagashima, Thomas Sewell, Alex Hixon, Gabriele Keller, Toby Murray, Gerwin Klein

    Abstract: We present a self-certifying compiler for the COGENT systems language. COGENT is a restricted, polymorphic, higher-order, and purely functional language with linear types and without the need for a trusted runtime or garbage collector. It compiles to efficient C code that is designed to interoperate with existing C functions. The language is suited for layered systems code with minimal sharing suc… ▽ More

    Submitted 21 January, 2016; originally announced January 2016.

  13. Specifying a Realistic File System

    Authors: Sidney Amani, Toby Murray

    Abstract: We present the most interesting elements of the correctness specification of BilbyFs, a performant Linux flash file system. The BilbyFs specification supports asynchronous writes, a feature that has been overlooked by several file system verification projects, and has been used to verify the correctness of BilbyFs's fsync() C implementation. It makes use of nondeterminism to be concise and is shal… ▽ More

    Submitted 13 November, 2015; originally announced November 2015.

    Comments: In Proceedings MARS 2015, arXiv:1511.02528

    Journal ref: EPTCS 196, 2015, pp. 1-9

  14. Automatic Verification of Message-Based Device Drivers

    Authors: Sidney Amani, Peter Chubb, Alastair F. Donaldson, Alexander Legg, Leonid Ryzhyk, Yan** Zhu

    Abstract: We develop a practical solution to the problem of automatic verification of the interface between device drivers and the OS. Our solution relies on a combination of improved driver architecture and verification tools. It supports drivers written in C and can be implemented in any existing OS, which sets it apart from previous proposals for verification-friendly drivers. Our Linux-based evaluati… ▽ More

    Submitted 26 November, 2012; originally announced November 2012.

    Comments: In Proceedings SSV 2012, arXiv:1211.5873

    ACM Class: D.4.4; B.4.2; D.2.4

    Journal ref: EPTCS 102, 2012, pp. 4-17