Skip to main content

Showing 1–50 of 104 results for author: Seshia, S

.
  1. arXiv:2406.06253  [pdf, other

    eess.SY cs.PL

    PretVM: Predictable, Efficient Virtual Machine for Real-Time Concurrency

    Authors: Shaokai Lin, Erling Jellum, Mirco Theile, Tassilo Tanneberger, Binqi Sun, Chadlia Jerad, Ruomu Xu, Guangyu Feng, Christian Menard, Marten Lohstroh, Jeronimo Castrillon, Sanjit Seshia, Edward Lee

    Abstract: This paper introduces the Precision-Timed Virtual Machine (PretVM), an intermediate platform facilitating the execution of quasi-static schedules compiled from a subset of programs written in the Lingua Franca (LF) coordination language. The subset consists of those programs that in principle should have statically verifiable and predictable timing behavior. The PretVM provides a schedule with wel… ▽ More

    Submitted 25 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2406.05403  [pdf, other

    cs.CR cs.AR

    SemPat: Using Hyperproperty-based Semantic Analysis to Generate Microarchitectural Attack Patterns

    Authors: Adwait Godbole, Yatin A. Manerkar, Sanjit A. Seshia

    Abstract: Microarchitectural security verification of software has seen the emergence of two broad classes of approaches. The first is based on semantic security properties (e.g., non-interference) which are verified for a given program and a specified abstract model of the hardware microarchitecture. The second is based on attack patterns, which, if found in a program execution, indicates the presence of a… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 16 pages, 14 figures, under review

  3. arXiv:2406.03636  [pdf, other

    cs.PL cs.LG

    Synthetic Programming Elicitation and Repair for Text-to-Code in Very Low-Resource Programming Languages

    Authors: Federico Mora, Justin Wong, Haley Lepe, Sahil Bhatia, Karim Elmaaroufi, George Varghese, Joseph E. Gonzalez, Elizabeth Polgreen, Sanjit A. Seshia

    Abstract: Recent advances in large language models (LLMs) for code applications have demonstrated remarkable zero-shot fluency and instruction following on challenging code related tasks ranging from test case generation to self-repair. Unsurprisingly, however, models struggle to compose syntactically valid programs in programming languages unrepresented in pre-training, referred to as very low-resource Pro… ▽ More

    Submitted 29 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 15 pages, 6 figures, 1 table

  4. arXiv:2406.03003  [pdf, other

    cs.PL

    Verified Code Transpilation with LLMs

    Authors: Sahil Bhatia, Jie Qiu, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Domain-specific languages (DSLs) are integral to various software workflows. Such languages offer domain-specific optimizations and abstractions that improve code readability and maintainability. However, leveraging these languages requires developers to rewrite existing code using the specific DSL's API. While large language models (LLMs) have shown some success in automatic code transpilation, n… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2405.06624  [pdf, other

    cs.AI

    Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

    Abstract: Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro… ▽ More

    Submitted 8 July, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  6. arXiv:2405.03709  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Generating Probabilistic Scenario Programs from Natural Language

    Authors: Karim Elmaaroufi, Devan Shanker, Ana Cismaru, Marcell Vazquez-Chanlatte, Alberto Sangiovanni-Vincentelli, Matei Zaharia, Sanjit A. Seshia

    Abstract: For cyber-physical systems (CPS), including robotics and autonomous vehicles, mass deployment has been hindered by fatal errors that occur when operating in rare events. To replicate rare events such as vehicle crashes, many companies have created logging systems and employed crash reconstruction experts to meticulously recreate these valuable events in simulation. However, in these methods, "what… ▽ More

    Submitted 14 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 17 pages, 2 figures

  7. arXiv:2404.18249  [pdf, other

    cs.PL

    Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations

    Authors: Jie Qiu, Colin Cai, Sahil Bhatia, Niranjan Hasabnis, Sanjit A. Seshia, Alvin Cheung

    Abstract: Tensor processing infrastructures such as deep learning frameworks and specialized hardware accelerators have revolutionized how computationally intensive code from domains such as deep learning and image processing is executed and optimized. These infrastructures provide powerful and expressive abstractions while ensuring high performance. However, to utilize them, code must be written specifical… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  8. arXiv:2404.11578  [pdf, other

    cs.LG cs.AI cs.FL

    LTL-Constrained Policy Optimization with Cycle Experience Replay

    Authors: Ameesh Shah, Cameron Voloshin, Chenxi Yang, Abhinav Verma, Swarat Chaudhuri, Sanjit A. Seshia

    Abstract: Linear Temporal Logic (LTL) offers a precise means for constraining the behavior of reinforcement learning agents. However, in many tasks, LTL is insufficient for task specification; LTL-constrained policy optimization, where the goal is to optimize a scalar reward under LTL constraints, is needed. Prior methods for this constrained problem are restricted to finite state spaces. In this work, we p… ▽ More

    Submitted 24 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: preprint, 9 pages in main text

  9. arXiv:2403.11737  [pdf, other

    cs.RO eess.SY

    SMT-Based Dynamic Multi-Robot Task Allocation

    Authors: Victoria Marie Tuck, Pei-Wei Chen, Georgios Fainekos, Bardh Hoxha, Hideki Okamoto, S. Shankar Sastry, Sanjit A. Seshia

    Abstract: Multi-Robot Task Allocation (MRTA) is a problem that arises in many application domains including package delivery, warehouse robotics, and healthcare. In this work, we consider the problem of MRTA for a dynamic stream of tasks with task deadlines and capacitated agents (capacity for more than one simultaneous task). Previous work commonly focuses on the static case, uses specialized algorithms fo… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 26 pages, 6 figures, to be published in NASA Formal Methods Symposium 2024

  10. arXiv:2402.07051  [pdf, other

    cs.LG cs.AI cs.CL cs.FL

    $L^*LM$: Learning Automata from Examples using Natural Language Oracles

    Authors: Marcell Vazquez-Chanlatte, Karim Elmaaroufi, Stefan J. Witwicki, Sanjit A. Seshia

    Abstract: Expert demonstrations have proven an easy way to indirectly specify complex tasks. Recent algorithms even support extracting unambiguous formal specifications, e.g. deterministic finite automata (DFA), from demonstrations. Unfortunately, these techniques are generally not sample efficient. In this work, we introduce $L^*LM$, an algorithm for learning DFAs from both demonstrations and natural langu… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  11. arXiv:2310.12234  [pdf, other

    cs.LO cs.AI cs.PL

    An Eager Satisfiability Modulo Theories Solver for Algebraic Datatypes

    Authors: Amar Shah, Federico Mora, Sanjit A. Seshia

    Abstract: Algebraic data types (ADTs) are a construct classically found in functional programming languages that capture data structures like enumerated types, lists, and trees. In recent years, interest in ADTs has increased. For example, popular programming languages, like Python, have added support for ADTs. Automated reasoning about ADTs can be done using satisfiability modulo theories (SMT) solving, an… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  12. arXiv:2307.10434  [pdf, other

    cs.FL cs.AI cs.LG

    Learning Formal Specifications from Membership and Preference Queries

    Authors: Ameesh Shah, Marcell Vazquez-Chanlatte, Sebastian Junges, Sanjit A. Seshia

    Abstract: Active learning is a well-studied approach to learning formal specifications, such as automata. In this work, we extend active specification learning by proposing a novel framework that strategically requests a combination of membership labels and pair-wise preferences, a popular alternative to membership labels. The combination of pair-wise preferences and membership labels allows for a more flex… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 6 pages, Presented at ICML 2023 Workshop on The Many Facets of Preference-Based Learning

  13. arXiv:2307.03325  [pdf, other

    cs.PL

    3D Environment Modeling for Falsification and Beyond with Scenic 3.0

    Authors: Eric Vin, Shun Kashiwa, Matthew Rhea, Daniel J. Fremont, Edward Kim, Tommaso Dreossi, Shromona Ghosh, Xiangyu Yue, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: We present a major new version of Scenic, a probabilistic programming language for writing formal models of the environments of cyber-physical systems. Scenic has been successfully used for the design and analysis of CPS in a variety of domains, but earlier versions are limited to environments which are essentially two-dimensional. In this paper, we extend Scenic with native support for 3D geometr… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 13 pages, 6 figures. Full version of a CAV 2023 tool paper, to appear in the Springer Lecture Notes in Computer Science series

  14. arXiv:2306.00521  [pdf

    cs.SE cs.PL

    Genetic Algorithms for Searching a Matrix of Metagrammars for Synthesis

    Authors: Yixuan Li, Federico Mora, Elizabeth Polgreen, Sanjit A. Seshia

    Abstract: Syntax-guided synthesis is a paradigm in program synthesis in which the search space of candidate solutions is constrained by a syntactic template in the form of a grammar. These syntactic constraints serve two purposes: constraining the language to the space the user desires, but also rendering the search space tractable for the synthesizer. Given a well-written syntactic template, this is an ext… ▽ More

    Submitted 4 June, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: SYNT 2023 of CAV 2023, 4 Pages

  15. arXiv:2305.17596  [pdf, other

    cs.LO eess.SY

    Context-Aided Variable Elimination for Requirement Engineering

    Authors: Inigo Incer, Albert Benveniste, Richard M. Murray, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: Deriving system-level specifications from component specifications usually involves the elimination of variables that are not part of the interface of the top-level system. This paper presents algorithms for eliminating variables from formulas by computing refinements or relaxations of these formulas in a context. We discuss a connection between this problem and optimization and give efficient alg… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  16. arXiv:2303.17751  [pdf, other

    cs.LO eess.SY

    Pacti: Scaling Assume-Guarantee Reasoning for System Analysis and Design

    Authors: Inigo Incer, Apurva Badithela, Josefine Graebener, Piergiuseppe Mallozzi, Ayush Pandey, Sheng-Jung Yu, Albert Benveniste, Benoit Caillaud, Richard M. Murray, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: Contract-based design is a method to facilitate modular system design. While there has been substantial progress on the theory of contracts, there has been less progress on scalable algorithms for the algebraic operations in this theory. In this paper, we present: 1) principles to implement a contract-based design tool at scale and 2) Pacti, a tool that can efficiently compute these operations. We… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  17. arXiv:2303.17010  [pdf, other

    cs.LG cs.FL cs.RO

    Specification-Guided Data Aggregation for Semantically Aware Imitation Learning

    Authors: Ameesh Shah, Jonathan DeCastro, John Gideon, Beyazit Yalcinkaya, Guy Rosman, Sanjit A. Seshia

    Abstract: Advancements in simulation and formal methods-guided environment sampling have enabled the rigorous evaluation of machine learning models in a number of safety-critical scenarios, such as autonomous driving. Application of these environment sampling techniques towards improving the learned models themselves has yet to be fully exploited. In this work, we introduce a novel method for improving imit… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: 8 pages, under review

  18. arXiv:2302.13980  [pdf, other

    cs.RO cs.AI cs.FL

    A Grammar for the Representation of Unmanned Aerial Vehicles with 3D Topologies

    Authors: Piergiuseppe Mallozzi, Hussein Sibai, Inigo Incer, Sanjit A. Seshia, Alberto Sangiovanni-Vincentelli

    Abstract: We propose a context-sensitive grammar for the systematic exploration of the design space of the topology of 3D robots, particularly unmanned aerial vehicles. It defines production rules for adding components to an incomplete design topology modeled over a 3D grid. The rules are local. The grammar is simple, yet capable of modeling most existing UAVs as well as novel ones. It can be easily general… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  19. arXiv:2212.02795  [pdf

    cs.CY

    Emerging Technology and Policy Co-Design Considerations for the Safe and Transparent Use of Small Unmanned Aerial Systems

    Authors: Ritwik Gupta, Alexander Bayen, Sarah Rohrschneider, Adrienne Fulk, Andrew Reddie, Sanjit A. Seshia, Shankar Sastry, Janet Napolitano

    Abstract: The rapid technological growth observed in the sUAS sector over the past decade has been unprecedented and has left gaps in policies and regulations to adequately provide for a safe and trusted environment in which to operate these devices. The Center for Security in Politics at UC Berkeley, via a two-day workshop, analyzed these gaps by addressing the entire sUAS vertical. From human factors to a… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  20. arXiv:2211.02179  [pdf, other

    cs.CR

    Verifying RISC-V Physical Memory Protection

    Authors: Kevin Cheang, Cameron Rasmussen, Dayeol Lee, David W. Kohlbrenner, Krste Asanović, Sanjit A. Seshia

    Abstract: We formally verify an open-source hardware implementation of physical memory protection (PMP) in RISC-V, which is a standard feature used for memory isolation in security critical systems such as the Keystone trusted execution environment. PMP provides per-hardware-thread machine-mode control registers that specify the access privileges for physical memory regions. We first formalize the functiona… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: SECRISC-V 2019 Workshop

  21. Cerberus: A Formal Approach to Secure and Efficient Enclave Memory Sharing

    Authors: Dayeol Lee, Kevin Cheang, Alexander Thomas, Catherine Lu, Pranav Gaddamadugu, Anjo Vahldiek-Oberwagner, Mona Vij, Dawn Song, Sanjit A. Seshia, Krste Asanović

    Abstract: Hardware enclaves rely on a disjoint memory model, which maps each physical address to an enclave to achieve strong memory isolation. However, this severely limits the performance and programmability of enclave programs. While some prior work proposes enclave memory sharing, it does not provide a formal model or verification of their designs. This paper presents Cerberus, a formal approach to secu… ▽ More

    Submitted 14 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: ACM CCS 2022

  22. arXiv:2208.06733  [pdf, other

    cs.LO cs.AR

    Automated Conversion of Axiomatic to Operational Models: Theory and Practice

    Authors: Adwait Godbole, Yatin A. Manerkar, Sanjit A. Seshia

    Abstract: A system may be modelled as an operational model (which has explicit notions of state and transitions between states) or an axiomatic model (which is specified entirely as a set of invariants). Most formal methods techniques (e.g., IC3, invariant synthesis, etc) are designed for operational models and are largely inaccessible to axiomatic models. Furthermore, no prior method exists to automaticall… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

    Comments: 16 pages, 14 pages

    ACM Class: F.1.1; C.1.2; F.3.1

  23. arXiv:2208.03699  [pdf, other

    cs.LO

    UCLID5: Multi-Modal Formal Modeling, Verification, and Synthesis

    Authors: Elizabeth Polgreen, Kevin Cheang, Pranav Gaddamadugu, Adwait Godbole, Kevin Laeufer, Shaokai Lin, Yatin A. Manerkar, Federico Mora, Sanjit A. Seshia

    Abstract: UCLID5 is a tool for the multi-modal formal modeling, verification, and synthesis of systems. It enables one to tackle verification problems for heterogeneous systems such as combinations of hardware and software, or those that have multiple, varied specifications, or systems that require hybrid modes of modeling. A novel aspect of \uclid is an emphasis on the use of syntax-guided and inductive sy… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: 12 pages plus appendix. Published at CAV 2022

  24. arXiv:2205.13013  [pdf, other

    cs.FL

    Learning Deterministic Finite Automata Decompositions from Examples and Demonstrations

    Authors: Niklas Lauffer, Beyazit Yalcinkaya, Marcell Vazquez-Chanlatte, Ameesh Shah, Sanjit A. Seshia

    Abstract: The identification of a deterministic finite automaton (DFA) from labeled examples is a well-studied problem in the literature; however, prior work focuses on the identification of monolithic DFAs. Although monolithic DFAs provide accurate descriptions of systems' behavior, they lack simplicity and interpretability; moreover, they fail to capture sub-tasks realized by the system and introduce indu… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Preprint, under review

  25. DEC-LOS-RRT: Decentralized Path Planning for Multi-robot Systems with Line-of-sight Constrained Communication

    Authors: Victoria Tuck, Yash Vardhan Pant, Sanjit A. Seshia, S. Shankar Sastry

    Abstract: Decentralized planning for multi-agent systems, such as fleets of robots in a search-and-rescue operation, is often constrained by limitations on how agents can communicate with each other. One such limitation is the case when agents can communicate with each other only when they are in line-of-sight (LOS). Develo** decentralized planning methods that guarantee safety is difficult in this case,… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 8 pages, 8 figures, Presented at CCTA 2022

    Journal ref: CCTA (2021) 103-110

  26. arXiv:2112.10807  [pdf, other

    cs.AI cs.FL

    Demonstration Informed Specification Search

    Authors: Marcell Vazquez-Chanlatte, Ameesh Shah, Gil Lederman, Sanjit A. Seshia

    Abstract: This paper considers the problem of learning temporal task specifications, e.g. automata and temporal logic, from expert demonstrations. Task specifications are a class of sparse memory augmented rewards with explicit support for temporal and Boolean composition. Three features make learning temporal task specifications difficult: (1) the (countably) infinite number of tasks under consideration; (… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 December, 2021; originally announced December 2021.

  27. arXiv:2112.00206  [pdf, other

    cs.CV cs.AI cs.PL cs.RO

    Querying Labelled Data with Scenario Programs for Sim-to-Real Validation

    Authors: Edward Kim, Jay Shenoy, Sebastian Junges, Daniel Fremont, Alberto Sangiovanni-Vincentelli, Sanjit Seshia

    Abstract: Simulation-based testing of autonomous vehicles (AVs) has become an essential complement to road testing to ensure safety. Consequently, substantial research has focused on searching for failure scenarios in simulation. However, a fundamental question remains: are AV failure scenarios identified in simulation meaningful in reality, i.e., are they reproducible on the real system? Due to the sim-to-… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: pre-print

  28. arXiv:2110.14870  [pdf, other

    cs.AI

    A Scenario-Based Platform for Testing Autonomous Vehicle Behavior Prediction Models in Simulation

    Authors: Francis Indaheng, Edward Kim, Kesav Viswanadha, Jay Shenoy, **kyu Kim, Daniel J. Fremont, Sanjit A. Seshia

    Abstract: Behavior prediction remains one of the most challenging tasks in the autonomous vehicle (AV) software stack. Forecasting the future trajectories of nearby agents plays a critical role in ensuring road safety, as it equips AVs with the necessary information to plan safe routes of travel. However, these prediction models are data-driven and trained on data collected in real life that may not represe… ▽ More

    Submitted 13 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Accepted to the NeurIPS 2021 Workshop on Machine Learning for Autonomous Driving

  29. arXiv:2108.13796  [pdf, other

    cs.SE cs.AI

    Addressing the IEEE AV Test Challenge with Scenic and VerifAI

    Authors: Kesav Viswanadha, Francis Indaheng, Justin Wong, Edward Kim, Ellen Kalvan, Yash Pant, Daniel J. Fremont, Sanjit A. Seshia

    Abstract: This paper summarizes our formal approach to testing autonomous vehicles (AVs) in simulation for the IEEE AV Test Challenge. We demonstrate a systematic testing framework leveraging our previous work on formally-driven simulation for intelligent cyber-physical systems. First, to model and generate interactive scenarios involving multiple agents, we used Scenic, a probabilistic programming language… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: Accepted to the IEEE AITest Conference 2021

  30. arXiv:2108.07307  [pdf, ps, other

    cs.LG cs.AI cs.LO

    Synthesizing Pareto-Optimal Interpretations for Black-Box Models

    Authors: Hazem Torfah, Shetal Shah, Supratik Chakraborty, S. Akshay, Sanjit A. Seshia

    Abstract: We present a new multi-objective optimization approach for synthesizing interpretations that "explain" the behavior of black-box machine learning models. Constructing human-understandable interpretations for black-box models often requires balancing conflicting objectives. A simple interpretation may be easier to understand for humans while being less precise in its predictions vis-a-vis a complex… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Long version of conference paper accepted at FMCAD'21

  31. arXiv:2107.13477  [pdf, other

    cs.LO cs.LG cs.PL

    Satisfiability and Synthesis Modulo Oracles

    Authors: Elizabeth Polgreen, Andrew Reynolds, Sanjit A. Seshia

    Abstract: In classic program synthesis algorithms, such as counterexample-guided inductive synthesis (CEGIS), the algorithms alternate between a synthesis phase and an oracle (verification) phase. Many synthesis algorithms use a white-box oracle based on satisfiability modulo theory (SMT) solvers to provide counterexamples. But what if a white-box oracle is either not available or not easy to work with? We… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: 12 pages, 8 Figures

  32. arXiv:2107.04164  [pdf, other

    cs.AI cs.SE

    Parallel and Multi-Objective Falsification with Scenic and VerifAI

    Authors: Kesav Viswanadha, Edward Kim, Francis Indaheng, Daniel J. Fremont, Sanjit A. Seshia

    Abstract: Falsification has emerged as an important tool for simulation-based verification of autonomous systems. In this paper, we present extensions to the Scenic scenario specification language and VerifAI toolkit that improve the scalability of sampling-based falsification methods by using parallelism and extend falsification to multi-objective specifications. We first present a parallelized framework t… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  33. arXiv:2106.10365  [pdf, other

    cs.LG cs.AI

    Scenic4RL: Programmatic Modeling and Generation of Reinforcement Learning Environments

    Authors: Abdus Salam Azad, Edward Kim, Qiancheng Wu, Kimin Lee, Ion Stoica, Pieter Abbeel, Sanjit A. Seshia

    Abstract: The capability of a reinforcement learning (RL) agent heavily depends on the diversity of the learning scenarios generated by the environment. Generation of diverse realistic scenarios is challenging for real-time strategy (RTS) environments. The RTS environments are characterized by intelligent entities/non-RL agents cooperating and competing with the RL agents with large state and action spaces… ▽ More

    Submitted 28 March, 2023; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: First two authors contributed equally. The final version of this paper is accepted at Proceedings of the AAAI Conference on Artificial Intelligence, 36(6), 6028-6036. https://doi.org/10.1609/aaai.v36i6.20549

  34. Hypercontracts

    Authors: Inigo Incer, Albert Benveniste, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: Contract theories have been proposed to formally support distributed and decentralized system design while ensuring safe system integration. In this paper we propose hypercontracts, a generic model with a richer structure for its underlying model of components, subsuming simulation preorders. While this new model remains generic, it provides a much more elegant and richer algebra for its key notio… ▽ More

    Submitted 8 October, 2021; v1 submitted 26 May, 2021; originally announced June 2021.

    Journal ref: NFM 2022. Lecture Notes in Computer Science, vol 13260

  35. arXiv:2105.12326  [pdf, other

    cs.LO

    Model Checking Finite-Horizon Markov Chains with Probabilistic Inference

    Authors: Steven Holtzen, Sebastian Junges, Marcell Vazquez-Chanlatte, Todd Millstein, Sanjit A. Seshia, Guy Van Den Broeck

    Abstract: We revisit the symbolic verification of Markov chains with respect to finite horizon reachability properties. The prevalent approach iteratively computes step-bounded state reachability probabilities. By contrast, recent advances in probabilistic inference suggest symbolically representing all horizon-length paths through the Markov chain. We ask whether this perspective advances the state-of-the-… ▽ More

    Submitted 30 June, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Technical Report. Accepted at CAV 2021

  36. arXiv:2105.12322  [pdf, other

    cs.LO cs.RO

    Runtime Monitoring for Markov Decision Processes

    Authors: Sebastian Junges, Hazem Torfah, Sanjit A. Seshia

    Abstract: We investigate the problem of monitoring partially observable systems with nondeterministic and probabilistic dynamics. In such systems, every state may be associated with a risk, e.g., the probability of an imminent crash. During runtime, we obtain partial information about the system state in form of observations. The monitor uses this information to estimate the risk of the (unobservable) curre… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Technical report with appendix. Accepted at CAV

  37. arXiv:2103.05672  [pdf, other

    cs.RO cs.FL

    Entropy-Guided Control Improvisation

    Authors: Marcell Vazquez-Chanlatte, Sebastian Junges, Daniel J. Fremont, Sanjit Seshia

    Abstract: High level declarative constraints provide a powerful (and popular) way to define and construct control policies; however, most synthesis algorithms do not support specifying the degree of randomness (unpredictability) of the resulting controller. In many contexts, e.g., patrolling, testing, behavior prediction,and planning on idealized models, predictable or biased controllers are undesirable. To… ▽ More

    Submitted 28 June, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: RSS 21

  38. arXiv:2011.14551  [pdf, other

    cs.AI cs.RO

    A Customizable Dynamic Scenario Modeling and Data Generation Platform for Autonomous Driving

    Authors: Jay Shenoy, Edward Kim, Xiangyu Yue, Taesung Park, Daniel Fremont, Alberto Sangiovanni-Vincentelli, Sanjit Seshia

    Abstract: Safely interacting with humans is a significant challenge for autonomous driving. The performance of this interaction depends on machine learning-based modules of an autopilot, such as perception, behavior prediction, and planning. These modules require training datasets with high-quality labels and a diverse range of realistic dynamic behaviors. Consequently, training such modules to handle rare… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  39. arXiv:2010.06580  [pdf, other

    cs.PL cs.CV cs.LG

    Scenic: A Language for Scenario Specification and Data Generation

    Authors: Daniel J. Fremont, Edward Kim, Tommaso Dreossi, Shromona Ghosh, Xiangyu Yue, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia

    Abstract: We propose a new probabilistic programming language for the design and analysis of cyber-physical systems, especially those based on machine learning. Specifically, we consider the problems of training a system to be robust to rare events, testing its performance under different conditions, and debugging failures. We show how a probabilistic programming language can help address these problems by… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: Supercedes arXiv:1809.09310

  40. arXiv:2009.14363  [pdf, other

    eess.SY cs.MA cs.RO

    Co-design of Control and Planning for Multi-rotor UAVs with Signal Temporal Logic Specifications

    Authors: Yash Vardhan Pant, He Yin, Murat Arcak, Sanjit A. Seshia

    Abstract: Urban Air Mobility (UAM), or the scenario where multiple manned and Unmanned Aerial Vehicles (UAVs) carry out various tasks over urban airspaces, is a transportation concept of the future that is gaining prominence. UAM missions with complex spatial, temporal and reactive requirements can be succinctly represented using Signal Temporal Logic (STL), a behavioral specification language. However, pla… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  41. arXiv:2009.00155  [pdf, other

    cs.CV cs.LG eess.IV

    A Review of Single-Source Deep Unsupervised Visual Domain Adaptation

    Authors: Sicheng Zhao, Xiangyu Yue, Shanghang Zhang, Bo Li, Han Zhao, Bichen Wu, Ravi Krishna, Joseph E. Gonzalez, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia, Kurt Keutzer

    Abstract: Large-scale labeled training datasets have enabled deep neural networks to excel across a wide range of benchmark vision tasks. However, in many applications, it is prohibitively expensive and time-consuming to obtain large quantities of labeled data. To cope with limited labeled training data, many have attempted to directly apply models trained on a large-scale labeled source domain to another s… ▽ More

    Submitted 18 September, 2020; v1 submitted 31 August, 2020; originally announced September 2020.

  42. arXiv:2008.09707  [pdf, other

    cs.RO cs.AI cs.PL

    SOTER on ROS: A Run-Time Assurance Framework on the Robot Operating System

    Authors: Sumukh Shivakumar, Hazem Torfah, Ankush Desai, Sanjit A. Seshia

    Abstract: We present an implementation of SOTER, a run-time assurance framework for building safe distributed mobile robotic (DMR) systems, on top of the Robot Operating System (ROS). The safety of DMR systems cannot always be guaranteed at design time, especially when complex, off-the-shelf components are used that cannot be verified easily. SOTER addresses this by providing a language-based approach for r… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: 20th International Conference on Runtime Verification

  43. arXiv:2007.10519  [pdf, ps, other

    cs.LO

    SynRG: Syntax Guided Synthesis of Expressions with Alternating Quantifiers

    Authors: Elizabeth Polgreen, Sanjit A. Seshia

    Abstract: Program synthesis is the task of automatically generating expressions that satisfy a given specification. Program synthesis techniques have been used to automate the generation of loop invariants in code, synthesize function summaries, and to assist programmers via program sketching. Syntax-guided synthesis has been a successful paradigm in this area, however, one area where the state-of-the-art s… ▽ More

    Submitted 12 October, 2020; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: 20 pages

  44. arXiv:2007.06760  [pdf, ps, other

    cs.PL

    Synthesis in Uclid5

    Authors: Federico Mora, Kevin Cheang, Elizabeth Polgreen, Sanjit A. Seshia

    Abstract: We describe an integration of program synthesis into Uclid5, a formal modelling and verification tool. To the best of our knowledge, the new version of Uclid5 is the only tool that supports program synthesis with bounded model checking, k-induction, sequential program verification, and hyperproperty verification. We use the integration to generate 25 program synthesis benchmarks with simple, known… ▽ More

    Submitted 16 July, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  45. arXiv:2007.06677  [pdf, ps, other

    cs.SE cs.AI cs.LG cs.PL

    Gradient Descent over Metagrammars for Syntax-Guided Synthesis

    Authors: Nicolas Chan, Elizabeth Polgreen, Sanjit A. Seshia

    Abstract: The performance of a syntax-guided synthesis algorithm is highly dependent on the provision of a good syntactic template, or grammar. Provision of such a template is often left to the user to do manually, though in the absence of such a grammar, state-of-the-art solvers will provide their own default grammar, which is dependent on the signature of the target program to be sythesized. In this work,… ▽ More

    Submitted 16 July, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 5 pages, SYNT 2020

  46. arXiv:2007.03204  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    Learning Branching Heuristics for Propositional Model Counting

    Authors: Pashootan Vaezipoor, Gil Lederman, Yuhuai Wu, Chris J. Maddison, Roger Grosse, Sanjit A. Seshia, Fahiem Bacchus

    Abstract: Propositional model counting, or #SAT, is the problem of computing the number of satisfying assignments of a Boolean formula. Many problems from different application areas, including many discrete probabilistic inference problems, can be translated into model counting problems to be solved by #SAT solvers. Exact #SAT solvers, however, are often not scalable to industrial size instances. In this p… ▽ More

    Submitted 8 September, 2022; v1 submitted 7 July, 2020; originally announced July 2020.

    Journal ref: 35(14), 2021, 12427-12435

  47. arXiv:2007.00085  [pdf, other

    cs.AI cs.RO eess.SY

    Enforcing Almost-Sure Reachability in POMDPs

    Authors: Sebastian Junges, Nils Jansen, Sanjit A. Seshia

    Abstract: Partially-Observable Markov Decision Processes (POMDPs) are a well-known stochastic model for sequential decision making under limited information. We consider the EXPTIME-hard problem of synthesising policies that almost-surely reach some goal state without ever visiting a bad state. In particular, we are interested in computing the winning region, that is, the set of system configurations from w… ▽ More

    Submitted 18 March, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

  48. arXiv:2005.07173  [pdf, other

    cs.LG cs.PL eess.SY stat.ML

    Formal Analysis and Redesign of a Neural Network-Based Aircraft Taxiing System with VerifAI

    Authors: Daniel J. Fremont, Johnathan Chiu, Dragos D. Margineantu, Denis Osipychev, Sanjit A. Seshia

    Abstract: We demonstrate a unified approach to rigorous design of safety-critical autonomous systems using the VerifAI toolkit for formal analysis of AI-based systems. VerifAI provides an integrated toolchain for tasks spanning the design process, including modeling, falsification, debugging, and ML component retraining. We evaluate all of these applications in an industrial case study on an experimental au… ▽ More

    Submitted 14 May, 2020; originally announced May 2020.

    Comments: Full version of a CAV 2020 paper

  49. arXiv:2003.07739  [pdf, other

    eess.SY cs.LG cs.LO cs.SE

    Formal Scenario-Based Testing of Autonomous Vehicles: From Simulation to the Real World

    Authors: Daniel J. Fremont, Edward Kim, Yash Vardhan Pant, Sanjit A. Seshia, Atul Acharya, Xantha Bruso, Paul Wells, Steve Lemke, Qiang Lu, Shalin Mehta

    Abstract: We present a new approach to automated scenario-based testing of the safety of autonomous vehicles, especially those using advanced artificial intelligence-based components, spanning both simulation-based evaluation as well as testing in the real world. Our approach is based on formal methods, combining formal specification of scenarios and safety properties, algorithmic test case generation using… ▽ More

    Submitted 12 July, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: 9 pages, 6 figures. Full version of an ITSC 2020 paper

    ACM Class: I.2.9; D.2.4; D.2.5

  50. arXiv:1912.00289  [pdf, other

    cs.CV

    A Programmatic and Semantic Approach to Explaining and DebuggingNeural Network Based Object Detectors

    Authors: Edward Kim, Divya Gopinath, Corina Pasareanu, Sanjit Seshia

    Abstract: Even as deep neural networks have become very effective for tasks in vision and perception, it remains difficult to explain and debug their behavior. In this paper, we present a programmatic and semantic approach to explaining, understanding, and debugging the correct and incorrect behaviors of a neural network-based perception system. Our approach is semantic in that it employs a high-level repre… ▽ More

    Submitted 16 June, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Journal ref: CVPR (2020)