Skip to main content

Showing 1–50 of 60 results for author: Solar-Lezama, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01567  [pdf, other

    cs.RO cs.LG

    MeMo: Meaningful, Modular Controllers via Noise Injection

    Authors: Megan Tjandrasuwita, Jie Xu, Armando Solar-Lezama, Wojciech Matusik

    Abstract: Robots are often built from standardized assemblies, (e.g. arms, legs, or fingers), but each robot must be trained from scratch to control all the actuators of all the parts together. In this paper we demonstrate a new approach that takes a single robot and its controller as input and produces a set of modular controllers for each of these assemblies such that when a new robot is built from the sa… ▽ More

    Submitted 24 May, 2024; originally announced July 2024.

  2. arXiv:2406.07897  [pdf, other

    cs.LG cs.AI

    When Do Skills Help Reinforcement Learning? A Theoretical Analysis of Temporal Abstractions

    Authors: Zhening Li, Gabriel Poesia, Armando Solar-Lezama

    Abstract: Skills are temporal abstractions that are intended to improve reinforcement learning (RL) performance through hierarchical RL. Despite our intuition about the properties of an environment that make skills useful, a precise characterization has been absent. We provide the first such characterization, focusing on the utility of deterministic skills in deterministic sparse-reward environments with fi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 29 pages, 1 figure. Accepted to ICML 2024

  3. arXiv:2403.07974  [pdf, other

    cs.SE cs.CL cs.LG

    LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

    Authors: Naman Jain, King Han, Alex Gu, Wen-Ding Li, Fanjia Yan, Tianjun Zhang, Sida Wang, Armando Solar-Lezama, Koushik Sen, Ion Stoica

    Abstract: Large Language Models (LLMs) applied to code-related applications have emerged as a prominent field, attracting significant interest from both academia and industry. However, as new and improved LLMs are developed, existing evaluation benchmarks (e.g., HumanEval, MBPP) are no longer sufficient for assessing their capabilities. In this work, we propose LiveCodeBench, a comprehensive and contaminati… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Website - https://livecodebench.github.io/

  4. arXiv:2402.19475  [pdf, other

    cs.SE cs.AI cs.LG

    The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

    Authors: Alex Gu, Wen-Ding Li, Naman Jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama

    Abstract: While language models are increasingly more proficient at code generation, they still frequently generate incorrect programs. Many of these programs are obviously wrong, but others are more subtle and pass weaker correctness checks such as being able to compile. In this work, we focus on these counterfeit samples: programs sampled from a language model that 1) have a high enough log-probability to… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 54 pages, 25 figures

  5. arXiv:2401.03065  [pdf, other

    cs.SE cs.AI cs.LG

    CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution

    Authors: Alex Gu, Baptiste Rozière, Hugh Leather, Armando Solar-Lezama, Gabriel Synnaeve, Sida I. Wang

    Abstract: We present CRUXEval (Code Reasoning, Understanding, and eXecution Evaluation), a benchmark consisting of 800 Python functions (3-13 lines). Each function comes with an input-output pair, leading to two natural tasks: input prediction and output prediction. First, we propose a generic recipe for generating our execution benchmark which can be used to create future variation of the benchmark. Second… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 71 pages, 29 figures

  6. LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

    Authors: Theo X. Olausson, Alex Gu, Benjamin Lipkin, Cedegao E. Zhang, Armando Solar-Lezama, Joshua B. Tenenbaum, Roger Levy

    Abstract: Logical reasoning, i.e., deductively inferring the truth value of a conclusion from a set of premises, is an important task for artificial intelligence with wide potential impacts on science, mathematics, and society. While many prompting-based strategies have been proposed to enable Large Language Models (LLMs) to do such reasoning more effectively, they still appear unsatisfactory, often failing… ▽ More

    Submitted 14 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP Main 2023 (Outstanding Paper Award)

    Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 5153-5176, Singapore. Association for Computational Linguistics

  7. arXiv:2310.11614  [pdf, other

    cs.AI

    Learning a Hierarchical Planner from Humans in Multiple Generations

    Authors: Leonardo Hernandez Cano, Yewen Pu, Robert D. Hawkins, Josh Tenenbaum, Armando Solar-Lezama

    Abstract: A typical way in which a machine acquires knowledge from humans is by programming. Compared to learning from demonstrations or experiences, programmatic learning allows the machine to acquire a novel skill as soon as the program is written, and, by building a library of programs, a machine can quickly learn how to perform complex tasks. However, as programs often take their execution contexts for… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: First two authors contributed equally

  8. arXiv:2306.09896  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    Is Self-Repair a Silver Bullet for Code Generation?

    Authors: Theo X. Olausson, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao, Armando Solar-Lezama

    Abstract: Large language models have shown remarkable aptitude in code generation, but still struggle to perform complex tasks. Self-repair -- in which the model debugs and repairs its own code -- has recently become a popular way to boost performance in these settings. However, despite its increasing popularity, existing studies of self-repair have been limited in scope; in many settings, its efficacy thus… ▽ More

    Submitted 2 February, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted to ICLR 2024. Added additional Code Llama experiments and fixed a data processing error harming Code Llama's reported self-repair performance on HumanEval

  9. arXiv:2306.08997   

    cs.CL cs.AI cs.LG

    Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

    Authors: Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

    Abstract: We curate a comprehensive dataset of 4,550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree. We evaluate the ability of large language models to fulfill the graduation requirements for any MIT major in Mathematics and EECS. Our results demonstrate that… ▽ More

    Submitted 24 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Did not receive permission to release the data or model fine-tuned on the data

  10. arXiv:2302.01976  [pdf, other

    cs.LG

    SPARLING: Learning Latent Representations with Extremely Sparse Activations

    Authors: Kavi Gupta, Osbert Bastani, Armando Solar-Lezama

    Abstract: Real-world processes often contain intermediate state that can be modeled as an extremely sparse tensor. We introduce Sparling, a technique that allows you to learn models with intermediate layers that match this state from only end-to-end labeled examples (i.e., no supervision on the intermediate state). Sparling uses a new kind of informational bottleneck that enforces levels of activation spars… ▽ More

    Submitted 20 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 10 pages, 6 figures

  11. arXiv:2211.16605  [pdf, other

    cs.PL cs.AI

    Top-Down Synthesis for Library Learning

    Authors: Matthew Bowers, Theo X. Olausson, Lionel Wong, Gabriel Grand, Joshua B. Tenenbaum, Kevin Ellis, Armando Solar-Lezama

    Abstract: This paper introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm… ▽ More

    Submitted 15 January, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Published at POPL 2023

    Journal ref: Proc. ACM Program. Lang. 7, POPL, Article 41 (January 2023), pp 1182-1213

  12. arXiv:2211.12112  [pdf, other

    cs.CV cs.AI cs.LG

    Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark

    Authors: Vitali Petsiuk, Alexander E. Siemenn, Saisamrit Surbehera, Zad Chin, Keith Tyser, Gregory Hunter, Arvind Raghavan, Yann Hicke, Bryan A. Plummer, Ori Kerret, Tonio Buonassisi, Kate Saenko, Armando Solar-Lezama, Iddo Drori

    Abstract: We provide a new multi-task benchmark for evaluating text-to-image models. We perform a human evaluation comparing the most common open-source (Stable Diffusion) and commercial (DALL-E 2) models. Twenty computer science AI graduate students evaluated the two models, on three tasks, at three difficulty levels, across ten prompts each, providing 3,600 ratings. Text-to-image generation has seen rapid… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 Workshop on Human Evaluation of Generative Models (HEGM)

  13. arXiv:2211.08671  [pdf, other

    cs.AI

    LEMMA: Bootstrap** High-Level Mathematical Reasoning with Learned Symbolic Abstractions

    Authors: Zhening Li, Gabriel Poesia, Omar Costilla-Reyes, Noah Goodman, Armando Solar-Lezama

    Abstract: Humans tame the complexity of mathematical reasoning by develo** hierarchies of abstractions. With proper abstractions, solutions to hard problems can be expressed concisely, thus making them more likely to be found. In this paper, we propose Learning Mathematical Abstractions (LEMMA): an algorithm that implements this idea for reinforcement learning agents in mathematical domains. LEMMA augment… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 10 pages, 2 figures; to appear in 2nd MATH-AI Workshop at NeurIPS'22

  14. arXiv:2210.11468  [pdf, other

    cs.SE cs.HC cs.LG

    ObSynth: An Interactive Synthesis System for Generating Object Models from Natural Language Specifications

    Authors: Alex Gu, Tamara Mitrovska, Daniela Velez, Jacob Andreas, Armando Solar-Lezama

    Abstract: We introduce ObSynth, an interactive system leveraging the domain knowledge embedded in large language models (LLMs) to help users design object models from high level natural language prompts. This is an example of specification reification, the process of taking a high-level, potentially vague specification and reifying it into a more concrete form. We evaluate ObSynth via a user study, leading… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: 25 pages, 15 figures

  15. arXiv:2210.05050  [pdf, other

    cs.AI

    Neurosymbolic Programming for Science

    Authors: Jennifer J. Sun, Megan Tjandrasuwita, Atharva Sehgal, Armando Solar-Lezama, Swarat Chaudhuri, Yisong Yue, Omar Costilla-Reyes

    Abstract: Neurosymbolic Programming (NP) techniques have the potential to accelerate scientific discovery. These models combine neural and symbolic components to learn complex patterns and representations from data, using high-level concepts or known constraints. NP techniques can interface with symbolic domain knowledge from scientists, such as prior knowledge and experimental context, to produce interpret… ▽ More

    Submitted 7 November, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Neural Information Processing Systems 2022 - AI for science workshop

  16. arXiv:2206.07828  [pdf, other

    cs.PL

    Searching Entangled Program Spaces

    Authors: James Koppel, Zheng Guo, Edsko de Vries, Armando Solar-Lezama, Nadia Polikarpova

    Abstract: Many problem domains, including program synthesis and rewrite-based optimization, require searching astronomically large spaces of programs. Existing approaches often rely on building specialized data structures -- version-space algebras, finite tree automata, or e-graphs -- to compactly represent these programs. To find a compact representation, existing data structures exploit independence of su… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  17. arXiv:2206.06164  [pdf, other

    cs.PL

    Metric Program Synthesis

    Authors: John Feser, Isil Dillig, Armando Solar-Lezama

    Abstract: We present a new domain-agnostic synthesis technique for generating programs from input-output examples. Our method, called metric program synthesis, relaxes the well-known observational equivalence idea (used widely in bottom-up enumerative synthesis) into a weaker notion of observational similarity, with the goal of reducing the search space that the synthesizer needs to explore. Our method clus… ▽ More

    Submitted 10 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

  18. arXiv:2111.12772  [pdf, other

    cs.LG cs.CV cs.GR

    JoinABLe: Learning Bottom-up Assembly of Parametric CAD Joints

    Authors: Karl D. D. Willis, Pradeep Kumar Jayaraman, Hang Chu, Yunsheng Tian, Yifei Li, Daniele Grandi, Aditya Sanghi, Linh Tran, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech Matusik

    Abstract: Physical products are often complex assemblies combining a multitude of 3D parts modeled in computer-aided design (CAD) software. CAD designers build up these assemblies by aligning individual parts to one another using constraints called joints. In this paper we introduce JoinABLe, a learning-based method that assembles parts together to form joints. JoinABLe uses the weak supervision available i… ▽ More

    Submitted 22 April, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: CVPR 2022; code available at https://github.com/AutodeskAILab/JoinABLe and data available at https://github.com/AutodeskAILab/Fusion360GalleryDataset

  19. arXiv:2110.05440  [pdf, other

    cs.RO

    Safe Human-Interactive Control via Shielding

    Authors: Jeevana Priya Inala, Yecheng Jason Ma, Osbert Bastani, Xin Zhang, Armando Solar-Lezama

    Abstract: Ensuring safety for human-interactive robotics is important due to the potential for human injury. The key challenge is defining safety in a way that accounts for the complex range of human behaviors without modeling the human as an unconstrained adversary. We propose a novel approach to ensuring safety in these settings. Our approach focuses on defining backup actions that we believe human always… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  20. arXiv:2102.11137  [pdf, other

    cs.AI

    Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

    Authors: Yichen David Yang, Jeevana Priya Inala, Osbert Bastani, Yewen Pu, Armando Solar-Lezama, Martin Rinard

    Abstract: A key challenge for reinforcement learning is solving long-horizon planning problems. Recent work has leveraged programs to guide reinforcement learning in these settings. However, these approaches impose a high manual burden on the user since they must provide a guiding program for every new task. Partially observed environments further complicate the programming task because the program must imp… ▽ More

    Submitted 1 November, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Journal ref: NeurIPS 2021

  21. arXiv:2101.03238  [pdf, other

    cs.MA cs.LG cs.PL

    Neurosymbolic Transformers for Multi-Agent Communication

    Authors: Jeevana Priya Inala, Yichen Yang, James Paulos, Yewen Pu, Osbert Bastani, Vijay Kumar, Martin Rinard, Armando Solar-Lezama

    Abstract: We study the problem of inferring communication structures that can solve cooperative multi-agent planning problems while minimizing the amount of communication. We quantify the amount of communication as the maximum degree of the communication graph; this metric captures settings where agents have limited bandwidth. Minimizing communication is challenging due to the combinatorial nature of both t… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Journal ref: NeurIPS 2020

  22. arXiv:2012.12964  [pdf, other

    cs.PL cs.AI cs.LG

    Representing Partial Programs with Blended Abstract Semantics

    Authors: Maxwell Nye, Yewen Pu, Matthew Bowers, Jacob Andreas, Joshua B. Tenenbaum, Armando Solar-Lezama

    Abstract: Synthesizing programs from examples requires searching over a vast, combinatorial space of possible programs. In this search process, a key challenge is representing the behavior of a partially written program before it can be executed, to judge if it is on the right track and predict where to search next. We introduce a general technique for representing partially written programs in a program sy… ▽ More

    Submitted 19 April, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: ICLR 2021

  23. Automatically Deriving Control-Flow Graph Generators from Operational Semantics

    Authors: James Koppel, Jackson Kearl, Armando Solar-Lezama

    Abstract: We develop the first theory of control-flow graphs from first principles, and use it to create an algorithm for automatically synthesizing many variants of control-flow graph generators from a language's operational semantics. Our approach first introduces a new algorithm for converting a large class of small-step operational semantics to an abstract machine. It next uses a technique called "abstr… ▽ More

    Submitted 22 July, 2022; v1 submitted 10 October, 2020; originally announced October 2020.

  24. arXiv:2010.02392  [pdf, other

    cs.LG cs.CV cs.GR

    Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Construction from Human Design Sequences

    Authors: Karl D. D. Willis, Yewen Pu, Jieliang Luo, Hang Chu, Tao Du, Joseph G. Lambourne, Armando Solar-Lezama, Wojciech Matusik

    Abstract: Parametric computer-aided design (CAD) is a standard paradigm used to design manufactured objects, where a 3D shape is represented as a program supported by the CAD software. Despite the pervasiveness of parametric CAD and a growing interest from the research community, currently there does not exist a dataset of realistic CAD models in a concise programmatic form. In this paper we present the Fus… ▽ More

    Submitted 16 May, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to SIGGRAPH 2021; data/code available at https://github.com/AutodeskAILab/Fusion360GalleryDataset

  25. arXiv:2007.05060  [pdf, other

    cs.AI cs.SE

    Program Synthesis with Pragmatic Communication

    Authors: Yewen Pu, Kevin Ellis, Marta Kryven, Josh Tenenbaum, Armando Solar-Lezama

    Abstract: Program synthesis techniques construct or infer programs from user-provided specifications, such as input-output examples. Yet most specifications, especially those given by end-users, leave the synthesis problem radically ill-posed, because many programs may simultaneously satisfy the specification. Prior work resolves this ambiguity by using various inductive biases, such as a preference for sim… ▽ More

    Submitted 20 October, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: The second author and the third author contributed equally to this work

    ACM Class: I.2.2; D.3.0

  26. arXiv:2007.03704  [pdf

    cs.CY cs.HC

    Computer-Aided Personalized Education

    Authors: Rajeev Alur, Richard Baraniuk, Rastislav Bodik, Ann Drobnis, Sumit Gulwani, Bjoern Hartmann, Yasmin Kafai, Jeff Karpicke, Ran Libeskind-Hadas, Debra Richardson, Armando Solar-Lezama, Candace Thille, Moshe Vardi

    Abstract: The shortage of people trained in STEM fields is becoming acute, and universities and colleges are straining to satisfy this demand. In the case of computer science, for instance, the number of US students taking introductory courses has grown three-fold in the past decade. Recently, massive open online courses (MOOCs) have been promoted as a way to ease this strain. This at best provides access t… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: A Computing Community Consortium (CCC) workshop report, 12 pages

    Report number: ccc2016report_6

  27. arXiv:2007.01223  [pdf, other

    cs.AI cs.LG cs.LO

    Verifiably Safe Exploration for End-to-End Reinforcement Learning

    Authors: Nathan Hunt, Nathan Fulton, Sara Magliacane, Nghia Hoang, Subhro Das, Armando Solar-Lezama

    Abstract: Deploying deep reinforcement learning in safety-critical settings requires develo** algorithms that obey hard constraints during exploration. This paper contributes a first approach toward enforcing formal safety constraints on end-to-end policies with visual inputs. Our approach draws on recent advances in object detection and automated reasoning for hybrid dynamical systems. The approach is ev… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    ACM Class: F.3.1; I.2.8

  28. arXiv:2006.08381  [pdf, other

    cs.AI cs.LG

    DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

    Authors: Kevin Ellis, Catherine Wong, Maxwell Nye, Mathias Sable-Meyer, Luc Cary, Lucas Morales, Luke Hewitt, Armando Solar-Lezama, Joshua B. Tenenbaum

    Abstract: Expert problem-solving is driven by powerful languages for thinking about problems and their solutions. Acquiring expertise means learning these languages -- systems of concepts, alongside the skills to use them. We present DreamCoder, a system that learns to solve problems by writing programs. It builds expertise by creating programming languages for expressing domain concepts, together with neur… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  29. arXiv:2003.05562  [pdf, other

    cs.AI cs.LG

    Learning Compositional Rules via Neural Program Synthesis

    Authors: Maxwell I. Nye, Armando Solar-Lezama, Joshua B. Tenenbaum, Brenden M. Lake

    Abstract: Many aspects of human reasoning, including language, require learning rules from very little data. Humans can do this, often learning systematic rules from very few examples, and combining these rules to form compositional rule-based systems. Current neural architectures, on the other hand, often fail to generalize in a compositional manner, especially when evaluated in ways that vary systematical… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2020. Code can be found at https://github.com/mtensor/rulesynthesis

  30. arXiv:1912.12659  [pdf, other

    cs.PL

    Synthesizing Queries via Interactive Sketching

    Authors: Osbert Bastani, Xin Zhang, Armando Solar-Lezama

    Abstract: We propose a novel approach to program synthesis, focusing on synthesizing database queries. At a high level, our proposed algorithm takes as input a sketch with soft constraints encoding user intent, and then iteratively interacts with the user to refine the sketch. At each step, our algorithm proposes a candidate refinement of the sketch, which the user can either accept or reject. By leveraging… ▽ More

    Submitted 11 October, 2021; v1 submitted 29 December, 2019; originally announced December 2019.

  31. arXiv:1906.04604  [pdf, other

    cs.PL cs.AI cs.LG cs.SE

    Write, Execute, Assess: Program Synthesis with a REPL

    Authors: Kevin Ellis, Maxwell Nye, Yewen Pu, Felix Sosa, Josh Tenenbaum, Armando Solar-Lezama

    Abstract: We present a neural program synthesis approach integrating components which write, execute, and assess code to navigate the search space of possible programs. We equip the search process with an interpreter or a read-eval-print-loop (REPL), which immediately executes partially written programs, exposing their semantics. The REPL addresses a basic challenge of program synthesis: tiny changes in syn… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: The first four authors contributed equally to this work

  32. arXiv:1903.03229  [pdf, other

    cs.PL cs.DB

    Deductive Optimization of Relational Data Storage

    Authors: John K. Feser, Samuel Madden, Nan Tang, Armando Solar-Lezama

    Abstract: Optimizing the physical data storage and retrieval of data are two key database management problems. In this paper, we propose a language that can express a wide range of physical database layouts, going well beyond the row- and column-based methods that are widely used in database management systems. We use deductive synthesis to turn a high-level relational representation of a database query int… ▽ More

    Submitted 5 February, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

  33. arXiv:1902.06349  [pdf, other

    cs.AI cs.LG

    Learning to Infer Program Sketches

    Authors: Maxwell Nye, Luke Hewitt, Joshua Tenenbaum, Armando Solar-Lezama

    Abstract: Our goal is to build systems which write code automatically from the kinds of specifications humans can most easily provide, such as examples and natural language instruction. The key idea of this work is that a flexible combination of pattern recognition and explicit reasoning can be used to solve these complex programming problems. We propose a method for dynamically integrating these types of i… ▽ More

    Submitted 4 June, 2019; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: Accepted to ICML 2019

  34. arXiv:1812.02573  [pdf, other

    cs.AI

    Probabilistic Verification of Fairness Properties via Concentration

    Authors: Osbert Bastani, Xin Zhang, Armando Solar-Lezama

    Abstract: As machine learning systems are increasingly used to make real world legal and financial decisions, it is of paramount importance that we develop algorithms to verify that these systems do not discriminate against minorities. We design a scalable algorithm for verifying fairness specifications. Our algorithm obtains strong correctness guarantees based on adaptive concentration inequalities; such i… ▽ More

    Submitted 30 December, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

  35. arXiv:1807.08137  [pdf, other

    cs.LO

    Delta-Decision Procedures for Exists-Forall Problems over the Reals

    Authors: Soonho Kong, Armando Solar-Lezama, Sicun Gao

    Abstract: Solving nonlinear SMT problems over real numbers has wide applications in robotics and AI. While significant progress is made in solving quantifier-free SMT formulas in the domain, quantified formulas have been much less investigated. We propose the first delta-complete algorithm for solving satisfiability of nonlinear SMT over real numbers with universal quantification and a wide range of nonline… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

  36. arXiv:1805.08328  [pdf, other

    cs.LG stat.ML

    Verifiable Reinforcement Learning via Policy Extraction

    Authors: Osbert Bastani, Yewen Pu, Armando Solar-Lezama

    Abstract: While deep reinforcement learning has successfully solved many challenging control tasks, its real-world applicability has been limited by the inability to ensure the safety of learned policies. We propose an approach to verifiable reinforcement learning by training decision tree policies, which can represent complex policies (since they are nonparametric), yet can be efficiently verified using ex… ▽ More

    Submitted 23 January, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

  37. arXiv:1803.07244  [pdf, other

    cs.AI cs.PL cs.SE

    The Three Pillars of Machine Programming

    Authors: Justin Gottschlich, Armando Solar-Lezama, Nesime Tatbul, Michael Carbin, Martin Rinard, Regina Barzilay, Saman Amarasinghe, Joshua B Tenenbaum, Tim Mattson

    Abstract: In this position paper, we describe our vision of the future of machine programming through a categorical examination of three pillars of research. Those pillars are: (i) intention, (ii) invention, and(iii) adaptation. Intention emphasizes advancements in the human-to-computer and computer-to-machine-learning interfaces. Invention emphasizes the creation or refinement of algorithms or core hardwar… ▽ More

    Submitted 26 June, 2021; v1 submitted 19 March, 2018; originally announced March 2018.

  38. arXiv:1802.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections

    Authors: Xin Zhang, Armando Solar-Lezama, Rishabh Singh

    Abstract: We present a new algorithm to generate minimal, stable, and symbolic corrections to an input that will cause a neural network with ReLU activations to change its output. We argue that such a correction is a useful way to provide feedback to a user when the network's output is different from a desired output. Our algorithm generates such a correction by solving a series of linear constraint satisfa… ▽ More

    Submitted 30 August, 2018; v1 submitted 20 February, 2018; originally announced February 2018.

    Comments: 24 pages

    MSC Class: 68T01

  39. arXiv:1802.04408  [pdf, other

    cs.PL cs.AI

    REAS: Combining Numerical Optimization with SAT Solving

    Authors: Jeevana Priya Inala, Sicun Gao, Soonho Kong, Armando Solar-Lezama

    Abstract: In this paper, we present ReaS, a technique that combines numerical optimization with SAT solving to synthesize unknowns in a program that involves discrete and floating point computation. ReaS makes the program end-to-end differentiable by smoothing any Boolean expression that introduces discontinuity such as conditionals and relaxing the Boolean unknowns so that numerical optimization can be per… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.

  40. arXiv:1711.11438  [pdf, other

    cs.SE cs.LG cs.PL

    SyGuS-Comp 2017: Results and Analysis

    Authors: Rajeev Alur, Dana Fisman, Rishabh Singh, Armando Solar-Lezama

    Abstract: Syntax-Guided Synthesis (SyGuS) is the computational problem of finding an implementation f that meets both a semantic constraint given by a logical formula phi in a background theory T, and a syntactic constraint given by a grammar G, which specifies the allowed set of candidate implementations. Such a synthesis problem can be formally defined in SyGuS-IF, a language that is built on top of SMT-L… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: In Proceedings SYNT 2017, arXiv:1711.10224. arXiv admin note: text overlap with arXiv:1611.07627, arXiv:1602.01170

    Journal ref: EPTCS 260, 2017, pp. 97-115

  41. arXiv:1711.03243  [pdf, other

    cs.AI

    Selecting Representative Examples for Program Synthesis

    Authors: Yewen Pu, Zachery Miranda, Armando Solar-Lezama, Leslie Pack Kaelbling

    Abstract: Program synthesis is a class of regression problems where one seeks a solution, in the form of a source-code program, map** the inputs to their corresponding outputs exactly. Due to its precise and combinatorial nature, program synthesis is commonly formulated as a constraint satisfaction problem, where input-output examples are encoded as constraints and solved with a constraint solver. A key c… ▽ More

    Submitted 7 June, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

    ACM Class: I.2.2

  42. arXiv:1710.10385  [pdf, other

    cs.PL

    Capturing the Future by Replaying the Past

    Authors: James Koppel, Gabriel Scherer, Armando Solar-Lezama

    Abstract: Delimited continuations are the mother of all monads! So goes the slogan inspired by Filinski's 1994 paper, which showed that delimited continuations can implement any monadic effect, letting the programmer use an effect as easily as if it was built into the language. It's a shame that not many languages have delimited continuations. Luckily, exceptions and state are also the mother of all monad… ▽ More

    Submitted 5 July, 2018; v1 submitted 28 October, 2017; originally announced October 2017.

  43. arXiv:1707.09627  [pdf, other

    cs.AI

    Learning to Infer Graphics Programs from Hand-Drawn Images

    Authors: Kevin Ellis, Daniel Ritchie, Armando Solar-Lezama, Joshua B. Tenenbaum

    Abstract: We introduce a model that learns to convert simple hand drawings into graphics programs written in a subset of \LaTeX. The model combines techniques from deep learning and program synthesis. We learn a convolutional neural network that proposes plausible drawing primitives that explain an image. These drawing primitives are like a trace of the set of primitive commands issued by a graphics program… ▽ More

    Submitted 26 October, 2018; v1 submitted 30 July, 2017; originally announced July 2017.

    MSC Class: 68T05

  44. One Tool, Many Languages: Language-Parametric Transformation with Incremental Parametric Syntax

    Authors: James Koppel, Varot Premtoon, Armando Solar-Lezama

    Abstract: We present a new approach for building source-to-source transformations that can run on multiple programming languages, based on a new way of representing programs called incremental parametric syntax. We implement this approach in Haskell in our Cubix system, and construct incremental parametric syntaxes for C, Java, JavaScript, Lua, and Python. We demonstrate a whole-program refactoring tool tha… ▽ More

    Submitted 1 October, 2018; v1 submitted 14 July, 2017; originally announced July 2017.

    ACM Class: D.3.4; D.3.1

    Journal ref: Proc. ACM Program. Lang., Vol. 2, No. OOPSLA, Article 122. Publication date: November 2018

  45. arXiv:1705.07478  [pdf, other

    cs.DC

    Report of the HPC Correctness Summit, Jan 25--26, 2017, Washington, DC

    Authors: Ganesh Gopalakrishnan, Paul D. Hovland, Costin Iancu, Sriram Krishnamoorthy, Ignacio Laguna, Richard A. Lethin, Koushik Sen, Stephen F. Siegel, Armando Solar-Lezama

    Abstract: Maintaining leadership in HPC requires the ability to support simulations at large scales and fidelity. In this study, we detail one of the most significant productivity challenges in achieving this goal, namely the increasing proclivity to bugs, especially in the face of growing hardware and software heterogeneity and sheer system scale. We identify key areas where timely new research must be pro… ▽ More

    Submitted 21 May, 2017; originally announced May 2017.

    Comments: 57 pages

  46. arXiv:1704.06131  [pdf, other

    cs.AI cs.LG stat.ML

    Learning to Acquire Information

    Authors: Yewen Pu, Leslie P Kaelbling, Armando Solar-Lezama

    Abstract: We consider the problem of diagnosis where a set of simple observations are used to infer a potentially complex hidden hypothesis. Finding the optimal subset of observations is intractable in general, thus we focus on the problem of active diagnosis, where the agent selects the next most-informative observation based on the results of previous observations. We show that under the assumption of uni… ▽ More

    Submitted 11 July, 2017; v1 submitted 20 April, 2017; originally announced April 2017.

  47. arXiv:1611.07627  [pdf, other

    cs.SE cs.LG cs.LO

    SyGuS-Comp 2016: Results and Analysis

    Authors: Rajeev Alur, Dana Fisman, Rishabh Singh, Armando Solar-Lezama

    Abstract: Syntax-Guided Synthesis (SyGuS) is the computational problem of finding an implementation f that meets both a semantic constraint given by a logical formula $\varphi$ in a background theory T, and a syntactic constraint given by a grammar G, which specifies the allowed set of candidate implementations. Such a synthesis problem can be formally defined in SyGuS-IF, a language that is built on top of… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

    Comments: In Proceedings SYNT 2016, arXiv:1611.07178. arXiv admin note: text overlap with arXiv:1602.01170

    ACM Class: I.2.2; D.1.2; D.2.2; D.2.4;

    Journal ref: EPTCS 229, 2016, pp. 178-202

  48. arXiv:1607.03445  [pdf, other

    cs.PL

    Liquid Information Flow Control

    Authors: Nadia Polikarpova, Deian Stefan, Jean Yang, Shachar Itzhaky, Travis Hance, Armando Solar-Lezama

    Abstract: We present Lifty, a domain-specific language for data-centric applications that manipulate sensitive data. A Lifty programmer annotates the sources of sensitive data with declarative security policies, and the language statically and automatically verifies that the application handles the data according to the policies. Moreover, if verification fails, Lifty suggests a provably correct repair, the… ▽ More

    Submitted 30 June, 2020; v1 submitted 12 July, 2016; originally announced July 2016.

  49. arXiv:1607.02902  [pdf, other

    cs.PL cs.AI

    sk_p: a neural program corrector for MOOCs

    Authors: Yewen Pu, Karthik Narasimhan, Armando Solar-Lezama, Regina Barzilay

    Abstract: We present a novel technique for automatic program correction in MOOCs, capable of fixing both syntactic and semantic errors without manual, problem specific correction strategies. Given an incorrect student program, it generates candidate programs from a distribution of likely corrections, and checks each candidate for correctness against a test suite. The key observation is that in MOOCs many… ▽ More

    Submitted 11 July, 2016; originally announced July 2016.

  50. arXiv:1602.07285  [pdf, other

    cs.PL

    Automatic Generation of Formula Simplifiers based on Conditional Rewrite Rules

    Authors: Rohit Singh, Armando Solar-Lezama

    Abstract: This paper addresses the problem of creating simplifiers for logic formulas based on conditional term rewriting. In particular, the paper focuses on a program synthesis application where formula simplifications have been shown to have a significant impact. We show that by combining machine learning techniques with constraint-based synthesis, it is possible to synthesize a formula simplifier fully… ▽ More

    Submitted 23 February, 2016; originally announced February 2016.

    Comments: Submitted for peer reviewed conference