Skip to main content

Showing 1–15 of 15 results for author: Crouse, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00121  [pdf, other

    cs.LG cs.AI cs.CL

    Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

    Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

    Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  2. arXiv:2405.04324  [pdf, other

    cs.AI cs.CL cs.SE

    Granite Code Models: A Family of Open Foundation Models for Code Intelligence

    Authors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang, Yikang Shen, Aditya Prasad, Adriana Meza Soria, Michele Merler, Parameswaran Selvam, Saptha Surendran, Shivdeep Singh, Manish Sethi, Xuan-Hong Dang, Pengyuan Li, Kun-Lung Wu, Syed Zawad, Andrew Coleman, Matthew White, Mark Lewis, Raju Pavuluri, Yan Koyfman, Boris Lublinsky, Maximilien de Bayser, Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal , et al. (21 additional authors not shown)

    Abstract: Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and LLM-based agents are beginning to show promise for handling complex tasks autonomously. Realizing the full potential of code LLMs requires a wide range of capabili… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Corresponding Authors: Rameswar Panda, Ruchir Puri; Equal Contributors: Mayank Mishra, Matt Stallone, Gaoyuan Zhang

  3. arXiv:2402.15491  [pdf, other

    cs.CL cs.AI

    API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

    Authors: Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras

    Abstract: There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks. As such, there is tremendous interest in methods that can acquire sufficient quantities of train and test data that involve calls to tools / APIs. Two lines of research have emerged as the predominant strategies for addressing this cha… ▽ More

    Submitted 20 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL'24-main conference

  4. arXiv:2310.08535  [pdf, other

    cs.AI cs.CL

    Formally Specifying the High-Level Behavior of LLM-Based Agents

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Ramon Astudillo, Kinjal Basu, Soham Dan, Sadhana Kumaravel, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Luis Lastras

    Abstract: Autonomous, goal-driven agents powered by LLMs have recently emerged as promising tools for solving challenging problems without the need for task-specific finetuned models that can be expensive to procure. Currently, the design and implementation of such agents is ad hoc, as the wide variety of tasks that LLM-based agents may be applied to naturally means there can be no one-size-fits-all approac… ▽ More

    Submitted 24 January, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Preprint under review

  5. arXiv:2309.16467  [pdf, other

    cs.LG

    Compositional Program Generation for Few-Shot Systematic Generalization

    Authors: Tim Klinger, Luke Liu, Soham Dan, Maxwell Crouse, Parikshit Ram, Alexander Gray

    Abstract: Compositional generalization is a key ability of humans that enables us to learn new concepts from only a handful examples. Neural machine learning models, including the now ubiquitous Transformers, struggle to generalize in this way, and typically require thousands of examples of a concept during training in order to generalize meaningfully. This difference in ability between humans and artificia… ▽ More

    Submitted 18 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 7 pages of text with 1 page of references

  6. arXiv:2306.10452  [pdf, other

    cs.CL

    MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

    Authors: Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: With the growing interest in large language models, the need for evaluating the quality of machine text compared to reference (typically human-generated) text has become focal attention. Most recent works focus either on task-specific evaluation metrics or study the properties of machine-generated text captured by the existing metrics. In this work, we propose a new evaluation scheme to model huma… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023 (ACL Findings Long)

  7. arXiv:2305.20018  [pdf, other

    cs.CL cs.AI

    Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency

    Authors: Maxwell Crouse, Ramon Astudillo, Tahira Naseem, Subhajit Chaudhury, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: We introduce Logical Offline Cycle Consistency Optimization (LOCCO), a scalable, semi-supervised method for training a neural semantic parser. Conceptually, LOCCO can be viewed as a form of self-learning where the semantic parser being trained is used to generate annotations for unlabeled text that are then used as new supervision. To increase the quality of annotations, our method utilizes a coun… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  8. arXiv:2305.08676  [pdf, other

    cs.AI cs.LO

    An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural Representations

    Authors: Achille Fokoue, Ibrahim Abdelaziz, Maxwell Crouse, Shajith Ikbal, Akihiro Kishimoto, Guilherme Lima, Ndivhuwo Makondo, Radu Marinescu

    Abstract: Using reinforcement learning for automated theorem proving has recently received much attention. Current approaches use representations of logical statements that often rely on the names used in these statements and, as a result, the models are generally not transferable from one domain to another. The size of these representations and whether to include the whole theory or part of it are other im… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted to IJCAI 2023

  9. arXiv:2305.04346  [pdf, other

    cs.CL cs.AI

    Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing

    Authors: Maxwell Crouse, Pavan Kapanipathi, Subhajit Chaudhury, Tahira Naseem, Ramon Astudillo, Achille Fokoue, Tim Klinger

    Abstract: Nearly all general-purpose neural semantic parsers generate logical forms in a strictly top-down autoregressive fashion. Though such systems have achieved impressive results across a variety of datasets and domains, recent works have called into question whether they are ultimately limited in their ability to compositionally generalize. In this work, we approach semantic parsing from, quite litera… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL main conference

  10. arXiv:2106.03906  [pdf, other

    cs.AI cs.LO

    Learning to Guide a Saturation-Based Theorem Prover

    Authors: Ibrahim Abdelaziz, Maxwell Crouse, Bassem Makni, Vernon Austil, Cristina Cornelio, Shajith Ikbal, Pavan Kapanipathi, Ndivhuwo Makondo, Kavitha Srinivas, Michael Witbrock, Achille Fokoue

    Abstract: Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  11. arXiv:2004.03573  [pdf, other

    cs.AI cs.NE cs.SC

    Neural Analogical Matching

    Authors: Maxwell Crouse, Constantine Nakos, Ibrahim Abdelaziz, Kenneth Forbus

    Abstract: Analogy is core to human cognition. It allows us to solve problems based on prior experience, it governs the way we conceptualize new information, and it even influences our visual perception. The importance of analogy to humans has made it an active area of research in the broader field of artificial intelligence, resulting in data-efficient models that learn and reason in human-like ways. While… ▽ More

    Submitted 15 December, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: AAAI version

  12. arXiv:2002.00423  [pdf, other

    cs.AI cs.LG cs.LO

    An Experimental Study of Formula Embeddings for Automated Theorem Proving in First-Order Logic

    Authors: Ibrahim Abdelaziz, Veronika Thost, Maxwell Crouse, Achille Fokoue

    Abstract: Automated theorem proving in first-order logic is an active research area which is successfully supported by machine learning. While there have been various proposals for encoding logical formulas into numerical vectors -- from simple strings to more involved graph-based embeddings -- little is known about how these different encodings compare. In this paper, we study and experimentally compare pa… ▽ More

    Submitted 15 March, 2020; v1 submitted 2 February, 2020; originally announced February 2020.

    Comments: 7 pages, preprint, under review

  13. arXiv:1911.06904  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    Improving Graph Neural Network Representations of Logical Formulae with Subgraph Pooling

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Cristina Cornelio, Veronika Thost, Lingfei Wu, Kenneth Forbus, Achille Fokoue

    Abstract: Recent advances in the integration of deep learning with automated theorem proving have centered around the representation of logical formulae as inputs to deep learning systems. In particular, there has been a growing interest in adapting structure-aware neural methods to work with the underlying graph representations of logical expressions. While more effective than character and token-level app… ▽ More

    Submitted 5 June, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

  14. arXiv:1911.02065  [pdf, other

    cs.AI cs.LG cs.LO

    A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni, Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

    Abstract: Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theor… ▽ More

    Submitted 15 September, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  15. arXiv:1901.02565  [pdf, other

    cs.AI

    High-Fidelity Vector Space Models of Structured Data

    Authors: Maxwell Crouse, Achille Fokoue, Maria Chang, Pavan Kapanipathi, Ryan Musa, Constantine Nakos, Lingfei Wu, Kenneth Forbus, Michael Witbrock

    Abstract: Machine learning systems regularly deal with structured data in real-world applications. Unfortunately, such data has been difficult to faithfully represent in a way that most machine learning techniques would expect, i.e. as a real-valued vector of a fixed, pre-specified size. In this work, we introduce a novel approach that compiles structured data into a satisfiability problem which has in its… ▽ More

    Submitted 15 January, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: updated to reflect conference submission, new experiment added