Skip to main content

Showing 1–23 of 23 results for author: Fokoue, A

.
  1. arXiv:2310.08535  [pdf, other

    cs.AI cs.CL

    Formally Specifying the High-Level Behavior of LLM-Based Agents

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Ramon Astudillo, Kinjal Basu, Soham Dan, Sadhana Kumaravel, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Luis Lastras

    Abstract: Autonomous, goal-driven agents powered by LLMs have recently emerged as promising tools for solving challenging problems without the need for task-specific finetuned models that can be expensive to procure. Currently, the design and implementation of such agents is ad hoc, as the wide variety of tasks that LLM-based agents may be applied to naturally means there can be no one-size-fits-all approac… ▽ More

    Submitted 24 January, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Preprint under review

  2. arXiv:2307.02689  [pdf, other

    cs.CL

    Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

    Authors: Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar, Alexander Gray

    Abstract: Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games. On the other hand, neuro-symbolic methods, specifically those that leverage an intermediate formal representation, are gaining significant attention in language understanding tasks. Th… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  3. arXiv:2306.10452  [pdf, other

    cs.CL

    MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

    Authors: Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: With the growing interest in large language models, the need for evaluating the quality of machine text compared to reference (typically human-generated) text has become focal attention. Most recent works focus either on task-specific evaluation metrics or study the properties of machine-generated text captured by the existing metrics. In this work, we propose a new evaluation scheme to model huma… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023 (ACL Findings Long)

  4. arXiv:2305.08676  [pdf, other

    cs.AI cs.LO

    An Ensemble Approach for Automated Theorem Proving Based on Efficient Name Invariant Graph Neural Representations

    Authors: Achille Fokoue, Ibrahim Abdelaziz, Maxwell Crouse, Shajith Ikbal, Akihiro Kishimoto, Guilherme Lima, Ndivhuwo Makondo, Radu Marinescu

    Abstract: Using reinforcement learning for automated theorem proving has recently received much attention. Current approaches use representations of logical statements that often rely on the names used in these statements and, as a result, the models are generally not transferable from one domain to another. The size of these representations and whether to include the whole theory or part of it are other im… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted to IJCAI 2023

  5. arXiv:2305.04346  [pdf, other

    cs.CL cs.AI

    Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing

    Authors: Maxwell Crouse, Pavan Kapanipathi, Subhajit Chaudhury, Tahira Naseem, Ramon Astudillo, Achille Fokoue, Tim Klinger

    Abstract: Nearly all general-purpose neural semantic parsers generate logical forms in a strictly top-down autoregressive fashion. Though such systems have achieved impressive results across a variety of datasets and domains, recent works have called into question whether they are ultimately limited in their ability to compositionally generalize. In this work, we approach semantic parsing from, quite litera… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL main conference

  6. arXiv:2209.09114  [pdf

    cs.LG cs.AI

    Neuro-symbolic Models for Interpretable Time Series Classification using Temporal Logic Description

    Authors: Ruixuan Yan, Tengfei Ma, Achille Fokoue, Maria Chang, Agung Julius

    Abstract: Most existing Time series classification (TSC) models lack interpretability and are difficult to inspect. Interpretable machine learning models can aid in discovering patterns in data as well as give easy-to-understand insights to domain specialists. In this study, we present Neuro-Symbolic Time Series Classification (NSTSC), a neuro-symbolic model that leverages signal temporal logic (STL) and ne… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  7. arXiv:2209.05828  [pdf, other

    cs.AI cs.DB

    Expressive Reasoning Graph Store: A Unified Framework for Managing RDF and Property Graph Databases

    Authors: Sumit Neelam, Udit Sharma, Sumit Bhatia, Hima Karanam, Ankita Likhyani, Ibrahim Abdelaziz, Achille Fokoue, L. V. Subramaniam

    Abstract: Resource Description Framework (RDF) and Property Graph (PG) are the two most commonly used data models for representing, storing, and querying graph data. We present Expressive Reasoning Graph Store (ERGS) -- a graph store built on top of JanusGraph (a Property Graph store) that also allows storing and querying of RDF datasets. First, we describe how RDF data can be translated into a Property Gra… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 16 pages, 3 figures, 9 tables

  8. arXiv:2204.08554  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases

    Authors: Dung Thai, Srinivas Ravishankar, Ibrahim Abdelaziz, Mudit Chaudhary, Nandana Mihindukulasooriya, Tahira Naseem, Rajarshi Das, Pavan Kapanipathi, Achille Fokoue, Andrew McCallum

    Abstract: Knowledge bases (KBs) are often incomplete and constantly changing in practice. Yet, in many question answering applications coupled with knowledge bases, the sparse nature of KBs is often overlooked. To this end, we propose a case-based reasoning approach, CBR-iKB, for knowledge base question answering (KBQA) with incomplete-KB as our main focus. Our method ensembles decisions from multiple reaso… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 8 pages, 3 figurs, 4 tables

  9. arXiv:2201.05793  [pdf, other

    cs.CL cs.AI

    A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning are emerging as an important research direction. However, most existing KBQA datasets focus primarily on generic multi-hop reasoning over explicit facts, largely ignoring other reasoning types such as temporal, spatial, and taxonomic reasoning. In this paper, we present a benchmark dataset for temporal reasoning, TempQA-… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430

  10. arXiv:2111.05825  [pdf, other

    cs.CL cs.AI

    A Two-Stage Approach towards Generalization in Knowledge Base Question Answering

    Authors: Srinivas Ravishankar, June Thai, Ibrahim Abdelaziz, Nandana Mihidukulasooriya, Tahira Naseem, Pavan Kapanipathi, Gaetano Rossiello, Achille Fokoue

    Abstract: Most existing approaches for Knowledge Base Question Answering (KBQA) focus on a specific underlying knowledge base either because of inherent assumptions in the approach, or because evaluating it on a different knowledge base requires non-trivial changes. However, many popular knowledge bases share similarities in their underlying schemas that can be leveraged to facilitate generalization across… ▽ More

    Submitted 17 November, 2021; v1 submitted 10 November, 2021; originally announced November 2021.

  11. arXiv:2109.13430  [pdf, other

    cs.CL cs.AI

    SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme LimaRyan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that in-volve complex reasoning are emerging as an important re-search direction. However, most KBQA systems struggle withgeneralizability, particularly on two dimensions: (a) acrossmultiple reasoning types where both datasets and systems haveprimarily focused on multi-hop reasoning, and (b) across mul-tiple knowledge bases, where KBQA approaches are… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  12. arXiv:2106.03906  [pdf, other

    cs.AI cs.LO

    Learning to Guide a Saturation-Based Theorem Prover

    Authors: Ibrahim Abdelaziz, Maxwell Crouse, Bassem Makni, Vernon Austil, Cristina Cornelio, Shajith Ikbal, Pavan Kapanipathi, Ndivhuwo Makondo, Kavitha Srinivas, Michael Witbrock, Achille Fokoue

    Abstract: Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  13. arXiv:2012.01707  [pdf, other

    cs.CL cs.AI

    Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

    Authors: Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy , et al. (5 additional authors not shown)

    Abstract: Knowledge base question answering (KBQA)is an important task in Natural Language Processing. Existing approaches face significant challenges including complex question understanding, necessity for reasoning, and lack of large end-to-end training datasets. In this work, we propose Neuro-Symbolic Question Answering (NSQA), a modular KBQA system, that leverages (1) Abstract Meaning Representation (AM… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to Findings of ACL

  14. arXiv:2002.00423  [pdf, other

    cs.AI cs.LG cs.LO

    An Experimental Study of Formula Embeddings for Automated Theorem Proving in First-Order Logic

    Authors: Ibrahim Abdelaziz, Veronika Thost, Maxwell Crouse, Achille Fokoue

    Abstract: Automated theorem proving in first-order logic is an active research area which is successfully supported by machine learning. While there have been various proposals for encoding logical formulas into numerical vectors -- from simple strings to more involved graph-based embeddings -- little is known about how these different encodings compare. In this paper, we study and experimentally compare pa… ▽ More

    Submitted 15 March, 2020; v1 submitted 2 February, 2020; originally announced February 2020.

    Comments: 7 pages, preprint, under review

  15. arXiv:1911.06904  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    Improving Graph Neural Network Representations of Logical Formulae with Subgraph Pooling

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Cristina Cornelio, Veronika Thost, Lingfei Wu, Kenneth Forbus, Achille Fokoue

    Abstract: Recent advances in the integration of deep learning with automated theorem proving have centered around the representation of logical formulae as inputs to deep learning systems. In particular, there has been a growing interest in adapting structure-aware neural methods to work with the underlying graph representations of logical expressions. While more effective than character and token-level app… ▽ More

    Submitted 5 June, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

  16. arXiv:1911.02085  [pdf, other

    cs.AI cs.CL

    Path-Based Contextualization of Knowledge Graphs for Textual Entailment

    Authors: Kshitij Fadnis, Kartik Talamadupula, Pavan Kapanipathi, Haque Ishfaq, Salim Roukos, Achille Fokoue

    Abstract: In this paper, we introduce the problem of knowledge graph contextualization -- that is, given a specific NLP task, the problem of extracting meaningful and relevant sub-graphs from a given knowledge graph. The task in the case of this paper is the textual entailment problem, and the context is a relevant sub-graph for an instance of the textual entailment problem -- where given two sentences p an… ▽ More

    Submitted 3 February, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  17. arXiv:1911.02065  [pdf, other

    cs.AI cs.LG cs.LO

    A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni, Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

    Abstract: Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theor… ▽ More

    Submitted 15 September, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  18. arXiv:1911.02060  [pdf, other

    cs.CL cs.AI

    Infusing Knowledge into the Textual Entailment Task Using Graph Convolutional Networks

    Authors: Pavan Kapanipathi, Veronika Thost, Siva Sankalp Patel, Spencer Whitehead, Ibrahim Abdelaziz, Avinash Balakrishnan, Maria Chang, Kshitij Fadnis, Chulaka Gunasekara, Bassem Makni, Nicholas Mattei, Kartik Talamadupula, Achille Fokoue

    Abstract: Textual entailment is a fundamental task in natural language processing. Most approaches for solving the problem use only the textual content present in training data. A few approaches have shown that information from external knowledge sources like knowledge graphs (KGs) can add value, in addition to the textual content, by providing background knowledge that may be critical for a task. However,… ▽ More

    Submitted 21 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

  19. arXiv:1901.02565  [pdf, other

    cs.AI

    High-Fidelity Vector Space Models of Structured Data

    Authors: Maxwell Crouse, Achille Fokoue, Maria Chang, Pavan Kapanipathi, Ryan Musa, Constantine Nakos, Lingfei Wu, Kenneth Forbus, Michael Witbrock

    Abstract: Machine learning systems regularly deal with structured data in real-world applications. Unfortunately, such data has been difficult to faithfully represent in a way that most machine learning techniques would expect, i.e. as a real-valued vector of a fixed, pre-specified size. In this work, we introduce a novel approach that compiles structured data into a satisfiability problem which has in its… ▽ More

    Submitted 15 January, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: updated to reflect conference submission, new experiment added

  20. arXiv:1809.05726  [pdf, other

    cs.AI cs.CL

    Answering Science Exam Questions Using Query Rewriting with Background Knowledge

    Authors: Ryan Musa, Xiaoyan Wang, Achille Fokoue, Nicholas Mattei, Maria Chang, Pavan Kapanipathi, Bassem Makni, Kartik Talamadupula, Michael Witbrock

    Abstract: Open-domain question answering (QA) is an important problem in AI and NLP that is emerging as a bellwether for progress on the generalizability of AI methods and techniques. Much of the progress in open-domain QA systems has been realized through advances in information retrieval methods and corpus construction. In this paper, we focus on the recently introduced ARC Challenge dataset, which contai… ▽ More

    Submitted 5 April, 2019; v1 submitted 15 September, 2018; originally announced September 2018.

  21. arXiv:1809.05724  [pdf, other

    cs.AI cs.CL cs.LG

    Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

    Authors: Xiaoyan Wang, Pavan Kapanipathi, Ryan Musa, Mo Yu, Kartik Talamadupula, Ibrahim Abdelaziz, Maria Chang, Achille Fokoue, Bassem Makni, Nicholas Mattei, Michael Witbrock

    Abstract: Natural Language Inference (NLI) is fundamental to many Natural Language Processing (NLP) applications including semantic search and question answering. The NLI problem has gained significant attention thanks to the release of large scale, challenging datasets. Present approaches to the problem largely focus on learning-based methods that use only textual information in order to classify whether a… ▽ More

    Submitted 20 November, 2018; v1 submitted 15 September, 2018; originally announced September 2018.

    Comments: 9 pages, 3 figures, 5 tables

  22. arXiv:1705.08520  [pdf

    cs.AI cs.LG cs.NE

    An effective algorithm for hyperparameter optimization of neural networks

    Authors: Gonzalo Diaz, Achille Fokoue, Giacomo Nannicini, Horst Samulowitz

    Abstract: A major challenge in designing neural network (NN) systems is to determine the best structure and parameters for the network given the data for the machine learning problem at hand. Examples of parameters are the number of layers and nodes, the learning rates, and the dropout rates. Typically, these parameters are chosen based on heuristic rules and manually fine-tuned, which may be very time-cons… ▽ More

    Submitted 23 May, 2017; originally announced May 2017.

  23. arXiv:1308.5703  [pdf, other

    cs.DB

    A Principled Approach to Bridging the Gap between Graph Data and their Schemas

    Authors: Marcelo Arenas, Gonzalo I. Diaz, Achille Fokoue, Anastasios Kementsietsidis, Kavitha Srinivas

    Abstract: Although RDF graphs have schema information associated with them, in practice it is very common to find cases in which data do not fully conform to their schema. A prominent example of this is DBpedia, which is RDF data extracted from Wikipedia, a publicly editable source of information. In such situations, it becomes interesting to study the structural properties of the actual data, because the s… ▽ More

    Submitted 4 March, 2014; v1 submitted 26 August, 2013; originally announced August 2013.

    Comments: 18 pages, 8 figures. To be published in PVLDB Vol. 8, No. 9