Skip to main content

Showing 1–42 of 42 results for author: Witbrock, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13002  [pdf, other

    cs.CV

    Recurrence over Video Frames (RoVF) for the Re-identification of Meerkats

    Authors: Mitchell Rogers, Kobe Knowles, Gaël Gendron, Shahrokh Heidari, David Arturo Soriano Valdez, Mihailo Azhar, Padriac O'Leary, Simon Eyre, Michael Witbrock, Patrice Delmas

    Abstract: Deep learning approaches for animal re-identification have had a major impact on conservation, significantly reducing the time required for many downstream tasks, such as well-being monitoring. We propose a method called Recurrence over Video Frames (RoVF), which uses a recurrent head based on the Perceiver architecture to iteratively construct an embedding from a video clip. RoVF is trained using… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Presented as a poster at the CV4Animals Workshop, CVPR 2024

  2. Efficient size-prescribed $k$-core search

    Authors: Yi** Liu, Bo Yan, Bo Zhao, Hongyi Su, Yang Chen, Michael Witbrock

    Abstract: $k$-core is a subgraph where every node has at least $k$ neighbors within the subgraph. The $k$-core subgraphs has been employed in large platforms like Network Repository to comprehend the underlying structures and dynamics of the network. Existing studies have primarily focused on finding $k… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2402.02636  [pdf, other

    cs.CL cs.AI cs.IT cs.LG

    Can Large Language Models Learn Independent Causal Mechanisms?

    Authors: Gaël Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael Witbrock, Gillian Dobbie

    Abstract: Despite impressive performance on language modelling and complex reasoning tasks, Large Language Models (LLMs) fall short on the same tasks in uncommon settings or with distribution shifts, exhibiting some lack of generalisation ability. This issue has usually been alleviated by feeding more training data into the LLM. However, this method is brittle, as the scope of tasks may not be readily predi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 17 pages, 8 pages for the main paper and 9 pages for references and appendices, 12 figures

    ACM Class: I.2.3; I.2.6; I.2.7; G.3

  4. arXiv:2312.14333  [pdf, other

    cs.MA cs.LG stat.ME

    Behaviour Modelling of Social Animals via Causal Structure Discovery and Graph Neural Networks

    Authors: Gaël Gendron, Yang Chen, Mitchell Rogers, Yi** Liu, Mihailo Azhar, Shahrokh Heidari, David Arturo Soriano Valdez, Kobe Knowles, Padriac O'Leary, Simon Eyre, Michael Witbrock, Gillian Dobbie, Jiamou Liu, Patrice Delmas

    Abstract: Better understanding the natural world is a crucial task with a wide range of applications. In environments with close proximity between humans and animals, such as zoos, it is essential to better understand the causes behind animal behaviour and what interventions are responsible for changes in their behaviours. This can help to predict unusual behaviours, mitigate detrimental effects and increas… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 9 pages, 7 figures, accepted as an extended abstract and poster at AAMAS 2024

    ACM Class: I.2.6; I.5.1; I.6.3; J.4

  5. arXiv:2311.12337  [pdf, other

    cs.CL cs.AI

    Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?

    Authors: Tim Hartill, Joshua Bensemann, Michael Witbrock, Patricia J. Riddle

    Abstract: A distinction is often drawn between a model's ability to predict a label for an evaluation sample that is directly memorised from highly similar training samples versus an ability to predict the label via some method of generalisation. In the context of using Language Models for question-answering, discussion continues to occur as to the extent to which questions are answered through memorisation… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  6. arXiv:2310.09430  [pdf, ps, other

    cs.CL cs.AI

    Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning

    Authors: Qiming Bao, Gael Gendron, Alex Yuxuan Peng, Wanjun Zhong, Neset Tan, Yang Chen, Michael Witbrock, Jiamou Liu

    Abstract: Large language models (LLMs), such as LLaMA, Alpaca, Vicuna, GPT-3.5 and GPT-4, have advanced the performance of AI systems on various natural language processing tasks to human-like levels. However, their generalisation and robustness when performing logical reasoning has not been sufficiently assessed. To comprehensively evaluate this ability, we develop three new logical reasoning datasets name… ▽ More

    Submitted 30 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: The short version (v3) was accepted for oral presentation at the first LLM@IJCAI 2023 non-archival symposium; the full version is under review

  7. arXiv:2309.10444  [pdf, other

    cs.AI cs.CL

    Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models

    Authors: Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Gaël Gendron, Timothy Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Large language models exhibit superior capabilities in processing and understanding language, yet their applications in educational contexts remain underexplored. Learnersourcing enhances learning by engaging students in creating their own educational content. When learnersourcing multiple-choice questions, creating explanations for the solution of a question is a crucial step; it helps other stud… ▽ More

    Submitted 10 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: The short version (v4) was accepted as a non-archival workshop paper at AGI@ICLR 2024; the full version is under review

  8. arXiv:2308.04711  [pdf, other

    cs.CL

    Answering Unseen Questions With Smaller Language Models Using Rationale Generation and Dense Retrieval

    Authors: Tim Hartill, Diana Benavides-Prado, Michael Witbrock, Patricia J. Riddle

    Abstract: When provided with sufficient explanatory context, smaller Language Models have been shown to exhibit strong reasoning ability on challenging short-answer question-answering tasks where the questions are unseen in training. We evaluate two methods for further improvement in this setting. Both methods focus on combining rationales generated by a larger Language Model with longer contexts created fr… ▽ More

    Submitted 12 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

  9. arXiv:2308.00946  [pdf, other

    cs.CL cs.AI

    Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

    Authors: Tim Hartill, Neset Tan, Michael Witbrock, Patricia J. Riddle

    Abstract: We equip a smaller Language Model to generalise to answering challenging compositional questions that have not been seen in training. To do so we propose a combination of multitask supervised pretraining on up to 93 tasks designed to instill diverse reasoning abilities, and a dense retrieval system that aims to retrieve a set of evidential paragraph fragments. Recent progress in question-answering… ▽ More

    Submitted 20 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  10. arXiv:2306.11326  [pdf, other

    cs.CV

    Meerkat Behaviour Recognition Dataset

    Authors: Mitchell Rogers, Gaël Gendron, David Arturo Soriano Valdez, Mihailo Azhar, Yang Chen, Shahrokh Heidari, Caleb Perelini, Padriac O'Leary, Kobe Knowles, Izak Tait, Simon Eyre, Michael Witbrock, Patrice Delmas

    Abstract: Recording animal behaviour is an important step in evaluating the well-being of animals and further understanding the natural world. Current methods for documenting animal behaviour within a zoo setting, such as scan sampling, require excessive human effort, are unfit for around-the-clock monitoring, and may produce human-biased results. Several animal datasets already exist that focus predominant… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Presented as a poster for the CV4Animals Workshop, CVPR 2023. For associated dataset see: https://meerkat-dataset.github.io/

  11. arXiv:2305.19555  [pdf, ps, other

    cs.CL cs.LG

    Large Language Models Are Not Strong Abstract Reasoners

    Authors: Gaël Gendron, Qiming Bao, Michael Witbrock, Gillian Dobbie

    Abstract: Large Language Models have shown tremendous performance on a large variety of natural language processing tasks, ranging from text comprehension to common sense reasoning. However, the mechanisms responsible for this success remain opaque, and it is unclear whether LLMs can achieve human-like cognitive capabilities or whether these models are still fundamentally circumscribed. Abstract reasoning i… ▽ More

    Submitted 2 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 50 pages, 14 pages for the main paper and 36 pages for the supplement, 35 figures, 17 tables. V3: performed additional experiments

    ACM Class: I.2.2; I.2.3; I.2.7; I.5.1

  12. arXiv:2305.12599  [pdf, other

    cs.CL cs.AI

    Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning

    Authors: Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset Tan, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu

    Abstract: Combining large language models with logical reasoning enhances their capacity to address problems in a robust and reliable manner. Nevertheless, the intricate nature of logical reasoning poses challenges when gathering reliable data from the web to build comprehensive training datasets, subsequently affecting performance on downstream tasks. To address this, we introduce a novel logic-driven data… ▽ More

    Submitted 6 June, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: 21 pages, 8 figures, the Findings of ACL 2024

  13. arXiv:2305.03232  [pdf, other

    cs.CL

    Neuromodulation Gated Transformer

    Authors: Kobe Knowles, Joshua Bensemann, Diana Benavides-Prado, Vithya Yogarajan, Michael Witbrock, Gillian Dobbie, Yang Chen

    Abstract: We introduce a novel architecture, the Neuromodulation Gated Transformer (NGT), which is a simple implementation of neuromodulation in transformers via a multiplicative effect. We compare it to baselines and show that it results in the best average performance on the SuperGLUE benchmark validation sets.

    Submitted 11 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: 8 pages, 1 figure, 4 tables, ICLR 2023 Tiny Papers

  14. arXiv:2303.07585  [pdf, other

    cs.CL

    Input-length-shortening and text generation via attention values

    Authors: Neşet Özkan Tan, Alex Yuxuan Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock

    Abstract: Identifying words that impact a task's performance more than others is a challenge in natural language processing. Transformers models have recently addressed this issue by incorporating an attention mechanism that assigns greater attention (i.e., relevance) scores to some words than others. Because of the attention mechanism's high computational cost, transformer models usually have an input-leng… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: 7 pages, 4 figures. AAAI23-EMC2

  15. arXiv:2302.08001  [pdf, other

    cs.AI cs.MA

    Learning Density-Based Correlated Equilibria for Markov Games

    Authors: Libo Zhang, Yang Chen, Toru Takisaka, Bakh Khoussainov, Michael Witbrock, Jiamou Liu

    Abstract: Correlated Equilibrium (CE) is a well-established solution concept that captures coordination among agents and enjoys good algorithmic properties. In real-world multi-agent systems, in addition to being in an equilibrium, agents' policies are often expected to meet requirements with respect to safety, and fairness. Such additional requirements can often be expressed in terms of the state density w… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  16. arXiv:2302.00869  [pdf, other

    cs.LG cs.CV cs.DM stat.ME

    Disentanglement of Latent Representations via Causal Interventions

    Authors: Gaël Gendron, Michael Witbrock, Gillian Dobbie

    Abstract: The process of generating data such as images is controlled by independent and unknown factors of variation. The retrieval of these variables has been studied extensively in the disentanglement, causal representation learning, and independent component analysis fields. Recently, approaches merging these domains together have shown great success. Instead of directly representing the factors of vari… ▽ More

    Submitted 22 September, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 16 pages, 10 pages for the main paper and 6 pages for the supplement, 14 figures, accepted to IJCAI 2023. V3: content matches the IJCAI version

    ACM Class: I.2.6; I.4.7; I.4.10; G.3

    Journal ref: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence. IJCAI 2023. Main Track. Pages 3239-3247

  17. arXiv:2302.00293  [pdf, other

    cs.LG stat.ME

    A Survey of Methods, Challenges and Perspectives in Causality

    Authors: Gaël Gendron, Michael Witbrock, Gillian Dobbie

    Abstract: Deep Learning models have shown success in a large variety of tasks by extracting correlation patterns from high-dimensional data but still struggle when generalizing out of their initial distribution. As causal engines aim to learn mechanisms independent from a data distribution, combining Deep Learning with Causality can have a great impact on the two fields. In this paper, we further motivate t… ▽ More

    Submitted 31 December, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: 40 pages, 37 pages for the main paper and 3 pages for the supplement, 8 figures, submitted to ACM Computing Surveys

    ACM Class: I.2.6; G.3

  18. arXiv:2211.08978  [pdf

    cs.SD cs.AI eess.AS

    Rapid Connectionist Speaker Adaptation

    Authors: Michael Witbrock, Patrick Haffner

    Abstract: We present SVCnet, a system for modelling speaker variability. Encoder Neural Networks specialized for each speech sound produce low dimensionality models of acoustical variation, and these models are further combined into an overall model of voice variability. A training procedure is described which minimizes the dependence of this model on which sounds have been uttered. Using the trained model… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 6 Figures, Two Tables, ICASSP-92

    Journal ref: ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, 1992, pp. 453-456 vol.1

  19. arXiv:2209.06923  [pdf, other

    cs.CL

    Prompt-based Conservation Learning for Multi-hop Question Answering

    Authors: Zhenyun Deng, Yonghua Zhu, Yang Chen, Qianqian Qi, Michael Witbrock, Patricia Riddle

    Abstract: Multi-hop question answering (QA) requires reasoning over multiple documents to answer a complex question and provide interpretable supporting evidence. However, providing supporting evidence is not enough to demonstrate that a model has performed the desired reasoning to reach the correct answer. Most existing multi-hop QA methods fail to answer a large fraction of sub-questions, even if their pa… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted to COLING 2022

  20. arXiv:2207.14000  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation

    Authors: Qiming Bao, Alex Yuxuan Peng, Tim Hartill, Neset Tan, Zhenyun Deng, Michael Witbrock, Jiamou Liu

    Abstract: Combining deep learning with symbolic logic reasoning aims to capitalize on the success of both fields and is drawing increasing attention. Inspired by DeepLogic, an end-to-end model trained to perform inference on logic programs, we introduce IMA-GloVe-GA, an iterative neural inference network for multi-step reasoning expressed in natural language. In our model, reasoning is performed using an it… ▽ More

    Submitted 30 March, 2024; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 10 pages, 3 figures, The 2nd International Joint Conference on Learning & Reasoning and 16th International Workshop on Neural-Symbolic Learning and Reasoning (IJCLR-NeSy 2022)

  21. arXiv:2206.08486  [pdf, ps, other

    cs.CL

    Interpretable AMR-Based Question Decomposition for Multi-hop Question Answering

    Authors: Zhenyun Deng, Yonghua Zhu, Yang Chen, Michael Witbrock, Patricia Riddle

    Abstract: Effective multi-hop question answering (QA) requires reasoning over multiple scattered paragraphs and providing explanations for answers. Most existing approaches cannot provide an interpretable reasoning process to illustrate how these models arrive at an answer. In this paper, we propose a Question Decomposition method based on Abstract Meaning Representation (QDAMR) for multi-hop QA, which achi… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted by IJCAI 2022

    Journal ref: https://www.ijcai.org/proceedings/2022/0568.pdf

  22. arXiv:2203.12186  [pdf, other

    cs.CL

    AbductionRules: Training Transformers to Explain Unexpected Inputs

    Authors: Nathan Young, Qiming Bao, Joshua Bensemann, Michael Witbrock

    Abstract: Transformers have recently been shown to be capable of reliably performing logical reasoning over facts and rules expressed in natural language, but abductive reasoning - inference to the best explanation of an unexpected observation - has been underexplored despite significant applications to scientific discovery, common-sense reasoning, and model interpretability. We present AbductionRules, a… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022

  23. Relating Blindsight and AI: A Review

    Authors: Joshua Bensemann, Qiming Bao, Gaël Gendron, Tim Hartill, Michael Witbrock

    Abstract: Processes occurring in brains, a.k.a. biological neural networks, can and have been modeled within artificial neural network architectures. Due to this, we have conducted a review of research on the phenomenon of blindsight in an attempt to generate ideas for artificial intelligence models. Blindsight can be considered as a diminished form of visual experience. If we assume that artificial network… ▽ More

    Submitted 8 December, 2021; originally announced January 2022.

    Comments: Preprint of an article published in Journal of Artificial Intelligence and Consciousness, 2021 doi.org/10.1142/S2705078521500156 \c{opyright} copyright World Scientific Publishing Company www.worldscientific.com/worldscinet/jaic

    Journal ref: Journal of Artificial Intelligence and Consciousness, 1-15 (2021)

  24. Semantic Construction Grammar: Bridging the NL / Logic Divide

    Authors: Dave Schneider, Michael Witbrock

    Abstract: In this paper, we discuss Semantic Construction Grammar (SCG), a system developed over the past several years to facilitate translation between natural language and logical representations. Crucially, SCG is designed to support a variety of different methods of representation, ranging from those that are fairly close to the NL structure (e.g. so-called 'logical forms'), to those that are quite dif… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  25. arXiv:2111.10058  [pdf, other

    cs.CL cs.AI cs.LG

    DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions

    Authors: Lin Ni, Qiming Bao, Xiaoxuan Li, Qianqian Qi, Paul Denny, Jim Warren, Michael Witbrock, Jiamou Liu

    Abstract: Automated question quality rating (AQQR) aims to evaluate question quality through computational means, thereby addressing emerging challenges in online learnersourced question repositories. Existing methods for AQQR rely solely on explicitly-defined criteria such as readability and word count, while not fully utilising the power of state-of-the-art deep-learning techniques. We propose DeepQR, a n… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: EAAI 22

  26. arXiv:2106.03906  [pdf, other

    cs.AI cs.LO

    Learning to Guide a Saturation-Based Theorem Prover

    Authors: Ibrahim Abdelaziz, Maxwell Crouse, Bassem Makni, Vernon Austil, Cristina Cornelio, Shajith Ikbal, Pavan Kapanipathi, Ndivhuwo Makondo, Kavitha Srinivas, Michael Witbrock, Achille Fokoue

    Abstract: Traditional automated theorem provers have relied on manually tuned heuristics to guide how they perform proof search. Recently, however, there has been a surge of interest in the design of learning mechanisms that can be integrated into theorem provers to improve their performance automatically. In this work, we introduce TRAIL, a deep learning-based approach to theorem proving that characterizes… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  27. arXiv:2104.14654  [pdf, other

    cs.LG

    Adversarial Inverse Reinforcement Learning for Mean Field Games

    Authors: Yang Chen, Libo Zhang, Jiamou Liu, Michael Witbrock

    Abstract: Mean field games (MFGs) provide a mathematically tractable framework for modelling large-scale multi-agent systems by leveraging mean field theory to simplify interactions among agents. It enables applying inverse reinforcement learning (IRL) to predict behaviours of large populations by recovering reward signals from demonstrated behaviours. However, existing IRL methods for MFGs are powerless to… ▽ More

    Submitted 17 April, 2023; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: Accepted to AAMAS 2023

  28. arXiv:1911.02065  [pdf, other

    cs.AI cs.LG cs.LO

    A Deep Reinforcement Learning Approach to First-Order Logic Theorem Proving

    Authors: Maxwell Crouse, Ibrahim Abdelaziz, Bassem Makni, Spencer Whitehead, Cristina Cornelio, Pavan Kapanipathi, Kavitha Srinivas, Veronika Thost, Michael Witbrock, Achille Fokoue

    Abstract: Automated theorem provers have traditionally relied on manually tuned heuristics to guide how they perform proof search. Deep reinforcement learning has been proposed as a way to obviate the need for such heuristics, however, its deployment in automated theorem proving remains a challenge. In this paper we introduce TRAIL, a system that applies deep reinforcement learning to saturation-based theor… ▽ More

    Submitted 15 September, 2020; v1 submitted 5 November, 2019; originally announced November 2019.

  29. arXiv:1903.05153  [pdf, other

    cs.LG cs.AI stat.ML

    A Sequential Set Generation Method for Predicting Set-Valued Outputs

    Authors: Tian Gao, Jie Chen, Vijil Chenthamarakshan, Michael Witbrock

    Abstract: Consider a general machine learning setting where the output is a set of labels or sequences. This output set is unordered and its size varies with the input. Whereas multi-label classification methods seem a natural first resort, they are not readily applicable to set-valued outputs because of the growth rate of the output space; and because conventional sequence generation doesn't reflect sets'… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: Published at AAAI 2019

  30. arXiv:1901.02565  [pdf, other

    cs.AI

    High-Fidelity Vector Space Models of Structured Data

    Authors: Maxwell Crouse, Achille Fokoue, Maria Chang, Pavan Kapanipathi, Ryan Musa, Constantine Nakos, Lingfei Wu, Kenneth Forbus, Michael Witbrock

    Abstract: Machine learning systems regularly deal with structured data in real-world applications. Unfortunately, such data has been difficult to faithfully represent in a way that most machine learning techniques would expect, i.e. as a real-valued vector of a fixed, pre-specified size. In this work, we introduce a novel approach that compiles structured data into a satisfiability problem which has in its… ▽ More

    Submitted 15 January, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

    Comments: updated to reflect conference submission, new experiment added

  31. arXiv:1812.00151  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Discrete Adversarial Attacks and Submodular Optimization with Applications to Text Classification

    Authors: Qi Lei, Lingfei Wu, Pin-Yu Chen, Alexandros G. Dimakis, Inderjit S. Dhillon, Michael Witbrock

    Abstract: Adversarial examples are carefully constructed modifications to an input that completely change the output of a classifier but are imperceptible to humans. Despite these successful attacks for continuous data (such as image and audio samples), generating adversarial examples for discrete structures such as text has proven significantly more challenging. In this paper we formulate the attacks with… ▽ More

    Submitted 4 April, 2019; v1 submitted 1 December, 2018; originally announced December 2018.

    Comments: In SysML 2019

  32. arXiv:1811.01713  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Word Mover's Embedding: From Word2Vec to Document Embedding

    Authors: Lingfei Wu, Ian E. H. Yen, Kun Xu, Fangli Xu, Avinash Balakrishnan, Pin-Yu Chen, Pradeep Ravikumar, Michael J. Witbrock

    Abstract: While the celebrated Word2Vec technique yields semantically rich representations for individual words, there has been relatively less success in extending to generate unsupervised sentences or documents embeddings. Recent work has demonstrated that a distance measure between documents called \emph{Word Mover's Distance} (WMD) that aligns semantically similar words, yields unprecedented KNN classif… ▽ More

    Submitted 30 October, 2018; originally announced November 2018.

    Comments: EMNLP'18 Camera-Ready Version

  33. arXiv:1809.05726  [pdf, other

    cs.AI cs.CL

    Answering Science Exam Questions Using Query Rewriting with Background Knowledge

    Authors: Ryan Musa, Xiaoyan Wang, Achille Fokoue, Nicholas Mattei, Maria Chang, Pavan Kapanipathi, Bassem Makni, Kartik Talamadupula, Michael Witbrock

    Abstract: Open-domain question answering (QA) is an important problem in AI and NLP that is emerging as a bellwether for progress on the generalizability of AI methods and techniques. Much of the progress in open-domain QA systems has been realized through advances in information retrieval methods and corpus construction. In this paper, we focus on the recently introduced ARC Challenge dataset, which contai… ▽ More

    Submitted 5 April, 2019; v1 submitted 15 September, 2018; originally announced September 2018.

  34. arXiv:1809.05724  [pdf, other

    cs.AI cs.CL cs.LG

    Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

    Authors: Xiaoyan Wang, Pavan Kapanipathi, Ryan Musa, Mo Yu, Kartik Talamadupula, Ibrahim Abdelaziz, Maria Chang, Achille Fokoue, Bassem Makni, Nicholas Mattei, Michael Witbrock

    Abstract: Natural Language Inference (NLI) is fundamental to many Natural Language Processing (NLP) applications including semantic search and question answering. The NLI problem has gained significant attention thanks to the release of large scale, challenging datasets. Present approaches to the problem largely focus on learning-based methods that use only textual information in order to classify whether a… ▽ More

    Submitted 20 November, 2018; v1 submitted 15 September, 2018; originally announced September 2018.

    Comments: 9 pages, 3 figures, 5 tables

  35. arXiv:1809.05259  [pdf, other

    cs.LG stat.ML

    Random War** Series: A Random Features Method for Time-Series Embedding

    Authors: Lingfei Wu, Ian En-Hsu Yen, **feng Yi, Fangli Xu, Qi Lei, Michael Witbrock

    Abstract: Time series data analytics has been a problem of substantial interests for decades, and Dynamic Time War** (DTW) has been the most widely adopted technique to measure dissimilarity between time series. A number of global-alignment kernels have since been proposed in the spirit of DTW to extend its use to kernel-based estimation method such as support vector machine. However, those kernels suffer… ▽ More

    Submitted 14 September, 2018; originally announced September 2018.

    Comments: AIStats18, Oral Paper, Add code link for generating RWS

  36. arXiv:1806.00358  [pdf, other

    cs.AI cs.CL cs.IR

    A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset

    Authors: Michael Boratko, Harshit Padigela, Divyendra Mikkilineni, Pritish Yuvraj, Rajarshi Das, Andrew McCallum, Maria Chang, Achille Fokoue-Nkoutche, Pavan Kapanipathi, Nicholas Mattei, Ryan Musa, Kartik Talamadupula, Michael Witbrock

    Abstract: The recent work of Clark et al. introduces the AI2 Reasoning Challenge (ARC) and the associated ARC dataset that partitions open domain, complex science questions into an Easy Set and a Challenge Set. That paper includes an analysis of 100 questions with respect to the types of knowledge and reasoning required to answer them; however, it does not include clear definitions of these types, nor does… ▽ More

    Submitted 4 February, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Presented at the Machine Reading for Question Answering (MRQA 2018) Workshop at the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2018). 11 pages, 5 tables, 4 figures. Added missing citations in the latest draft

  37. arXiv:1805.02704  [pdf, other

    cs.CV

    Image Super-Resolution via Dual-State Recurrent Networks

    Authors: Wei Han, Shiyu Chang, Ding Liu, Mo Yu, Michael Witbrock, Thomas S. Huang

    Abstract: Advances in image super-resolution (SR) have recently benefited significantly from rapid developments in deep neural networks. Inspired by these recent discoveries, we note that many state-of-the-art deep SR architectures can be reformulated as a single-state recurrent neural network (RNN) with finite unfoldings. In this paper, we explore new structures for SR based on this compact RNN view, leadi… ▽ More

    Submitted 7 May, 2018; originally announced May 2018.

  38. arXiv:1804.00823  [pdf, other

    cs.AI cs.CL cs.LG stat.ML

    Graph2Seq: Graph to Sequence Learning with Attention-based Neural Networks

    Authors: Kun Xu, Lingfei Wu, Zhiguo Wang, Yansong Feng, Michael Witbrock, Vadim Sheinin

    Abstract: The celebrated Sequence to Sequence learning (Seq2Seq) technique and its numerous variants achieve excellent performance on many tasks. However, many machine learning tasks have inputs naturally represented as graphs; existing Seq2Seq models face a significant challenge in achieving accurate conversion from graph form to the appropriate sequence. To address this challenge, we introduce a novel gen… ▽ More

    Submitted 3 December, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: 16 pages, 3 figures, 4 tables

  39. arXiv:1802.04956  [pdf, ps, other

    stat.ML cs.LG

    D2KE: From Distance to Kernel and Embedding

    Authors: Lingfei Wu, Ian En-Hsu Yen, Fangli Xu, Pradeep Ravikumar, Michael Witbrock

    Abstract: For many machine learning problem settings, particularly with structured inputs such as sequences or sets of objects, a distance measure between inputs can be specified more naturally than a feature representation. However, most standard machine models are designed for inputs with a vector feature representation. In this work, we consider the estimation of a function… ▽ More

    Submitted 25 May, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 15 pages, 4 tables

  40. arXiv:1801.01554  [pdf

    cs.LG cs.DC cs.NE

    An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer

    Authors: Michael Witbrock, Marco Zagha

    Abstract: Current connectionist simulations require huge computational resources. We describe a neural network simulator for the IBM GF11, an experimental SIMD machine with 566 processors and a peak arithmetic performance of 11 Gigaflops. We present our parallel implementation of the backpropagation learning algorithm, techniques for increasing efficiency, performance measurements on the NetTalk text-to-spe… ▽ More

    Submitted 4 January, 2018; originally announced January 2018.

    Report number: CMU-CS-89-208

    Journal ref: Witbrock, M., and Zagha, M. (1989). "An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer." School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, Technical Report CMU-CS-89-208

  41. arXiv:1710.02224  [pdf, other

    cs.AI cs.LG

    Dilated Recurrent Neural Networks

    Authors: Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael Witbrock, Mark Hasegawa-Johnson, Thomas S. Huang

    Abstract: Learning with recurrent neural networks (RNNs) on long sequences is a notoriously difficult task. There are three major challenges: 1) complex dependencies, 2) vanishing and exploding gradients, and 3) efficient parallelization. In this paper, we introduce a simple yet effective RNN connection structure, the DilatedRNN, which simultaneously tackles all of these challenges. The proposed architectur… ▽ More

    Submitted 1 November, 2017; v1 submitted 5 October, 2017; originally announced October 2017.

    Comments: Accepted by NIPS 2017

  42. arXiv:1603.04402  [pdf

    cs.AI

    Controlling Search in Very large Commonsense Knowledge Bases: A Machine Learning Approach

    Authors: Abhishek Sharma, Michael Witbrock, Keith Goolsbey

    Abstract: Very large commonsense knowledge bases (KBs) often have thousands to millions of axioms, of which relatively few are relevant for answering any given query. A large number of irrelevant axioms can easily overwhelm resolution-based theorem provers. Therefore, methods that help the reasoner identify useful inference paths form an essential part of large-scale reasoning systems. In this paper, we des… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: 6 pages