Skip to main content

Showing 1–31 of 31 results for author: Kelleher, J D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19934  [pdf, other

    cs.CY

    Estimating Population Burden of Stroke with an Agent-Based Model

    Authors: Elizabeth Hunter, John D. Kelleher

    Abstract: Stroke is one of the leading causes of death and disability worldwide but it is believed to be highly preventable. The majority of stroke prevention focuses on targeting high-risk individuals but its is important to understand how the targeting of high-risk individuals might impact the overall societal burden of stroke. We propose using an agent-based model that follows agents through their pre-st… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 18th Social Simulation Conference

  2. arXiv:2404.05870  [pdf, other

    cs.RO

    CoBT: Collaborative Programming of Behaviour Trees from One Demonstration for Robot Manipulation

    Authors: Aayush Jain, Philip Long, Valeria Villani, John D. Kelleher, Maria Chiara Leva

    Abstract: Mass customization and shorter manufacturing cycles are becoming more important among small and medium-sized companies. However, classical industrial robots struggle to cope with product variation and dynamic environments. In this paper, we present CoBT, a collaborative programming by demonstration framework for generating reactive and modular behavior trees. CoBT relies on a single demonstration… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at IEEE ICRA 2024

  3. arXiv:2403.02009  [pdf, other

    cs.CL

    Topic Aware Probing: From Sentence Length Prediction to Idiom Identification how reliant are Neural Language Models on Topic?

    Authors: Vasudevan Nedumpozhimana, John D. Kelleher

    Abstract: Transformer-based Neural Language Models achieve state-of-the-art performance on various natural language processing tasks. However, an open question is the extent to which these models rely on word-order/syntactic or word co-occurrence/topic-based information when processing natural language. This work contributes to this debate by addressing the question of whether these models primarily use top… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2402.13219  [pdf, other

    cs.AI cs.HC cs.LG cs.MA eess.SY

    Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies

    Authors: Ammar N. Abbas, Chidera W. Amazu, Joseph Mietkiewicz, Houda Briwa, Andres Alonzo Perez, Gabriele Baldissone, Micaela Demichela, Georgios G. Chasparis, John D. Kelleher, Maria Chiara Leva

    Abstract: In complex industrial and chemical process control rooms, effective decision-making is crucial for safety and efficiency. The experiments in this paper evaluate the impact and applications of an AI-based decision support system integrated into an improved human-machine interface, using dynamic influence diagrams, a hidden Markov model, and deep reinforcement learning. The enhanced support system a… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  5. arXiv:2402.06097  [pdf, other

    cs.AI cs.LG

    TWIG: Towards pre-hoc Hyperparameter Optimisation and Cross-Graph Generalisation via Simulated KGE Models

    Authors: Jeffrey Sardina, John D. Kelleher, Declan O'Sullivan

    Abstract: In this paper we introduce TWIG (Topologically-Weighted Intelligence Generation), a novel, embedding-free paradigm for simulating the output of KGEs that uses a tiny fraction of the parameters. TWIG learns weights from inputs that consist of topological features of the graph data, with no coding for latent representations of entities or edges. Our experiments on the UMLS dataset show that a single… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: This article was accepted for publication at IEEE ICSC 2024

    MSC Class: 68R10

  6. arXiv:2310.18811  [pdf

    cs.AI cs.LG eess.SY

    Hierarchical Framework for Interpretable and Probabilistic Model-Based Safe Reinforcement Learning

    Authors: Ammar N. Abbas, Georgios C. Chasparis, John D. Kelleher

    Abstract: The difficulty of identifying the physical model of complex systems has led to exploring methods that do not rely on such complex modeling of the systems. Deep reinforcement learning has been the pioneer for solving this problem without the need for relying on the physical model of complex systems by just interacting with it. However, it uses a black-box learning approach that makes it difficult t… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.13433

    Journal ref: Data & Knowledge Engineering, 2023

  7. arXiv:2310.14788  [pdf

    cs.LG cs.AI eess.SY

    Specialized Deep Residual Policy Safe Reinforcement Learning-Based Controller for Complex and Continuous State-Action Spaces

    Authors: Ammar N. Abbas, Georgios C. Chasparis, John D. Kelleher

    Abstract: Traditional controllers have limitations as they rely on prior knowledge about the physics of the problem, require modeling of dynamics, and struggle to adapt to abnormal situations. Deep reinforcement learning has the potential to address these problems by learning optimal control policies through exploration in an environment. For safety-critical environments, it is impractical to explore random… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  8. arXiv:2310.14451  [pdf, other

    cs.CL

    Domain Terminology Integration into Machine Translation: Leveraging Large Language Models

    Authors: Yasmin Moslem, Gianfranco Romani, Mahdi Molaei, Rejwanul Haque, John D. Kelleher, Andy Way

    Abstract: This paper discusses the methods that we used for our submissions to the WMT 2023 Terminology Shared Task for German-to-English (DE-EN), English-to-Czech (EN-CS), and Chinese-to-English (ZH-EN) language pairs. The task aims to advance machine translation (MT) by challenging participants to develop systems that accurately translate technical terms, ultimately enhancing communication and understandi… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: WMT 2023

  9. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  10. arXiv:2304.14333  [pdf, other

    cs.CL cs.AI cs.LG

    Idioms, Probing and Dangerous Things: Towards Structural Probing for Idiomaticity in Vector Space

    Authors: Filip Klubička, Vasudevan Nedumpozhimana, John D. Kelleher

    Abstract: The goal of this paper is to learn more about how idiomatic information is structurally encoded in embeddings, using a structural probing method. We repurpose an existing English verbal multi-word expression (MWE) dataset to suit the probing framework and perform a comparative probing study of static (GloVe) and contextual (BERT) embeddings. Our experiments indicate that both encode some idiomatic… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: 9 pages, 5 tables, In proceedings of the 19th Workshop on Multiword Expressions @ EACL2023

    MSC Class: 68T50

  11. arXiv:2301.13294  [pdf, other

    cs.CL

    Adaptive Machine Translation with Large Language Models

    Authors: Yasmin Moslem, Rejwanul Haque, John D. Kelleher, Andy Way

    Abstract: Consistency is a key requirement of high-quality translation. It is especially important to adhere to pre-approved terminology and adapt to corrected translations in domain-specific projects. Machine translation (MT) has achieved significant progress in the area of domain adaptation. However, real-time adaptation remains challenging. Large-scale language models (LLMs) have recently shown interesti… ▽ More

    Submitted 9 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: EAMT 2023 - Research: technical

  12. arXiv:2301.10656  [pdf, other

    cs.CL cs.AI cs.LG

    Probing Taxonomic and Thematic Embeddings for Taxonomic Information

    Authors: Filip Klubička, John D. Kelleher

    Abstract: Modelling taxonomic and thematic relatedness is important for building AI with comprehensive natural language understanding. The goal of this paper is to learn more about how taxonomic information is structurally encoded in embeddings. To do this, we design a new hypernym-hyponym probing task and perform a comparative probing study of taxonomic and thematic SGNS and GloVe embeddings. Our experimen… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 9 pages, 1 figure, 4 tables, In proceedings of the 12th International Global Wordnet Conference

    MSC Class: 68T30

  13. arXiv:2210.12206  [pdf, other

    cs.CL cs.AI cs.LG

    Probing with Noise: Unpicking the Warp and Weft of Embeddings

    Authors: Filip Klubička, John D. Kelleher

    Abstract: Improving our understanding of how information is encoded in vector space can yield valuable interpretability insights. Alongside vector dimensions, we argue that it is possible for the vector norm to also carry linguistic information. We develop a method to test this: an extension of the probing framework which allows for relative intrinsic interpretations of probing results. It relies on introdu… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 3 tables, Workshop on analyzing and interpreting neural networks for NLP

    MSC Class: 68Uxx

  14. arXiv:2208.05909  [pdf

    cs.CL

    Domain-Specific Text Generation for Machine Translation

    Authors: Yasmin Moslem, Rejwanul Haque, John D. Kelleher, Andy Way

    Abstract: Preservation of domain knowledge from the source to target is crucial in any translation workflow. It is common in the translation industry to receive highly specialized projects, where there is hardly any parallel in-domain data. In such scenarios where there is insufficient in-domain data to fine-tune Machine Translation (MT) models, producing translations that are consistent with the relevant c… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: AMTA 2022 - MT Research Track

    Report number: 2022.amta-research.2

    Journal ref: Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (2022) Volume 1: Research Track, pages 14-30, Orlando, USA. Association for Machine Translation in the Americas

  15. Interpretable Hidden Markov Model-Based Deep Reinforcement Learning Hierarchical Framework for Predictive Maintenance of Turbofan Engines

    Authors: Ammar N. Abbas, Georgios Chasparis, John D. Kelleher

    Abstract: An open research question in deep reinforcement learning is how to focus the policy learning of key decisions within a sparse domain. This paper emphasizes combining the advantages of inputoutput hidden Markov models and reinforcement learning towards interpretable maintenance decisions. We propose a novel hierarchical-modeling methodology that, at a high level, detects and interprets the root cau… ▽ More

    Submitted 11 January, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Journal ref: Preprint: International Conference on Big Data Analytics and Knowledge Discovery Proceedings, 2022

  16. arXiv:2206.02436  [pdf, other

    cs.HC cs.AI cs.CY cs.LG

    Detecting Interlocutor Confusion in Situated Human-Avatar Dialogue: A Pilot Study

    Authors: Na Li, John D. Kelleher, Robert Ross

    Abstract: In order to enhance levels of engagement with conversational systems, our long term research goal seeks to monitor the confusion state of a user and adapt dialogue policies in response to such user confusion states. To this end, in this paper, we present our initial research centred on a user-avatar dialogue scenario that we have developed to study the manifestation of confusion and in the long te… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: 8 figures, 10pages including 2pages reference. Conference: https://semdial2021.ling.uni-potsdam.de/, Paper link:https://semdial2021.ling.uni-potsdam.de/assets/semdial2021_potsdial_full_proceedings.pdf

    MSC Class: 62-11 ACM Class: C.2; G.3

  17. Mutual Information Decay Curves and Hyper-Parameter Grid Search Design for Recurrent Neural Architectures

    Authors: Abhijit Mahalunkar, John D. Kelleher

    Abstract: We present an approach to design the grid searches for hyper-parameter optimization for recurrent neural architectures. The basis for this approach is the use of mutual information to analyze long distance dependencies (LDDs) within a dataset. We also report a set of experiments that demonstrate how using this approach, we obtain state-of-the-art results for DilatedRNNs across a range of benchmark… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: Published at the 27th International Conference on Neural Information Processing, ICONIP 2020, Bangkok, Thailand, November 18-22, 2020. arXiv admin note: text overlap with arXiv:1810.02966

  18. arXiv:2011.14901  [pdf, other

    cs.CL cs.CV cs.LG cs.NE

    Language-Driven Region Pointer Advancement for Controllable Image Captioning

    Authors: Annika Lindh, Robert J. Ross, John D. Kelleher

    Abstract: Controllable Image Captioning is a recent sub-field in the multi-modal task of Image Captioning wherein constraints are placed on which regions in an image should be described in the generated natural language caption. This puts a stronger focus on producing more detailed descriptions, and opens the door for more end-user control over results. A vital component of the Controllable Image Captioning… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

    Comments: Accepted to COLING 2020

    MSC Class: 68T07; 68T45; 68T50 ACM Class: I.2.7; I.2.10; I.5.1

  19. arXiv:2002.06235  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Semantic Relatedness and Taxonomic Word Embeddings

    Authors: Magdalena Kacmajor, John D. Kelleher, Filip Klubicka, Alfredo Maldonado

    Abstract: This paper connects a series of papers dealing with taxonomic word embeddings. It begins by noting that there are different types of semantic relatedness and that different lexical representations encode different forms of relatedness. A particularly important distinction within semantic relatedness is that of thematic versus taxonomic relatedness. Next, we present a number of experiments that ana… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 7 pages 0 figures

  20. arXiv:1907.06048  [pdf, other

    cs.LG cs.FL stat.ML

    Multi-Element Long Distance Dependencies: Using SPk Languages to Explore the Characteristics of Long-Distance Dependencies

    Authors: Abhijit Mahalunkar, John D. Kelleher

    Abstract: In order to successfully model Long Distance Dependencies (LDDs) it is necessary to understand the full-range of the characteristics of the LDDs exhibited in a target dataset. In this paper, we use Strictly k-Piecewise languages to generate datasets with various properties. We then compute the characteristics of the LDDs in these datasets using mutual information and analyze the impact of factors… ▽ More

    Submitted 13 July, 2019; originally announced July 2019.

    Comments: To appear in ACL 2019 workshop on Deep Learning and Formal Languages: Building Bridges. arXiv admin note: substantial text overlap with arXiv:1810.02966

  21. arXiv:1903.09866  [pdf, other

    cs.HC

    Referring to the recently seen: reference and perceptual memory in situated dialog

    Authors: John D. Kelleher, Simon Dobnik

    Abstract: From theoretical linguistic and cognitive perspectives, situated dialog systems are interesting as they provide ideal test-beds for investigating the interaction between language and perception. At the same time there are a growing number of practical applications, for example robotic systems and driver-less cars, where spoken interfaces, capable of situated dialog, promise many advantages. To dat… ▽ More

    Submitted 23 March, 2019; originally announced March 2019.

    Comments: 18 Pages, 4 Figures

  22. arXiv:1812.09541  [pdf, other

    cs.IR

    TEST: A Terminology Extraction System for Technology Related Terms

    Authors: Murhaf Hossari, Soumyabrata Dev, John D. Kelleher

    Abstract: Tracking developments in the highly dynamic data-technology landscape are vital to kee** up with novel technologies and tools, in the various areas of Artificial Intelligence (AI). However, It is difficult to keep track of all the relevant technology keywords. In this paper, we propose a novel system that addresses this problem. This tool is used to automatically detect the existence of new tech… ▽ More

    Submitted 7 March, 2019; v1 submitted 22 December, 2018; originally announced December 2018.

    Comments: Published in 11th International Conference on Computer and Automation Engineering (ICCAE 2019)

  23. Generating Diverse and Meaningful Captions

    Authors: Annika Lindh, Robert J. Ross, Abhijit Mahalunkar, Giancarlo Salton, John D. Kelleher

    Abstract: Image Captioning is a task that requires models to acquire a multi-modal understanding of the world and to express this understanding in natural language text. While the state-of-the-art for this task has rapidly improved in terms of n-gram metrics, these models tend to output the same generic captions for similar images. In this work, we address this limitation and train a model that generates mo… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Accepted for presentation at The 27th International Conference on Artificial Neural Networks (ICANN 2018)

    Journal ref: Artificial Neural Networks and Machine Learning - ICANN 2018 (pp. 176-187). Springer International Publishing

  24. arXiv:1810.06695  [pdf, other

    cs.CL cs.LG stat.ML

    Exploring the Use of Attention within an Neural Machine Translation Decoder States to Translate Idioms

    Authors: Giancarlo D. Salton, Robert J. Ross, John D. Kelleher

    Abstract: Idioms pose problems to almost all Machine Translation systems. This type of language is very frequent in day-to-day language use and cannot be simply ignored. The recent interest in memory augmented models in the field of Language Modelling has aided the systems to achieve good results by bridging long-distance dependencies. In this paper we explore the use of such techniques into a Neural Machin… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

  25. arXiv:1810.04437  [pdf, other

    cs.LG stat.ML

    Persistence pays off: Paying Attention to What the LSTM Gating Mechanism Persists

    Authors: Giancarlo D. Salton, John D. Kelleher

    Abstract: Language Models (LMs) are important components in several Natural Language Processing systems. Recurrent Neural Network LMs composed of LSTM units, especially those augmented with an external memory, have achieved state-of-the-art results. However, these models still struggle to process long sequences which are more likely to contain long-distance dependencies because of information fading and a b… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

  26. arXiv:1810.02966  [pdf, other

    cs.LG stat.ML

    Understanding Recurrent Neural Architectures by Analyzing and Synthesizing Long Distance Dependencies in Benchmark Sequential Datasets

    Authors: Abhijit Mahalunkar, John D. Kelleher

    Abstract: In order to build efficient deep recurrent neural architectures, it is essential to analyze the complexityof long distance dependencies (LDDs) of the dataset being modeled. In this paper, we presentdetailed analysis of the dependency decay curve exhibited by various datasets. The datasets sampledfrom a similar process (e.g. natural language, sequential MNIST, Strictlyk-Piecewise languages,etc) dis… ▽ More

    Submitted 8 December, 2020; v1 submitted 6 October, 2018; originally announced October 2018.

  27. Using Regular Languages to Explore the Representational Capacity of Recurrent Neural Architectures

    Authors: Abhijit Mahalunkar, John D. Kelleher

    Abstract: The presence of Long Distance Dependencies (LDDs) in sequential data poses significant challenges for computational models. Various recurrent neural architectures have been designed to mitigate this issue. In order to test these state-of-the-art architectures, there is growing need for rich benchmarking datasets. However, one of the drawbacks of existing datasets is the lack of experimental contro… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: International Conference of Artificial Neural Networks (ICANN) 2018

  28. arXiv:1807.11386  [pdf, other

    cs.IT physics.soc-ph

    On the Inability of Markov Models to Capture Criticality in Human Mobility

    Authors: Vaibhav Kulkarni, Abhijit Mahalunkar, Benoit Garbinato, John D. Kelleher

    Abstract: We examine the non-Markovian nature of human mobility by exposing the inability of Markov models to capture criticality in human mobility. In particular, the assumed Markovian nature of mobility was used to establish a theoretical upper bound on the predictability of human mobility (expressed as a minimum error probability limit), based on temporally correlated entropy. Since its inception, this b… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

  29. arXiv:1807.09844  [pdf, other

    cs.CL cs.AI cs.LG cs.NE stat.ML

    Modular Mechanistic Networks: On Bridging Mechanistic and Phenomenological Models with Deep Neural Networks in Natural Language Processing

    Authors: Simon Dobnik, John D. Kelleher

    Abstract: Natural language processing (NLP) can be done using either top-down (theory driven) and bottom-up (data driven) approaches, which we call mechanistic and phenomenological respectively. The approaches are frequently considered to stand in opposition to each other. Examining some recent approaches in deep learning we argue that deep neural networks incorporate both perspectives and, furthermore, tha… ▽ More

    Submitted 23 March, 2019; v1 submitted 21 July, 2018; originally announced July 2018.

    Comments: 18 pages, 1 figure, Appears in CLASP Papers in Computational Linguistics Vol. 1: Proceedings of the Conference on Logic and Machine Learning in Natural Language (LaML 2017)

    Journal ref: CLASP Papers in Computational Linguistics Vol. 1: Proceedings of the Conference on Logic and Machine Learning in Natural Language (LaML 2017). ISSN: 2002-9764. URI: http://hdl.handle.net/2077/54911

  30. arXiv:1807.08133  [pdf, other

    cs.LG cs.AI cs.CL cs.NE stat.ML

    What is not where: the challenge of integrating spatial representations into deep learning architectures

    Authors: John D. Kelleher, Simon Dobnik

    Abstract: This paper examines to what degree current deep learning architectures for image caption generation capture spatial language. On the basis of the evaluation of examples of generated captions from the literature we argue that systems capture what objects are in the image data but not where these objects are located: the captions generated by these systems are the output of a language model conditio… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

    Comments: 15 pages, 10 figures, Appears in CLASP Papers in Computational Linguistics Vol 1: Proceedings of the Conference on Logic and Machine Learning in Natural Language (LaML 2017), pp. 41-52

  31. arXiv:1807.06998  [pdf, other

    cs.CL

    Is it worth it? Budget-related evaluation metrics for model selection

    Authors: Filip Klubička, Giancarlo D. Salton, John D. Kelleher

    Abstract: Creating a linguistic resource is often done by using a machine learning model that filters the content that goes through to a human annotator, before going into the final resource. However, budgets are often limited, and the amount of available data exceeds the amount of affordable annotation. In order to optimize the benefit from the invested human work, we argue that deciding on which model one… ▽ More

    Submitted 18 July, 2018; originally announced July 2018.

    Comments: 7 pages, 1 figure, 5 tables, In proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)