Skip to main content

Showing 1–20 of 20 results for author: FitzGerald, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04670  [pdf, other

    cs.CL cs.AI

    MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources

    Authors: Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald, Jens Lehmann

    Abstract: Leveraging external knowledge is crucial for achieving high performance in knowledge-intensive tasks, such as question answering. The retrieve-and-read approach is widely adopted for integrating external knowledge into a language model. However, this approach suffers from increased computational cost and latency due to the long context length, which grows proportionally with the number of retrieve… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL2024-Findings

  2. arXiv:2402.16619  [pdf

    eess.IV cs.CV physics.med-ph

    Magnetic resonance delta radiomics to track radiation response in lung tumors receiving stereotactic MRI-guided radiotherapy

    Authors: Yining Zha, Benjamin H. Kann, Zezhong Ye, Anna Zapaishchykova, John He, Shu-Hui Hsu, Jonathan E. Leeman, Kelly J. Fitzgerald, David E. Kozono, Raymond H. Mak, Hugo J. W. L. Aerts

    Abstract: Introduction: Lung cancer is a leading cause of cancer-related mortality, and stereotactic body radiotherapy (SBRT) has become a standard treatment for early-stage lung cancer. However, the heterogeneous response to radiation at the tumor level poses challenges. Currently, standardized dosage regimens lack adaptation based on individual patient or tumor characteristics. Thus, we explore the potent… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  3. arXiv:2305.11759  [pdf, other

    cs.CL cs.AI

    Controlling the Extraction of Memorized Data from Large Language Models via Prompt-Tuning

    Authors: Mustafa Safa Ozdayi, Charith Peris, Jack FitzGerald, Christophe Dupuy, Jimit Majmudar, Haidar Khan, Rahil Parikh, Rahul Gupta

    Abstract: Large Language Models (LLMs) are known to memorize significant portions of their training data. Parts of this memorized content have been shown to be extractable by simply querying the model, which poses a privacy risk. We present a novel approach which uses prompt-tuning to control the extraction rates of memorized content in LLMs. We present two prompt training strategies to increase and decreas… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 Figures, ACL 2023

  4. arXiv:2212.06346  [pdf, other

    cs.CL

    The Massively Multilingual Natural Language Understanding 2022 (MMNLU-22) Workshop and Competition

    Authors: Christopher Hench, Charith Peris, Jack FitzGerald, Kay Rottmann

    Abstract: Despite recent progress in Natural Language Understanding (NLU), the creation of multilingual NLU systems remains a challenge. It is common to have NLU systems limited to a subset of languages due to lack of available data. They also often vary widely in performance. We launch a three-phase approach to address the limitations in NLU and help propel NLU technology to new heights. We release a 52 la… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: 5 pages

    Journal ref: Proceedings of the Massively Multilingual Natural Language Understanding Workshop (MMNLU-22), pages 83 - 87 December 7, 2022, copyright 2022 Association for Computational Linguistics

  5. arXiv:2208.01448  [pdf, other

    cs.CL cs.LG

    AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model

    Authors: Saleh Soltan, Shankar Ananthakrishnan, Jack FitzGerald, Rahul Gupta, Wael Hamza, Haidar Khan, Charith Peris, Stephen Rawls, Andy Rosenbaum, Anna Rumshisky, Chandana Satya Prakash, Mukund Sridhar, Fabian Triefenbach, Apurv Verma, Gokhan Tur, Prem Natarajan

    Abstract: In this work, we demonstrate that multilingual large-scale sequence-to-sequence (seq2seq) models, pre-trained on a mixture of denoising and Causal Language Modeling (CLM) tasks, are more efficient few-shot learners than decoder-only models on various tasks. In particular, we train a 20 billion parameter multilingual seq2seq model called Alexa Teacher Model (AlexaTM 20B) and show that it achieves s… ▽ More

    Submitted 3 August, 2022; v1 submitted 2 August, 2022; originally announced August 2022.

  6. arXiv:2206.07808  [pdf, other

    cs.CL cs.AI cs.LG

    Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems

    Authors: Jack FitzGerald, Shankar Ananthakrishnan, Konstantine Arkoudas, Davide Bernardi, Abhishek Bhagia, Claudio Delli Bovi, ** Cao, Rakesh Chada, Amit Chauhan, Luoxin Chen, Anurag Dwarakanath, Satyam Dwivedi, Turan Gojayev, Karthik Gopalakrishnan, Thomas Gueudre, Dilek Hakkani-Tur, Wael Hamza, Jonathan Hueser, Kevin Martin Jose, Haidar Khan, Beiye Liu, Jianhua Lu, Alessandro Manzotti, Pradeep Natarajan, Karolina Owczarzak , et al. (16 additional authors not shown)

    Abstract: We present results from a large-scale experiment on pretraining encoders with non-embedding parameter counts ranging from 700M to 9.3B, their subsequent distillation into smaller models ranging from 17M-170M parameters, and their application to the Natural Language Understanding (NLU) component of a virtual assistant system. Though we train using 70% spoken-form data, our teacher models perform co… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: KDD 2022

    ACM Class: I.2.7

    Journal ref: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA

  7. arXiv:2204.08582  [pdf, other

    cs.CL cs.AI cs.LG

    MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages

    Authors: Jack FitzGerald, Christopher Hench, Charith Peris, Scott Mackie, Kay Rottmann, Ana Sanchez, Aaron Nash, Liam Urbach, Vishesh Kakarala, Richa Singh, Swetha Ranganath, Laurie Crist, Misha Britan, Wouter Leeuwis, Gokhan Tur, Prem Natarajan

    Abstract: We present the MASSIVE dataset--Multilingual Amazon Slu resource package (SLURP) for Slot-filling, Intent classification, and Virtual assistant Evaluation. MASSIVE contains 1M realistic, parallel, labeled virtual assistant utterances spanning 51 languages, 18 domains, 60 intents, and 55 slots. MASSIVE was created by tasking professional translators to localize the English-only SLURP dataset into 5… ▽ More

    Submitted 17 June, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: Preprint; 8 pages

  8. arXiv:2105.06824  [pdf

    cs.NE q-bio.NC

    Multi-Objective Optimisation of Cortical Spiking Neural Networks With Genetic Algorithms

    Authors: James Fitzgerald, KongFatt Wong-Lin

    Abstract: Spiking neural networks (SNNs) communicate through the all-or-none spiking activity of neurons. However, fitting the large number of SNN model parameters to observed neural activity patterns, for example, in biological experiments, remains a challenge. Previous work using genetic algorithm (GA) optimisation on a specific efficient SNN model, using the Izhikevich neuronal model, was limited to a si… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: In: 32nd Irish Signals and Systems Conference (ISSC) 2021

  9. Is academia becoming more localised? The growth of regional knowledge networks within international research collaboration

    Authors: John Fitzgerald, Sanna Ojanperä, Neave O'Clery

    Abstract: It is well-established that the process of learning and capability building is core to economic development and structural transformation. Since knowledge is `sticky', a key component of this process is learning-by-doing, which can be achieved via a variety of mechanisms including international research collaboration. Uncovering significant inter-country research ties using Scopus co-authorship da… ▽ More

    Submitted 1 June, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

    Comments: 28 pages, 7 figures, accepted to Applied Network Science

    Journal ref: Appl Netw Sci 6, 38 (2021)

  10. arXiv:2101.07261  [pdf, other

    cs.SE

    Proceedings of the 18th International Overture Workshop

    Authors: John Fitzgerald, Tomohiro Oda, Hugo Daniel Macedo

    Abstract: This volume contains the papers presented at the 18th International Overture Workshop, held online on 7th December 2020. This event was the latest in a series of workshops around the Vienna Development Method (VDM), the open-source project Overture, and related tools and formalisms. VDM is one of the longest established formal methods for systems development. A lively community of researchers and… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  11. arXiv:2010.02600  [pdf, other

    cs.CL cs.AI

    Converting the Point of View of Messages Spoken to Virtual Assistants

    Authors: Isabelle G. Lee, Vera Zu, Sai Srujana Buddi, Dennis Liang, Purva Kulkarni, Jack G. M. Fitzgerald

    Abstract: Virtual Assistants can be quite literal at times. If the user says "tell Bob I love him," most virtual assistants will extract the message "I love him" and send it to the user's contact named Bob, rather than properly converting the message to "I love you." We designed a system to allow virtual assistants to take a voice message from one user, convert the point of view of the message, and then del… ▽ More

    Submitted 7 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: 10 pages, 11 figures, Findings of EMNLP 2020

  12. arXiv:2010.00760  [pdf, other

    cs.CL

    STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++

    Authors: Jack G. M. FitzGerald

    Abstract: Slot-filling, Translation, Intent classification, and Language identification, or STIL, is a newly-proposed task for multilingual Natural Language Understanding (NLU). By performing simultaneous slot filling and translation into a single output language (English in this case), some portion of downstream system components can be monolingual, reducing development and maintenance cost. Results are gi… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: 4 pages; To be published at AACL 2020; For code, see: https://github.com/jgmfitz/stil-mbart-multiatispp-aacl2020

    Journal ref: Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing (2020) 576-581

  13. arXiv:2007.00741  [pdf

    physics.optics cs.CV eess.IV physics.app-ph

    Deep learning-based holographic polarization microscopy

    Authors: Tairan Liu, Kevin de Haan, Bijie Bai, Yair Rivenson, Yi Luo, Hongda Wang, David Karalli, Hongxiang Fu, Yibo Zhang, John FitzGerald, Aydogan Ozcan

    Abstract: Polarized light microscopy provides high contrast to birefringent specimen and is widely used as a diagnostic tool in pathology. However, polarization microscopy systems typically operate by analyzing images collected from two or more light paths in different states of polarization, which lead to relatively complex optical designs, high system costs or experienced technicians being required. Here,… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 20 pages, 8 figures

    Journal ref: ACS Photonics (2020)

  14. A Cloud-Based Collaboration Platform for Model-Based Design of Cyber-Physical Systems

    Authors: Peter Gorm Larsen, Hugo Daniel Macedo, John Fitzgerald, Holger Pfeifer, Martin Benedikt, Stefano Tonetta, Angelo Marguglio, Sergio Gusmeroli, George Suciu Jr

    Abstract: Businesses, particularly small and medium-sized enterprises, aiming to start up in Model-Based Design (MBD) face difficult choices from a wide range of methods, notations and tools before making the significant investments in planning, procurement and training necessary to deploy new approaches successfully. In the development of Cyber-Physical Systems (CPSs) this is exacerbated by the diversity o… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  15. arXiv:1909.02893  [pdf, other

    cs.CR cs.FL math.CT

    Map** finite state machines to zk-SNARKS Using Category Theory

    Authors: Fabrizio Genovese, Andre Knispel, Joshua Fitzgerald

    Abstract: We provide a categorical procedure to turn graphs corresponding to state spaces of finite state machines into boolean circuits, leveraging on the fact that boolean circuits can be easily turned into zk-SNARKS. Our circuits verify that a given sequence of edges and nodes is indeed a path in the graph they represent. We then generalize to circuits verifying paths in arbitrary graphs. We prove that a… ▽ More

    Submitted 14 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

    Comments: 18 pages total, 10 pages body, 2 pages addendum, 5 pages appendix, 36 figures, 6 tables

  16. Modelling System of Systems Interface Contract Behaviour

    Authors: Oldrich Faldik, Richard Payne, John Fitzgerald, Barbora Buhnova

    Abstract: A key challenge in System of Systems (SoS) engineering is the analysis and maintenance of global properties under SoS evolution, and the integration of new constituent elements. There is a need to model the constituent systems composing a SoS in order to allow the analysis of emergent behaviours at the SoS boundary. The Contract pattern allows the engineer to specify constrained behaviours to whic… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.

    Comments: In Proceedings FESCA 2017, arXiv:1703.06590

    Journal ref: EPTCS 245, 2017, pp. 1-15

  17. arXiv:1404.7778  [pdf, other

    cs.SE

    SoS Fault Modelling at the Architectural Level in an Emergency Response Case Study

    Authors: Claire Ingram, Steve Riddle, John Fitzgerald, Sakina A. H. J. Al-Lawati, Afra Alrbaiyan

    Abstract: Systems of systems (SoSs) are particularly vulnerable to faults and other threats to their dependability, but frequently inhabit domains that demand high levels of dependability. For this reason fault tolerance analysis is important in SoS engineering. The COMPASS project has previously proposed a Fault Tolerance Architecture Framework (FMAF), consisting of a collection of viewpoints that support… ▽ More

    Submitted 7 May, 2014; v1 submitted 30 April, 2014; originally announced April 2014.

    Comments: EDCC-2014, EDSoS-2014

  18. Proceedings Third Workshop on Formal Aspects of Virtual Organisations

    Authors: Jeremy Bryans, John Fitzgerald

    Abstract: This volume contains the proceedings of the 3rd International Workshop on Formal Aspects of Virtual Organisations (FAVO 2011). The workshop was held in Sao Paulo, Brazil on October 18th, 2011 as a satellite event to the 12th IFIP Working Conference on Virtual Enterprises (PRO-VE'11). The FAVO workshop aims to provide a forum for researchers interested in the application of formal techniques in… ▽ More

    Submitted 25 April, 2012; originally announced April 2012.

    Journal ref: EPTCS 83, 2012

  19. Proceedings Second Workshop on Formal Aspects of Virtual Organisations

    Authors: Jeremy Bryans, John Fitzgerald

    Abstract: FAVO2009 was the second workshop on Formal Aspects of Virtual Organisations. The purpose of the FAVO workshops is to encourage an active community of researchers and practitioners using formal methods in the research and development of Virtual Organisations.

    Submitted 28 January, 2010; originally announced January 2010.

    Journal ref: EPTCS 16, 2010

  20. Common Representation of Information Flows for Dynamic Coalitions

    Authors: Igor Mozolevsky, John Fitzgerald

    Abstract: We propose a formal foundation for reasoning about access control policies within a Dynamic Coalition, defining an abstraction over existing access control models and providing mechanisms for translation of those models into information-flow domain. The abstracted information-flow domain model, called a Common Representation, can then be used for defining a way to control the evolution of Dynami… ▽ More

    Submitted 25 January, 2010; originally announced January 2010.

    Journal ref: EPTCS 16, 2010, pp. 15-25