Skip to main content

Showing 1–50 of 76 results for author: Frank, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18624  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Explanations?

    Authors: Letitia Parcalabescu, Anette Frank

    Abstract: Vision and language model (VLM) decoders are currently the best-performing architectures on multimodal tasks. Next to predictions, they can also produce explanations, either in post-hoc or CoT settings. However, it is not clear how much they use the vision and text modalities when generating predictions or explanations. In this work, we investigate if VLMs rely on modalities differently when they… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 25 pages, 8 figures, 11 tables

    MSC Class: 68Txx ACM Class: I.2.7; I.2.10

  2. arXiv:2404.10570  [pdf, other

    cs.CY

    PAKT: Perspectivized Argumentation Knowledge Graph and Tool for Deliberation Analysis (with Supplementary Materials)

    Authors: Moritz Plenz, Philipp Heinisch, Anette Frank, Philipp Cimiano

    Abstract: Deliberative processes play a vital role in sha** opinions, decisions and policies in our society. In contrast to persuasive debates, deliberation aims to foster understanding of conflicting perspectives among interested parties. The exchange of arguments in deliberation serves to elucidate viewpoints, to raise awareness of conflicting interests, and to finally converge on a resolution. To bette… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted at the 1st International Conference on Robust Argumentation Machines (RATIO24); 18 pages and 13 pages supplementary materials

    ACM Class: E.1; E.2; H.3; I.2; J.4

  3. arXiv:2403.13369  [pdf, other

    cs.CL cs.AI cs.LG

    Clinical information extraction for Low-resource languages with Few-shot learning using Pre-trained language models and Prompting

    Authors: Phillip Richter-Pechanski, Philipp Wiesenbach, Dominic M. Schwab, Christina Kiriakou, Nicolas Geis, Christoph Dieterich, Anette Frank

    Abstract: Automatic extraction of medical information from clinical documents poses several challenges: high costs of required clinical expertise, limited interpretability of model predictions, restricted computational resources and privacy regulations. Recent advances in domain-adaptation and prompting methods showed promising results with minimal training data using lightweight masked language models, whi… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  4. arXiv:2403.04400  [pdf, other

    cs.CL

    Exploring Continual Learning of Compositional Generalization in NLI

    Authors: Xiyan Fu, Anette Frank

    Abstract: Compositional Natural Language Inference has been explored to assess the true abilities of neural models to perform NLI. Yet, current evaluations assume models to have full access to all primitive inferences in advance, in contrast to humans that continuously acquire inference knowledge. In this paper, we introduce the Continual Compositional Generalization in Inference (C2Gen NLI) challenge, wher… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  5. arXiv:2401.07105  [pdf, other

    cs.CL cs.AI cs.LG

    Graph Language Models

    Authors: Moritz Plenz, Anette Frank

    Abstract: While Language Models (LMs) are the workhorses of NLP, their interplay with structured knowledge graphs (KGs) is still actively researched. Current methods for encoding such graphs typically either (i) linearize them for embedding with LMs -- which underutilize structural information, or (ii) use Graph Neural Networks (GNNs) to preserve the graph structure -- but GNNs cannot represent text feature… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024. 9 pages, 10 figures, 9 tables

    ACM Class: I.2.0; I.2.4; I.2.7

  6. arXiv:2311.07466  [pdf, other

    cs.CL cs.AI cs.LG

    On Measuring Faithfulness or Self-consistency of Natural Language Explanations

    Authors: Letitia Parcalabescu, Anette Frank

    Abstract: Large language models (LLMs) can explain their predictions through post-hoc or Chain-of-Thought (CoT) explanations. But an LLM could make up reasonably sounding explanations that are unfaithful to its underlying reasoning. Recent work has designed tests that aim to judge the faithfulness of post-hoc or CoT explanations. In this work we argue that these faithfulness tests do not measure faithfulnes… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Paper accepted for publication at ACL 2024 Main (Bangkok, Thailand); 10 main paper pages, 30 appendix pages

    MSC Class: 68Txx ACM Class: I.2.7; I.2.10

  7. arXiv:2311.07022  [pdf, other

    cs.CL cs.AI cs.CV

    ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

    Authors: Ilker Kesen, Andrea Pedrotti, Mustafa Dogan, Michele Cafagna, Emre Can Acikgoz, Letitia Parcalabescu, Iacer Calixto, Anette Frank, Albert Gatt, Aykut Erdem, Erkut Erdem

    Abstract: With the ever-increasing popularity of pretrained Video-Language Models (VidLMs), there is a pressing need to develop robust evaluation methodologies that delve deeper into their visio-linguistic capabilities. To address this challenge, we present ViLMA (Video Language Model Assessment), a task-agnostic benchmark that places the assessment of fine-grained capabilities of these models on a firm foo… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

    Comments: Preprint. 48 pages, 22 figures, 10 tables

  8. arXiv:2309.07624  [pdf, other

    cs.CL

    Dynamic MOdularized Reasoning for Compositional Structured Explanation Generation

    Authors: Xiyan Fu, Anette Frank

    Abstract: Despite the success of neural models in solving reasoning tasks, their compositional generalization capabilities remain unclear. In this work, we propose a new setting of the structured explanation generation task to facilitate compositional reasoning research. Previous works found that symbolic methods achieve superior compositionality by using pre-defined inference rules for iterative reasoning.… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  9. arXiv:2308.12008  [pdf, other

    cs.CL

    Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature

    Authors: Frederick Riemenschneider, Anette Frank

    Abstract: Intertextual allusions hold a pivotal role in Classical Philology, with Latin authors frequently referencing Ancient Greek texts. Until now, the automatic identification of these intertextual references has been constrained to monolingual approaches, seeking parallels solely within Latin or Greek texts. In this study, we introduce SPhilBERTa, a trilingual Sentence-RoBERTa model tailored for Classi… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Paper accepted for publication at the First Workshop on Ancient Language Processing (ALP) 2023; 9 pages, 5 tables

    ACM Class: I.2.7

  10. arXiv:2306.00936  [pdf, other

    cs.CL cs.IR

    AMR4NLI: Interpretable and robust NLI measures from semantic graphs

    Authors: Juri Opitz, Shira Wein, Julius Steen, Anette Frank, Nathan Schneider

    Abstract: The task of natural language inference (NLI) asks whether a given premise (expressed in NL) entails a given NL hypothesis. NLI benchmarks contain human ratings of entailment, but the meaning relationships driving these ratings are not formalized. Can the underlying sentence pair relationships be made more explicit in an interpretable yet robust fashion? We compare semantic structures to represent… ▽ More

    Submitted 5 September, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: International Conference on Computational Semantics (IWCS 2023); v2 fixes an imprecise sentence below Eq. 5

  11. arXiv:2305.16819  [pdf, other

    cs.CL

    With a Little Push, NLI Models can Robustly and Efficiently Predict Faithfulness

    Authors: Julius Steen, Juri Opitz, Anette Frank, Katja Markert

    Abstract: Conditional language models still generate unfaithful output that is not supported by their input. These unfaithful generations jeopardize trust in real-world applications such as summarization or human-machine interaction, motivating a need for automatic faithfulness metrics. To implement such metrics, NLI models seem attractive, since they solve a strongly related task that comes with a wealth o… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (short paper)

  12. arXiv:2305.15045  [pdf, other

    cs.CL

    SETI: Systematicity Evaluation of Textual Inference

    Authors: Xiyan Fu, Anette Frank

    Abstract: We propose SETI (Systematicity Evaluation of Textual Inference), a novel and comprehensive benchmark designed for evaluating pre-trained language models (PLMs) for their systematicity capabilities in the domain of textual inference. Specifically, SETI offers three different NLI tasks and corresponding datasets to evaluate various types of systematicity in reasoning processes. In order to solve the… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to Findings of ACL2023

  13. arXiv:2305.13698  [pdf, other

    cs.CL

    Exploring Large Language Models for Classical Philology

    Authors: Frederick Riemenschneider, Anette Frank

    Abstract: Recent advances in NLP have led to the creation of powerful language models for many languages including Ancient Greek and Latin. While prior work on Classical languages unanimously uses BERT, in this work we create four language models for Ancient Greek that vary along two dimensions to study their versatility for tasks of interest for Classical languages: we explore (i) encoder-only and encoder-… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Paper accepted for publication at ACL 2023 Main; 10 pages, 7 appendix pages, 4 figures, 13 tables

    ACM Class: I.2.7

  14. arXiv:2305.08495  [pdf, other

    cs.CL cs.DB

    Similarity-weighted Construction of Contextualized Commonsense Knowledge Graphs for Knowledge-intense Argumentation Tasks

    Authors: Moritz Plenz, Juri Opitz, Philipp Heinisch, Philipp Cimiano, Anette Frank

    Abstract: Arguments often do not make explicit how a conclusion follows from its premises. To compensate for this lack, we enrich arguments with structured background knowledge to support knowledge-intense argumentation tasks. We present a new unsupervised method for constructing Contextualized Commonsense Knowledge Graphs (CCKGs) that selects contextually relevant knowledge from large knowledge graphs (KGs… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  15. arXiv:2304.03286  [pdf, other

    cond-mat.stat-mech cs.IT nlin.CG physics.bio-ph q-bio.QM

    Semantic Information in a model of Resource Gathering Agents

    Authors: Damian R Sowinski, Jonathan Carroll-Nellenback, Robert N Markwick, Jordi Piñero, Marcelo Gleiser, Artemy Kolchinsky, Gourab Ghoshal, Adam Frank

    Abstract: We explore the application of a new theory of Semantic Information to the well-motivated problem of a resource foraging agent. Semantic information is defined as the subset of correlations, measured via the transfer entropy, between agent $A$ and environment $E$ that is necessary for the agent to maintain its viability $V$. Viability, in turn, is endogenously defined as opposed to the use of exoge… ▽ More

    Submitted 17 October, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 17 pages, 10 figures, 5 appendices

    Journal ref: PRX Life 1, 023003 (2023)

  16. MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision and Language Models & Tasks

    Authors: Letitia Parcalabescu, Anette Frank

    Abstract: Vision and language models (VL) are known to exploit unrobust indicators in individual modalities (e.g., introduced by distributional biases) instead of focusing on relevant information in each modality. That a unimodal model achieves similar accuracy on a VL task to a multimodal one, indicates that so-called unimodal collapse occurred. However, accuracy-based tests fail to detect e.g., when the m… ▽ More

    Submitted 23 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Paper accepted for publication at ACL 2023 Main (Toronto); 10 pages, 14 appendix pages, 11 figures, 3 tables

    MSC Class: 68Txx ACM Class: I.2.7; I.2.10

  17. What Pronouns for Pepper? A Critical Review of Gender/ing in Research

    Authors: Katie Seaborn, Alexa Frank

    Abstract: Gender/ing guides how we view ourselves, the world around us, and each other--including non-humans. Critical voices have raised the alarm about stereotyped gendering in the design of socially embodied artificial agents like voice assistants, conversational agents, and robots. Yet, little is known about how this plays out in research and to what extent. As a first step, we critically reviewed the c… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted at CHI '22

    Journal ref: Proc. SIGCHI Conf. Hum. Factor Comput. Syst. (2022), Article No. 239, 1-15

  18. arXiv:2210.06461  [pdf, other

    cs.CL cs.AI

    Better Smatch = Better Parser? AMR evaluation is not so simple anymore

    Authors: Juri Opitz, Anette Frank

    Abstract: Recently, astonishing advances have been observed in AMR parsing, as measured by the structural Smatch metric. In fact, today's systems achieve performance levels that seem to surpass estimates of human inter annotator agreement (IAA). Therefore, it is unclear how well Smatch (still) relates to human estimates of parse quality, as in this situation potentially fine-grained errors of similar weight… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: accepted at "Evaluation and Comparison of NLP Systems" Workshop (Eval4NLP 2022)

  19. Automatic differentiation and the optimization of differential equation models in biology

    Authors: Steven A. Frank

    Abstract: A computational revolution unleashed the power of artificial neural networks. At the heart of that revolution is automatic differentiation, which calculates the derivative of a performance measure relative to a large number of parameters. Differentiation enhances the discovery of improved performance in large models, an achievement that was previously difficult or impossible. Recently, a second co… ▽ More

    Submitted 11 October, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

  20. arXiv:2206.07023  [pdf, other

    cs.CL cs.AI

    SBERT studies Meaning Representations: Decomposing Sentence Embeddings into Explainable Semantic Features

    Authors: Juri Opitz, Anette Frank

    Abstract: Models based on large-pretrained language models, such as S(entence)BERT, provide effective and efficient sentence embeddings that show high correlation to human similarity ratings, but lack interpretability. On the other hand, graph metrics for graph-based meaning representations (e.g., Abstract Meaning Representation, AMR) can make explicit the semantic aspects in which two sentences are similar… ▽ More

    Submitted 28 October, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: to appear in AACL 2022 (main)

  21. arXiv:2205.12176  [pdf, other

    cs.CL

    A Dynamic, Interpreted CheckList for Meaning-oriented NLG Metric Evaluation -- through the Lens of Semantic Similarity Rating

    Authors: Laura Zeidler, Juri Opitz, Anette Frank

    Abstract: Evaluating the quality of generated text is difficult, since traditional NLG evaluation metrics, focusing more on surface form than meaning, often fail to assign appropriate scores. This is especially problematic for AMR-to-text evaluation, given the abstract nature of AMR. Our work aims to support the development and improvement of NLG evaluation metrics that focus on meaning, by develo** a dyn… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: to appear in *SEM 2022

  22. arXiv:2204.07833  [pdf, other

    q-bio.QM cs.LG

    Optimizing differential equations to fit data and predict outcomes

    Authors: Steven A. Frank

    Abstract: Many scientific problems focus on observed patterns of change or on how to design a system to achieve particular dynamics. Those problems often require fitting differential equation models to target trajectories. Fitting such models can be difficult because each evaluation of the fit must calculate the distance between the model and target patterns at numerous points along a trajectory. The gradie… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  23. arXiv:2203.13226  [pdf, other

    cs.CL

    SMARAGD: Learning SMatch for Accurate and Rapid Approximate Graph Distance

    Authors: Juri Opitz, Philipp Meier, Anette Frank

    Abstract: The similarity of graph structures, such as Meaning Representations (MRs), is often assessed via structural matching algorithms, such as Smatch (Cai and Knight, 2013). However, Smatch involves a combinatorial problem that suffers from NP-completeness, making large-scale applications, e.g., graph clustering or search, infeasible. To alleviate this issue, we learn SMARAGD: Semantic Match for Accurat… ▽ More

    Submitted 1 June, 2023; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: to appear at 15th International Conference on Computational Semantics (IWCS 2023)

  24. arXiv:2203.10899  [pdf, other

    astro-ph.EP astro-ph.IM cs.CR physics.pop-ph

    The Case for Technosignatures: Why They May Be Abundant, Long-lived, Highly Detectable, and Unambiguous

    Authors: Jason T. Wright, Jacob Haqq-Misra, Adam Frank, Ravi Kopparapu, Manasvi Lingam, Sofia Z. Sheikh

    Abstract: The intuition suggested by the Drake equation implies that technology should be less prevalent than biology in the galaxy. However, it has been appreciated for decades in the SETI community that technosignatures could be more abundant, longer-lived, more detectable, and less ambiguous than biosignatures. We collect the arguments for and against technosignatures' ubiquity and discuss the implicatio… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Published in ApJ Letters

    Journal ref: 2022 ApJL 927 L30

  25. arXiv:2202.00254  [pdf, other

    cs.CL cs.LG

    Active Learning Over Multiple Domains in Natural Language Tasks

    Authors: Shayne Longpre, Julia Reisler, Edward Greg Huang, Yi Lu, Andrew Frank, Nikhil Ramesh, Chris DuBois

    Abstract: Studies of active learning traditionally assume the target and source data stem from a single domain. However, in realistic applications, practitioners often require active learning with multiple sources of out-of-distribution data, where it is unclear a priori which data sources will help or hurt the target domain. We survey a wide variety of techniques in active learning (AL), domain shift detec… ▽ More

    Submitted 8 February, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

  26. arXiv:2201.12237  [pdf, other

    physics.chem-ph cs.DC

    Experiences with managing data parallel computational workflows for High-throughput Fragment Molecular Orbital (FMO) Calculations

    Authors: Dimuthu Wannipurage, Indrajit Deb, Eroma Abeysinghe, Sudhakar Pamidighantam, Suresh Marru, Marlon Pierce, Aaron T. Frank

    Abstract: Fragment Molecular Orbital (FMO) calculations provide a framework to speed up quantum mechanical calculations and so can be used to explore structure-energy relationships in large and complex biomolecular systems. These calculations are still onerous, especially when applied to large sets of molecules. Therefore, cyberinfrastructure that provides mechanisms and user interfaces that manage job subm… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

  27. arXiv:2201.04642  [pdf, other

    physics.soc-ph cs.IT physics.hist-ph q-bio.NC

    Consensus between Epistemic Agents is Difficult

    Authors: Damian R. Sowinski, Jonathan Carroll-Nellenback, Jeremy M. DeSilva, Adam Frank, Gourab Ghoshal, Marcelo Gleiser, Hari Seldon

    Abstract: We introduce an epistemic information measure between two data streams, that we term $influence$. Closely related to transfer entropy, the measure must be estimated by epistemic agents with finite memory resources via sampling accessible data streams. We show that even under ideal conditions, epistemic agents using slightly different sampling strategies might not achieve consensus in their conclus… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 5 figures

  28. VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

    Authors: Letitia Parcalabescu, Michele Cafagna, Lilitta Muradjan, Anette Frank, Iacer Calixto, Albert Gatt

    Abstract: We propose VALSE (Vision And Language Structured Evaluation), a novel benchmark designed for testing general-purpose pretrained vision and language (V&L) models for their visio-linguistic grounding capabilities on specific linguistic phenomena. VALSE offers a suite of six tests covering various linguistic constructs. Solving these requires models to ground linguistic phenomena in the visual modali… ▽ More

    Submitted 14 March, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Paper accepted for publication at ACL 2022 Main; 28 pages, 4 figures, 11 tables

    MSC Class: 68Txx ACM Class: I.2.7; I.2.10

  29. arXiv:2112.05253  [pdf, other

    cs.CV cs.CL

    MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

    Authors: Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letitia Parcalabescu, Anette Frank

    Abstract: Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGMA - a simple method for augmenting generative language models with additional modalities using adapter-based finetuning. Building on Frozen, we train a series of… ▽ More

    Submitted 24 October, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 13 pages, 6 figures, 2 tables. Minor improvements. Accepted at EMNLP 2022

    ACM Class: I.2.7; I.4.8; I.5.1

  30. arXiv:2108.11949  [pdf, other

    cs.CL cs.AI

    Weisfeiler-Leman in the BAMBOO: Novel AMR Graph Metrics and a Benchmark for AMR Graph Similarity

    Authors: Juri Opitz, Angel Daza, Anette Frank

    Abstract: Several metrics have been proposed for assessing the similarity of (abstract) meaning representations (AMRs), but little is known about how they relate to human similarity ratings. Moreover, the current metrics have complementary strengths and weaknesses: some emphasize speed, while others make the alignment of graph structures explicit, at the price of a costly alignment step. In this work we p… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: to appear in TACL, this is a pre-MIT Press publication version

  31. arXiv:2106.04565  [pdf, other

    cs.CL cs.AI

    Translate, then Parse! A strong baseline for Cross-Lingual AMR Parsing

    Authors: Sarah Uhrig, Yoalli Rezepka Garcia, Juri Opitz, Anette Frank

    Abstract: In cross-lingual Abstract Meaning Representation (AMR) parsing, researchers develop models that project sentences from various languages onto their AMRs to capture their essential semantic structures: given a sentence in any language, we aim to capture its core semantic content through concepts connected by manifold types of semantic relations. Methods typically leverage large silver training data… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: IWPT 2021

  32. arXiv:2106.03973  [pdf, other

    cs.CL cs.AI

    Generating Hypothetical Events for Abductive Inference

    Authors: Debjit Paul, Anette Frank

    Abstract: Abductive reasoning starts from some observations and aims at finding the most plausible explanation for these observations. To perform abduction, humans often make use of temporal and causal inferences, and knowledge about how some hypothetical situation can result in different outcomes. This work offers the first study of how such knowledge impacts the Abductive NLI task -- which consists in cho… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Proceedings of The Tenth Joint Conference on Lexical and Computational Semantics (STARSEM 2021)

  33. arXiv:2106.02497  [pdf, other

    cs.CL cs.AI

    COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion

    Authors: Debjit Paul, Anette Frank

    Abstract: Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific textual outputs. In this paper we present COINS, a recursive inference framework that i) iteratively re… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  34. arXiv:2105.03157  [pdf, other

    cs.CL

    CO-NNECT: A Framework for Revealing Commonsense Knowledge Paths as Explicitations of Implicit Knowledge in Texts

    Authors: Maria Becker, Katharina Korfhage, Debjit Paul, Anette Frank

    Abstract: In this work we leverage commonsense knowledge in form of knowledge paths to establish connections between sentences, as a form of explicitation of implicit knowledge. Such connections can be direct (singlehop paths) or require intermediate concepts (multihop paths). To construct such paths we combine two model types in a joint framework we call Co-nnect: a relation classifier that predicts direct… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted at IWCS 2021

  35. arXiv:2103.06304  [pdf, other

    cs.AI cs.CL cs.CV

    What is Multimodality?

    Authors: Letitia Parcalabescu, Nils Trost, Anette Frank

    Abstract: The last years have shown rapid developments in the field of multimodal machine learning, combining e.g., vision, text or speech. In this position paper we explain how the field uses outdated definitions of multimodality that prove unfit for the machine learning era. We propose a new task-relative definition of (multi)modality in the context of multimodal machine learning that focuses on represent… ▽ More

    Submitted 10 June, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Paper accepted for publication at MMSR 2021; 10 pages, 5 figures

    MSC Class: 68Txx ACM Class: I.2.0; I.2.7; I.2.10

    Journal ref: Proceedings of the 1st Workshop on Multimodal Semantic Representations (MMSR), 2021, Groningen, Netherlands (Online), Association for Computational Linguistics, p. 1--10

  36. arXiv:2012.14094  [pdf, other

    cs.CL

    Pivot Through English: Reliably Answering Multilingual Questions without Document Retrieval

    Authors: Ivan Montero, Shayne Longpre, Ni Lao, Andrew J. Frank, Christopher DuBois

    Abstract: Existing methods for open-retrieval question answering in lower resource languages (LRLs) lag significantly behind English. They not only suffer from the shortcomings of non-English document retrieval, but are reliant on language-specific supervision for either the task or translation. We formulate a task setup more realistic to available resources, that circumvents document retrieval to reliably… ▽ More

    Submitted 15 July, 2021; v1 submitted 27 December, 2020; originally announced December 2020.

  37. arXiv:2012.12352  [pdf, other

    cs.CV cs.CL

    Seeing past words: Testing the cross-modal capabilities of pretrained V&L models on counting tasks

    Authors: Letitia Parcalabescu, Albert Gatt, Anette Frank, Iacer Calixto

    Abstract: We investigate the reasoning ability of pretrained vision and language (V&L) models in two tasks that require multimodal integration: (1) discriminating a correct image-sentence pair from an incorrect one, and (2) counting entities in an image. We evaluate three pretrained V&L models on these tasks: ViLBERT, ViLBERT 12-in-1 and LXMERT, in zero-shot and finetuned settings. Our results show that mod… ▽ More

    Submitted 17 June, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: Paper accepted for publication at MMSR 2021; 13 pages, 3 figures, 7 Tables

    MSC Class: 68Txx ACM Class: I.2.7; I.2.10

    Journal ref: Proceedings of the 1st Workshop on Multimodal Semantic Representations (MMSR), 2021, Groningen, Netherlands (Online), Association for Computational Linguistics, p. 32--44

  38. arXiv:2010.14544  [pdf, other

    q-bio.PE cs.IT

    The fundamental equations of change in statistical ensembles and biological populations

    Authors: Steven A. Frank, Frank J. Bruggeman

    Abstract: A recent article in Nature Physics unified key results from thermodynamics, statistics, and information theory. The unification arose from a general equation for the rate of change in the information content of a system. The general equation describes the change in the moments of an observable quantity over a probability distribution. One term in the equation describes the change in the probabilit… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  39. arXiv:2010.05587  [pdf, other

    cs.CL

    Social Commonsense Reasoning with Multi-Head Knowledge Attention

    Authors: Debjit Paul, Anette Frank

    Abstract: Social Commonsense Reasoning requires understanding of text, knowledge about social events and their pragmatic implications, as well as commonsense reasoning skills. In this work we propose a novel multi-head knowledge attention model that encodes semi-structured commonsense inference rules and learns to incorporate them in a transformer-based reasoning cell. We assess the model's performance on t… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  40. arXiv:2010.01998  [pdf, other

    cs.CL

    X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset

    Authors: Angel Daza, Anette Frank

    Abstract: Even though SRL is researched for many languages, major improvements have mostly been obtained for English, for which more resources are available. In fact, existing multilingual SRL datasets contain disparate annotation styles or come from different domains, hampering generalization in multilingual learning. In this work, we propose a method to automatically construct an SRL corpus that is parall… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: To be presented at the EMNLP 2020 Conference

  41. arXiv:2008.08896  [pdf, other

    cs.CL

    Towards a Decomposable Metric for Explainable Evaluation of Text Generation from AMR

    Authors: Juri Opitz, Anette Frank

    Abstract: Systems that generate natural language text from abstract meaning representations such as AMR are typically evaluated using automatic surface matching metrics that compare the generated texts to reference texts from which the input meaning representations were constructed. We show that besides well-known issues from which such metrics suffer, an additional problem arises when applying these metric… ▽ More

    Submitted 26 January, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: EACL 2021

  42. arXiv:2005.10600  [pdf

    cs.CV cs.AI

    A Neural Network Looks at Leonardo's(?) Salvator Mundi

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: We use convolutional neural networks (CNNs) to analyze authorship questions surrounding the works of Leonardo da Vinci -- in particular, Salvator Mundi, the world's most expensive painting and among the most controversial. Trained on the works of an artist under study and visually comparable works of other artists, our system can identify likely forgeries and shed light on attribution controversie… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: This is the author's final version. The article has been accepted for publication in Leonardo (MIT Press)

  43. arXiv:2005.04132  [pdf, other

    eess.AS cs.SD

    Asteroid: the PyTorch-based audio source separation toolkit for researchers

    Authors: Manuel Pariente, Samuele Cornell, Joris Cosentino, Sunit Sivasankaran, Efthymios Tzinis, Jens Heitkaemper, Michel Olvera, Fabian-Robert Stöter, Mathieu Hu, Juan M. Martín-Doñas, David Ditter, Ariel Frank, Antoine Deleforge, Emmanuel Vincent

    Abstract: This paper describes Asteroid, the PyTorch-based audio source separation toolkit for researchers. Inspired by the most successful neural source separation systems, it provides all neural building blocks required to build such a system. To improve reproducibility, Kaldi-style recipes on common audio source separation datasets are also provided. This paper describes the software architecture of Aste… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  44. arXiv:2002.05107  [pdf

    cs.CV

    Analysis of Dutch Master Paintings with Convolutional Neural Networks

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: Trained on the works of an artist under study and visually comparable works of other artists, convolutional neural networks can identify forgeries and provide attributions. They can also assign classification probabilities within a painting, revealing mixed authorship and identifying regions painted by different hands.

    Submitted 16 August, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

  45. AMR Similarity Metrics from Principles

    Authors: Juri Opitz, Letitia Parcalabescu, Anette Frank

    Abstract: Different metrics have been proposed to compare Abstract Meaning Representation (AMR) graphs. The canonical Smatch metric (Cai and Knight, 2013) aligns the variables of two graphs and assesses triple matches. The recent SemBleu metric (Song and Gildea, 2019) is based on the machine-translation metric Bleu (Papineni et al., 2002) and increases computational efficiency by ablating the variable-align… ▽ More

    Submitted 17 September, 2020; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: TACL 2020 https://doi.org/10.1162/tacl_a_00329

  46. arXiv:2001.08398  [pdf, ps, other

    cs.RO cs.AI

    Socially intelligent task and motion planning for human-robot interaction

    Authors: Andrea Frank, Laurel Riek

    Abstract: As social beings, much human behavior is predicated on social context - the ambient social state that includes cultural norms, social signals, individual preferences, etc. In this paper, we propose a socially-aware task and motion planning algorithm that considers social context to generate appropriate and effective plans in human social environments (HSEs). The key strength of our proposed approa… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: 2 pages plus references, no figures. Presented at RSS 2019 Workshop on Robust Task and Motion Planning

  47. arXiv:1912.10161  [pdf, other

    cs.CL

    Implicit Knowledge in Argumentative Texts: An Annotated Corpus

    Authors: Maria Becker, Katharina Korfhage, Anette Frank

    Abstract: When speaking or writing, people omit information that seems clear and evident, such that only part of the message is expressed in words. Especially in argumentative texts it is very common that (important) parts of the argument are implied and omitted. We hypothesize that for argument analysis it will be beneficial to reconstruct this implied information. As a starting point for filling such know… ▽ More

    Submitted 4 December, 2019; originally announced December 2019.

  48. arXiv:1908.11326  [pdf, other

    cs.CL cs.LG

    Translate and Label! An Encoder-Decoder Approach for Cross-lingual Semantic Role Labeling

    Authors: Angel Daza, Anette Frank

    Abstract: We propose a Cross-lingual Encoder-Decoder model that simultaneously translates and generates sentences with Semantic Role Labeling annotations in a resource-poor target language. Unlike annotation projection techniques, our model does not need parallel data during inference time. Our approach can be applied in monolingual, multilingual and cross-lingual settings and is able to produce dependency-… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  49. arXiv:1908.10721  [pdf, other

    cs.CL cs.LG

    Discourse-Aware Semantic Self-Attention for Narrative Reading Comprehension

    Authors: Todor Mihaylov, Anette Frank

    Abstract: In this work, we propose to use linguistic annotations as a basis for a \textit{Discourse-Aware Semantic Self-Attention} encoder that we employ for reading comprehension on long narrative texts. We extract relations between discourse units, events and their arguments as well as coreferring mentions, using available annotation tools. Our empirical evaluation shows that the investigated structures i… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: Accepted as a long conference paper to EMNLP-IJCNLP 2019

  50. arXiv:1907.12436  [pdf

    cs.CV cs.LG eess.IV

    Salient Slices: Improved Neural Network Training and Performance with Image Entropy

    Authors: Steven J. Frank, Andrea M. Frank

    Abstract: As a training and analysis strategy for convolutional neural networks (CNNs), we slice images into tiled segments and use, for training and prediction, segments that both satisfy a criterion of information diversity and contain sufficient content to support classification. In particular, we utilize image entropy as the diversity criterion. This ensures that each tile carries as much information di… ▽ More

    Submitted 4 May, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

    Comments: Final version; article will be published in Neural Computation 32, 1222-1237 (June 2020)