Skip to main content

Showing 1–50 of 53 results for author: Pfeiffer, J

.
  1. arXiv:2407.01091  [pdf, other

    cs.CL

    M2QA: Multi-domain Multilingual Question Answering

    Authors: Leon Engländer, Hannah Sterz, Clifton Poth, Jonas Pfeiffer, Ilia Kuznetsov, Iryna Gurevych

    Abstract: Generalization and robustness to input variation are core desiderata of machine learning research. Language varies along several axes, most importantly, language instance (e.g. French) and domain (e.g. news). While adapting NLP models to new languages within a single domain, or to new domains within a single language, is widely studied, research in joint adaptation is hampered by the lack of evalu… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2403.14549  [pdf

    astro-ph.SR

    Resonant Ion Radiation Scattering and the Integrated Atomic Cross-Section as applied to Binary Star Shock Fronts

    Authors: Raymond J. Pfeiffer

    Abstract: The current literature is rather vague regarding how to calculate the exact numerical value of the resonant ion scattering cross-section that should be used for a specific bandpass of finite width. Such a value was needed in order to calculate the ion and mass densities in the shock fronts of hot, close binary star systems. This was done based on a modeling of ultraviolet wind-line profiles, using… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 11 pages, 4 figures, 4 tables, 16 numbered equations; Derivation of ion scattered stellar flux from binary star shock-fronts; IUE UV light curve fitting; Binary star modelling

  3. arXiv:2311.11077  [pdf, other

    cs.CL cs.AI cs.LG

    Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning

    Authors: Clifton Poth, Hannah Sterz, Indraneil Paul, Sukannya Purkayastha, Leon Engländer, Timo Imhof, Ivan Vulić, Sebastian Ruder, Iryna Gurevych, Jonas Pfeiffer

    Abstract: We introduce Adapters, an open-source library that unifies parameter-efficient and modular transfer learning in large language models. By integrating 10 diverse adapter methods into a unified interface, Adapters offers ease of use and flexible configuration. Our library allows researchers and practitioners to leverage adapter modularity through composition blocks, enabling the design of complex ad… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023: Systems Demonstrations

  4. arXiv:2311.10861  [pdf

    econ.GN

    First, Do No Harm: Algorithms, AI, and Digital Product Liability

    Authors: Marc J. Pfeiffer

    Abstract: The ethical imperative for technology should be first, do no harm. But digital innovations like AI and social media increasingly enable societal harms, from bias to misinformation. As these technologies grow ubiquitous, we need solutions to address unintended consequences. This report proposes a model to incentivize developers to prevent foreseeable algorithmic harms. It does this by expanding neg… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 22 pages

  5. arXiv:2311.09344  [pdf, other

    cs.CL

    Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

    Authors: Alexandra Chronopoulou, Jonas Pfeiffer, Joshua Maynez, Xinyi Wang, Sebastian Ruder, Priyanka Agrawal

    Abstract: Parameter-efficient fine-tuning (PEFT) using labeled task data can significantly improve the performance of large language models (LLMs) on the downstream task. However, there are 7000 languages in the world and many of these languages lack labeled data for real-world language generation tasks. In this paper, we propose to improve zero-shot cross-lingual transfer by composing language or task spec… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  6. arXiv:2305.18893  [pdf, other

    cs.CL

    Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

    Authors: Benjamin Minixhofer, Jonas Pfeiffer, Ivan Vulić

    Abstract: Many NLP pipelines split text into sentences as one of the crucial preprocessing steps. Prior sentence segmentation tools either rely on punctuation or require a considerable amount of sentence-segmented training data: both central assumptions might fail when porting sentence segmenters to diverse languages on a massive scale. In this work, we thus introduce a multilingual punctuation-agnostic sen… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  7. arXiv:2305.14224  [pdf, other

    cs.CL

    mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations

    Authors: Jonas Pfeiffer, Francesco Piccinno, Massimo Nicosia, Xinyi Wang, Machel Reid, Sebastian Ruder

    Abstract: Multilingual sequence-to-sequence models perform poorly with increased language coverage and fail to consistently generate text in the correct target language in few-shot settings. To address these challenges, we propose mmT5, a modular multilingual sequence-to-sequence model. mmT5 utilizes language-specific modules during pre-training, which disentangle language-specific information from language… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  8. arXiv:2305.14214  [pdf, other

    cs.CL

    CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models

    Authors: Benjamin Minixhofer, Jonas Pfeiffer, Ivan Vulić

    Abstract: While many languages possess processes of joining two or more words to create compound words, previous studies have been typically limited only to languages with excessively productive compound formation (e.g., German, Dutch) and there is no public dataset containing compound and non-compound words across a large number of languages. In this work, we systematically study decompounding, the task of… ▽ More

    Submitted 23 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  9. arXiv:2304.08865  [pdf, other

    cs.CL cs.LG

    Romanization-based Large-scale Adaptation of Multilingual Language Models

    Authors: Sukannya Purkayastha, Sebastian Ruder, Jonas Pfeiffer, Iryna Gurevych, Ivan Vulić

    Abstract: Large multilingual pretrained language models (mPLMs) have become the de facto state of the art for cross-lingual transfer in NLP. However, their large-scale deployment to many languages, besides pretraining data scarcity, is also hindered by the increase in vocabulary size and limitations in their parameter budget. In order to boost the capacity of mPLMs to deal with low-resource and unseen langu… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 9 pages, 5 figures

  10. arXiv:2302.11529  [pdf, other

    cs.LG

    Modular Deep Learning

    Authors: Jonas Pfeiffer, Sebastian Ruder, Ivan Vulić, Edoardo Maria Ponti

    Abstract: Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples. Nonetheless, it remains unclear how to develop models that specialise towards multiple tasks without incurring negative interference and that generalise systematically to non-identically distributed tasks. Modul… ▽ More

    Submitted 27 January, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

  11. arXiv:2301.05487  [pdf, other

    cs.CL

    FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing

    Authors: Chen Cecilia Liu, Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych

    Abstract: Standard fine-tuning of language models typically performs well on in-distribution data, but suffers with generalization to distribution shifts. In this work, we aim to improve the generalization of adapter-based cross-lingual task transfer where such cross-language distribution shifts are imminent. We investigate scheduled unfreezing algorithms -- originally proposed to mitigate catastrophic forg… ▽ More

    Submitted 4 April, 2024; v1 submitted 13 January, 2023; originally announced January 2023.

  12. arXiv:2210.06379  [pdf, other

    cs.CV cs.CL

    One does not fit all! On the Complementarity of Vision Encoders for Vision and Language Tasks

    Authors: Gregor Geigle, Chen Cecilia Liu, Jonas Pfeiffer, Iryna Gurevych

    Abstract: Current multimodal models, aimed at solving Vision and Language (V+L) tasks, predominantly repurpose Vision Encoders (VE) as feature extractors. While many VEs -- of different architectures, trained on different data and objectives -- are publicly available, they are not designed for the downstream V+L tasks. Nonetheless, most current work assumes that a \textit{single} pre-trained VE can serve as… ▽ More

    Submitted 8 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Repl4NLP 2023

  13. arXiv:2205.06266  [pdf, other

    cs.CL

    Lifting the Curse of Multilinguality by Pre-training Modular Transformers

    Authors: Jonas Pfeiffer, Naman Goyal, Xi Victoria Lin, Xian Li, James Cross, Sebastian Riedel, Mikel Artetxe

    Abstract: Multilingual pre-trained models are known to suffer from the curse of multilinguality, which causes per-language performance to drop as they cover more languages. We address this issue by introducing language-specific modules, which allows us to grow the total capacity of the model, while kee** the total number of trainable parameters per language constant. In contrast with prior work that learn… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  14. arXiv:2203.13693  [pdf, other

    cs.CL cs.IR

    UKP-SQUARE: An Online Platform for Question Answering Research

    Authors: Tim Baumgärtner, Kexin Wang, Rachneet Sachdeva, Max Eichler, Gregor Geigle, Clifton Poth, Hannah Sterz, Haritz Puerto, Leonardo F. R. Ribeiro, Jonas Pfeiffer, Nils Reimers, Gözde Gül Şahin, Iryna Gurevych

    Abstract: Recent advances in NLP and information retrieval have given rise to a diverse set of question answering tasks that are of different formats (e.g., extractive, abstractive), require different model architectures (e.g., generative, discriminative), and setups (e.g., with or without retrieval). Despite having a large number of powerful, specialized QA pipelines (which we refer to as Skills) that cons… ▽ More

    Submitted 28 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022 Demo Track

  15. arXiv:2202.07630  [pdf, other

    cs.CL

    Delving Deeper into Cross-lingual Visual Question Answering

    Authors: Chen Liu, Jonas Pfeiffer, Anna Korhonen, Ivan Vulić, Iryna Gurevych

    Abstract: Visual question answering (VQA) is one of the crucial vision-and-language tasks. Yet, existing VQA research has mostly focused on the English language, due to a lack of suitable evaluation resources. Previous work on cross-lingual VQA has reported poor zero-shot transfer performance of current multilingual multimodal Transformers with large gaps to monolingual performance, without any deeper analy… ▽ More

    Submitted 8 June, 2023; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Findings of EACL 2023

  16. arXiv:2201.11732  [pdf, other

    cs.CL cs.CV

    IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages

    Authors: Emanuele Bugliarello, Fangyu Liu, Jonas Pfeiffer, Siva Reddy, Desmond Elliott, Edoardo Maria Ponti, Ivan Vulić

    Abstract: Reliable evaluation benchmarks designed for replicability and comprehensiveness have driven progress in machine learning. Due to the lack of a multilingual benchmark, however, vision-and-language research has mostly focused on English language tasks. To fill this gap, we introduce the Image-Grounded Language Understanding Evaluation benchmark. IGLUE brings together - by both aggregating pre-existi… ▽ More

    Submitted 17 July, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: ICML 2022

  17. arXiv:2110.14794  [pdf, other

    cs.CR cs.LG stat.ML

    Masked LARk: Masked Learning, Aggregation and Reporting worKflow

    Authors: Joseph J. Pfeiffer III, Denis Charles, Davis Gilton, Young Hun Jung, Mehul Parsana, Erik Anderson

    Abstract: Today, many web advertising data flows involve passive cross-site tracking of users. Enabling such a mechanism through the usage of third party tracking cookies (3PC) exposes sensitive user data to a large number of parties, with little oversight on how that data can be used. Thus, most browsers are moving towards removal of 3PC in subsequent browser iterations. In order to substantially improve e… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: Microsoft Journal of Applied Research (MSJAR Volume 16)

    MSC Class: 68T07

  18. arXiv:2109.06082  [pdf, other

    cs.CL

    xGQA: Cross-Lingual Visual Question Answering

    Authors: Jonas Pfeiffer, Gregor Geigle, Aishwarya Kamath, Jan-Martin O. Steitz, Stefan Roth, Ivan Vulić, Iryna Gurevych

    Abstract: Recent advances in multimodal vision and language modeling have predominantly focused on the English language, mostly due to the lack of multilingual multimodal datasets to steer modeling efforts. In this work, we address this gap and provide xGQA, a new multilingual evaluation benchmark for the visual question answering task. We extend the established English GQA dataset to 7 typologically divers… ▽ More

    Submitted 17 March, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Findings of ACL 2022

  19. arXiv:2109.04422  [pdf, other

    cs.CV cs.CL

    TxT: Crossmodal End-to-End Learning with Transformers

    Authors: Jan-Martin O. Steitz, Jonas Pfeiffer, Iryna Gurevych, Stefan Roth

    Abstract: Reasoning over multiple modalities, e.g. in Visual Question Answering (VQA), requires an alignment of semantic concepts across domains. Despite the widespread success of end-to-end learning, today's multimodal pipelines by and large leverage pre-extracted, fixed features from object detectors, typically Faster R-CNN, as representations of the visual world. The obvious downside is that the visual r… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: To appear at the 43rd DAGM German Conference on Pattern Recognition (GCPR) 2021

  20. arXiv:2109.03808  [pdf, other

    cs.CL

    Smelting Gold and Silver for Improved Multilingual AMR-to-Text Generation

    Authors: Leonardo F. R. Ribeiro, Jonas Pfeiffer, Yue Zhang, Iryna Gurevych

    Abstract: Recent work on multilingual AMR-to-text generation has exclusively focused on data augmentation strategies that utilize silver AMR. However, this assumes a high quality of generated AMRs, potentially limiting the transferability to the target task. In this paper, we investigate different techniques for automatically generating AMR annotations, where we aim to study which source of information yiel… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted as a conference paper to EMNLP 2021

  21. arXiv:2108.08103  [pdf, other

    cs.CL

    AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters

    Authors: Tilman Beck, Bela Bohlender, Christina Viehmann, Vincent Hane, Yanik Adamson, Jaber Khuri, Jonas Brossmann, Jonas Pfeiffer, Iryna Gurevych

    Abstract: The open-access dissemination of pretrained language models through online repositories has led to a democratization of state-of-the-art natural language processing (NLP) research. This also allows people outside of NLP to use such models and adapt them to specific use-cases. However, a certain amount of technical proficiency is still required which is an entry barrier for users who want to apply… ▽ More

    Submitted 19 April, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: ACL 2022 System Demonstrations

  22. arXiv:2104.08247  [pdf, other

    cs.CL

    What to Pre-Train on? Efficient Intermediate Task Selection

    Authors: Clifton Poth, Jonas Pfeiffer, Andreas Rücklé, Iryna Gurevych

    Abstract: Intermediate task fine-tuning has been shown to culminate in large transfer gains across many NLP tasks. With an abundance of candidate datasets as well as pre-trained language models, it has become infeasible to run the cross-product of all combinations to find the best transfer setting. In this work we first establish that similar sequential fine-tuning gains can be achieved in adapter settings,… ▽ More

    Submitted 10 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021

  23. arXiv:2103.11920  [pdf, other

    cs.CV cs.CL

    Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval

    Authors: Gregor Geigle, Jonas Pfeiffer, Nils Reimers, Ivan Vulić, Iryna Gurevych

    Abstract: Current state-of-the-art approaches to cross-modal retrieval process text and visual input jointly, relying on Transformer-based architectures with cross-attention mechanisms that attend over all words and objects in an image. While offering unmatched retrieval performance, such models: 1) are typically pretrained from scratch and thus less scalable, 2) suffer from huge retrieval latency and ineff… ▽ More

    Submitted 18 February, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: TACL 2022

  24. arXiv:2012.15613  [pdf, other

    cs.CL

    How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models

    Authors: Phillip Rust, Jonas Pfeiffer, Ivan Vulić, Sebastian Ruder, Iryna Gurevych

    Abstract: In this work, we provide a systematic and comprehensive empirical comparison of pretrained multilingual language models versus their monolingual counterparts with regard to their monolingual task performance. We study a set of nine typologically diverse languages with readily available pretrained monolingual models on a set of five diverse monolingual downstream tasks. We first aim to establish, v… ▽ More

    Submitted 1 June, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: ACL 2021

  25. arXiv:2012.15562  [pdf, other

    cs.CL

    UNKs Everywhere: Adapting Multilingual Language Models to New Scripts

    Authors: Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych, Sebastian Ruder

    Abstract: Massively multilingual language models such as multilingual BERT offer state-of-the-art cross-lingual transfer performance on a range of NLP tasks. However, due to limited capacity and large differences in pretraining data sizes, there is a profound performance gap between resource-rich and resource-poor target languages. The ultimate challenge is dealing with under-resourced languages not covered… ▽ More

    Submitted 10 September, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: EMNLP 2021

  26. arXiv:2010.11918  [pdf, other

    cs.LG cs.CL

    AdapterDrop: On the Efficiency of Adapters in Transformers

    Authors: Andreas Rücklé, Gregor Geigle, Max Glockner, Tilman Beck, Jonas Pfeiffer, Nils Reimers, Iryna Gurevych

    Abstract: Massively pre-trained transformer models are computationally expensive to fine-tune, slow for inference, and have large storage requirements. Recent approaches tackle these shortcomings by training smaller models, dynamically reducing the model size, and by training light-weight adapters. In this paper, we propose AdapterDrop, removing adapters from lower transformer layers during training and inf… ▽ More

    Submitted 5 October, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021

  27. arXiv:2010.08710  [pdf, other

    cs.LG stat.ML

    Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

    Authors: Shuxi Zeng, Murat Ali Bayir, Joesph J. Pfeiffer III, Denis Charles, Emre Kiciman

    Abstract: It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized expe… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 9 pages, 7 figures, 2 tables, accepted to WSDM 2021

  28. Causal Inference in the Presence of Interference in Sponsored Search Advertising

    Authors: Razieh Nabi, Joel Pfeiffer, Murat Ali Bayir, Denis Charles, Emre Kıcıman

    Abstract: In classical causal inference, inferring cause-effect relations from data relies on the assumption that units are independent and identically distributed. This assumption is violated in settings where units are related through a network of dependencies. An example of such a setting is ad placement in sponsored search advertising, where the clickability of a particular ad is potentially influenced… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Journal ref: Special issue on Causal Inference and Machine Learning with Network Data, Frontiers in Big Data, 2022

  29. arXiv:2010.00980  [pdf, other

    cs.CL cs.IR

    MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale

    Authors: Andreas Rücklé, Jonas Pfeiffer, Iryna Gurevych

    Abstract: We study the zero-shot transfer capabilities of text matching models on a massive scale, by self-supervised training on 140 source domains from community question answering forums in English. We investigate the model performances on nine benchmarks of answer selection and question similarity tasks, and show that all 140 models transfer surprisingly well, where the large majority of models substant… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

    Comments: EMNLP-2020

  30. arXiv:2007.07779  [pdf, other

    cs.CL

    AdapterHub: A Framework for Adapting Transformers

    Authors: Jonas Pfeiffer, Andreas Rücklé, Clifton Poth, Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, Iryna Gurevych

    Abstract: The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes progress towards more general and versatile NLP methods that learn from and for many tasks. Adapters -- small learnt bottleneck layers inserted within each laye… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: EMNLP 2020: Systems Demonstrations

  31. arXiv:2006.08004  [pdf, other

    q-fin.MF math.PR

    The Gauss2++ Model -- A Comparison of Different Measure Change Specifications for a Consistent Risk Neutral and Real World Calibration

    Authors: Christoph Berninger, Julian Pfeiffer

    Abstract: Especially in the insurance industry interest rate models play a crucial role e.g. to calculate the insurance company's liabilities, performance scenarios or risk measures. A prominant candidate is the 2-Additive-Factor Gaussian Model (Gauss2++) - in a different representation also known as the 2-Factor Hull-White model. In this paper, we propose a framework to estimate the model such that it can… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

    MSC Class: 62P05; 97M30; 91G30 ACM Class: G.3

  32. arXiv:2005.00250  [pdf, other

    cs.CL

    Low Resource Multi-Task Sequence Tagging -- Revisiting Dynamic Conditional Random Fields

    Authors: Jonas Pfeiffer, Edwin Simpson, Iryna Gurevych

    Abstract: We compare different models for low resource multi-task sequence tagging that leverage dependencies between label sequences for different tasks. Our analysis is aimed at datasets where each example has labels for multiple tasks. Current approaches use either a separate model for each task or standard multi-task learning to learn shared feature representations. However, these approaches ignore corr… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

  33. arXiv:2005.00247  [pdf, other

    cs.CL

    AdapterFusion: Non-Destructive Task Composition for Transfer Learning

    Authors: Jonas Pfeiffer, Aishwarya Kamath, Andreas Rücklé, Kyunghyun Cho, Iryna Gurevych

    Abstract: Sequential fine-tuning and multi-task learning are methods aiming to incorporate knowledge from multiple tasks; however, they suffer from catastrophic forgetting and difficulties in dataset balancing. To address these shortcomings, we propose AdapterFusion, a new two stage learning algorithm that leverages knowledge from multiple tasks. First, in the knowledge extraction stage we learn task specif… ▽ More

    Submitted 26 January, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Journal ref: Proceedings of EACL 2021

  34. arXiv:2005.00052  [pdf, other

    cs.CL

    MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer

    Authors: Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych, Sebastian Ruder

    Abstract: The main goal behind state-of-the-art pre-trained multilingual models such as multilingual BERT and XLM-R is enabling and bootstrap** NLP applications in low-resource languages through zero-shot or few-shot cross-lingual transfer. However, due to limited model capacity, their transfer performance is the weakest exactly on such low-resource languages and languages unseen during pre-training. We p… ▽ More

    Submitted 6 October, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: EMNLP 2020

  35. Fine-Tuned Neural Models for Propaganda Detection at the Sentence and Fragment levels

    Authors: Tariq Alhindi, Jonas Pfeiffer, Smaranda Muresan

    Abstract: This paper presents the CUNLP submission for the NLP4IF 2019 shared-task on FineGrained Propaganda Detection. Our system finished 5th out of 26 teams on the sentence-level classification task and 5th out of 11 teams on the fragment-level classification task based on our scores on the blind test set. We present our models, a discussion of our ablation studies and experiments, and an analysis of our… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Accepted to the 2nd Workshop on NLP for Internet Freedom (NLP4IF): Censorship, Disinformation, and Propaganda

  36. arXiv:1909.04547  [pdf, other

    cs.CL

    What do Deep Networks Like to Read?

    Authors: Jonas Pfeiffer, Aishwarya Kamath, Iryna Gurevych, Sebastian Ruder

    Abstract: Recent research towards understanding neural networks probes models in a top-down manner, but is only able to identify model tendencies that are known a priori. We propose Susceptibility Identification through Fine-Tuning (SIFT), a novel abstractive method that uncovers a model's preferences without imposing any prior. By fine-tuning an autoencoder with the gradients from a fixed classifier, we ar… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  37. arXiv:1908.11254  [pdf, other

    cs.CL

    FAMULUS: Interactive Annotation and Feedback Generation for Teaching Diagnostic Reasoning

    Authors: Jonas Pfeiffer, Christian M. Meyer, Claudia Schulz, Jan Kiesewetter, Jan Zottmann, Michael Sailer, Elisabeth Bauer, Frank Fischer, Martin R. Fischer, Iryna Gurevych

    Abstract: Our proposed system FAMULUS helps students learn to diagnose based on automatic feedback in virtual patient simulations, and it supports instructors in labeling training data. Diagnosing is an exceptionally difficult skill to obtain but vital for many different professions (e.g., medical doctors, teachers). Previous case simulation systems are limited to multiple-choice questions and thus cann… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: EMNLP 2019 - Demo

    Journal ref: EMNLP-IJCNLP 2019: System Demonstrations

  38. arXiv:1806.02794  [pdf, other

    cs.LG cs.AI stat.ML

    Unbiased Estimation of the Value of an Optimized Policy

    Authors: Elon Portugaly, Joseph J. Pfeiffer III

    Abstract: Randomized trials, also known as A/B tests, are used to select between two policies: a control and a treatment. Given a corresponding set of features, we can ideally learn an optimized policy P that maps the A/B test data features to action space and optimizes reward. However, although A/B testing provides an unbiased estimator for the value of deploying B (i.e., switching from policy A to B), dir… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

  39. arXiv:1804.06909  [pdf, other

    cs.LG stat.ML

    Modeling and Simultaneously Removing Bias via Adversarial Neural Networks

    Authors: John Moore, Joel Pfeiffer, Kai Wei, Rishabh Iyer, Denis Charles, Ran Gilad-Bachrach, Levi Boyles, Eren Manavoglu

    Abstract: In real world systems, the predictions of deployed Machine Learned models affect the training data available to build subsequent models. This introduces a bias in the training data that needs to be addressed. Existing solutions to this problem attempt to resolve the problem by either casting this in the reinforcement learning framework or by quantifying the bias and re-weighting the loss functions… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

  40. arXiv:1804.05885  [pdf, other

    cs.IT

    Area Rate Evaluation based on Spatial Clustering of massive MIMO Channel Measurements

    Authors: Maximilian Arnold, Johannes Pfeiffer, Stephan ten Brink

    Abstract: Channel models for massive MIMO are typically based on matrices with complex Gaussian entries, extended by the Kronecker and Weichselberger model. One reason for observing a gap between modeled and actual channel behavior is the absence of spatial consistency in many such models, that is, spatial correlations over an area in the x, y-dimensions are not accounted for, making it difficult to study,… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: Submitted to WSA2018

  41. Evidence of a two-stage melting of Wigner solids in two dimensions

    Authors: Jian Huang, Talbot Knighton, Zhe Wu, Alessandro Serafin, J. S. Xia L. N. Pfeiffer, K. W. West

    Abstract: Two-dimensional (2D) solid-liquid transition (SLT)~\cite{Mermin1966Absence,Mermin1968Crystalline,Kosterlitz1972Long} concerns fundamental concepts of long-range correlations vital to magnetism, superconductivity, superfluidity, and topological matters. A long sought-after example is the melting of a Wigner Crystal (WC)~\cite{Wigner1934Interaction} of electrons. Detection efforts have targeted dist… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

    Journal ref: Phys. Rev. B 97, 085135 (2018)

  42. arXiv:1402.4199  [pdf, ps, other

    math.AG math.OC

    Sums of Squares on the Hypercube

    Authors: Grigoriy Blekherman, João Gouveia, James Pfeiffer

    Abstract: Let X be a finite set of points in R^n. A polynomial p nonnegative on X can be written as a sum of squares of rational functions modulo the vanishing ideal I(X). From the point of view of applications, such as polynomial optimization, we are interested in rational function representations of small degree. We derive a general upper bound in terms of the Hilbert function of X, and we show that this… ▽ More

    Submitted 17 February, 2014; originally announced February 2014.

  43. arXiv:1401.7751  [pdf, ps, other

    math.RT math.CO

    The representation theory of generalized hyperoctahedral groups

    Authors: William McGovern, James Pfeiffer

    Abstract: We give an explicit decomposition of $\hbox{Ind}(1)_{B_n}^{S_{2n}}$, following Barbasch and Vogan [1]. We define two natural generalizations of $B_n$, and extend the proof in [1] to recursively compute these decompositions. Although the decompositions do not appear to follow a simple pattern, we prove enough of their structure to show that they are almost never multiplicity-free.

    Submitted 30 January, 2014; originally announced January 2014.

  44. arXiv:1211.0039  [pdf, ps, other

    math.OC math.CO

    A Semidefinite Approach to the $K_i$ Cover Problem

    Authors: João Gouveia, James Pfeiffer

    Abstract: We apply theta body relaxations to the $K_i$-cover problem and show polynomial time solvability for certain classes of graphs. In particular, we give an effective relaxation where all $K_i$-$p$-hole facets are valid, and study its relation to an open question of Conforti et al. For the triangle free problem, we show for $K_n$ that the theta body relaxations do not converge by $(n-2)/4$ steps; we a… ▽ More

    Submitted 1 February, 2014; v1 submitted 31 October, 2012; originally announced November 2012.

  45. arXiv:1208.4897  [pdf, other

    physics.atom-ph quant-ph

    High-Resolution Imaging and Optical Control of Bose-Einstein Condensates in an Atom Chip Magnetic Trap

    Authors: Evan A. Salim, Seth C. Caliga, Jonathan B. Pfeiffer, Dana Z. Anderson

    Abstract: A high-resolution projection and imaging system for ultracold atoms is implemented using a compound silicon and glass atom chip. The atom chip is metalized to enable magnetic trap** while glass regions enable high numerical aperture optical access to atoms residing in the magnetic trap about 100 microns below the chip surface. The atom chip serves as a wall of the vacuum system, which enables th… ▽ More

    Submitted 23 August, 2012; originally announced August 2012.

    Comments: 4 pages, 5 figures, 12 references

  46. Bootstrap percolation on the Hamming torus

    Authors: Janko Gravner, Christopher Hoffman, James Pfeiffer, David Sivakoff

    Abstract: The Hamming torus of dimension $d$ is the graph with vertices $\{1,\dots,n\}^d$ and an edge between any two vertices that differ in a single coordinate. Bootstrap percolation with threshold $θ$ starts with a random set of open vertices, to which every vertex belongs independently with probability $p$, and at each time step the open set grows by adjoining every vertex with at least $θ$ open neighbo… ▽ More

    Submitted 23 January, 2015; v1 submitted 23 February, 2012; originally announced February 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-AAP996 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AAP-AAP996

    Journal ref: Annals of Applied Probability 2015, Vol. 25, No. 1, 287-323

  47. arXiv:1202.4805  [pdf, other

    cs.SI physics.soc-ph

    Fast Generation of Large Scale Social Networks with Clustering

    Authors: Joseph J. Pfeiffer III, Timothy La Fond, Sebastian Moreno, Jennifer Neville

    Abstract: A key challenge within the social network literature is the problem of network generation - that is, how can we create synthetic networks that match characteristics traditionally found in most real world networks? Important characteristics that are present in social networks include a power law degree distribution, small diameter and large amounts of clustering; however, most current network gener… ▽ More

    Submitted 21 February, 2012; originally announced February 2012.

    Comments: 11 pages

    ACM Class: G.2.2; G.3

  48. arXiv:1104.0319  [pdf, ps, other

    cs.SI physics.soc-ph

    Methods to Determine Node Centrality and Clustering in Graphs with Uncertain Structure

    Authors: Joseph J. Pfeiffer III, Jennifer Neville

    Abstract: Much of the past work in network analysis has focused on analyzing discrete graphs, where binary edges represent the "presence" or "absence" of a relationship. Since traditional network measures (e.g., betweenness centrality) utilize a discrete link structure, complex systems must be transformed to this representation in order to investigate network properties. However, in many domains there may b… ▽ More

    Submitted 2 April, 2011; originally announced April 2011.

    Comments: Longer version of paper appearing in Fifth International AAAI Conference on Weblogs and Social Media. 9 pages, 4 Figures

    MSC Class: 91D30 ACM Class: H.3.4

  49. Critical current diffraction pattern of SIFS Josephson junctions with step-like F-layer

    Authors: M. Weides, U. Peralagu, H. Kohlstedt, J. Pfeiffer, M. Kemmler, C. Gürlich, E. Goldobin, D. Koelle, R. Kleiner

    Abstract: We present the latest generation of superconductor-insulator-ferromagnet-superconductor Josephson tunnel junctions with a step-like thickness of the ferromagnetic (F) layer. The F-layer thicknesses $d_1$ and $d_2$ in both halves were varied to obtain different combinations of positive and negative critical current densities $j_{c,1}$ and $j_{c,2}$. The measured dependences of the critical current… ▽ More

    Submitted 15 July, 2010; v1 submitted 26 June, 2010; originally announced June 2010.

    Comments: 7 pages, 3 figures

    Journal ref: Supercond. Sci. Technol. 23, 095007 (2010)

  50. arXiv:0903.1046  [pdf, ps, other

    cond-mat.supr-con

    Escape Rate Measurements and Microwave Spectroscopy of 0, pi, and 0-pi ferromagnetic Josephson Tunnel Junctions

    Authors: J. Pfeiffer, T. Gaber, D. Koelle, R. Kleiner, E. Goldobin, M. Weides, H. Kohlstedt, J. Lisenfeld, A. K. Feofanov, A. V. Ustinov

    Abstract: We present experimental studies of high quality underdamped 0, pi, and 0-pi ferromagnetic Josephson tunnel junctions of intermediate length L (lambda_J < L < 5 lambda_J, where lambda_J is the Josephson penetration depth). The junctions are fabricated as Nb/Al_2O_3/Cu_40Ni_60/Nb Superconductor-Insulator-Ferromagnet-Superconductor heterostructures. Using microwave spectroscopy, we have investigate… ▽ More

    Submitted 5 March, 2009; originally announced March 2009.

    Comments: submitted to RPB