Skip to main content

Showing 1–3 of 3 results for author: Braune, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.01444  [pdf, other

    cs.LG cs.PF

    XPASC: Measuring Generalization in Weak Supervision by Explainability and Association

    Authors: Luisa März, Ehsaneddin Asgari, Fabienne Braune, Franziska Zimmermann, Benjamin Roth

    Abstract: Weak supervision is leveraged in a wide range of domains and tasks due to its ability to create massive amounts of labeled data, requiring only little manual effort. Standard approaches use labeling functions to specify signals that are relevant for the labeling. It has been conjectured that weakly supervised models over-rely on those signals and as a result suffer from overfitting. To verify this… ▽ More

    Submitted 22 November, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: 26 pages, 20 Figures, 5 Tables

  2. arXiv:2109.07994  [pdf, other

    cs.LG cs.CL

    KnowMAN: Weakly Supervised Multinomial Adversarial Networks

    Authors: Luisa März, Ehsaneddin Asgari, Fabienne Braune, Franziska Zimmermann, Benjamin Roth

    Abstract: The absence of labeled data for training neural models is often addressed by leveraging knowledge about the specific task, resulting in heuristic but noisy labels. The knowledge is captured in labeling functions, which detect certain regularities or patterns in the training samples and annotate corresponding labels for training. This process of weakly supervised training may result in an over-reli… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: 9 pages, 3 figures, 2 tables, accepted to EMNLP 2021

  3. arXiv:1904.09678  [pdf, other

    cs.CL

    UniSent: Universal Adaptable Sentiment Lexica for 1000+ Languages

    Authors: Ehsaneddin Asgari, Fabienne Braune, Benjamin Roth, Christoph Ringlstetter, Mohammad R. K. Mofrad

    Abstract: In this paper, we introduce UniSent universal sentiment lexica for $1000+$ languages. Sentiment lexica are vital for sentiment analysis in absence of document-level annotations, a very common scenario for low-resource languages. To the best of our knowledge, UniSent is the largest sentiment resource to date in terms of the number of covered languages, including many low resource ones. In this work… ▽ More

    Submitted 28 November, 2019; v1 submitted 21 April, 2019; originally announced April 2019.