Skip to main content

Showing 1–4 of 4 results for author: Sosnowski, W

.
  1. arXiv:2211.15202  [pdf, other

    cs.CL cs.AI cs.LG

    Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

    Authors: Witold Sosnowski, Anna Wróblewska, Karolina Seweryn, Piotr Gawrysiak

    Abstract: Distance Metric Learning (DML) has attracted much attention in image processing in recent years. This paper analyzes its impact on supervised fine-tuning language models for Natural Language Processing (NLP) classification tasks under few-shot learning settings. We investigated several DML loss functions in training RoBERTa language models on known SentEval Transfer Tasks datasets. We also analyze… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  2. arXiv:2211.15195  [pdf, other

    cs.CL cs.AI cs.LG

    Distance Metric Learning Loss Functions in Few-Shot Scenarios of Supervised Language Models Fine-Tuning

    Authors: Witold Sosnowski, Karolina Seweryn, Anna Wróblewska, Piotr Gawrysiak

    Abstract: This paper presents an analysis regarding an influence of the Distance Metric Learning (DML) loss functions on the supervised fine-tuning of the language models for classification tasks. We experimented with known datasets from SentEval Transfer Tasks. Our experiments show that applying the DML loss function can increase performance on downstream classification tasks of RoBERTa-large models in f… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  3. arXiv:2204.07775  [pdf, other

    cs.CL cs.AI cs.LG

    TASTEset -- Recipe Dataset and Food Entities Recognition Benchmark

    Authors: Ania Wróblewska, Agnieszka Kaliska, Maciej Pawłowski, Dawid Wiśniewski, Witold Sosnowski, Agnieszka Ławrynowicz

    Abstract: Food Computing is currently a fast-growing field of research. Natural language processing (NLP) is also increasingly essential in this field, especially for recognising food entities. However, there are still only a few well-defined tasks that serve as benchmarks for solutions in this area. We introduce a new dataset -- called \textit{TASTEset} -- to bridge this gap. In this dataset, Named Entity… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  4. Applying SoftTriple Loss for Supervised Language Model Fine Tuning

    Authors: Witold Sosnowski, Anna Wroblewska, Piotr Gawrysiak

    Abstract: We introduce a new loss function TripleEntropy, to improve classification performance for fine-tuning general knowledge pre-trained language models based on cross-entropy and SoftTriple loss. This loss function can improve the robust RoBERTa baseline model fine-tuned with cross-entropy loss by about (0.02% - 2.29%). Thorough tests on popular datasets indicate a steady gain. The fewer samples in th… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Journal ref: 17th Conference on Computer Science and Intelligence Systems 2022. Series: ACSIS Annals of Computer Science and Information Systems