Skip to main content

Showing 1–10 of 10 results for author: Brun, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17566  [pdf, other

    cs.CL

    FrenchToxicityPrompts: a Large Benchmark for Evaluating and Mitigating Toxicity in French Texts

    Authors: Caroline Brun, Vassilina Nikoulina

    Abstract: Large language models (LLMs) are increasingly popular but are also prone to generating bias, toxic or harmful language, which can have detrimental effects on individuals and communities. Although most efforts is put to assess and mitigate toxicity in generated content, it is primarily concentrated on English, while it's essential to consider other languages as well. For addressing this issue, we c… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: TRAC-2024, Fourth Workshop on Threat, Aggression and Cyberbullying. 20 May 2024

  2. An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks

    Authors: Varsha Suresh, Salah Aït-Mokhtar, Caroline Brun, Ioan Calapodescu

    Abstract: Self-supervised learning models have revolutionized the field of speech processing. However, the process of fine-tuning these models on downstream tasks requires substantial computational resources, particularly when dealing with multiple speech-processing tasks. In this paper, we explore the potential of adapter-based fine-tuning in develo** a unified model capable of effectively handling multi… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: ICASSP 2024

  3. arXiv:2311.01070  [pdf, other

    cs.CL cs.SD eess.AS

    Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts

    Authors: Thomas Palmeira Ferraz, Marcely Zanon Boito, Caroline Brun, Vassilina Nikoulina

    Abstract: Whisper is a multitask and multilingual speech model covering 99 languages. It yields commendable automatic speech recognition (ASR) results in a subset of its covered languages, but the model still underperforms on a non-negligible number of under-represented languages, a problem exacerbated in smaller model versions. In this work, we propose DistilWhisper, an approach able to bridge the performa… ▽ More

    Submitted 12 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to IEEE ICASSP 2024

  4. arXiv:2210.11621  [pdf, other

    cs.CL cs.AI cs.LG

    SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages

    Authors: Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson, Laurent Besacier

    Abstract: In recent years, multilingual machine translation models have achieved promising performance on low-resource language pairs by sharing information between similar languages, thus enabling zero-shot translation. To overcome the "curse of multilinguality", these models often opt for scaling up the number of parameters, which makes their use in resource-constrained environments challenging. We introd… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

    Journal ref: https://aclanthology.org/2022.emnlp-main.571

  5. arXiv:2205.10828  [pdf, other

    cs.CL cs.AI cs.LG

    What Do Compressed Multilingual Machine Translation Models Forget?

    Authors: Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson, Laurent Besacier

    Abstract: Recently, very large pre-trained models achieve state-of-the-art results in various natural language processing (NLP) tasks, but their size makes it more challenging to apply them in resource-constrained environments. Compression techniques allow to drastically reduce the size of the models and therefore their inference time with negligible impact on top-tier metrics. However, the general performa… ▽ More

    Submitted 27 June, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of EMNLP 2022, presented at WMT 2022

    Journal ref: https://aclanthology.org/2022.findings-emnlp.317/

  6. arXiv:2112.02721  [pdf, other

    cs.CL cs.AI cs.LG

    NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation

    Authors: Kaustubh D. Dhole, Varun Gangal, Sebastian Gehrmann, Aadesh Gupta, Zhenhao Li, Saad Mahamood, Abinaya Mahendiran, Simon Mille, Ashish Shrivastava, Samson Tan, Tongshuang Wu, Jascha Sohl-Dickstein, **ho D. Choi, Eduard Hovy, Ondrej Dusek, Sebastian Ruder, Sajant Anand, Nagender Aneja, Rabin Banjade, Lisa Barthe, Hanna Behnke, Ian Berlot-Attwell, Connor Boyle, Caroline Brun, Marco Antonio Sobrevilla Cabezudo , et al. (101 additional authors not shown)

    Abstract: Data augmentation is an important component in the robustness evaluation of models in natural language processing (NLP) and in enhancing the diversity of the data they are trained on. In this paper, we present NL-Augmenter, a new participatory Python-based natural language augmentation framework which supports the creation of both transformations (modifications to the data) and filters (data split… ▽ More

    Submitted 11 October, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

    Comments: 39 pages, repository at https://github.com/GEM-benchmark/NL-Augmenter

  7. arXiv:1610.01910  [pdf, ps, other

    cs.CL

    Toward Automatic Understanding of the Function of Affective Language in Support Groups

    Authors: Amit Navindgi, Caroline Brun, Cécile Boulard Masson, Scott Nowson

    Abstract: Understanding expressions of emotions in support forums has considerable value and NLP methods are key to automating this. Many approaches understandably use subjective categories which are more fine-grained than a straightforward polarity-based spectrum. However, the definition of such categories is non-trivial and, in fact, we argue for a need to incorporate communicative elements even beyond su… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: 9 pages, 1 figure, conference workshop

  8. arXiv:1604.05377  [pdf

    stat.ML cs.LG cs.NE

    Churn analysis using deep convolutional neural networks and autoencoders

    Authors: Artit Wangperawong, Cyrille Brun, Olav Laudy, Rujikorn Pavasuthipaisit

    Abstract: Customer temporal behavioral data was represented as images in order to perform churn prediction by leveraging deep learning architectures prominent in image classification. Supervised learning was performed on labeled data of over 6 million customers using deep convolutional neural networks, which achieved an AUC of 0.743 on the test dataset using no more than 12 temporal features for each custom… ▽ More

    Submitted 18 April, 2016; originally announced April 2016.

  9. arXiv:cs/0506049  [pdf, ps, other

    cs.DL

    Exploitation de dictionnaires électroniques pour la désambiguïsation sémantique lexicale

    Authors: Caroline Brun, Bernard Jacquemin, Frédérique Segond

    Abstract: This paper presents a lexical disambiguation system, initially developed for English and now adapted to French. This system associates a word with its meaning in a given context using electronic dictionaries as semantically annotated corpora in order to extract semantic disambiguation rules. We describe the rule extraction and application process as well as the evaluation of the system. The resu… ▽ More

    Submitted 12 June, 2005; originally announced June 2005.

    Comments: 25 pp

    ACM Class: H.3; H.4; H.5

    Journal ref: Traitement Automatique des Langues (TAL) 42, no. 3 (2001) pp. 667-690

  10. arXiv:cs/0506048  [pdf, ps, other

    cs.IR

    Enriching a Text by Semantic Disambiguation for Information Extraction

    Authors: Bernard Jacquemin, Caroline Brun, Claude Roux

    Abstract: External linguistic resources have been used for a very long time in information extraction. These methods enrich a document with data that are semantically equivalent, in order to improve recall. For instance, some of these methods use synonym dictionaries. These dictionaries enrich a sentence with words that have a similar meaning. However, these methods present some serious drawbacks, since w… ▽ More

    Submitted 12 June, 2005; originally announced June 2005.

    Comments: 7 pp

    ACM Class: H.3; H.4; H.5

    Journal ref: LREC 2002 Workshop Proceedings "Using semantics for informaiton retrival and filtering" (2002) 45-51