Skip to main content

Showing 1–17 of 17 results for author: Montariol, S

.
  1. Course Recommender Systems Need to Consider the Job Market

    Authors: Jibril Frej, Anna Dai, Syrielle Montariol, Antoine Bosselut, Tanja Käser

    Abstract: Current course recommender systems primarily leverage learner-course interactions, course content, learner preferences, and supplementary course details like instructor, institution, ratings, and reviews, to make their recommendation. However, these systems often overlook a critical aspect: the evolving skill demand of the job market. This paper focuses on the perspective of academic researchers,… ▽ More

    Submitted 1 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: accepted at SIGIR 2024 as a perspective paper. Camera Ready will come soon

    ACM Class: H.3.3

  2. arXiv:2404.05281  [pdf, ps, other

    cs.CL

    Multi-Task Learning for Features Extraction in Financial Annual Reports

    Authors: Syrielle Montariol, Matej Martinc, Andraž Pelicon, Senja Pollak, Boshko Koloski, Igor Lončarski, Aljoša Valentinčič

    Abstract: For assessing various performance indicators of companies, the focus is shifting from strictly financial (quantitative) publicly disclosed information to qualitative (textual) information. This textual data can provide valuable weak signals, for example through stylistic features, which can complement the quantitative data on financial performance or on Environmental, Social and Governance (ESG) c… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted at MIDAS Workshop at ECML-PKDD 2022

  3. arXiv:2403.13965  [pdf, other

    cs.CV

    ConGeo: Robust Cross-view Geo-localization across Ground View Variations

    Authors: Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia

    Abstract: Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Project page at https://chasel-tsui.github.io/ConGeo/

  4. arXiv:2403.00180  [pdf, other

    cs.CL

    "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models

    Authors: Karina Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut

    Abstract: Model editing has emerged as a cost-effective strategy to update knowledge stored in language models. However, model editing can have unintended consequences after edits are applied: information unrelated to the edits can also be changed, and other general behaviors of the model can be wrongly altered. In this work, we investigate how model editing methods unexpectedly amplify model biases post-ed… ▽ More

    Submitted 16 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  5. arXiv:2402.12846  [pdf, other

    cs.CV cs.AI

    ConVQG: Contrastive Visual Question Generation with Multimodal Guidance

    Authors: Li Mi, Syrielle Montariol, Javiera Castillo-Navarro, Xianjie Dai, Antoine Bosselut, Devis Tuia

    Abstract: Asking questions about visual environments is a crucial way for intelligent agents to understand rich multi-faceted scenes, raising the importance of Visual Question Generation (VQG) systems. Apart from being grounded to the image, existing VQG systems can use textual constraints, such as expected answers or knowledge triplets, to generate focused questions. These constraints allow VQG systems to… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: AAAI 2024. Project page at https://limirs.github.io/ConVQG

  6. arXiv:2402.03832  [pdf, other

    cs.CL

    Rethinking Skill Extraction in the Job Market Domain using Large Language Models

    Authors: Khanh Cao Nguyen, Mike Zhang, Syrielle Montariol, Antoine Bosselut

    Abstract: Skill Extraction involves identifying skills and qualifications mentioned in documents such as job postings and resumes. The task is commonly tackled by training supervised models using a sequence labeling approach with BIO tags. However, the reliance on manually annotated data limits the generalizability of such approaches. Moreover, the common BIO setting limits the ability of the models to capt… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Published at NLP4HR 2024 (EACL Workshop)

  7. arXiv:2402.03242  [pdf, other

    cs.CL

    JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching

    Authors: Antoine Magron, Anna Dai, Mike Zhang, Syrielle Montariol, Antoine Bosselut

    Abstract: Recent approaches in skill matching, employing synthetic training data for classification or similarity model training, have shown promising results, reducing the need for time-consuming and expensive annotations. However, previous synthetic datasets have limitations, such as featuring only one skill per sentence and generally comprising short sentences. In this paper, we introduce JobSkape, a fra… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Published at NLP4HR 2024 (EACL Workshop)

  8. arXiv:2402.02933  [pdf, other

    cs.LG cs.CY cs.HC

    InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

    Authors: Vinitra Swamy, Syrielle Montariol, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser

    Abstract: Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i.e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance. Most existing methods compromise one or more of these requirements; e.g., post-hoc approaches provide limited faithfulness, automatically identified featu… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  9. arXiv:2312.00575  [pdf, other

    cs.CL

    Instruction-tuning Aligns LLMs to the Human Brain

    Authors: Khai Loong Aw, Syrielle Montariol, Badr AlKhamissi, Martin Schrimpf, Antoine Bosselut

    Abstract: Instruction-tuning is a widely adopted method of finetuning that enables large language models (LLMs) to generate output that more closely resembles human responses to natural language queries, in many cases leading to human-level performance on diverse testbeds. However, it remains unclear whether instruction-tuning truly makes LLMs more similar to how humans process language. We investigate the… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  10. arXiv:2311.16079  [pdf, other

    cs.CL cs.AI cs.LG

    MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

    Authors: Zeming Chen, Alejandro Hernández Cano, Angelika Romanou, Antoine Bonnet, Kyle Matoba, Francesco Salvi, Matteo Pagliardini, Simin Fan, Andreas Köpf, Amirkeivan Mohtashami, Alexandre Sallinen, Alireza Sakhaeirad, Vinitra Swamy, Igor Krawczuk, Deniz Bayazit, Axel Marmet, Syrielle Montariol, Mary-Anne Hartley, Martin Jaggi, Antoine Bosselut

    Abstract: Large language models (LLMs) can potentially democratize access to medical knowledge. While many efforts have been made to harness and improve LLMs' medical knowledge and reasoning capacities, the resulting models are either closed-source (e.g., PaLM, GPT-4) or limited in scale (<= 13B parameters), which restricts their abilities. In this work, we improve access to large-scale medical LLMs by rele… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  11. arXiv:2311.04284  [pdf, other

    cs.CL cs.AI

    CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

    Authors: Angelika Romanou, Syrielle Montariol, Debjit Paul, Leo Laugier, Karl Aberer, Antoine Bosselut

    Abstract: Understanding narratives requires reasoning about the cause-and-effect relationships between events mentioned in the text. While existing foundation models yield impressive results in many NLP tasks requiring reasoning, it is unclear whether they understand the complexity of the underlying network of causal relationships of events in narratives. In this work, we present CRAB, a new Causal Reasonin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  12. arXiv:2310.15239  [pdf, other

    cs.CL cs.AI

    CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks

    Authors: Mete Ismayilzada, Debjit Paul, Syrielle Montariol, Mor Geva, Antoine Bosselut

    Abstract: Recent efforts in natural language processing (NLP) commonsense reasoning research have yielded a considerable number of new datasets and benchmarks. However, most of these datasets formulate commonsense reasoning challenges in artificial scenarios that are not reflective of the tasks which real-world NLP systems are designed to solve. In this work, we present CRoW, a manually-curated, multi-task… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 37 pages, camera-ready for EMNLP 2023

  13. arXiv:2210.13029  [pdf, other

    cs.CL

    Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

    Authors: Syrielle Montariol, Arij Riabi, Djamé Seddah

    Abstract: Zero-shot cross-lingual transfer learning has been shown to be highly challenging for tasks involving a lot of linguistic specificities or when a cultural gap is present between languages, such as in hate speech detection. In this paper, we highlight this limitation for hate speech detection in several domains and languages using strict experimental settings. Then, we propose to train on multiling… ▽ More

    Submitted 25 October, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of AACL-IJCNLP 2022

  14. Capturing Evolution in Word Usage: Just Add More Clusters?

    Authors: Matej Martinc, Syrielle Montariol, Elaine Zosa, Lidia Pivovarova

    Abstract: The way the words are used evolves through time, mirroring cultural or technological evolution of society. Semantic change detection is the task of detecting and analysing word evolution in textual data, even in short periods of time. In this paper we focus on a new set of methods relying on contextualised embeddings, a type of semantic modelling that revolutionised the NLP field recently. We leve… ▽ More

    Submitted 23 January, 2020; v1 submitted 18 January, 2020; originally announced January 2020.

    Journal ref: WWW 20 Companion Proceedings of the Web Conference 2020 (April 2020) p. 343-349

  15. arXiv:1909.01863  [pdf, other

    cs.CL

    Empirical Study of Diachronic Word Embeddings for Scarce Data

    Authors: Syrielle Montariol, Alexandre Allauzen

    Abstract: Word meaning change can be inferred from drifts of time-varying word embeddings. However, temporal data may be too sparse to build robust word embeddings and to discriminate significant drifts from noise. In this paper, we compare three models to learn diachronic word embeddings on scarce data: incremental updating of a Skip-Gram from Kim et al. (2014), dynamic filtering from Bamler and Mandt (201… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: 7 pages

    Journal ref: RANLP 2019

  16. arXiv:1907.09169  [pdf, other

    cs.CL cs.LG

    Learning dynamic word embeddings with drift regularisation

    Authors: Syrielle Montariol, Alexandre Allauzen

    Abstract: Word usage, meaning and connotation change throughout time. Diachronic word embeddings are used to grasp these changes in an unsupervised way. In this paper, we use variants of the Dynamic Bernoulli Embeddings model to learn dynamic word embeddings, in order to identify notable properties of the model. The comparison is made on the New York Times Annotated Corpus in English and a set of articles f… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Published at TALN 2019. in French

  17. arXiv:1907.08469  [pdf, ps, other

    cs.CL

    Exploring sentence informativeness

    Authors: Syrielle Montariol, Aina Garí Soler, Alexandre Allauzen

    Abstract: This study is a preliminary exploration of the concept of informativeness -how much information a sentence gives about a word it contains- and its potential benefits to building quality word representations from scarce data. We propose several sentence-level classifiers to predict informativeness, and we perform a manual annotation on a set of sentences. We conclude that these two measures corresp… ▽ More

    Submitted 22 July, 2019; v1 submitted 19 July, 2019; originally announced July 2019.

    Comments: Published at TALN 2019