Skip to main content

Showing 1–10 of 10 results for author: Schmidt, F D

.
  1. arXiv:2407.02310  [pdf, other

    cs.CL

    Evaluating the Ability of LLMs to Solve Semantics-Aware Process Mining Tasks

    Authors: Adrian Rebmann, Fabian David Schmidt, Goran Glavaš, Han van der Aa

    Abstract: The process mining community has recently recognized the potential of large language models (LLMs) for tackling various process mining tasks. Initial studies report the capability of LLMs to support process analysis and even, to some extent, that they are able to reason about how processes work. This latter property suggests that LLMs could also be used to tackle process mining tasks that benefit… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Submitted to ICPM

  2. arXiv:2406.12739  [pdf, other

    cs.CL

    Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

    Authors: Fabian David Schmidt, Philipp Borchert, Ivan Vulić, Goran Glavaš

    Abstract: LLMs have become a go-to solution not just for text generation, but also for natural language understanding (NLU) tasks. Acquiring extensive knowledge through language modeling on web-scale corpora, they excel on English NLU, yet struggle to extend their NLU capabilities to underrepresented languages. In contrast, machine translation models (MT) produce excellent multilingual representations, resu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.12634  [pdf, other

    cs.IR cs.AI

    News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation

    Authors: Andreea Iana, Fabian David Schmidt, Goran Glavaš, Heiko Paulheim

    Abstract: Rapidly growing numbers of multilingual news consumers pose an increasing challenge to news recommender systems in terms of providing customized recommendations. First, existing neural news recommenders, even when powered by multilingual language models (LMs), suffer substantial performance losses in zero-shot cross-lingual transfer (ZS-XLT). Second, the current paradigm of fine-tuning the backbon… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; H.3.3

  4. arXiv:2404.19319  [pdf, other

    cs.CL

    Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

    Authors: Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

    Abstract: Compared to standard language model (LM) pretraining (i.e., from scratch), Knowledge Distillation (KD) entails an additional forward pass through a teacher model that is typically substantially larger than the target student model. As such, KD in LM pretraining materially slows down throughput of pretraining instances vis-a-vis pretraining from scratch. Scaling laws of LM pretraining suggest that… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024

  5. arXiv:2310.10532  [pdf, other

    cs.CL

    One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging For Cross-Lingual Transfer

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Multilingual language models enable zero-shot cross-lingual transfer (ZS-XLT): fine-tuned on sizable source-language task data, they perform the task in target languages without labeled instances. The effectiveness of ZS-XLT hinges on the linguistic proximity between languages and the amount of pretraining data for a language. Because of this, model selection based on source-language validation is… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted to findings of EMNLP 2023

  6. arXiv:2305.16834  [pdf, other

    cs.CL

    Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging

    Authors: Fabian David Schmidt, Ivan Vulić, Goran Glavaš

    Abstract: Massively multilingual language models have displayed strong performance in zero-shot (ZS-XLT) and few-shot (FS-XLT) cross-lingual transfer setups, where models fine-tuned on task data in a source language are transferred without any or with only a few annotated instances to the target language(s). However, current work typically overestimates model performance as fine-tuned models are frequently… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted To Appear In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics

  7. arXiv:2210.06763  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    Dust survival rates in clumps passing through the Cas A reverse shock -- II. The impact of magnetic fields

    Authors: Florian Kirchschlager, Franziska D. Schmidt, M. J. Barlow, Ilse De Looze, Nina S. Sartorio

    Abstract: Dust grains form in the clumpy ejecta of core-collapse supernovae where they are subject to the reverse shock, which is able to disrupt the clumps and destroy the grains. Important dust destruction processes include thermal and kinetic sputtering as well as fragmentation and grain vaporization. In the present study, we focus on the effect of magnetic fields on the destruction processes. We have pe… ▽ More

    Submitted 16 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted by MNRAS. Author accepted manuscript. Accepted on 23/01/2023. 24 pages, 21 Figures

  8. arXiv:2003.03380  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    Silicate grain growth due to ion trap** in oxygen-rich supernova remnants like Cassiopeia A

    Authors: Florian Kirchschlager, M. J. Barlow, Franziska D. Schmidt

    Abstract: Core-collapse supernovae can condense large masses of dust post-explosion. However, sputtering and grain-grain collisions during the subsequent passage of the dust through the reverse shock can potentially destroy a significant fraction of the newly formed dust before it can reach the interstellar medium. Here we show that in oxygen-rich supernova remnants like Cassiopeia A the penetration and tra… ▽ More

    Submitted 10 March, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: Accepted by ApJ. Author accepted manuscript. Accepted on 06/03/2020. Deposited on 06/03/2020. 11 pages

  9. arXiv:1909.09068  [pdf, other

    astro-ph.GA astro-ph.SR

    Dust destruction by the reverse shock in the clumpy supernova remnant Cassiopeia A based on hydrodynamic simulations

    Authors: Florian Kirchschlager, Franziska D. Schmidt, M. J. Barlow, Erica L. Fogerty, Antonia Bevan, Felix D. Priestley

    Abstract: Observations of the ejecta of core-collapse supernovae have shown that dust grains form in over-dense gas clumps in the expanding ejecta. The clumps are later subject to the passage of the reverse shock and a significant amount of the newly formed dust material can be destroyed due to the high temperatures and high velocities in the post-shock gas. To determine dust survival rates, we have perform… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Conference proceeding

  10. arXiv:1908.10875  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    Dust survival rates in clumps passing through the Cas A reverse shock I: results for a range of clump densities

    Authors: Florian Kirchschlager, Franziska D. Schmidt, M. J. Barlow, Erica L. Fogerty, Antonia Bevan, Felix D. Priestley

    Abstract: The reverse shock in the ejecta of core-collapse supernovae is potentially able to destroy newly formed dust material. In order to determine dust survival rates, we have performed a set of hydrodynamic simulations using the grid-based code AstroBEAR in order to model a shock wave interacting with clumpy supernova ejecta. Dust motions and destruction rates were computed using our newly developed ex… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: Accepted by MNRAS. Author accepted manuscript. Accepted on 28/08/2019. Deposited on 28/08/19. 34 pages