-
Hamming Distance Oracle
Authors:
Itai Boneh,
Dvir Fried,
Shay Golan,
Matan Kraus
Abstract:
In this paper, we present and study the \emph{Hamming distance oracle problem}. In this problem, the task is to preprocess two strings $S$ and $T$ of lengths $n$ and $m$, respectively, to obtain a data-structure that is able to answer queries regarding the Hamming distance between a substring of $S$ and a substring of $T$.
For a constant size alphabet strings, we show that for every $x\le nm$ th…
▽ More
In this paper, we present and study the \emph{Hamming distance oracle problem}. In this problem, the task is to preprocess two strings $S$ and $T$ of lengths $n$ and $m$, respectively, to obtain a data-structure that is able to answer queries regarding the Hamming distance between a substring of $S$ and a substring of $T$.
For a constant size alphabet strings, we show that for every $x\le nm$ there is a data structure with $\tilde{O}(nm/x)$ preprocess time and $O(x)$ query time. We also provide a combinatorial conditional lower bound, showing that for every $\varepsilon > 0$ and $x \le nm$ there is no data structure with query time $O(x)$ and preprocess time $O((\frac{nm}{x})^{1-\varepsilon})$ unless combinatorial fast matrix multiplication is possible.
For strings over general alphabet, we present a data structure with $\tilde{O}(nm/\sqrt{x})$ preprocess time and $O(x)$ query time for every $x \le nm$.
△ Less
Submitted 7 July, 2024;
originally announced July 2024.
-
Multi-Agent Reinforcement Learning Meets Leaf Sequencing in Radiotherapy
Authors:
Riqiang Gao,
Florin C. Ghesu,
Simon Arberet,
Shahab Basiri,
Esa Kuusela,
Martin Kraus,
Dorin Comaniciu,
Ali Kamen
Abstract:
In contemporary radiotherapy planning (RTP), a key module leaf sequencing is predominantly addressed by optimization-based approaches. In this paper, we propose a novel deep reinforcement learning (DRL) model termed as Reinforced Leaf Sequencer (RLS) in a multi-agent framework for leaf sequencing. The RLS model offers improvements to time-consuming iterative optimization steps via large-scale trai…
▽ More
In contemporary radiotherapy planning (RTP), a key module leaf sequencing is predominantly addressed by optimization-based approaches. In this paper, we propose a novel deep reinforcement learning (DRL) model termed as Reinforced Leaf Sequencer (RLS) in a multi-agent framework for leaf sequencing. The RLS model offers improvements to time-consuming iterative optimization steps via large-scale training and can control movement patterns through the design of reward mechanisms. We have conducted experiments on four datasets with four metrics and compared our model with a leading optimization sequencer. Our findings reveal that the proposed RLS model can achieve reduced fluence reconstruction errors, and potential faster convergence when integrated in an optimization planner. Additionally, RLS has shown promising results in a full artificial intelligence RTP pipeline. We hope this pioneer multi-agent RL leaf sequencer can foster future research on machine learning for RTP.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
A machine learning framework for interpretable predictions in patient pathways: The case of predicting ICU admission for patients with symptoms of sepsis
Authors:
Sandra Zilker,
Sven Weinzierl,
Mathias Kraus,
Patrick Zschech,
Martin Matzner
Abstract:
Proactive analysis of patient pathways helps healthcare providers anticipate treatment-related risks, identify outcomes, and allocate resources. Machine learning (ML) can leverage a patient's complete health history to make informed decisions about future events. However, previous work has mostly relied on so-called black-box models, which are unintelligible to humans, making it difficult for clin…
▽ More
Proactive analysis of patient pathways helps healthcare providers anticipate treatment-related risks, identify outcomes, and allocate resources. Machine learning (ML) can leverage a patient's complete health history to make informed decisions about future events. However, previous work has mostly relied on so-called black-box models, which are unintelligible to humans, making it difficult for clinicians to apply such models. Our work introduces PatWay-Net, an ML framework designed for interpretable predictions of admission to the intensive care unit (ICU) for patients with symptoms of sepsis. We propose a novel type of recurrent neural network and combine it with multi-layer perceptrons to process the patient pathways and produce predictive yet interpretable results. We demonstrate its utility through a comprehensive dashboard that visualizes patient health trajectories, predictive outcomes, and associated risks. Our evaluation includes both predictive performance - where PatWay-Net outperforms standard models such as decision trees, random forests, and gradient-boosted decision trees - and clinical utility, validated through structured interviews with clinicians. By providing improved predictive accuracy along with interpretable and actionable insights, PatWay-Net serves as a valuable tool for healthcare decision support in the critical case of patients with symptoms of sepsis.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Structure-preserving particle methods for the Landau collision operator using the metriplectic framework
Authors:
Sandra Jeyakumar,
Michael Kraus,
Matthew J. Hole,
David Pfefferlé
Abstract:
We present a novel family of particle discretisation methods for the nonlinear Landau collision operator. We exploit the metriplectic structure underlying the Vlasov-Maxwell-Landau system in order to obtain disretisation schemes that automatically preserve mass, momentum, and energy, warrant monotonic dissipation of entropy, and are thus guaranteed to respect the laws of thermodynamics. In contras…
▽ More
We present a novel family of particle discretisation methods for the nonlinear Landau collision operator. We exploit the metriplectic structure underlying the Vlasov-Maxwell-Landau system in order to obtain disretisation schemes that automatically preserve mass, momentum, and energy, warrant monotonic dissipation of entropy, and are thus guaranteed to respect the laws of thermodynamics. In contrast to recent works that used radial basis functions and similar methods for regularisation, here we use an auxiliary spline or finite element representation of the distribution function to this end. Discrete gradient methods are employed to guarantee the aforementioned properties in the time discrete domain as well.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Hairpin Completion Distance Lower Bound
Authors:
Itai Boneh,
Dvir Fried,
Shay Golan,
Matan Kraus
Abstract:
Hairpin completion, derived from the hairpin formation observed in DNA biochemistry, is an operation applied to strings, particularly useful in DNA computing. Conceptually, a right hairpin completion operation transforms a string $S$ into $S\cdot S'$ where $S'$ is the reverse complement of a prefix of $S$. Similarly, a left hairpin completion operation transforms a string $S$ into $S'\cdot S$ wher…
▽ More
Hairpin completion, derived from the hairpin formation observed in DNA biochemistry, is an operation applied to strings, particularly useful in DNA computing. Conceptually, a right hairpin completion operation transforms a string $S$ into $S\cdot S'$ where $S'$ is the reverse complement of a prefix of $S$. Similarly, a left hairpin completion operation transforms a string $S$ into $S'\cdot S$ where $S'$ is the reverse complement of a suffix of $S$. The hairpin completion distance from $S$ to $T$ is the minimum number of hairpin completion operations needed to transform $S$ into $T$. Recently Boneh et al. showed an $O(n^2)$ time algorithm for finding the hairpin completion distance between two strings of length at most $n$. In this paper we show that for any $\varepsilon>0$ there is no $O(n^{2-\varepsilon})$-time algorithm for the hairpin completion distance problem unless the Strong Exponential Time Hypothesis (SETH) is false. Thus, under SETH, the time complexity of the hairpin completion distance problem is quadratic, up to sub-polynomial factors.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Instabilities in the Yellow Hypergiant domain
Authors:
Wolfgang Glatzel,
Michaela Kraus
Abstract:
Yellow Hypergiants (YHGs) are massive stars that are commonly interpreted to be in a post-red supergiant evolutionary state. These objects can undergo outbursts on timescales of decades, which are suspected to be due to instabilities in the envelope. To test this conjecture, the stability of envelope models for YHGs with respect to infinitesimal, radial perturbations is investigated. Violent stran…
▽ More
Yellow Hypergiants (YHGs) are massive stars that are commonly interpreted to be in a post-red supergiant evolutionary state. These objects can undergo outbursts on timescales of decades, which are suspected to be due to instabilities in the envelope. To test this conjecture, the stability of envelope models for YHGs with respect to infinitesimal, radial perturbations is investigated. Violent strange mode instabilities with growth rates in the dynamical regime are identified if the luminosity to mass ratio exceeds $\approx 10^4$ in solar units. For the observed parameters of YHGs we thus predict instability. The strange mode instabilities persist over the entire range of effective temperatures from red to blue supergiants. Due to short thermal timescales and dominant radiation pressure in the envelopes of YHGs, a nonadiabatic stability analysis is mandatory and an adiabatic analysis being the basis of the common perception is irrelevant. Contrary to the prevailing opinion, the models considered here do not exhibit any adiabatic instability.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
IGANN Sparse: Bridging Sparsity and Interpretability with Non-linear Insight
Authors:
Theodor Stoecker,
Nico Hambauer,
Patrick Zschech,
Mathias Kraus
Abstract:
Feature selection is a critical component in predictive analytics that significantly affects the prediction accuracy and interpretability of models. Intrinsic methods for feature selection are built directly into model learning, providing a fast and attractive option for large amounts of data. Machine learning algorithms, such as penalized regression models (e.g., lasso) are the most common choice…
▽ More
Feature selection is a critical component in predictive analytics that significantly affects the prediction accuracy and interpretability of models. Intrinsic methods for feature selection are built directly into model learning, providing a fast and attractive option for large amounts of data. Machine learning algorithms, such as penalized regression models (e.g., lasso) are the most common choice when it comes to in-built feature selection. However, they fail to capture non-linear relationships, which ultimately affects their ability to predict outcomes in intricate datasets. In this paper, we propose IGANN Sparse, a novel machine learning model from the family of generalized additive models, which promotes sparsity through a non-linear feature selection process during training. This ensures interpretability through improved model sparsity without sacrificing predictive performance. Moreover, IGANN Sparse serves as an exploratory tool for information systems researchers to unveil important non-linear relationships in domains that are characterized by complex patterns. Our ongoing research is directed at a thorough evaluation of the IGANN Sparse model, including user studies that allow to assess how well users of the model can benefit from the reduced number of features. This will allow for a deeper understanding of the interactions between linear vs. non-linear modeling, number of selected features, and predictive performance.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Breaking Abbe's diffraction limit with harmonic deactivation microscopy
Authors:
Kevin Murzyn,
Maarten L. S. van der Geest,
Leo Guery,
Zhonghui Nie,
Pieter van Essen,
Stefan Witte,
Peter M. Kraus
Abstract:
Nonlinear optical microscopy provides elegant means for label-free imaging of biological samples and condensed matter systems. The widespread areas of application could even be increased if resolution was improved, which is currently limited by the famous Abbe diffraction limit. Super-resolution techniques can break the diffraction limit but rely on fluorescent labeling. This makes them incompatib…
▽ More
Nonlinear optical microscopy provides elegant means for label-free imaging of biological samples and condensed matter systems. The widespread areas of application could even be increased if resolution was improved, which is currently limited by the famous Abbe diffraction limit. Super-resolution techniques can break the diffraction limit but rely on fluorescent labeling. This makes them incompatible with (sub-)femtosecond temporal resolution and applications that demand the absence of labeling. Here, we introduce harmonic deactivation microscopy (HADES) for breaking the diffraction limit in non-fluorescent samples. By controlling the harmonic generation process on the quantum level with a second donut-shaped pulse, we confine the third harmonic generation to three times below the original focus size and use this pulse for scanning microscopy. We demonstrate that resolution improvement by deactivation is more efficient for higher harmonic orders, and only limited by the maximum applicable deactivation-pulse fluence. This provides a route towards sub-100~nm resolution in a regular nonlinear microscope. The new capability of label-free super-resolution can find immediate applications in condensed matter physics, semiconductor metrology, and biomedical imaging.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
United We Pretrain, Divided We Fail! Representation Learning for Time Series by Pretraining on 75 Datasets at Once
Authors:
Maurice Kraus,
Felix Divo,
David Steinmann,
Devendra Singh Dhami,
Kristian Kersting
Abstract:
In natural language processing and vision, pretraining is utilized to learn effective representations. Unfortunately, the success of pretraining does not easily carry over to time series due to potential mismatch between sources and target. Actually, common belief is that multi-dataset pretraining does not work for time series! Au contraire, we introduce a new self-supervised contrastive pretraini…
▽ More
In natural language processing and vision, pretraining is utilized to learn effective representations. Unfortunately, the success of pretraining does not easily carry over to time series due to potential mismatch between sources and target. Actually, common belief is that multi-dataset pretraining does not work for time series! Au contraire, we introduce a new self-supervised contrastive pretraining approach to learn one encoding from many unlabeled and diverse time series datasets, so that the single learned representation can then be reused in several target domains for, say, classification. Specifically, we propose the XD-MixUp interpolation method and the Soft Interpolation Contextual Contrasting (SICC) loss. Empirically, this outperforms both supervised training and other self-supervised pretraining methods when finetuning on low-data regimes. This disproves the common belief: We can actually learn from multiple time series datasets, even from 75 at once.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Towards complete all-optical emission control of high-harmonic generation from solids
Authors:
Pieter J. van Essen,
Zhonghui Nie,
Brian de Keijzer,
Peter M. Kraus
Abstract:
Optical modulation of high-harmonics generation in solids enables the detection of material properties such as the band structure and promising new applications such as super-resolution imaging in semiconductors. Various recent studies have shown optical modulation of high-harmonics generation in solids, in particular, suppression of high-harmonics generation has been observed by synchronized or d…
▽ More
Optical modulation of high-harmonics generation in solids enables the detection of material properties such as the band structure and promising new applications such as super-resolution imaging in semiconductors. Various recent studies have shown optical modulation of high-harmonics generation in solids, in particular, suppression of high-harmonics generation has been observed by synchronized or delayed multi-pulse sequences. Here we provide an overview of the underlying mechanisms attributed to this suppression and provide a perspective on the challenges and opportunities regarding these mechanisms. All-optical control of high-harmonic generation allows for femtosecond, and in the future possibly subfemtosecond, switching, which has numerous possible applications: These range from super-resolution microscopy, to nanoscale controlled chemistry, and highly tunable nonlinear light sources.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Right on Time: Revising Time Series Models by Constraining their Explanations
Authors:
Maurice Kraus,
David Steinmann,
Antonia Wüst,
Andre Kokozinski,
Kristian Kersting
Abstract:
The reliability of deep time series models is often compromised by their tendency to rely on confounding factors, which may lead to incorrect outputs. Our newly recorded, naturally confounded dataset named P2S from a real mechanical production line emphasizes this. To avoid "Clever-Hans" moments in time series, i.e., to mitigate confounders, we introduce the method Right on Time (RioT). RioT enabl…
▽ More
The reliability of deep time series models is often compromised by their tendency to rely on confounding factors, which may lead to incorrect outputs. Our newly recorded, naturally confounded dataset named P2S from a real mechanical production line emphasizes this. To avoid "Clever-Hans" moments in time series, i.e., to mitigate confounders, we introduce the method Right on Time (RioT). RioT enables, for the first time, interactions with model explanations across both the time and frequency domain. Feedback on explanations in both domains is then used to constrain the model, steering it away from the annotated confounding factors. The dual-domain interaction strategy is crucial for effectively addressing confounders in time series datasets. We empirically demonstrate that RioT can effectively guide models away from the wrong reasons in P2S as well as popular time series classification and forecasting datasets.
△ Less
Submitted 19 June, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering
Authors:
Tobias Schimanski,
**gwei Ni,
Mathias Kraus,
Elliott Ash,
Markus Leippold
Abstract:
Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information withi…
▽ More
Advances towards more faithful and traceable answers of Large Language Models (LLMs) are crucial for various research and practical endeavors. One avenue in reaching this goal is basing the answers on reliable sources. However, this Evidence-Based QA has proven to work insufficiently with LLMs in terms of citing the correct sources (source quality) and truthfully representing the information within sources (answer attributability). In this work, we systematically investigate how to robustly fine-tune LLMs for better source quality and answer attributability. Specifically, we introduce a data generation pipeline with automated data quality filters, which can synthesize diversified high-quality training and testing data at scale. We further introduce four test sets to benchmark the robustness of fine-tuned specialist models. Extensive evaluation shows that fine-tuning on synthetic data improves performance on both in- and out-of-distribution. Furthermore, we show that data quality, which can be drastically improved by proposed quality filters, matters more than quantity in improving Evidence-Based QA.
△ Less
Submitted 3 June, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Rings, shells, and arc structures around B[e] supergiants: I. Classical tools of non-linear hydrodynamics
Authors:
Dieter H. Nickeler,
Michaela Kraus
Abstract:
Structures in circumstellar matter reflect both fast processes and quasi-equilibrium states. A geometrical diversity of emitting circumstellar matter is observed around evolved massive stars, in particular around B[e] supergiants. We recapitulate classical analytical tools of linear and non-linear potential theory, such as Cole-Hopf transformations and Grad-Shafranov theory, and develop them furth…
▽ More
Structures in circumstellar matter reflect both fast processes and quasi-equilibrium states. A geometrical diversity of emitting circumstellar matter is observed around evolved massive stars, in particular around B[e] supergiants. We recapitulate classical analytical tools of linear and non-linear potential theory, such as Cole-Hopf transformations and Grad-Shafranov theory, and develop them further to explain occurrence of the circumstellar matter structures and their dynamics. We use potential theory to formulate the non-linear hydrodynamical equations and test dilatations of the quasi-equilibrium initial conditions. We find that a wide range of flow patterns can basically be generated and the time scales can switch, based on initial conditions, and lead to eruptive processes, reinforcing that the non-linear fluid environment includes both quasi-stationary structures and fast processes like finite-time singularities. Some constraints and imposed symmetries can lead to Keplerian orbits, while other constraints can deliver quasi-Keplerian ones. The threshold is given by a characteristic density at the stellar surface.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
A Globally Convergent Algorithm for Neural Network Parameter Optimization Based on Difference-of-Convex Functions
Authors:
Daniel Tschernutter,
Mathias Kraus,
Stefan Feuerriegel
Abstract:
We propose an algorithm for optimizing the parameters of single hidden layer neural networks. Specifically, we derive a blockwise difference-of-convex (DC) functions representation of the objective function. Based on the latter, we propose a block coordinate descent (BCD) approach that we combine with a tailored difference-of-convex functions algorithm (DCA). We prove global convergence of the pro…
▽ More
We propose an algorithm for optimizing the parameters of single hidden layer neural networks. Specifically, we derive a blockwise difference-of-convex (DC) functions representation of the objective function. Based on the latter, we propose a block coordinate descent (BCD) approach that we combine with a tailored difference-of-convex functions algorithm (DCA). We prove global convergence of the proposed algorithm. Furthermore, we mathematically analyze the convergence rate of parameters and the convergence rate in value (i.e., the training loss). We give conditions under which our algorithm converges linearly or even faster depending on the local shape of the loss function. We confirm our theoretical derivations numerically and compare our algorithm against state-of-the-art gradient-based solvers in terms of both training loss and test loss.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Gravitational Bremsstrahlung in Black-Hole Scattering at $\mathcal{O}(G^3)$: Linear-in-Spin Effects
Authors:
Lara Bohnenblust,
Harald Ita,
Manfred Kraus,
Johannes Schlenk
Abstract:
We compute the far-field time-domain waveform of the gravitational waves produced in the scattering of two spinning massive objects. The results include linear-in-spin ($S$) couplings and first-order gravitational corrections ($G^3$), and are valid for encounters in the weak-field regime. Employing a field-theory framework based on the scattering of massive scalar and vector particles coupled to E…
▽ More
We compute the far-field time-domain waveform of the gravitational waves produced in the scattering of two spinning massive objects. The results include linear-in-spin ($S$) couplings and first-order gravitational corrections ($G^3$), and are valid for encounters in the weak-field regime. Employing a field-theory framework based on the scattering of massive scalar and vector particles coupled to Einstein-Hilbert gravity, we derive results for leading and the next-to-leading spectral waveforms. We provide analytic expressions for the required scattering data, which include trees, one-loop amplitudes and their cuts. The expressions are extracted from numerical amplitude evaluations with the Caravel program, using analytic reconstruction techniques applied in the classical limit. We confirm a recent prediction for infrared physics of the classical observable, and observe the surprising appearance of a ultraviolet singularity, which drops out in the far-field waveform.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Volume-Preserving Transformers for Learning Time Series Data with Structure
Authors:
Benedikt Brantner,
Guillaume de Romemont,
Michael Kraus,
Zeyuan Li
Abstract:
Two of the many trends in neural network research of the past few years have been (i) the learning of dynamical systems, especially with recurrent neural networks such as long short-term memory networks (LSTMs) and (ii) the introduction of transformer neural networks for natural language processing (NLP) tasks. Both of these trends have created enormous amounts of traction, particularly the second…
▽ More
Two of the many trends in neural network research of the past few years have been (i) the learning of dynamical systems, especially with recurrent neural networks such as long short-term memory networks (LSTMs) and (ii) the introduction of transformer neural networks for natural language processing (NLP) tasks. Both of these trends have created enormous amounts of traction, particularly the second one: transformer networks now dominate the field of NLP. Even though some work has been performed on the intersection of these two trends, those efforts was largely limited to using the vanilla transformer directly without adjusting its architecture for the setting of a physical system. In this work we use a transformer-inspired neural network to learn a dynamical system and furthermore (for the first time) imbue it with structure-preserving properties to improve long-term stability. This is shown to be of great advantage when applying the neural network to real world applications.
△ Less
Submitted 1 May, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Symplectic Autoencoders for Model Reduction of Hamiltonian Systems
Authors:
Benedikt Brantner,
Michael Kraus
Abstract:
Many applications, such as optimization, uncertainty quantification and inverse problems, require repeatedly performing simulations of large-dimensional physical systems for different choices of parameters. This can be prohibitively expensive.
In order to save computational cost, one can construct surrogate models by expressing the system in a low-dimensional basis, obtained from training data.…
▽ More
Many applications, such as optimization, uncertainty quantification and inverse problems, require repeatedly performing simulations of large-dimensional physical systems for different choices of parameters. This can be prohibitively expensive.
In order to save computational cost, one can construct surrogate models by expressing the system in a low-dimensional basis, obtained from training data. This is referred to as model reduction.
Past investigations have shown that, when performing model reduction of Hamiltonian systems, it is crucial to preserve the symplectic structure associated with the system in order to ensure long-term numerical stability.
Up to this point structure-preserving reductions have largely been limited to linear transformations. We propose a new neural network architecture in the spirit of autoencoders, which are established tools for dimension reduction and feature extraction in data science, to obtain more general map**s.
In order to train the network, a non-standard gradient descent approach is applied that leverages the differential-geometric structure emerging from the network design.
The new architecture is shown to significantly outperform existing designs in accuracy.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Two-Loop Master Integrals for Leading-Color $pp\to t\bar{t}H$ Amplitudes with a Light-Quark Loop
Authors:
F. Febres Cordero,
G. Figueiredo,
M. Kraus,
B. Page,
L. Reina
Abstract:
We compute the two-loop master integrals for leading-color QCD scattering amplitudes including a closed light-quark loop in $t\bar{t}H$ production at hadron colliders. Exploiting numerical evaluations in modular arithmetic, we construct a basis of master integrals satisfying a system of differential equations in $ε$-factorized form. We present the analytic form of the differential equations in ter…
▽ More
We compute the two-loop master integrals for leading-color QCD scattering amplitudes including a closed light-quark loop in $t\bar{t}H$ production at hadron colliders. Exploiting numerical evaluations in modular arithmetic, we construct a basis of master integrals satisfying a system of differential equations in $ε$-factorized form. We present the analytic form of the differential equations in terms of a minimal set of differential one-forms. We explore properties of the function space of analytic solutions to the differential equations in terms of iterative integrals which can be exploited for studying the analytic form of related scattering amplitudes. Finally, we solve the differential equations using generalized series expansions to numerically evaluate the master integrals in physical phase space. As the first computation of a set of two-loop seven-scale master integrals, our results provide valuable input for analytic studies of scattering amplitudes in processes involving massive particles and a large number of kinematic scales.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
On the stability and pulsation in models of B[e] star MWC 137
Authors:
Sugyan Parida,
Abhay Pratap Yadav,
Michaela Kraus,
Wolfgang Glatzel,
Yogesh Chandra Joshi,
Santosh Joshi
Abstract:
B[e] type stars are characterised by strong emission lines, photometric $\&$ spectroscopic variabilities and unsteady mass-loss rates. MWC 137 is a galactic B[e] type star situated in the constellation Orion. Recent photometric observation of MWC 137 by TESS has revealed variabilities with a dominant period of 1.9 d. The origin of this variability is not known but suspected to be from stellar puls…
▽ More
B[e] type stars are characterised by strong emission lines, photometric $\&$ spectroscopic variabilities and unsteady mass-loss rates. MWC 137 is a galactic B[e] type star situated in the constellation Orion. Recent photometric observation of MWC 137 by TESS has revealed variabilities with a dominant period of 1.9 d. The origin of this variability is not known but suspected to be from stellar pulsation. To understand the nature and origin of this variability, we have constructed three different set of models of MWC 137 and performed non-adiabatic linear stability analysis. Several low order modes are found to be unstable in which models having mass in the range of 31 to 34 M$_{\odot}$ and 43 to 46 M$_{\odot}$ have period close to 1.9 d. The evolution of instabilities in the non-linear regime for model having solar chemical composition and mass of 45 M$_{\odot}$ leads to finite amplitude pulsation with a period of 1.9 d. Therefore in the present study we confirm that this variability in MWC 137 is due to pulsation. Evolutionary tracks passing through the location of MWC 137 in the HR diagram indicate that the star is either in post main sequence evolutionary phase or about to enter in this evolutionary phase.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
ClimateBERT-NetZero: Detecting and Assessing Net Zero and Reduction Targets
Authors:
Tobias Schimanski,
Julia Bingler,
Camilla Hyslop,
Mathias Kraus,
Markus Leippold
Abstract:
Public and private actors struggle to assess the vast amounts of information about sustainability commitments made by various institutions. To address this problem, we create a novel tool for automatically detecting corporate, national, and regional net zero and reduction targets in three steps. First, we introduce an expert-annotated data set with 3.5K text samples. Second, we train and release C…
▽ More
Public and private actors struggle to assess the vast amounts of information about sustainability commitments made by various institutions. To address this problem, we create a novel tool for automatically detecting corporate, national, and regional net zero and reduction targets in three steps. First, we introduce an expert-annotated data set with 3.5K text samples. Second, we train and release ClimateBERT-NetZero, a natural language classifier to detect whether a text contains a net zero or reduction target. Third, we showcase its analysis potential with two use cases: We first demonstrate how ClimateBERT-NetZero can be combined with conventional question-answering (Q&A) models to analyze the ambitions displayed in net zero and reduction targets. Furthermore, we employ the ClimateBERT-NetZero model on quarterly earning call transcripts and outline how communication patterns evolve over time. Our experiments demonstrate promising pathways for extracting and analyzing net zero and emission reduction targets at scale.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
Searching 2D-Strings for Matching Frames
Authors:
Itai Boneh,
Dvir Fried,
Shay Golan,
Matan Kraus,
Adrian Miclaus,
Arseny Shur
Abstract:
We introduce the natural notion of a matching frame in a $2$-dimensional string. A matching frame in a $2$-dimensional $n\times m$ string $M$, is a rectangle such that the strings written on the horizontal sides of the rectangle are identical, and so are the strings written on the vertical sides of the rectangle. Formally, a matching frame in $M$ is a tuple $(u,d,\ell,r)$ such that…
▽ More
We introduce the natural notion of a matching frame in a $2$-dimensional string. A matching frame in a $2$-dimensional $n\times m$ string $M$, is a rectangle such that the strings written on the horizontal sides of the rectangle are identical, and so are the strings written on the vertical sides of the rectangle. Formally, a matching frame in $M$ is a tuple $(u,d,\ell,r)$ such that $M[u][\ell ..r] = M[d][\ell ..r]$ and $M[u..d][\ell] = M[u..d][r]$.
In this paper, we present an algorithm for finding the maximum perimeter matching frame in a matrix $M$ in $\tilde{O}(n^{2.5})$ time (assuming $n \ge m)$. Additionally, for every constant $ε> 0$ we present a near-linear $(1-ε)$-approximation algorithm for the maximum perimeter of a matching frame.
In the development of the aforementioned algorithms, we introduce inventive technical elements and uncover distinctive structural properties that we believe will captivate the curiosity of the community.
△ Less
Submitted 18 April, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
Counterfactual Image Generation for adversarially robust and interpretable Classifiers
Authors:
Rafael Bischof,
Florian Scheidegger,
Michael A. Kraus,
A. Cristiano I. Malossi
Abstract:
Neural Image Classifiers are effective but inherently hard to interpret and susceptible to adversarial attacks. Solutions to both problems exist, among others, in the form of counterfactual examples generation to enhance explainability or adversarially augment training datasets for improved robustness. However, existing methods exclusively address only one of the issues. We propose a unified frame…
▽ More
Neural Image Classifiers are effective but inherently hard to interpret and susceptible to adversarial attacks. Solutions to both problems exist, among others, in the form of counterfactual examples generation to enhance explainability or adversarially augment training datasets for improved robustness. However, existing methods exclusively address only one of the issues. We propose a unified framework leveraging image-to-image translation Generative Adversarial Networks (GANs) to produce counterfactual samples that highlight salient regions for interpretability and act as adversarial samples to augment the dataset for more robustness. This is achieved by combining the classifier and discriminator into a single model that attributes real images to their respective classes and flags generated images as "fake". We assess the method's effectiveness by evaluating (i) the produced explainability masks on a semantic segmentation task for concrete cracks and (ii) the model's resilience against the Projected Gradient Descent (PGD) attack on a fruit defects detection problem. Our produced saliency maps are highly descriptive, achieving competitive IoU values compared to classical segmentation models despite being trained exclusively on classification labels. Furthermore, the model exhibits improved robustness to adversarial attacks, and we show how the discriminator's "fakeness" value serves as an uncertainty measure of the predictions.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
A structure-preserving particle discretisation for the Lenard-Bernstein collision operator
Authors:
Sandra Jeyakumar,
Michael Kraus,
Matthew Hole,
David Pfefferlé
Abstract:
Collisions are an important dissipation mechanism in plasmas. In one-dimensional modelling, a commonly used collision operator is the Lenard-Bernstein operator, or its modified energy- and momentum-conserving counterpart. When approximating such operators numerically, it is important to respect their structure in order to satisfy the laws of thermodynamics. It is, however, challenging to discretis…
▽ More
Collisions are an important dissipation mechanism in plasmas. In one-dimensional modelling, a commonly used collision operator is the Lenard-Bernstein operator, or its modified energy- and momentum-conserving counterpart. When approximating such operators numerically, it is important to respect their structure in order to satisfy the laws of thermodynamics. It is, however, challenging to discretise such operators in a structure-preserving way when considering particle methods. In this work, we present a macro-particle discretisation of the Lenard-Bernstein collision operator that is energy and momentum preserving.
△ Less
Submitted 1 February, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Unveiling the evolutionary state of three B supergiant stars: PU Gem, $ε$ CMa and $η$ CMa
Authors:
Julieta P. Sánchez Arias,
Péter Németh,
Elisson S. G. de Almeida,
Matias A. Ruiz Diaz,
Michaela Kraus,
Maximiliano Haucke
Abstract:
We aim to combine asteroseismology, spectroscopy, and evolutionary models to establish a comprehensive picture of the evolution of Galactic blue supergiant stars (BSG). To start such an investigation, we selected three BSG candidates for our analysis: HD 42087 (PU Gem), HD 52089 ($ε$ CMa) and HD 58350 ($η$ CMa). These stars show pulsations and were suspected to be in an evolutionary stage either p…
▽ More
We aim to combine asteroseismology, spectroscopy, and evolutionary models to establish a comprehensive picture of the evolution of Galactic blue supergiant stars (BSG). To start such an investigation, we selected three BSG candidates for our analysis: HD 42087 (PU Gem), HD 52089 ($ε$ CMa) and HD 58350 ($η$ CMa). These stars show pulsations and were suspected to be in an evolutionary stage either preceding or succeding the red supergiant (RSG) stage.
For our analysis, we utilized the 2-min cadence TESS data to study the photometric variability and obtained new spectroscopic observations at the CASLEO observatory. We calculated CMFGEN non-LTE radiative transfer models and derived stellar and wind parameters using the iterative spectral analysis pipeline XTGRID. The spectral modeling was limited to changing only the effective temperature, surface gravity, CNO abundances, and mass-loss rates. Finally, we compared the derived metal abundances with predictions from Geneva stellar evolution models. The frequency spectra of all three stars show either stochastic oscillations, nonradial strange modes, or a rotational splitting.
We conclude that the rather short sectoral observing windows of TESS prevent establishing a reliable mode identification of low frequencies connected to mass-loss variabilities. The spectral analysis confirmed gradual changes in the mass-loss rates and the derived CNO abundances comply with the values reported in the literature. We were able to achieve a quantitative match with stellar evolution models for the stellar masses and luminosities. However, the spectroscopic surface abundances turned out to be inconsistent with theoretical predictions. The stars show N enrichment, typical for CNO cycle processed material, but the abundance ratios do not reflect the associated levels of C and O depletion.
△ Less
Submitted 24 August, 2023;
originally announced August 2023.
-
Data-Driven Allocation of Preventive Care With Application to Diabetes Mellitus Type II
Authors:
Mathias Kraus,
Stefan Feuerriegel,
Maytal Saar-Tsechansky
Abstract:
Problem Definition. Increasing costs of healthcare highlight the importance of effective disease prevention. However, decision models for allocating preventive care are lacking.
Methodology/Results. In this paper, we develop a data-driven decision model for determining a cost-effective allocation of preventive treatments to patients at risk. Specifically, we combine counterfactual inference, mac…
▽ More
Problem Definition. Increasing costs of healthcare highlight the importance of effective disease prevention. However, decision models for allocating preventive care are lacking.
Methodology/Results. In this paper, we develop a data-driven decision model for determining a cost-effective allocation of preventive treatments to patients at risk. Specifically, we combine counterfactual inference, machine learning, and optimization techniques to build a scalable decision model that can exploit high-dimensional medical data, such as the data found in modern electronic health records. Our decision model is evaluated based on electronic health records from 89,191 prediabetic patients. We compare the allocation of preventive treatments (metformin) prescribed by our data-driven decision model with that of current practice. We find that if our approach is applied to the U.S. population, it can yield annual savings of $1.1 billion. Finally, we analyze the cost-effectiveness under varying budget levels.
Managerial Implications. Our work supports decision-making in health management, with the goal of achieving effective disease prevention at lower costs. Importantly, our decision model is generic and can thus be used for effective allocation of preventive care for other preventable diseases.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools
Authors:
**gwei Ni,
Julia Bingler,
Chiara Colesanti-Senni,
Mathias Kraus,
Glen Gostlow,
Tobias Schimanski,
Dominik Stammbach,
Saeid Ashraf Vaghefi,
Qian Wang,
Nicolas Webersinke,
Tobias Wekhof,
Tingyu Yu,
Markus Leippold
Abstract:
In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at sc…
▽ More
In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at scale, which leads to a lack of transparency in sustainability reporting. Empowering stakeholders with LLM-based automatic analysis tools can be a promising way to democratize sustainability report analysis. However, develo** such tools is challenging due to (1) the hallucination of LLMs and (2) the inefficiency of bringing domain experts into the AI development loop. In this paper, we ChatReport, a novel LLM-based system to automate the analysis of corporate sustainability reports, addressing existing challenges by (1) making the answers traceable to reduce the harm of hallucination and (2) actively involving domain experts in the development loop. We make our methodology, annotated datasets, and generated analyses of 1015 reports publicly available.
△ Less
Submitted 11 October, 2023; v1 submitted 28 July, 2023;
originally announced July 2023.
-
BCD spectrophotometry for massive stars in transition phases
Authors:
Y. J. Aidelman,
M. Borges Fernandes,
L. S. Cidale,
A. Smith Castelli,
M. L. Arias,
J. Zorec,
M. Kraus,
A. Torres,
T. B. Souza,
Y. R. Cochetti
Abstract:
Context. Stars in transition phases, like those showing the B[e] phenomenon and luminous blue variables (LBVs), undergo strong, often irregular mass ejection events. The prediction of these phases in stellar evolution models is therefore extremely difficult if not impossible. As a result, their effective temperatures, their luminosities and even their true nature are not fully known.
Aims. A sui…
▽ More
Context. Stars in transition phases, like those showing the B[e] phenomenon and luminous blue variables (LBVs), undergo strong, often irregular mass ejection events. The prediction of these phases in stellar evolution models is therefore extremely difficult if not impossible. As a result, their effective temperatures, their luminosities and even their true nature are not fully known.
Aims. A suitable procedure to derive the stellar parameters of these types of objects is to use the BCD spectrophotometric classification system, based on the analysis of the Balmer discontinuity. The BCD parameters (λ_1, D) are independent of interstellar extinction and circumstellar contributions.
Methods. We obtained low-resolution spectra for 14 stars with the B[e] phenomenon and LBVs. Using the BCD method, we derived the stellar and physical parameters. The study was complemented with the information provided by the JHK colour-colour diagram.
Results. For each star, the BCD system gives a complete set of fundamental parameters and related quantities such as luminosity and distance. We confirmed HK Ori, HD 323771 and HD 52721 as pre-main sequence HAe/B[e], AS 202 and HD 85567 as FS CMa-type, and HD 62623 as sgB[e] stars. We classified Hen 3-847, CD-24 5721, and HD 53367 as young B[e] stars or FS CMa-type candidates, and HD 58647 as a slightly evolved B[e] star. In addition, Hen 3-1398 is an sgB[e] and MWC 877, CPD-59 2854 and LHA 120-S 65 are LBV candidates. The stellar parameters of the latter two LBVs are determined for the first time.
Conclusions. Our results emphasise that the BCD system is a highly valuable tool to derive stellar parameters and physical properties of B-type stars in transition phases. This method can be combined with near-IR colour-colour diagrams to determine or confirm the evolutionary stage of emission-line stars with dust disks.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
The cubic Dirac operator on compact quotients of the oscillator group
Authors:
Ines Kath,
Margarita Kraus
Abstract:
We determine the spectrum of Kostant's cubic Dirac operator $D^{1/3}$ on locally symmetric Lorentzian manifolds of the form $Γ\backslash {\rm Osc}_1$, where ${\rm Osc}_1$ is the four-dimensional oscillator group and $Γ\subset {\rm Osc}_1$ is a (cocompact) lattice. Moreover, we give an explicit decomposition of the regular representation of ${\rm Osc}_1$ on $L^2$-sections of the spinor bundle into…
▽ More
We determine the spectrum of Kostant's cubic Dirac operator $D^{1/3}$ on locally symmetric Lorentzian manifolds of the form $Γ\backslash {\rm Osc}_1$, where ${\rm Osc}_1$ is the four-dimensional oscillator group and $Γ\subset {\rm Osc}_1$ is a (cocompact) lattice. Moreover, we give an explicit decomposition of the regular representation of ${\rm Osc}_1$ on $L^2$-sections of the spinor bundle into irreducible subrepresentations and we determine the eigenspaces of $D^{1/3}$.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
New Insight into the FS CMa System MWC 645 from Near-Infrared and Optical Spectroscopy
Authors:
Andrea F. Torres,
María L. Arias,
Michaela Kraus,
Lorena V. Mercanti,
Tõnis Eenmäe
Abstract:
The B[e] phenomenon is manifested by a heterogeneous group of stars surrounded by gaseous and dusty circumstellar envelopes with similar physical conditions. Among these stars, the FS CMa-type objects are suspected to be binary systems, which could be experiencing or have undergone a mass-transfer process that could explain the large amount of material surrounding them. We aim to contribute to the…
▽ More
The B[e] phenomenon is manifested by a heterogeneous group of stars surrounded by gaseous and dusty circumstellar envelopes with similar physical conditions. Among these stars, the FS CMa-type objects are suspected to be binary systems, which could be experiencing or have undergone a mass-transfer process that could explain the large amount of material surrounding them. We aim to contribute to the knowledge of a recently confirmed binary, MWC 645, which could be undergoing an active mass-transfer process. We present near-infrared and optical spectra, identify atomic and molecular spectral features, and derive different quantitative properties of line profiles. Based on publicly available photometric data, we search for periodicity in the light curve and model the spectral energy distribution. We have detected molecular bands of CO in absorption at 1.62 $μ$m and 2.3 $μ$m for the first time. We derive an upper limit for the effective temperature of the cool binary component. We found a correlation between the enhancement of the H$α$ emission and the decrease in optical brightness that could be associated with mass-ejection events or an increase in mass loss. We outline the global properties of the envelope, possibly responsible for brightness variations due to a variable extinction, and briefly speculate on different possible scenarios.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool
Authors:
**gwei Ni,
Julia Bingler,
Chiara Colesanti-Senni,
Mathias Kraus,
Glen Gostlow,
Tobias Schimanski,
Dominik Stammbach,
Saeid Ashraf Vaghefi,
Qian Wang,
Nicolas Webersinke,
Tobias Wekhof,
Tingyu Yu,
Markus Leippold
Abstract:
This paper introduces a novel approach to enhance Large Language Models (LLMs) with expert knowledge to automate the analysis of corporate sustainability reports by benchmarking them against the Task Force for Climate-Related Financial Disclosures (TCFD) recommendations. Corporate sustainability reports are crucial in assessing organizations' environmental and social risks and impacts. However, an…
▽ More
This paper introduces a novel approach to enhance Large Language Models (LLMs) with expert knowledge to automate the analysis of corporate sustainability reports by benchmarking them against the Task Force for Climate-Related Financial Disclosures (TCFD) recommendations. Corporate sustainability reports are crucial in assessing organizations' environmental and social risks and impacts. However, analyzing these reports' vast amounts of information makes human analysis often too costly. As a result, only a few entities worldwide have the resources to analyze these reports, which could lead to a lack of transparency. While AI-powered tools can automatically analyze the data, they are prone to inaccuracies as they lack domain-specific expertise. This paper introduces a novel approach to enhance LLMs with expert knowledge to automate the analysis of corporate sustainability reports. We christen our tool CHATREPORT, and apply it in a first use case to assess corporate climate risk disclosures following the TCFD recommendations. CHATREPORT results from collaborating with experts in climate science, finance, economic policy, and computer science, demonstrating how domain experts can be involved in develo** AI tools. We make our prompt templates, generated data, and scores available to the public to encourage transparency.
△ Less
Submitted 16 November, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Dense Molecular Environments of B[e] Supergiants and Yellow Hypergiants
Authors:
Michaela Kraus,
Michalis Kourniotis,
Maria Laura Arias,
Andrea F. Torres,
Dieter H. Nickeler
Abstract:
Massive stars expel large amounts of mass during their late evolutionary phases. We aim to unveil the physical conditions within the warm molecular environments of B[e] supergiants (B[e]SGs) and yellow hypergiants (YHGs), which are known to be embedded in circumstellar shells and disks. We present K-band spectra of two B[e]SGs from the Large Magellanic Cloud and four Galactic YHGs. The CO band emi…
▽ More
Massive stars expel large amounts of mass during their late evolutionary phases. We aim to unveil the physical conditions within the warm molecular environments of B[e] supergiants (B[e]SGs) and yellow hypergiants (YHGs), which are known to be embedded in circumstellar shells and disks. We present K-band spectra of two B[e]SGs from the Large Magellanic Cloud and four Galactic YHGs. The CO band emission detected from the B[e]SGs LHA 120-S 12 and LHA 120-S 134 suggests that these stars are surrounded by stable rotating molecular rings. The spectra of the YHGs display a rather diverse appearance. The objects 6 Cas and V509 Cas lack any molecular features. The star [FMR2006] 15 displays blue-shifted CO bands in emission, which might be explained by a possible close to pole-on oriented bipolar outflow. In contrast, HD 179821 shows blue-shifted CO bands in absorption. While the star itself is too hot to form molecules in its outer atmosphere, we propose that it might have experienced a recent outburst. We speculate that we currently can only see the approaching part of the expelled matter because the star itself might still block the receding parts of a (possibly) expanding gas shell.
△ Less
Submitted 19 June, 2023;
originally announced June 2023.
-
NLO QCD predictions for off-shell $t\bar{t}W$ production in association with a light jet at the LHC
Authors:
Huan-Yu Bi,
Manfred Kraus,
Minos Reinartz,
Malgorzata Worek
Abstract:
In view of the persisting tension between theoretical predictions and the LHC data for the $pp \to t\bar{t}W^\pm$ production process, we present the state-of-the-art full off-shell NLO QCD result for $pp \to t\bar{t}W^+\, j+X$. We concentrate on the multi-lepton decay channel at the LHC with $\sqrt{s}= 13$ TeV. In our calculation off-shell top quarks and gauge bosons are described by Breit-Wigner…
▽ More
In view of the persisting tension between theoretical predictions and the LHC data for the $pp \to t\bar{t}W^\pm$ production process, we present the state-of-the-art full off-shell NLO QCD result for $pp \to t\bar{t}W^+\, j+X$. We concentrate on the multi-lepton decay channel at the LHC with $\sqrt{s}= 13$ TeV. In our calculation off-shell top quarks and gauge bosons are described by Breit-Wigner propagators, furthermore, double-, single- as well as non-resonant top-quark contributions along with all interference effects are consistently incorporated at the matrix element level. We present results for both integrated and differential fiducial cross sections for various renormalisation and factorisation scale settings and different PDF sets. With a fairly inclusive choice of cuts and regardless of the scale and PDF choice, non-flat differential ${\cal K}$-factors are obtained for many observables that we have examined. Since from an experimental point of view, both processes $pp \to t\bar{t}W^\pm j+X$ and $pp\to t\bar{t}W^\pm +X$ consist of similar final states we investigate the effect of additional jet activity on the integrated and differential fiducial cross sections. For this purpose, the normalised differential distributions for $pp \to e^+ν_e\, μ^-\barν_μ\, τ^+ν_τ\, b\bar{b} \,j+X$ and $pp \to e^+ν_e\, μ^-\barν_μ\, τ^+ν_τ\, b\bar{b} +X$ are compared. The theoretical results for the latter process are also recalculated.
△ Less
Submitted 7 September, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
Large-Scale Ejecta of Z CMa -- Proper Motion Study and New Features Discovered
Authors:
Tiina Liimets,
Michaela Kraus,
Lydia Cidale,
Sergey Karpov,
Anthony Marston
Abstract:
Z Canis Majoris is a fascinating early-type binary with a Herbig Be primary and a FU Orionis-type secondary. Both of the stars exhibit sub-arcsecond jet-like ejecta. In addition, the primary is associated with the extended jet as well as with the large-scale outflow. In this study, we investigate further the nature of the large-scale outflow, which has not been studied since its discovery almost t…
▽ More
Z Canis Majoris is a fascinating early-type binary with a Herbig Be primary and a FU Orionis-type secondary. Both of the stars exhibit sub-arcsecond jet-like ejecta. In addition, the primary is associated with the extended jet as well as with the large-scale outflow. In this study, we investigate further the nature of the large-scale outflow, which has not been studied since its discovery almost three and a half decades ago. We present proper motion measurements of individual features of the large-scale outflow and determine their kinematical ages. Furthermore, with our newly acquired deep images, we have discovered additional faint arc-shaped features that can be associated with the central binary.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Development of a Trust-Aware User Simulator for Statistical Proactive Dialog Modeling in Human-AI Teams
Authors:
Matthias Kraus,
Ron Riekenbrauck,
Wolfgang Minker
Abstract:
The concept of a Human-AI team has gained increasing attention in recent years. For effective collaboration between humans and AI teammates, proactivity is crucial for close coordination and effective communication. However, the design of adequate proactivity for AI-based systems to support humans is still an open question and a challenging topic. In this paper, we present the development of a cor…
▽ More
The concept of a Human-AI team has gained increasing attention in recent years. For effective collaboration between humans and AI teammates, proactivity is crucial for close coordination and effective communication. However, the design of adequate proactivity for AI-based systems to support humans is still an open question and a challenging topic. In this paper, we present the development of a corpus-based user simulator for training and testing proactive dialog policies. The simulator incorporates informed knowledge about proactive dialog and its effect on user trust and simulates user behavior and personal information, including socio-demographic features and personality traits. Two different simulation approaches were compared, and a task-step-based approach yielded better overall results due to enhanced modeling of sequential dependencies. This research presents a promising avenue for exploring and evaluating appropriate proactive strategies in a dialog game setting for improving Human-AI teams.
△ Less
Submitted 18 June, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
chatClimate: Grounding Conversational AI in Climate Science
Authors:
Saeid Ashraf Vaghefi,
Qian Wang,
Veruska Muccione,
**gwei Ni,
Mathias Kraus,
Julia Bingler,
Tobias Schimanski,
Chiara Colesanti-Senni,
Nicolas Webersinke,
Christrian Huggel,
Markus Leippold
Abstract:
Large Language Models (LLMs) have made significant progress in recent years, achieving remarkable results in question-answering tasks (QA). However, they still face two major challenges: hallucination and outdated information after the training phase. These challenges take center stage in critical domains like climate change, where obtaining accurate and up-to-date information from reliable source…
▽ More
Large Language Models (LLMs) have made significant progress in recent years, achieving remarkable results in question-answering tasks (QA). However, they still face two major challenges: hallucination and outdated information after the training phase. These challenges take center stage in critical domains like climate change, where obtaining accurate and up-to-date information from reliable sources in a limited time is essential and difficult. To overcome these barriers, one potential solution is to provide LLMs with access to external, scientifically accurate, and robust sources (long-term memory) to continuously update their knowledge and prevent the propagation of inaccurate, incorrect, or outdated information. In this study, we enhanced GPT-4 by integrating the information from the Sixth Assessment Report of the Intergovernmental (IPCC AR6), the most comprehensive, up-to-date, and reliable source in this domain. We present our conversational AI prototype, available at www.chatclimate.ai and demonstrate its ability to answer challenging questions accurately in three different QA scenarios: asking from 1) GPT-4, 2) chatClimate, and 3) hybrid chatClimate. The answers and their sources were evaluated by our team of IPCC authors, who used their expert knowledge to score the accuracy of the answers from 1 (very-low) to 5 (very-high). The evaluation showed that the hybrid chatClimate provided more accurate answers, highlighting the effectiveness of our solution. This approach can be easily scaled for chatbots in specific domains, enabling the delivery of reliable and accurate information.
△ Less
Submitted 28 April, 2023; v1 submitted 11 April, 2023;
originally announced April 2023.
-
Enhancing Large Language Models with Climate Resources
Authors:
Mathias Kraus,
Julia Anna Bingler,
Markus Leippold,
Tobias Schimanski,
Chiara Colesanti Senni,
Dominik Stammbach,
Saeid Ashraf Vaghefi,
Nicolas Webersinke
Abstract:
Large language models (LLMs) have significantly transformed the landscape of artificial intelligence by demonstrating their ability in generating human-like text across diverse topics. However, despite their impressive capabilities, LLMs lack recent information and often employ imprecise language, which can be detrimental in domains where accuracy is crucial, such as climate change. In this study,…
▽ More
Large language models (LLMs) have significantly transformed the landscape of artificial intelligence by demonstrating their ability in generating human-like text across diverse topics. However, despite their impressive capabilities, LLMs lack recent information and often employ imprecise language, which can be detrimental in domains where accuracy is crucial, such as climate change. In this study, we make use of recent ideas to harness the potential of LLMs by viewing them as agents that access multiple sources, including databases containing recent and precise information about organizations, institutions, and companies. We demonstrate the effectiveness of our method through a prototype agent that retrieves emission data from ClimateWatch (https://www.climatewatchdata.org/) and leverages general Google search. By integrating these resources with LLMs, our approach overcomes the limitations associated with imprecise language and delivers more reliable and accurate information in the critical domain of climate change. This work paves the way for future advancements in LLMs and their application in domains where precision is of paramount importance.
△ Less
Submitted 31 March, 2023;
originally announced April 2023.
-
Linking High-Harmonic Generation and Strong-Field Ionization in Bulk Crystals
Authors:
Peter Jürgens,
Sylvianne D. C. Roscam Abbing,
Mark Mero,
Graham G. Brown,
Marc J. J. Vrakking,
Alexandre Mermillod-Blondin,
Peter M. Kraus,
Anton Husakou
Abstract:
The generation of high-order harmonics in bulk solids subjected to intense ultrashort laser pulses has opened up new avenues for research in extreme nonlinear optics and light-matter interaction on sub-cycle timescales. Despite significant advancement over the past decade, a complete understanding of the involved phenomena is still lacking. High-harmonic generation in solids is currently understoo…
▽ More
The generation of high-order harmonics in bulk solids subjected to intense ultrashort laser pulses has opened up new avenues for research in extreme nonlinear optics and light-matter interaction on sub-cycle timescales. Despite significant advancement over the past decade, a complete understanding of the involved phenomena is still lacking. High-harmonic generation in solids is currently understood as arising from nonlinear intraband currents, interband recollision and ionization-related phenomena. As all of these mechanisms involve or rely upon laser-driven excitation we combine measurements of the angular dependence of nonlinear absorption and high-order harmonic generation in bulk crystals to demonstrate the relation between high-harmonic emission and nonlinear, laser-induced ionization in solids.
An unambiguous correlation between the emission of harmonics and laser-induced ionization is found experimentally, that is supported by numerical solutions of the semiconductor Bloch equations and calculations of orientation-dependent ionization rates using maximally localized Wannier-functions.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
ForDigitStress: A multi-modal stress dataset employing a digital job interview scenario
Authors:
Alexander Heimerl,
Pooja Prajod,
Silvan Mertes,
Tobias Baur,
Matthias Kraus,
Ailin Liu,
Helen Risack,
Nicolas Rohleder,
Elisabeth André,
Linda Becker
Abstract:
We present a multi-modal stress dataset that uses digital job interviews to induce stress. The dataset provides multi-modal data of 40 participants including audio, video (motion capturing, facial recognition, eye tracking) as well as physiological information (photoplethysmography, electrodermal activity). In addition to that, the dataset contains time-continuous annotations for stress and occurr…
▽ More
We present a multi-modal stress dataset that uses digital job interviews to induce stress. The dataset provides multi-modal data of 40 participants including audio, video (motion capturing, facial recognition, eye tracking) as well as physiological information (photoplethysmography, electrodermal activity). In addition to that, the dataset contains time-continuous annotations for stress and occurred emotions (e.g. shame, anger, anxiety, surprise). In order to establish a baseline, five different machine learning classifiers (Support Vector Machine, K-Nearest Neighbors, Random Forest, Long-Short-Term Memory Network) have been trained and evaluated on the proposed dataset for a binary stress classification task. The best-performing classifier achieved an accuracy of 88.3% and an F1-score of 87.5%.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
A metriplectic formulation of polarized radiative transfer
Authors:
Vincent Bosboom,
Michael Kraus,
Matthias Schlottbom
Abstract:
We present a metriplectic formulation of the radiative transfer equation with polarization and varying refractive index and show that this formulation automatically satisfies the first two laws of thermodynamics. In particular, the derived antisymmetric bracket enjoys the Jacobi identity. To obtain this formulation we suitably transform the equation and show that important physical quantities deri…
▽ More
We present a metriplectic formulation of the radiative transfer equation with polarization and varying refractive index and show that this formulation automatically satisfies the first two laws of thermodynamics. In particular, the derived antisymmetric bracket enjoys the Jacobi identity. To obtain this formulation we suitably transform the equation and show that important physical quantities derived from the solution remain invariant under such a transformation.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Does It Affect You? Social and Learning Implications of Using Cognitive-Affective State Recognition for Proactive Human-Robot Tutoring
Authors:
Matthias Kraus,
Diana Betancourt,
Wolfgang Minker
Abstract:
Using robots in educational contexts has already shown to be beneficial for a student's learning and social behaviour. For levitating them to the next level of providing more effective and human-like tutoring, the ability to adapt to the user and to express proactivity is fundamental. By acting proactively, intelligent robotic tutors anticipate possible situations where problems for the student ma…
▽ More
Using robots in educational contexts has already shown to be beneficial for a student's learning and social behaviour. For levitating them to the next level of providing more effective and human-like tutoring, the ability to adapt to the user and to express proactivity is fundamental. By acting proactively, intelligent robotic tutors anticipate possible situations where problems for the student may arise and act in advance for preventing negative outcomes. Still, the decisions of when and how to behave proactively are open questions. Therefore, this paper deals with the investigation of how the student's cognitive-affective states can be used by a robotic tutor for triggering proactive tutoring dialogue. In doing so, it is aimed to improve the learning experience. For this reason, a concept learning task scenario was observed where a robotic assistant proactively helped when negative user states were detected. In a learning task, the user's states of frustration and confusion were deemed to have negative effects on the outcome of the task and were used to trigger proactive behaviour. In an empirical user study with 40 undergraduate and doctoral students, we studied whether the initiation of proactive behaviour after the detection of signs of confusion and frustration improves the student's concentration and trust in the agent. Additionally, we investigated which level of proactive dialogue is useful for promoting the student's concentration and trust. The results show that high proactive behaviour harms trust, especially when triggered during negative cognitive-affective states but contributes to kee** the student focused on the task when triggered in these states. Based on our study results, we further discuss future steps for improving the proactive assistance of robotic tutoring systems.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Design Space Exploration and Explanation via Conditional Variational Autoencoders in Meta-model-based Conceptual Design of Pedestrian Bridges
Authors:
Vera M. Balmer,
Sophia V. Kuhn,
Rafael Bischof,
Luis Salamanca,
Walter Kaufmann,
Fernando Perez-Cruz,
Michael A. Kraus
Abstract:
For conceptual design, engineers rely on conventional iterative (often manual) techniques. Emerging parametric models facilitate design space exploration based on quantifiable performance metrics, yet remain time-consuming and computationally expensive. Pure optimisation methods, however, ignore qualitative aspects (e.g. aesthetics or construction methods). This paper provides a performance-driven…
▽ More
For conceptual design, engineers rely on conventional iterative (often manual) techniques. Emerging parametric models facilitate design space exploration based on quantifiable performance metrics, yet remain time-consuming and computationally expensive. Pure optimisation methods, however, ignore qualitative aspects (e.g. aesthetics or construction methods). This paper provides a performance-driven design exploration framework to augment the human designer through a Conditional Variational Autoencoder (CVAE), which serves as forward performance predictor for given design features as well as an inverse design feature predictor conditioned on a set of performance requests. The CVAE is trained on 18'000 synthetically generated instances of a pedestrian bridge in Switzerland. Sensitivity analysis is employed for explainability and informing designers about (i) relations of the model between features and/or performances and (ii) structural improvements under user-defined objectives. A case study proved our framework's potential to serve as a future co-pilot for conceptual design studies of pedestrian bridges and beyond.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Theory advances for $t\bar{t}W$ multi-lepton predictions
Authors:
Manfred Kraus
Abstract:
We report on recent theoretical advances in the description of the $pp\to t\bar{t}W$ process. First, we discuss a comparison of many state-of-the-art predictions for multi-lepton signatures including the leading QCD contributions at $\mathcal{O}(α_s^3α^6)$ as well as subleading EW contributions at $\mathcal{O}(α_sα^8)$. Furthermore, we briefly discuss recent improvements using multi-jet merging te…
▽ More
We report on recent theoretical advances in the description of the $pp\to t\bar{t}W$ process. First, we discuss a comparison of many state-of-the-art predictions for multi-lepton signatures including the leading QCD contributions at $\mathcal{O}(α_s^3α^6)$ as well as subleading EW contributions at $\mathcal{O}(α_sα^8)$. Furthermore, we briefly discuss recent improvements using multi-jet merging techniques.
△ Less
Submitted 28 November, 2022;
originally announced November 2022.
-
Improving Proactive Dialog Agents Using Socially-Aware Reinforcement Learning
Authors:
Matthias Kraus,
Nicolas Wagner,
Ron Riekenbrauck,
Wolfgang Minker
Abstract:
The next step for intelligent dialog agents is to escape their role as silent bystanders and become proactive. Well-defined proactive behavior may improve human-machine cooperation, as the agent takes a more active role during interaction and takes off responsibility from the user. However, proactivity is a double-edged sword because poorly executed pre-emptive actions may have a devastating effec…
▽ More
The next step for intelligent dialog agents is to escape their role as silent bystanders and become proactive. Well-defined proactive behavior may improve human-machine cooperation, as the agent takes a more active role during interaction and takes off responsibility from the user. However, proactivity is a double-edged sword because poorly executed pre-emptive actions may have a devastating effect not only on the task outcome but also on the relationship with the user. For designing adequate proactive dialog strategies, we propose a novel approach including both social as well as task-relevant features in the dialog. Here, the primary goal is to optimize proactive behavior so that it is task-oriented - this implies high task success and efficiency - while also being socially effective by fostering user trust. Including both aspects in the reward function for training a proactive dialog agent using reinforcement learning showed the benefit of our approach for more successful human-machine cooperation.
△ Less
Submitted 22 June, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
V838 Mon: A slow waking up of Slee** Beauty?
Authors:
T. Liimets,
I. Kolka,
M. Kraus,
T. Eenmäe,
T. Tuvikene,
T. Augusteijn,
L. Antunes Amaral,
A. A. Djupvik,
J. H. Telting,
B. Deshev,
E. Kankare,
J. Kankare,
J. E. Lindberg,
T. M. Amby,
T. Pursimo,
A. Somero,
A. Thygesen,
P. A. Strøm
Abstract:
Context. V838 Monocerotis is a peculiar binary that underwent an immense stellar explosion in 2002, leaving behind an expanding cool supergiant and a hot B3V companion. Five years after the outburst, the B3V companion disappeared from view, and so far did not recover. Aims. We investigate the changes in the light curve and spectral features Methods. A monitoring campaign has been performed during…
▽ More
Context. V838 Monocerotis is a peculiar binary that underwent an immense stellar explosion in 2002, leaving behind an expanding cool supergiant and a hot B3V companion. Five years after the outburst, the B3V companion disappeared from view, and so far did not recover. Aims. We investigate the changes in the light curve and spectral features Methods. A monitoring campaign has been performed during the past 13 years with the Nordic Optical Telescope to obtain optical photometric and spectroscopic data. The data sets are used to analyse the temporal evolution of the spectral features and the spectral energy distribution, and to characterize the object. Results. Our photometric data show a steady brightening in all bands during the past 13 years, which is particularly prominent in the blue. This rise is also reflected in the spectra, showing a gradual relative increase in the continuum flux at shorter wavelengths. In addition, a slow brightening of the Ha emission line starting in 2015 was detected. These changes might imply that the B3V companion is slowly reappearing. During the same time interval, our analysis reveals a considerable change in the observed colours of the object along with a steady decrease in the strength and width of molecular absorption bands in our low-resolution spectra. These changes suggest a rising temperature of the cool supergiant along with a weakening of its wind, most likely combined with a slow recovery of the secondary due to the evaporation of the dust and accretion of the material from the shell in which the hot companion is embedded. From our medium-resolution spectra, we find that the heliocentric radial velocity of the atomic absorption line of TiI 6556.06 A has been stable for more than a decade. We propose that TiI lines are tracing the velocity of the red supergiant in V838 Mon, and not representing the infalling matter as previously stated.
△ Less
Submitted 30 November, 2022; v1 submitted 12 November, 2022;
originally announced November 2022.
-
Reducing Down(stream)time: Pretraining Molecular GNNs using Heterogeneous AI Accelerators
Authors:
Jenna A. Bilbrey,
Kristina M. Herman,
Henry Sprueill,
Soritis S. Xantheas,
Payel Das,
Manuel Lopez Roldan,
Mike Kraus,
Hatem Helal,
Sutanay Choudhury
Abstract:
The demonstrated success of transfer learning has popularized approaches that involve pretraining models from massive data sources and subsequent finetuning towards a specific task. While such approaches have become the norm in fields such as natural language processing, implementation and evaluation of transfer learning approaches for chemistry are in the early stages. In this work, we demonstrat…
▽ More
The demonstrated success of transfer learning has popularized approaches that involve pretraining models from massive data sources and subsequent finetuning towards a specific task. While such approaches have become the norm in fields such as natural language processing, implementation and evaluation of transfer learning approaches for chemistry are in the early stages. In this work, we demonstrate finetuning for downstream tasks on a graph neural network (GNN) trained over a molecular database containing 2.7 million water clusters. The use of Graphcore IPUs as an AI accelerator for training molecular GNNs reduces training time from a reported 2.7 days on 0.5M clusters to 1.2 hours on 2.7M clusters. Finetuning the pretrained model for downstream tasks of molecular dynamics and transfer to a different potential energy surface took only 8.3 hours and 28 minutes, respectively, on a single GPU.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Construction of Global Solutions to the Linearized Field Equations for Causal Variational Principles
Authors:
Felix Finster,
Margarita Kraus
Abstract:
We give a novel construction of global solutions to the linearized field equations for causal variational principles. The method is to glue together local solutions supported in lens-shaped regions. As applications, causal Green's operators and cone structures are introduced.
We give a novel construction of global solutions to the linearized field equations for causal variational principles. The method is to glue together local solutions supported in lens-shaped regions. As applications, causal Green's operators and cone structures are introduced.
△ Less
Submitted 3 January, 2024; v1 submitted 29 October, 2022;
originally announced October 2022.
-
Efficient extreme-ultraviolet high-order wave mixing from laser-dressed silica
Authors:
Sylvianne D. C. Roscam Abbing,
Filippo Campi,
Brian de Keijzer,
Corentin Morice,
Zhuang-Yan Zhang,
Maarten L. S. van der Geest,
Peter M. Kraus
Abstract:
The emission of high-order harmonics from solids \cite{ghimire11a,schubert14a,luu15a,golde08a} under intense laser-pulse irradiation is revolutionizing our understanding of strong-field solid-light interactions \cite{ghimire11a,schubert14a,luu15a,vampa15b,yoshikawa17a,hafez18a,jurgens20a}, while simultaneously opening avenues towards novel, all-solid, coherent, short-wavelength table-top sources w…
▽ More
The emission of high-order harmonics from solids \cite{ghimire11a,schubert14a,luu15a,golde08a} under intense laser-pulse irradiation is revolutionizing our understanding of strong-field solid-light interactions \cite{ghimire11a,schubert14a,luu15a,vampa15b,yoshikawa17a,hafez18a,jurgens20a}, while simultaneously opening avenues towards novel, all-solid, coherent, short-wavelength table-top sources with tailored emission profiles and nanoscale light-field control\cite{franz19a,roscamCLEO21}. To date, broadband spectra have been generated well into the extreme-ultraviolet (XUV) \cite{luu15a,luu18b,han19a,uzan20a}, but the comparatively low conversion efficiency still lags behind gas-based high-harmonic generation (HHG) sources \cite{luu15a,luu18b}, and have hindered wider-spread applications. Here, we overcome the low conversion efficiency by two-color wave mixing. A quantum theory reveals that our experiments follow a novel generation mechanism where the conventional interband and intraband nonlinear dynamics are boosted by Floquet-Bloch dressed states, that make solid HHG in the XUV more efficient by at least one order of magnitude. Emission intensity scalings that follow perturbative optical wave mixing, combined with the angular separation of the emitted frequencies, make our approach a decisive step for all-solid coherent XUV sources and for studying light-engineered materials.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Report of the Topical Group on Top quark physics and heavy flavor production for Snowmass 2021
Authors:
Reinhard Schwienhorst,
Doreen Wackeroth,
Kaustubh Agashe,
Simone Alioli,
Javier Aparisi,
Giuseppe Bevilacqua,
Huan-Yu Bi,
Raymond Brock,
Abel Gutierrez Camacho,
Fernando Febres Cordero,
Jorge de Blas,
Regina Demina,
Yong Du,
Gauthier Durieux,
Jarrett Fein,
Roberto Franceschini,
Juan Fuster,
Maria Vittoria Garzelli,
Alessandro Gavardi,
Jason Gombas,
Christoph Grojean,
Jiale Gu,
Marco Guzzi,
Heribertus Bayu Hartanto,
Andre Hoang
, et al. (46 additional authors not shown)
Abstract:
This report summarizes the work of the Energy Frontier Topical Group on EW Physics: Heavy flavor and top quark physics (EF03) of the 2021 Community Summer Study (Snowmass). It aims to highlight the physics potential of top-quark studies and heavy-flavor production processes (bottom and charm) at the HL-LHC and possible future hadron and lepton colliders and running scenarios.
This report summarizes the work of the Energy Frontier Topical Group on EW Physics: Heavy flavor and top quark physics (EF03) of the 2021 Community Summer Study (Snowmass). It aims to highlight the physics potential of top-quark studies and heavy-flavor production processes (bottom and charm) at the HL-LHC and possible future hadron and lepton colliders and running scenarios.
△ Less
Submitted 6 November, 2022; v1 submitted 22 September, 2022;
originally announced September 2022.
-
Improved Perception of AEC Construction Details via Immersive Teaching in Virtual Reality
Authors:
Michael Kraus,
Romana Rust,
Maximilian Rietschel,
Daniel Hall
Abstract:
This work proposes, implements and tests an immersive framework upon Virtual Reality (VR) for comprehension, knowledge development and learning process assisting an improved perception of complex spatial arrangements in AEC in comparison to the traditional 2D projection drawing-based method. The research focuses on the prototypical example of construction details as a traditionally difficult teach…
▽ More
This work proposes, implements and tests an immersive framework upon Virtual Reality (VR) for comprehension, knowledge development and learning process assisting an improved perception of complex spatial arrangements in AEC in comparison to the traditional 2D projection drawing-based method. The research focuses on the prototypical example of construction details as a traditionally difficult teaching task for conveying geometric and semantic information to students. Our mixed-methods study analyses test results of two test panel groups upon different questions about geometric and functional aspects of the construction detail as well as surveys and interviews of participating lecturers, students and laypersons towards their experience using the VR tool. The quantitative analysis of the test results prove that for participants with little pre-existing knowledge (such as novice students), a significantly better learning score for the test group is detected. Moreover, both groups rated the VR experience as an enjoyable and engaging way of learning. Analysis of survey results towards the VR experience reveals, that students, lecturers and professionals alike enjoyed the VR experience more than traditional learning of the construction detail. During the post-experiment qualitative evaluation in the form of interviews, the panel expressed an improved understanding, increased enthusiasm for the topic, and greater desire for other topics to be presented using VR tools. The expressed better understanding of design concepts after the VR experience by the students is statistically significant on average in the exam results. The results support our core assumption, that the presentation of contextual 3D models is a promising teaching approach to illustrate content.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
Environmental Claim Detection
Authors:
Dominik Stammbach,
Nicolas Webersinke,
Julia Anna Bingler,
Mathias Kraus,
Markus Leippold
Abstract:
To transition to a green economy, environmental claims made by companies must be reliable, comparable, and verifiable. To analyze such claims at scale, automated methods are needed to detect them in the first place. However, there exist no datasets or models for this. Thus, this paper introduces the task of environmental claim detection. To accompany the task, we release an expert-annotated datase…
▽ More
To transition to a green economy, environmental claims made by companies must be reliable, comparable, and verifiable. To analyze such claims at scale, automated methods are needed to detect them in the first place. However, there exist no datasets or models for this. Thus, this paper introduces the task of environmental claim detection. To accompany the task, we release an expert-annotated dataset and models trained on this dataset. We preview one potential application of such models: We detect environmental claims made in quarterly earning calls and find that the number of environmental claims has steadily increased since the Paris Agreement in 2015.
△ Less
Submitted 26 May, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.