-
Provable Privacy Advantages of Decentralized Federated Learning via Distributed Optimization
Authors:
Wenrui Yu,
Qiongxiu Li,
Milan Lopuhaä-Zwakenberg,
Mads Græsbøll Christensen,
Richard Heusdens
Abstract:
Federated learning (FL) emerged as a paradigm designed to improve data privacy by enabling data to reside at its source, thus embedding privacy as a core consideration in FL architectures, whether centralized or decentralized. Contrasting with recent findings by Pasquini et al., which suggest that decentralized FL does not empirically offer any additional privacy or security benefits over centrali…
▽ More
Federated learning (FL) emerged as a paradigm designed to improve data privacy by enabling data to reside at its source, thus embedding privacy as a core consideration in FL architectures, whether centralized or decentralized. Contrasting with recent findings by Pasquini et al., which suggest that decentralized FL does not empirically offer any additional privacy or security benefits over centralized models, our study provides compelling evidence to the contrary. We demonstrate that decentralized FL, when deploying distributed optimization, provides enhanced privacy protection - both theoretically and empirically - compared to centralized approaches. The challenge of quantifying privacy loss through iterative processes has traditionally constrained the theoretical exploration of FL protocols. We overcome this by conducting a pioneering in-depth information-theoretical privacy analysis for both frameworks. Our analysis, considering both eavesdrop** and passive adversary models, successfully establishes bounds on privacy leakage. We show information theoretically that the privacy loss in decentralized FL is upper bounded by the loss in centralized FL. Compared to the centralized case where local gradients of individual participants are directly revealed, a key distinction of optimization-based decentralized FL is that the relevant information includes differences of local gradients over successive iterations and the aggregated sum of different nodes' gradients over the network. This information complicates the adversary's attempt to infer private data. To bridge our theoretical insights with practical applications, we present detailed case studies involving logistic regression and deep neural networks. These examples demonstrate that while privacy leakage remains comparable in simpler models, complex models like deep neural networks exhibit lower privacy risks under decentralized FL.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
The Potential Impact of Noise Correlation in Next-generation Gravitational Wave Detectors
Authors:
Isaac C. F. Wong,
Peter T. H. Pang,
Milan Wils,
Francesco Cireddu,
Walter Del Pozzo,
Tjonnie G. F. Li
Abstract:
Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation b…
▽ More
Building upon the statistical formulation for parameter estimation in the presence of correlated noise proposed by Cireddu et al., we present an initial study to incorporate the effects of correlated noise into the analyses of various detector designs' performance. We consider a two L-shaped detector configuration located in the European Union, and compare the expectation of parameter estimation between the non-colocated and a hypothetical colocated configurations. In our study, we posit the existence of low-frequency correlated noise within the $5\text{ Hz}$ to $10\text{ Hz}$ range for the colocated detector configuration, with a varying degree of correlation. In this specific detector setup, our observations indicate an enhancement in the precision of intrinsic parameter measurements as the degree of correlation increases. This trend suggests that higher degrees of noise correlation may beneficially influence the accuracy of parameter estimation. In particular, when the noise is highly correlated, the uncertainty on chirp mass decreases by up to $30\%$. The absence of an inter-European baseline does hinder the estimation of the extrinsic parameters. However, given a realistic global network with the additional detector located in the United States, the uncertainty of extrinsic parameters is significantly reduced. This reduction is further amplified as the degree of noise correlation increases. When noise correlation exceeds a certain level, the colocated configuration outperforms the non-colocated one, reducing the $90\%$ credible area of sky location by up to $10\%$. We conclude that noise correlation significantly impacts detector performance, potentially altering both quantitative and qualitative outcomes. Thus, we recommend including noise correlation in comprehensive assessments of third-generation gravitational wave detector designs.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Safe and Reliable Training of Learning-Based Aerospace Controllers
Authors:
Udayan Mandal,
Guy Amir,
Haoze Wu,
Ieva Daukantas,
Fletcher Lee Newell,
Umberto Ravaioli,
Baoluo Meng,
Michael Durling,
Kerianne Hobbs,
Milan Ganai,
Tobey Shim,
Guy Katz,
Clark Barrett
Abstract:
In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and safety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of…
▽ More
In recent years, deep reinforcement learning (DRL) approaches have generated highly successful controllers for a myriad of complex domains. However, the opaque nature of these models limits their applicability in aerospace systems and safety-critical domains, in which a single mistake can have dire consequences. In this paper, we present novel advancements in both the training and verification of DRL controllers, which can help ensure their safe behavior. We showcase a design-for-verification approach utilizing k-induction and demonstrate its use in verifying liveness properties. In addition, we also give a brief overview of neural Lyapunov Barrier certificates and summarize their capabilities on a case study. Finally, we describe several other novel reachability-based approaches which, despite failing to provide guarantees of interest, could be effective for verification of other DRL systems, and could be of further interest to the community.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Entropy Computing: A Paradigm for Optimization in an Open Quantum System
Authors:
Lac Nguyen,
Mohammad-Ali Miri,
R. Joseph Rupert,
Wesley Dyk,
Sam Wu,
Nick Vrahoretis,
Irwin Huang,
Milan Begliarbekov,
Nicholas Chancellor,
Uchenna Chukwu,
Pranav Mahamuni,
Cesar Martinez-Delgado,
David Haycraft,
Carrie Spear,
Mark Campanelli,
Russell Huffman,
Yong Meng Sua,
Yu** Huang
Abstract:
Modern quantum technologies using matter are designed as closed quantum systems to isolate them from interactions with the environment. This design paradigm greatly constrains the scalability and limits practical implementation of such systems. Here, we introduce a novel computing paradigm, entropy computing, that works by conditioning a quantum reservoir thereby enabling the stabilization of a gr…
▽ More
Modern quantum technologies using matter are designed as closed quantum systems to isolate them from interactions with the environment. This design paradigm greatly constrains the scalability and limits practical implementation of such systems. Here, we introduce a novel computing paradigm, entropy computing, that works by conditioning a quantum reservoir thereby enabling the stabilization of a ground state. In this work, we experimentally demonstrate the feasibility of entropy computing by building a hybrid photonic-electronic computer that uses measurement-based feedback to solve non-convex optimization problems. The system functions by using temporal photonic modes to create qudits in order to encode probability amplitudes in the time-frequency degree of freedom of a photon. This scheme, when coupled with electronic interconnects, allows us to encode an arbitrary Hamiltonian into the system and solve non-convex continuous variables and combinatorial optimization problems. We show that the proposed entropy computing paradigm can act as a scalable and versatile platform for tackling a large range of NP-hard optimization problems.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Large-scale quantum reservoir learning with an analog quantum computer
Authors:
Milan Kornjača,
Hong-Ye Hu,
Chen Zhao,
Jonathan Wurtz,
Phillip Weinberg,
Majd Hamdan,
Andrii Zhdanov,
Sergio H. Cantu,
Hengyun Zhou,
Rodrigo Araiza Bravo,
Kevin Bagnall,
James I. Basham,
Joseph Campo,
Adam Choukri,
Robert DeAngelo,
Paige Frederick,
David Haines,
Julian Hammett,
Ning Hsu,
Ming-Guang Hu,
Florian Huber,
Paul Niklas Jepsen,
Ningyuan Jia,
Thomas Karolyshyn,
Minho Kwon
, et al. (28 additional authors not shown)
Abstract:
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac…
▽ More
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Standardized Data-Parallel Rendering Using ANARI
Authors:
Ingo Wald,
Stefan Zellmann,
Jefferson Amstutz,
Qi Wu,
Kevin Griffin,
Milan Jaros,
Stefan Wesner
Abstract:
We propose and discuss a paradigm that allows for expressing \emph{data-parallel} rendering with the classically non-parallel ANARI API. We propose this as a new standard for data-parallel sci-vis rendering, describe two different implementations of this paradigm, and use multiple sample integrations into existing apps to show how easy it is to adopt this paradigm, and what can be gained from doin…
▽ More
We propose and discuss a paradigm that allows for expressing \emph{data-parallel} rendering with the classically non-parallel ANARI API. We propose this as a new standard for data-parallel sci-vis rendering, describe two different implementations of this paradigm, and use multiple sample integrations into existing apps to show how easy it is to adopt this paradigm, and what can be gained from doing so.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Moiré lattice of twisted bilayer graphene as template for non-covalent functionalization
Authors:
Tobias Dierke,
Stefan Wolff,
Roland Gillen,
Jasmin Eisenkolb,
Tamara Nagel,
Sabine Maier,
Milan Kivala,
Frank Hauke,
Andreas Hirsch,
Janina Maultzsch
Abstract:
We present a novel approach to achieve spatial variations in the degree of non-covalent functionalization of twisted bilayer graphene (tBLG). The tBLG with twist angles varying between ~ 5° and 7° was non-covalently functionalized with 1,4,5,8,9,11-hexaazatriphenylenehexacarbonitrile (HATCN) molecules. Our results show a correlation between the degree of functionalization and the twist angle of tB…
▽ More
We present a novel approach to achieve spatial variations in the degree of non-covalent functionalization of twisted bilayer graphene (tBLG). The tBLG with twist angles varying between ~ 5° and 7° was non-covalently functionalized with 1,4,5,8,9,11-hexaazatriphenylenehexacarbonitrile (HATCN) molecules. Our results show a correlation between the degree of functionalization and the twist angle of tBLG. This correlation was determined through Raman spectroscopy, where areas with larger twist angles exhibited a lower HATCN peak intensity compared to areas with smaller twist angles. We suggest that the HATCN adsorption follows the moiré pattern of tBLG by avoiding AA-stacked areas and attach predominantly to areas with a local AB-stacking order of tBLG, forming an overall ABA-stacking configuration. This is supported by density functional theory (DFT) calculations. Our work highlights the role of the moiré lattice in controlling the non-covalent functionalization of tBLG. Our approach can be generalized for designing nanoscale patterns on two-dimensional (2D) materials using moiré structures as a template.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency
Authors:
Leonidas Gee,
Milan Gritta,
Gerasimos Lampouras,
Ignacio Iacobacci
Abstract:
Code Language Models have been trained to generate accurate solutions, typically with no regard for runtime. On the other hand, previous works that explored execution optimisation have observed corresponding drops in functional correctness. To that end, we introduce Code-Optimise, a framework that incorporates both correctness (passed, failed) and runtime (quick, slow) as learning signals via self…
▽ More
Code Language Models have been trained to generate accurate solutions, typically with no regard for runtime. On the other hand, previous works that explored execution optimisation have observed corresponding drops in functional correctness. To that end, we introduce Code-Optimise, a framework that incorporates both correctness (passed, failed) and runtime (quick, slow) as learning signals via self-generated preference data. Our framework is both lightweight and robust as it dynamically selects solutions to reduce overfitting while avoiding a reliance on larger models for learning signals. Code-Optimise achieves significant improvements in pass@k while decreasing the competitive baseline runtimes by an additional 6% for in-domain data and up to 3% for out-of-domain data. As a byproduct, the average length of the generated solutions is reduced by up to 48% on MBPP and 23% on HumanEval, resulting in faster and cheaper inference. The generated data and codebase will be open-sourced at www.open-source.link.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of Czech
Authors:
Milan Straka,
Jana Straková
Abstract:
We present an open-source web service for Czech morphosyntactic analysis. The system combines a deep learning model with rescoring by a high-precision morphological dictionary at inference time. We show that our hybrid method surpasses two competitive baselines: While the deep learning model ensures generalization for out-of-vocabulary words and better disambiguation, an improvement over an existi…
▽ More
We present an open-source web service for Czech morphosyntactic analysis. The system combines a deep learning model with rescoring by a high-precision morphological dictionary at inference time. We show that our hybrid method surpasses two competitive baselines: While the deep learning model ensures generalization for out-of-vocabulary words and better disambiguation, an improvement over an existing morphological analyser MorphoDiTa, at the same time, the deep learning model benefits from inference-time guidance of a manually curated morphological dictionary. We achieve 50% error reduction in lemmatization and 58% error reduction in POS tagging over MorphoDiTa, while also offering dependency parsing. The model is trained on one of the currently largest Czech morphosyntactic corpora, the PDT-C 1.0, with the trained models available at https://hdl.handle.net/11234/1-5293. We provide the tool as a web service deployed at https://lindat.mff.cuni.cz/services/udpipe/. The source code is available at GitHub (https://github.com/ufal/udpipe/tree/udpipe-2), along with a Python client for a simple use. The documentation for the models can be found at https://ufal.mff.cuni.cz/udpipe/2/models#czech_pdtc1.0_model.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Incentive Contracts and Peer Effects in the Workplace
Authors:
Marc Claveria-Mayol,
Pau Milán,
Nicolás Oviedo-Dávila
Abstract:
Risk-averse workers in a team exert effort to produce joint output. Workers' incentives are connected via chains of productivity spillovers, represented by a network of peer-effects. We study the problem of a principal offering wage contracts that simultaneously incentivize and insure agents. We solve for the optimal linear contract for any network and show that optimal incentives are loaded more…
▽ More
Risk-averse workers in a team exert effort to produce joint output. Workers' incentives are connected via chains of productivity spillovers, represented by a network of peer-effects. We study the problem of a principal offering wage contracts that simultaneously incentivize and insure agents. We solve for the optimal linear contract for any network and show that optimal incentives are loaded more heavily on workers that are more central in a specific way. We conveniently link firm profits to network structure via the networks spectral properties. When firms can't personalize contracts, better connected workers extract rents. In this case, a group composition result follows: large within-group differences in centrality can decrease firm's profits. Finally, we find that modular production has important implications for how peer structures distribute incentives.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
The Choquet-Deny Property for Groupoids
Authors:
Tey Berendschot,
Soham Chakraborty,
Milan Donvil,
Se-** Kim,
Mario Klisse
Abstract:
A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property…
▽ More
A countable discrete group is called Choquet-Deny if for any non-degenerate probability measure on the group, the corresponding space of bounded harmonic functions is trivial. Building on the previous work of Jaworski, a complete characterization of Choquet-Deny groups was recently achieved by Frisch, Hartman, Tamuz, and Ferdowski. In this article, we extend the study of the Choquet-Deny property to the framework of discrete measured groupoids. Our primary result offers a complete characterization of this property in terms of the isotropy groups and the equivalence relation associated with the given groupoid. Additionally, we use the implications derived from our main theorem to classify the Choquet-Deny property of transformation groupoids.
△ Less
Submitted 25 June, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking
Authors:
Josef Vonášek,
Milan Straka,
Rostislav Krč,
Lenka Lasoňová,
Ekaterina Egorova,
Jana Straková,
Jakub Náplava
Abstract:
We present CWRCzech, Click Web Ranking dataset for Czech, a 100M query-document Czech click dataset for relevance ranking with user behavior data collected from search engine logs of Seznam.cz. To the best of our knowledge, CWRCzech is the largest click dataset with raw text published so far. It provides document positions in the search results as well as information about user behavior: 27.6M cli…
▽ More
We present CWRCzech, Click Web Ranking dataset for Czech, a 100M query-document Czech click dataset for relevance ranking with user behavior data collected from search engine logs of Seznam.cz. To the best of our knowledge, CWRCzech is the largest click dataset with raw text published so far. It provides document positions in the search results as well as information about user behavior: 27.6M clicked documents and 10.8M dwell times. In addition, we also publish a manually annotated Czech test for the relevance task, containing nearly 50k query-document pairs, each annotated by at least 2 annotators. Finally, we analyze how the user behavior data improve relevance ranking and show that models trained on data automatically harnessed at sufficient scale can surpass the performance of models trained on human annotated data. CWRCzech is published under an academic non-commercial license and is available to the research community at https://github.com/seznam/CWRCzech.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Duality and degeneracy lifting in two-dimensional electron liquids on SrTiO$_3$(001)
Authors:
Igor Sokolović,
Eduardo B. Guedes,
Thomas P. van Waas,
Samuel Poncé,
Craig Polley,
Michael Schmid,
Ulrike Diebold,
Milan Radović,
Martin Setvín,
J. Hugo Dil
Abstract:
Two-dimensional electron liquids (2DELs) have increasing technological relevance for ultrafast electronics and spintronics, yet significant gaps in their fundamental understanding are exemplified on the prototypical SrTiO$_3$. We correlate the exact SrTiO$_3$(001) surface structure with distinct 2DELs through combined microscopic angle-resolved photoemission spectroscopy and non-contact atomic for…
▽ More
Two-dimensional electron liquids (2DELs) have increasing technological relevance for ultrafast electronics and spintronics, yet significant gaps in their fundamental understanding are exemplified on the prototypical SrTiO$_3$. We correlate the exact SrTiO$_3$(001) surface structure with distinct 2DELs through combined microscopic angle-resolved photoemission spectroscopy and non-contact atomic force microscopy on truly bulk-terminated surfaces that alleviate structural uncertainties inherent to this long-studied system. The SrO termination is shown to develop a 2DEL following the creation of oxygen vacancies, unlike the intrinsically metallic TiO$_2$ termination. Differences in degeneracy of the 2DELs, that share the same band filling and identical band bending, are assigned to polar distortions of the Ti atoms in combination with spin order, supported with the extraction of fundamental electron-phonon coupling strength. These results not only resolve the ambiguities regarding 2DELs on SrTiO$_3$ thus far, but also pave the way to manipulating band filling and spin order in oxide 2DELs in general.
△ Less
Submitted 31 May, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
On attractor behavior in braneworld constant-roll inflation
Authors:
Goran S. Djordjevic,
Neven Bilić,
Dragoljub D. Dimitrijevic,
Milan Milosevic,
Marko Stojanovic
Abstract:
We investigate in detail the attractor behavior of some inflationary models based on braneworld dynamics under the constant-roll condition. We describe the dynamics of the models, assuming that the second slow-roll parameter remains constant during inflation. We show that the dynamics of the considered models have the property of a cosmological attractor.
We investigate in detail the attractor behavior of some inflationary models based on braneworld dynamics under the constant-roll condition. We describe the dynamics of the models, assuming that the second slow-roll parameter remains constant during inflation. We show that the dynamics of the considered models have the property of a cosmological attractor.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Photonic Bose-Einstein condensation in the continuum limit
Authors:
Andris Erglis,
Milan Radonjić,
Stefan Yoshi Buhmann
Abstract:
We investigate the properties of the photon Bose-Einstein condensate in the limit of small mode spacing. Alongside the well-known threshold of the phase transition at large mode spacings, we find an emergence of a second threshold for sufficiently small mode spacings, defining the crossover to a fully condensed state. Furthermore, we present our findings for the mode occupations in the precondensa…
▽ More
We investigate the properties of the photon Bose-Einstein condensate in the limit of small mode spacing. Alongside the well-known threshold of the phase transition at large mode spacings, we find an emergence of a second threshold for sufficiently small mode spacings, defining the crossover to a fully condensed state. Furthermore, we present our findings for the mode occupations in the precondensate or supercooling region towards the continuum limit.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Probing Berry curvature in magnetic topological insulators through resonant infrared magnetic circular dichroism
Authors:
Seul-Ki Bac,
Florian le Mardelé,
Jiashu Wang,
Mykhaylo Ozerov,
Kota Yoshimura,
Ivan Mohelský,
Xingdan Sun,
Benjamin Piot,
Stefan Wimmer,
Andreas Ney,
Tatyana Orlova,
Maksym Zhukovskyi,
Günther Bauer,
Gunther Springholz,
Xinyu Liu,
Milan Orlita,
Kyungwha Park,
Yi-Ting Hsu,
Badih A. Assaf
Abstract:
Probing the quantum geometry and topology in condensed matter systems has relied heavily on static electronic transport experiments in magnetic fields. Yet, contact-free optical measurements have rarely been explored. Magnetic dichroism (MCD), the nonreciprocal absorption of circular polarized light, was theoretically linked to the quantized anomalous Hall effect in magnetic insulators and can ide…
▽ More
Probing the quantum geometry and topology in condensed matter systems has relied heavily on static electronic transport experiments in magnetic fields. Yet, contact-free optical measurements have rarely been explored. Magnetic dichroism (MCD), the nonreciprocal absorption of circular polarized light, was theoretically linked to the quantized anomalous Hall effect in magnetic insulators and can identify the bands and momenta responsible for the underlying Berry Curvature (BC). Detecting BC through MCD faces two challenges: First, the relevant inter-band transitions usually generate MCD in the infrared (IR) range, requiring large samples with high quality. Second, while most magnetic materials are metallic, the relation between MCD and BC in metals remains unclear. Here, we report the observation of MCD in the IR range along with the anomalous Hall effect in thin film MnBi2Te4. Both phenomena emerge with a field-driven phase transition from an antiferromagnet to a canted ferromagnet. By theoretically relating the MCD to the anomalous Hall effect via BC in a metal, we show that this transition accompanies an abrupt onset of BC, signaling a topological phase transition from a topological insulator to a doped Chern insulator. Our density functional theory calculation suggests the MCD signal mainly originates from an optical transition at the Brillouin zone edge, hinting at a potential new source of BC away from the commonly considered Γ point. Our findings demonstrate a novel experimental approach for detecting BC and identifying the responsible bands and momenta, generally applicable to magnetic materials.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Exploring interacting bulk viscous model with decaying vacuum density
Authors:
Vinita Khatri,
C. P. Singh,
Milan Srivastava
Abstract:
In the present work, we study a cosmological model composed of a viscous dark matter interacting with decaying vacuum energy in a spatially flat Universe. In the first part, we find the analytical solution of different cosmological parameters by assuming the physically viable forms of bulk viscosity and decaying vacuum density with the interaction term. The second part is dedicated to constrain th…
▽ More
In the present work, we study a cosmological model composed of a viscous dark matter interacting with decaying vacuum energy in a spatially flat Universe. In the first part, we find the analytical solution of different cosmological parameters by assuming the physically viable forms of bulk viscosity and decaying vacuum density with the interaction term. The second part is dedicated to constrain the free parameters of the interacting viscous model with decaying vacuum energy by employing latest observational data of $Pantheon+$, Cosmic Chronometer and $f(z)σ_{8}(z)$. We find that the interacting model just deviate very slightly from well-known concordance $Λ$CDM model and can alleviate effectively the current $H_0$ tension between local measurement by R21 and global measurement by Planck 2018, and the excess in the mass fluctuation amplitude $σ_{8}$ essentially vanish in this context. We report the Hubble constants as $H_0=72.150^{+0.989}_{-0.779}$, and $ 72.202^{+0.796}_{-0.937}$ \;$km s^{-1} Mpc^{-1}$, deceleration parameters as $q_0=-0.533 \pm 0.024$, and $-0.531 \pm 0.024$, and equation of state parameters as $w_0=-0.689 \pm 0.016$, and $ -0.687 \pm 0.016$ for $Λ$CDM and interacting models, respectively. It is found that the interacting model is in good agreement with $Λ$CDM. Further, we discuss the amplitude of matter power spectrum $σ_8$ and its associated parameter $S_8$ using $f(z)σ_8(z)$ data. Finally, the information selection criterion and Bayesian inference are discussed to distinguish the interacting model with $Λ$CDM model.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Robust pinned magnetisation in A2Ir2O7 iridates, the case of Er2Ir2O7 and Lu2Ir2O7 flux-grown single crystals
Authors:
Daniel Staško,
Filip Hájek,
Kristina Vlášková,
Jiří Kaštil,
Margarida Henriques,
Milan Klicpera
Abstract:
Reliable and profound studies of actual magnetic domain structure in rare-earth A2Ir2O7 pyrochlore iridates are frequently limited by insufficient sample quality or lack of single crystals. We report the magnetic properties of the for-the-first-time synthesised Lu2Ir2O7 and Er2Ir2O7 single-crystals. The paper is focused on the robust ferromagnetic component of magnetisation present in the material…
▽ More
Reliable and profound studies of actual magnetic domain structure in rare-earth A2Ir2O7 pyrochlore iridates are frequently limited by insufficient sample quality or lack of single crystals. We report the magnetic properties of the for-the-first-time synthesised Lu2Ir2O7 and Er2Ir2O7 single-crystals. The paper is focused on the robust ferromagnetic component of magnetisation present in the material with the antiferromagnetically ordered state of the all-in-all-out (AIAO) type, and is discussed in the framework of AIAO and AOAI domains and interfaces on the geometrically frustrated lattice.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates
Authors:
Udayan Mandal,
Guy Amir,
Haoze Wu,
Ieva Daukantas,
Fletcher Lee Newell,
Umberto J. Ravaioli,
Baoluo Meng,
Michael Durling,
Milan Ganai,
Tobey Shim,
Guy Katz,
Clark Barrett
Abstract:
Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the "black box" nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions…
▽ More
Deep reinforcement learning (DRL) is a powerful machine learning paradigm for generating agents that control autonomous systems. However, the "black box" nature of DRL agents limits their deployment in real-world safety-critical applications. A promising approach for providing strong guarantees on an agent's behavior is to use Neural Lyapunov Barrier (NLB) certificates, which are learned functions over the system whose properties indirectly imply that an agent behaves as desired. However, NLB-based certificates are typically difficult to learn and even more difficult to verify, especially for complex systems. In this work, we present a novel method for training and verifying NLB-based certificates for discrete-time systems. Specifically, we introduce a technique for certificate composition, which simplifies the verification of highly-complex systems by strategically designing a sequence of certificates. When jointly verified with neural network verification engines, these certificates provide a formal guarantee that a DRL agent both achieves its goals and avoids unsafe behavior. Furthermore, we introduce a technique for certificate filtering, which significantly simplifies the process of producing formally verified certificates. We demonstrate the merits of our approach with a case study on providing safety and liveness guarantees for a DRL-controlled spacecraft.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Tools at the Frontiers of Quantitative Verification
Authors:
Roman Andriushchenko,
Alexander Bork,
Carlos E. Budde,
Milan Češka,
Kush Grover,
Ernst Moritz Hahn,
Arnd Hartmanns,
Bryant Israelsen,
Nils Jansen,
Joshua Jeppson,
Sebastian Junges,
Maximilian A. Köhl,
Bettina Könighofer,
Jan Křetínský,
Tobias Meggendorfer,
David Parker,
Stefan Pranger,
Tim Quatmann,
Enno Ruijters,
Landon Taylor,
Matthias Volk,
Maximilian Weininger,
Zhen Zhang
Abstract:
The analysis of formal models that include quantitative aspects such as timing or probabilistic choices is performed by quantitative verification tools. Broad and mature tool support is available for computing basic properties such as expected rewards on basic models such as Markov chains. Previous editions of QComp, the comparison of tools for the analysis of quantitative formal models, focused o…
▽ More
The analysis of formal models that include quantitative aspects such as timing or probabilistic choices is performed by quantitative verification tools. Broad and mature tool support is available for computing basic properties such as expected rewards on basic models such as Markov chains. Previous editions of QComp, the comparison of tools for the analysis of quantitative formal models, focused on this setting. Many application scenarios, however, require more advanced property types such as LTL and parameter synthesis queries as well as advanced models like stochastic games and partially observable MDPs. For these, tool support is in its infancy today. This paper presents the outcomes of QComp 2023: a survey of the state of the art in quantitative verification tool support for advanced property types and models. With tools ranging from first research prototypes to well-supported integrations into established toolsets, this report highlights today's active areas and tomorrow's challenges in tool-focused research for quantitative verification.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Mitigating Text Toxicity with Counterfactual Generation
Authors:
Milan Bhan,
Jean-Noel Vittaut,
Nina Achache,
Victor Legrand,
Nicolas Chesneau,
Annabelle Blangero,
Juliette Murris,
Marie-Jeanne Lesot
Abstract:
Toxicity mitigation consists in rephrasing text in order to remove offensive or harmful meaning. Neural natural language processing (NLP) models have been widely used to target and mitigate textual toxicity. However, existing methods fail to detoxify text while preserving the initial non-toxic meaning at the same time. In this work, we propose to apply counterfactual generation methods from the eX…
▽ More
Toxicity mitigation consists in rephrasing text in order to remove offensive or harmful meaning. Neural natural language processing (NLP) models have been widely used to target and mitigate textual toxicity. However, existing methods fail to detoxify text while preserving the initial non-toxic meaning at the same time. In this work, we propose to apply counterfactual generation methods from the eXplainable AI (XAI) field to target and mitigate textual toxicity. In particular, we perform text detoxification by applying local feature importance and counterfactual generation methods to a toxicity classifier distinguishing between toxic and non-toxic texts. We carry out text detoxification through counterfactual generation on three datasets and compare our approach to three competitors. Automatic and human evaluations show that recently developed NLP counterfactual generators can mitigate toxicity accurately while better preserving the meaning of the initial text as compared to classical detoxification methods. Finally, we take a step back from using automated detoxification tools, and discuss how to manage the polysemous nature of toxicity and the risk of malicious use of detoxification tools. This work is the first to bridge the gap between counterfactual generation and text detoxification and paves the way towards more practical application of XAI methods.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Authors:
Milan Gritta,
Gerasimos Lampouras,
Ignacio Iacobacci
Abstract:
Language models (LMs) as conversational assistants recently became popular tools that help people accomplish a variety of tasks. These typically result from adapting LMs pretrained on general domain text sequences through further instruction-tuning and possibly preference optimisation methods. The evaluation of such LMs would ideally be performed using human judgement, however, this is not scalabl…
▽ More
Language models (LMs) as conversational assistants recently became popular tools that help people accomplish a variety of tasks. These typically result from adapting LMs pretrained on general domain text sequences through further instruction-tuning and possibly preference optimisation methods. The evaluation of such LMs would ideally be performed using human judgement, however, this is not scalable. On the other hand, automatic evaluation featuring auxiliary LMs as judges and/or knowledge-based tasks is scalable but struggles with assessing conversational ability and adherence to instructions. To help accelerate the development of LMs as conversational assistants, we propose a novel automatic evaluation task: HumanRankEval (HRE). It consists of a large-scale, diverse and high-quality set of questions, each with several answers authored and scored by humans. To perform evaluation, HRE ranks these answers based on their log-likelihood under the LM's distribution, and subsequently calculates their correlation with the corresponding human rankings. We support HRE's efficacy by investigating how efficiently it separates pretrained and instruction-tuned LMs of various sizes. We show that HRE correlates well with human judgements and is particularly responsive to model changes following instruction-tuning.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Autodetachment of diatomic carbon anions from long-lived high-rotation quartet states
Authors:
Viviane C. Schmidt,
Roman Čurík,
Milan Ončák,
Klaus Blaum,
Sebastian George,
Jürgen Göck,
Manfred Grieser,
Florian Grussie,
Robert von Hahn,
Claude Krantz,
Holger Kreckel,
Oldřich Novotný,
Kaija Spruck,
Andreas Wolf
Abstract:
Highly excited C$_2{}^{-}$ ions prominently feature electron detachment at a mean decay time near 3 milliseconds with hitherto unexplained origin. Considering various sources of unimolecular decay, we attribute the signal to the electronic C$^4Σ^+_u$ state. Quartet C$_2{}^{-}$ levels are found to be stabilized against autodetachment by high rotation. Time constants of their rotationally assisted a…
▽ More
Highly excited C$_2{}^{-}$ ions prominently feature electron detachment at a mean decay time near 3 milliseconds with hitherto unexplained origin. Considering various sources of unimolecular decay, we attribute the signal to the electronic C$^4Σ^+_u$ state. Quartet C$_2{}^{-}$ levels are found to be stabilized against autodetachment by high rotation. Time constants of their rotationally assisted autodetachment into levels opening energetically at lower rotation are calculated by a theory based on the non-local resonance model. For some final levels of significantly less rotation the results conclusively explain the puzzling observations.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Unimolecular processes in diatomic carbon anions at high rotational excitation
Authors:
Viviane C. Schmidt,
Roman Čurík,
Milan Ončák,
Klaus Blaum,
Sebastian George,
Jürgen Göck,
Manfred Grieser,
Florian Grussie,
Robert von Hahn,
Claude Krantz,
Holger Kreckel,
Oldřich Novotný,
Kaija Spruck,
Andreas Wolf
Abstract:
On the millisecond to second time scale, stored beams of diatomic carbon anions C$_2{}^-$ from a sputter ion source feature unimolecular decay of yet unexplained origin by electron emission and fragmentation. To account for the magnitude and time dependence of the experimental rates, levels with high rotational and vibrational excitation are modeled for the lowest electronic states of C$_2{}^-$, a…
▽ More
On the millisecond to second time scale, stored beams of diatomic carbon anions C$_2{}^-$ from a sputter ion source feature unimolecular decay of yet unexplained origin by electron emission and fragmentation. To account for the magnitude and time dependence of the experimental rates, levels with high rotational and vibrational excitation are modeled for the lowest electronic states of C$_2{}^-$, also including the lowest quartet potential. Energies, spontaneous radiative decay rates (including spin-forbidden quartet-level decay), and tunneling dissociation rates are determined for a large number of highly excited C$_2{}^-$ levels and their population in sputter-type ion sources is considered. For the quartet levels, the stability against autodetachment is addressed and recently calculated rates of rotationally assisted autodetachment are applied. Non-adiabatic vibrational autodetachment rates of high vibrational levels in the doublet C$_2{}^-$ ground potential are also calculated. The results are combined to model the experimental unimolecular decay signals. Comparison of the modeled to the experimental rates measured at the Croygenic Storage Ring (CSR) gives strong evidence that C$_2{}^-$ ions in quasi-stable levels of the quartet electronic states are the so far unidentified source of unimolecular decay.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
On Probabilistic and Causal Reasoning with Summation Operators
Authors:
Duligur Ibeling,
Thomas F. Icard,
Milan Mossé
Abstract:
Ibeling et al. (2023). axiomatize increasingly expressive languages of causation and probability, and Mosse et al. (2024) show that reasoning (specifically the satisfiability problem) in each causal language is as difficult, from a computational complexity perspective, as reasoning in its merely probabilistic or "correlational" counterpart. Introducing a summation operator to capture common device…
▽ More
Ibeling et al. (2023). axiomatize increasingly expressive languages of causation and probability, and Mosse et al. (2024) show that reasoning (specifically the satisfiability problem) in each causal language is as difficult, from a computational complexity perspective, as reasoning in its merely probabilistic or "correlational" counterpart. Introducing a summation operator to capture common devices that appear in applications -- such as the $do$-calculus of Pearl (2009) for causal inference, which makes ample use of marginalization -- van der Zander et al. (2023) partially extend these earlier complexity results to causal and probabilistic languages with marginalization. We complete this extension, fully characterizing the complexity of probabilistic and causal reasoning with summation, demonstrating that these again remain equally difficult. Surprisingly, allowing free variables for random variable values results in a system that is undecidable, so long as the ranges of these random variables are unrestricted. We finally axiomatize these languages featuring marginalization (or more generally summation), resolving open questions posed by Ibeling et al. (2023).
△ Less
Submitted 18 May, 2024; v1 submitted 5 May, 2024;
originally announced May 2024.
-
PhilHumans: Benchmarking Machine Learning for Personal Health
Authors:
Vadim Liventsev,
Vivek Kumar,
Allmin Pradhap Singh Susaiyah,
Zixiu Wu,
Ivan Rodin,
Asfand Yaar,
Simone Balloccu,
Marharyta Beraziuk,
Sebastiano Battiato,
Giovanni Maria Farinella,
Aki Härmä,
Rim Helaoui,
Milan Petkovic,
Diego Reforgiato Recupero,
Ehud Reiter,
Daniele Riboni,
Raymond Sterling
Abstract:
The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of be…
▽ More
The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of benchmarks for machine learning across different Healthcare settings - talk therapy, diet coaching, emergency care, intensive care, obstetric sonography - as well as different learning settings, such as action anticipation, timeseries modeling, insight mining, language modeling, computer vision, reinforcement learning and program synthesis
△ Less
Submitted 16 May, 2024; v1 submitted 4 May, 2024;
originally announced May 2024.
-
An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints
Authors:
Jung-hun Kim,
Milan Vojnovic,
Se-Young Yun
Abstract:
In this study, we consider the infinitely many-armed bandit problems in a rested rotting setting, where the mean reward of an arm may decrease with each pull, while otherwise, it remains unchanged. We explore two scenarios regarding the rotting of rewards: one in which the cumulative amount of rotting is bounded by $V_T$, referred to as the slow-rotting case, and the other in which the cumulative…
▽ More
In this study, we consider the infinitely many-armed bandit problems in a rested rotting setting, where the mean reward of an arm may decrease with each pull, while otherwise, it remains unchanged. We explore two scenarios regarding the rotting of rewards: one in which the cumulative amount of rotting is bounded by $V_T$, referred to as the slow-rotting case, and the other in which the cumulative number of rotting instances is bounded by $S_T$, referred to as the abrupt-rotting case. To address the challenge posed by rotting rewards, we introduce an algorithm that utilizes UCB with an adaptive sliding window, designed to manage the bias and variance trade-off arising due to rotting rewards. Our proposed algorithm achieves tight regret bounds for both slow and abrupt rotting scenarios. Lastly, we demonstrate the performance of our algorithm using numerical experiments.
△ Less
Submitted 24 May, 2024; v1 submitted 22 April, 2024;
originally announced April 2024.
-
A multi-robot system for the detection of explosive devices
Authors:
Ken Hasselmann,
Mario Malizia,
Rafael Caballero,
Fabio Polisano,
Shashank Govindaraj,
Jakob Stigler,
Oleksii Ilchenko,
Milan Bajic,
Geert De Cubber
Abstract:
In order to clear the world of the threat posed by landmines and other explosive devices, robotic systems can play an important role. However, the development of such field robots that need to operate in hazardous conditions requires the careful consideration of multiple aspects related to the perception, mobility, and collaboration capabilities of the system. In the framework of a European challe…
▽ More
In order to clear the world of the threat posed by landmines and other explosive devices, robotic systems can play an important role. However, the development of such field robots that need to operate in hazardous conditions requires the careful consideration of multiple aspects related to the perception, mobility, and collaboration capabilities of the system. In the framework of a European challenge, the Artificial Intelligence for Detection of Explosive Devices - eXtended (AIDEDeX) project proposes to design a heterogeneous multi-robot system with advanced sensor fusion algorithms. This system is specifically designed to detect and classify improvised explosive devices, explosive ordnances, and landmines. This project integrates specialised sensors, including electromagnetic induction, ground penetrating radar, X-Ray backscatter imaging, Raman spectrometers, and multimodal cameras, to achieve comprehensive threat identification and localisation. The proposed system comprises a fleet of unmanned ground vehicles and unmanned aerial vehicles. This article details the operational phases of the AIDEDeX system, from rapid terrain exploration using unmanned aerial vehicles to specialised detection and classification by unmanned ground vehicles equipped with a robotic manipulator. Initially focusing on a centralised approach, the project will also explore the potential of a decentralised control architecture, taking inspiration from swarm robotics to provide a robust, adaptable, and scalable solution for explosive detection.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Long-lived oscillations of false and true vacuum states in neutral atom systems
Authors:
Siva Darbha,
Milan Kornjača,
Fangli Liu,
Jan Balewski,
Mark R. Hirsbrunner,
Pedro Lopes,
Sheng-Tao Wang,
Roel Van Beeumen,
Katherine Klymko,
Daan Camps
Abstract:
Metastable false vacuum states arise in a range of quantum systems and can be observed in various dynamical scenarios, including decay, bubble nucleation, and long-lived oscillations. False vacuum phenomenology has been examined in quantum many-body systems, notably in 1D ferromagnetic Ising spin systems and superfluids. In this paper, we study long-lived oscillations of false and true vacuum stat…
▽ More
Metastable false vacuum states arise in a range of quantum systems and can be observed in various dynamical scenarios, including decay, bubble nucleation, and long-lived oscillations. False vacuum phenomenology has been examined in quantum many-body systems, notably in 1D ferromagnetic Ising spin systems and superfluids. In this paper, we study long-lived oscillations of false and true vacuum states in 1D antiferromagnetic neutral atom chains with long-range Rydberg interactions. We use a staggered local detuning field to achieve confinement. Using theoretical and numerical models, we identify novel spectral signatures of quasiparticle oscillations distinct to antiferromagnetic neutral atom systems and interpret them using a classical energy model of deconfinement from Rydberg tails. Finally, we evaluate the experimental accessibility of our proposed setup on current neutral-atom platforms and discuss experimental feasibility and constraints.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
False vacuum decay and nucleation dynamics in neutral atom systems
Authors:
Siva Darbha,
Milan Kornjača,
Fangli Liu,
Jan Balewski,
Mark R. Hirsbrunner,
Pedro Lopes,
Sheng-Tao Wang,
Roel Van Beeumen,
Daan Camps,
Katherine Klymko
Abstract:
False vacuum decay and nucleation offer the opportunity to study non-equilibrium dynamical phenomena in quantum many-body systems with confinement. Recent work has examined false vacuum decay in 1D ferromagnetic Ising spins and superfluids. In this paper, we study false vacuum nucleation dynamics in 1D antiferromagnetic neutral atom chains with Rydberg interactions, using both numerical simulation…
▽ More
False vacuum decay and nucleation offer the opportunity to study non-equilibrium dynamical phenomena in quantum many-body systems with confinement. Recent work has examined false vacuum decay in 1D ferromagnetic Ising spins and superfluids. In this paper, we study false vacuum nucleation dynamics in 1D antiferromagnetic neutral atom chains with Rydberg interactions, using both numerical simulations and analytic modeling. We apply a staggered local detuning field to generate the false and true vacuum states. Our efforts focus on two dynamical regimes: decay and annealing. In the first, we corroborate the phenomenological decay rate scaling and determine the associated parameter range for the decay process; in the second, we uncover and elucidate a procedure to anneal the false vacuum from the initial to the final system, with intermediate nucleation events. We further propose experimental protocols to prepare the required states and perform quenches on near-term neutral atom quantum simulators, examining the experimental feasibility of our proposed setup and parameter regime.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
Authors:
Vincent Conitzer,
Rachel Freedman,
Jobst Heitzig,
Wesley H. Holliday,
Bob M. Jacobs,
Nathan Lambert,
Milan Mossé,
Eric Pacuit,
Stuart Russell,
Hailey Schoelkopf,
Emanuel Tewolde,
William S. Zwicker
Abstract:
Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as hel** to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level prin…
▽ More
Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as hel** to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the input into consistent data about "collective" preferences or otherwise use it to make collective choices about model behavior? In this paper, we argue that the field of social choice is well positioned to address these questions, and we discuss ways forward for this agenda, drawing on discussions in a recent workshop on Social Choice for AI Ethics and Safety held in Berkeley, CA, USA in December 2023.
△ Less
Submitted 4 June, 2024; v1 submitted 15 April, 2024;
originally announced April 2024.
-
$ε$-isomorphisms for rank one $(\varphi,Γ)$-modules over Lubin-Tate Robba rings
Authors:
Milan Malcic,
Rustam Steingart,
Otmar Venjakob,
Max Witzelsperger
Abstract:
Inspired by Nakamura's work (arXiv:1305.0880) on $ε$-isomorphisms for $(\varphi,Γ)$-modules over (relative) Robba rings with respect to the cyclotomic theory, we formulate an analogous conjecture for $L$-analytic Lubin-Tate $(\varphi_L,Γ_L)$-modules over (relative) Robba rings for any finite extension $L$ of $\mathbb{Q}_p.$ In contrast to Kato's and Nakamura's setting, our conjecture involves $L$-…
▽ More
Inspired by Nakamura's work (arXiv:1305.0880) on $ε$-isomorphisms for $(\varphi,Γ)$-modules over (relative) Robba rings with respect to the cyclotomic theory, we formulate an analogous conjecture for $L$-analytic Lubin-Tate $(\varphi_L,Γ_L)$-modules over (relative) Robba rings for any finite extension $L$ of $\mathbb{Q}_p.$ In contrast to Kato's and Nakamura's setting, our conjecture involves $L$-analytic cohomology instead of continuous cohomology within the generalized Herr complex. Similarly, we restrict to the identity components of $D_{cris}$ and $D_{dR},$ respectively. For rank one modules of the above type or slightly more generally for trianguline ones, we construct $ε$-isomorphisms for their Lubin-Tate deformations satisfying the desired interpolation property.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Integrability of the sub-Riemannian geodesic flow of the left-invariant metric on the Heisenberg group
Authors:
Milan Pavlovic,
Tijana Sukilovic
Abstract:
In this paper, we study two different classes of normal geodesic flows corresponding to the left-invariant sub-Riemannian metric on the $(2n+1)$-dimensional Heisenberg group. The first class corresponds to the left-invariant distribution, while the second corresponds to the right-invariant one. We prove that corresponding Hamiltonian L-L and L-R systems are completely integrable.
In this paper, we study two different classes of normal geodesic flows corresponding to the left-invariant sub-Riemannian metric on the $(2n+1)$-dimensional Heisenberg group. The first class corresponds to the left-invariant distribution, while the second corresponds to the right-invariant one. We prove that corresponding Hamiltonian L-L and L-R systems are completely integrable.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
An Overview of Absolute Value Equations: From Theory to Solution Methods and Challenges
Authors:
Milan Hladík,
Hossein Moosaei,
Fakhrodin Hashemi,
Saeed Ketabchi,
Panos M. Pardalos
Abstract:
This paper provides a thorough exploration of the absolute value equations $Ax-|x|=b$, a seemingly straightforward concept that has gained heightened attention in recent years. It is an NP-hard and nondifferentiable problem and equivalent with the standard linear complementarity problem. Offering a comprehensive review of existing literature, the study delves into theorems concerning the existence…
▽ More
This paper provides a thorough exploration of the absolute value equations $Ax-|x|=b$, a seemingly straightforward concept that has gained heightened attention in recent years. It is an NP-hard and nondifferentiable problem and equivalent with the standard linear complementarity problem. Offering a comprehensive review of existing literature, the study delves into theorems concerning the existence and nonexistence of solutions to the absolute value equations, along with numerical methods for effectively addressing this complex equation. Going beyond conventional approaches, the paper investigates strategies for obtaining solutions with minimal norms, techniques for correcting infeasible systems, and other pertinent topics. By pinpointing challenging issues and emphasizing open problems, this paper serves as a valuable guide for sha** the future research trajectory in this dynamic and multifaceted field.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Generalized Positive Energy Representations of the Group of Compactly Supported Diffeomorphisms
Authors:
Bas Janssens,
Milan Niestijl
Abstract:
Motivated by asymptotic symmetry groups in general relativity, we consider projective unitary representations $(\overlineρ, \mathcal{H})$ of the Lie group $\mathrm{Diff}_c(M)$ of compactly supported diffeomorphisms of a smooth manifold $M$ that satisfy a so-called generalized positive energy condition. In particular, this captures representations that are in a suitable sense compatible with a KMS…
▽ More
Motivated by asymptotic symmetry groups in general relativity, we consider projective unitary representations $(\overlineρ, \mathcal{H})$ of the Lie group $\mathrm{Diff}_c(M)$ of compactly supported diffeomorphisms of a smooth manifold $M$ that satisfy a so-called generalized positive energy condition. In particular, this captures representations that are in a suitable sense compatible with a KMS state on the von Neumann algebra generated by $\overlineρ$. We show that if $M$ is connected and $\dim(M) > 1$, then any such representation is necessarily trivial on the identity component $\mathrm{Diff}_c(M)_0$. As an intermediate step towards this result, we determine the continuous second Lie algebra cohomology $H^2_{\mathrm{ct}}(\mathcal{X}_c(M), \mathbb{R})$ of the Lie algebra of compactly supported vector fields (which is subtly different from Gelfand--Fuks cohomology).
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
ÚFAL LatinPipe at EvaLatin 2024: Morphosyntactic Analysis of Latin
Authors:
Milan Straka,
Jana Straková,
Federica Gamba
Abstract:
We present LatinPipe, the winning submission to the EvaLatin 2024 Dependency Parsing shared task. Our system consists of a fine-tuned concatenation of base and large pre-trained LMs, with a dot-product attention head for parsing and softmax classification heads for morphology to jointly learn both dependency parsing and morphological analysis. It is trained by sampling from seven publicly availabl…
▽ More
We present LatinPipe, the winning submission to the EvaLatin 2024 Dependency Parsing shared task. Our system consists of a fine-tuned concatenation of base and large pre-trained LMs, with a dot-product attention head for parsing and softmax classification heads for morphology to jointly learn both dependency parsing and morphological analysis. It is trained by sampling from seven publicly available Latin corpora, utilizing additional harmonization of annotations to achieve a more unified annotation style. Before fine-tuning, we train the system for a few initial epochs with frozen weights. We also add additional local relative contextualization by stacking the BiLSTM layers on top of the Transformer(s). Finally, we ensemble output probability distributions from seven randomly instantiated networks for the final submission. The code is available at https://github.com/ufal/evalatin2024-latinpipe.
△ Less
Submitted 29 May, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Engineering quantum states with neutral atoms
Authors:
Jan Balewski,
Milan Kornjaca,
Katherine Klymko,
Siva Darbha,
Mark R. Hirsbrunner,
Pedro Lopes,
Fangli Liu,
Daan Camps
Abstract:
Aquila, an analog quantum simulation platform developed by QuEra Computing, supports control of the position and coherent evolution of up to 256 neutral atoms. This study details novel experimental protocols designed for analog quantum simulators that generate Bell state entanglement far away from the blockade regime, construct a Z2 state with a defect induced by an ancilla, and optimize the drivi…
▽ More
Aquila, an analog quantum simulation platform developed by QuEra Computing, supports control of the position and coherent evolution of up to 256 neutral atoms. This study details novel experimental protocols designed for analog quantum simulators that generate Bell state entanglement far away from the blockade regime, construct a Z2 state with a defect induced by an ancilla, and optimize the driving fields schedule to prepare excited states with enhanced fidelity.
We additionally evaluate the effectiveness of readout error mitigation techniques in improving the fidelity of measurement results. All experiments were executed on Aquila from QuEra and facilitated by the AWS Braket interface.
Our experimental results closely align with theoretical predictions and numerical simulations. The insights gained from this study showcase Aquila's capabilities in handling complex quantum simulations and computations, and also pave the way for new avenues of research in quantum information processing and physics that employ programmable analog hardware platforms.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Photochemistry upon charge separation in triphenylamine derivatives from fs to $\mathrmμ$s
Authors:
Hendrik J. Brockmann,
Letao Huang,
Felix Hainer,
Danyellen Galindo,
Angelina Jocic,
Milan Kivala,
Andreas Dreuw,
Tiago Buckup
Abstract:
Quantum chemical methods and time-resolved laser spectroscopy are employed to elucidate ultrafast charge separation processes in triphenylamine (TPA) derivatives upon photoexcitation. When changing the ambient solvent from generic ones to those capable of accepting electrons, such as chloroform, a vastly extended and multifaceted photochemistry is observed. Following the initial excitation, two co…
▽ More
Quantum chemical methods and time-resolved laser spectroscopy are employed to elucidate ultrafast charge separation processes in triphenylamine (TPA) derivatives upon photoexcitation. When changing the ambient solvent from generic ones to those capable of accepting electrons, such as chloroform, a vastly extended and multifaceted photochemistry is observed. Following the initial excitation, two concurrent charge transfer processes are identified. Firstly, when the TPA derivative and solvent molecules are correctly positioned, an electron transfer to the solvent molecule with immediate charge separation takes place. Consequently, this process gives rise to the formation of the corresponding radical cation of the TPA derivative. This highly reactive species can subsequently combine with other TPA derivative molecules to yield dimeric species. Secondly, when the molecular positioning upon photoexcitation is not optimal, relaxation back to the $\mathrm{S_1}$ state occurs. From this state, an electron transfer process leads to the formation of a charge transfer complex. In this complex, the negatively charged solvent molecule remains closely associated with the positively charged TPA derivative. Within 30 picoseconds, the charges within this complex recombine, yielding a triplet state. This transition to the triplet state is driven by a lower reaction barrier for charge separation compared to the formation of the singlet state.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
From News to Summaries: Building a Hungarian Corpus for Extractive and Abstractive Summarization
Authors:
Botond Barta,
Dorina Lakatos,
Attila Nagy,
Milán Konor Nyist,
Judit Ács
Abstract:
Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl…
▽ More
Training summarization models requires substantial amounts of training data. However for less resourceful languages like Hungarian, openly available models and datasets are notably scarce. To address this gap our paper introduces HunSum-2 an open-source Hungarian corpus suitable for training abstractive and extractive summarization models. The dataset is assembled from segments of the Common Crawl corpus undergoing thorough cleaning, preprocessing and deduplication. In addition to abstractive summarization we generate sentence-level labels for extractive summarization using sentence similarity. We train baseline models for both extractive and abstractive summarization using the collected dataset. To demonstrate the effectiveness of the trained models, we perform both quantitative and qualitative evaluation. Our dataset, models and code are publicly available, encouraging replication, further research, and real-world applications across various domains.
△ Less
Submitted 12 April, 2024; v1 submitted 4 April, 2024;
originally announced April 2024.
-
A Survey on Error-Bounded Lossy Compression for Scientific Datasets
Authors:
Sheng Di,
**yang Liu,
Kai Zhao,
Xin Liang,
Robert Underwood,
Zhaorui Zhang,
Milan Shah,
Yafan Huang,
Jiajun Huang,
Xiaodong Yu,
Congrong Ren,
Hanqi Guo,
Grant Wilkins,
Dingwen Tao,
Jiannan Tian,
Sian **,
Zizhe Jian,
Daoce Wang,
MD Hasanur Rahman,
Boyuan Zhang,
Jon C. Calhoun,
Guanpeng Li,
Kazutomo Yoshii,
Khalid Ayed Alharthi,
Franck Cappello
Abstract:
Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. These lossy compressors are designed with distinct compression models and design principles, such that each…
▽ More
Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. These lossy compressors are designed with distinct compression models and design principles, such that each of them features particular pros and cons. In this paper we provide a comprehensive survey of emerging error-bounded lossy compression techniques for different use cases each involving big data to process. The key contribution is fourfold. (1) We summarize an insightful taxonomy of lossy compression into 6 classic compression models. (2) We provide a comprehensive survey of 10+ commonly used compression components/modules used in error-bounded lossy compressors. (3) We provide a comprehensive survey of 10+ state-of-the-art error-bounded lossy compressors as well as how they combine the various compression modules in their designs. (4) We provide a comprehensive survey of the lossy compression for 10+ modern scientific applications and use-cases. We believe this survey is useful to multiple communities including scientific applications, high-performance computing, lossy compression, and big data.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
First determination of the angular dependence of rise and decay times of solar radio bursts using multi-spacecraft observations
Authors:
Nicolina Chrysaphi,
Milan Maksimovic,
Eduard P. Kontar,
Antonio Vecchio,
Xingyao Chen,
Aikaterini Pesini
Abstract:
Radio photons interact with anisotropic density fluctuations in the heliosphere, which can alter their trajectory and influence properties deduced from observations. This is particularly evident in solar radio observations, where anisotropic scattering leads to highly-directional radio emissions. Consequently, observers at varying locations will measure different properties, including different so…
▽ More
Radio photons interact with anisotropic density fluctuations in the heliosphere, which can alter their trajectory and influence properties deduced from observations. This is particularly evident in solar radio observations, where anisotropic scattering leads to highly-directional radio emissions. Consequently, observers at varying locations will measure different properties, including different source sizes, source positions, and intensities. However, it is not known if measurements of the decay time of solar radio bursts are also affected by the observer's position. Decay times are dominated by scattering effects, and so are frequently used as proxies of the level of density fluctuations in the heliosphere, making the identification of any location-related dependence crucial. We combine multi-vantage observations of interplanetary Type III bursts from four non-collinear, angularly-separated spacecraft with simulations, to investigate the dependence of both the decay- and rise-time measurements on the separation of the observer from the source. We propose a function to characterise the entire time profile of radio signals, allowing for the simultaneous estimation of the peak flux, decay time, and rise time, while demonstrating that the rise phase of radio bursts has a non-constant, non-exponential growth rate. We determine that the decay and rise times are independent of the observer's position, identifying them as the only properties to remain unaffected, thus not requiring corrections for the observer's location. Moreover, we examine the ratio between the rise and decay times, finding that it does not depend on the frequency. Therefore, we provide the first evidence that the rise phase is also significantly impacted by scattering effects, adding to our understanding of the plasma emission process.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Adversary-Robust Graph-Based Learning of WSIs
Authors:
Saba Heidari Gheshlaghi,
Milan Aryal,
Nasim Yahyasoltani,
Masoud Ganji
Abstract:
Enhancing the robustness of deep learning models against adversarial attacks is crucial, especially in critical domains like healthcare where significant financial interests heighten the risk of such attacks. Whole slide images (WSIs) are high-resolution, digitized versions of tissue samples mounted on glass slides, scanned using sophisticated imaging equipment. The digital analysis of WSIs presen…
▽ More
Enhancing the robustness of deep learning models against adversarial attacks is crucial, especially in critical domains like healthcare where significant financial interests heighten the risk of such attacks. Whole slide images (WSIs) are high-resolution, digitized versions of tissue samples mounted on glass slides, scanned using sophisticated imaging equipment. The digital analysis of WSIs presents unique challenges due to their gigapixel size and multi-resolution storage format. In this work, we aim at improving the robustness of cancer Gleason grading classification systems against adversarial attacks, addressing challenges at both the image and graph levels. As regards the proposed algorithm, we develop a novel and innovative graph-based model which utilizes GNN to extract features from the graph representation of WSIs. A denoising module, along with a pooling layer is incorporated to manage the impact of adversarial attacks on the WSIs. The process concludes with a transformer module that classifies various grades of prostate cancer based on the processed data. To assess the effectiveness of the proposed method, we conducted a comparative analysis using two scenarios. Initially, we trained and tested the model without the denoiser using WSIs that had not been exposed to any attack. We then introduced a range of attacks at either the image or graph level and processed them through the proposed network. The performance of the model was evaluated in terms of accuracy and kappa scores. The results from this comparison showed a significant improvement in cancer diagnosis accuracy, highlighting the robustness and efficiency of the proposed method in handling adversarial challenges in the context of medical imaging.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Practical End-to-End Optical Music Recognition for Pianoform Music
Authors:
Jiří Mayer,
Milan Straka,
Jan Hajič jr.,
Pavel Pecina
Abstract:
The majority of recent progress in Optical Music Recognition (OMR) has been achieved with Deep Learning methods, especially models following the end-to-end paradigm, reading input images and producing a linear sequence of tokens. Unfortunately, many music scores, especially piano music, cannot be easily converted to a linear sequence. This has led OMR researchers to use custom linearized encodings…
▽ More
The majority of recent progress in Optical Music Recognition (OMR) has been achieved with Deep Learning methods, especially models following the end-to-end paradigm, reading input images and producing a linear sequence of tokens. Unfortunately, many music scores, especially piano music, cannot be easily converted to a linear sequence. This has led OMR researchers to use custom linearized encodings, instead of broadly accepted structured formats for music notation. Their diversity makes it difficult to compare the performance of OMR systems directly. To bring recent OMR model progress closer to useful results: (a) We define a sequential format called Linearized MusicXML, allowing to train an end-to-end model directly and maintaining close cohesion and compatibility with the industry-standard MusicXML format. (b) We create a dev and test set for benchmarking typeset OMR with MusicXML ground truth based on the OpenScore Lieder corpus. They contain 1,438 and 1,493 pianoform systems, each with an image from IMSLP. (c) We train and fine-tune an end-to-end model to serve as a baseline on the dataset and employ the TEDn metric to evaluate the model. We also test our model against the recently published synthetic pianoform dataset GrandStaff and surpass the state-of-the-art results.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Elastic properties and thermodynamic anomalies of supersolids
Authors:
Milan Rakic,
Andrew F. Ho,
Derek K. K. Lee
Abstract:
We study a supersolid in the context of a Gross-Pitaevskii theory with a non-local effective potential. We employ a homogenisation technique which allows us to calculate the elastic moduli, supersolid fraction and other state variables of the system. Our methodology is verified against numerical simulations of elastic deformations. We can also verify that the long-wavelength Goldstone modes that e…
▽ More
We study a supersolid in the context of a Gross-Pitaevskii theory with a non-local effective potential. We employ a homogenisation technique which allows us to calculate the elastic moduli, supersolid fraction and other state variables of the system. Our methodology is verified against numerical simulations of elastic deformations. We can also verify that the long-wavelength Goldstone modes that emerge from this technique agree with Bogoliubov theory. We find a thermodynamic anomaly that the supersolid does not obey the thermodynamic relation $\partial P / \partial V \bigr|_N = - n \, \partial P / \partial N \bigr|_V$, which we claim is a feature unique to supersolids.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Observation of spin-electric transitions in a molecular exchange qubit
Authors:
Florian le Mardelé,
Ivan Mohelský,
Jan Wyzula,
Milan Orlita,
Philippe Turek,
Filippo Troiani,
Athanassios K. Boudalis
Abstract:
Electric fields represent an ideal means for controlling spins at the nanoscale and, more specifically, for manipulating protected degrees of freedom in multispin systems. Here we perform low-temperature magnetic far-IR spectroscopy on a molecular spin triangle (Fe3) and provide the first experimental evidence of spin-electric transitions in polynuclear complexes. The co-presence of electric- and…
▽ More
Electric fields represent an ideal means for controlling spins at the nanoscale and, more specifically, for manipulating protected degrees of freedom in multispin systems. Here we perform low-temperature magnetic far-IR spectroscopy on a molecular spin triangle (Fe3) and provide the first experimental evidence of spin-electric transitions in polynuclear complexes. The co-presence of electric- and magnetic-dipole transitions, allows us to estimate the spin-electric coupling. Based on spin Hamiltonian simulations of the spectra, we identify the observed transitions and introduce the concept of a generalized exchange qubit. This applies to a wide class of molecular spin triangles, and includes the scalar chirality and the partial spin sum qubits as special cases.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Determination of Energetic Positions of Electronic States and the Exciton Dynamics in a pi-Expanded N-Heterotriangulene Derivative Adsorbed on Au(111)
Authors:
Jakob Steidel,
Ina Michalsky,
Mohsen Ajdari,
Milan Kivala,
Petra Tegeder
Abstract:
Bridged triarylamines, so-called N-heterotriangulenes (N-HTAs) are promising organic semiconductors for applications in optoelectronic devices. Thereby the electronic structure at organic/metal interfaces and within thin films as well as the electronically excited states dynamics after optical excitation is essential for the performance of organic-molecule-based devices. Here, we investigated the…
▽ More
Bridged triarylamines, so-called N-heterotriangulenes (N-HTAs) are promising organic semiconductors for applications in optoelectronic devices. Thereby the electronic structure at organic/metal interfaces and within thin films as well as the electronically excited states dynamics after optical excitation is essential for the performance of organic-molecule-based devices. Here, we investigated the energy level alignment and the excited state dynamics of a N-HTA derivative adsorbed on Au(111) by means of energy- and time-resolved two-photon photoemission spectroscopy. We quantitatively determined the energetic positions of several occupied and unoccupied molecular (transport levels) and excitonic states (optical gap) in detail. A transport gap of 3.20 eV and an optical gap of 2.58 eV is determined, resulting in an exciton binding energy of 0.62 eV. With the first time-resolved investigation on a N-HTA compound we gained insights into the exciton dynamics and resolved processes on the femtosecond to picosecond timescale.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Electronic Properties of Interfaces between N-Heterotriangulene Donors and Strong Tetracyanoquinodimethane Acceptors
Authors:
Mohsen Ajdari,
Ronja Pappenberger,
Christian Walla,
Ina Michalsky,
Friedrich Maass,
Milan Kivala,
Andreas Dreuw,
Petra Tegeder
Abstract:
N-heterotriangulenes (N-HTAs) represent a class of functional molecules with high potential for optoelectronic materials, for example as electron donating compounds in donor/acceptor (D/A) systems. The capability of two different N-HTAs, N-HTA 550 and N-HTA 557, the latter containing an additional 7-membered ring, to act as electron donors at interfaces with strong tetracyanoquinodimethane (TCNQ a…
▽ More
N-heterotriangulenes (N-HTAs) represent a class of functional molecules with high potential for optoelectronic materials, for example as electron donating compounds in donor/acceptor (D/A) systems. The capability of two different N-HTAs, N-HTA 550 and N-HTA 557, the latter containing an additional 7-membered ring, to act as electron donors at interfaces with strong tetracyanoquinodimethane (TCNQ and F4TCNQ) acceptors is studied using high-resolution electron energy loss spectroscopy in combination with state-of-the-art quantum chemical calculations. For TCNQ/N-HTA bilayer systems adsorbed on Au(111) Low-energy (< 2.5 eV) electronic transitions which are attributed to charge transfer (CT) states for all four D/A combinations are identified. Based on substantial quantum chemical calculations a generation of ground state CT complexes is excluded. Instead, CT in the excited state, in which an electron-stimulated CT from the N-HTAs to TCNQs is the underlying process, is proposed. The energies of the CT states are determined by the values of the ionization potential and electron affinity of the involved donor and acceptor.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Observation of quantum oscillations near the Mott-Ioffe-Regel limit in CaAs3
Authors:
Yuxiang Wang,
Minhao Zhao,
**glei Zhang,
Wenbin Wu,
Shichao Li,
Yong Zhang,
Wenxiang Jiang,
Nesta Benno Joseph,
Liangcai Xu,
Yicheng Mou,
Yunkun Yang,
Pengliang Leng,
Yong Zhang,
Li Pi,
Alexey Suslov,
Mykhaylo Ozerov,
Jan Wyzula,
Milan Orlita,
Fengfeng Zhu,
Yi Zhang,
Xufeng Kou,
Zengwei Zhu,
Awadhesh Narayan,
Dong Qian,
**sheng Wen
, et al. (3 additional authors not shown)
Abstract:
The Mott-Ioffe-Regel limit sets the lower bound of carrier mean free path for coherent quasiparticle transport. Metallicity beyond this limit is of great interest because it is often closely related to quantum criticality and unconventional superconductivity. Progress along this direction mainly focuses on the strange-metal behaviors originating from the evolution of quasiparticle scattering rate…
▽ More
The Mott-Ioffe-Regel limit sets the lower bound of carrier mean free path for coherent quasiparticle transport. Metallicity beyond this limit is of great interest because it is often closely related to quantum criticality and unconventional superconductivity. Progress along this direction mainly focuses on the strange-metal behaviors originating from the evolution of quasiparticle scattering rate such as linear-in-temperature resistivity, while the quasiparticle coherence phenomena in this regime are much less explored due to the short mean free path at the diffusive bound. Here we report the observation of quantum oscillations from Landau quantization near the Mott-Ioffe-Regel limit in CaAs3. Despite the insulator-like temperature dependence of resistivity, CaAs3 presents giant magnetoresistance and prominent Shubnikov-de Haas oscillations from Fermi surfaces, indicating highly coherent band transport. In contrast, the quantum oscillation is absent in the magnetic torque. The quasiparticle effective mass increases systematically with magnetic fields, manifesting a much larger value than the expectation given by magneto-infrared spectroscopy. It suggests a strong many-body renormalization effect near Fermi surface. We find that these unconventional behaviors may be explained by the interplay between the mobility edge and the van Hove singularity, which results in the formation of coherent cyclotron orbits emerging at the diffusive bound. Our results call for further study on the electron correlation effect of the van Hove singularity.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Fuzzy Fault Trees Formalized
Authors:
Thi Kim Nhung Dang,
Milan Lopuhaä-Zwakenberg,
Mariëlle Stoelinga
Abstract:
Fault tree analysis is a vital method of assessing safety risks. It helps to identify potential causes of accidents, assess their likelihood and severity, and suggest preventive measures. Quantitative analysis of fault trees is often done via the dependability metrics that compute the system's failure behaviour over time. However, the lack of precise data is a major obstacle to quantitative analys…
▽ More
Fault tree analysis is a vital method of assessing safety risks. It helps to identify potential causes of accidents, assess their likelihood and severity, and suggest preventive measures. Quantitative analysis of fault trees is often done via the dependability metrics that compute the system's failure behaviour over time. However, the lack of precise data is a major obstacle to quantitative analysis, and so to reliability analysis. Fuzzy logic is a popular framework for dealing with ambiguous values and has applications in many domains. A number of fuzzy approaches have been proposed to fault tree analysis, but -- to the best of our knowledge -- none of them provide rigorous definitions or algorithms for computing fuzzy unreliability values. In this paper, we define a rigorous framework for fuzzy unreliability values. In addition, we provide a bottom-up algorithm to efficiently calculate fuzzy reliability for a system. The algorithm incorporates the concept of $α$-cuts method. That is, performing binary algebraic operations on intervals on horizontally discretised $α$-cut representations of fuzzy numbers. The method preserves the nonlinearity of fuzzy unreliability. Finally, we illustrate the results obtained from two case studies.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.