-
Measuring the arrival time of an electron wave packet using a dynamical potential barrier
Authors:
Wanki Park,
H. -S. Sim,
Sungguen Ryu
Abstract:
A time-dependent potential barrier has been used to probe the arrival-time distribution of the wave packet of a hot electron by raising the barrier to block the packet upon arrival of the packet at the barrier. To see whether the barrier precisely detects the distribution, it is necessary to study an error caused by a finite rising speed of the barrier. For this purpose, we study transmission of a…
▽ More
A time-dependent potential barrier has been used to probe the arrival-time distribution of the wave packet of a hot electron by raising the barrier to block the packet upon arrival of the packet at the barrier. To see whether the barrier precisely detects the distribution, it is necessary to study an error caused by a finite rising speed of the barrier. For this purpose, we study transmission of an electron wave packet through the dynamical barrier, and identify two regimes, the semiclassical regime and the quasistatic regime. In each regime, we calculate the arrival-time distribution reconstructed by using the barrier and quantify the error in the detection, the difference of the temporal uncertainty between the wave-packet distribution and the reconstructed distribution. Our finding suggests that for precise detection, the time scale, in which the barrier height rises over the energy distribution of the wave packet and the tunneling energy window of the barrier, has to be much shorter than the temporal uncertainty of the wave packet. The analytical results are confirmed with numerical calculations.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Aggressive Trajectory Tracking for Nano Quadrotors Using Embedded Nonlinear Model Predictive Control
Authors:
Muhammad Kazim,
Hyunjae Sim,
Gihun Shin,
Hwancheol Hwang,
Kwang-Ki K. Kim
Abstract:
This paper presents an aggressive trajectory tracking method for a small lightweight nano-quadrotor using nonlinear model predictive control (NMPC) based on acados. Controlling a nano quadrotor for accurate trajectory tracking at high speed in dynamic environments is challenging due to complex aerodynamic forces that introduce significant disturbances and large positional tracking errors. These ae…
▽ More
This paper presents an aggressive trajectory tracking method for a small lightweight nano-quadrotor using nonlinear model predictive control (NMPC) based on acados. Controlling a nano quadrotor for accurate trajectory tracking at high speed in dynamic environments is challenging due to complex aerodynamic forces that introduce significant disturbances and large positional tracking errors. These aerodynamic effects are difficult to be identified and require feedback control that compensates for them in real time. NMPC allows the nano-quadrotor to control its motion in real time based on onboard sensor measurements, making it well-suited for tasks such as aggressive maneuvers and navigation in complex and dynamic environments. The software package acados enables the implementation of the NMPC algorithm on embedded systems, which is particularly important for nano-quadrotor due to its limited computational resources. Our autonomous navigation system is developed based on an AI-deck that is a GAP8-based parallel ultra-low power computing platform with onboard sensors of a multi-ranger deck and a flow deck. The proposed method of NMPC-based trajectory tracking control is tested in simulation and the results demonstrate its effectiveness in trajectory tracking while considering the dynamic environments. It is also tested on a real nano quadrotor hardware, 27-g Crazyflie 2.1, with a customized MCU running embedded NMPC, in which accurate trajectory tracking results are achieved in dynamic real-world environments.
△ Less
Submitted 1 December, 2023;
originally announced December 2023.
-
Thermal Hall effects due to topological spin fluctuations in YMnO$_3$
Authors:
Ha-Leem Kim,
Takuma Saito,
Heejun Yang,
Hiroaki Ishizuka,
Matthew John Coak,
Jun Han Lee,
Hasung Sim,
Yoon Seok Oh,
Naoto Nagaosa,
Je-Geun Park
Abstract:
The thermal Hall effect in magnetic insulators has been considered a powerful method for examining the topological nature of charge-neutral quasiparticles such as magnons. Yet, unlike the kagome system, the triangular lattice has received less attention for studying the thermal Hall effect because the scalar spin chirality cancels out between adjacent triangles. However, such cancellation cannot b…
▽ More
The thermal Hall effect in magnetic insulators has been considered a powerful method for examining the topological nature of charge-neutral quasiparticles such as magnons. Yet, unlike the kagome system, the triangular lattice has received less attention for studying the thermal Hall effect because the scalar spin chirality cancels out between adjacent triangles. However, such cancellation cannot be perfect if the triangular lattice is distorted, which could open the possibility of a non-zero thermal Hall effect. Here, we report that the trimerized triangular lattice of multiferroic hexagonal manganite YMnO$_3$ produces a highly unusual thermal Hall effect due to topological spin fluctuations with the additional intricacy of a Dzyaloshinskii-Moriya interaction under an applied magnetic field. We conclude the thermal Hall conductivity arises from the system's topological nature of spin fluctuations. Our theoretical calculations demonstrate that the thermal Hall conductivity is also related in this material to the splitting of the otherwise degenerate two chiralities, left and right, of its 120$^{\circ}$ magnetic structure. Our result is one of the most unusual cases of topological physics due to this broken $Z_2$ symmetry of the chirality in the supposedly paramagnetic state of YMnO$_3$, with strong topological spin fluctuations. These new mechanisms in this important class of materials are crucial in exploring new thermal Hall physics and exotic excitations.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
DenseNet and Support Vector Machine classifications of major depressive disorder using vertex-wise cortical features
Authors:
Vladimir Belov,
Tracy Erwin-Grabner,
Ling-Li Zeng,
Christopher R. K. Ching,
Andre Aleman,
Alyssa R. Amod,
Zeynep Basgoze,
Francesco Benedetti,
Bianca Besteher,
Katharina Brosch,
Robin Bülow,
Romain Colle,
Colm G. Connolly,
Emmanuelle Corruble,
Baptiste Couvy-Duchesne,
Kathryn Cullen,
Udo Dannlowski,
Christopher G. Davey,
Annemiek Dols,
Jan Ernsting,
Jennifer W. Evans,
Lukas Fisch,
Paola Fuentes-Claramonte,
Ali Saffet Gonul,
Ian H. Gotlib
, et al. (63 additional authors not shown)
Abstract:
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h…
▽ More
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, has the potential to provide diagnostic and predictive biomarkers for MDD. However, previous attempts to demarcate MDD patients and healthy controls (HC) based on segmented cortical features via linear machine learning approaches have reported low accuracies. In this study, we used globally representative data from the ENIGMA-MDD working group containing an extensive sample of people with MDD (N=2,772) and HC (N=4,240), which allows a comprehensive analysis with generalizable results. Based on the hypothesis that integration of vertex-wise cortical features can improve classification performance, we evaluated the classification of a DenseNet and a Support Vector Machine (SVM), with the expectation that the former would outperform the latter. As we analyzed a multi-site sample, we additionally applied the ComBat harmonization tool to remove potential nuisance effects of site. We found that both classifiers exhibited close to chance performance (balanced accuracy DenseNet: 51%; SVM: 53%), when estimated on unseen sites. Slightly higher classification performance (balanced accuracy DenseNet: 58%; SVM: 55%) was found when the cross-validation folds contained subjects from all sites, indicating site effect. In conclusion, the integration of vertex-wise morphometric features and the use of the non-linear classifier did not lead to the differentiability between MDD and HC. Our results support the notion that MDD classification on this combination of features and classifiers is unfeasible.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
Efficacy of Wolbachia-mediated sterility to suppress dengue: a synthetic control study
Authors:
Jue Tao Lim,
Somya Bansal,
Chee Seng Chong,
Borame Dickens,
Youming Ng,
Lu Deng,
Caleb Lee,
Li Yun Tan,
Grace Chain,
Pei Ma,
Shuzhen Sim,
Cheong Huat Tan,
Alex R Cook,
Lee Ching Ng
Abstract:
In a study conducted in Singapore, a country prone to dengue outbreaks due to its climate and urban population, researchers examined the effectiveness of releasing male Aedes aegypti mosquitoes infected with Wolbachia (wAlbB strain) to reduce dengue transmission. These infected males, when mating with wild-type females, produced non-viable eggs, leading to vector suppression. Extensive field trial…
▽ More
In a study conducted in Singapore, a country prone to dengue outbreaks due to its climate and urban population, researchers examined the effectiveness of releasing male Aedes aegypti mosquitoes infected with Wolbachia (wAlbB strain) to reduce dengue transmission. These infected males, when mating with wild-type females, produced non-viable eggs, leading to vector suppression. Extensive field trials involving over 600,000 residents in four townships were conducted from 2018 to 2022. The results showed a 57% decline in total dengue incidence and a 64% decline in clustered dengue incidence. This approach offers promise for large-scale dengue control in regions facing rising dengue cases, providing a critical solution in combating the disease.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Sandi: A System for Accountability and Applications in Direct Communication (Extended Abstract)
Authors:
F. Betül Durak,
Kim Laine,
Simon Langowski,
Radames Cruz Moreno,
Robert Sim,
Shrey Jain
Abstract:
Reputation systems guide our decision making both in life and work: which restaurant to eat at, which vendor to buy from, which software dependencies to use, and who or what to trust. These systems are often based on old ideas and are failing in the face of modern threats. Fraudsters have found ways to manipulate them, undermining their integrity and utility. Generative AI adds to the problem by e…
▽ More
Reputation systems guide our decision making both in life and work: which restaurant to eat at, which vendor to buy from, which software dependencies to use, and who or what to trust. These systems are often based on old ideas and are failing in the face of modern threats. Fraudsters have found ways to manipulate them, undermining their integrity and utility. Generative AI adds to the problem by enabling the creation of real-looking fake narratives at scale, creating a false sense of consensus. Meanwhile, the need for reliable reputation concepts is more important than ever, as wrong decisions lead to increasingly severe outcomes: wasted time, poor service, and a feeling of injustice at best, fraud, identity theft, and ransomware at worst.
In this extended abstract we introduce Sandi, a new kind of reputation system with a single well-defined purpose: to create trust through accountability in one-to-one transactions. Examples of such transactions include sending an email or making a purchase online. Sandi has strong security and privacy properties that make it suitable for use also in sensitive contexts. Furthermore, Sandi can guarantee reputation integrity and transparency for its registered users.
As a primary application, we envision how Sandi could counter fraud and abuse in direct communication. Concretely, message senders request a cryptographic tag from Sandi that they send along with their message. If the receiver finds the message inappropriate, they can report the sender using this tag. Notably, only senders need registered accounts and do not need to manage long-term keys. The design of Sandi ensures compatibility with any communication system that allows for small binary data transmission.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Multivessel Coronary Artery Segmentation and Stenosis Localisation using Ensemble Learning
Authors:
Muhammad Bilal,
Dinis Martinho,
Reiner Sim,
Adnan Qayyum,
Hunaid Vohra,
Massimo Caputo,
Taofeek Akinosho,
Sofiat Abioye,
Zaheer Khan,
Waleed Niaz,
Junaid Qadir
Abstract:
Coronary angiography analysis is a common clinical task performed by cardiologists to diagnose coronary artery disease (CAD) through an assessment of atherosclerotic plaque's accumulation. This study introduces an end-to-end machine learning solution developed as part of our solution for the MICCAI 2023 Automatic Region-based Coronary Artery Disease diagnostics using x-ray angiography imagEs (ARCA…
▽ More
Coronary angiography analysis is a common clinical task performed by cardiologists to diagnose coronary artery disease (CAD) through an assessment of atherosclerotic plaque's accumulation. This study introduces an end-to-end machine learning solution developed as part of our solution for the MICCAI 2023 Automatic Region-based Coronary Artery Disease diagnostics using x-ray angiography imagEs (ARCADE) challenge, which aims to benchmark solutions for multivessel coronary artery segmentation and potential stenotic lesion localisation from X-ray coronary angiograms. We adopted a robust baseline model training strategy to progressively improve performance, comprising five successive stages of binary class pretraining, multivessel segmentation, fine-tuning using class frequency weighted dataloaders, fine-tuning using F1-based curriculum learning strategy (F1-CLS), and finally multi-target angiogram view classifier-based collective adaptation. Unlike many other medical imaging procedures, this task exhibits a notable degree of interobserver variability. %, making it particularly amenable to automated analysis. Our ensemble model combines the outputs from six baseline models using the weighted ensembling approach, which our analysis shows is found to double the predictive accuracy of the proposed solution. The final prediction was further refined, targeting the correction of misclassified blobs. Our solution achieved a mean F1 score of $37.69\%$ for coronary artery segmentation, and $39.41\%$ for stenosis localisation, positioning our team in the 5th position on both leaderboards. This work demonstrates the potential of automated tools to aid CAD diagnosis, guide interventions, and improve the accuracy of stent injections in clinical settings.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Privately Aligning Language Models with Reinforcement Learning
Authors:
Fan Wu,
Huseyin A. Inan,
Arturs Backurs,
Varun Chandrasekaran,
Janardhan Kulkarni,
Robert Sim
Abstract:
Positioned between pre-training and user deployment, aligning large language models (LLMs) through reinforcement learning (RL) has emerged as a prevailing strategy for training instruction following-models such as ChatGPT. In this work, we initiate the study of privacy-preserving alignment of LLMs through Differential Privacy (DP) in conjunction with RL. Following the influential work of Ziegler e…
▽ More
Positioned between pre-training and user deployment, aligning large language models (LLMs) through reinforcement learning (RL) has emerged as a prevailing strategy for training instruction following-models such as ChatGPT. In this work, we initiate the study of privacy-preserving alignment of LLMs through Differential Privacy (DP) in conjunction with RL. Following the influential work of Ziegler et al. (2020), we study two dominant paradigms: (i) alignment via RL without human in the loop (e.g., positive review generation) and (ii) alignment via RL from human feedback (RLHF) (e.g., summarization in a human-preferred way). We give a new DP framework to achieve alignment via RL, and prove its correctness. Our experimental results validate the effectiveness of our approach, offering competitive utility while ensuring strong privacy protections.
△ Less
Submitted 3 May, 2024; v1 submitted 25 October, 2023;
originally announced October 2023.
-
No-Regret Learning and Equilibrium Computation in Quantum Games
Authors:
Wayne Lin,
Georgios Piliouras,
Ryann Sim,
Antonios Varvitsiotis
Abstract:
As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited i…
▽ More
As quantum processors advance, the emergence of large-scale decentralized systems involving interacting quantum-enabled agents is on the horizon. Recent research efforts have explored quantum versions of Nash and correlated equilibria as solution concepts of strategic quantum interactions, but these approaches did not directly connect to decentralized adaptive setups where agents possess limited information. This paper delves into the dynamics of quantum-enabled agents within decentralized systems that employ no-regret algorithms to update their behaviors over time. Specifically, we investigate two-player quantum zero-sum games and polymatrix quantum zero-sum games, showing that no-regret algorithms converge to separable quantum Nash equilibria in time-average. In the case of general multi-player quantum games, our work leads to a novel solution concept, (separable) quantum coarse correlated equilibria (QCCE), as the convergent outcome of the time-averaged behavior no-regret algorithms, offering a natural solution concept for decentralized quantum systems. Finally, we show that computing QCCEs can be formulated as a semidefinite program and establish the existence of entangled (i.e., non-separable) QCCEs, which cannot be approached via the current paradigm of no-regret learning.
△ Less
Submitted 14 November, 2023; v1 submitted 12 October, 2023;
originally announced October 2023.
-
A Machine Learning Approach to Predicting Single Event Upsets
Authors:
Archit Gupta,
Chong Yock Eng,
Deon Lim Meng Wee,
Rashna Analia Ahmed,
See Min Sim
Abstract:
A single event upset (SEU) is a critical soft error that occurs in semiconductor devices on exposure to ionising particles from space environments. SEUs cause bit flips in the memory component of semiconductors. This creates a multitude of safety hazards as stored information becomes less reliable. Currently, SEUs are only detected several hours after their occurrence. CREMER, the model presented…
▽ More
A single event upset (SEU) is a critical soft error that occurs in semiconductor devices on exposure to ionising particles from space environments. SEUs cause bit flips in the memory component of semiconductors. This creates a multitude of safety hazards as stored information becomes less reliable. Currently, SEUs are only detected several hours after their occurrence. CREMER, the model presented in this paper, predicts SEUs in advance using machine learning. CREMER uses only positional data to predict SEU occurrence, making it robust, inexpensive and scalable. Upon implementation, the improved reliability of memory devices will create a digitally safer environment onboard space vehicles.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Serving Deep Learning Model in Relational Databases
Authors:
Alexandre Eichenberger,
Qi Lin,
Saif Masood,
Hong Min,
Alexander Sim,
Jie Wang,
Yida Wang,
Kesheng Wu,
Binhang Yuan,
Lixi Zhou,
Jia Zou
Abstract:
Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-artDL-Centricarchitecture offloadsDL computati…
▽ More
Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-artDL-Centricarchitecture offloadsDL computations to dedicated DL frameworks. The potential UDF-Centric architecture encapsulates one or more tensor computations into User Defined Functions (UDFs) within the database system. The potentialRelation-Centricarchitecture aims to represent a large-scale tensor computation through relational operators. While each of these architectures demonstrates promise in specific use scenarios, we identify urgent requirements for seamless integration of these architectures and the middle ground between these architectures. We delve into the gaps that impede the integration and explore innovative strategies to close them. We present a pathway to establish a novel database system for enabling a broad class of data-intensive DL inference applications.
△ Less
Submitted 9 October, 2023; v1 submitted 7 October, 2023;
originally announced October 2023.
-
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning
Authors:
Liam Collins,
Shanshan Wu,
Sewoong Oh,
Khe Chai Sim
Abstract:
In many applications of federated learning (FL), clients desire models that are personalized using their local data, yet are also robust in the sense that they retain general global knowledge. However, the presence of data heterogeneity across clients induces a fundamental trade-off between personalization (i.e., adaptation to a local distribution) and robustness (i.e., not forgetting previously l…
▽ More
In many applications of federated learning (FL), clients desire models that are personalized using their local data, yet are also robust in the sense that they retain general global knowledge. However, the presence of data heterogeneity across clients induces a fundamental trade-off between personalization (i.e., adaptation to a local distribution) and robustness (i.e., not forgetting previously learned general knowledge). It is critical to understand how to navigate this personalization vs robustness trade-off when designing federated systems, which are increasingly moving towards a paradigm of fine-tuning large foundation models. Due to limited computational and communication capabilities in most federated settings, this foundation model fine-tuning must be done using parameter-efficient fine-tuning (PEFT) approaches. While some recent work has studied federated approaches to PEFT, the personalization vs robustness trade-off of federated PEFT has been largely unexplored. In this work, we take a step towards bridging this gap by benchmarking fundamental FL algorithms -- FedAvg and FedSGD plus personalization (via client local fine-tuning) -- applied to one of the most ubiquitous PEFT approaches to large language models (LLMs) -- prompt tuning -- in a multitude of hyperparameter settings under varying levels of data heterogeneity. Our results show that federated-trained prompts can be surprisingly robust when using a small learning rate with many local epochs for personalization, especially when using an adaptive optimizer as the client optimizer during federated training. We also demonstrate that simple approaches such as adding regularization and interpolating two prompts are effective in improving the personalization vs robustness trade-off in computation-limited settings with few local updates allowed for personalization.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
Swee** Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Authors:
Chen Dun,
Mirian Hipolito Garcia,
Guoqing Zheng,
Ahmed Hassan Awadallah,
Anastasios Kyrillidis,
Robert Sim
Abstract:
Large Language Models (LLMs) have the ability to solve a variety of tasks, such as text summarization and mathematical questions, just out of the box, but they are often trained with a single task in mind. Due to high computational costs, the current trend is to use prompt instruction tuning to better adjust monolithic, pretrained LLMs for new -- but often individual -- downstream tasks. Thus, how…
▽ More
Large Language Models (LLMs) have the ability to solve a variety of tasks, such as text summarization and mathematical questions, just out of the box, but they are often trained with a single task in mind. Due to high computational costs, the current trend is to use prompt instruction tuning to better adjust monolithic, pretrained LLMs for new -- but often individual -- downstream tasks. Thus, how one would expand prompt tuning to handle -- concomitantly -- heterogeneous tasks and data distributions is a widely open question. To address this gap, we suggest the use of \emph{Mixture of Prompts}, or MoPs, associated with smart gating functionality: the latter -- whose design is one of the contributions of this paper -- can identify relevant skills embedded in different groups of prompts and dynamically assign combined experts (i.e., collection of prompts), based on the target task. Additionally, MoPs are empirically agnostic to any model compression technique applied -- for efficiency reasons -- as well as instruction data source and task composition. In practice, MoPs can simultaneously mitigate prompt training "interference" in multi-task, multi-source scenarios (e.g., task and data heterogeneity across sources), as well as possible implications from model approximations. As a highlight, MoPs manage to decrease final perplexity from $\sim20\%$ up to $\sim70\%$, as compared to baselines, in the federated scenario, and from $\sim 3\%$ up to $\sim30\%$ in the centralized scenario.
△ Less
Submitted 5 October, 2023; v1 submitted 4 October, 2023;
originally announced October 2023.
-
Machine Learning-Enabled Precision Position Control and Thermal Regulation in Advanced Thermal Actuators
Authors:
Seyed Mo Mirvakili,
Ehsan Haghighat,
Douglas Sim
Abstract:
With their unique combination of characteristics - an energy density almost 100 times that of human muscle, and a power density of 5.3 kW/kg, similar to a jet engine's output - Nylon artificial muscles stand out as particularly apt for robotics applications. However, the necessity of integrating sensors and controllers poses a limitation to their practical usage. Here we report a constant power op…
▽ More
With their unique combination of characteristics - an energy density almost 100 times that of human muscle, and a power density of 5.3 kW/kg, similar to a jet engine's output - Nylon artificial muscles stand out as particularly apt for robotics applications. However, the necessity of integrating sensors and controllers poses a limitation to their practical usage. Here we report a constant power open-loop controller based on machine learning. We show that we can control the position of a nylon artificial muscle without external sensors. To this end, we construct a map** from a desired displacement trajectory to a required power using an ensemble encoder-style feed-forward neural network. The neural controller is carefully trained on a physics-based denoised dataset and can be fine-tuned to accommodate various types of thermal artificial muscles, irrespective of the presence or absence of hysteresis.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
Authors:
Weiran Wang,
Zelin Wu,
Diamantino Caseiro,
Tsendsuren Munkhdalai,
Khe Chai Sim,
Pat Rondon,
Golan Pundak,
Gan Song,
Rohit Prabhavalkar,
Zhong Meng,
Ding Zhao,
Tara Sainath,
Pedro Moreno Mengibar
Abstract:
Contextual biasing refers to the problem of biasing the automatic speech recognition (ASR) systems towards rare entities that are relevant to the specific user or application scenarios. We propose algorithms for contextual biasing based on the Knuth-Morris-Pratt algorithm for pattern matching. During beam search, we boost the score of a token extension if it extends matching into a set of biasing…
▽ More
Contextual biasing refers to the problem of biasing the automatic speech recognition (ASR) systems towards rare entities that are relevant to the specific user or application scenarios. We propose algorithms for contextual biasing based on the Knuth-Morris-Pratt algorithm for pattern matching. During beam search, we boost the score of a token extension if it extends matching into a set of biasing phrases. Our method simulates the classical approaches often implemented in the weighted finite state transducer (WFST) framework, but avoids the FST language altogether, with careful considerations on memory footprint and efficiency on tensor processing units (TPUs) by vectorization. Without introducing additional model parameters, our method achieves significant word error rate (WER) reductions on biasing test sets by itself, and yields further performance gain when combined with a model-based biasing method.
△ Less
Submitted 29 September, 2023;
originally announced October 2023.
-
Massive End-to-end Models for Short Search Queries
Authors:
Weiran Wang,
Rohit Prabhavalkar,
Dongseong Hwang,
Qiujia Li,
Khe Chai Sim,
Bo Li,
James Qin,
Xingyu Cai,
Adam Stooke,
Zhong Meng,
CJ Zheng,
Yanzhang He,
Tara Sainath,
Pedro Moreno Mengibar
Abstract:
In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters. The encoders of our models use the neural architecture of Google's universal speech model (USM), with additional funnel pooling layers to signifi…
▽ More
In this work, we investigate two popular end-to-end automatic speech recognition (ASR) models, namely Connectionist Temporal Classification (CTC) and RNN-Transducer (RNN-T), for offline recognition of voice search queries, with up to 2B model parameters. The encoders of our models use the neural architecture of Google's universal speech model (USM), with additional funnel pooling layers to significantly reduce the frame rate and speed up training and inference. We perform extensive studies on vocabulary size, time reduction strategy, and its generalization performance on long-form test sets. Despite the speculation that, as the model size increases, CTC can be as good as RNN-T which builds label dependency into the prediction, we observe that a 900M RNN-T clearly outperforms a 1.8B CTC and is more tolerant to severe time reduction, although the WER gap can be largely removed by LM shallow fusion.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
SN 2022jli: a type Ic supernova with periodic modulation of its light curve and an unusually long rise
Authors:
Moore T.,
Smartt S. J.,
Nicholl M.,
Srivastav S.,
Stevance H. F.,
Jess D. B.,
Grant S. D. T.,
Fulton M. D.,
Rhodes L.,
Sim S. A.,
Hirai R.,
Podsiadlowski P.,
Anderson J. P.,
Ashall C.,
Bate W.,
Fender R.,
Gutierrez C. P.,
Howell D. A.,
Huber M. E.,
Inserra C.,
Leloudas G.,
Monard L. A. G.,
Muller-Bravo T. E.,
Shappee B. J.,
Smith K. W.
, et al. (20 additional authors not shown)
Abstract:
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova d…
▽ More
We present multi-wavelength photometry and spectroscopy of SN 2022jli, an unprecedented Type Ic supernova discovered in the galaxy NGC 157 at a distance of $\approx$ 23 Mpc. The multi-band light curves reveal many remarkable characteristics. Peaking at a magnitude of $g=15.11\pm0.02$, the high-cadence photometry reveals 12.5$\pm0.2\ $day periodic undulations superimposed on the 200 day supernova decline. This periodicity is observed in the light curves from nine separate filter and instrument configurations with peak-to-peak amplitudes of $\simeq$ 0.1 mag. This is the first time that repeated periodic oscillations, over many cycles, have been detected in a supernova light curve. SN 2022jli also displays an extreme early excess which fades over $\approx$ 25 days followed by a rise to a peak luminosity of $L_{\rm opt} = 10^{42.1}$ erg s$^{-1}$. Although the exact explosion epoch is not constrained by data, the time from explosion to maximum light is $\gtrsim$ 59 days. The luminosity can be explained by a large ejecta mass ($M_{\rm ej}\approx12\pm6$M$_{\odot}$) powered by $^{56}$Ni but we find difficulty in quantitatively modelling the early excess with circumstellar interaction and cooling. Collision between the supernova ejecta and a binary companion is a possible source of this emission. We discuss the origin of the periodic variability in the light curve, including interaction of the SN ejecta with nested shells of circumstellar matter and neutron stars colliding with binary companions.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation
Authors:
Xinyu Tang,
Richard Shin,
Huseyin A. Inan,
Andre Manoel,
Fatemehsadat Mireshghallah,
Zinan Lin,
Sivakanth Gopi,
Janardhan Kulkarni,
Robert Sim
Abstract:
We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that…
▽ More
We study the problem of in-context learning (ICL) with large language models (LLMs) on private datasets. This scenario poses privacy risks, as LLMs may leak or regurgitate the private examples demonstrated in the prompt. We propose a novel algorithm that generates synthetic few-shot demonstrations from the private dataset with formal differential privacy (DP) guarantees, and show empirically that it can achieve effective ICL. We conduct extensive experiments on standard benchmarks and compare our algorithm with non-private ICL and zero-shot solutions. Our results demonstrate that our algorithm can achieve competitive performance with strong privacy levels. These results open up new possibilities for ICL with privacy protection for a broad range of applications.
△ Less
Submitted 27 January, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Improving Speech Recognition for African American English With Audio Classification
Authors:
Shefali Garg,
Zhouyuan Huo,
Khe Chai Sim,
Suzan Schwartz,
Mason Chua,
Alëna Aksënova,
Tsendsuren Munkhdalai,
Levi King,
Darryl Wright,
Zion Mengesha,
Dongseong Hwang,
Tara Sainath,
Françoise Beaufays,
Pedro Moreno Mengibar
Abstract:
Automatic speech recognition (ASR) systems have been shown to have large quality disparities between the language varieties they are intended or expected to recognize. One way to mitigate this is to train or fine-tune models with more representative datasets. But this approach can be hindered by limited in-domain data for training and evaluation. We propose a new way to improve the robustness of a…
▽ More
Automatic speech recognition (ASR) systems have been shown to have large quality disparities between the language varieties they are intended or expected to recognize. One way to mitigate this is to train or fine-tune models with more representative datasets. But this approach can be hindered by limited in-domain data for training and evaluation. We propose a new way to improve the robustness of a US English short-form speech recognizer using a small amount of out-of-domain (long-form) African American English (AAE) data. We use CORAAL, YouTube and Mozilla Common Voice to train an audio classifier to approximately output whether an utterance is AAE or some other variety including Mainstream American English (MAE). By combining the classifier output with coarse geographic information, we can select a subset of utterances from a large corpus of untranscribed short-form queries for semi-supervised learning at scale. Fine-tuning on this data results in a 38.5% relative word error rate disparity reduction between AAE and MAE without reducing MAE quality.
△ Less
Submitted 16 September, 2023;
originally announced September 2023.
-
Towards inferring the geometry of kilonovae
Authors:
Christine E. Collins,
Luke J. Shingles,
Andreas Bauswein,
Stuart A. Sim,
Theodoros Soultanis,
Vimal Vijayan,
Andreas Floers,
Oliver Just,
Gerrit Leck,
Georgios Lioutas,
Gabriel Martínez-Pinedo,
Albert Sneppen,
Darach Watson,
Zewei Xiong
Abstract:
Recent analysis of the kilonova, AT2017gfo, has indicated that this event was highly spherical. This may challenge hydrodynamics simulations of binary neutron star mergers, which usually predict a range of asymmetries, and radiative transfer simulations show a strong direction dependence. Here we investigate whether the synthetic spectra from a 3D kilonova simulation of asymmetric ejecta from a hy…
▽ More
Recent analysis of the kilonova, AT2017gfo, has indicated that this event was highly spherical. This may challenge hydrodynamics simulations of binary neutron star mergers, which usually predict a range of asymmetries, and radiative transfer simulations show a strong direction dependence. Here we investigate whether the synthetic spectra from a 3D kilonova simulation of asymmetric ejecta from a hydrodynamical merger simulation can be compatible with the observational constraints suggesting a high degree of sphericity in AT2017gfo. Specifically, we determine whether fitting a simple P-Cygni line profile model leads to a value for the photospheric velocity that is consistent with the value obtained from the expanding photosphere method. We would infer that our kilonova simulation is highly spherical at early times, when the spectra resemble a blackbody distribution. The two independently inferred photospheric velocities can be very similar, implying a high degree of sphericity, which can be as spherical as inferred for AT2017gfo, demonstrating that the photosphere can appear spherical even for asymmetrical ejecta. The last-interaction velocities of radiation esca** the simulation show a high degree of sphericity, supporting the inferred symmetry of the photosphere. We find that when the synthetic spectra resemble a blackbody the expanding photosphere method can be used to obtain an accurate luminosity distance (within 4-7 per cent).
△ Less
Submitted 23 February, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Automatic Data Transformation Using Large Language Model: An Experimental Study on Building Energy Data
Authors:
Ankita Sharma,
Xuanmao Li,
Hong Guan,
Guoxin Sun,
Liang Zhang,
Lanjun Wang,
Kesheng Wu,
Lei Cao,
Erkang Zhu,
Alexander Sim,
Teresa Wu,
Jia Zou
Abstract:
Existing approaches to automatic data transformation are insufficient to meet the requirements in many real-world scenarios, such as the building sector. First, there is no convenient interface for domain experts to provide domain knowledge easily. Second, they require significant training data collection overheads. Third, the accuracy suffers from complicated schema changes. To bridge this gap, w…
▽ More
Existing approaches to automatic data transformation are insufficient to meet the requirements in many real-world scenarios, such as the building sector. First, there is no convenient interface for domain experts to provide domain knowledge easily. Second, they require significant training data collection overheads. Third, the accuracy suffers from complicated schema changes. To bridge this gap, we present a novel approach that leverages the unique capabilities of large language models (LLMs) in coding, complex reasoning, and zero-shot learning to generate SQL code that transforms the source datasets into the target datasets. We demonstrate the viability of this approach by designing an LLM-based framework, termed SQLMorpher, which comprises a prompt generator that integrates the initial prompt with optional domain knowledge and historical patterns in external databases. It also implements an iterative prompt optimization mechanism that automatically improves the prompt based on flaw detection. The key contributions of this work include (1) pioneering an end-to-end LLM-based solution for data transformation, (2) develo** a benchmark dataset of 105 real-world building energy data transformation problems, and (3) conducting an extensive empirical evaluation where our approach achieved 96% accuracy in all 105 problems. SQLMorpher demonstrates the effectiveness of utilizing LLMs in complex, domain-specific challenges, highlighting the potential of their potential to drive sustainable solutions.
△ Less
Submitted 6 September, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Deep Imitation Learning for Humanoid Loco-manipulation through Human Teleoperation
Authors:
Mingyo Seo,
Steve Han,
Kyutae Sim,
Seung Hyeon Bang,
Carlos Gonzalez,
Luis Sentis,
Yuke Zhu
Abstract:
We tackle the problem of develo** humanoid loco-manipulation skills with deep imitation learning. The difficulty of collecting task demonstrations and training policies for humanoids with a high degree of freedom presents substantial challenges. We introduce TRILL, a data-efficient framework for training humanoid loco-manipulation policies from human demonstrations. In this framework, we collect…
▽ More
We tackle the problem of develo** humanoid loco-manipulation skills with deep imitation learning. The difficulty of collecting task demonstrations and training policies for humanoids with a high degree of freedom presents substantial challenges. We introduce TRILL, a data-efficient framework for training humanoid loco-manipulation policies from human demonstrations. In this framework, we collect human demonstration data through an intuitive Virtual Reality (VR) interface. We employ the whole-body control formulation to transform task-space commands by human operators into the robot's joint-torque actuation while stabilizing its dynamics. By employing high-level action abstractions tailored for humanoid loco-manipulation, our method can efficiently learn complex sensorimotor skills. We demonstrate the effectiveness of TRILL in simulation and on a real-world robot for performing various loco-manipulation tasks. Videos and additional materials can be found on the project page: https://ut-austin-rpl.github.io/TRILL.
△ Less
Submitted 19 November, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Using Large Language Models for Cybersecurity Capture-The-Flag Challenges and Certification Questions
Authors:
Wesley Tann,
Yuancheng Liu,
Jun Heng Sim,
Choon Meng Seah,
Ee-Chien Chang
Abstract:
The assessment of cybersecurity Capture-The-Flag (CTF) exercises involves participants finding text strings or ``flags'' by exploiting system vulnerabilities. Large Language Models (LLMs) are natural-language models trained on vast amounts of words to understand and generate text; they can perform well on many CTF challenges. Such LLMs are freely available to students. In the context of CTF exerci…
▽ More
The assessment of cybersecurity Capture-The-Flag (CTF) exercises involves participants finding text strings or ``flags'' by exploiting system vulnerabilities. Large Language Models (LLMs) are natural-language models trained on vast amounts of words to understand and generate text; they can perform well on many CTF challenges. Such LLMs are freely available to students. In the context of CTF exercises in the classroom, this raises concerns about academic integrity. Educators must understand LLMs' capabilities to modify their teaching to accommodate generative AI assistance. This research investigates the effectiveness of LLMs, particularly in the realm of CTF challenges and questions. Here we evaluate three popular LLMs, OpenAI ChatGPT, Google Bard, and Microsoft Bing. First, we assess the LLMs' question-answering performance on five Cisco certifications with varying difficulty levels. Next, we qualitatively study the LLMs' abilities in solving CTF challenges to understand their limitations. We report on the experience of using the LLMs for seven test cases in all five types of CTF challenges. In addition, we demonstrate how jailbreak prompts can bypass and break LLMs' ethical safeguards. The paper concludes by discussing LLM's impact on CTF exercises and its implications.
△ Less
Submitted 20 August, 2023;
originally announced August 2023.
-
Unprecedented early flux excess in the hybrid 02es-like type Ia supernova 2022ywc indicates interaction with circumstellar material
Authors:
Shubham Srivastav,
T. Moore,
M. Nicholl,
M. R. Magee,
S. J. Smartt,
M. D. Fulton,
S. A. Sim,
J. M. Pollin,
L. Galbany,
C. Inserra,
A. Kozyreva,
Takashi J. Moriya,
F. P. Callan,
X. Sheng,
K. W. Smith,
J. S. Sommer,
J. P. Anderson,
M. Deckers,
M. Gromadzki,
T. E. Müller-Bravo,
G. Pignata,
A. Rest,
D. R. Young
Abstract:
We present optical photometric and spectroscopic observations of the 02es-like type Ia supernova (SN) 2022ywc. The transient occurred in the outskirts of an elliptical host galaxy and showed a striking double-peaked light curve with an early excess feature detected in the ATLAS orange and cyan bands. The early excess is remarkably luminous with an absolute magnitude $\sim -19$, comparable in lumin…
▽ More
We present optical photometric and spectroscopic observations of the 02es-like type Ia supernova (SN) 2022ywc. The transient occurred in the outskirts of an elliptical host galaxy and showed a striking double-peaked light curve with an early excess feature detected in the ATLAS orange and cyan bands. The early excess is remarkably luminous with an absolute magnitude $\sim -19$, comparable in luminosity to the subsequent radioactively-driven second peak. The spectra resemble the hybrid 02es-like SN 2016jhr, that is considered to be a helium shell detonation candidate. We investigate different physical mechanisms that could power such a prominent early excess and rule out massive helium shell detonation, surface $^{56}$Ni distribution and ejecta-companion interaction. We conclude that SN ejecta interacting with circumstellar material (CSM) is the most viable scenario. Semi-analytical modelling with MOSFiT indicates that SN ejecta interacting with $\sim 0.05\,$M$_{\odot}$ of CSM at a distance of $\sim 10^{14}$ cm can explain the extraordinary light curve. A double-degenerate scenario may explain the origin of the CSM, either by tidally-stripped material from the secondary white dwarf, or disk-originated matter launched along polar axes following the disruption and accretion of the secondary white dwarf. A non-spherical CSM configuration could suggest that a small fraction of 02es-like events viewed along a favourable line of sight may be expected to display a very conspicuous early excess like SN 2022ywc.
△ Less
Submitted 25 September, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Project Florida: Federated Learning Made Easy
Authors:
Daniel Madrigal Diaz,
Andre Manoel,
Jialei Chen,
Nalin Singal,
Robert Sim
Abstract:
We present Project Florida, a system architecture and software development kit (SDK) enabling deployment of large-scale Federated Learning (FL) solutions across a heterogeneous device ecosystem. Federated learning is an approach to machine learning based on a strong data sovereignty principle, i.e., that privacy and security of data is best enabled by storing it at its origin, whether on end-user…
▽ More
We present Project Florida, a system architecture and software development kit (SDK) enabling deployment of large-scale Federated Learning (FL) solutions across a heterogeneous device ecosystem. Federated learning is an approach to machine learning based on a strong data sovereignty principle, i.e., that privacy and security of data is best enabled by storing it at its origin, whether on end-user devices or in segregated cloud storage silos. Federated learning enables model training across devices and silos while the training data remains within its security boundary, by distributing a model snapshot to a client running inside the boundary, running client code to update the model, and then aggregating updated snapshots across many clients in a central orchestrator. Deploying a FL solution requires implementation of complex privacy and security mechanisms as well as scalable orchestration infrastructure. Scale and performance is a paramount concern, as the model training process benefits from full participation of many client devices, which may have a wide variety of performance characteristics. Project Florida aims to simplify the task of deploying cross-device FL solutions by providing cloud-hosted infrastructure and accompanying task management interfaces, as well as a multi-platform SDK supporting most major programming languages including C++, Java, and Python, enabling FL training across a wide range of operating system (OS) and hardware specifications. The architecture decouples service management from the FL workflow, enabling a cloud service provider to deliver FL-as-a-service (FLaaS) to ML engineers and application developers. We present an overview of Florida, including a description of the architecture, sample code, and illustrative experiments demonstrating system capabilities.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Control- & Task-Aware Optimal Design of Actuation System for Legged Robots using Binary Integer Linear Programming
Authors:
Youngwoo Sim,
Guillermo Colin,
Joao Ramos
Abstract:
Athletic robots demand a whole-body actuation system design that utilizes motors up to the boundaries of their performance. However, creating such robots poses challenges of integrating design principles and reasoning of practical design choices. This paper presents a design framework that guides designers to find optimal design choices to create an actuation system that can rapidly generate torqu…
▽ More
Athletic robots demand a whole-body actuation system design that utilizes motors up to the boundaries of their performance. However, creating such robots poses challenges of integrating design principles and reasoning of practical design choices. This paper presents a design framework that guides designers to find optimal design choices to create an actuation system that can rapidly generate torques and velocities required to achieve a given set of tasks, by minimizing inertia and leveraging cooperation between actuators. The framework serves as an interactive tool for designers who are in charge of providing design rules and candidate components such as motors, reduction mechanism, and coupling mechanisms between actuators and joints. A binary integer linear optimization explores design combinations to find optimal components that can achieve a set of tasks. The framework is demonstrated with 200 optimal design studies of a biped with 5-degree-of-freedom (DoF) legs, focusing on the effect of achieving multiple tasks (walking, lifting), constraining the mass budget of all motors in the system and the use of coupling mechanisms. The result provides a comprehensive view of how design choices and rules affect reflected inertia, copper loss of motors, and force capability of optimal actuation systems.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Contrastive Graph Pooling for Explainable Classification of Brain Networks
Authors:
Jiaxing Xu,
Qingtian Bian,
Xinhang Li,
Aihu Zhang,
Yi** Ke,
Miao Qiao,
Wei Zhang,
Wei Khang Jeremy Sim,
Balázs Gulyás
Abstract:
Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics…
▽ More
Functional magnetic resonance imaging (fMRI) is a commonly used technique to measure neural activation. Its application has been particularly important in identifying underlying neurodegenerative conditions such as Parkinson's, Alzheimer's, and Autism. Recent analysis of fMRI data models the brain as a graph and extracts features by graph neural networks (GNNs). However, the unique characteristics of fMRI data require a special design of GNN. Tailoring GNN to generate effective and domain-explainable features remains challenging. In this paper, we propose a contrastive dual-attention block and a differentiable graph pooling method called ContrastPool to better utilize GNN for brain networks, meeting fMRI-specific requirements. We apply our method to 5 resting-state fMRI brain network datasets of 3 diseases and demonstrate its superiority over state-of-the-art baselines. Our case study confirms that the patterns extracted by our method match the domain knowledge in neuroscience literature, and disclose direct and interesting insights. Our contributions underscore the potential of ContrastPool for advancing the understanding of brain networks and neurodegenerative conditions. The source code is available at https://github.com/AngusMonroe/ContrastPool.
△ Less
Submitted 12 April, 2024; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Effectiveness and predictability of in-network storage cache for scientific workflows
Authors:
Caitlin Sim,
Kesheng Wu,
Alex Sim,
Inder Monga,
Chin Guok,
Frank Wurthwein,
Diego Davila,
Harvey Newman,
Justas Balcas
Abstract:
Large scientific collaborations often have multiple scientists accessing the same set of files while doing different analyses, which create repeated accesses to the large amounts of shared data located far away. These data accesses have long latency due to distance and occupy the limited bandwidth available over the wide-area network. To reduce the wide-area network traffic and the data access lat…
▽ More
Large scientific collaborations often have multiple scientists accessing the same set of files while doing different analyses, which create repeated accesses to the large amounts of shared data located far away. These data accesses have long latency due to distance and occupy the limited bandwidth available over the wide-area network. To reduce the wide-area network traffic and the data access latency, regional data storage caches have been installed as a new networking service. To study the effectiveness of such a cache system in scientific applications, we examine the Southern California Petabyte Scale Cache for a high-energy physics experiment. By examining about 3TB of operational logs, we show that this cache removed 67.6% of file requests from the wide-area network and reduced the traffic volume on wide-area network by 12.3TB (or 35.4%) an average day. The reduction in the traffic volume (35.4%) is less than the reduction in file counts (67.6%) because the larger files are less likely to be reused. Due to this difference in data access patterns, the cache system has implemented a policy to avoid evicting smaller files when processing larger files. We also build a machine learning model to study the predictability of the cache behavior. Tests show that this model is able to accurately predict the cache accesses, cache misses, and network throughput, making the model useful for future studies on resource provisioning and planning.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
DC-DFT for Open Shells: How to Deal with Spin Contamination
Authors:
Hayoung Yu,
Suhwan Song,
Seungsoo Nam,
Kieron Burke,
Eunji Sim
Abstract:
Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong…
▽ More
Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong spin contamination in the HF calculation becomes increasingly important. We compare two different open-shell HF densities across 13 different DFT functionals and two DC-DFT methods. For significant spin contamination, ROHF densities outperform UHF densities by as much as a factor of 3, depending on the energy functional, and ROHF-DFT improves over self-consistent DFT for most of the tested functionals. We refine the DC(HF)-DFT algorithm, recommending ROHF-DFT in cases of severe spin contamination.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Whole-Body Dynamic Telelocomotion: A Step-to-Step Dynamics Approach to Human Walking Reference Generation
Authors:
Guillermo Colin,
Joseph Byrnes,
Youngwoo Sim,
Patrick Wensing,
Joao Ramos
Abstract:
Teleoperated humanoid robots hold significant potential as physical avatars for humans in hazardous and inaccessible environments, with the goal of channeling human intelligence and sensorimotor skills through these robotic counterparts. Precise coordination between humans and robots is crucial for accomplishing whole-body behaviors involving locomotion and manipulation. To progress successfully,…
▽ More
Teleoperated humanoid robots hold significant potential as physical avatars for humans in hazardous and inaccessible environments, with the goal of channeling human intelligence and sensorimotor skills through these robotic counterparts. Precise coordination between humans and robots is crucial for accomplishing whole-body behaviors involving locomotion and manipulation. To progress successfully, dynamic synchronization between humans and humanoid robots must be achieved. This work enhances advancements in whole-body dynamic telelocomotion, addressing challenges in robustness. By embedding the hybrid and underactuated nature of bipedal walking into a virtual human walking interface, we achieve dynamically consistent walking gait generation. Additionally, we integrate a reactive robot controller into a whole-body dynamic telelocomotion framework. Thus, allowing the realization of telelocomotion behaviors on the full-body dynamics of a bipedal robot. Real-time telelocomotion simulation experiments validate the effectiveness of our methods, demonstrating that a trained human pilot can dynamically synchronize with a simulated bipedal robot, achieving sustained locomotion, controlling walking speeds within the range of 0.0 m/s to 0.3 m/s, and enabling backward walking for distances of up to 2.0 m. This research contributes to advancing teleoperated humanoid robots and paves the way for future developments in synchronized locomotion between humans and bipedal robots.
△ Less
Submitted 21 July, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Helium as a signature of the double detonation in Type Ia supernovae
Authors:
Christine E. Collins,
Stuart A. Sim,
Luke. J. Shingles,
Sabrina Gronow,
Friedrich K. Roepke,
Ruediger Pakmor,
Ivo R. Seitenzahl,
Markus Kromer
Abstract:
The double detonation is a widely discussed mechanism to explain Type Ia supernovae from explosions of sub-Chandrasekhar mass white dwarfs. In this scenario, a helium detonation is ignited in a surface helium shell on a carbon/oxygen white dwarf, which leads to a secondary carbon detonation. Explosion simulations predict high abundances of unburnt helium in the ejecta, however, radiative transfer…
▽ More
The double detonation is a widely discussed mechanism to explain Type Ia supernovae from explosions of sub-Chandrasekhar mass white dwarfs. In this scenario, a helium detonation is ignited in a surface helium shell on a carbon/oxygen white dwarf, which leads to a secondary carbon detonation. Explosion simulations predict high abundances of unburnt helium in the ejecta, however, radiative transfer simulations have not been able to fully address whether helium spectral features would form. This is because helium can not be sufficiently excited to form spectral features by thermal processes, but can be excited by collisions with non-thermal electrons, which most studies have neglected. We carry out a full non-local thermodynamic equilibrium (non-LTE) radiative transfer simulation for an instance of a double detonation explosion model, and include a non-thermal treatment of fast electrons. We find a clear He I λ 10830 feature which is strongest in the first few days after explosion and becomes weaker with time. Initially this feature is blended with the Mg II λ 10927 feature but over time separates to form a secondary feature to the blue wing of the Mg II λ 10927 feature. We compare our simulation to observations of iPTF13ebh, which showed a similar feature to the blue wing of the Mg II λ 10927 feature, previously identified as C I. Our simulation shows a good match to the evolution of this feature and we identify it as high velocity He I λ 10830. This suggests that He I λ 10830 could be a signature of the double detonation scenario.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
BiRP: Learning Robot Generalized Bimanual Coordination using Relative Parameterization Method on Human Demonstration
Authors:
Junjia Liu,
Hengyi Sim,
Chenzui Li,
Fei Chen
Abstract:
Human bimanual manipulation can perform more complex tasks than a simple combination of two single arms, which is credited to the spatio-temporal coordination between the arms. However, the description of bimanual coordination is still an open topic in robotics. This makes it difficult to give an explainable coordination paradigm, let alone applied to robotics. In this work, we divide the main bim…
▽ More
Human bimanual manipulation can perform more complex tasks than a simple combination of two single arms, which is credited to the spatio-temporal coordination between the arms. However, the description of bimanual coordination is still an open topic in robotics. This makes it difficult to give an explainable coordination paradigm, let alone applied to robotics. In this work, we divide the main bimanual tasks in human daily activities into two types: leader-follower and synergistic coordination. Then we propose a relative parameterization method to learn these types of coordination from human demonstration. It represents coordination as Gaussian mixture models from bimanual demonstration to describe the change in the importance of coordination throughout the motions by probability. The learned coordinated representation can be generalized to new task parameters while ensuring spatio-temporal coordination. We demonstrate the method using synthetic motions and human demonstration data and deploy it to a humanoid robot to perform a generalized bimanual coordination motion. We believe that this easy-to-use bimanual learning from demonstration (LfD) method has the potential to be used as a data augmentation plugin for robot large manipulation model training. The corresponding codes are open-sourced in https://github.com/Skylark0924/Rofunc.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
Explicit Cocycle of the Dedekind-Rademacher Cohomology Class and the Darmon-Dasgupta Measures
Authors:
Jae Hyung Sim
Abstract:
The work of Darmon, Pozzi, and Vonk has recently shown that the RM-values of the Dedekind-Rademacher cocycle $J_{DR}$ are Gross-Stark units up to a controlled torsion. In the aforementioned work, it is remarked that the measure-valued cohomology class $μ_{DR}$ which underlies $J_{DR}$ is the level 1 incarnation of earlier constructions by Darmon and Dasgupta. In this paper, we make this relationsh…
▽ More
The work of Darmon, Pozzi, and Vonk has recently shown that the RM-values of the Dedekind-Rademacher cocycle $J_{DR}$ are Gross-Stark units up to a controlled torsion. In the aforementioned work, it is remarked that the measure-valued cohomology class $μ_{DR}$ which underlies $J_{DR}$ is the level 1 incarnation of earlier constructions by Darmon and Dasgupta. In this paper, we make this relationship explicit by computing a concrete cocycle representative of $μ_{DR}$ by tracing the construction of the cohomology class and comparing periods of weight 2 Eisenstein series. While maintaining a global perspective in our computations, we configure the appropriate method of smoothing cocycles which exactly yields the $p$-adic measures of Darmon and Dasgupta when applied to $μ_{DR}$. These methods will also explain the optional degree zero condition imposed in Darmon and Dasgupta's work which was remarked upon in works of Fleischer and Liu as well as Dasgupta and Kakde.
△ Less
Submitted 22 March, 2024; v1 submitted 1 July, 2023;
originally announced July 2023.
-
Self-consistent 3D radiative transfer for kilonovae: directional spectra from merger simulations
Authors:
Luke J. Shingles,
Christine E. Collins,
Vimal Vijayan,
Andreas Flörs,
Oliver Just,
Gerrit Leck,
Zewei Xiong,
Andreas Bauswein,
Gabriel Martínez-Pinedo,
Stuart A. Sim
Abstract:
We present three-dimensional radiative transfer calculations for the ejecta from a neutron star merger that include line-by-line opacities for tens of millions of bound-bound transitions, composition from an r-process nuclear network, and time-dependent thermalization of decay products from individual $α$ and $β^-$ decay reactions. In contrast to expansion opacities and other wavelength-binned tre…
▽ More
We present three-dimensional radiative transfer calculations for the ejecta from a neutron star merger that include line-by-line opacities for tens of millions of bound-bound transitions, composition from an r-process nuclear network, and time-dependent thermalization of decay products from individual $α$ and $β^-$ decay reactions. In contrast to expansion opacities and other wavelength-binned treatments, a line-by-line treatment enables us include fluorescence effects and associate spectral features with the emitting and absorbing lines of individual elements. We find variations in the synthetic observables with both the polar and azimuthal viewing angles. The spectra exhibit blended features with strong interactions by Ce III, Sr II, Y II, and Zr II that vary with time and viewing direction. We demonstrate the importance of wavelength-calibration of atomic data using a model with calibrated Sr, Y, and Zr data, and find major differences in the resulting spectra, including a better agreement with AT2017gfo. The synthetic spectra for near-polar inclination show a feature at around 8000 A, similar to AT2017gfo. However, they evolve on a more rapid timescale, likely due to the low ejecta mass (0.005 M$_\odot$) as we take into account only the early ejecta. The comparatively featureless spectra for equatorial observers gives a tentative prediction that future observations of edge-on kilonovae will appear substantially different from AT2017gfo. We also show that 1D models obtained by spherically averaging the 3D ejecta lead to dramatically different direction-integrated luminosities and spectra compared to full 3D calculations.
△ Less
Submitted 1 September, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Modelling the spectra of the kilonova AT2017gfo -- II: Beyond the photospheric epochs
Authors:
J. H. Gillanders,
S. A. Sim,
S. J. Smartt,
S. Goriely,
A. Bauswein
Abstract:
Binary neutron star mergers are the first confirmed site of element nucleosynthesis by the rapid neutron-capture process (r-process). The kilonova AT2017gfo is the only electromagnetic counterpart of a neutron star merger spectroscopically observed. We analyse the entire spectral sequence of AT2017gfo (from merger to +10.4 days) and identify seven emission-like features. We confirm that the promin…
▽ More
Binary neutron star mergers are the first confirmed site of element nucleosynthesis by the rapid neutron-capture process (r-process). The kilonova AT2017gfo is the only electromagnetic counterpart of a neutron star merger spectroscopically observed. We analyse the entire spectral sequence of AT2017gfo (from merger to +10.4 days) and identify seven emission-like features. We confirm that the prominent 1.08 um feature can be explained by the Sr II near-infrared triplet evolving from a P-Cygni profile through to pure emission. We calculate the expected strength of the [Sr II] doublet and show that its absence requires highly clumped ejecta. Near-infrared features at 1.58 and 2.07 um emerge after three days and become more prominent as the spectra evolve. We model these as optically thick P-Cygni profiles and alternatively as pure emission features (with FWHM = 35600 +/- 6600 km/s), and favour the latter interpretation. The profile of the strong 2.07 um emission feature is best reproduced with two lines, centred at 2.059 and 2.135 um. We search for candidate ions for all prominent features in the spectra. Strong, permitted transitions of La III, Ce III, Gd III, Ra II and Ac I are plausible candidates for the emission features. If any of these features are produced by intrinsically weak, forbidden transitions, we highlight candidate ions spanning the three r-process peaks. The second r-process peak elements Te and I have plausible matches to multiple features. We highlight the need for more detailed and quantitative atomic line transition data.
△ Less
Submitted 27 November, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Hierarchical entanglement shells of multichannel Kondo clouds
Authors:
Jeongmin Shim,
Donghoon Kim,
H. -S. Sim
Abstract:
Impurities or boundaries often impose nontrivial boundary conditions on a gapless bulk, resulting in distinct boundary universality classes for a given bulk, phase transitions, and non-Fermi liquids in diverse systems. The underlying boundary states however remain largely unexplored. This is related with a fundamental issue how a Kondo cloud spatially forms to screen a magnetic impurity in a metal…
▽ More
Impurities or boundaries often impose nontrivial boundary conditions on a gapless bulk, resulting in distinct boundary universality classes for a given bulk, phase transitions, and non-Fermi liquids in diverse systems. The underlying boundary states however remain largely unexplored. This is related with a fundamental issue how a Kondo cloud spatially forms to screen a magnetic impurity in a metal. Here we predict the quantum-coherent spatial and energy structure of multichannel Kondo clouds, representative boundary states involving competing non-Fermi liquids, by studying quantum entanglement between the impurity and the channels. Entanglement shells of distinct non-Fermi liquids coexist in the structure, depending on the channels. As temperature increases, the shells become suppressed one by one from the outside, and the remaining outermost shell determines the thermal phase of each channel. Detection of the entanglement shells is experimentally feasible. Our findings suggest a guide to studying other boundary states and boundary-bulk entanglement.
△ Less
Submitted 18 June, 2023;
originally announced June 2023.
-
FedJETs: Efficient Just-In-Time Personalization with Federated Mixture of Experts
Authors:
Chen Dun,
Mirian Hipolito Garcia,
Guoqing Zheng,
Ahmed Hassan Awadallah,
Robert Sim,
Anastasios Kyrillidis,
Dimitrios Dimitriadis
Abstract:
One of the goals in Federated Learning (FL) is to create personalized models that can adapt to the context of each participating client, while utilizing knowledge from a shared global model. Yet, often, personalization requires a fine-tuning step using clients' labeled data in order to achieve good performance. This may not be feasible in scenarios where incoming clients are fresh and/or have priv…
▽ More
One of the goals in Federated Learning (FL) is to create personalized models that can adapt to the context of each participating client, while utilizing knowledge from a shared global model. Yet, often, personalization requires a fine-tuning step using clients' labeled data in order to achieve good performance. This may not be feasible in scenarios where incoming clients are fresh and/or have privacy concerns. It, then, remains open how one can achieve just-in-time personalization in these scenarios. We propose FedJETs, a novel solution by using a Mixture-of-Experts (MoE) framework within a FL setup. Our method leverages the diversity of the clients to train specialized experts on different subsets of classes, and a gating function to route the input to the most relevant expert(s). Our gating function harnesses the knowledge of a pretrained model common expert to enhance its routing decisions on-the-fly. As a highlight, our approach can improve accuracy up to 18\% in state of the art FL settings, while maintaining competitive zero-shot performance. In practice, our method can handle non-homogeneous data distributions, scale more efficiently, and improve the state-of-the-art performance on common FL benchmarks.
△ Less
Submitted 4 October, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Edit Distance based RL for RNNT decoding
Authors:
Dongseong Hwang,
Changwan Ryu,
Khe Chai Sim
Abstract:
RNN-T is currently considered the industry standard in ASR due to its exceptional WERs in various benchmark tests and its ability to support seamless streaming and longform transcription. However, its biggest drawback lies in the significant discrepancy between its training and inference objectives. During training, RNN-T maximizes all alignment probabilities by teacher forcing, while during infer…
▽ More
RNN-T is currently considered the industry standard in ASR due to its exceptional WERs in various benchmark tests and its ability to support seamless streaming and longform transcription. However, its biggest drawback lies in the significant discrepancy between its training and inference objectives. During training, RNN-T maximizes all alignment probabilities by teacher forcing, while during inference, it uses beam search which may not necessarily find the maximum probable alignment. Additionally, RNN-T's inability to experience mistakes during teacher forcing training makes it more problematic when a mistake occurs in inference. To address this issue, this paper proposes a Reinforcement Learning method that minimizes the gap between training and inference time. Our Edit Distance based RL (EDRL) approach computes rewards based on the edit distance, and trains the network at every action level. The proposed approach yielded SoTA WERs on LibriSpeech for the 600M Conformer RNN-T model.
△ Less
Submitted 14 July, 2023; v1 submitted 31 May, 2023;
originally announced June 2023.
-
Learning Off-Road Terrain Traversability with Self-Supervisions Only
Authors:
Junwon Seo,
Sungdae Sim,
Inwook Shim
Abstract:
Estimating the traversability of terrain should be reliable and accurate in diverse conditions for autonomous driving in off-road environments. However, learning-based approaches often yield unreliable results when confronted with unfamiliar contexts, and it is challenging to obtain manual annotations frequently for new circumstances. In this paper, we introduce a method for learning traversabilit…
▽ More
Estimating the traversability of terrain should be reliable and accurate in diverse conditions for autonomous driving in off-road environments. However, learning-based approaches often yield unreliable results when confronted with unfamiliar contexts, and it is challenging to obtain manual annotations frequently for new circumstances. In this paper, we introduce a method for learning traversability from images that utilizes only self-supervision and no manual labels, enabling it to easily learn traversability in new circumstances. To this end, we first generate self-supervised traversability labels from past driving trajectories by labeling regions traversed by the vehicle as highly traversable. Using the self-supervised labels, we then train a neural network that identifies terrains that are safe to traverse from an image using a one-class classification algorithm. Additionally, we supplement the limitations of self-supervised labels by incorporating methods of self-supervised learning of visual representations. To conduct a comprehensive evaluation, we collect data in a variety of driving environments and perceptual conditions and show that our method produces reliable estimations in various environments. In addition, the experimental results validate that our method outperforms other self-supervised traversability estimation methods and achieves comparable performances with supervised learning methods trained on manually labeled data.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Synthetic Light Curves and Spectra from a Self-Consistent 2D Simulation of an Ultra-strippped Supernova
Authors:
Thomas Maunder,
Bernhard Müller,
Fionntan Callan,
Stuart Sim,
Alexander Heger
Abstract:
Spectroscopy is an important tool for providing insights into the structure of core-collapse supernova explosions. We use the Monte Carlo radiative transfer code ARTIS to compute synthetic spectra and light curves based on a two-dimensional explosion model of an ultra-stripped supernova. These calculations are designed both to identify observable fingerprints of ultra-stripped supernovae and as a…
▽ More
Spectroscopy is an important tool for providing insights into the structure of core-collapse supernova explosions. We use the Monte Carlo radiative transfer code ARTIS to compute synthetic spectra and light curves based on a two-dimensional explosion model of an ultra-stripped supernova. These calculations are designed both to identify observable fingerprints of ultra-stripped supernovae and as a proof-of-principle for using synthetic spectroscopy to constrain the nature of stripped-envelope supernovae more broadly. We predict very characteristic spectral and photometric features for our ultra-stripped explosion model, but find that these do not match observed ultra-stripped supernova candidates like SN 2005ek. With a peak bolometric luminosity of $6.8\times10^{41}\,\mathrm{erg}\,\mathrm{s}^{-1}$, a peak magnitude of $-15.9\,\mathrm{mag}$ in R-band, and $Δm_{15,\mathrm{R}}=3.50$, the model is even fainter and evolves even faster than SN 2005ek as the closest possible analogue in photometric properties. The predicted spectra are extremely unusual. The most prominent features are Mg II lines at 2,800 Angstrom and 4,500 Angstrom and the infrared Ca triplet at late times. The Mg lines are sensitive to the multi-dimensional structure of the model and are viewing-angle dependent. They disappear due to line blanketing by Fe group elements in a spherically averaged model with additional microscopic mixing. In future studies, multi-D radiative transfer calculations need to be applied to a broader range of models to elucidate the nature of observed Type Ib/c supernovae.
△ Less
Submitted 27 May, 2023;
originally announced May 2023.
-
Interactive Natural Language Processing
Authors:
Zekun Wang,
Ge Zhang,
Kexin Yang,
Ning Shi,
Wangchunshu Zhou,
Shaochun Hao,
Guangzheng Xiong,
Yizhi Li,
Mong Yuan Sim,
Xiuying Chen,
Qingqing Zhu,
Zhenzhu Yang,
Adam Nik,
Qi Liu,
Chenghua Lin,
Shi Wang,
Ruibo Liu,
Wenhu Chen,
Ke Xu,
Dayiheng Liu,
Yike Guo,
Jie Fu
Abstract:
Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within the field of NLP, aimed at addressing limitations in existing frameworks while aligning with the ultimate goals of artificial intelligence. This paradigm considers language models as agents capable of observing, acting, and receiving feedback iteratively from external entities. Specifically, language models in th…
▽ More
Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within the field of NLP, aimed at addressing limitations in existing frameworks while aligning with the ultimate goals of artificial intelligence. This paradigm considers language models as agents capable of observing, acting, and receiving feedback iteratively from external entities. Specifically, language models in this context can: (1) interact with humans for better understanding and addressing user needs, personalizing responses, aligning with human values, and improving the overall user experience; (2) interact with knowledge bases for enriching language representations with factual knowledge, enhancing the contextual relevance of responses, and dynamically leveraging external information to generate more accurate and informed responses; (3) interact with models and tools for effectively decomposing and addressing complex tasks, leveraging specialized expertise for specific subtasks, and fostering the simulation of social behaviors; and (4) interact with environments for learning grounded representations of language, and effectively tackling embodied tasks such as reasoning, planning, and decision-making in response to environmental observations. This paper offers a comprehensive survey of iNLP, starting by proposing a unified definition and framework of the concept. We then provide a systematic classification of iNLP, dissecting its various components, including interactive objects, interaction interfaces, and interaction methods. We proceed to delve into the evaluation methodologies used in the field, explore its diverse applications, scrutinize its ethical and safety issues, and discuss prospective research directions. This survey serves as an entry point for researchers who are interested in this rapidly evolving area and offers a broad view of the current landscape and future trajectory of iNLP.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test
Authors:
Eungbeom Kim,
Yunkee Chae,
Jaeheon Sim,
Kyogu Lee
Abstract:
Automatic speech recognition systems based on deep learning are mainly trained under empirical risk minimization (ERM). Since ERM utilizes the averaged performance on the data samples regardless of a group such as healthy or dysarthric speakers, ASR systems are unaware of the performance disparities across the groups. This results in biased ASR systems whose performance differences among groups ar…
▽ More
Automatic speech recognition systems based on deep learning are mainly trained under empirical risk minimization (ERM). Since ERM utilizes the averaged performance on the data samples regardless of a group such as healthy or dysarthric speakers, ASR systems are unaware of the performance disparities across the groups. This results in biased ASR systems whose performance differences among groups are severe. In this study, we aim to improve the ASR system in terms of group robustness for dysarthric speakers. To achieve our goal, we present a novel approach, sample reweighting with sample affinity test (Re-SAT). Re-SAT systematically measures the debiasing helpfulness of the given data sample and then mitigates the bias by debiasing helpfulness-based sample reweighting. Experimental results demonstrate that Re-SAT contributes to improved ASR performance on dysarthric speech without performance degradation on healthy speech.
△ Less
Submitted 27 June, 2023; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Intra-atomic Hund's exchange interaction determines spin states and energetics of Li-rich layered sulfides for battery applications
Authors:
Jae-Hoon Sim,
D. D. Sarma,
Jean-Marie Tarascon,
Silke Biermann
Abstract:
Motivated by experimental suggestions of anionic redox processes hel** to design higher energy lithium ion-battery cathode materials, we investigate this effect using first-principles electronic structure calculations for Li-rich layered sulfides. We identify the determination of the energetic contribution of intra-atomic Hund's exchange coupling as a major obstacle to a reliable theoretical des…
▽ More
Motivated by experimental suggestions of anionic redox processes hel** to design higher energy lithium ion-battery cathode materials, we investigate this effect using first-principles electronic structure calculations for Li-rich layered sulfides. We identify the determination of the energetic contribution of intra-atomic Hund's exchange coupling as a major obstacle to a reliable theoretical description. We overcome this challenge by develo** a particularly efficient flavor of charge-self-consistent combined density functional + dynamical mean-field theory (DFT+DMFT) calculations. Our scheme allows us to describe the spin ground states of the transition metal d shell, the electronic structure of the materials, and its energetics. As a result of the high-spin to low-spin transition the average intercalation voltage shows intriguing non-monotonic behavior. We rationalize these findings by an analysis of the fluctuations of spin and charge degrees of freedom. Our work demonstrates the relevance of most recent insights into correlated electron materials for the physics of functional materials such as Li-ion battery compounds.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Fault-tolerant quantum algorithm for symmetry-adapted perturbation theory
Authors:
Cristian L. Cortes,
Matthias Loipersberger,
Robert M. Parrish,
Sam Morley-Short,
William Pol,
Sukin Sim,
Mark Steudtner,
Christofer S. Tautermann,
Matthias Degroote,
Nikolaj Moll,
Raffaele Santagati,
Michael Streif
Abstract:
The efficient computation of observables beyond the total energy is a key challenge and opportunity for fault-tolerant quantum computing approaches in quantum chemistry. Here we consider the symmetry-adapted perturbation theory (SAPT) components of the interaction energy as a prototypical example of such an observable. We provide a guide for calculating this observable on a fault-tolerant quantum…
▽ More
The efficient computation of observables beyond the total energy is a key challenge and opportunity for fault-tolerant quantum computing approaches in quantum chemistry. Here we consider the symmetry-adapted perturbation theory (SAPT) components of the interaction energy as a prototypical example of such an observable. We provide a guide for calculating this observable on a fault-tolerant quantum computer while optimizing the required computational resources. Specifically, we present a quantum algorithm that estimates interaction energies at the first-order SAPT level with a Heisenberg-limited scaling. To this end, we exploit a high-order tensor factorization and block encoding technique that efficiently represents each SAPT observable. To quantify the computational cost of our methodology, we provide resource estimates in terms of the required number of logical qubits and Toffoli gates to execute our algorithm for a range of benchmark molecules, also taking into account the cost of the eigenstate preparation and the cost of block encoding the SAPT observables. Finally, we perform the resource estimation for a heme and artemisinin complex as a representative large-scale system encountered in drug design, highlighting our algorithm's performance in this new benchmark study and discussing possible bottlenecks that may be improved in future work.
△ Less
Submitted 15 May, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
Undercover Deepfakes: Detecting Fake Segments in Videos
Authors:
Sanjay Saha,
Rashindrie Perera,
Sachith Seneviratne,
Tamasha Malepathirana,
Sanka Rasnayaka,
Deshani Geethika,
Terence Sim,
Saman Halgamuge
Abstract:
The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such…
▽ More
The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such generative techniques creates a new paradigm of deepfakes which are mostly real videos altered slightly to distort the truth. This paradigm has been under-explored by the current deepfake detection methods in the academic literature. In this paper, we present a deepfake detection method that can address this issue by performing deepfake prediction at the frame and video levels. To facilitate testing our method, we prepared a new benchmark dataset where videos have both real and fake frame sequences with very subtle transitions. We provide a benchmark on the proposed dataset with our detection method which utilizes the Vision Transformer based on Scaling and Shifting to learn spatial features, and a Timeseries Transformer to learn temporal features of the videos to help facilitate the interpretation of possible deepfakes. Extensive experiments on a variety of deepfake generation methods show excellent results by the proposed method on temporal segmentation and classical video-level predictions as well. In particular, the paradigm we address will form a powerful tool for the moderation of deepfakes, where human oversight can be better targeted to the parts of videos suspected of being deepfakes. All experiments can be reproduced at: github.com/rgb91/temporal-deepfake-segmentation.
△ Less
Submitted 24 August, 2023; v1 submitted 11 May, 2023;
originally announced May 2023.
-
Shedding Light on Microscopic Details: 2D Spectroscopy of 1D Quantum Ising Magnets
Authors:
GiBaik Sim,
Frank Pollmann,
Johannes Knolle
Abstract:
The identification of microscopic models describing the low-energy properties of correlated materials has been a central goal of spectroscopic measurements. We demonstrate how 2D non-linear spectroscopy can be used to distinguish effective spin models whose linear responses show similar behavior. Motivated by recent experiments on the quasi-1D Ising magnet CoNb$_2$O$_6$, we focus on two proposed m…
▽ More
The identification of microscopic models describing the low-energy properties of correlated materials has been a central goal of spectroscopic measurements. We demonstrate how 2D non-linear spectroscopy can be used to distinguish effective spin models whose linear responses show similar behavior. Motivated by recent experiments on the quasi-1D Ising magnet CoNb$_2$O$_6$, we focus on two proposed models, the ferromagnetic twisted Kitaev chain with bond dependent interactions and the transverse field Ising model. The dynamical spin structure factor probed in linear response displays similar broad spectra for both models from their fermionic domain wall excitations. In sharp contrast, the 2D non-linear spectra of the two models show clear qualitative differences: those of the twisted Kitaev model contain off-diagonal peaks originating from the bond dependent interactions and transitions between different fermion bands absent in the transverse field Ising model. We discuss the different signatures of spin fractionalization in integrable and non-integrable regimes of the models and their connection to experiments.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Analyzing Transatlantic Network Traffic over Scientific Data Caches
Authors:
Z. Deng,
A. Sim,
K. Wu,
C. Guok,
D. Hazen,
I. Monga,
F. Andrijauskas,
F. Wuerthwein,
D. Weitzel
Abstract:
Large scientific collaborations often share huge volumes of data around the world. Consequently a significant amount of network bandwidth is needed for data replication and data access. Users in the same region may possibly share resources as well as data, especially when they are working on related topics with similar datasets. In this work, we study the network traffic patterns and resource util…
▽ More
Large scientific collaborations often share huge volumes of data around the world. Consequently a significant amount of network bandwidth is needed for data replication and data access. Users in the same region may possibly share resources as well as data, especially when they are working on related topics with similar datasets. In this work, we study the network traffic patterns and resource utilization for scientific data caches connecting European networks to the US. We explore the efficiency of resource utilization, especially for network traffic which consists mostly of transatlantic data transfers, and the potential for having more caching node deployments. Our study shows that these data caches reduced network traffic volume by 97% during the study period. This demonstrates that such caching nodes are effective in reducing wide-area network traffic.
△ Less
Submitted 17 July, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
Magneto-Mechanical Bilayer Metasurface with Global Area-Preserving Density Tunability for Acoustic Wave Regulation
Authors:
Jay Sim,
Shuai Wu,
Jize Dai,
Ruike Renee Zhao
Abstract:
Metasurfaces have extensive potential in acoustic cloaking, optical scattering, and electromagnetic antenna due to their unprecedented properties and the ability to conform to curved substrates. Active metasurfaces have attracted significant research attention because of their on-demand tunable properties and performances through shape reconfigurations. They normally achieve active properties thro…
▽ More
Metasurfaces have extensive potential in acoustic cloaking, optical scattering, and electromagnetic antenna due to their unprecedented properties and the ability to conform to curved substrates. Active metasurfaces have attracted significant research attention because of their on-demand tunable properties and performances through shape reconfigurations. They normally achieve active properties through internal structural deformations, which often lead to changes in overall dimensions. This also demands the corresponding alterations of the conforming substrate, which could be a significant limitation for their practical applications. To date, achieving area-preserving active metasurfaces with distinct shape reconfigurations remains a prominent challenge. In this paper, we present magneto-mechanical bilayer metasurfaces that demonstrate area density tunability with area-preserving capability. The bilayer metasurfaces primarily consist of two arrays of magnetic soft materials with distinct magnetization distributions. Under an external magnetic field, each layer behaves differently, which allows the metasurface to reconfigure its shape into multiple modes and thus significantly tune its area density without changing its overall dimensions. The area-preserving multimodal shape reconfigurations are further exploited as active acoustic wave regulators to tune bandgaps and wave propagations. The bilayer approach thus provides a new concept to design area-preserving active metasurfaces for broader practical applications.
△ Less
Submitted 12 April, 2023;
originally announced April 2023.
-
Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer
Authors:
Agus Gunawan,
Soo Ye Kim,
Hyeonjun Sim,
Jae-Ho Lee,
Munchurl Kim
Abstract:
This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via ph…
▽ More
This paper firstly presents old photo modernization using multiple references by performing stylization and enhancement in a unified manner. In order to modernize old photos, we propose a novel multi-reference-based old photo modernization (MROPM) framework consisting of a network MROPM-Net and a novel synthetic data generation scheme. MROPM-Net stylizes old photos using multiple references via photorealistic style transfer (PST) and further enhances the results to produce modern-looking images. Meanwhile, the synthetic data generation scheme trains the network to effectively utilize multiple references to perform modernization. To evaluate the performance, we propose a new old photos benchmark dataset (CHD) consisting of diverse natural indoor and outdoor scenes. Extensive experiments show that the proposed method outperforms other baselines in performing modernization on real old photos, even though no old photos were used during training. Moreover, our method can appropriately select styles from multiple references for each semantic region in the old photo to further improve the modernization performance.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Fault-tolerant quantum computation of molecular observables
Authors:
Mark Steudtner,
Sam Morley-Short,
William Pol,
Sukin Sim,
Cristian L. Cortes,
Matthias Loipersberger,
Robert M. Parrish,
Matthias Degroote,
Nikolaj Moll,
Raffaele Santagati,
Michael Streif
Abstract:
Over the past three decades significant reductions have been made to the cost of estimating ground-state energies of molecular Hamiltonians with quantum computers. However, comparatively little attention has been paid to estimating the expectation values of other observables with respect to said ground states, which is important for many industrial applications. In this work we present a novel exp…
▽ More
Over the past three decades significant reductions have been made to the cost of estimating ground-state energies of molecular Hamiltonians with quantum computers. However, comparatively little attention has been paid to estimating the expectation values of other observables with respect to said ground states, which is important for many industrial applications. In this work we present a novel expectation value estimation (EVE) quantum algorithm which can be applied to estimate the expectation values of arbitrary observables with respect to any of the system's eigenstates. In particular, we consider two variants of EVE: std-EVE, based on standard quantum phase estimation, and QSP-EVE, which utilizes quantum signal processing (QSP) techniques. We provide rigorous error analysis for both both variants and minimize the number of individual phase factors for QSPEVE. These error analyses enable us to produce constant-factor quantum resource estimates for both std-EVE and QSP-EVE across a variety of molecular systems and observables. For the systems considered, we show that QSP-EVE reduces (Toffoli) gate counts by up to three orders of magnitude and reduces qubit width by up to 25% compared to std-EVE. While estimated resource counts remain far too high for the first generations of fault-tolerant quantum computers, our estimates mark a first of their kind for both the application of expectation value estimation and modern QSP-based techniques.
△ Less
Submitted 27 October, 2023; v1 submitted 24 March, 2023;
originally announced March 2023.