-
CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only
Authors:
Junhee Cho,
Jihoon Kim,
Daseul Bae,
**ho Choo,
Youngjune Gwon,
Yeong-Dae Kwon
Abstract:
Software robots have long been deployed in Robotic Process Automation (RPA) to automate mundane and repetitive computer tasks. The advent of Large Language Models (LLMs) with advanced reasoning capabilities has set the stage for these agents to now undertake more complex and even previously unseen tasks. However, the LLM-based automation techniques in recent literature frequently rely on HTML sour…
▽ More
Software robots have long been deployed in Robotic Process Automation (RPA) to automate mundane and repetitive computer tasks. The advent of Large Language Models (LLMs) with advanced reasoning capabilities has set the stage for these agents to now undertake more complex and even previously unseen tasks. However, the LLM-based automation techniques in recent literature frequently rely on HTML source codes for input, limiting their application to web environments. Moreover, the information contained in HTML codes is often inaccurate or incomplete, making the agent less reliable for practical applications. We propose an LLM-based agent that functions solely on the basis of screenshots for recognizing environments, while leveraging in-context learning to eliminate the need for collecting large datasets of human demonstration. Our strategy, named Context-Aware Action Planning (CAAP) prompting encourages the agent to meticulously review the context in various angles. Through our proposed methodology, we achieve a success rate of 94.4% on 67~types of MiniWoB++ problems, utilizing only 1.48~demonstrations per problem type. Our method offers the potential for broader applications, especially for tasks that require inter-application coordination on computers or smartphones, showcasing a significant advancement in the field of automation agents. Codes and models are accessible at https://github.com/caap-agent/caap-agent.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Optimality conditions at infinity for nonsmooth minimax programming
Authors:
Nguyen Van Tuyen,
Kwan Deok Bae,
Do Sang Kim
Abstract:
This paper is devoted to study of optimality conditions at infinity in nonsmooth minimax programming problems and applications. By means of the limiting subdifferential and normal cone at infinity, we dirive necessary and sufficient optimality conditions of Karush--Kuhn--Tucker type for nonsmooth minimax programming problems with constraint. The obtained results are applied to a nonsmooth vector o…
▽ More
This paper is devoted to study of optimality conditions at infinity in nonsmooth minimax programming problems and applications. By means of the limiting subdifferential and normal cone at infinity, we dirive necessary and sufficient optimality conditions of Karush--Kuhn--Tucker type for nonsmooth minimax programming problems with constraint. The obtained results are applied to a nonsmooth vector optimization problem.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Topological Floquet engineering of a three-band optical lattice with dual-mode resonant driving
Authors:
Dalmin Bae,
Junyoung Park,
Myeonghyeon Kim,
Haneul Kwak,
Junhwan Kwon,
Yong-il Shin
Abstract:
We present a Floquet framework for controlling topological features of a one-dimensional optical lattice system with dual-mode resonant driving, in which both the amplitude and phase of the lattice potential are modulated simultaneously. We investigate a three-band model consisting of the three lowest orbitals and elucidate the formation of a cross-linked two-leg ladder through an indirect interba…
▽ More
We present a Floquet framework for controlling topological features of a one-dimensional optical lattice system with dual-mode resonant driving, in which both the amplitude and phase of the lattice potential are modulated simultaneously. We investigate a three-band model consisting of the three lowest orbitals and elucidate the formation of a cross-linked two-leg ladder through an indirect interband coupling via an off-resonant band. We numerically demonstrate the emergence of topologically nontrivial bands within the driven system, and a topological charge pum** phenomenon with cyclic parameter changes in the dual-mode resonant driving. Finally, we show that the band topology in the driven three-band system is protected by parity-time reversal symmetry.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Monitoring AGNs with H$β$ Asymmetry. IV. First Reverberation Map** Results of 14 AGNs
Authors:
T. E. Zastrocky,
Michael S. Brotherton,
Pu Du,
Jacob N. McLane,
Kianna A. Olson,
D. A. Dale,
H. A. Kobulnicky,
Jaya Maithil,
My L. Nguyen,
William T. Chick,
David H. Kasper,
Derek Hand,
C. Adelman,
Z. Carter,
G. Murphree,
M. Oeur,
T. Roth,
S. Schonsberg,
M. J. Caradonna,
J. Favro,
A. J. Ferguson,
I. M. Gonzalez,
L. M. Hadding,
H. D. Hagler,
C. J. Rogers
, et al. (19 additional authors not shown)
Abstract:
We report first-time reverberation map** results for 14 AGNs from the ongoing Monitoring AGNs with H$β$ Asymmetry campaign (MAHA). These results utilize optical spectra obtained with the Long Slit Spectrograph on the Wyoming Infrared 2.3m Telescope between 2017 November-2023 May. MAHA combines long-duration monitoring with high cadence. We report results from multiple observing seasons for 9 of…
▽ More
We report first-time reverberation map** results for 14 AGNs from the ongoing Monitoring AGNs with H$β$ Asymmetry campaign (MAHA). These results utilize optical spectra obtained with the Long Slit Spectrograph on the Wyoming Infrared 2.3m Telescope between 2017 November-2023 May. MAHA combines long-duration monitoring with high cadence. We report results from multiple observing seasons for 9 of the 14 objects. These results include H$β$ time lags, supermassive black hole masses, and velocity-resolved time lags. The velocity-resolved lags allow us to investigate the kinematics of the broad-line region.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Large language models surpass human experts in predicting neuroscience results
Authors:
Xiaoliang Luo,
Akilles Rechardt,
Guangzhi Sun,
Kevin K. Nejad,
Felipe Yáñez,
Bati Yilmaz,
Kangjoo Lee,
Alexandra O. Cohen,
Valentina Borghesani,
Anton Pashkov,
Daniele Marinazzo,
Jonathan Nicholas,
Alessandro Salatiello,
Ilia Sucholutsky,
Pasquale Minervini,
Sepehr Razavi,
Roberta Rocca,
Elkhan Yusifov,
Tereza Okalova,
Nianlong Gu,
Martin Ferianc,
Mikail Khona,
Kaustubh R. Patil,
Pui-Shee Lee,
Rui Mata
, et al. (14 additional authors not shown)
Abstract:
Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created Brain…
▽ More
Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors.
△ Less
Submitted 21 June, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Hydrogen bonding in water under extreme confinement unveiled by nanoscale vibrational spectroscopy and simulations
Authors:
Xintong Xu,
Xin **,
Matthias Kuehne,
De-Liang Bao,
Joel Martis,
Yu-Ming Tu,
Cody L. Ritt,
Juan Carlos Idrobo,
Michael S. Strano,
Arun Majumdar,
Sokrates T. Pantelides,
Jordan A. Hachtel
Abstract:
Fluids under extreme confinement exhibit distinctly new properties compared to their bulk analogs. Understanding the structure and intermolecular bonding of confined water lays the foundation for creating and improving applications at the water-energy nexus. However, probing confined water experimentally at the length scale of intermolecular and surface forces has remained a challenge. Here, we re…
▽ More
Fluids under extreme confinement exhibit distinctly new properties compared to their bulk analogs. Understanding the structure and intermolecular bonding of confined water lays the foundation for creating and improving applications at the water-energy nexus. However, probing confined water experimentally at the length scale of intermolecular and surface forces has remained a challenge. Here, we report a combined experiment/theory framework to reveal changes in H-bonding environment and the underlying molecular structure of confined water inside individual carbon nanotubes. H-bonding is directly probed through the O-H stretch frequency with vibrational electron energy-loss spectroscopy and compared to spectra from molecular-dynamics simulations based on density-functional-theory. Experimental spectra show that water in larger carbon nanotubes exhibit the bonded O-H vibrations of bulk water, but at smaller diameters, the frequency blueshifts to near the 'free' O-H stretch found in water vapor and hydrophobic surfaces. The matching simulations reveal that, in addition to steric confinement, the tube's vibrations play a key role in breaking up the H-bond network, resulting in an orientationally-dispersed, non-H-bonded phase. Furthermore, the temperature-dependence of the vibrations is investigated, providing insights into phase transitions and the confined-water density. This research demonstrates the potential of the experiment/theory framework to explore unprecedented aspects of structure and bonding in confined fluids.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Can GNN be Good Adapter for LLMs?
Authors:
Xuanwen Huang,
Kaiqiao Han,
Yang Yang,
Dezheng Bao,
Quan** Tao,
Ziwei Chai,
Qi Zhu
Abstract:
Recently, large language models (LLMs) have demonstrated superior capabilities in understanding and zero-shot learning on textual data, promising significant advances for many text-related domains. In the graph domain, various real-world scenarios also involve textual data, where tasks and node features can be described by text. These text-attributed graphs (TAGs) have broad applications in social…
▽ More
Recently, large language models (LLMs) have demonstrated superior capabilities in understanding and zero-shot learning on textual data, promising significant advances for many text-related domains. In the graph domain, various real-world scenarios also involve textual data, where tasks and node features can be described by text. These text-attributed graphs (TAGs) have broad applications in social media, recommendation systems, etc. Thus, this paper explores how to utilize LLMs to model TAGs. Previous methods for TAG modeling are based on million-scale LMs. When scaled up to billion-scale LLMs, they face huge challenges in computational costs. Additionally, they also ignore the zero-shot inference capabilities of LLMs. Therefore, we propose GraphAdapter, which uses a graph neural network (GNN) as an efficient adapter in collaboration with LLMs to tackle TAGs. In terms of efficiency, the GNN adapter introduces only a few trainable parameters and can be trained with low computation costs. The entire framework is trained using auto-regression on node text (next token prediction). Once trained, GraphAdapter can be seamlessly fine-tuned with task-specific prompts for various downstream tasks. Through extensive experiments across multiple real-world TAGs, GraphAdapter based on Llama 2 gains an average improvement of approximately 5\% in terms of node classification. Furthermore, GraphAdapter can also adapt to other language models, including RoBERTa, GPT-2. The promising results demonstrate that GNNs can serve as effective adapters for LLMs in TAG modeling.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
Clustering Inductive Biases with Unrolled Networks
Authors:
Jonathan Huml,
Abiy Tasissa,
Demba Ba
Abstract:
The classical sparse coding (SC) model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field profiles observed empirically. While neurons fire sparsely, neuronal populations are also org…
▽ More
The classical sparse coding (SC) model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field profiles observed empirically. While neurons fire sparsely, neuronal populations are also organized in physical space by their sensitivity to certain features. In V1, this organization is a smooth progression of orientations along the cortical sheet. A number of subsequent models have either discarded the sparse dictionary learning framework entirely or whose updates have yet to take advantage of the surge in unrolled, neural dictionary learning architectures. A key missing theme of these updates is a stronger notion of \emph{structured sparsity}. We propose an autoencoder architecture (WLSC) whose latent representations are implicitly, locally organized for spectral clustering through a Laplacian quadratic form of a bipartite graph, which generates a diverse set of artificial receptive fields that match primate data in V1 as faithfully as recent contrastive frameworks like Local Low Dimensionality, or LLD \citep{lld} that discard sparse dictionary learning. By unifying sparse and smooth coding in models of the early visual cortex through our autoencoder, we also show that our regularization can be interpreted as early-stage specialization of receptive fields to certain classes of stimuli; that is, we induce a weak clustering bias for later stages of cortex where functional and spatial segregation (i.e. topography) are known to occur. The results show an imperative for \emph{spatial regularization} of both the receptive fields and firing rates to begin to describe feature disentanglement in V1 and beyond.
△ Less
Submitted 29 November, 2023;
originally announced February 2024.
-
Ruddlesden-Popper chalcogenides push the limit of mechanical stiffness and glass-like thermal conductivity in crystals
Authors:
Md Shafkat Bin Hoque,
Eric R. Hoglund,
Boyang Zhao,
De-Liang Bao,
Hao Zhou,
Sandip Thakur,
Eric Osei-Agyemang,
Khalid Hattar,
Ethan A. Scott,
Mythili Surendran,
John A. Tomko,
John T. Gaskins,
Kiumars Aryana,
Sara Makarem,
Ganesh Balasubramanian,
Ashutosh Giri,
Tianli Feng,
Jordan A. Hachtel,
Jayakanth Ravichandran,
Sokrates T. Pantelides,
Patrick E. Hopkins
Abstract:
Insulating materials featuring ultralow thermal conductivity for diverse applications also require robust mechanical properties. Conventional thinking, however, which correlates strong bonding with high atomic-vibration-mediated heat conduction, led to diverse weakly bonded materials that feature ultralow thermal conductivity and low elastic moduli. One must, therefore, search for strongly-bonded…
▽ More
Insulating materials featuring ultralow thermal conductivity for diverse applications also require robust mechanical properties. Conventional thinking, however, which correlates strong bonding with high atomic-vibration-mediated heat conduction, led to diverse weakly bonded materials that feature ultralow thermal conductivity and low elastic moduli. One must, therefore, search for strongly-bonded materials in which heat transport is impeded by other means. Here, we report intrinsic, glass-like, ultralow thermal conductivity and ultrahigh elastic-modulus/thermal-conductivity ratio in single-crystalline, BaZrS3-derived, Ruddlesden-Popper phases Ban+1ZrnS3n+1, n = 2, 3. Their key features are strong anharmonicity and intra-unit-cell rock-salt blocks. The latter produce strongly bonded intrinsic superlattices, impeding heat conduction by broadband reduction of phonon velocities and mean free paths and concomitant strong phonon localization. The present study initiates a paradigm of "mechanically stiff phonon glasses".
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Triple Flares within Five Years in ztf18aanlzzf: An Enhanced Tidal Disruption Rate in ULIRGs?
Authors:
Dong-Wei Bao,
Wei-Jian Guo,
Zhi-Xiang Zhang,
Cheng Cheng,
Zhu-Heng Yao,
Yan-Rong Li,
Ye-Fei Yuan,
Jian-Min Wang,
Chao-Wei Tsai,
Zhi-Qiang Chen
Abstract:
We present a noteworthy transient event in the optical light curves of ztf18aanlzzf (SDSS J161259.83+421940.3), identified as a Narrow Line Seyfert 1 (NLS1) exhibiting merging patterns in the optical image. The 16-year long-term archived light curve revealed that this target stays in a steady state, while three flares occurred within the past 5 years with time separations ranging from 1 year to 3.…
▽ More
We present a noteworthy transient event in the optical light curves of ztf18aanlzzf (SDSS J161259.83+421940.3), identified as a Narrow Line Seyfert 1 (NLS1) exhibiting merging patterns in the optical image. The 16-year long-term archived light curve revealed that this target stays in a steady state, while three flares occurred within the past 5 years with time separations ranging from 1 year to 3.5 years. The flare patterns of rapid brightening and slow decline following the peak, coupled with distinctive spectral features with strong He {\sc ii} and rare appearance of Bowen fluorescence line emissions, indicate at least two Tidal Eruption Event (TDE) flares in ztf18aanlzzf with a time separation of 3.5 years. We also apply TiDE light curve modeling and yield a Black Hole (BH) mass of $\sim 10^{6}\ M_{\odot}$, which is consistent with the BH mass measured from single-epoch spectra. Besides, the observed time lags $3.90_{-2.00}^{+2.06}$ days between the g and r bands strongly disagree with the prediction of the standard accretion disk model, highlighting the intricate interaction in the inner region related to the TDE. The reoccurrence gap of these TDEs, surpassing the previously reported repeated TDEs, can be attributed to binary star tidal disruption by a binary SMBH. Notably, the frequent TDE flares observed in this ULIRG-like target align with findings in a previous report for another ULIRG, suggesting a potentially elevated TDE rate in ULIRGs. Systematic variability studies of ULIRGs may help verify whether ULIRGs indeed have higher TDE rates.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
On the Opportunities of Green Computing: A Survey
Authors:
You Zhou,
Xiu**g Lin,
Xiang Zhang,
Maolin Wang,
Gangwei Jiang,
Huakang Lu,
Yupeng Wu,
Kai Zhang,
Zhe Yang,
Kehang Wang,
Yongduo Sui,
Fengwei Jia,
Zuoli Tang,
Yao Zhao,
Hongxuan Zhang,
Tiannuo Yang,
Weibo Chen,
Yunong Mao,
Yi Li,
De Bao,
Yu Li,
Hongrui Liao,
Ting Liu,
**gwen Liu,
**chi Guo
, et al. (16 additional authors not shown)
Abstract:
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention…
▽ More
Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention is paid on pursuing new state-of-the-art (SOTA) results, resulting in ever increasing of model size and computational complexity. The needs for high computing power brings higher carbon emission and undermines research fairness by preventing small or medium-sized research institutions and companies with limited funding in participating in research. To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic. In this survey, we give a systematic overview of the technologies used in Green Computing. We propose the framework of Green Computing and devide it into four key components: (1) Measures of Greenness, (2) Energy-Efficient AI, (3) Energy-Efficient Computing Systems and (4) AI Use Cases for Sustainability. For each components, we discuss the research progress made and the commonly used techniques to optimize the AI efficiency. We conclude that this new research direction has the potential to address the conflicts between resource constraints and AI development. We encourage more researchers to put attention on this direction and make AI more environmental friendly.
△ Less
Submitted 8 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Phonon vortices at heavy impurities in two-dimensional materials
Authors:
De-Liang Bao,
Mingquan Xu,
Ao-Wen Li,
Gang Su,
Wu Zhou,
Sokrates T. Pantelides
Abstract:
The advent of monochromated electron energy-loss spectroscopy has enabled atomic-resolution vibrational spectroscopy, which triggered interest in spatially localized or quasi-localized vibrational modes in materials. Here we report the discovery of phonon vortices at heavy impurities in two-dimensional materials. We use density-functional-theory calculations for two configurations of Si impurities…
▽ More
The advent of monochromated electron energy-loss spectroscopy has enabled atomic-resolution vibrational spectroscopy, which triggered interest in spatially localized or quasi-localized vibrational modes in materials. Here we report the discovery of phonon vortices at heavy impurities in two-dimensional materials. We use density-functional-theory calculations for two configurations of Si impurities in graphene, Si-C3 and Si-C4, to examine atom-projected phonon densities of states and display the atomic-displacement patterns for select modes that are dominated by impurity displacements. The vortices are driven by large displacements of the impurities, and reflect local symmetries. Similar vortices are found at phosphorus impurities in hexagonal boron nitride, suggesting that they may be a feature of heavy impurities in crystalline materials. Phonon vortices at defects are expected to play a role in thermal conductivity and other properties.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
An Efficient Algorithm for Clustered Multi-Task Compressive Sensing
Authors:
Alexander Lin,
Demba Ba
Abstract:
This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matri…
▽ More
This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matrix inversion and log-determinant computation for multiple large covariance matrices. We propose a new algorithm that substantially accelerates model inference by avoiding the need to explicitly compute these covariance matrices. Our approach combines Monte Carlo sampling with iterative linear solvers. Our experiments reveal that compared to the existing baseline, our algorithm can be up to thousands of times faster and an order of magnitude more memory-efficient.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Prompt-based Node Feature Extractor for Few-shot Learning on Text-Attributed Graphs
Authors:
Xuanwen Huang,
Kaiqiao Han,
Dezheng Bao,
Quan** Tao,
Zhisheng Zhang,
Yang Yang,
Qi Zhu
Abstract:
Text-attributed Graphs (TAGs) are commonly found in the real world, such as social networks and citation networks, and consist of nodes represented by textual descriptions. Currently, mainstream machine learning methods on TAGs involve a two-stage modeling approach: (1) unsupervised node feature extraction with pre-trained language models (PLMs); and (2) supervised learning using Graph Neural Netw…
▽ More
Text-attributed Graphs (TAGs) are commonly found in the real world, such as social networks and citation networks, and consist of nodes represented by textual descriptions. Currently, mainstream machine learning methods on TAGs involve a two-stage modeling approach: (1) unsupervised node feature extraction with pre-trained language models (PLMs); and (2) supervised learning using Graph Neural Networks (GNNs). However, we observe that these representations, which have undergone large-scale pre-training, do not significantly improve performance with a limited amount of training samples. The main issue is that existing methods have not effectively integrated information from the graph and downstream tasks simultaneously. In this paper, we propose a novel framework called G-Prompt, which combines a graph adapter and task-specific prompts to extract node features. First, G-Prompt introduces a learnable GNN layer (\emph{i.e.,} adaptor) at the end of PLMs, which is fine-tuned to better capture the masked tokens considering graph neighborhood information. After the adapter is trained, G-Prompt incorporates task-specific prompts to obtain \emph{interpretable} node representations for the downstream task. Our experiment results demonstrate that our proposed method outperforms current state-of-the-art (SOTA) methods on few-shot node classification. More importantly, in zero-shot settings, the G-Prompt embeddings can not only provide better task interpretability than vanilla PLMs but also achieve comparable performance with fully-supervised baselines.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
Fast generation of Schrödinger cat states in a Kerr-tunable superconducting resonator
Authors:
X. L. He,
Yong Lu,
D. Q. Bao,
Hang Xue,
W. B. Jiang,
Zhen Wang,
A. F. Roudsari,
Per Delsing,
J. S. Tsai,
Z. R. Lin
Abstract:
Schrödinger cat states, quantum superpositions of macroscopically distinct classical states, are an important resource for quantum communication, quantum metrology and quantum computation. Especially, cat states in a phase space protected against phase-flip errors can be used as a logical qubit. However, cat states, normally generated in three-dimensional cavities, are facing the challenges of sca…
▽ More
Schrödinger cat states, quantum superpositions of macroscopically distinct classical states, are an important resource for quantum communication, quantum metrology and quantum computation. Especially, cat states in a phase space protected against phase-flip errors can be used as a logical qubit. However, cat states, normally generated in three-dimensional cavities, are facing the challenges of scalability and controllability. Here, we present a novel strategy to generate and store cat states in a coplanar superconducting circuit by the fast modulation of Kerr nonlinearity. At the Kerr-free work point, our cat states are passively preserved due to the vanishing Kerr effect. We are able to prepare a 2-component cat state in our chip-based device with a fidelity reaching 89.1% under a 96 ns gate time. Our scheme shows an excellent route to constructing a chip-based bosonic quantum processor.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Long-term multiwavelength monitoring and reverberation map** of NGC 2617 during a changing-look event
Authors:
V. L. Oknyansky,
M. S. Brotherton,
S. S. Tsygankov,
A. V. Dodin,
A. M. Tatarnikov,
P. Du,
D. -W. Bao,
M. A. Burlak,
N. P. Ikonnikova,
V. M. Lipunov,
E. S. Gorbovskoy,
V. G. Metlov,
A. A. Belinski,
N. I. Shatsky,
S. G. Zheltouhov,
N. A. Maslennikova,
J. -M. Wang,
S. Zhai,
F. -N. Fang,
Y. -X. Fu,
H. -R. Bai,
D. Kasper,
N. A. Huseynov,
J. N. McLane,
J. Maithil
, et al. (10 additional authors not shown)
Abstract:
We present the results of photometric and spectroscopic monitoring campaigns of the changing look AGN NGC~2617 carried out from 2016 until 2022 and covering the wavelength range from the X-ray to the near-IR. The facilities included the telescopes of the SAI MSU, MASTER Global Robotic Net, the 2.3-m WIRO telescope, Swift, and others. We found significant variability at all wavelengths and, specifi…
▽ More
We present the results of photometric and spectroscopic monitoring campaigns of the changing look AGN NGC~2617 carried out from 2016 until 2022 and covering the wavelength range from the X-ray to the near-IR. The facilities included the telescopes of the SAI MSU, MASTER Global Robotic Net, the 2.3-m WIRO telescope, Swift, and others. We found significant variability at all wavelengths and, specifically, in the intensities and profiles of the broad Balmer lines. We measured time delays of ~ 6 days (~ 8 days) in the responses of the H-beta (H-alpha) line to continuum variations. We found the X-ray variations to correlate well with the UV and optical (with a small time delay of a few days for longer wavelengths). The K-band lagged the B band by 14 +- 4 days during the last 3 seasons, which is significantly shorter than the delays reported previously by the 2016 and 2017--2019 campaigns. Near-IR variability arises from two different emission regions: the outer part of the accretion disc and a more distant dust component. The HK-band variability is governed primarily by dust. The Balmer decrement of the broad-line components is inversely correlated with the UV flux. The change of the object's type, from Sy1 to Sy1.8, was recorded over a period of ~ 8 years. We interpret these changes as a combination of two factors: changes in the accretion rate and dust recovery along the line of sight.
△ Less
Submitted 23 August, 2023; v1 submitted 9 August, 2023;
originally announced August 2023.
-
Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models
Authors:
Alexander Lin,
Bahareh Tolooshams,
Yves Atchadé,
Demba Ba
Abstract:
Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models is the expectation-maximization (EM) algorithm. For problems with high-dimensional latent variables and large datasets, EM scales poorly because it needs to inv…
▽ More
Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models is the expectation-maximization (EM) algorithm. For problems with high-dimensional latent variables and large datasets, EM scales poorly because it needs to invert as many large covariance matrices as the number of data points. We introduce probabilistic unrolling, a method that combines Monte Carlo sampling with iterative linear solvers to circumvent matrix inversion. Our theoretical analyses reveal that unrolling and backpropagation through the iterations of the solver can accelerate gradient estimation for maximum likelihood estimation. In experiments on simulated and real data, we demonstrate that probabilistic unrolling learns latent Gaussian models up to an order of magnitude faster than gradient EM, with minimal losses in model performance.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Variations of the Kibble-Zurek scaling exponents of trapped Bose gases
Authors:
Tenzin Rabga,
Yangheon Lee,
Dalmin Bae,
Myeonghyeon Kim,
Yong-il Shin
Abstract:
We study the vortex nucleation dynamics in inhomogeneous atomic Bose gases quenched into a superfluid phase and investigate the dependence of the Kibble-Zurek (KZ) scaling exponent on the underlying trap configuration. For samples in a number of different inhomogeneous traps, we observe the characteristic power-law scaling of the vortex number with the thermal quench rate, as well as an enhanced v…
▽ More
We study the vortex nucleation dynamics in inhomogeneous atomic Bose gases quenched into a superfluid phase and investigate the dependence of the Kibble-Zurek (KZ) scaling exponent on the underlying trap configuration. For samples in a number of different inhomogeneous traps, we observe the characteristic power-law scaling of the vortex number with the thermal quench rate, as well as an enhanced vortex suppression in the outer regions with lower particle density, in agreement with the causality effect as encapsulated in the inhomogeneous Kibble-Zurek mechanism (IKZM). However, the measured KZ scaling exponents show significant differences from the theoretical estimates, and furthermore their trends as a function of the underlying trap configuration deviate from the IKZM prediction. We also investigate the early-time coarsening effect using a two-step quench protocol as proposed in a recent study and show that the interpretation of the measurement results without including the causality effect might be misleading. This paper provides a comprehensive study of vortex formation dynamics in quenched Bose gases confined in inhomogeneous trap** potentials and calls for a refined theoretical framework for quantitative understanding of the phase transition and defect formation processes in such inhomogeneous systems.
△ Less
Submitted 28 November, 2023; v1 submitted 30 May, 2023;
originally announced May 2023.
-
Learning Linear Groups in Neural Networks
Authors:
Emmanouil Theodosis,
Karim Helwani,
Demba Ba
Abstract:
Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a priori specification of the desired symmetries. We present a neural network architecture, Linear Group Networks (LGNs), for learning linear groups acting on the…
▽ More
Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a priori specification of the desired symmetries. We present a neural network architecture, Linear Group Networks (LGNs), for learning linear groups acting on the weight space of neural networks. Linear groups are desirable due to their inherent interpretability, as they can be represented as finite matrices. LGNs learn groups without any supervision or knowledge of the hidden symmetries in the data and the groups can be mapped to well known operations in machine learning. We use LGNs to learn groups on multiple datasets while considering different downstream tasks; we demonstrate that the linear group structure depends on both the data distribution and the considered task.
△ Less
Submitted 29 May, 2023;
originally announced May 2023.
-
SHAPER: Can You Hear the Shape of a Jet?
Authors:
Demba Ba,
Akshunna S. Dogra,
Rikab Gambhir,
Abiy Tasissa,
Jesse Thaler
Abstract:
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing colli…
▽ More
The identification of interesting substructures within jets is an important tool for searching for new physics and probing the Standard Model at colliders. Many of these substructure tools have previously been shown to take the form of optimal transport problems, in particular the Energy Mover's Distance (EMD). In this work, we show that the EMD is in fact the natural structure for comparing collider events, which accounts for its recent success in understanding event and jet substructure. We then present a Shape Hunting Algorithm using Parameterized Energy Reconstruction (SHAPER), which is a general framework for defining and computing shape-based observables. SHAPER generalizes N-jettiness from point clusters to any extended, parametrizable shape. This is accomplished by efficiently minimizing the EMD between events and parameterized manifolds of energy flows representing idealized shapes, implemented using the dual-potential Sinkhorn approximation of the Wasserstein metric. We show how the geometric language of observables as manifolds can be used to define novel observables with built-in infrared-and-collinear safety. We demonstrate the efficacy of the SHAPER framework by performing empirical jet substructure studies using several examples of new shape-based observables.
△ Less
Submitted 20 July, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Sparse, Geometric Autoencoder Models of V1
Authors:
Jonathan Huml,
Abiy Tasissa,
Demba Ba
Abstract:
The classical sparse coding model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field (SCRF) profiles. A number of subsequent models have either discarded the sparse dictionary learnin…
▽ More
The classical sparse coding model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field (SCRF) profiles. A number of subsequent models have either discarded the sparse dictionary learning framework entirely or have yet to take advantage of the surge in unrolled, neural dictionary learning architectures. A key missing theme of these updates is a stronger notion of \emph{structured sparsity}. We propose an autoencoder architecture whose latent representations are implicitly, locally organized for spectral clustering, which begets artificial neurons better matched to observed primate data. The weighted-$\ell_1$ (WL) constraint in the autoencoder objective function maintains core ideas of the sparse coding framework, yet also offers a promising path to describe the differentiation of receptive fields in terms of a discriminative hierarchy in future work.
△ Less
Submitted 22 February, 2023;
originally announced February 2023.
-
Broad-line region in NGC 4151 monitored by two decades of reverberation map** campaigns. I. Evolution of structure and kinematics
Authors:
Yong-Jie Chen,
Dong-Wei Bao,
Shuo Zhai,
Feng-Na Fang,
Chen Hu,
Pu Du,
Sen Yang,
Zhu-Heng Yao,
Yan-Rong Li,
Michael S. Brotherton,
Jacob N. McLane,
T. E. Zastrocky,
Kianna A. Olson,
Edi Bon,
Hua-Rui Bai,
Yi-Xin Fu,
Jun-Rong Liu,
Yi-Lin Wang,
Jaya Maithil,
H. A. Kobulnicky,
D. A. Dale,
C. Adelman,
M. J. Caradonna,
Z. Carter,
J. Favro
, et al. (11 additional authors not shown)
Abstract:
We report the results of long-term reverberation map** (RM) campaigns of the nearby active galactic nuclei (AGN) NGC 4151, spanning from 1994 to 2022, based on archived observations of the FAST Spectrograph Publicly Archived Programs and our new observations with the 2.3m telescope at the Wyoming Infrared Observatory. We reduce and calibrate all the spectra in a consistent way, and derive light…
▽ More
We report the results of long-term reverberation map** (RM) campaigns of the nearby active galactic nuclei (AGN) NGC 4151, spanning from 1994 to 2022, based on archived observations of the FAST Spectrograph Publicly Archived Programs and our new observations with the 2.3m telescope at the Wyoming Infrared Observatory. We reduce and calibrate all the spectra in a consistent way, and derive light curves of the broad H$β$ line and 5100\,Å continuum. Continuum light curves are also constructed using public archival photometric data to increase sampling cadences. We subtract the host galaxy contamination using {\it HST} imaging to correct fluxes of the calibrated light curves. Utilizing the long-term archival photometric data, we complete the absolute flux-calibration of the AGN continuum. We find that the H$β$ time delays are correlated with the 5100\,Å luminosities as $τ_{\rm Hβ}\propto L_{5100}^{0.46\pm0.16}$. This is remarkably consistent with Bentz et al. (2013)'s global size-luminosity relationship of AGNs. Moreover, the data sets for five of the seasons allow us to obtain the velocity-resolved delays of the H$β$ line, showing diverse structures (outflows, inflows and disks). Combining our results with previous independent measurements, we find the measured dynamics of the H$β$ broad-line region (BLR) are possibly related to the long-term trend of the luminosity. There is also a possible additional $\sim$1.86 years time lag between the variation in BLR radius and luminosity. These results suggest that dynamical changes in the BLR may be driven by the effects of radiation pressure.
△ Less
Submitted 15 January, 2023;
originally announced January 2023.
-
Learning unfolded networks with a cyclic group structure
Authors:
Emmanouil Theodosis,
Demba Ba
Abstract:
Deep neural networks lack straightforward ways to incorporate domain knowledge and are notoriously considered black boxes. Prior works attempted to inject domain knowledge into architectures implicitly through data augmentation. Building on recent advances on equivariant neural networks, we propose networks that explicitly encode domain knowledge, specifically equivariance with respect to rotation…
▽ More
Deep neural networks lack straightforward ways to incorporate domain knowledge and are notoriously considered black boxes. Prior works attempted to inject domain knowledge into architectures implicitly through data augmentation. Building on recent advances on equivariant neural networks, we propose networks that explicitly encode domain knowledge, specifically equivariance with respect to rotations. By using unfolded architectures, a rich framework that originated from sparse coding and has theoretical guarantees, we present interpretable networks with sparse activations. The equivariant unfolded networks compete favorably with baselines, with only a fraction of their parameters, as showcased on (rotated) MNIST and CIFAR-10.
△ Less
Submitted 16 November, 2022;
originally announced November 2022.
-
Unrolled Compressed Blind-Deconvolution
Authors:
Bahareh Tolooshams,
Satish Mulleti,
Demba Ba,
Yonina C. Eldar
Abstract:
The problem of sparse multichannel blind deconvolution (S-MBD) arises frequently in many engineering applications such as radar/sonar/ultrasound imaging. To reduce its computational and implementation cost, we propose a compression method that enables blind recovery from much fewer measurements with respect to the full received signal in time. The proposed compression measures the signal through a…
▽ More
The problem of sparse multichannel blind deconvolution (S-MBD) arises frequently in many engineering applications such as radar/sonar/ultrasound imaging. To reduce its computational and implementation cost, we propose a compression method that enables blind recovery from much fewer measurements with respect to the full received signal in time. The proposed compression measures the signal through a filter followed by a subsampling, allowing for a significant reduction in implementation cost. We derive theoretical guarantees for the identifiability and recovery of a sparse filter from compressed measurements. Our results allow for the design of a wide class of compression filters. We, then, propose a data-driven unrolled learning framework to learn the compression filter and solve the S-MBD problem. The encoder is a recurrent inference network that maps compressed measurements into an estimate of sparse filters. We demonstrate that our unrolled learning method is more robust to choices of source shapes and has better recovery performance compared to optimization-based methods. Finally, in data-limited applications (fewshot learning), we highlight the superior generalization capability of unrolled learning compared to conventional deep learning.
△ Less
Submitted 18 May, 2023; v1 submitted 28 September, 2022;
originally announced September 2022.
-
Suppression of Spontaneous Defect Formation in Inhomogeneous Bose Gases
Authors:
Myeonghyeon Kim,
Tenzin Rabga,
Yangheon Lee,
Junhong Goo,
Dalmin Bae,
Yong-il Shin
Abstract:
In phase transition dynamics involving symmetry breaking, topological defects can be spontaneously created but it is suppressed in a spatially inhomogeneous system due to the spreading of the ordered phase information. We demonstrate the defect suppression effect in a trapped atomic Bose gas which is quenched into a superfluid phase. The spatial distribution of created defects is measured for vari…
▽ More
In phase transition dynamics involving symmetry breaking, topological defects can be spontaneously created but it is suppressed in a spatially inhomogeneous system due to the spreading of the ordered phase information. We demonstrate the defect suppression effect in a trapped atomic Bose gas which is quenched into a superfluid phase. The spatial distribution of created defects is measured for various quench times and it is shown that for slower quenches, the spontaneous defect production is relatively more suppressed in the sample's outer region with higher atomic density gradient. The power-law scaling of the local defect density with the quench time is enhanced in the outer region, which is consistent with the Kibble-Zurek mechanism including the causality effect due to the spatial inhomogeneity of the system. This work opens an avenue in the study of nonequilibrium phase transition dynamics using the defect position information.
△ Less
Submitted 3 August, 2022;
originally announced August 2022.
-
Direct Visualization of Localized Vibrations at Complex Grain Boundaries
Authors:
Eric R. Hoglund,
De-Liang Bao,
Andrew O'Hara,
Thomas W. Pfeifer,
Md Shafkat Bin Hoque,
Sara Makarem,
James M. Howe,
Sokrates T. Pantelides,
Patrick E. Hopkins,
Jordan A. Hachtel
Abstract:
Grain boundaries (GBs) are a prolific microstructural feature that dominates the functionality of a wide class of materials. The change in functionality at a GB is a direct result of unique local atomic arrangements, different from those in the grain, that have driven extensive experimental and theoretical studies correlating atomic-scale GB structures to macroscopic electronic, infrared-optical,…
▽ More
Grain boundaries (GBs) are a prolific microstructural feature that dominates the functionality of a wide class of materials. The change in functionality at a GB is a direct result of unique local atomic arrangements, different from those in the grain, that have driven extensive experimental and theoretical studies correlating atomic-scale GB structures to macroscopic electronic, infrared-optical, and thermal properties. Here, we examine a SrTiO3 GB using atomic-resolution aberration-corrected scanning transmission electron microscopy (STEM) and ultra-high-energy-resolution monochromated electron energy-loss spectroscopy (EELS), in conjunction with density functional theory (DFT) calculations. This combination enables the direct correlation of the GB structure, composition, and chemical bonding with atomic vibrations within the GB dislocation-cores. We observe that nonstoichiometry and changes in coordination and bonding at the GB leads to a redistribution of vibrational states at the GB and its dislocation-cores relative to the bounding grains. The access to localized vibrations within GBs provided by ultrahigh spatial/spectral resolution EELS correlated with atomic coordination, bonding, and stoichiometry and validated by theory, provides a direct route to quantifying the impact of individual boundaries on macroscopic properties.
△ Less
Submitted 24 August, 2022; v1 submitted 30 July, 2022;
originally announced August 2022.
-
Large-Scale Deep Learning for Multi-Jet Event Classification
Authors:
Jiwoong Kim,
Dongsung Bae,
Kihyeon Cho,
Junghwan Goh,
Jaegyoon Hahm,
Taeyoung Hong,
Soonwook Hwang,
Minsik Kim,
Sungwon Kim,
Tongil Kim,
Chang-Seong Moon,
Hunjoo Myung,
Hokyeong Nam,
Changhyun Yoo,
Hwidong Yoo
Abstract:
We report the largest scale deep learning with High Performance Computing (HPC) to physics analysis with the CMS simulation data in proton-proton collisions at 13 TeV. We build a Convolutional Neural Network (CNN) model that takes low-level information as images considering the geometry of the CMS detector and use this model to discriminate \textit{R}-parity violating super symmetry (RPV SUSY) eve…
▽ More
We report the largest scale deep learning with High Performance Computing (HPC) to physics analysis with the CMS simulation data in proton-proton collisions at 13 TeV. We build a Convolutional Neural Network (CNN) model that takes low-level information as images considering the geometry of the CMS detector and use this model to discriminate \textit{R}-parity violating super symmetry (RPV SUSY) events from the background events with inelastic quantum process from the Standard Model (QCD multi-jet). We compare the classification performance of the CNN method with that of the widely used cut-based method. The signal efficiency (and expected significance) of the CNN method is 1.85 (1.2) times higher than that of the cut-based method. To speed-up the training, the model training is conducted using the Nurion HPC system at the Korea Institute of Science and Technology Information, which is equipped with thousands of parallel \texttt{Xeon Phi} CPUs. Notably, our CNN model shows scalability up to 1024 nodes.
△ Less
Submitted 30 May, 2023; v1 submitted 24 July, 2022;
originally announced July 2022.
-
Twisted bilayer zigzag-graphene nanoribbon junctions with tunable edge states
Authors:
Dongfei Wang,
De-Liang Bao,
Qi Zheng,
Chang-Tian Wang,
Shiyong Wang,
Peng Fan,
Shantanu Mishra,
Lei Tao,
Yao Xiao,
Li Huang,
Xinliang Feng,
Klaus Müllen,
Yu-Yang Zhang,
Roman Fasel,
Pascal Ruffieux,
Shixuan Du,
Hong-Jun Gao
Abstract:
Stacking two-dimensional layered materials such as graphene and transitional metal dichalcogenides with nonzero interlayer twist angles has recently become attractive because of the emergence of novel physical properties. Stacking of one-dimensional nanomaterials offers the lateral stacking offset as an additional parameter for modulating the resulting material properties. Here, we report that the…
▽ More
Stacking two-dimensional layered materials such as graphene and transitional metal dichalcogenides with nonzero interlayer twist angles has recently become attractive because of the emergence of novel physical properties. Stacking of one-dimensional nanomaterials offers the lateral stacking offset as an additional parameter for modulating the resulting material properties. Here, we report that the edge states of twisted bilayer zigzag graphene nanoribbons (TBZGNRs) can be tuned with both the twist angle and the stacking offset. Strong edge state variations in the stacking region are first revealed by density functional theory (DFT) calculations. We construct and characterize twisted bilayer zigzag graphene nanoribbon (TBZGNR) systems on a Au(111) surface using scanning tunneling microscopy. A detailed analysis of three prototypical orthogonal TBZGNR junctions exhibiting different stacking offsets by means of scanning tunneling spectroscopy reveals emergent near-zero-energy states. From a comparison with DFT calculations, we conclude that the emergent edge states originate from the formation of flat bands whose energy and spin degeneracy are highly tunable with the stacking offset. Our work highlights fundamental differences between 2D and 1D twistronics and spurs further investigation of twisted one-dimensional systems.
△ Less
Submitted 27 February, 2023; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Monitoring AGNs with H$β$ Asymmetry. III. Long-term Reverberation Map** Results of 15 Palomar-Green Quasars
Authors:
Dong-Wei Bao,
Michael S. Brotherton,
Pu Du,
Jacob N. McLane,
T. E. Zastrocky,
Kianna A. Olson,
Feng-Na Fang,
Shuo Zhai,
Zheng-Peng Huang,
Kai Wang,
Bi-Xuan Zhao,
Sha-Sha Li,
Sen Yang,
Yong-Jie Chen,
Jun-Rong Liu,
Zhu-Heng Yao,
Yue-Chang Peng,
Wei-Jian Guo,
Yu-Yang Songsheng,
Yan-Rong Li,
Bo-Wei Jiang,
David H. Kasper,
William T. Chick,
My L. Nguyen,
Jaya Maithil
, et al. (20 additional authors not shown)
Abstract:
In this third paper of the series reporting on the reverberation map** (RM) campaign of active galactic nuclei with asymmetric H$β$ emission-line profiles, we present results for 15 Palomar-Green (PG) quasars using spectra obtained between the end of 2016 to May 2021. This campaign combines long time spans with relatively high cadence. For 8 objects, both the time lags obtained from the entire l…
▽ More
In this third paper of the series reporting on the reverberation map** (RM) campaign of active galactic nuclei with asymmetric H$β$ emission-line profiles, we present results for 15 Palomar-Green (PG) quasars using spectra obtained between the end of 2016 to May 2021. This campaign combines long time spans with relatively high cadence. For 8 objects, both the time lags obtained from the entire light curves and the measurements from individual observing seasons are provided. Reverberation map** of 9 of our targets has been attempted for the first time, while the results for 6 others can be compared with previous campaigns. We measure the H$β$ time lags over periods of years and estimate their black hole masses. The long duration of the campaign enables us to investigate their broad line region (BLR) geometry and kinematics for different years by using velocity-resolved lags, which demonstrate signatures of diverse BLR geometry and kinematics. The BLR geometry and kinematics of individual objects are discussed. In this sample, the BLR kinematics of Keplerian/virialized motion and inflow is more common than outflow.
△ Less
Submitted 1 July, 2022;
originally announced July 2022.
-
Literature Review to Collect Conceptual Variables of Scenario Methods for Establishing a Conceptual Scenario Framework
Authors:
Young-Min Baek,
Esther Cho,
Donghwan Shin,
Doo-Hwan Bae
Abstract:
Over recent decades, scenarios and scenario-based software/system engineering have been actively employed as essential tools to handle intricate problems, validate requirements, and support stakeholders' communication. However, despite the widespread use of scenarios, there have been several challenges for engineers to more willingly utilize scenario-based engineering approaches (i.e., scenario me…
▽ More
Over recent decades, scenarios and scenario-based software/system engineering have been actively employed as essential tools to handle intricate problems, validate requirements, and support stakeholders' communication. However, despite the widespread use of scenarios, there have been several challenges for engineers to more willingly utilize scenario-based engineering approaches (i.e., scenario methods) in their projects. First, the term scenario has numerous published definitions, thus lacking in a well-established shared understanding of scenarios and scenario methods. Second, the conceptual basis for engineers develo** or employing scenarios is missing. To establish shared understanding and to find common denominators of scenario methods, this study leverages well-defined metamodeling and conceptualization that systematically investigate the concepts under analysis and define core entities and their relations. By conducting a semi-systematic literature review, conceptual variables are collected and conceptualized as a conceptual meta-model. As a result, this study introduces scenario variables (SVs) that represent constructs/semantics of scenario descriptions, according to 4 levels of constructs of a scenario method. To evaluate the comprehensibility and applicability of the defined variables, we analyze five existing scenario methods and their instances in automated driving system (ADS) domains. The results showed that our conceptual model and its constituent scenario variables adequately support the understanding of a scenario method and provide a means for comparative analysis between different scenario methods.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Environment Imitation: Data-Driven Environment Model Generation Using Imitation Learning for Efficient CPS Goal Verification
Authors:
Yong-Jun Shin,
Donghwan Shin,
Doo-Hwan Bae
Abstract:
Cyber-Physical Systems (CPS) continuously interact with their physical environments through software controllers that observe the environments and determine actions. Engineers can verify to what extent the CPS under analysis can achieve given goals by analyzing its Field Operational Test (FOT) logs. However, it is challenging to repeat many FOTs to obtain statistically significant results due to i…
▽ More
Cyber-Physical Systems (CPS) continuously interact with their physical environments through software controllers that observe the environments and determine actions. Engineers can verify to what extent the CPS under analysis can achieve given goals by analyzing its Field Operational Test (FOT) logs. However, it is challenging to repeat many FOTs to obtain statistically significant results due to its cost and risk in practice. To address this challenge, simulation-based verification can be a good alternative for efficient CPS goal verification, but it requires an accurate virtual environment model that can replace the real environment that interacts with the CPS in a closed loop. This paper proposes a novel data-driven approach that automatically generates the virtual environment model from a small amount of FOT logs. We formally define the environment model generation problem and solve it using Imitation Learning (IL) algorithms. In addition, we propose three specific use cases of our approach in the evolutionary CPS development. To validate our approach, we conduct a case study using a simplified autonomous vehicle with a lane-kee** system. The case study results show that our approach can generate accurate virtual environment models for CPS goal verification at a low cost through simulations.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
BABD: A Bitcoin Address Behavior Dataset for Pattern Analysis
Authors:
Yuexin Xiang,
Yuchen Lei,
Ding Bao,
Wei Ren,
Tiantian Li,
Qingqing Yang,
Wenmao Liu,
Tianqing Zhu,
Kim-Kwang Raymond Choo
Abstract:
Cryptocurrencies are no longer just the preferred option for cybercriminal activities on darknets, due to the increasing adoption in mainstream applications. This is partly due to the transparency associated with the underpinning ledgers, where any individual can access the record of a transaction record on the public ledger. In this paper, we build a dataset comprising Bitcoin transactions betwee…
▽ More
Cryptocurrencies are no longer just the preferred option for cybercriminal activities on darknets, due to the increasing adoption in mainstream applications. This is partly due to the transparency associated with the underpinning ledgers, where any individual can access the record of a transaction record on the public ledger. In this paper, we build a dataset comprising Bitcoin transactions between 12 July 2019 and 26 May 2021. This dataset (hereafter referred to as BABD-13) contains 13 types of Bitcoin addresses, 5 categories of indicators with 148 features, and 544,462 labeled data, which is the largest labeled Bitcoin address behavior dataset publicly available to our knowledge. We then use our proposed dataset on common machine learning models, namely: k-nearest neighbors algorithm, decision tree, random forest, multilayer perceptron, and XGBoost. The results show that the accuracy rates of these machine learning models for the multi-classification task on our proposed dataset are between 93.24% and 97.13%. We also analyze the proposed features and their relationships from the experiments, and propose a k-hop subgraph generation algorithm to extract a k-hop subgraph from the entire Bitcoin transaction graph constructed by the directed heterogeneous multigraph starting from a specific Bitcoin address node (e.g., a known transaction associated with a criminal investigation). Besides, we initially analyze the behavior patterns of different types of Bitcoin addresses according to the extracted features.
△ Less
Submitted 5 May, 2022; v1 submitted 10 April, 2022;
originally announced April 2022.
-
Vortex shedding frequency of a moving obstacle in a Bose-Einstein condensate
Authors:
Younghoon Lim,
Yangheon Lee,
Junhong Goo,
Dalmin Bae,
Yong-il Shin
Abstract:
We experimentally investigate the periodic vortex shedding dynamics in a highly oblate Bose-Einstein condensate using a moving penetrable Gaussian obstacle. The shedding frequency $f_v$ is measured as a function of the obstacle velocity $v$ and characterized by a linear relationship of $f_v=a(v-v_c)$ with $v_c$ being the critical velocity. The proportionality constant $a$ is linearly decreased wit…
▽ More
We experimentally investigate the periodic vortex shedding dynamics in a highly oblate Bose-Einstein condensate using a moving penetrable Gaussian obstacle. The shedding frequency $f_v$ is measured as a function of the obstacle velocity $v$ and characterized by a linear relationship of $f_v=a(v-v_c)$ with $v_c$ being the critical velocity. The proportionality constant $a$ is linearly decreased with a decrease in the obstacle strength, whereas $v_c$ approaches the speed of sound. When the obstacle size increases, both $a$ and $v_c$ are decreased. The critical vortex shedding is further investigated for an oscillating obstacle and found to be consistent with the measured $f_v$. When the obstacle's maximum velocity exceeds $v_c$ but its oscillation amplitude is not large enough to create a vortex dipole, we observe that vortices are generated in the low-density boundary region of the trapped condensate, which is attributed to the phonon emission from the oscillating obstacle. Finally, we discuss a possible asymptotic association of $a$ with the Strouhal number in the context of universal shedding dynamics of a superfluid.
△ Less
Submitted 27 February, 2022;
originally announced February 2022.
-
High-Dimensional Sparse Bayesian Learning without Covariance Matrices
Authors:
Alexander Lin,
Andrew H. Song,
Berkin Bilgic,
Demba Ba
Abstract:
Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem. However, the most popular inference algorithms for SBL become too expensive for high-dimensional settings, due to the need to store and compute a large covariance matrix. We introduce a new inference scheme that avoids explicit construction of the covariance matrix by solving multiple linear systems in p…
▽ More
Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem. However, the most popular inference algorithms for SBL become too expensive for high-dimensional settings, due to the need to store and compute a large covariance matrix. We introduce a new inference scheme that avoids explicit construction of the covariance matrix by solving multiple linear systems in parallel to obtain the posterior moments for SBL. Our approach couples a little-known diagonal estimation result from numerical linear algebra with the conjugate gradient algorithm. On several simulations, our method scales better than existing approaches in computation time and memory, especially for structured dictionaries capable of fast matrix-vector multiplication.
△ Less
Submitted 25 February, 2022;
originally announced February 2022.
-
Synthetic Hall ladder with tunable magnetic flux
Authors:
Jeong Ho Han,
Dalmin Bae,
Yong-il Shin
Abstract:
We describe a synthetic three-leg Hall ladder system with a tunable magnetic flux for neutral $^{173}$Yb atoms in a one-dimensional optical lattice. The ladder legs are formed by three hyperfine ground spin states of the atoms, and the complex interleg links are generated through Raman couplings between the spin states using multiple laser beams. The effective magnetic flux through a ladder plaque…
▽ More
We describe a synthetic three-leg Hall ladder system with a tunable magnetic flux for neutral $^{173}$Yb atoms in a one-dimensional optical lattice. The ladder legs are formed by three hyperfine ground spin states of the atoms, and the complex interleg links are generated through Raman couplings between the spin states using multiple laser beams. The effective magnetic flux through a ladder plaquette, $φ$, is controlled by the angles of the Raman laser beams with the lattice axis. We investigate the quench dynamics of the Hall ladder system for $φ\approx\fracπ{3}, \fracπ{2},$ and $\frac{2π}{3}$ after a sudden application of the Raman coupling in various interleg link configurations. The semi-classical trajectory of the atoms in the plane of the spin composition and lattice position exhibits the characteristic motion for the effective magnetic field. In a tube configuration with the three legs cyclically linked, the quench evolution was observed to be substantially damped, which is attributed to the random flux threading the Hall tube.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Universal Early Coarsening of Quenched Bose Gases
Authors:
Junhong Goo,
Yangheon Lee,
Younghoon Lim,
Dalmin Bae,
Tenzin Rabga,
Yong-il Shin
Abstract:
We investigate the early coarsening dynamics of an atomic Bose gas quenched into a superfluid phase. Using a two-step quench protocol, we effectively control the cooling rates, $r_1$ and $r_2$, during and after passing through the critical region, respectively, and measure the number of quantum vortices spontaneously created in the system. The latter cooling rate $r_2$ regulates the temperature du…
▽ More
We investigate the early coarsening dynamics of an atomic Bose gas quenched into a superfluid phase. Using a two-step quench protocol, we effectively control the cooling rates, $r_1$ and $r_2$, during and after passing through the critical region, respectively, and measure the number of quantum vortices spontaneously created in the system. The latter cooling rate $r_2$ regulates the temperature during the condensate growth, consequently controlling the early coarsening dynamics in the defect formation. We find that the defect number shows a scaling behavior with $r_2$ regardless of the initial cooling rate $r_1$, indicating universal coarsening dynamics in the early stage of condensate growth. Our results demonstrate that early coarsening not only reduces the defect density but also affects its scaling with the quench rate, which is beyond the Kibble-Zurek mechanism.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Mixture Model Auto-Encoders: Deep Clustering through Dictionary Learning
Authors:
Alexander Lin,
Andrew H. Song,
Demba Ba
Abstract:
State-of-the-art approaches for clustering high-dimensional data utilize deep auto-encoder architectures. Many of these networks require a large number of parameters and suffer from a lack of interpretability, due to the black-box nature of the auto-encoders. We introduce Mixture Model Auto-Encoders (MixMate), a novel architecture that clusters data by performing inference on a generative model. D…
▽ More
State-of-the-art approaches for clustering high-dimensional data utilize deep auto-encoder architectures. Many of these networks require a large number of parameters and suffer from a lack of interpretability, due to the black-box nature of the auto-encoders. We introduce Mixture Model Auto-Encoders (MixMate), a novel architecture that clusters data by performing inference on a generative model. Derived from the perspective of sparse dictionary learning and mixture models, MixMate comprises several auto-encoders, each tasked with reconstructing data in a distinct cluster, while enforcing sparsity in the latent space. Through experiments on various image datasets, we show that MixMate achieves competitive performance compared to state-of-the-art deep clustering algorithms, while using orders of magnitude fewer parameters.
△ Less
Submitted 25 February, 2022; v1 submitted 9 October, 2021;
originally announced October 2021.
-
A two-step machine learning approach for crop disease detection: an application of GAN and UAV technology
Authors:
Aaditya Prasad,
Nikhil Mehta,
Matthew Horak,
Wan D. Bae
Abstract:
Automated plant diagnosis is a technology that promises large increases in cost-efficiency for agriculture. However, multiple problems reduce the effectiveness of drones, including the inverse relationship between resolution and speed and the lack of adequate labeled training data. This paper presents a two-step machine learning approach that analyzes low-fidelity and high-fidelity images in seque…
▽ More
Automated plant diagnosis is a technology that promises large increases in cost-efficiency for agriculture. However, multiple problems reduce the effectiveness of drones, including the inverse relationship between resolution and speed and the lack of adequate labeled training data. This paper presents a two-step machine learning approach that analyzes low-fidelity and high-fidelity images in sequence, preserving efficiency as well as accuracy. Two data-generators are also used to minimize class imbalance in the high-fidelity dataset and to produce low-fidelity data that is representative of UAV images. The analysis of applications and methods is conducted on a database of high-fidelity apple tree images which are corrupted with class imbalance. The application begins by generating high-fidelity data using generative networks and then uses this novel data alongside the original high-fidelity data to produce low-fidelity images. A machine-learning identifier identifies plants and labels them as potentially diseased or not. A machine learning classifier is then given the potentially diseased plant images and returns actual diagnoses for these plants. The results show an accuracy of 96.3% for the high-fidelity system and a 75.5% confidence level for our low-fidelity system. Our drone technology shows promising results in accuracy when compared to labor-based methods of diagnosis.
△ Less
Submitted 18 September, 2021;
originally announced September 2021.
-
Reverberation Map** of Two Luminous Quasars: the Broad-line Region Structure and Black Hole Mass
Authors:
Sha-Sha Li,
Sen Yang,
Zi-Xu Yang,
Yong-Jie Chen,
Yu-Yang Songsheng,
He-Zhen Liu,
Pu Du,
Bin Luo,
Zhe Yu,
Chen Hu,
Bo-Wei Jiang,
Dong-Wei Bao,
Wei-Jian Guo,
Zhi-Xiang Zhang,
Yan-Rong Li,
Ming Xiao,
Kai-Xing Lu,
Luis C. Ho,
**g-Min Bai,
Wei-Hao Bian,
Jesús Aceituno,
Takeo Minezaki,
Mitsuru Kokubo,
Jian-Min Wang
Abstract:
We report the results of a multi-year spectroscopic and photometric monitoring campaign of two luminous quasars, PG~0923+201 and PG~1001+291, both located at the high-luminosity end of the broad-line region (BLR) size-luminosity relation with optical luminosities above $10^{45}~{\rm erg~s^{-1}}$. PG~0923+201 is for the first time monitored, and PG~1001+291 was previously monitored but our campaign…
▽ More
We report the results of a multi-year spectroscopic and photometric monitoring campaign of two luminous quasars, PG~0923+201 and PG~1001+291, both located at the high-luminosity end of the broad-line region (BLR) size-luminosity relation with optical luminosities above $10^{45}~{\rm erg~s^{-1}}$. PG~0923+201 is for the first time monitored, and PG~1001+291 was previously monitored but our campaign has a much longer temporal baseline. We detect time lags of variations of the broad H$β$, H$γ$, Fe {\sc ii} lines with respect to those of the 5100~Å continuum. The velocity-resolved delay map of H$β$ in PG~0923+201 indicates a complicated structure with a mix of Keplerian disk-like motion and outflow, and the map of H$β$ in PG~1001+291 shows a signature of Keplerian disk-like motion. Assuming a virial factor of $f_{\rm BLR}=1$ and FWHM line widths, we measure the black hole mass to be $118_{-16}^{+11}\times 10^7 M_{\odot}$ for PG~0923+201 and $3.33_{-0.54}^{+0.62}\times 10^7 M_{\odot}$ for PG~1001+291. Their respective accretion rates are estimated to be $0.21_{-0.07}^{+0.06} \times L_{\rm Edd}\,c^{-2}$ and $679_{-227}^{+259}\times L_{\rm Edd}\,c^{-2}$, indicating that PG~0923+201 is a sub-Eddington accretor and PG~1001+291 is a super-Eddington accretor. While the H$β$ time lag of PG~0923+201 agrees with the size-luminosity relation, the time lag of PG~1001+291 shows a significant deviation, confirming that in high-luminosity AGN the BLR size depends on both luminosity and Eddington ratio. Black hole mass estimates from single AGN spectra will be over-estimated at high luminosities and redshifts if this effect is not taken into account.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Stable and Interpretable Unrolled Dictionary Learning
Authors:
Bahareh Tolooshams,
Demba Ba
Abstract:
The dictionary learning problem, representing data as a combination of a few atoms, has long stood as a popular method for learning representations in statistics and signal processing. The most popular dictionary learning algorithm alternates between sparse coding and dictionary update steps, and a rich literature has studied its theoretical convergence. The success of dictionary learning relies o…
▽ More
The dictionary learning problem, representing data as a combination of a few atoms, has long stood as a popular method for learning representations in statistics and signal processing. The most popular dictionary learning algorithm alternates between sparse coding and dictionary update steps, and a rich literature has studied its theoretical convergence. The success of dictionary learning relies on access to a "good" initial estimate of the dictionary and the ability of the sparse coding step to provide an unbiased estimate of the code. The growing popularity of unrolled sparse coding networks has led to the empirical finding that backpropagation through such networks performs dictionary learning. We offer the theoretical analysis of these empirical results through PUDLE, a Provable Unrolled Dictionary LEarning method. We provide conditions on the network initialization and data distribution sufficient to recover and preserve the support of the latent code. Additionally, we address two challenges; first, the vanilla unrolled sparse coding computes a biased code estimate, and second, gradients during backpropagated learning can become unstable. We show approaches to reduce the bias of the code estimate in the forward pass, and that of the dictionary estimate in the backward pass. We propose strategies to resolve the learning instability by tuning network parameters and modifying the loss function. Overall, we highlight the impact of loss, unrolling, and backpropagation on convergence. We complement our findings through synthetic and image denoising experiments. Finally, we demonstrate PUDLE's interpretability, a driving factor in designing deep networks based on iterative optimizations, by building a mathematical relation between network weights, its output, and the training set.
△ Less
Submitted 2 August, 2022; v1 submitted 31 May, 2021;
originally announced June 2021.
-
Covariance-Free Sparse Bayesian Learning
Authors:
Alexander Lin,
Andrew H. Song,
Berkin Bilgic,
Demba Ba
Abstract:
Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem while also providing uncertainty quantification. The most popular inference algorithms for SBL exhibit prohibitively large computational costs for high-dimensional problems due to the need to maintain a large covariance matrix. To resolve this issue, we introduce a new method for accelerating SBL inferenc…
▽ More
Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem while also providing uncertainty quantification. The most popular inference algorithms for SBL exhibit prohibitively large computational costs for high-dimensional problems due to the need to maintain a large covariance matrix. To resolve this issue, we introduce a new method for accelerating SBL inference -- named covariance-free expectation maximization (CoFEM) -- that avoids explicit computation of the covariance matrix. CoFEM solves multiple linear systems to obtain unbiased estimates of the posterior statistics needed by SBL. This is accomplished by exploiting innovations from numerical linear algebra such as preconditioned conjugate gradient and a little-known diagonal estimation rule. For a large class of compressed sensing matrices, we provide theoretical justifications for why our method scales well in high-dimensional settings. Through simulations, we show that CoFEM can be up to thousands of times faster than existing baselines without sacrificing coding accuracy. Through applications to calcium imaging deconvolution and multi-contrast MRI reconstruction, we show that CoFEM enables SBL to tractably tackle high-dimensional sparse coding problems of practical interest.
△ Less
Submitted 8 April, 2022; v1 submitted 21 May, 2021;
originally announced May 2021.
-
Nanoscale Phonon Spectroscopy Reveals Emergent Interface Vibrational Structure of Superlattices
Authors:
Eric R. Hoglund,
De-Liang Bao,
Andrew O'Hara,
Sara Makarem,
Zachary T. Piontkowski,
Joseph R. Matson,
Ajay K. Yadav,
Ryan C. Haisimaier,
Roman Engel-Herbert,
Jon F. Ihlefeld,
Jayakanth Ravichandran,
Ramamoorthy Ramesh,
Joshua D. Caldwell,
Thomas E. Beechem,
John A. Tomko,
Jordan A. Hachtel,
Sokrates T. Pantelides,
Patrick E. Hopkins,
James M. Howe
Abstract:
As the length-scales of materials decrease, heterogeneities associated with interfaces approach the importance of the surrounding materials. Emergent electronic and magnetic interface properties in superlattices have been studied extensively by both experiments and theory. $^{1-6}$ However, the presence of interfacial vibrations that impact phonon-mediated responses, like thermal conductivity…
▽ More
As the length-scales of materials decrease, heterogeneities associated with interfaces approach the importance of the surrounding materials. Emergent electronic and magnetic interface properties in superlattices have been studied extensively by both experiments and theory. $^{1-6}$ However, the presence of interfacial vibrations that impact phonon-mediated responses, like thermal conductivity $^{7,8}$, has only been inferred in experiments indirectly. While it is accepted that intrinsic phonons change near boundaries $^{9,10}$, the physical mechanisms and length-scales through which interfacial effects influence materials remain unclear. Herein, we demonstrate the localized vibrational response associated with the interfaces in SrTiO$_3$-CaTiO$_3$ superlattices by combining advanced scanning transmission electron microscopy imaging and spectroscopy and density-functional-theory calculations. Symmetries atypical of either constituent material are observed within a few atomic planes near the interface. The local symmetries create local phonon modes that determine the global response of the superlattice once the spacing of the interfaces approaches the phonon spatial extent. The results provide direct visualization and quantification, illustrating the progression of the local symmetries and interface vibrations as they come to determine the vibrational response of an entire superlattice; stated differently, the progression from a material with interfaces, to a material dominated by interfaces, to a material of interfaces as the period decreases. Direct observation of such local atomic and vibrational phenomena demonstrates that their spatial extent needs to be quantified to understand macroscopic behavior. Tailoring interfaces, and knowing their local vibrational response, provides a means of pursuing designer solids having emergent infrared and thermal responses.
△ Less
Submitted 4 October, 2021; v1 submitted 20 May, 2021;
originally announced May 2021.
-
AGN STORM 2: I. First results: A Change in the Weather of Mrk 817
Authors:
Erin Kara,
Missagh Mehdipour,
Gerard A. Kriss,
Edward M. Cackett,
Nahum Arav,
Aaron J. Barth,
Doyee Byun,
Michael S. Brotherton,
Gisella De Rosa,
Jonathan Gelbord,
Juan V. Hernandez Santisteban,
Chen Hu,
Jelle Kaastra,
Hermine Landt,
Yan-Rong Li,
Jake A. Miller,
John Montano,
Ethan Partington,
Jesus Aceituno,
**-Ming Bai,
Dongwei Bao,
Misty C. Bentz,
Thomas G. Brink,
Doron Chelouche,
Yong-Jie Chen
, et al. (47 additional authors not shown)
Abstract:
We present the first results from the ongoing, intensive, multi-wavelength monitoring program of the luminous Seyfert 1 galaxy Mrk 817. While this AGN was, in part, selected for its historically unobscured nature, we discovered that the X-ray spectrum is highly absorbed, and there are new blueshifted, broad and narrow UV absorption lines, which suggest that a dust-free, ionized obscurer located at…
▽ More
We present the first results from the ongoing, intensive, multi-wavelength monitoring program of the luminous Seyfert 1 galaxy Mrk 817. While this AGN was, in part, selected for its historically unobscured nature, we discovered that the X-ray spectrum is highly absorbed, and there are new blueshifted, broad and narrow UV absorption lines, which suggest that a dust-free, ionized obscurer located at the inner broad line region partially covers the central source. Despite the obscuration, we measure UV and optical continuum reverberation lags consistent with a centrally illuminated Shakura-Sunyaev thin accretion disk, and measure reverberation lags associated with the optical broad line region, as expected. However, in the first 55 days of the campaign, when the obscuration was becoming most extreme, we observe a de-coupling of the UV continuum and the UV broad emission line variability. The correlation recovers in the next 42 days of the campaign, as Mrk 817 enters a less obscured state. The short CIV and Ly alpha lags suggest that the accretion disk extends beyond the UV broad line region.
△ Less
Submitted 12 May, 2021;
originally announced May 2021.
-
Asymptotic properties in the Probit-Zero-inflated Binomial regression model
Authors:
Aba Diop,
Demba Bocar Ba,
Fatimata Lo
Abstract:
Zero-inflated regression models have had wide application recently and have provenuseful in modeling data with many zeros. Zero-inflated Binomial (ZIB) regression model is an extension of the ordinary binomial distribution that takes into account the excess of zeros. In comparing the probit model to the logistic model, many authors believe that there is little theoretical justification in choosing…
▽ More
Zero-inflated regression models have had wide application recently and have provenuseful in modeling data with many zeros. Zero-inflated Binomial (ZIB) regression model is an extension of the ordinary binomial distribution that takes into account the excess of zeros. In comparing the probit model to the logistic model, many authors believe that there is little theoretical justification in choosing one formulation over the other in most circumstances involving binary responses. The logit model is considered to be computationally simpler but it is based on a more restrictive assumption of error independence, although many other generalizations have dealt with that assumption as well. By contrast, the probit model assumes that random errors have a multivariate normal distribution. This assumption makes the probit model attractive because the normal distribution provides a good approximation to many other distributions. In this paper, we develop a maximum likelihood estimation procedure for the parameters of a zero-inflated Binomial regression model with probit link function for both component of the model. We establish the existency, consistency and asymptotic normality of the proposed estimator.
△ Less
Submitted 2 May, 2021;
originally announced May 2021.
-
Weighed $\ell_1$ on the simplex: Compressive sensing meets locality
Authors:
Abiy Tasissa,
Pranay Tankala,
Demba Ba
Abstract:
Sparse manifold learning algorithms combine techniques in manifold learning and sparse optimization to learn features that could be utilized for downstream tasks. The standard setting of compressive sensing can not be immediately applied to this setup. Due to the intrinsic geometric structure of data, dictionary atoms might be redundant and do not satisfy the restricted isometry property or cohere…
▽ More
Sparse manifold learning algorithms combine techniques in manifold learning and sparse optimization to learn features that could be utilized for downstream tasks. The standard setting of compressive sensing can not be immediately applied to this setup. Due to the intrinsic geometric structure of data, dictionary atoms might be redundant and do not satisfy the restricted isometry property or coherence condition. In addition, manifold learning emphasizes learning local geometry which is not reflected in a standard $\ell_1$ minimization problem. We propose weighted $\ell_0$ and weighted $\ell_1$ metrics that encourage representation via neighborhood atoms suited for dictionary based manifold learning. Assuming that the data is generated from Delaunay triangulation, we show the equivalence of weighted $\ell_1$ and weighted $\ell_0$. We discuss an optimization program that learns the dictionaries and sparse coefficients and demonstrate the utility of our regularization on synthetic and real datasets.
△ Less
Submitted 28 April, 2021;
originally announced April 2021.
-
Multi-Wavelength Monitoring and Reverberation Map** of a Changing Look Event in the Seyfert Galaxy NGC 3516
Authors:
V. L. Oknyansky,
M. S. Brotherton,
S. S. Tsygankov,
A. V. Dodin,
D. -W. Bao,
B. -X. Zhao,
P. Du,
M. A. Burlak,
N. P. Ikonnikova,
A. M. Tatarnikov,
A. A. Belinski,
A. A. Fedoteva,
N. I. Shatsky,
E. O. Mishin,
S. G. Zheltouhov,
S. A. Potanin,
J. -M. Wang,
J. N. McLane,
H. A. Kobulnicky,
D. A. Dale,
T. E. Zastrocky,
J. Maithil,
K. A. Olson,
C. Adelman,
Z. Carter
, et al. (4 additional authors not shown)
Abstract:
We present the results of photometric and spectroscopic monitoring campaigns of the changing look AGN NGC 3516 carried out in 2018 to 2020 covering the wavelength range from the X-ray to the optical. The facilities included the telescopes of the CMO SAI MSU, the 2.3-m WIRO telescope, and the XRT and UVOT of Swift. We found that NGC 3516 brightened to a high state and could be classified as Sy1.5 d…
▽ More
We present the results of photometric and spectroscopic monitoring campaigns of the changing look AGN NGC 3516 carried out in 2018 to 2020 covering the wavelength range from the X-ray to the optical. The facilities included the telescopes of the CMO SAI MSU, the 2.3-m WIRO telescope, and the XRT and UVOT of Swift. We found that NGC 3516 brightened to a high state and could be classified as Sy1.5 during the late spring of 2020. We have measured time delays in the responses of the Balmer and He II 4686 lines to continuum variations. In the case of the best-characterized broad H-beta line, the delay to continuum variability is about 17 days in the blue wing and is clearly shorter, 9 days, in the red, which is suggestive of inflow. As the broad lines strengthened, the blue side came to dominate the Balmer lines, resulting in very asymmetric profiles with blueshifted peaks during this high state. During the outburst the X-ray flux reached its maximum on 1 April 2020 and it was the highest value ever observed for NGC 3516 by the Swift observatory. The X-ray hard photon index became softer, about 1.8 in the maximum on 21 Apr 2020 compared to the mean about 0.7 during earlier epochs before 2020. We have found that the UV and optical variations correlated well (with a small time delay of 1-2 days) with the X-ray until the beginning of April 2020, but later, until the end of Jun. 2020, these variations were not correlated. We suggest that this fact may be a consequence of partial obscuration by Compton-thick clouds crossing the line of sight.
△ Less
Submitted 11 May, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
Gaussian Process Convolutional Dictionary Learning
Authors:
Andrew H. Song,
Bahareh Tolooshams,
Demba Ba
Abstract:
Convolutional dictionary learning (CDL), the problem of estimating shift-invariant templates from data, is typically conducted in the absence of a prior/structure on the templates. In data-scarce or low signal-to-noise ratio (SNR) regimes, learned templates overfit the data and lack smoothness, which can affect the predictive performance of downstream tasks. To address this limitation, we propose…
▽ More
Convolutional dictionary learning (CDL), the problem of estimating shift-invariant templates from data, is typically conducted in the absence of a prior/structure on the templates. In data-scarce or low signal-to-noise ratio (SNR) regimes, learned templates overfit the data and lack smoothness, which can affect the predictive performance of downstream tasks. To address this limitation, we propose GPCDL, a convolutional dictionary learning framework that enforces priors on templates using Gaussian Processes (GPs). With the focus on smoothness, we show theoretically that imposing a GP prior is equivalent to Wiener filtering the learned templates, thereby suppressing high-frequency components and promoting smoothness. We show that the algorithm is a simple extension of the classical iteratively reweighted least squares algorithm, independent of the choice of GP kernels. This property allows one to experiment flexibly with different smoothness assumptions. Through simulation, we show that GPCDL learns smooth dictionaries with better accuracy than the unregularized alternative across a range of SNRs. Through an application to neural spiking data, we show that GPCDL learns a more accurate and visually-interpretable smooth dictionary, leading to superior predictive performance compared to non-regularized CDL, as well as parametric alternatives.
△ Less
Submitted 24 November, 2021; v1 submitted 28 March, 2021;
originally announced April 2021.
-
On the convergence of group-sparse autoencoders
Authors:
Emmanouil Theodosis,
Bahareh Tolooshams,
Pranay Tankala,
Abiy Tasissa,
Demba Ba
Abstract:
Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated by the success of architectures that impose structured forms of sparsity, we introduce and study a group-sparse autoencoder that accounts for a variety of genera…
▽ More
Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated by the success of architectures that impose structured forms of sparsity, we introduce and study a group-sparse autoencoder that accounts for a variety of generative models, and utilizes a group-sparse ReLU activation function to force the non-zero units at a given layer to occur in blocks. For clustering models, inputs that result in the same group of active units belong to the same cluster. We proceed to analyze the gradient dynamics of a shallow instance of the proposed autoencoder, trained with data adhering to a group-sparse generative model. In this setting, we theoretically prove the convergence of the network parameters to a neighborhood of the generating matrix. We validate our model through numerical analysis and highlight the superior performance of networks with a group-sparse ReLU compared to networks that utilize traditional ReLUs, both in sparse coding and in parameter recovery tasks. We also provide real data experiments to corroborate the simulated results, and emphasize the clustering capabilities of structured sparsity models.
△ Less
Submitted 21 January, 2022; v1 submitted 13 February, 2021;
originally announced February 2021.
-
K-Deep Simplex: Deep Manifold Learning via Local Dictionaries
Authors:
Pranay Tankala,
Abiy Tasissa,
James M. Murphy,
Demba Ba
Abstract:
We propose K-Deep Simplex (KDS) which, given a set of data points, learns a dictionary comprising synthetic landmarks, along with representation coefficients supported on a simplex. KDS integrates manifold learning and sparse coding/dictionary learning: reconstruction term, as in classical dictionary learning, and a novel local weighted $\ell_1$ penalty that encourages each data point to represent…
▽ More
We propose K-Deep Simplex (KDS) which, given a set of data points, learns a dictionary comprising synthetic landmarks, along with representation coefficients supported on a simplex. KDS integrates manifold learning and sparse coding/dictionary learning: reconstruction term, as in classical dictionary learning, and a novel local weighted $\ell_1$ penalty that encourages each data point to represent itself as a convex combination of nearby landmarks. We solve the proposed optimization program using alternating minimization and design an efficient, interpretable autoencoder using algorithm enrolling. We theoretically analyze the proposed program by relating the weighted $\ell_1$ penalty in KDS to a weighted $\ell_0$ program. Assuming that the data are generated from a Delaunay triangulation, we prove the equivalence of the weighted $\ell_1$ and weighted $\ell_0$ programs. If the representation coefficients are given, we prove that the resulting dictionary is unique. Further, we show that low-dimensional representations can be efficiently obtained from the covariance of the coefficient matrix. We apply KDS to the unsupervised clustering problem and prove theoretical performance guarantees. Experiments show that the algorithm is highly efficient and performs competitively on synthetic and real data sets.
△ Less
Submitted 14 January, 2023; v1 submitted 3 December, 2020;
originally announced December 2020.
-
Monitoring AGNs with Hβ Asymmetry. II. Reverberation Map** of Three Seyfert Galaxies Historically Displaying Hβ Profiles with Changing Asymmetry: Mrk 79, NGC 3227, and Mrk 841
Authors:
Michael S. Brotherton,
Pu Du,
Ming Xiao,
Dong-Wei Bao,
Bixuan Zhao,
Jacob N. McLane,
Kianna A. Olson,
Kai Wang,
Zheng-Peng Huang,
Chen Hu,
David H. Kasper,
William T. Chick,
My L. Nguyen,
Jaya Maithil,
Derek Hand,
Yan-Rong Li,
Luis C. Ho,
**-Ming Bai,
Wei-Hao Bian,
Jian-Min Wang
Abstract:
We report the results of reverberation map** three bright Seyfert galaxies, Mrk 79, NGC 3227, and Mrk 841, from a campaign conducted from December 2016 to May 2017 with the Wyoming Infrared Observatory (WIRO) 2.3-meter telescope. All three of these targets have shown asymmetric broad H$β$ emission lines in the past, although their emission lines were relatively symmetric during our observations.…
▽ More
We report the results of reverberation map** three bright Seyfert galaxies, Mrk 79, NGC 3227, and Mrk 841, from a campaign conducted from December 2016 to May 2017 with the Wyoming Infrared Observatory (WIRO) 2.3-meter telescope. All three of these targets have shown asymmetric broad H$β$ emission lines in the past, although their emission lines were relatively symmetric during our observations. We measured Hβ time lags for all three targets and estimated masses of their black holes -- for the first time in the case of Mrk 841. For Mrk 79 and NGC 3227, the data are of sufficient quality to resolve distinct time lags as a function of velocity and to compute two-dimensional velocity-delay maps. Mrk 79 shows smaller time lags for high-velocity gas but the distribution is not symmetric, and its complex velocity-delay map could result from the combination of both inflowing and outflowing Hβ emitting disks that may be part of a single larger structure. NGC 3227 shows the largest time lags for blueshifted gas and the two-dimensional velocity-delay map suggests a disk with some inflow. We compare our results with previous work and find evidence for different time lags despite similar luminosities, as well as evolving broad line region structures.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.