-
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
Authors:
Tharindu Kumarage,
Amrita Bhattacharjee,
Djordje Padejski,
Kristy Roschke,
Dan Gillmor,
Scott Ruston,
Huan Liu,
Joshua Garland
Abstract:
The rapid proliferation of AI-generated text online is profoundly resha** the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about…
▽ More
The rapid proliferation of AI-generated text online is profoundly resha** the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about their vulnerability to simple adversarial attacks. Furthermore, due to the eccentricities of news writing, applying these detection methods for AI-generated news can produce false positives, potentially damaging the reputation of news organizations. To address these challenges, we leverage the expertise of an interdisciplinary team to develop a framework, J-Guard, capable of steering existing supervised AI text detectors for detecting AI-generated news while boosting adversarial robustness. By incorporating stylistic cues inspired by the unique journalistic attributes, J-Guard effectively distinguishes between real-world journalism and AI-generated news articles. Our experiments on news articles generated by a vast array of AI models, including ChatGPT (GPT3.5), demonstrate the effectiveness of J-Guard in enhancing detection capabilities while maintaining an average performance decrease of as low as 7% when faced with adversarial attacks.
△ Less
Submitted 6 September, 2023;
originally announced September 2023.
-
The QUATRO Application Suite: Quantum Computing for Models of Human Cognition
Authors:
Raghavendra Pradyumna Pothukuchi,
Leon Lufkin,
Yu Jun Shen,
Alejandro Simon,
Rome Thorstenson,
Bernardo Eilert Trevisan,
Michael Tu,
Mudi Yang,
Ben Foxman,
Viswanatha Srinivas Pothukuchi,
Gunnar Ep**,
Thi Ha Kyaw,
Bryant J Jongkees,
Yongshan Ding,
Jerome R Busemeyer,
Jonathan D Cohen,
Abhishek Bhattacharjee
Abstract:
Research progress in quantum computing has, thus far, focused on a narrow set of application domains. Expanding the suite of quantum application domains is vital for the discovery of new software toolchains and architectural abstractions. In this work, we unlock a new class of applications ripe for quantum computing research -- computational cognitive modeling. Cognitive models are critical to und…
▽ More
Research progress in quantum computing has, thus far, focused on a narrow set of application domains. Expanding the suite of quantum application domains is vital for the discovery of new software toolchains and architectural abstractions. In this work, we unlock a new class of applications ripe for quantum computing research -- computational cognitive modeling. Cognitive models are critical to understanding and replicating human intelligence. Our work connects computational cognitive models to quantum computer architectures for the first time. We release QUATRO, a collection of quantum computing applications from cognitive models. The development and execution of QUATRO shed light on gaps in the quantum computing stack that need to be closed to ease programming and drive performance. Among several contributions, we propose and study ideas pertaining to quantum cloud scheduling (using data from gate- and annealing-based quantum computers), parallelization, and more. In the long run, we expect our research to lay the groundwork for more versatile quantum computer systems in the future.
△ Less
Submitted 8 December, 2023; v1 submitted 1 September, 2023;
originally announced September 2023.
-
$W(0,b)$ algebra and the dual theory of 3D asymptotically flat higher spin gravity
Authors:
Nabamita Banerjee,
Arindam Bhattacharjee,
Surajit Biswas,
Arpita Mitra,
Debangshu Mukherjee
Abstract:
BMS algebra in three spacetime dimensions can be deformed into a two parameter family of algebra known as $W(a,b)$ algebra. For $a=0$, we show that other than $W(0,-1)$, no other $W(0,b)$ algebra admits a non-degenerate bilinear and thus one can not have a Chern-Simons gauge theory formulation with them. However, they may appear in a three-dimensional gravity description, where we also need to hav…
▽ More
BMS algebra in three spacetime dimensions can be deformed into a two parameter family of algebra known as $W(a,b)$ algebra. For $a=0$, we show that other than $W(0,-1)$, no other $W(0,b)$ algebra admits a non-degenerate bilinear and thus one can not have a Chern-Simons gauge theory formulation with them. However, they may appear in a three-dimensional gravity description, where we also need to have a spin 2 generator, that comes from the $(a=0,b=-1)$ sector. In the present work, we have demonstrated that the asymptotic symmetry algebra of a spin 3 gravity theory on flat spacetime has both the $W(0,-1)$ and $W(0,-2)$ algebras as subalgebras. We have also constructed a dual boundary field theory for this higher spin gravity theory by using the Chern-Simons/Wess-Zumino-Witten correspondence.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
Authors:
Amrita Bhattacharjee,
Huan Liu
Abstract:
Large language models (LLMs) such as ChatGPT are increasingly being used for various use cases, including text content generation at scale. Although detection methods for such AI-generated text exist already, we investigate ChatGPT's performance as a detector on such AI-generated text, inspired by works that use ChatGPT as a data labeler or annotator. We evaluate the zero-shot performance of ChatG…
▽ More
Large language models (LLMs) such as ChatGPT are increasingly being used for various use cases, including text content generation at scale. Although detection methods for such AI-generated text exist already, we investigate ChatGPT's performance as a detector on such AI-generated text, inspired by works that use ChatGPT as a data labeler or annotator. We evaluate the zero-shot performance of ChatGPT in the task of human-written vs. AI-generated text detection, and perform experiments on publicly available datasets. We empirically investigate if ChatGPT is symmetrically effective in detecting AI-generated or human-written text. Our findings provide insight on how ChatGPT and similar LLMs may be leveraged in automated detection pipelines by simply focusing on solving a specific aspect of the problem and deriving the rest from that solution. All code and data is available at https://github.com/AmritaBh/ChatGPT-as-Detector.
△ Less
Submitted 17 August, 2023; v1 submitted 2 August, 2023;
originally announced August 2023.
-
HyDe: A Hybrid PCM/FeFET/SRAM Device-search for Optimizing Area and Energy-efficiencies in Analog IMC Platforms
Authors:
Abhiroop Bhattacharjee,
Abhishek Moitra,
Priyadarshini Panda
Abstract:
Today, there are a plethora of In-Memory Computing (IMC) devices- SRAMs, PCMs & FeFETs, that emulate convolutions on crossbar-arrays with high throughput. Each IMC device offers its own pros & cons during inference of Deep Neural Networks (DNNs) on crossbars in terms of area overhead, programming energy and non-idealities. A design-space exploration is, therefore, imperative to derive a hybrid-dev…
▽ More
Today, there are a plethora of In-Memory Computing (IMC) devices- SRAMs, PCMs & FeFETs, that emulate convolutions on crossbar-arrays with high throughput. Each IMC device offers its own pros & cons during inference of Deep Neural Networks (DNNs) on crossbars in terms of area overhead, programming energy and non-idealities. A design-space exploration is, therefore, imperative to derive a hybrid-device architecture optimized for accurate DNN inference under the impact of non-idealities from multiple devices, while maintaining competitive area & energy-efficiencies. We propose a two-phase search framework (HyDe) that exploits the best of all worlds offered by multiple devices to determine an optimal hybrid-device architecture for a given DNN topology. Our hybrid models achieve upto 2.30-2.74x higher TOPS/mm^2 at 22-26% higher energy-efficiencies than baseline homogeneous models for a VGG16 DNN topology. We further propose a feasible implementation of the HyDe-derived hybrid-device architectures in the 2.5D design space using chiplets to reduce design effort and cost in the hardware fabrication involving multiple technology processes.
△ Less
Submitted 24 October, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Cognitive Engagement for STEM+C Education: Investigating Serious Game Impact on Graph Structure Learning with fNIRS
Authors:
Shayla Sharmin,
Reza Koiler,
Rifat Sadik,
Arpan Bhattacharjee,
Priyanka Raju Patre,
Pinar Kullu,
Charles Hohensee,
Nancy Getchell,
Roghayeh Leila Barmaki
Abstract:
For serious games on education, understanding the effectiveness of different learning methods in influencing cognitive processes remains a significant challenge. This study investigates the impact of serious games on graph structure learning. For this, we compared our in-house game-based learning (GBL) and video-based learning (VBL) methodologies by evaluating their effectiveness on cognitive proc…
▽ More
For serious games on education, understanding the effectiveness of different learning methods in influencing cognitive processes remains a significant challenge. This study investigates the impact of serious games on graph structure learning. For this, we compared our in-house game-based learning (GBL) and video-based learning (VBL) methodologies by evaluating their effectiveness on cognitive processes by oxygenated hemoglobin levels using functional near-infrared spectroscopy (fNIRS). We conducted a 2 x 1 between subjects preliminary study with twelve participants, involving two conditions: game and video. Both groups received equivalent content related to the basic structure of a graph, with comparable session lengths. The game group interacted with a quiz-based game, while the video group watched a pre-recorded video. The fNIRS was employed to capture cerebral signals from the prefrontal cortex, and participants completed pre- and post- questionnaires capturing user experience and knowledge gain. In our study, we noted that the mean levels of oxygenated hemoglobin were higher in the GBL group, suggesting the potential enhanced cognitive involvement. Our results show that the lateral prefrontal cortex (LPFC) has greater hemodynamic activity during the learning period. Moreover, knowledge gain analysis showed an increase in mean score in the GBL group compared to the VBL group. Although we did not observe statistically significant changes due to participant variability and sample size, this preliminary work contributes to understanding how GBL and VBL impact cognitive processes, providing insights for enhanced instructional design and educational game development. Additionally, it emphasizes the necessity for further investigation into the impact of GBL on cognitive engagement and learning outcomes.
△ Less
Submitted 7 March, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray μCT of Titanium Coated Ni wires using Deep Learning
Authors:
Pradyumna Elavarthi,
Arun Bhattacharjee,
Ashley Paz y Puente,
Anca Ralescu
Abstract:
A fully convolutional neural network was used to measure the evolution of the volume fraction of two different Kirkendall pores during the homogenization of Ti coated Ni wires. Traditional methods like Otsus thresholding and the largest connected component analysis were used to obtain the masks for training the segmentation model. Once trained, the model was used to semantically segment the two ty…
▽ More
A fully convolutional neural network was used to measure the evolution of the volume fraction of two different Kirkendall pores during the homogenization of Ti coated Ni wires. Traditional methods like Otsus thresholding and the largest connected component analysis were used to obtain the masks for training the segmentation model. Once trained, the model was used to semantically segment the two types of pores at different stages in their evolution. Masks of the pores predicted by the network were then used to measure the volume fraction of porosity at 0 mins, 240 mins, and 480 mins of homogenization. The model predicted an increase in porosity for one type of pore and a decrease in porosity for another type of pore due to pore sintering, and it achieved an F1 Score of 0.95.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Fast ion transport in quasisymmetric equilibria in the presence of a resonant Alfvénic perturbation
Authors:
Elizabeth J. Paul,
Harry E. Mynick,
Amitava Bhattacharjee
Abstract:
Significant progress has been made in designing magnetic fields that provide excellent confinement of the guiding enter trajectories of alpha particles using quasisymmetry (QS). Given the reduction in this transport channel, we assess the impact of resonant Alfvén eigenmodes (AEs) on the guiding center motion. The AE amplitudes are chosen to be consistent with experimental measurements and large-s…
▽ More
Significant progress has been made in designing magnetic fields that provide excellent confinement of the guiding enter trajectories of alpha particles using quasisymmetry (QS). Given the reduction in this transport channel, we assess the impact of resonant Alfvén eigenmodes (AEs) on the guiding center motion. The AE amplitudes are chosen to be consistent with experimental measurements and large-scale simulations. We evaluate the drift resonance condition, phase-space island width, and island overlap criterion for quasisymmetric configurations. Kinetic Poincaré plots elucidate features of the transport, including stiff transport above a critical perturbation amplitude. Our analysis highlights key departures from the AE-driven transport in tokamaks, such as the avoidance of phase-space island overlap in quasihelical configurations and the enhanced transport due to wide phase-space islands in low magnetic shear configurations. In configurations that are closer to QS, with QS deviations $δB/B_0 \lesssim 10^{-3}$, the transport is primarily driven by the AE, while configurations that are further from QS, $δB/B_0 \sim 10^{-2}$, experience significant transport due to the QS-breaking fields in addition to the AE.
△ Less
Submitted 9 September, 2023; v1 submitted 21 June, 2023;
originally announced June 2023.
-
Examining the Role and Limits of Batchnorm Optimization to Mitigate Diverse Hardware-noise in In-memory Computing
Authors:
Abhiroop Bhattacharjee,
Abhishek Moitra,
Youngeun Kim,
Yeshwanth Venkatesha,
Priyadarshini Panda
Abstract:
In-Memory Computing (IMC) platforms such as analog crossbars are gaining focus as they facilitate the acceleration of low-precision Deep Neural Networks (DNNs) with high area- & compute-efficiencies. However, the intrinsic non-idealities in crossbars, which are often non-deterministic and non-linear, degrade the performance of the deployed DNNs. In addition to quantization errors, most frequently…
▽ More
In-Memory Computing (IMC) platforms such as analog crossbars are gaining focus as they facilitate the acceleration of low-precision Deep Neural Networks (DNNs) with high area- & compute-efficiencies. However, the intrinsic non-idealities in crossbars, which are often non-deterministic and non-linear, degrade the performance of the deployed DNNs. In addition to quantization errors, most frequently encountered non-idealities during inference include crossbar circuit-level parasitic resistances and device-level non-idealities such as stochastic read noise and temporal drift. In this work, our goal is to closely examine the distortions caused by these non-idealities on the dot-product operations in analog crossbars and explore the feasibility of a nearly training-less solution via crossbar-aware fine-tuning of batchnorm parameters in real-time to mitigate the impact of the non-idealities. This enables reduction in hardware costs in terms of memory and training energy for IMC noise-aware retraining of the DNN weights on crossbars.
△ Less
Submitted 28 May, 2023;
originally announced May 2023.
-
Mitigating Catastrophic Forgetting in Long Short-Term Memory Networks
Authors:
Ketaki Joshi,
Raghavendra Pradyumna Pothukuchi,
Andre Wibisono,
Abhishek Bhattacharjee
Abstract:
Continual learning on sequential data is critical for many machine learning (ML) deployments. Unfortunately, LSTM networks, which are commonly used to learn on sequential data, suffer from catastrophic forgetting and are limited in their ability to learn multiple tasks continually. We discover that catastrophic forgetting in LSTM networks can be overcome in two novel and readily-implementable ways…
▽ More
Continual learning on sequential data is critical for many machine learning (ML) deployments. Unfortunately, LSTM networks, which are commonly used to learn on sequential data, suffer from catastrophic forgetting and are limited in their ability to learn multiple tasks continually. We discover that catastrophic forgetting in LSTM networks can be overcome in two novel and readily-implementable ways -- separating the LSTM memory either for each task or for each target label. Our approach eschews the need for explicit regularization, hypernetworks, and other complex methods. We quantify the benefits of our approach on recently-proposed LSTM networks for computer memory access prefetching, an important sequential learning problem in ML-based computer system optimization. Compared to state-of-the-art weight regularization methods to mitigate catastrophic forgetting, our approach is simple, effective, and enables faster learning. We also show that our proposal enables the use of small, non-regularized LSTM networks for complex natural language processing in the offline learning scenario, which was previously considered difficult.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
XPert: Peripheral Circuit & Neural Architecture Co-search for Area and Energy-efficient Xbar-based Computing
Authors:
Abhishek Moitra,
Abhiroop Bhattacharjee,
Youngeun Kim,
Priyadarshini Panda
Abstract:
The hardware-efficiency and accuracy of Deep Neural Networks (DNNs) implemented on In-memory Computing (IMC) architectures primarily depend on the DNN architecture and the peripheral circuit parameters. It is therefore essential to holistically co-search the network and peripheral parameters to achieve optimal performance. To this end, we propose XPert, which co-searches network architecture in ta…
▽ More
The hardware-efficiency and accuracy of Deep Neural Networks (DNNs) implemented on In-memory Computing (IMC) architectures primarily depend on the DNN architecture and the peripheral circuit parameters. It is therefore essential to holistically co-search the network and peripheral parameters to achieve optimal performance. To this end, we propose XPert, which co-searches network architecture in tandem with peripheral parameters such as the type and precision of analog-to-digital converters, crossbar column sharing and the layer-specific input precision using an optimization-based design space exploration. Compared to VGG16 baselines, XPert achieves 10.24x (4.7x) lower EDAP, 1.72x (1.62x) higher TOPS/W,1.93x (3x) higher TOPS/mm2 at 92.46% (56.7%) accuracy for CIFAR10 (TinyImagenet) datasets. The code for this paper is available at https://github.com/Intelligent-Computing-Lab-Yale/XPert.
△ Less
Submitted 21 November, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
Resonant instabilities mediated by drag and electrostatic interactions in laboratory and astrophysical dusty plasmas
Authors:
Ben Y. Israeli,
Amitava Bhattacharjee,
Hong Qin
Abstract:
Dusty plasmas are known to support a diverse range of instabilities, including both generalizations of standard plasma instabilities and ones caused by effects specific to dusty systems. It has been recently demonstrated that a novel broad class of streaming instabilities, termed resonant drag instabilities (RDIs), can be attributed to a particular resonance phenomenon, manifested by defective eig…
▽ More
Dusty plasmas are known to support a diverse range of instabilities, including both generalizations of standard plasma instabilities and ones caused by effects specific to dusty systems. It has been recently demonstrated that a novel broad class of streaming instabilities, termed resonant drag instabilities (RDIs), can be attributed to a particular resonance phenomenon, manifested by defective eigenvalues of the linearized dust/fluid system. In this work, it is demonstrated that this resonance phenomenon is not unique to RDIs and can be used as a framework to understand a wider range of instabilities, termed resonant instabilities. Particular attention is given to the filamentary ionization instability seen in laboratory dusty plasmas and to the two-stream instability. It is shown that, due to the commonalities in underlying physics between the dust-ion-acoustic two-stream instability and the acoustic RDI, these instabilities should be relevant in strongly overlap** regimes in astrophysical dusty plasmas. It is proposed that a similar overlap in the experimental accessibility of these modes (and of the filamentary instability) allows for the possibility of experimental investigation in the laboratory of complex and astrophysically relevant instability dynamics.
△ Less
Submitted 20 July, 2023; v1 submitted 23 March, 2023;
originally announced March 2023.
-
DeeBBAA: A benchmark Deep Black Box Adversarial Attack against Cyber-Physical Power Systems
Authors:
Arnab Bhattacharjee,
Tapan K. Saha,
Ashu Verma,
Sukumar Mishra
Abstract:
An increased energy demand, and environmental pressure to accommodate higher levels of renewable energy and flexible loads like electric vehicles have led to numerous smart transformations in the modern power systems. These transformations make the cyber-physical power system highly susceptible to cyber-adversaries targeting its numerous operations. In this work, a novel black box adversarial atta…
▽ More
An increased energy demand, and environmental pressure to accommodate higher levels of renewable energy and flexible loads like electric vehicles have led to numerous smart transformations in the modern power systems. These transformations make the cyber-physical power system highly susceptible to cyber-adversaries targeting its numerous operations. In this work, a novel black box adversarial attack strategy is proposed targeting the AC state estimation operation of an unknown power system using historical data. Specifically, false data is injected into the measurements obtained from a small subset of the power system components which leads to significant deviations in the state estimates. Experiments carried out on the IEEE 39 bus and 118 bus test systems make it evident that the proposed strategy, called DeeBBAA, can evade numerous conventional and state-of-the-art attack detection mechanisms with very high probability.
△ Less
Submitted 15 March, 2023;
originally announced March 2023.
-
Best arm identification in rare events
Authors:
Anirban Bhattacharjee,
Sushant Vijayan,
Sandeep K Juneja
Abstract:
We consider the best arm identification problem in the stochastic multi-armed bandit framework where each arm has a tiny probability of realizing large rewards while with overwhelming probability the reward is zero. A key application of this framework is in online advertising where click rates of advertisements could be a fraction of a single percent and final conversion to sales, while highly pro…
▽ More
We consider the best arm identification problem in the stochastic multi-armed bandit framework where each arm has a tiny probability of realizing large rewards while with overwhelming probability the reward is zero. A key application of this framework is in online advertising where click rates of advertisements could be a fraction of a single percent and final conversion to sales, while highly profitable, may again be a small fraction of the click rates. Lately, algorithms for BAI problems have been developed that minimise sample complexity while providing statistical guarantees on the correct arm selection. As we observe, these algorithms can be computationally prohibitive. We exploit the fact that the reward process for each arm is well approximated by a Compound Poisson process to arrive at algorithms that are faster, with a small increase in sample complexity. We analyze the problem in an asymptotic regime as rarity of reward occurrence reduces to zero, and reward amounts increase to infinity. This helps illustrate the benefits of the proposed algorithm. It also sheds light on the underlying structure of the optimal BAI algorithms in the rare event setting.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Stylometric Detection of AI-Generated Text in Twitter Timelines
Authors:
Tharindu Kumarage,
Joshua Garland,
Amrita Bhattacharjee,
Kirill Trapeznikov,
Scott Ruston,
Huan Liu
Abstract:
Recent advancements in pre-trained language models have enabled convenient methods for generating human-like text at a large scale. Though these generation capabilities hold great potential for breakthrough applications, it can also be a tool for an adversary to generate misinformation. In particular, social media platforms like Twitter are highly susceptible to AI-generated misinformation. A pote…
▽ More
Recent advancements in pre-trained language models have enabled convenient methods for generating human-like text at a large scale. Though these generation capabilities hold great potential for breakthrough applications, it can also be a tool for an adversary to generate misinformation. In particular, social media platforms like Twitter are highly susceptible to AI-generated misinformation. A potential threat scenario is when an adversary hijacks a credible user account and incorporates a natural language generator to generate misinformation. Such threats necessitate automated detectors for AI-generated tweets in a given user's Twitter timeline. However, tweets are inherently short, thus making it difficult for current state-of-the-art pre-trained language model-based detectors to accurately detect at what point the AI starts to generate tweets in a given Twitter timeline. In this paper, we present a novel algorithm using stylometric signals to aid detecting AI-generated tweets. We propose models corresponding to quantifying stylistic changes in human and AI tweets in two related tasks: Task 1 - discriminate between human and AI-generated tweets, and Task 2 - detect if and when an AI starts to generate tweets in a given Twitter timeline. Our extensive experiments demonstrate that the stylometric features are effective in augmenting the state-of-the-art AI-generated text detectors.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Self-consistent simulation of compressional Alfvén eigenmodes excited by runaway electrons
Authors:
Chang Liu,
Andrey Lvovskiy,
Carlos Paz-Soldan,
Stephen C. Jardin,
Amitava Bhattacharjee
Abstract:
Alfvénic modes in the current quench (CQ) stage of the tokamak disruption have been observed in experiments. In DIII-D the excitation of these modes is associated with the presence of high-energy runaway electrons, and a strong mode excitation is often associated with the failure of RE plateau formation. In this work we present results of self-consistent kinetic-MHD simulations of RE-driven compre…
▽ More
Alfvénic modes in the current quench (CQ) stage of the tokamak disruption have been observed in experiments. In DIII-D the excitation of these modes is associated with the presence of high-energy runaway electrons, and a strong mode excitation is often associated with the failure of RE plateau formation. In this work we present results of self-consistent kinetic-MHD simulations of RE-driven compressional Alfvén eigenmodes (CAEs) in DIII-D disruption scenarios, providing an explanation of the CQ modes. Simulation results reveal that high energy trapped REs can have resonance with the Alfvén mode through their precession motion, and the resonance frequency is proportional to the energy of REs. The mode frequencies and their relationship with the RE energy are consistent with experimental observation. The perturbed magnetic fields from the modes can lead to spatial diffusion of runaway electrons including the nonresonant passing ones, thus providing the theoretical basis for a potential approach for runaway electron mitigation.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Structure of pressure-gradient-driven current singularity in ideal magnetohydrodynamic equilibrium
Authors:
Yi-Min Huang,
Yao Zhou,
Joaquim Loizu,
Stuart Hudson,
Amitava Bhattacharjee
Abstract:
Singular currents typically appear on rational surfaces in non-axisymmetric ideal magnetohydrodynamic equilibria with a continuum of nested flux surfaces and a continuous rotational transform. These currents have two components: a surface current (Dirac $δ$-function in flux surface labeling) that prevents the formation of magnetic islands and an algebraically divergent Pfirsch--Schlüter current de…
▽ More
Singular currents typically appear on rational surfaces in non-axisymmetric ideal magnetohydrodynamic equilibria with a continuum of nested flux surfaces and a continuous rotational transform. These currents have two components: a surface current (Dirac $δ$-function in flux surface labeling) that prevents the formation of magnetic islands and an algebraically divergent Pfirsch--Schlüter current density when a pressure gradient is present across the rational surface. At flux surfaces adjacent to the rational surface, the traditional treatment gives the Pfirsch--Schlüter current density scaling as $J\sim1/Δι$, where $Δι$ is the difference of the rotational transform relative to the rational surface. If the distance $s$ between flux surfaces is proportional to $Δι$, the scaling relation $J\sim1/Δι\sim1/s$ will lead to a paradox that the Pfirsch--Schlüter current is not integrable. In this work, we investigate this issue by considering the pressure-gradient-driven singular current in the Hahm\textendash Kulsrud\textendash Taylor problem, which is a prototype for singular currents arising from resonant magnetic perturbations. We show that not only the Pfirsch--Schlüter current density but also the diamagnetic current density are divergent as $\sim1/Δι$. However, due to the formation of a Dirac $δ$-function current sheet at the rational surface, the neighboring flux surfaces are strongly packed with $s\sim(Δι)^{2}$. Consequently, the singular current density $J\sim1/\sqrt{s}$, making the total current finite, thus resolving the paradox.
△ Less
Submitted 9 February, 2024; v1 submitted 3 March, 2023;
originally announced March 2023.
-
Periodic Korteweg-de Vries soliton potentials generate magnetic field strength with excellent quasisymmetry
Authors:
W. Sengupta,
N. Nikulsin,
S. Buller,
R. Madan,
E. J. Paul,
R. Nies,
A. A. Kaptanoglu,
S. R. Hudson,
A. Bhattacharjee
Abstract:
Quasisymmetry (QS) is a hidden symmetry of the magnetic field strength, $B$, that confines charged particles effectively in a three-dimensional toroidal plasma equilibrium. Here, we show that QS has a deep connection to the underlying symmetry that makes solitons possible. Our approach uncovers a hidden lower dimensionality of $B$ on a magnetic flux surface, which could make stellarator optimizati…
▽ More
Quasisymmetry (QS) is a hidden symmetry of the magnetic field strength, $B$, that confines charged particles effectively in a three-dimensional toroidal plasma equilibrium. Here, we show that QS has a deep connection to the underlying symmetry that makes solitons possible. Our approach uncovers a hidden lower dimensionality of $B$ on a magnetic flux surface, which could make stellarator optimization schemes significantly more efficient. Recent numerical breakthroughs have yielded configurations with excellent volumetric QS and surprisingly low magnetic shear. Our approach elucidates why the magnetic shear is low in these configurations. Furthermore, we deduce an upper bound on the maximum toroidal volume that can be quasisymmetric and verify it for the Landreman-Paul precise quasiaxisymmetric (QA) stellarator configuration. In the neighborhood of the outermost surface, we show that the B approaches the form of the 1-soliton reflectionless potential. We present three independent approaches to demonstrate that quasisymmetric $B$ is described by well-known integrable systems such as the Korteweg-de Vries (KdV) equation. The first approach is weakly nonlinear multiscale perturbation theory, which highlights the crucial role that magnetic shear plays in QS. We show that the overdetermined problem of finding quasisymmetric vacuum fields admits solutions for which the rotational transform is not free but highly constrained. We obtain the KdV equation (and, more specifically, Gardner's equation for certain choices of parameters). Our second approach is non-perturbative and based on ensuring single-valuedness of $B$, which directly leads to its Painlevé property shared by the KdV equation. Our third approach uses machine learning, trained on a large dataset of numerically optimized quasisymmetric stellarators. We robustly recover the KdV (and Gardner's) equation from the data.
△ Less
Submitted 24 May, 2024; v1 submitted 27 February, 2023;
originally announced February 2023.
-
XploreNAS: Explore Adversarially Robust & Hardware-efficient Neural Architectures for Non-ideal Xbars
Authors:
Abhiroop Bhattacharjee,
Abhishek Moitra,
Priyadarshini Panda
Abstract:
Compute In-Memory platforms such as memristive crossbars are gaining focus as they facilitate acceleration of Deep Neural Networks (DNNs) with high area and compute-efficiencies. However, the intrinsic non-idealities associated with the analog nature of computing in crossbars limits the performance of the deployed DNNs. Furthermore, DNNs are shown to be vulnerable to adversarial attacks leading to…
▽ More
Compute In-Memory platforms such as memristive crossbars are gaining focus as they facilitate acceleration of Deep Neural Networks (DNNs) with high area and compute-efficiencies. However, the intrinsic non-idealities associated with the analog nature of computing in crossbars limits the performance of the deployed DNNs. Furthermore, DNNs are shown to be vulnerable to adversarial attacks leading to severe security threats in their large-scale deployment. Thus, finding adversarially robust DNN architectures for non-ideal crossbars is critical to the safe and secure deployment of DNNs on the edge. This work proposes a two-phase algorithm-hardware co-optimization approach called XploreNAS that searches for hardware-efficient & adversarially robust neural architectures for non-ideal crossbar platforms. We use the one-shot Neural Architecture Search (NAS) approach to train a large Supernet with crossbar-awareness and sample adversarially robust Subnets therefrom, maintaining competitive hardware-efficiency. Our experiments on crossbars with benchmark datasets (SVHN, CIFAR10 & CIFAR100) show upto ~8-16% improvement in the adversarial robustness of the searched Subnets against a baseline ResNet-18 model subjected to crossbar-aware adversarial training. We benchmark our robust Subnets for Energy-Delay-Area-Products (EDAPs) using the Neurosim tool and find that with additional hardware-efficiency driven optimizations, the Subnets attain ~1.5-1.6x lower EDAPs than ResNet-18 baseline.
△ Less
Submitted 15 April, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
DeepCAM: A Fully CAM-based Inference Accelerator with Variable Hash Lengths for Energy-efficient Deep Neural Networks
Authors:
Duy-Thanh Nguyen,
Abhiroop Bhattacharjee,
Abhishek Moitra,
Priyadarshini Panda
Abstract:
With ever increasing depth and width in deep neural networks to achieve state-of-the-art performance, deep learning computation has significantly grown, and dot-products remain dominant in overall computation time. Most prior works are built on conventional dot-product where weighted input summation is used to represent the neuron operation. However, another implementation of dot-product based on…
▽ More
With ever increasing depth and width in deep neural networks to achieve state-of-the-art performance, deep learning computation has significantly grown, and dot-products remain dominant in overall computation time. Most prior works are built on conventional dot-product where weighted input summation is used to represent the neuron operation. However, another implementation of dot-product based on the notion of angles and magnitudes in the Euclidean space has attracted limited attention. This paper proposes DeepCAM, an inference accelerator built on two critical innovations to alleviate the computation time bottleneck of convolutional neural networks. The first innovation is an approximate dot-product built on computations in the Euclidean space that can replace addition and multiplication with simple bit-wise operations. The second innovation is a dynamic size content addressable memory-based (CAM-based) accelerator to perform bit-wise operations and accelerate the CNNs with a lower computation time. Our experiments on benchmark image recognition datasets demonstrate that DeepCAM is up to 523x and 3498x faster than Eyeriss and traditional CPUs like Intel Skylake, respectively. Furthermore, the energy consumed by our DeepCAM approach is 2.16x to 109x less compared to Eyeriss.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Towards Detecting Harmful Agendas in News Articles
Authors:
Melanie Subbiah,
Amrita Bhattacharjee,
Yilun Hua,
Tharindu Kumarage,
Huan Liu,
Kathleen McKeown
Abstract:
Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest poten…
▽ More
Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest potential for real world harm. Moreover, due to real concerns around censorship, harmful agenda detectors must be interpretable to be effective. In this work, we propose this new task and release a dataset, NewsAgendas, of annotated news articles for agenda identification. We show how interpretable systems can be effective on this task and demonstrate that they can perform comparably to black-box models.
△ Less
Submitted 2 August, 2023; v1 submitted 31 January, 2023;
originally announced February 2023.
-
Computational Solar Energy -- Ensemble Learning Methods for Prediction of Solar Power Generation based on Meteorological Parameters in Eastern India
Authors:
Debojyoti Chakraborty,
Jayeeta Mondal,
Hrishav Bakul Barua,
Ankur Bhattacharjee
Abstract:
The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (…
▽ More
The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (ML) models have gained importance and are widely used for prediction of solar power plant performance. In this paper, the impact of weather parameters on solar PV power generation is estimated by several Ensemble ML (EML) models like Bagging, Boosting, Stacking, and Voting for the first time. The performance of chosen ML algorithms is validated by field dataset of a 10kWp solar PV power plant in Eastern India region. Furthermore, a complete test-bed framework has been designed for data mining as well as to select appropriate learning models. It also supports feature selection and reduction for dataset to reduce space and time complexity of the learning models. The results demonstrate greater prediction accuracy of around 96% for Stacking and Voting EML models. The proposed work is a generalized one and can be very useful for predicting the performance of large-scale solar PV power plants also.
△ Less
Submitted 21 January, 2023;
originally announced January 2023.
-
A Multi-Site Accelerator-Rich Processing Fabric for Scalable Brain-Computer Interfacing
Authors:
Karthik Sriram,
Raghavendra Pradyumna Pothukuchi,
Michał Gerasimiuk,
Oliver Ye,
Muhammed Ugur,
Rajit Manohar,
Anurag Khandelwal,
Abhishek Bhattacharjee
Abstract:
Hull is an accelerator-rich distributed implantable Brain-Computer Interface (BCI) that reads biological neurons at data rates that are 2-3 orders of magnitude higher than the prior state of art, while supporting many neuroscientific applications. Prior approaches have restricted brain interfacing to tens of megabits per second in order to meet two constraints necessary for effective operation and…
▽ More
Hull is an accelerator-rich distributed implantable Brain-Computer Interface (BCI) that reads biological neurons at data rates that are 2-3 orders of magnitude higher than the prior state of art, while supporting many neuroscientific applications. Prior approaches have restricted brain interfacing to tens of megabits per second in order to meet two constraints necessary for effective operation and safe long-term implantation -- power dissipation under tens of milliwatts and response latencies in the tens of milliseconds. Hull also adheres to these constraints, but is able to interface with the brain at much higher data rates, thereby enabling, for the first time, BCI-driven research on and clinical treatment of brain-wide behaviors and diseases that require reading and stimulating many brain locations. Central to Hull's power efficiency is its realization as a distributed system of BCI nodes with accelerator-rich compute. Hull balances modular system layering with aggressive cross-layer hardware-software co-design to integrate compute, networking, and storage. The result is a lesson in designing networked distributed systems with hardware accelerators from the ground up.
△ Less
Submitted 8 January, 2023;
originally announced January 2023.
-
JT gravity from holographic reduction of 3D asymptotically flat spacetime
Authors:
Arindam Bhattacharjee,
Muktajyoti Saha
Abstract:
We attempt to understand the CFT$_1$ structure underlying (2+1)D gravity in flat spacetime via dimensional reduction. We observe that under superrotation, the hyperbolic (and dS$_2$) slices of flat spacetime transform to asymptotically (A)dS$_2$ slices. We consider a wedge region bounded by two such surfaces as End-of-the-World branes and employ Wedge holography to perform holographic reduction. W…
▽ More
We attempt to understand the CFT$_1$ structure underlying (2+1)D gravity in flat spacetime via dimensional reduction. We observe that under superrotation, the hyperbolic (and dS$_2$) slices of flat spacetime transform to asymptotically (A)dS$_2$ slices. We consider a wedge region bounded by two such surfaces as End-of-the-World branes and employ Wedge holography to perform holographic reduction. We show that once we consider fluctuating branes, the localised theory on the branes is Jackiw-Teitelboim (JT) theory. Finally, using the dual description of JT, we derive an 1D Schwarzian theory at the spatial slice of null infinity. In this dual Celestial (nearly) CFT, the superrotation mode of 3D plays the role of the Schwarzian derivative of the boundary time reparametrization mode.
△ Less
Submitted 2 February, 2023; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach
Authors:
Siddhant Bhambri,
Amrita Bhattacharjee,
Dimitri Bertsekas
Abstract:
In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) problems. These methods are based on approximation in value space and the rollout approach, admit a straightforward implementation, and provide improved…
▽ More
In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) problems. These methods are based on approximation in value space and the rollout approach, admit a straightforward implementation, and provide improved performance over various heuristic approaches. For the Wordle puzzle, they yield on-line solution strategies that are very close to optimal at relatively modest computational cost. Our methods are viable for more complex versions of Wordle and related search problems, for which an optimal strategy would be impossible to compute. They are also applicable to a wide range of adaptive sequential decision problems that involve an unknown or frequently changing environment whose parameters are estimated on-line.
△ Less
Submitted 29 November, 2022; v1 submitted 14 November, 2022;
originally announced November 2022.
-
SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks
Authors:
Abhishek Moitra,
Abhiroop Bhattacharjee,
Runcong Kuang,
Gokul Krishnan,
Yu Cao,
Priyadarshini Panda
Abstract:
SNNs are an active research domain towards energy efficient machine intelligence. Compared to conventional ANNs, SNNs use temporal spike data and bio-plausible neuronal activation functions such as Leaky-Integrate Fire/Integrate Fire (LIF/IF) for data processing. However, SNNs incur significant dot-product operations causing high memory and computation overhead in standard von-Neumann computing pl…
▽ More
SNNs are an active research domain towards energy efficient machine intelligence. Compared to conventional ANNs, SNNs use temporal spike data and bio-plausible neuronal activation functions such as Leaky-Integrate Fire/Integrate Fire (LIF/IF) for data processing. However, SNNs incur significant dot-product operations causing high memory and computation overhead in standard von-Neumann computing platforms. Today, In-Memory Computing (IMC) architectures have been proposed to alleviate the "memory-wall bottleneck" prevalent in von-Neumann architectures. Although recent works have proposed IMC-based SNN hardware accelerators, the following have been overlooked- 1) the adverse effects of crossbar non-ideality on SNN performance due to repeated analog dot-product operations over multiple time-steps, 2) hardware overheads of essential SNN-specific components such as the LIF/IF and data communication modules. To this end, we propose SpikeSim, a tool that can perform realistic performance, energy, latency and area evaluation of IMC-mapped SNNs. SpikeSim consists of a practical monolithic IMC architecture called SpikeFlow for map** SNNs. Additionally, the non-ideality computation engine (NICE) and energy-latency-area (ELA) engine performs hardware-realistic evaluation of SpikeFlow-mapped SNNs. Based on 65nm CMOS implementation and experiments on CIFAR10, CIFAR100 and TinyImagenet datasets, we find that the LIF/IF neuronal module has significant area contribution (>11% of the total hardware area). We propose SNN topological modifications leading to 1.24x and 10x reduction in the neuronal module's area and the overall energy-delay-product value, respectively. Furthermore, in this work, we perform a holistic comparison between IMC implemented ANN and SNNs and conclude that lower number of time-steps are the key to achieve higher throughput and energy-efficiency for SNNs compared to 4-bit ANNs.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
Reconnection-Driven Energy Cascade in Magnetohydrodynamic Turbulence
Authors:
Chuanfei Dong,
Liang Wang,
Yi-Min Huang,
Luca Comisso,
Timothy A. Sandstrom,
Amitava Bhattacharjee
Abstract:
Magnetohydrodynamic turbulence regulates the transfer of energy from large to small scales in many astrophysical systems, including the solar atmosphere. We perform three-dimensional magnetohydrodynamic simulations with unprecedentedly large magnetic Reynolds number to reveal how rapid reconnection of magnetic field lines changes the classical paradigm of the turbulent energy cascade. By breaking…
▽ More
Magnetohydrodynamic turbulence regulates the transfer of energy from large to small scales in many astrophysical systems, including the solar atmosphere. We perform three-dimensional magnetohydrodynamic simulations with unprecedentedly large magnetic Reynolds number to reveal how rapid reconnection of magnetic field lines changes the classical paradigm of the turbulent energy cascade. By breaking elongated current sheets into chains of small magnetic flux ropes (or plasmoids), magnetic reconnection leads to a new range of turbulent energy cascade, where the rate of energy transfer is controlled by the growth rate of the plasmoids. As a consequence, the turbulent energy spectra steepen and attain a spectral index of -2.2 that is accompanied by changes in the anisotropy of turbulence eddies. The omnipresence of plasmoids and their consequences on, e.g., solar coronal heating, can be further explored with current and future spacecraft and telescopes.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset
Authors:
Ajwad Akil,
Najrin Sultana,
Abhik Bhattacharjee,
Rifat Shahriyar
Abstract:
In this work, we present BanglaParaphrase, a high-quality synthetic Bangla Paraphrase dataset curated by a novel filtering pipeline. We aim to take a step towards alleviating the low resource status of the Bangla language in the NLP domain through the introduction of BanglaParaphrase, which ensures quality by preserving both semantics and diversity, making it particularly useful to enhance other B…
▽ More
In this work, we present BanglaParaphrase, a high-quality synthetic Bangla Paraphrase dataset curated by a novel filtering pipeline. We aim to take a step towards alleviating the low resource status of the Bangla language in the NLP domain through the introduction of BanglaParaphrase, which ensures quality by preserving both semantics and diversity, making it particularly useful to enhance other Bangla datasets. We show a detailed comparative analysis between our dataset and models trained on it with other existing works to establish the viability of our synthetic paraphrase data generation pipeline. We are making the dataset and models publicly available at https://github.com/csebuetnlp/banglaparaphrase to further the state of Bangla NLP.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Weak expectations of discrete quantum group algebras and crossed products
Authors:
Arnab Bhattacharjee,
Angshuman Bhattacharya
Abstract:
In this article we study analogues of the weak expectation property of discrete group C*-algebras and their crossed products, in the discrete quantum group setting, i.e., discrete quantum group C*-algebras and crossed products of C*-algebras with amenable discrete quantum groups.
In this article we study analogues of the weak expectation property of discrete group C*-algebras and their crossed products, in the discrete quantum group setting, i.e., discrete quantum group C*-algebras and crossed products of C*-algebras with amenable discrete quantum groups.
△ Less
Submitted 15 January, 2023; v1 submitted 30 August, 2022;
originally announced August 2022.
-
Radio Dichotomy in Quasars with H$β$ FWHM greater than $15,000$ km\,s$^{-1}$
Authors:
A. Chakraborty,
A. Bhattacharjee,
M. S. Brotherton,
R. Chatterjee,
S. Chatterjee,
M. Gilbert
Abstract:
It has been inferred from large unbiased samples that $10\%$-$15\%$ of all quasars are radio-loud (RL). Using the quasar catalog from the Sloan Digital Sky Survey, we show that the radio-loud fraction (RLF) for high broad line (HBL) quasars, containing H$β$ FWHM greater than $15,000$ km s$^{-1}$, is $\sim 57 \%$. While there is no significant difference between the RL and radio-quiet (RQ) populati…
▽ More
It has been inferred from large unbiased samples that $10\%$-$15\%$ of all quasars are radio-loud (RL). Using the quasar catalog from the Sloan Digital Sky Survey, we show that the radio-loud fraction (RLF) for high broad line (HBL) quasars, containing H$β$ FWHM greater than $15,000$ km s$^{-1}$, is $\sim 57 \%$. While there is no significant difference between the RL and radio-quiet (RQ) populations in our sample in terms of their black hole mass, Eddington ratio, and covering fraction (CF), optical continuum luminosity of the RL quasars are higher. The similarity in the distribution of their CF indicates that our analysis is unbiased in terms of the viewing angle of the HBL RL and RQ quasars. Hence, we conclude that the accretion disc luminosity of the RL quasars in our HBL sample is higher, which indicates a connection between a brighter disc and a more prominent jet. By comparing them with the non-HBL H$β$ broad emission line quasars, we find that the HBL sources have the lowest Eddington ratios in addition to having a very high RLF. That is consistent with the theories of jet formation, in which jets are launched from low Eddington ratio accreting systems. We find that the [O III] narrow emission line is stronger in the RL compared to RQ quasars in our HBL sample, which is consistent with previous findings in the literature, and may be caused by the interaction of the narrow line gas with the jet.
△ Less
Submitted 21 August, 2022;
originally announced August 2022.
-
Do chaotic field lines cause fast reconnection in coronal loops?
Authors:
Yi-Min Huang,
Amitava Bhattacharjee
Abstract:
Over the past decade, Boozer has argued that three-dimensional (3D) magnetic reconnection fundamentally differs from two-dimensional (2D) reconnection due to the fact that the separation between any pair of neighboring field lines almost always increases exponentially over distance in a 3D magnetic field. According to Boozer, this feature makes 3D field-line map** chaotic and exponentially sensi…
▽ More
Over the past decade, Boozer has argued that three-dimensional (3D) magnetic reconnection fundamentally differs from two-dimensional (2D) reconnection due to the fact that the separation between any pair of neighboring field lines almost always increases exponentially over distance in a 3D magnetic field. According to Boozer, this feature makes 3D field-line map** chaotic and exponentially sensitive to small non-ideal effects; consequently, 3D reconnection can occur without intense current sheets. We test Boozer's theory via ideal and resistive reduced magnetohydrodynamic simulations of the Boozer-Elder coronal loop model driven by sub-Alfvenic footpoint motions [A. H. Boozer and T. Elder, Physics of Plasmas 28, 062303 (2021)]. Our simulation results significantly differ from their predictions. The ideal simulation shows that Boozer and Elder under-predict the intensity of current density due to missing terms in their reduced model equations. Furthermore, resistive simulations of varying Lundquist numbers show that the maximal current density scales linearly rather than logarithmically with the Lundquist number.
△ Less
Submitted 25 November, 2022; v1 submitted 14 August, 2022;
originally announced August 2022.
-
Energetic particle loss mechanisms in reactor-scale equilibria close to quasisymmetry
Authors:
E. J. Paul,
A. Bhattacharjee,
M. Landreman,
D. Alex,
J. L. Velasco,
R. Nies
Abstract:
Collisionless physics primarily determines the transport of fusion-born alpha particles in 3D equilibria. Several transport mechanisms have been implicated in stellarator configurations, including stochastic diffusion due to class transitions, ripple trap**, and banana drift-convective orbits. Given the guiding center dynamics in a set of six quasihelical and quasiaxisymmetric equilibria, we per…
▽ More
Collisionless physics primarily determines the transport of fusion-born alpha particles in 3D equilibria. Several transport mechanisms have been implicated in stellarator configurations, including stochastic diffusion due to class transitions, ripple trap**, and banana drift-convective orbits. Given the guiding center dynamics in a set of six quasihelical and quasiaxisymmetric equilibria, we perform a classification of trap** states and transport mechanisms. In addition to banana drift convection and ripple transport, we observe substantial non-conservation of the parallel adiabatic invariant which can cause losses through diffusive banana tip motion. Furthermore, many lost trajectories undergo transitions between trap** classes on longer time scales, either with periodic or irregular behavior. We discuss possible optimization strategies for each of the relevant transport mechanisms. We perform a comparison between fast ion losses and metrics for the prevalence of mechanisms such as banana-drift convection [1], transitioning orbits, and wide orbit widths. Quasihelical configurations are found to have natural protection against ripple-trap** and diffusive banana tip motion leading to a reduction in prompt losses.
△ Less
Submitted 10 October, 2022; v1 submitted 3 August, 2022;
originally announced August 2022.
-
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Authors:
Sebastian Gehrmann,
Abhik Bhattacharjee,
Abinaya Mahendiran,
Alex Wang,
Alexandros Papangelis,
Aman Madaan,
Angelina McMillan-Major,
Anna Shvets,
Ashish Upadhyay,
Bingsheng Yao,
Bryan Wilie,
Chandra Bhagavatula,
Chaobin You,
Craig Thomson,
Cristina Garbacea,
Dakuo Wang,
Daniel Deutsch,
Deyi Xiong,
Di **,
Dimitra Gkatzia,
Dragomir Radev,
Elizabeth Clark,
Esin Durmus,
Faisal Ladhak,
Filip Ginter
, et al. (52 additional authors not shown)
Abstract:
Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an…
▽ More
Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, and human evaluation to make definitive claims. To make following best model evaluation practices easier, we introduce GEMv2. The new version of the Generation, Evaluation, and Metrics Benchmark introduces a modular infrastructure for dataset, model, and metric developers to benefit from each others work. GEMv2 supports 40 documented datasets in 51 languages. Models for all datasets can be evaluated online and our interactive data card creation and rendering tools make it easier to add new datasets to the living benchmark.
△ Less
Submitted 24 June, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Examining the Robustness of Spiking Neural Networks on Non-ideal Memristive Crossbars
Authors:
Abhiroop Bhattacharjee,
Youngeun Kim,
Abhishek Moitra,
Priyadarshini Panda
Abstract:
Spiking Neural Networks (SNNs) have recently emerged as the low-power alternative to Artificial Neural Networks (ANNs) owing to their asynchronous, sparse, and binary information processing. To improve the energy-efficiency and throughput, SNNs can be implemented on memristive crossbars where Multiply-and-Accumulate (MAC) operations are realized in the analog domain using emerging Non-Volatile-Mem…
▽ More
Spiking Neural Networks (SNNs) have recently emerged as the low-power alternative to Artificial Neural Networks (ANNs) owing to their asynchronous, sparse, and binary information processing. To improve the energy-efficiency and throughput, SNNs can be implemented on memristive crossbars where Multiply-and-Accumulate (MAC) operations are realized in the analog domain using emerging Non-Volatile-Memory (NVM) devices. Despite the compatibility of SNNs with memristive crossbars, there is little attention to study on the effect of intrinsic crossbar non-idealities and stochasticity on the performance of SNNs. In this paper, we conduct a comprehensive analysis of the robustness of SNNs on non-ideal crossbars. We examine SNNs trained via learning algorithms such as, surrogate gradient and ANN-SNN conversion. Our results show that repetitive crossbar computations across multiple time-steps induce error accumulation, resulting in a huge performance drop during SNN inference. We further show that SNNs trained with a smaller number of time-steps achieve better accuracy when deployed on memristive crossbars.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Exploring quantum properties of bipartite mixed states under coherent and incoherent basis
Authors:
Sovik Roy,
Anushree Bhattacharjee,
Chandrashekar Radhakrishnan,
Md. Manirul Ali,
Biplab Ghosh
Abstract:
Quantum coherence and quantum entanglement are two different manifestations of the superposition principle. In this article we show that the right choice of basis to be used to estimate coherence is the separable basis. The quantum coherence estimated using the Bell basis does not represent the coherence in the system, since there is a coherence in the system due to the choice of the basis states.…
▽ More
Quantum coherence and quantum entanglement are two different manifestations of the superposition principle. In this article we show that the right choice of basis to be used to estimate coherence is the separable basis. The quantum coherence estimated using the Bell basis does not represent the coherence in the system, since there is a coherence in the system due to the choice of the basis states. We first compute the entanglement and quantum coherence in the two qubit mixed states prepared using the Bell states and one of the states from the computational basis. The quantum coherence is estimated using the l1-norm of coherence, the entanglement is measured using the concurrence and the mixedness is measured using the linear entropy. Then we estimate these quantities in the Bell basis and establish that coherence should be measured only in separable basis, whereas entanglement and mixedness can be measured in any basis. We then calculate the teleportation fidelity of these mixed states and find the regions where the states have a fidelity greater than the classical teleportation fidelity. We also examine the violation of the Bell-CHSH inequality to verify the quantum nonlocal correlations in the system. The estimation of the above mentioned quantum correlations, teleportation fidelity and the verification of Bell-CHSH inequality is also done for bipartite states obtained from the tripartite systems by the tracing out of one of their qubits. We find that for some of these states teleportation is possible even when the Bell-CHSH inequality is not violated, signifying that nonlocality is not a necessary condition for quantum teleportation.
△ Less
Submitted 8 March, 2023; v1 submitted 16 June, 2022;
originally announced June 2022.
-
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Authors:
Abhik Bhattacharjee,
Tahmid Hasan,
Wasi Uddin Ahmad,
Rifat Shahriyar
Abstract:
This work presents BanglaNLG, a comprehensive benchmark for evaluating natural language generation (NLG) models in Bangla, a widely spoken yet low-resource language. We aggregate six challenging conditional text generation tasks under the BanglaNLG benchmark, introducing a new dataset on dialogue generation in the process. Furthermore, using a clean corpus of 27.5 GB of Bangla data, we pretrain Ba…
▽ More
This work presents BanglaNLG, a comprehensive benchmark for evaluating natural language generation (NLG) models in Bangla, a widely spoken yet low-resource language. We aggregate six challenging conditional text generation tasks under the BanglaNLG benchmark, introducing a new dataset on dialogue generation in the process. Furthermore, using a clean corpus of 27.5 GB of Bangla data, we pretrain BanglaT5, a sequence-to-sequence Transformer language model for Bangla. BanglaT5 achieves state-of-the-art performance in all of these tasks, outperforming several multilingual models by up to 9% absolute gain and 32% relative gain. We are making the new dialogue dataset and the BanglaT5 model publicly available at https://github.com/csebuetnlp/BanglaNLG in the hope of advancing future research on Bangla NLG.
△ Less
Submitted 11 February, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Constructing the space of quasisymmetric stellarators
Authors:
Eduardo Rodriguez,
Wrick Sengupta,
Amitava Bhattacharjee
Abstract:
A simplified view of the space of optimised stellarators has the potential to guide and aid the design efforts of magnetic confinement configurations suitable for future fusion reactors. We present one such view for the class of quasisymmetric stellarators based on their approximate description near their centre (magnetic axis). The result is a space that captures existing designs and presents new…
▽ More
A simplified view of the space of optimised stellarators has the potential to guide and aid the design efforts of magnetic confinement configurations suitable for future fusion reactors. We present one such view for the class of quasisymmetric stellarators based on their approximate description near their centre (magnetic axis). The result is a space that captures existing designs and presents new ones, providing a common framework to study them. Such a simplified construction offers a basic topological approach, guided by certain theoretical and physical choices, which this paper presents in detail.
△ Less
Submitted 11 April, 2023; v1 submitted 21 April, 2022;
originally announced April 2022.
-
SATA: Sparsity-Aware Training Accelerator for Spiking Neural Networks
Authors:
Ruokai Yin,
Abhishek Moitra,
Abhiroop Bhattacharjee,
Youngeun Kim,
Priyadarshini Panda
Abstract:
Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. Recently, SNNs with backpropagation through time (BPTT) have achieved a higher accuracy result on image recognition tasks than other SNN training algorithms. Despite the success from the algorithm per…
▽ More
Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. Recently, SNNs with backpropagation through time (BPTT) have achieved a higher accuracy result on image recognition tasks than other SNN training algorithms. Despite the success from the algorithm perspective, prior works neglect the evaluation of the hardware energy overheads of BPTT due to the lack of a hardware evaluation platform for this SNN training algorithm. Moreover, although SNNs have long been seen as an energy-efficient counterpart of ANNs, a quantitative comparison between the training cost of SNNs and ANNs is missing. To address the aforementioned issues, in this work, we introduce SATA (Sparsity-Aware Training Accelerator), a BPTT-based training accelerator for SNNs. The proposed SATA provides a simple and re-configurable systolic-based accelerator architecture, which makes it easy to analyze the training energy for BPTT-based SNN training algorithms. By utilizing the sparsity, SATA increases its computation energy efficiency by $5.58 \times$ compared to the one without using sparsity. Based on SATA, we show quantitative analyses of the energy efficiency of SNN training and compare the training cost of SNNs and ANNs. The results show that, on Eyeriss-like systolic-based architecture, SNNs consume $1.27\times$ more total energy with sparsities when compared to ANNs. We find that such high training energy cost is from time-repetitive convolution operations and data movements during backpropagation. Moreover, to propel the future SNN training algorithm design, we provide several observations on energy efficiency for different SNN-specific training parameters and propose an energy estimation framework for SNN training. Code for our framework is made publicly available.
△ Less
Submitted 19 December, 2022; v1 submitted 11 April, 2022;
originally announced April 2022.
-
MIME: Adapting a Single Neural Network for Multi-task Inference with Memory-efficient Dynamic Pruning
Authors:
Abhiroop Bhattacharjee,
Yeshwanth Venkatesha,
Abhishek Moitra,
Priyadarshini Panda
Abstract:
Recent years have seen a paradigm shift towards multi-task learning. This calls for memory and energy-efficient solutions for inference in a multi-task scenario. We propose an algorithm-hardware co-design approach called MIME. MIME reuses the weight parameters of a trained parent task and learns task-specific threshold parameters for inference on multiple child tasks. We find that MIME results in…
▽ More
Recent years have seen a paradigm shift towards multi-task learning. This calls for memory and energy-efficient solutions for inference in a multi-task scenario. We propose an algorithm-hardware co-design approach called MIME. MIME reuses the weight parameters of a trained parent task and learns task-specific threshold parameters for inference on multiple child tasks. We find that MIME results in highly memory-efficient DRAM storage of neural-network parameters for multiple tasks compared to conventional multi-task inference. In addition, MIME results in input-dependent dynamic neuronal pruning, thereby enabling energy-efficient inference with higher throughput on a systolic-array hardware. Our experiments with benchmark datasets (child tasks)- CIFAR10, CIFAR100, and Fashion-MNIST, show that MIME achieves ~3.48x memory-efficiency and ~2.4-3.1x energy-savings compared to conventional multi-task inference in Pipelined task mode.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Text Transformations in Contrastive Self-Supervised Learning: A Review
Authors:
Amrita Bhattacharjee,
Mansooreh Karami,
Huan Liu
Abstract:
Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because…
▽ More
Contrastive self-supervised learning has become a prominent technique in representation learning. The main step in these methods is to contrast semantically similar and dissimilar pairs of samples. However, in the domain of Natural Language Processing (NLP), the augmentation methods used in creating similar pairs with regard to contrastive learning (CL) assumptions are challenging. This is because, even simply modifying a word in the input might change the semantic meaning of the sentence, and hence, would violate the distributional hypothesis. In this review paper, we formalize the contrastive learning framework, emphasize the considerations that need to be addressed in the data transformation step, and review the state-of-the-art methods and evaluations for contrastive representation learning in NLP. Finally, we describe some challenges and potential directions for learning better text representations using contrastive methods.
△ Less
Submitted 6 June, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
Research Opportunities in Plasma Astrophysics
Authors:
Stuart Bale,
Amitava Bhattacharjee,
Fausto Cattaneo,
Jemes Drake,
Hantao Ji,
Marty Lee,
Hui Li,
Edison Liang,
Marc Pound,
Stewart Prager,
Eliot Quataert,
Bruce Remington,
Robert Rosner,
Dmitri Ryutov,
Edward Thomas Jr,
Ellen Zweibel
Abstract:
Major scientific questions and research opportunities are described on 10 unprioritized plasma astrophysics topics: (1) magnetic reconnection, (2) collisionless shocks and particle acceleration, (3) waves and turbulence, (4) magnetic dynamos, (5) interface and shear instabilities, (6) angular momentum transport, (7) dusty plasmas, (8) radiative hydrodynamics, (9) relativistic, pair-dominated and s…
▽ More
Major scientific questions and research opportunities are described on 10 unprioritized plasma astrophysics topics: (1) magnetic reconnection, (2) collisionless shocks and particle acceleration, (3) waves and turbulence, (4) magnetic dynamos, (5) interface and shear instabilities, (6) angular momentum transport, (7) dusty plasmas, (8) radiative hydrodynamics, (9) relativistic, pair-dominated and strongly magnetized plasmas, (10) jets and outflows. Note that this is a conference report from a Workshop on Opportunities in Plasma Astrophysics (WOPA, https://w3.pppl.gov/conferences/2010/WOPA/) in January 2010, that attracted broad representation from the community and was supported by the U.S. Department of Energy, National Aeronautics and Space Administration, National Science Foundation, American Physical Society's Topical Group for Plasma Astrophysics and Division of Plasma Physics, and Center for Magnetic Self-Organization in Laboratory and Astrophysical Plasmas. Although there has been much planning and many developments in both science and infrastructure since the report was written, most of the motivation, priorities, problems and technical challenges discussed therein remain unaddressed and are relevant at the time of posting.
△ Less
Submitted 4 March, 2022;
originally announced March 2022.
-
On large-scale dynamos with stable stratification and the application to stellar radiative zones
Authors:
Valentin Skoutnev,
Jonathan Squire,
Amitava Bhattacharjee
Abstract:
Our understanding of large-scale magnetic fields in stellar radiative zones remains fragmented and incomplete. Such magnetic fields, which must be produced by some form of dynamo mechanism, are thought to dominate angular-momentum transport, making them crucial to stellar evolution. A major difficulty is the effect of stable stratification, which generally suppresses dynamo action. We explore the…
▽ More
Our understanding of large-scale magnetic fields in stellar radiative zones remains fragmented and incomplete. Such magnetic fields, which must be produced by some form of dynamo mechanism, are thought to dominate angular-momentum transport, making them crucial to stellar evolution. A major difficulty is the effect of stable stratification, which generally suppresses dynamo action. We explore the effects of stable stratification on mean-field dynamo theory with a particular focus on a non-helical large-scale dynamo (LSD) mechanism known as the magnetic shear-current effect. We find that the mechanism is robust to increasing stable stratification as long as the original requirements for its operation are met: a source of shear and non-helical magnetic fluctuations (e.g. from a small-scale dynamo). Both are plausibly sourced in the presence of differential rotation. Our idealized direct numerical simulations, supported by mean-field theory, demonstrate the generation of near equipartition large-scale toroidal fields. Additionally, a scan over magnetic Reynolds number shows no change in the growth or saturation of the LSD, providing good numerical evidence of a dynamo mechanism resilient to catastrophic quenching, which has been an issue for helical dynamos. These properties -- the absence of catastrophic quenching and robustness to stable stratification -- make the mechanism a plausible candidate for generating in-situ large-scale magnetic fields in stellar radiative zones.
△ Less
Submitted 16 September, 2022; v1 submitted 3 March, 2022;
originally announced March 2022.
-
Rate Coding or Direct Coding: Which One is Better for Accurate, Robust, and Energy-efficient Spiking Neural Networks?
Authors:
Youngeun Kim,
Hyoungseob Park,
Abhishek Moitra,
Abhiroop Bhattacharjee,
Yeshwanth Venkatesha,
Priyadarshini Panda
Abstract:
Recent Spiking Neural Networks (SNNs) works focus on an image classification task, therefore various coding techniques have been proposed to convert an image into temporal binary spikes. Among them, rate coding and direct coding are regarded as prospective candidates for building a practical SNN system as they show state-of-the-art performance on large-scale datasets. Despite their usage, there is…
▽ More
Recent Spiking Neural Networks (SNNs) works focus on an image classification task, therefore various coding techniques have been proposed to convert an image into temporal binary spikes. Among them, rate coding and direct coding are regarded as prospective candidates for building a practical SNN system as they show state-of-the-art performance on large-scale datasets. Despite their usage, there is little attention to comparing these two coding schemes in a fair manner. In this paper, we conduct a comprehensive analysis of the two codings from three perspectives: accuracy, adversarial robustness, and energy-efficiency. First, we compare the performance of two coding techniques with various architectures and datasets. Then, we measure the robustness of the coding techniques on two adversarial attack methods. Finally, we compare the energy-efficiency of two coding schemes on a digital hardware platform. Our results show that direct coding can achieve better accuracy especially for a small number of timesteps. In contrast, rate coding shows better robustness to adversarial attacks owing to the non-differentiable spike generation process. Rate coding also yields higher energy-efficiency than direct coding which requires multi-bit precision for the first layer. Our study explores the characteristics of two codings, which is an important design consideration for building SNNs. The code is made available at https://github.com/Intelligent-Computing-Lab-Yale/Rate-vs-Direct.
△ Less
Submitted 12 April, 2022; v1 submitted 31 January, 2022;
originally announced February 2022.
-
Phases and phase-transitions in quasisymmetric configuration space
Authors:
Eduardo Rodriguez,
Wrick Sengupta,
Amitava Bhattacharjee
Abstract:
We explore the structure of the space of quasisymmetric configurations identifying them by their magnetic axes, described as 3D closed curves. We demonstrate that this topological perspective divides the space of all configurations into well-separated quasisymmetric phases. Each phase is characterized by the self-linking number (a topological invariant), defining different symmetry configurations…
▽ More
We explore the structure of the space of quasisymmetric configurations identifying them by their magnetic axes, described as 3D closed curves. We demonstrate that this topological perspective divides the space of all configurations into well-separated quasisymmetric phases. Each phase is characterized by the self-linking number (a topological invariant), defining different symmetry configurations (quasi-axisymmetry or quasi-helical symmetry). The phase-transition manifolds correspond to quasi-isodynamic configurations. By considering some models for closed curves (most notably torus unknots), general features associated with these phases are explored. Some general criteria are also built and leveraged to provide a simple way to describe existing quasisymmetric designs. This constitutes the first step in a program to identify quasisymmetric configurations with a reduced set of functions and parameters, to deepen understanding of configuration space, and offer an alternative approach to stellarator optimization that begins with the magnetic axis and builds outward.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Interplay of Three-Dimensional Instabilities and Magnetic Reconnection in the Explosive Onset of Magnetospheric Substorms
Authors:
Samuel R. Totorica,
Amitava Bhattacharjee
Abstract:
Magnetospheric substorms are preceded by a slow growth phase of magnetic flux loading and current sheet thinning in the tail. Extensive datasets have provided evidence of the triggering of instabilities at substorm onset, including magnetic reconnection and ballooning instabilities. Using an exact kinetic magnetotail equilibrium we present particle-in-cell simulations which capture the explosive n…
▽ More
Magnetospheric substorms are preceded by a slow growth phase of magnetic flux loading and current sheet thinning in the tail. Extensive datasets have provided evidence of the triggering of instabilities at substorm onset, including magnetic reconnection and ballooning instabilities. Using an exact kinetic magnetotail equilibrium we present particle-in-cell simulations which capture the explosive nature of substorms through a disruption of the dipolarization front by the ballooning instability. We use self-consistent particle tracking to determine the nonthermal particle acceleration mechanisms.
△ Less
Submitted 15 September, 2023; v1 submitted 16 January, 2022;
originally announced January 2022.
-
Examining and Mitigating the Impact of Crossbar Non-idealities for Accurate Implementation of Sparse Deep Neural Networks
Authors:
Abhiroop Bhattacharjee,
Lakshya Bhatnagar,
Priyadarshini Panda
Abstract:
Recently several structured pruning techniques have been introduced for energy-efficient implementation of Deep Neural Networks (DNNs) with lesser number of crossbars. Although, these techniques have claimed to preserve the accuracy of the sparse DNNs on crossbars, none have studied the impact of the inexorable crossbar non-idealities on the actual performance of the pruned networks. To this end,…
▽ More
Recently several structured pruning techniques have been introduced for energy-efficient implementation of Deep Neural Networks (DNNs) with lesser number of crossbars. Although, these techniques have claimed to preserve the accuracy of the sparse DNNs on crossbars, none have studied the impact of the inexorable crossbar non-idealities on the actual performance of the pruned networks. To this end, we perform a comprehensive study to show how highly sparse DNNs, that result in significant crossbar-compression-rate, can lead to severe accuracy losses compared to unpruned DNNs mapped onto non-ideal crossbars. We perform experiments with multiple structured-pruning approaches (such as, C/F pruning, XCS and XRS) on VGG11 and VGG16 DNNs with benchmark datasets (CIFAR10 and CIFAR100). We propose two mitigation approaches - Crossbar column rearrangement and Weight-Constrained-Training (WCT) - that can be integrated with the crossbar-map** of the sparse DNNs to minimize accuracy losses incurred by the pruned models. These help in mitigating non-idealities by increasing the proportion of low conductance synapses on crossbars, thereby improving their computational accuracies.
△ Less
Submitted 13 January, 2022;
originally announced January 2022.
-
Transonic accretion and winds around Pseudo-Kerr black holes and comparison with general relativistic solutions
Authors:
Abhrajit Bhattacharjee,
Sandip K. Chakrabarti,
Dipak Debnath
Abstract:
Spectral and timing properties of accretion flows on a black hole depend on their density and temperature distributions, which in turn come from the underlying dynamics. Thus, an accurate description of the flow which includes hydrodynamics and radiative transfer is a must to interpret the observational results. In the case of non-rotating black holes, Pseudo-Newtonian description of surrounding s…
▽ More
Spectral and timing properties of accretion flows on a black hole depend on their density and temperature distributions, which in turn come from the underlying dynamics. Thus, an accurate description of the flow which includes hydrodynamics and radiative transfer is a must to interpret the observational results. In the case of non-rotating black holes, Pseudo-Newtonian description of surrounding space-time enables one to make a significant progress in predicting spectral and timing properties. This formalism is lacking for spinning black holes. In this paper, we show that there exists an exact form of 'natural' potential derivable from the general relativistic (GR) radial momentum equation. Use of this potential in an otherwise Newtonian set of equations allows to describe transonic flows very accurately as is evidenced by comparing with solutions obtained from the full GR framework. We study the properties of the critical points and the centrifugal pressure supported shocks in the parameter space spanned by the specific energy and the angular momentum, and compare with the results of GR hydrodynamics. We show that this potential can safely be used for the entire range of Kerr parameter $-1<a<1$ for modeling of observational results around spinning black holes. We assume the flow to be inviscid. Thus, it is non-dissipative with constant energy and angular momentum. These assumptions are valid very close to the black hole as the infall timescale is much shorter as compared to the viscous timescale.
△ Less
Submitted 12 January, 2022;
originally announced January 2022.
-
Laser-Driven, Ion-Scale Magnetospheres in Laboratory Plasmas. I. Experimental Platform and First Results
Authors:
D. B. Schaeffer,
F. D. Cruz,
R. S. Dorst,
F. Cruz,
P. V. Heuer,
C. G. Constantin,
P. Pribyl,
C. Niemann,
L. O. Silva,
A. Bhattacharjee
Abstract:
Magnetospheres are a ubiquitous feature of magnetized bodies embedded in a plasma flow. While large planetary magnetospheres have been studied for decades by spacecraft, ion-scale "mini" magnetospheres can provide a unique environment to study kinetic-scale, collisionless plasma physics in the laboratory to help validate models of larger systems. In this work, we present preliminary experiments of…
▽ More
Magnetospheres are a ubiquitous feature of magnetized bodies embedded in a plasma flow. While large planetary magnetospheres have been studied for decades by spacecraft, ion-scale "mini" magnetospheres can provide a unique environment to study kinetic-scale, collisionless plasma physics in the laboratory to help validate models of larger systems. In this work, we present preliminary experiments of ion-scale magnetospheres performed on a unique high-repetition-rate platform developed for the Large Plasma Device (LAPD) at UCLA. The experiments utilize a high-repetition-rate laser to drive a fast plasma flow into a pulsed dipole magnetic field embedded in a uniform magnetized background plasma. 2D maps of magnetic field with high spatial and temporal resolution are measured with magnetic flux probes to examine the evolution of magnetosphere and current density structures for a range of dipole and upstream parameters. The results are further compared to 2D PIC simulations to identify key observational signatures of the kinetic-scale structures and dynamics of the laser-driven plasma. We find that distinct 2D kinetic-scale magnetopause and diamagnetic current structures are formed at higher dipole moments, and their locations are consistent with predictions based on pressure balances and energy conservation.
△ Less
Submitted 4 March, 2022; v1 submitted 6 January, 2022;
originally announced January 2022.
-
Understanding User Perspectives on Prompts for Brief Reflection on Troubling Emotions
Authors:
Ananya Bhattacharjee,
Pan Chen,
Linjia Zhou,
Abhijoy Mandal,
Jai Aggarwal,
Katie O'Leary,
Anne Hsu,
Alex Mariakakis,
Joseph Jay Williams
Abstract:
We investigate users' perspectives on an online reflective question activity (RQA) that prompts people to externalize their underlying emotions on a troubling situation. Inspired by principles of cognitive behavioral therapy, our 15-minute activity encourages self-reflection without a human or automated conversational partner. A deployment of our RQA on Amazon Mechanical Turk suggests that people…
▽ More
We investigate users' perspectives on an online reflective question activity (RQA) that prompts people to externalize their underlying emotions on a troubling situation. Inspired by principles of cognitive behavioral therapy, our 15-minute activity encourages self-reflection without a human or automated conversational partner. A deployment of our RQA on Amazon Mechanical Turk suggests that people perceive several benefits from our RQA, including structured awareness of their thoughts and problem-solving around managing their emotions. Quantitative evidence from a randomized experiment suggests people find that our RQA makes them feel less worried by their selected situation and worth the minimal time investment. A further two-week technology probe deployment with 11 participants indicates that people see benefits to doing this activity repeatedly, although the activity may get monotonous over time. In summary, this work demonstrates the promise of online reflection activities that carefully leverage principles of psychology in their design.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs
Authors:
Abhik Bhattacharjee,
Tahmid Hasan,
Wasi Uddin Ahmad,
Yuan-Fang Li,
Yong-Bin Kang,
Rifat Shahriyar
Abstract:
We present CrossSum, a large-scale cross-lingual summarization dataset comprising 1.68 million article-summary samples in 1,500+ language pairs. We create CrossSum by aligning parallel articles written in different languages via cross-lingual retrieval from a multilingual abstractive summarization dataset and perform a controlled human evaluation to validate its quality. We propose a multistage da…
▽ More
We present CrossSum, a large-scale cross-lingual summarization dataset comprising 1.68 million article-summary samples in 1,500+ language pairs. We create CrossSum by aligning parallel articles written in different languages via cross-lingual retrieval from a multilingual abstractive summarization dataset and perform a controlled human evaluation to validate its quality. We propose a multistage data sampling algorithm to effectively train a cross-lingual summarization model capable of summarizing an article in any target language. We also introduce LaSE, an embedding-based metric for automatically evaluating model-generated summaries. LaSE is strongly correlated with ROUGE and, unlike ROUGE, can be reliably measured even in the absence of references in the target language. Performance on ROUGE and LaSE indicate that our proposed model consistently outperforms baseline models. To the best of our knowledge, CrossSum is the largest cross-lingual summarization dataset and the first ever that is not centered around English. We are releasing the dataset, training and evaluation scripts, and models to spur future research on cross-lingual summarization. The resources can be found at https://github.com/csebuetnlp/CrossSum
△ Less
Submitted 25 May, 2023; v1 submitted 16 December, 2021;
originally announced December 2021.