Search | arXiv e-print repository

Magnetic Hysteresis Modeling with Neural Operators

Authors: Abhishek Chandra, Bram Daniels, Mitrofan Curti, Koen Tiels, Elena A. Lomonova

Abstract: Hysteresis modeling is crucial to comprehend the behavior of magnetic devices, facilitating optimal designs. Hitherto, deep learning-based methods employed to model hysteresis, face challenges in generalizing to novel input magnetic fields. This paper addresses the generalization challenge by proposing neural operators for modeling constitutive laws that exhibit magnetic hysteresis by learning a m… ▽ More Hysteresis modeling is crucial to comprehend the behavior of magnetic devices, facilitating optimal designs. Hitherto, deep learning-based methods employed to model hysteresis, face challenges in generalizing to novel input magnetic fields. This paper addresses the generalization challenge by proposing neural operators for modeling constitutive laws that exhibit magnetic hysteresis by learning a map** between magnetic fields. In particular, two prominent neural operators -- deep operator network and Fourier neural operator -- are employed to predict novel first-order reversal curves and minor loops, where novel means they are not used to train the model. In addition, a rate-independent Fourier neural operator is proposed to predict material responses at sampling rates different from those used during training to incorporate the rate-independent characteristics of magnetic hysteresis. The presented numerical experiments demonstrate that neural operators efficiently model magnetic hysteresis, outperforming the traditional neural recurrent methods on various metrics and generalizing to novel magnetic fields. The findings emphasize the advantages of using neural operators for modeling hysteresis under varying magnetic conditions, underscoring their importance in characterizing magnetic material based devices. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 8 pages, 5 figures

arXiv:2407.03091 [pdf, other]

Performance Comparison of ROS2 Middlewares for Multi-robot Mesh Networks in Planetary Exploration

Authors: Loïck Pierre Chovet, Gabriel Manuel Garcia, Abhishek Bera, Antoine Richard, Kazuya Yoshida, Miguel Angel Olivares-Mendez

Abstract: Recent advancements in Multi-Robot Systems (MRS) and mesh network technologies pave the way for innovative approaches to explore extreme environments. The Artemis Accords, a series of international agreements, have further catalyzed this progress by fostering cooperation in space exploration, emphasizing the use of cutting-edge technologies. In parallel, the widespread adoption of the Robot Operat… ▽ More Recent advancements in Multi-Robot Systems (MRS) and mesh network technologies pave the way for innovative approaches to explore extreme environments. The Artemis Accords, a series of international agreements, have further catalyzed this progress by fostering cooperation in space exploration, emphasizing the use of cutting-edge technologies. In parallel, the widespread adoption of the Robot Operating System 2 (ROS 2) by companies across various sectors underscores its robustness and versatility. This paper evaluates the performances of available ROS 2 MiddleWare (RMW), such as FastRTPS, CycloneDDS and Zenoh, over a mesh network with a dynamic topology. The final choice of RMW is determined by the one that would fit the most the scenario: an exploration of the extreme extra-terrestrial environment using a MRS. The conducted study in a real environment highlights Zenoh as a potential solution for future applications, showing a reduced delay, reachability, and CPU usage while being competitive on data overhead and RAM usage over a dynamic mesh topology △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: PrePrint

arXiv:2407.03055 [pdf]

doi 10.1063/5.0212604

Investigation of injector-coupled combustion dynamics in a methane-oxygen combustor using large eddy simulation and dynamic mode decomposition

Authors: Abhishek Sharma, Ashoke De, Sunil Kumar

Abstract: This paper uses a reactive flow large eddy simulation (LES) and decomposition techniques to study combustion instabilities in a methane-oxygen combustor. This work examines two case scenarios to elucidate the significance of injector-chamber frequency coupling as the cause of thermo-acoustic instability. Initial investigation in a well-known benchmark case of the continuously variable resonance co… ▽ More This paper uses a reactive flow large eddy simulation (LES) and decomposition techniques to study combustion instabilities in a methane-oxygen combustor. This work examines two case scenarios to elucidate the significance of injector-chamber frequency coupling as the cause of thermo-acoustic instability. Initial investigation in a well-known benchmark case of the continuously variable resonance combustor (CVRC) reports the potential instability mechanisms and the role of injector-chamber frequency coupling in thermo-acoustic instability. Subsequently, the multi-element rocket combustor case study identifies the critical resonant modes and highlights potential frequency coupling between the injector and the chamber region. The interplay between longitudinal pressure oscillations in the oxidizer post and transverse pressure waves in the chamber is responsible for the enhanced pressure dynamics in the combustor. The present work uses the dynamic mode decomposition (DMD) technique to reveal the evolution of acoustic modes in injector and chamber for CVRC and multi-element combustor. The dominant pressure mode forms found by DMD analysis also showcase the role of injector-chamber frequency coupling in amplified combustion dynamics. The results demonstrate how the predominant cause of combustion instability in rocket combustors can be effectively determined using the high-fidelity LES framework in conjunction with the modal decomposition technique. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Journal ref: Physics of Fluids, 36, 065116 (2024)

arXiv:2407.02997 [pdf, other]

Holographic CFT thermodynamics of charged, rotating black holes in $D=4$ dimension

Authors: Abhishek Baruah, Prabwal Phukon

Abstract: We study the holographic thermodynamics of $4-D$ Kerr-Newman AdS black holes. We consider the conformal thermal states dual to KN AdS black holes and work out the corresponding thermodynamics in 10 ensembles. These ensembles are: fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},Ω,\mathcal{V},C)$, fixed $(\varphi,Ω,\mathcal{V},C)$, fixed $(\varphi,\mathcal{J},\mathcal{V},C)$, fi… ▽ More We study the holographic thermodynamics of $4-D$ Kerr-Newman AdS black holes. We consider the conformal thermal states dual to KN AdS black holes and work out the corresponding thermodynamics in 10 ensembles. These ensembles are: fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},Ω,\mathcal{V},C)$, fixed $(\varphi,Ω,\mathcal{V},C)$, fixed $(\varphi,\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},\mathcal{J},p,C)$, fixed $(\mathcal{Q},Ω,p,C)$,fixed $(\varphi,\mathcal{J},p,C)$,fixed $(\mathcal{Q},\mathcal{J},p,μ)$ and fixed $(\varphi,Ω,p,μ)$ ensembles. Here $\varphi$, $\mathcal{Q}$, $Ω$, $\mathcal{J}$, $p$, $\mathcal{V}$, $C$ and $μ$ denotes the electric potential, electric charge, angular velocity, angular momentum, CFT pressure, CFT volume, central charge, and chemical potential respectively. In the fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$ ensemble, we observe a first order phase transition for $\mathcal{Q}<\mathcal{Q}_{crit}$, $\mathcal{J}<\mathcal{J}_{crit}$ and $C>C_{crit}$. In the fixed $(\mathcal{Q},Ω,\mathcal{V}, C)$ ensemble, we again find a first-order phase transition for $\mathcal{Q}<\mathcal{Q}_{crit}$, $Ω<Ω_{crit}$ and $C>C_{crit}$. The fixed $(\varphi,Ω,\mathcal{V}, C)$ ensemble is characterized by a confinement/de-confinement phase transition. In the fixed $(\varphi,\mathcal{J},\mathcal{V},C)$ ensemble, we see a first order phase transition for $\mathcal{J}<\mathcal{J}_{crit}$, $\varphi<\varphi_{crit}$ and $C>C_{crit}$. Finally, in the fixed $(\mathcal{Q},\mathcal{J},p, C)$,$(\mathcal{Q},Ω,p, C)$,$(\varphi,\mathcal{J},p, C)$,$(\mathcal{Q},\mathcal{J},p,μ)$,$(\varphi,Ω,p,μ)$ ensembles, we do not observe any critical behavior or phase transition. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 30 pages, 18 figures

arXiv:2407.02798 [pdf, other]

Game-Based Discovery: Harnessing Mini-Games within Primary Games for Scientific Data Collection and Problem Solving

Authors: Abhishek Phadke, Mamta Yadav, Stanislav Ustymenko

Abstract: In the popular video game Batman: Arkham Knight, produced by Rocksteady Studios and released in 2015, the primary protagonist of the game is Batman, a vigilante dressed as a bat, fighting crime from the shadows in the fictitious city of Gotham. The game involves a real-world player who takes up the role of Batman to solve a peculiar side mission wherein they have to reconstruct the clean DNA seque… ▽ More In the popular video game Batman: Arkham Knight, produced by Rocksteady Studios and released in 2015, the primary protagonist of the game is Batman, a vigilante dressed as a bat, fighting crime from the shadows in the fictitious city of Gotham. The game involves a real-world player who takes up the role of Batman to solve a peculiar side mission wherein they have to reconstruct the clean DNA sequence of a human and separate it from mutant DNA to manufacture an antidote to cure the villain. Although this is undoubtedly a fascinating part of the game, one that was absent in previous Batman games, it showcases an interesting notion of using mini-games embedded within primary games to achieve a particular real-world research objective. Although the DNA data used in this case was not real, there are multiple such instances in video games where mini-games have been used for an underlying motive besides entertainment. Based on popular case studies incorporating a similar method, this study characterizes the methodology of designing mini-games within primary games for research purposes into a descriptive framework, highlighting the process's advantages and limitations. It is concluded that these mini-games not only facilitate a deeper understanding of complex scientific concepts but also accelerate data processing and analysis by leveraging crowd-sourced human intuition and pattern recognition capabilities. This paper argues for strategically incorporating miniaturized, gamified elements into established video games that are mainly intended for recreational purposes. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 6 pages, 4 figures

arXiv:2407.02584 [pdf, other]

Dissipative tidal effects to next-to-leading order and constraints on the dissipative tidal deformability using gravitational wave data

Authors: Abhishek Hegade K. R., Justin L. Ripley, Nicolás Yunes

Abstract: Dissipative tidal interactions can be used to probe the out-of-equilibrium physics of neutron stars using gravitational wave observations. In this paper, we present the first post-Newtonian (PN) corrections to the orbital dynamics of a binary system containing objects whose tidal interactions have a dissipative contribution. We derive the 1PN-accurate equations of motion in the center-of-mass fram… ▽ More Dissipative tidal interactions can be used to probe the out-of-equilibrium physics of neutron stars using gravitational wave observations. In this paper, we present the first post-Newtonian (PN) corrections to the orbital dynamics of a binary system containing objects whose tidal interactions have a dissipative contribution. We derive the 1PN-accurate equations of motion in the center-of-mass frame and a generalized energy-balance law that is valid for dissipative tidal interactions. We show how mass and energy loss due to the absorption of orbital energy change the orbital dynamics and derive the next-to-leading order correction to the gravitational wave phase of a binary system in a quasi-circular orbit containing initially non-spinning components. We then use this waveform model to constrain, for the first time, the individual dissipative tidal deformabilities of each of the binary components that generated the GW170817 event using real data. We find that the GW170817 data requires $Ξ_{1} \lesssim 1121$ and $Ξ_{2} \lesssim 1692$ at 90\% confidence, where $Ξ_{1,2}$ are the individual tidal deformabilities of the primary and secondary binary components that produced the GW170817 event. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 15 pages, 2 figures. Comments are welcome

arXiv:2407.02343 [pdf, other]

A detailed study of the very-high-energy Crab pulsar emission with the LST-1

Authors: CTA-LST Project, :, K. Abe, S. Abe, A. Abhishek, F. Acero, A. Aguasca-Cabot, I. Agudo, N. Alvarez Crespo, L. A. Antonelli, C. Aramo, A. Arbet-Engels, C. Arcaro, M. Artero, K. Asano, P. Aubert, A. Baktash, A. Bamba, A. Baquero Larriva, L. Baroncelli, U. Barres de Almeida, J. A. Barrio, I. Batkovic, J. Baxter, J. Becerra González , et al. (272 additional authors not shown)

Abstract: Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov… ▽ More Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov Telescope Array Observatory (CTAO). Its improved performance over previous IACTs makes it well suited for studying pulsars. Aims: To study the Crab pulsar emission with the LST-1, improving and complementing the results from other telescopes. These observations can also be used to characterize the potential of the LST-1 to study other pulsars and detect new ones. Methods: We analyzed a total of $\sim$103 hours of gamma-ray observations of the Crab pulsar conducted with the LST-1 in the period from September 2020 to January 2023. The observations were carried out at zenith angles less than 50 degrees. A new analysis of the Fermi-LAT data was also performed, including $\sim$14 years of observations. Results: The Crab pulsar phaseogram, long-term light-curve, and phase-resolved spectra are reconstructed with the LST-1 from 20 GeV to 450 GeV for P1 and up to 700 GeV for P2. The pulsed emission is detected with a significance of 15.2$σ$. The two characteristic emission peaks of the Crab pulsar are clearly detected (>10$σ$), as well as the so-called bridge emission (5.7$σ$). We find that both peaks are well described by power laws, with spectral indices of $\sim$3.44 and $\sim$3.03 respectively. The joint analysis of Fermi-LAT and LST-1 data shows a good agreement between both instruments in the overlap** energy range. The detailed results obtained in the first observations of the Crab pulsar with LST-1 show the potential that CTAO will have to study this type of sources. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: Accepted by A&A

arXiv:2407.01351 [pdf, other]

Probing the connection between IceCube neutrinos and MOJAVE AGN

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well established which can be resolved via correlation studies with photon observations. For neutrinos produced due to photohadronic interactions in AGN, in addition to a correlation of neutrinos with high-energy photons, there would also be a correlation of neutrinos with photons emitted at radio wavelengths. In this work, we perform an in-depth stacking study of the correlation between 15 GHz radio observations of AGN reported in the MOJAVE XV catalog, and ten years of neutrino data from IceCube. We also use a time-dependent approach which improves the statistical power of the stacking analysis. No significant correlation was found for both analyses and upper limits are reported. When compared to the IceCube diffuse flux, at 100 TeV and for a spectral index of 2.5, the upper limits derived are $\sim3\%$ and $\sim9\%$ for the time-averaged and time-dependent case, respectively. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 14 Pages 7 Figures

arXiv:2406.19304 [pdf, other]

Understanding Routing-Induced Censorship Changes Globally

Authors: Abhishek Bhaskar, Paul Pearce

Abstract: Internet censorship is pervasive, with significant effort dedicated to understanding what is censored, and where. Prior censorship work however have identified significant inconsistencies in their results; experiments show unexplained non-determinism thought to be caused by censor load, end-host geographic diversity, or incomplete censorship -- inconsistencies which impede reliable, repeatable and… ▽ More Internet censorship is pervasive, with significant effort dedicated to understanding what is censored, and where. Prior censorship work however have identified significant inconsistencies in their results; experiments show unexplained non-determinism thought to be caused by censor load, end-host geographic diversity, or incomplete censorship -- inconsistencies which impede reliable, repeatable and correct understanding of global censorship. In this work we investigate the extent to which Equal-cost Multi-path (ECMP) routing is the cause for these inconsistencies, develo** methods to measure and compensate for them. We find ECMP routing significantly changes observed censorship across protocols, censor mechanisms, and in 17 countries. We identify that previously observed non-determinism or regional variations are attributable to measurements between fixed end-hosts taking different routes based on Flow-ID; i.e., choice of intra-subnet source IP or ephemeral source port leads to differences in observed censorship. To achieve this we develop new route-stable censorship measurement methods that allow consistent measurement of DNS, HTTP, and HTTPS censorship. We find ECMP routing yields censorship changes across 42% of IPs and 51% of ASes, but that impact is not uniform. We identify numerous causes of the behavior, ranging from likely failed infrastructure, to routes to the same end-host taking geographically diverse paths which experience differences in censorship en-route. Finally, we explore our results in the context of prior global measurement studies, exploring first the applicability of our findings to prior observed variations, and then demonstrating how specific experiments from two studies could be impacted by, and specific results are explainable by, ECMP routing. Our work points to methods for improving future studies, reducing inconsistencies and increasing repeatability. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: In Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS 2024)

arXiv:2406.18899 [pdf, other]

Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning

Authors: Nishesh Singh, Sidharth Ramesh, Abhishek Shankar, Jyotishka Duttagupta, Leander Stephen D'Souza, Sanjay Singh

Abstract: Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoi… ▽ More Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoidable obstacles. Soft Actor-Critic (SAC) was applied along with Proportional Integral Derivative (PID) control to stabilise the chassis and traverse large obstacles at low speeds. The model uses the rover's distance from surrounding obstacles, the height of the obstacle, and the chassis' orientation to actuate the control links of the suspension accurately. Simulations carried out in the Gazebo environment are used to validate the proposed active system. △ Less

Submitted 30 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

Comments: 15 pages, 11 figures

ACM Class: I.2.9

arXiv:2406.17904 [pdf]

Application of Liquid Rank Reputation System for Twitter Trend Analysis on Bitcoin

Authors: Abhishek Saxena, Anton Kolonin

Abstract: Analyzing social media trends can create a win-win situation for both creators and consumers. Creators can receive fair compensation, while consumers gain access to engaging, relevant, and personalized content. This paper proposes a new model for analyzing Bitcoin trends on Twitter by incorporating a 'liquid democracy' approach based on user reputation. This system aims to identify the most impact… ▽ More Analyzing social media trends can create a win-win situation for both creators and consumers. Creators can receive fair compensation, while consumers gain access to engaging, relevant, and personalized content. This paper proposes a new model for analyzing Bitcoin trends on Twitter by incorporating a 'liquid democracy' approach based on user reputation. This system aims to identify the most impactful trends and their influence on Bitcoin prices and trading volume. It uses a Twitter sentiment analysis model based on a reputation rating system to determine the impact on Bitcoin price change and traded volume. In addition, the reputation model considers the users' higher-order friends on the social network (the initial Twitter input channels in our case study) to improve the accuracy and diversity of the reputation results. We analyze Bitcoin-related news on Twitter to understand how trends and user sentiment, measured through our Liquid Rank Reputation System, affect Bitcoin price fluctuations and trading activity within the studied time frame. This reputation model can also be used as an additional layer in other trend and sentiment analysis models. The paper proposes the implementation, challenges, and future scope of the liquid rank reputation model. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: Under publication in 2024 Ural-Siberian Conference on Biomedical Engineering, Radioelectronics and Information Technology, Yekaterinburg, Russia

arXiv:2406.17630 [pdf, other]

KANQAS: Kolmogorov Arnold Network for Quantum Architecture Search

Authors: Akash Kundu, Aritra Sarkar, Abhishek Sadhu

Abstract: Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high n… ▽ More Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high number of parameters. In this work, we evaluate the practicality of KANs in quantum architecture search problems, analyzing their efficiency in terms of the probability of success, frequency of optimal solutions and their dependencies on various degrees of freedom of the network. In a noiseless scenario, the probability of success and the number of optimal quantum circuit configurations to generate the multi-qubit maximally entangled states are significantly higher than MLPs. Moreover in noisy scenarios, KAN can achieve a better fidelity in approximating maximally entangled state than MLPs, where the performance of the MLP significantly depends on the choice of activation function. Further investigation reveals that KAN requires a very small number of learnable parameters compared to MLPs, however, the average time of executing each episode for KAN is much higher. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 10 pages and 4 figures

arXiv:2406.16630 [pdf, other]

Linac_Gen: integrating machine learning and particle-in-cell methods for enhanced beam dynamics at Fermilab

Authors: Abhishek Pathak

Abstract: Here, we introduce Linac_Gen, a tool developed at Fermilab, which combines machine learning algorithms with Particle-in-Cell methods to advance beam dynamics in linacs. Linac_Gen employs techniques such as Random Forest, Genetic Algorithms, Support Vector Machines, and Neural Networks, achieving a tenfold increase in speed for phase-space matching in linacs over traditional methods through the use… ▽ More Here, we introduce Linac_Gen, a tool developed at Fermilab, which combines machine learning algorithms with Particle-in-Cell methods to advance beam dynamics in linacs. Linac_Gen employs techniques such as Random Forest, Genetic Algorithms, Support Vector Machines, and Neural Networks, achieving a tenfold increase in speed for phase-space matching in linacs over traditional methods through the use of genetic algorithms. Crucially, Linac_Gen's adept handling of 3D field maps elevates the precision and realism in simulating beam instabilities and resonances, marking a key advancement in the field. Benchmarked against established codes, Linac_Gen demonstrates not only improved efficiency and precision in beam dynamics studies but also in the design and optimization of linac systems, as evidenced in its application to Fermilab's PIP-II linac project. This work represents a notable advancement in accelerator physics, marrying ML with PIC methods to set new standards for efficiency and accuracy in accelerator design and research. Linac_Gen exemplifies a novel approach in accelerator technology, offering substantial improvements in both theoretical and practical aspects of beam dynamics. △ Less

Submitted 24 June, 2024; originally announced June 2024.

arXiv:2406.16008 [pdf, other]

Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

Authors: Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long T. Le, Abhishek Kumar, James Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister

Abstract: Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between… ▽ More Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between lost-in-the-middle to LLMs' intrinsic attention bias: LLMs exhibit a U-shaped attention bias where the tokens at the beginning and at the end of its input receive higher attention, regardless of their relevance. Second, we mitigate this positional bias through a calibration mechanism, found-in-the-middle, that allows the model to attend to contexts faithfully according to their relevance, even though when they are in the middle. Third, we show found-in-the-middle not only achieves better performance in locating relevant information within a long context, but also eventually leads to improved retrieval-augmented generation (RAG) performance across various tasks, outperforming existing methods by up to 15 percentage points. These findings open up future directions in understanding LLM attention bias and its potential consequences. △ Less

Submitted 3 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

Comments: ACL Findings 2024

arXiv:2406.15653 [pdf, other]

Circular Polarization of Simulated Images of Black Holes

Authors: Abhishek V. Joshi, Ben S. Prather, Chi-kwan Chan, Maciek Wielgus, Charles F. Gammie

Abstract: Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole,… ▽ More Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole, the magnetic field strength at the horizon (i.e. both SANE and MAD models), the observer inclination, a parameter for the maximum ion-electron temperature ratio assuming a thermal plasma, and the direction of the magnetic field dipole moment. We find that ALMA observations of Sgr A* are inconsistent with all edge-on ($i = 90^\circ$) models. Restricting attention to the magnetically arrested disk (MAD) models favored by earlier EHT studies of Sgr A*, we find that only models with magnetic dipole moment pointing away from the observer are consistent with ALMA data. We also note that in 26 of the 27 passing MAD models the accretion flow rotates clockwise on the sky. We provide a table of the mean and standard deviation of the CP distributions for all model parameters along with their trends. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 33 pages, 17 figures, 2 tables. Accepted for publication in ApJ

arXiv:2406.15649 [pdf, other]

Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe

Authors: Sandeep Singh Sengar, Abhishek Kumar, Owen Singh

Abstract: This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic m… ▽ More This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic movements and partial occlusions. The improved framework is benchmarked against traditional models, demonstrating considerable precision and computational speed gains. The advancements have wide-ranging applications in augmented reality, sports analytics, and healthcare, enabling more immersive experiences, refined performance analysis, and advanced patient monitoring. The study also explores the integration of these enhancements within mobile and embedded systems, addressing the need for computational efficiency and broader accessibility. The implications of this research set a new benchmark for real-time human pose estimation technologies and pave the way for future innovations in the field. The implementation code for the paper is available at https://github.com/avhixd/Human_pose_estimation. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.15593 [pdf, other]

News Deja Vu: Connecting Past and Present with Semantic Search

Authors: Brevin Franklin, Emily Silcock, Abhishek Arora, Tom Bryan, Melissa Dell

Abstract: Social scientists and the general public often analyze contemporary events by drawing parallels with the past, a process complicated by the vast, noisy, and unstructured nature of historical texts. For example, hundreds of millions of page scans from historical newspapers have been noisily transcribed. Traditional sparse methods for searching for relevant material in these vast corpora, e.g., with… ▽ More Social scientists and the general public often analyze contemporary events by drawing parallels with the past, a process complicated by the vast, noisy, and unstructured nature of historical texts. For example, hundreds of millions of page scans from historical newspapers have been noisily transcribed. Traditional sparse methods for searching for relevant material in these vast corpora, e.g., with keywords, can be brittle given complex vocabularies and OCR noise. This study introduces News Deja Vu, a novel semantic search tool that leverages transformer large language models and a bi-encoder approach to identify historical news articles that are most similar to modern news queries. News Deja Vu first recognizes and masks entities, in order to focus on broader parallels rather than the specific named entities being discussed. Then, a contrastively trained, lightweight bi-encoder retrieves historical articles that are most similar semantically to a modern query, illustrating how phenomena that might seem unique to the present have varied historical precedents. Aimed at social scientists, the user-friendly News Deja Vu package is designed to be accessible for those who lack extensive familiarity with deep learning. It works with large text datasets, and we show how it can be deployed to a massive scale corpus of historical, open-source news articles. While human expertise remains important for drawing deeper insights, News Deja Vu provides a powerful tool for exploring parallels in how people have perceived past and present. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.15576 [pdf, other]

Contrastive Entity Coreference and Disambiguation for Historical Texts

Authors: Abhishek Arora, Emily Silcock, Leander Heldring, Melissa Dell

Abstract: Massive-scale historical document collections are crucial for social science research. Despite increasing digitization, these documents typically lack unique cross-document identifiers for individuals mentioned within the texts, as well as individual identifiers from external knowledgebases like Wikipedia/Wikidata. Existing entity disambiguation methods often fall short in accuracy for historical… ▽ More Massive-scale historical document collections are crucial for social science research. Despite increasing digitization, these documents typically lack unique cross-document identifiers for individuals mentioned within the texts, as well as individual identifiers from external knowledgebases like Wikipedia/Wikidata. Existing entity disambiguation methods often fall short in accuracy for historical documents, which are replete with individuals not remembered in contemporary knowledgebases. This study makes three key contributions to improve cross-document coreference resolution and disambiguation in historical texts: a massive-scale training dataset replete with hard negatives - that sources over 190 million entity pairs from Wikipedia contexts and disambiguation pages - high-quality evaluation data from hand-labeled historical newswire articles, and trained models evaluated on this historical benchmark. We contrastively train bi-encoder models for coreferencing and disambiguating individuals in historical texts, achieving accurate, scalable performance that identifies out-of-knowledgebase individuals. Our approach significantly surpasses other entity disambiguation models on our historical newswire benchmark. Our models also demonstrate competitive performance on modern entity disambiguation benchmarks, particularly certain news disambiguation datasets. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.14008 [pdf, other]

AMC: Access to Miss Correlation Prefetcher for Evolving Graph Analytics

Authors: Abhishek Singh, Christian Schulte, Xiaochen Guo

Abstract: Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, curren… ▽ More Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, current hardware prefetchers can not efficiently predict for applications with irregular memory accesses. In evolving graph applications, vertices that do not change during graph changes exhibit the same access correlation patterns. Current temporal prefetchers use one-to-one or one-to-many correlation to exploit these patterns. Similar patterns are recorded in the same entry, which causes aliasing and can lead to poor prefetch accuracy and coverage. This work proposes a software-assisted hardware prefetcher for evolving graphs. The key idea is to record the correlations between a sequence of vertex accesses and the following misses and then prefetch when the same vertex access sequence occurs in the future. The proposed Access-to-Miss Correlation (AMC) prefetcher provides a lightweight programming interface to identify the data structures of interest and sets the iteration boundary to update the correlation table. For the evaluated applications, AMC achieves a geomean speedup of 1.5x as compared to the best-performing prefetcher in prior work (VLDP). AMC can achieve an average of 62% accuracy and coverage, whereas VLDP has an accuracy of 31% and coverage of 23%. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 14 pages, 16 figures

ACM Class: C.1.1

arXiv:2406.13742 [pdf, other]

Superfluid stiffness of twisted multilayer graphene superconductors

Authors: Abhishek Banerjee, Zeyu Hao, Mary Kreidel, Patrick Ledwith, Isabelle Phinney, Jeong Min Park, Andrew M. Zimmerman, Kenji Watanabe, Takashi Taniguchi, Robert M Westervelt, Pablo Jarillo-Herrero, Pavel A. Volkov, Ashvin Vishwanath, Kin Chung Fong, Philip Kim

Abstract: The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasipar… ▽ More The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasiparticle excitations from gapless points (nodes) in momentum space. Intensive research on the recently discovered magic-angle twisted graphene family has revealed, in addition to superconducting states, strongly correlated electronic states associated with spontaneously broken symmetries, inviting the study of $ρ_s$ to uncover the potentially unconventional nature of its superconductivity. Here we report the measurement of $ρ_s$ in magic-angle twisted trilayer graphene (TTG), revealing unconventional nodal-gap superconductivity. Utilizing radio-frequency reflectometry techniques to measure the kinetic inductive response of superconducting TTG coupled to a microwave resonator, we find a linear temperature dependence of $ρ_s$ at low temperatures and nonlinear Meissner effects in the current bias dependence, both indicating nodal structures in the superconducting order parameter. Furthermore, the do** dependence shows a linear correlation between the zero temperature $ρ_s$ and the superconducting transition temperature $T_c$, reminiscent of Uemura's relation in cuprates, suggesting phase-coherence-limited superconductivity. Our results provide strong evidence for nodal superconductivity in TTG and put strong constraints on the mechanisms of these graphene-based superconductors. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.13494 [pdf, other]

Quantum steering under constrained free-will

Authors: Abhishek Sadhu, Siddhartha Das

Abstract: Quantum steering is a kind of bipartite quantum correlations where one party's measurement remotely alters the state of another party. In an adversarial scenario, there could be a hidden variable introducing a bias in the choice of measurement settings of the parties. However, observers without access to the hidden variable are unaware of this bias. The main focus of this work is to analyze quantu… ▽ More Quantum steering is a kind of bipartite quantum correlations where one party's measurement remotely alters the state of another party. In an adversarial scenario, there could be a hidden variable introducing a bias in the choice of measurement settings of the parties. However, observers without access to the hidden variable are unaware of this bias. The main focus of this work is to analyze quantum steering without assuming that the parties freely choose their measurement settings. For this, we introduce the measurement-dependent (MD-)steering scenario where the measurement settings chosen by the parties are biased by an adversary. In such a scenario, we present a class of inequalities to test for MD-steerable correlations. Further, we discuss the implications of violating such inequalities in certifying randomness from quantum extremal behaviors. We also assume that an adversary might prepare an assemblage as a mixture of MD-steerable and MD-unsteerable assemblages and provide a bound on the measurement dependence for the observed correlation to remain MD-steerable. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures

arXiv:2406.13473 [pdf, other]

Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions

Authors: Shivank Garg, Abhishek Baghel, Amit Agarwal, Durga Toshniwal

Abstract: With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed t… ▽ More With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed to improve object detection under such conditions, focusing on traffic signal detection where traditional methods often fail due to domain shifts caused by adverse weather. We provide a comprehensive analysis of the limitations of existing techniques. Our proposed pipeline significantly enhances detection accuracy in snow, rain, and fog. Results show a 40.8% improvement in average IoU and F1 scores compared to naive fine-tuning and a 22.4% performance increase in domain shift scenarios, such as training on artificial snow and testing on rain images. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.13266 [pdf]

doi 10.13140/RG.2.2.12433.85604/2

Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review

Authors: Abhishek Swami, Snehal Farande, Atharv Patil, Atharva Parle, Vivekanand Mane, Prathamesh Thorat

Abstract: The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this artic… ▽ More The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this article, some essential techniques such as edge detection, region-growing technique, and thresholding approach, and the deep learning models such as variants of YOLOv8-which is the best object detection and segmentation framework-are reviewed. We further investigate that the traditional image processing techniques like segmentation are very much simple and provides the alternative to the advanced methods as well. Our review gives useful knowledge on the practical usage of the innovative and traditional approaches of manual X-ray interpretation. The discovered information will help professionals and researchers to gain more profound knowledge in digital interpretation techniques in medical imaging. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 29 pages, 20 figures

MSC Class: 68T07

arXiv:2406.12203 [pdf, other]

InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context

Authors: Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao

Abstract: Large language models (LLMs) have demonstrated the potential to mimic human social intelligence. However, most studies focus on simplistic and static self-report or performance-based tests, which limits the depth and validity of the analysis. In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by map** their ability to understand and manage intentions… ▽ More Large language models (LLMs) have demonstrated the potential to mimic human social intelligence. However, most studies focus on simplistic and static self-report or performance-based tests, which limits the depth and validity of the analysis. In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by map** their ability to understand and manage intentions in a game setting. We focus on four dimensions of social intelligence: situational awareness, self-regulation, self-awareness, and theory of mind. Each dimension is linked to a specific game task: intention selection, intention following, intention summarization, and intention guessing. Our findings indicate that while LLMs exhibit high proficiency in selecting intentions, achieving an accuracy of 88\%, their ability to infer the intentions of others is significantly weaker, trailing human performance by 20\%. Additionally, game performance correlates with intention understanding, highlighting the importance of the four components towards success in this game. These findings underline the crucial role of intention understanding in evaluating LLMs' social intelligence and highlight the potential of using social deduction games as a complex testbed to enhance LLM evaluation. InterIntent contributes a structured approach to bridging the evaluation gap in social intelligence within multiplayer games. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11988 [pdf, other]

Decomposed evaluations of geographic disparities in text-to-image models

Authors: Abhishek Sureddy, Dishant Padalia, Nandhinee Periyakaruppa, Oindrila Saha, Adina Williams, Adriana Romero-Soriano, Megan Richards, Polina Kirichenko, Melissa Hall

Abstract: Recent work has identified substantial disparities in generated images of different geographic regions, including stereotypical depictions of everyday objects like houses and cars. However, existing measures for these disparities have been limited to either human evaluations, which are time-consuming and costly, or automatic metrics evaluating full images, which are unable to attribute these dispa… ▽ More Recent work has identified substantial disparities in generated images of different geographic regions, including stereotypical depictions of everyday objects like houses and cars. However, existing measures for these disparities have been limited to either human evaluations, which are time-consuming and costly, or automatic metrics evaluating full images, which are unable to attribute these disparities to specific parts of the generated images. In this work, we introduce a new set of metrics, Decomposed Indicators of Disparities in Image Generation (Decomposed-DIG), that allows us to separately measure geographic disparities in the depiction of objects and backgrounds in generated images. Using Decomposed-DIG, we audit a widely used latent diffusion model and find that generated images depict objects with better realism than backgrounds and that backgrounds in generated images tend to contain larger regional disparities than objects. We use Decomposed-DIG to pinpoint specific examples of disparities, such as stereotypical background generation in Africa, struggling to generate modern vehicles in Africa, and unrealistically placing some objects in outdoor settings. Informed by our metric, we use a new prompting structure that enables a 52% worst-region improvement and a 20% average improvement in generated background diversity. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.11773 [pdf, other]

On localizing groups of exotic diffeomorphisms of 4-manifolds

Authors: Hokuto Konno, Abhishek Mallick

Abstract: Ruberman in the 90's showed that the group of exotic diffeomorphisms of closed 4-manifolds can be infinitely generated. We provide various results on the question of when such infinite generation can localize to a smaller embedded submanifold of the original manifold. Our results include: (1) All known infinitely generated groups of exotic diffeomorphisms of 4-manifolds detected by families Seiber… ▽ More Ruberman in the 90's showed that the group of exotic diffeomorphisms of closed 4-manifolds can be infinitely generated. We provide various results on the question of when such infinite generation can localize to a smaller embedded submanifold of the original manifold. Our results include: (1) All known infinitely generated groups of exotic diffeomorphisms of 4-manifolds detected by families Seiberg-Witten theory do not localize to any topologically (locally-flatly) embedded rational homology balls in the ambient 4-manifold. (2) Many exotic diffeomorphisms cannot be obtained as Dehn twists along homology spheres (under mild assumptions). (3) There is no contractible 4-manifolds with Seifert fibered boundary that have a universal property for exotic diffeomorphisms analogous to a universal cork. In addition, there is no universal compact 4-manifold $W$ such that the set of exotic diffeomorphisms of a 4-manifold can localize to an embedding of $W$. (4) Certain infinite generations of exotic diffeomorphism groups do localize to a non-compact subset $V$ with a small Betti number, but not to any compact subset of $V$. (5) An analogous result holds for map** class groups of 4-manifolds. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 26 pages, 3 figures

Report number: RIKEN-iTHEMS-Report-24

arXiv:2406.10756 [pdf, other]

Astronomical Spectroscopy with Skipper CCDs: First Results from a Skipper CCD Focal Plane Prototype at SIFS

Authors: Edgar Marrufo Villalpando, Alex Drlica-Wagner, Brandon Roach, Marco Bonati, Abhishek Bakshi, Julia Campa, Gustavo Cancelo, Braulio Cancino, Claudio R. Chavez, Fernando Chierchie, Juan Estrada, Guillermo Fernandez Moroni, Luciano Fraga, Manuel E. Gaido, Stephen E. Holland, Rachel Hur, Michelle Jonas, Peter Moore, Eduardo Paolini, Andrés A. Plazas Malagón, Leandro Stefanazzi, Javier Tiffenberg, Ken Treptou, Sho Uemura, Neal Wilcer

Abstract: We present the first on-sky results from an ultra-low-readout-noise Skipper CCD focal plane prototype for the SOAR Integral Field Spectrograph (SIFS). The Skipper CCD focal plane consists of four 6k x 1k, 15 $μ$m pixel, fully-depleted, p-channel devices that have been thinned to ~250 $μ$m, backside processed, and treated with an anti-reflective coating. These Skipper CCDs were configured for astro… ▽ More We present the first on-sky results from an ultra-low-readout-noise Skipper CCD focal plane prototype for the SOAR Integral Field Spectrograph (SIFS). The Skipper CCD focal plane consists of four 6k x 1k, 15 $μ$m pixel, fully-depleted, p-channel devices that have been thinned to ~250 $μ$m, backside processed, and treated with an anti-reflective coating. These Skipper CCDs were configured for astronomical spectroscopy, i.e., single-sample readout noise < 4.3 e- rms/pixel, the ability to achieve multi-sample readout noise $\ll$ 1 e- rms/pixel, full-well capacities ~40,000-65,000 e-, low dark current and charge transfer inefficiency (~2 x 10$^{-4}$ e-/pixel/s and 3.44 x 10$^{-7}$, respectively), and an absolute quantum efficiency of $\gtrsim$ 80% between 450 nm and 980 nm ($\gtrsim$ 90% between 600 nm and 900 nm). We optimized the readout sequence timing to achieve sub-electron noise (~0.5 e- rms/pixel) in a region of 2k x 4k pixels and photon-counting noise (~0.22 e- rms/pixel) in a region of 220 x 4k pixels, each with a readout time of $\lesssim$ 17 min. We observed two quasars (HB89 1159+123 and QSO J1621-0042) at redshift z ~ 3.5, two high-redshift galaxy clusters (CL J1001+0220 and SPT-CL J2040-4451), an emission line galaxy at z = 0.3239, a candidate member star of the Boötes II ultra-faint dwarf galaxy, and five CALSPEC spectrophotometric standard stars (HD074000, HD60753, HD106252, HD101452, HD200654). We present charge-quantized, photon-counting observations of the quasar HB89 1159+123 and show the detector sensitivity increase for faint spectral features. We demonstrate signal-to-noise performance improvements for SIFS observations in the low-background, readout-noise-dominated regime. We outline scientific studies that will leverage the SIFS-Skipper CCD data and new detector architectures that utilize the Skipper floating gate amplifier with faster readout times. △ Less

Submitted 15 June, 2024; originally announced June 2024.

Comments: 20 pages, 13 figures, 1 table; Proc. SPIE

Report number: FERMILAB-CONF-24-0305-LDRD-PPD

arXiv:2406.09490 [pdf, other]

Newswire: A Large-Scale Structured Database of a Century of Historical News

Authors: Emily Silcock, Abhishek Arora, Luca D'Amico-Wong, Melissa Dell

Abstract: In the U.S. historically, local newspapers drew their content largely from newswires like the Associated Press. Historians argue that newswires played a pivotal role in creating a national identity and shared understanding of the world, but there is no comprehensive archive of the content sent over newswires. We reconstruct such an archive by applying a customized deep learning pipeline to hundred… ▽ More In the U.S. historically, local newspapers drew their content largely from newswires like the Associated Press. Historians argue that newswires played a pivotal role in creating a national identity and shared understanding of the world, but there is no comprehensive archive of the content sent over newswires. We reconstruct such an archive by applying a customized deep learning pipeline to hundreds of terabytes of raw image scans from thousands of local newspapers. The resulting dataset contains 2.7 million unique public domain U.S. newswire articles, written between 1878 and 1977. Locations in these articles are georeferenced, topics are tagged using customized neural topic classification, named entities are recognized, and individuals are disambiguated to Wikipedia using a novel entity disambiguation model. To construct the Newswire dataset, we first recognize newspaper layouts and transcribe around 138 millions structured article texts from raw image scans. We then use a customized neural bi-encoder model to de-duplicate reproduced articles, in the presence of considerable abridgement and noise, quantifying how widely each article was reproduced. A text classifier is used to ensure that we only include newswire articles, which historically are in the public domain. The structured data that accompany the texts provide rich information about the who (disambiguated individuals), what (topics), and where (georeferencing) of the news that millions of Americans read over the course of a century. We also include Library of Congress metadata information about the newspapers that ran the articles on their front pages. The Newswire dataset is useful both for large language modeling - expanding training data beyond what is available from modern web texts - and for studying a diversity of questions in computational linguistics, social science, and the digital humanities. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: arXiv admin note: text overlap with arXiv:2306.17810, arXiv:2308.12477

arXiv:2406.09091 [pdf, other]

Simulations of distributed-phase-reference quantum key distribution protocols

Authors: Venkat Abhignan, Abhishek Jamunkar, Gokul Nair, Mohit Mittal, Megha Shrivastava

Abstract: Quantum technology can enable secure communication for cryptography purposes using quantum key distribution. Quantum key distribution protocols provide a secret key between two users with security guaranteed by the laws of quantum mechanics. To define the proper implementation of a quantum key distribution system using a particular cryptography protocol, it is crucial to critically and meticulousl… ▽ More Quantum technology can enable secure communication for cryptography purposes using quantum key distribution. Quantum key distribution protocols provide a secret key between two users with security guaranteed by the laws of quantum mechanics. To define the proper implementation of a quantum key distribution system using a particular cryptography protocol, it is crucial to critically and meticulously assess the device's performance due to technological limitations in the components used. We perform simulations on the ANSYS Interconnect platform to characterise the practical implementation of these devices using distributed-phase-reference protocols differential-phase-shift and coherent-one-way quantum key distribution. Further, we briefly describe and simulate some possible eavesdrop** attempts, backflash attack, trojan-horse attack and detector-blinding attack exploiting the device imperfections. △ Less

Submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.07676 [pdf, other]

FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation

Authors: Swarup Ranjan Behera, Abhishek Dhiman, Karthik Gowda, Aalekhya Satya Narayani

Abstract: Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retrainin… ▽ More Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retraining by merging similar tokens in audio spectrograms. Furthermore, during training, FastAST brings about significant speed improvements. The experiments indicate that FastAST can increase audio classification throughput with minimal impact on accuracy. To mitigate the accuracy impact, we integrate Cross-Model Knowledge Distillation (CMKD) into the FastAST framework. Integrating ToMe and CMKD into AST results in improved accuracy compared to AST while maintaining faster inference speeds. FastAST represents a step towards real-time, resource-efficient audio analysis. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: Accepted to Interspeech 2024

MSC Class: 68T10

arXiv:2406.07153 [pdf, other]

EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms

Authors: Akanksha Sharma, Jyoti Nigam, Abhishek Rathore, Arnav Bhavsar

Abstract: In this work, we delve into the EEG classification task in the domain of visual brain decoding via two frameworks, involving two different learning paradigms. Considering the spatio-temporal nature of EEG data, one of our frameworks is based on a CNN-BiLSTM model. The other involves a CNN-Transformer architecture which inherently involves the more versatile attention based learning paradigm. In bo… ▽ More In this work, we delve into the EEG classification task in the domain of visual brain decoding via two frameworks, involving two different learning paradigms. Considering the spatio-temporal nature of EEG data, one of our frameworks is based on a CNN-BiLSTM model. The other involves a CNN-Transformer architecture which inherently involves the more versatile attention based learning paradigm. In both cases, a special 1D-CNN feature extraction module is used to generate the initial embeddings with 1D convolutions in the time and the EEG channel domains. Considering the EEG signals are noisy, non stationary and the discriminative features are even less clear (than in semantically structured data such as text or image), we also follow a window-based classification followed by majority voting during inference, to yield labels at a signal level. To illustrate how brain patterns correlate with different image classes, we visualize t-SNE plots of the BiLSTM embeddings alongside brain activation maps for the top 10 classes. These visualizations provide insightful revelations into the distinct neural signatures associated with each visual category, showcasing the BiLSTM's capability to capture and represent the discriminative brain activity linked to visual stimuli. We demonstrate the performance of our approach on the updated EEG-Imagenet dataset with positive comparisons with state-of-the-art methods. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: The paper has been submitted at ICPR 2024. It contains 15 pages with 7 images

arXiv:2406.07140 [pdf, other]

Constraints on Lorentz invariance violation from the extraordinary Mrk 421 flare of 2014 using a novel analysis method

Authors: MAGIC Collaboration, S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete , et al. (192 additional authors not shown)

Abstract: The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysica… ▽ More The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysical observations. In this scenario, the cosmological distances traversed by photons act as an amplifier for this effect. By leveraging the observation of a remarkable flare from the blazar Mrk\,421, recorded at energies above 100 GeV by the MAGIC telescopes on the night of April 25 to 26, 2014, we look for time delays scaling linearly and quadratically with the photon energies. Using for the first time in LIV studies a binned-likelihood approach we set constraints on the QG energy scale. For the linear scenario, we set $95\%$ lower limits $E_\mathrm{QG}>2.7\times10^{17}$ GeV for the subluminal case and $E_\mathrm{QG}> 3.6 \times10^{17}$ GeV for the superluminal case. For the quadratic scenario, the $95\%$ lower limits for the subluminal and superluminal cases are $E_\mathrm{QG}>2.6 \times10^{10}$ GeV and $E_\mathrm{QG}>2.5\times10^{10}$ GeV, respectively. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2406.05831 [pdf]

Electronic, optical, and transport properties of alkali metal oxides (Cs2O): A DFT study

Authors: Anjali Kumari, Kamal Kumar, Abhishek Kumar Mishra, Ramesh Sharma

Abstract: The electronic, structural, optical, and thermoelectric properties of the Cs2O cubic structure have been investigated using density functional theory (DFT). The calculations utilize a full relativistic version of the full-potential augmented plane-wave plus local orbitals method, which is based on density functional theory, employing both the GGA and LDA approximations. Additionally, we employed t… ▽ More The electronic, structural, optical, and thermoelectric properties of the Cs2O cubic structure have been investigated using density functional theory (DFT). The calculations utilize a full relativistic version of the full-potential augmented plane-wave plus local orbitals method, which is based on density functional theory, employing both the GGA and LDA approximations. Additionally, we employed the GGA proposed by Trans-Blaha (GGA-mBJ) for band structure computations, revealing the indirect band gap nature of Cs2O. The optical properties are also addressed by computing the refractive index, extinction coefficient, and complex dielectric tensor. The electrical conductivity, Seebeck coefficient, and thermal conductivity exhibit temperature-dependent variations, indicating the formation of a thermoelectric material. Our findings indicate that the compound under investigation is categorized as a p-type semiconductor, with the majority of charge carriers responsible for conduction being holes rather than electrons. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 21 pages

arXiv:2406.05793 [pdf, ps, other]

Existence of Positive Solutions for Generalized Fractional Brézis-Nirenberg Problem

Authors: Rohit Kumar, Abhishek Sarkar

Abstract: In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*} (-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where… ▽ More In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*} (-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where $s\in (0,1),~p \in (1,\frac{N}{s}), ~p_{s}^{*}= \frac{Np}{N-sp}$ and the operator $(-Δ)_{p}^{s}$ is the fractional $p$-Laplace operator. The space $\mathcal{D}^{s,p}(\mathbb{R}^{N})$ is the completion of $C_c^\infty(\mathbb{R}^N)$ with respect to the Gaglairdo semi-norm. In this article, we prove the existence of a positive solution to this problem by allowing the Hardy weight $w$ to change its sign. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 28 pages

MSC Class: 35B09; 35R11

arXiv:2406.05123 [pdf, other]

Born-Oppenheimer Potentials for $SU(3)$ Gauge Theory

Authors: Fareed Alasiri, Eric Braaten, Abhishek Mohapatra

Abstract: We develop parameterizations of 8 of the lowest Born-Oppenheimer potentials for quarkonium hybrid mesons as functions of the separation $r$ of the static quark and antiquark sources. The parameters are determined by fitting results calculated using pure $SU(3)$ lattice gauge theory. The parameterizations have the correct limiting behavior at small $r$, where the potentials form multiplets associat… ▽ More We develop parameterizations of 8 of the lowest Born-Oppenheimer potentials for quarkonium hybrid mesons as functions of the separation $r$ of the static quark and antiquark sources. The parameters are determined by fitting results calculated using pure $SU(3)$ lattice gauge theory. The parameterizations have the correct limiting behavior at small $r$, where the potentials form multiplets associated with gluelumps. They have the correct limiting behavior at large $r$, where the potentials form multiplets associated with excitations of a relativistic string. There is a narrow avoided crossing in the small-$r$ region between two potentials with the same Born-Oppenheimer quantum numbers. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 37 pages, 10 figures, 4 tables

Report number: TUM-EFT 188/24

arXiv:2406.04803 [pdf, other]

Embracing Nonlinearity and Geometry: A dimensional analysis guided design of shock absorbing materials

Authors: Abhishek Gupta, Komal Chawla, Ramathasan Thevamaran

Abstract: Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their str… ▽ More Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their stress-strain response. Contrary to this belief, we report that foams with a nonlinear stress-strain response can be useful to achieve simultaneously thin and lightweight protective pads. We introduce a new framework for the thickness or volume-constrained design of compact and lightweight protective foams while ensuring the desired structural integrity and mechanical performance. Our streamlined dimensional analysis approach provides geometric constraints on the dimensionless thickness and cross-sectional area of a protective foam with a given stress-strain response to limit the acceleration and compressive strain within desired critical limits. We also identify optimal mechanical properties that will result in the most compact and lightest protective foam layer for absorbing a given kinetic energy of impact. Guided by this design framework, we achieve optimal protective properties in hierarchically architected vertically aligned carbon nanotube (VACNT) foams, enabling next generation protective applications in extreme environments. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.03183 [pdf, other]

Geometric Localization of Homology Cycles

Authors: Amritendu Dhar, Vijay Natarajan, Abhishek Rathod

Abstract: Computing an optimal cycle in a given homology class, also referred to as the homology localization problem, is known to be an NP-hard problem in general. Furthermore, there is currently no known optimality criterion that localizes classes geometrically and admits a stability property under the setting of persistent homology. We present a geometric optimization of the cycles that is computable in… ▽ More Computing an optimal cycle in a given homology class, also referred to as the homology localization problem, is known to be an NP-hard problem in general. Furthermore, there is currently no known optimality criterion that localizes classes geometrically and admits a stability property under the setting of persistent homology. We present a geometric optimization of the cycles that is computable in polynomial time and is stable in an approximate sense. Tailoring our search criterion to different settings, we obtain various optimization problems like optimal homologous cycle, minimum homology basis, and minimum persistent homology basis. In practice, the (trivial) exact algorithm is computationally expensive despite having a worst case polynomial runtime. Therefore, we design approximation algorithms for the above problems and study their performance experimentally. These algorithms have reasonable runtimes for moderate sized datasets and the cycles computed by these algorithms are consistently of high quality as demonstrated via experiments on multiple datasets. △ Less

Submitted 5 June, 2024; originally announced June 2024.

Comments: To Appear in CCCG 2024 : Proc. 36th Canadian Conference on Computational Geometry

ACM Class: I.3.5

arXiv:2406.02742 [pdf, ps, other]

Tolerant Algorithms for Learning with Arbitrary Covariate Shift

Authors: Surbhi Goel, Abhishek Shetty, Konstantinos Stavropoulos, Arsen Vasilyan

Abstract: We study the problem of learning under arbitrary distribution shift, where the learner is trained on a labeled set from one distribution but evaluated on a different, potentially adversarially generated test distribution. We focus on two frameworks: PQ learning [Goldwasser, A. Kalai, Y. Kalai, Montasser NeurIPS 2020], allowing abstention on adversarially generated parts of the test distribution, a… ▽ More We study the problem of learning under arbitrary distribution shift, where the learner is trained on a labeled set from one distribution but evaluated on a different, potentially adversarially generated test distribution. We focus on two frameworks: PQ learning [Goldwasser, A. Kalai, Y. Kalai, Montasser NeurIPS 2020], allowing abstention on adversarially generated parts of the test distribution, and TDS learning [Klivans, Stavropoulos, Vasilyan COLT 2024], permitting abstention on the entire test distribution if distribution shift is detected. All prior known algorithms either rely on learning primitives that are computationally hard even for simple function classes, or end up abstaining entirely even in the presence of a tiny amount of distribution shift. We address both these challenges for natural function classes, including intersections of halfspaces and decision trees, and standard training distributions, including Gaussians. For PQ learning, we give efficient learning algorithms, while for TDS learning, our algorithms can tolerate moderate amounts of distribution shift. At the core of our approach is an improved analysis of spectral outlier-removal techniques from learning with nasty noise. Our analysis can (1) handle arbitrarily large fraction of outliers, which is crucial for handling arbitrary distribution shifts, and (2) obtain stronger bounds on polynomial moments of the distribution after outlier removal, yielding new insights into polynomial regression under distribution shifts. Lastly, our techniques lead to novel results for tolerant testable learning [Rubinfeld and Vasilyan STOC 2023], and learning with nasty noise. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.02704 [pdf, other]

Quantum-enabled continuous microwave-to-optics frequency conversion

Authors: Han Zhao, William David Chen, Abhishek Kejriwal, Mohammad Mirhosseini

Abstract: A quantum interface between microwave and optical photons is essential for entangling remote superconducting quantum processors. To preserve fragile quantum states, a transducer must operate efficiently while generating less than one photon of noise referred to its input. Here, we present a platform that meets these criteria, utilizing a combination of electrostatic and optomechanical interactions… ▽ More A quantum interface between microwave and optical photons is essential for entangling remote superconducting quantum processors. To preserve fragile quantum states, a transducer must operate efficiently while generating less than one photon of noise referred to its input. Here, we present a platform that meets these criteria, utilizing a combination of electrostatic and optomechanical interactions in devices made entirely from crystalline silicon. This platform's small mechanical dissipation and low optical absorption enable ground-state radiative cooling, resulting in quantum-enabled operation with a continuous laser drive. Under the optimal settings for high efficiency (low noise), we measure an external efficiency of $2.2\%$ ($0.47\%$) and an input-referred added noise of $0.94$ ($0.58$) in microwave-to-optics conversion. We quantify the transducer throughput using the efficiency-bandwidth product, finding it exceeds previous demonstrations with similar noise performance by approximately two orders of magnitude, thereby paving a practical path to interconnecting remote superconducting qubits. △ Less

Submitted 4 June, 2024; originally announced June 2024.

arXiv:2406.02523 [pdf, other]

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Authors: Soroush Nasiriany, Abhiram Maddukuri, Lance Zhang, Adeet Parikh, Aaron Lo, Abhishek Joshi, Ajay Mandlekar, Yuke Zhu

Abstract: Recent advancements in Artificial Intelligence (AI) have largely been propelled by scaling. In Robotics, scaling is hindered by the lack of access to massive robot datasets. We advocate using realistic physical simulation as a means to scale environments, tasks, and datasets for robot learning methods. We present RoboCasa, a large-scale simulation framework for training generalist robots in everyd… ▽ More Recent advancements in Artificial Intelligence (AI) have largely been propelled by scaling. In Robotics, scaling is hindered by the lack of access to massive robot datasets. We advocate using realistic physical simulation as a means to scale environments, tasks, and datasets for robot learning methods. We present RoboCasa, a large-scale simulation framework for training generalist robots in everyday environments. RoboCasa features realistic and diverse scenes focusing on kitchen environments. We provide thousands of 3D assets across over 150 object categories and dozens of interactable furniture and appliances. We enrich the realism and diversity of our simulation with generative AI tools, such as object assets from text-to-3D models and environment textures from text-to-image models. We design a set of 100 tasks for systematic evaluation, including composite tasks generated by the guidance of large language models. To facilitate learning, we provide high-quality human demonstrations and integrate automated trajectory generation methods to substantially enlarge our datasets with minimal human burden. Our experiments show a clear scaling trend in using synthetically generated robot data for large-scale imitation learning and show great promise in harnessing simulation data in real-world tasks. Videos and open-source code are available at https://robocasa.ai/ △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: RSS 2024

arXiv:2406.01132 [pdf, other]

Investigating a Device Independence Quantum Random Number Generation

Authors: Vardaan Mongia, Abhishek Kumar, Shashi Prabhakar, Anindya Banerji, R. P. Singh

Abstract: Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used a… ▽ More Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used as independent checks on the measurement devices. These measures ensure the unpredictability of quantum random number generation. This work can be easily extended to faster randomness expansion protocols. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: Comments and suggestions are welcomed

arXiv:2406.00503 [pdf, other]

Schrödinger Bridge with Quadratic State Cost is Exactly Solvable

Authors: Alexis M. H. Teter, Wenqing Wang, Abhishek Halder

Abstract: Schrödinger bridge is a diffusion process that steers a given distribution to another in a prescribed time while minimizing the effort to do so. It can be seen as the stochastic dynamical version of the optimal mass transport, and has growing applications in generative diffusion models and stochastic optimal control. In this work, we propose a regularized variant of the Schrödinger bridge with a q… ▽ More Schrödinger bridge is a diffusion process that steers a given distribution to another in a prescribed time while minimizing the effort to do so. It can be seen as the stochastic dynamical version of the optimal mass transport, and has growing applications in generative diffusion models and stochastic optimal control. In this work, we propose a regularized variant of the Schrödinger bridge with a quadratic state cost-to-go that incentivizes the optimal sample paths to stay close to a nominal level. Unlike the conventional Schrödinger bridge, the regularization induces a state-dependent rate of killing and creation of probability mass, and its solution requires determining the Markov kernel of a reaction-diffusion partial differential equation. We derive this Markov kernel in closed form. Our solution recovers the heat kernel in the vanishing regularization (i.e., diffusion without reaction) limit, thereby recovering the solution of the conventional Schrödinger bridge. Our results enable the use of dynamic Sinkhorn recursion for computing the Schrödinger bridge with a quadratic state cost-to-go, which would otherwise be challenging to use in this setting. We deduce properties of the new kernel and explain its connections with certain exactly solvable models in quantum mechanics. △ Less

Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

arXiv:2405.20953 [pdf, other]

Final Physics Design of Proton Improvement Plan-II At Fermilab

Authors: Abhishek Pathak, Arun Saini, Eduard Pozdeyev

Abstract: This paper presents the final physics design of the Proton Improvement Plan-II (PIP-II) at Fermilab, focusing on the linear accelerator (Linac) and its beam transfer line. We address the challenges in longitudinal and transverse lattice design, specifically targeting collective effects, parametric resonances, and space charge nonlinearities that impact beam stability and emittance control. The str… ▽ More This paper presents the final physics design of the Proton Improvement Plan-II (PIP-II) at Fermilab, focusing on the linear accelerator (Linac) and its beam transfer line. We address the challenges in longitudinal and transverse lattice design, specifically targeting collective effects, parametric resonances, and space charge nonlinearities that impact beam stability and emittance control. The strategies implemented effectively mitigate space charge complexities, resulting in significant improvements in beam quality -- evidenced by reduced emittance growth, lower beam halo, decreased loss, and better energy spread management. This comprehensive study is pivotal for the PIP-II project's success, providing valuable insights and approaches for future accelerator designs, especially in managing nonlinearities and enhancing beam dynamics. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Report number: FERMILAB-CONF-24-0268-PIP2

arXiv:2405.20836 [pdf, other]

Solving partial differential equations with sampled neural networks

Authors: Chinmay Datar, Taniya Kapoor, Abhishek Chandra, Qing Sun, Iryna Burak, Erik Lien Bolager, Anna Veselovska, Massimo Fornasier, Felix Dietrich

Abstract: Approximation of solutions to partial differential equations (PDE) is an important problem in computational science and engineering. Using neural networks as an ansatz for the solution has proven a challenge in terms of training time and approximation accuracy. In this contribution, we discuss how sampling the hidden weights and biases of the ansatz network from data-agnostic and data-dependent pr… ▽ More Approximation of solutions to partial differential equations (PDE) is an important problem in computational science and engineering. Using neural networks as an ansatz for the solution has proven a challenge in terms of training time and approximation accuracy. In this contribution, we discuss how sampling the hidden weights and biases of the ansatz network from data-agnostic and data-dependent probability distributions allows us to progress on both challenges. In most examples, the random sampling schemes outperform iterative, gradient-based optimization of physics-informed neural networks regarding training time and accuracy by several orders of magnitude. For time-dependent PDE, we construct neural basis functions only in the spatial domain and then solve the associated ordinary differential equation with classical methods from scientific computing over a long time horizon. This alleviates one of the greatest challenges for neural PDE solvers because it does not require us to parameterize the solution in time. For second-order elliptic PDE in Barron spaces, we prove the existence of sampled networks with $L^2$ convergence to the solution. We demonstrate our approach on several time-dependent and static PDEs. We also illustrate how sampled networks can effectively solve inverse problems in this setting. Benefits compared to common numerical schemes include spectral convergence and mesh-free construction of basis functions. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 16 pages, 15 figures

arXiv:2405.20110 [pdf]

Autonomous programmable microscopic electronic lablets optimized with digital control

Authors: Thomas Maeke, John McCaskill, Dominic Funke, Pierre Mayr, Abhishek Sharma, Uwe Tangen, Jürgen Oehm

Abstract: Lablets are autonomous microscopic particles with programmable CMOS electronics that can control electrokinetic phenomena and electrochemical reactions in solution via actuator and sensor microelectrodes. In this paper, we describe the design and fabrication of optimized singulated lablets (CMOS3) with dimensions 140x140x50 micrometers carrying an integrated coplanar encapsulated supercapacitor as… ▽ More Lablets are autonomous microscopic particles with programmable CMOS electronics that can control electrokinetic phenomena and electrochemical reactions in solution via actuator and sensor microelectrodes. In this paper, we describe the design and fabrication of optimized singulated lablets (CMOS3) with dimensions 140x140x50 micrometers carrying an integrated coplanar encapsulated supercapacitor as a rechargeable power supply. The lablets are designed to allow docking to one another or to a smart surface for interchange of energy, electronic information, and chemicals. The paper focusses on the digital and analog design of the lablets to allow significant programmable functionality in a microscopic footprint, including the control of autonomous actuation and sensing up to the level of being able to support a complete lablet self-reproduction life cycle, although experimentally this remains to be proven. The potential of lablets in autonomous sensing and control and for evolutionary experimentation are discussed. △ Less

Submitted 16 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

Comments: This article was originally submitted (2016) for review as one of a number of preprints as supporting information for the final review of the EU MICREAgents Project # 318671 (2012-2016). Here it is presented in slightly revised form. The version, v2 contains a reference to the verilog source code on GitHub

ACM Class: I.2.9; B.7.0; J.2; J.3; J.7; H.0

arXiv:2405.19984 [pdf, other]

Hydrodynamics of a hard-core non-polar active lattice gas

Authors: Ritwik Mukherjee, Soumyabrata Saha, Tridib Sadhu, Abhishek Dhar, Sanjib Sabhapandit

Abstract: We present a fluctuating hydrodynamic description of a non-polar active lattice gas model with excluded volume interactions that exhibits motility-induced phase separation under appropriate conditions. For quasi-one dimension and higher, stability analysis of the noiseless hydrodynamics gives quantitative bounds on the phase boundary of the motility-induced phase separation in terms of spinodal an… ▽ More We present a fluctuating hydrodynamic description of a non-polar active lattice gas model with excluded volume interactions that exhibits motility-induced phase separation under appropriate conditions. For quasi-one dimension and higher, stability analysis of the noiseless hydrodynamics gives quantitative bounds on the phase boundary of the motility-induced phase separation in terms of spinodal and binodal. Inclusion of the multiplicative noise in the fluctuating hydrodynamics describes the exponentially decaying two-point correlations in the stationary-state homogeneous phase. Our hydrodynamic description and theoretical predictions based on it are in excellent agreement with our Monte-Carlo simulations and pseudo-spectral iteration of the hydrodynamics equations. Our construction of hydrodynamics for this model is not suitable in strictly one-dimension with single-file constraints, and we argue that this breakdown is associated with micro-phase separation. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 6 pages, 6 figures, + 16 supplemental pages

arXiv:2405.19907 [pdf]

DFT study of structural, electronic and optical properties of 2D MgO monolayer under bi-axial mechanical strain

Authors: Kamal Kumar, Anjali Kumari, Soni Mishra, Ramesh Sharma, Abhishek Kumar Mishra

Abstract: The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains… ▽ More The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains consistent under both compressive and tensile mechanical strain. This nature of MgO is confirmed through partial density of states (PDOS) as well as electronic band structure. PDOS exhibits the contribution of different atomic orbitals in bond formation and nature of bond, while band structure provides insight into electron transitions between energy levels of valance and conduction bands. All optical parameters (dielectric function, reflectivity, energy loss, refractive index, extinction coefficient and absorption) are plotted in an energy range 0-15 eV. Within this energy interval, MgO possesses the highest value of the refractive index (2.13) at 3.12 eV energy. Also, a detailed analysis of changes in the geometrical structure of MgO monolayer is provided. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 6 figures

arXiv:2405.19463 [pdf, other]

Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Data

Authors: Xuxing Chen, Abhishek Roy, Yifan Hu, Krishnakumar Balasubramanian

Abstract: We develop and analyze algorithms for instrumental variable regression by viewing the problem as a conditional stochastic optimization problem. In the context of least-squares instrumental variable regression, our algorithms neither require matrix inversions nor mini-batches and provides a fully online approach for performing instrumental variable regression with streaming data. When the true mode… ▽ More We develop and analyze algorithms for instrumental variable regression by viewing the problem as a conditional stochastic optimization problem. In the context of least-squares instrumental variable regression, our algorithms neither require matrix inversions nor mini-batches and provides a fully online approach for performing instrumental variable regression with streaming data. When the true model is linear, we derive rates of convergence in expectation, that are of order $\mathcal{O}(\log T/T)$ and $\mathcal{O}(1/T^{1-ι})$ for any $ι>0$, respectively under the availability of two-sample and one-sample oracles, respectively, where $T$ is the number of iterations. Importantly, under the availability of the two-sample oracle, our procedure avoids explicitly modeling and estimating the relationship between confounder and the instrumental variables, demonstrating the benefit of the proposed approach over recent works based on reformulating the problem as minimax optimization problems. Numerical experiments are provided to corroborate the theoretical results. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.19307 [pdf, other]

Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels

Authors: Abhay Deshpande, Liyiming Ke, Quinn Pfeifer, Abhishek Gupta, Siddhartha S. Srinivasa

Abstract: We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by l… ▽ More We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by learning a locally continuous dynamics model from demonstrations to guide the agent back toward expert states. Through extensive experiments on peg insertion and fine gras**, we provide the first empirical validation that CCIL can significantly improve imitation learning performance despite discontinuities present in contact-rich manipulation. We find that: (1) real-world manipulation exhibits sufficient local smoothness to apply CCIL, (2) generated corrective labels are most beneficial in low-data regimes, and (3) label filtering based on estimated dynamics model error enables performance gains. To effectively apply CCIL to robotic domains, we offer a practical instantiation of the framework and insights into design choices and hyperparameter selection. Our work demonstrates CCIL's practicality for alleviating compounding errors in imitation learning on physical robots. △ Less

Submitted 3 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.18508 [pdf, other]

Exploring waveforms with non-GR deviations for extreme mass-ratio inspirals

Authors: Shailesh Kumar, Rishabh Kumar Singh, Abhishek Chowdhuri, Arpan Bhattacharyya

Abstract: The fundamental process of detecting and examining the polarization modes of gravitational waves plays a pivotal role in enhancing our grasp on the precise mechanisms behind their generation. A thorough investigation is essential for delving deeper into the essence of gravitational waves and rigorously evaluating and validating the range of modified gravity theories. In this line of interest, a ge… ▽ More The fundamental process of detecting and examining the polarization modes of gravitational waves plays a pivotal role in enhancing our grasp on the precise mechanisms behind their generation. A thorough investigation is essential for delving deeper into the essence of gravitational waves and rigorously evaluating and validating the range of modified gravity theories. In this line of interest, a general description of black holes in theories beyond general relativity can serve a meaningful purpose where distinct deviation parameters can be mapped to solutions representing distinct theories. Employing a refined version of the deformed Kerr geometry, which is free from pathological behaviours such as unphysical divergences in the metric, we explore an extreme mass-ratio inspiral system, wherein a stellar-mass object perturbs a supermassive black hole. We compute the effects of deformation parameters on gravitational wave fluxes, orbital evolution and phase dynamics with leading order post-Newtonian corrections. With the waveform analysis, we assess the plausibility of detecting deviations from general relativity through observations facilitated by the Laser Interferometer Space Antenna (LISA), simultaneously constraining the extent of these deviations. Therefore, this analysis provides an understanding while highlighting the essential role of observations in advancing gravitational phenomena beyond general relativity. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: 26 pages, 3 Figures

Showing 1–50 of 3,137 results for author: Abhishek