-
Magnetic Hysteresis Modeling with Neural Operators
Authors:
Abhishek Chandra,
Bram Daniels,
Mitrofan Curti,
Koen Tiels,
Elena A. Lomonova
Abstract:
Hysteresis modeling is crucial to comprehend the behavior of magnetic devices, facilitating optimal designs. Hitherto, deep learning-based methods employed to model hysteresis, face challenges in generalizing to novel input magnetic fields. This paper addresses the generalization challenge by proposing neural operators for modeling constitutive laws that exhibit magnetic hysteresis by learning a m…
▽ More
Hysteresis modeling is crucial to comprehend the behavior of magnetic devices, facilitating optimal designs. Hitherto, deep learning-based methods employed to model hysteresis, face challenges in generalizing to novel input magnetic fields. This paper addresses the generalization challenge by proposing neural operators for modeling constitutive laws that exhibit magnetic hysteresis by learning a map** between magnetic fields. In particular, two prominent neural operators -- deep operator network and Fourier neural operator -- are employed to predict novel first-order reversal curves and minor loops, where novel means they are not used to train the model. In addition, a rate-independent Fourier neural operator is proposed to predict material responses at sampling rates different from those used during training to incorporate the rate-independent characteristics of magnetic hysteresis. The presented numerical experiments demonstrate that neural operators efficiently model magnetic hysteresis, outperforming the traditional neural recurrent methods on various metrics and generalizing to novel magnetic fields. The findings emphasize the advantages of using neural operators for modeling hysteresis under varying magnetic conditions, underscoring their importance in characterizing magnetic material based devices.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Performance Comparison of ROS2 Middlewares for Multi-robot Mesh Networks in Planetary Exploration
Authors:
Loïck Pierre Chovet,
Gabriel Manuel Garcia,
Abhishek Bera,
Antoine Richard,
Kazuya Yoshida,
Miguel Angel Olivares-Mendez
Abstract:
Recent advancements in Multi-Robot Systems (MRS) and mesh network technologies pave the way for innovative approaches to explore extreme environments. The Artemis Accords, a series of international agreements, have further catalyzed this progress by fostering cooperation in space exploration, emphasizing the use of cutting-edge technologies. In parallel, the widespread adoption of the Robot Operat…
▽ More
Recent advancements in Multi-Robot Systems (MRS) and mesh network technologies pave the way for innovative approaches to explore extreme environments. The Artemis Accords, a series of international agreements, have further catalyzed this progress by fostering cooperation in space exploration, emphasizing the use of cutting-edge technologies. In parallel, the widespread adoption of the Robot Operating System 2 (ROS 2) by companies across various sectors underscores its robustness and versatility. This paper evaluates the performances of available ROS 2 MiddleWare (RMW), such as FastRTPS, CycloneDDS and Zenoh, over a mesh network with a dynamic topology. The final choice of RMW is determined by the one that would fit the most the scenario: an exploration of the extreme extra-terrestrial environment using a MRS. The conducted study in a real environment highlights Zenoh as a potential solution for future applications, showing a reduced delay, reachability, and CPU usage while being competitive on data overhead and RAM usage over a dynamic mesh topology
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Investigation of injector-coupled combustion dynamics in a methane-oxygen combustor using large eddy simulation and dynamic mode decomposition
Authors:
Abhishek Sharma,
Ashoke De,
Sunil Kumar
Abstract:
This paper uses a reactive flow large eddy simulation (LES) and decomposition techniques to study combustion instabilities in a methane-oxygen combustor. This work examines two case scenarios to elucidate the significance of injector-chamber frequency coupling as the cause of thermo-acoustic instability. Initial investigation in a well-known benchmark case of the continuously variable resonance co…
▽ More
This paper uses a reactive flow large eddy simulation (LES) and decomposition techniques to study combustion instabilities in a methane-oxygen combustor. This work examines two case scenarios to elucidate the significance of injector-chamber frequency coupling as the cause of thermo-acoustic instability. Initial investigation in a well-known benchmark case of the continuously variable resonance combustor (CVRC) reports the potential instability mechanisms and the role of injector-chamber frequency coupling in thermo-acoustic instability. Subsequently, the multi-element rocket combustor case study identifies the critical resonant modes and highlights potential frequency coupling between the injector and the chamber region. The interplay between longitudinal pressure oscillations in the oxidizer post and transverse pressure waves in the chamber is responsible for the enhanced pressure dynamics in the combustor. The present work uses the dynamic mode decomposition (DMD) technique to reveal the evolution of acoustic modes in injector and chamber for CVRC and multi-element combustor. The dominant pressure mode forms found by DMD analysis also showcase the role of injector-chamber frequency coupling in amplified combustion dynamics. The results demonstrate how the predominant cause of combustion instability in rocket combustors can be effectively determined using the high-fidelity LES framework in conjunction with the modal decomposition technique.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Holographic CFT thermodynamics of charged, rotating black holes in $D=4$ dimension
Authors:
Abhishek Baruah,
Prabwal Phukon
Abstract:
We study the holographic thermodynamics of $4-D$ Kerr-Newman AdS black holes. We consider the conformal thermal states dual to KN AdS black holes and work out the corresponding thermodynamics in 10 ensembles. These ensembles are: fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},Ω,\mathcal{V},C)$, fixed $(\varphi,Ω,\mathcal{V},C)$, fixed $(\varphi,\mathcal{J},\mathcal{V},C)$, fi…
▽ More
We study the holographic thermodynamics of $4-D$ Kerr-Newman AdS black holes. We consider the conformal thermal states dual to KN AdS black holes and work out the corresponding thermodynamics in 10 ensembles. These ensembles are: fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},Ω,\mathcal{V},C)$, fixed $(\varphi,Ω,\mathcal{V},C)$, fixed $(\varphi,\mathcal{J},\mathcal{V},C)$, fixed $(\mathcal{Q},\mathcal{J},p,C)$, fixed $(\mathcal{Q},Ω,p,C)$,fixed $(\varphi,\mathcal{J},p,C)$,fixed $(\mathcal{Q},\mathcal{J},p,μ)$ and fixed $(\varphi,Ω,p,μ)$ ensembles. Here $\varphi$, $\mathcal{Q}$, $Ω$, $\mathcal{J}$, $p$, $\mathcal{V}$, $C$ and $μ$ denotes the electric potential, electric charge, angular velocity, angular momentum, CFT pressure, CFT volume, central charge, and chemical potential respectively. In the fixed $(\mathcal{Q},\mathcal{J},\mathcal{V},C)$ ensemble, we observe a first order phase transition for $\mathcal{Q}<\mathcal{Q}_{crit}$, $\mathcal{J}<\mathcal{J}_{crit}$ and $C>C_{crit}$. In the fixed $(\mathcal{Q},Ω,\mathcal{V}, C)$ ensemble, we again find a first-order phase transition for $\mathcal{Q}<\mathcal{Q}_{crit}$, $Ω<Ω_{crit}$ and $C>C_{crit}$. The fixed $(\varphi,Ω,\mathcal{V}, C)$ ensemble is characterized by a confinement/de-confinement phase transition. In the fixed $(\varphi,\mathcal{J},\mathcal{V},C)$ ensemble, we see a first order phase transition for $\mathcal{J}<\mathcal{J}_{crit}$, $\varphi<\varphi_{crit}$ and $C>C_{crit}$. Finally, in the fixed $(\mathcal{Q},\mathcal{J},p, C)$,$(\mathcal{Q},Ω,p, C)$,$(\varphi,\mathcal{J},p, C)$,$(\mathcal{Q},\mathcal{J},p,μ)$,$(\varphi,Ω,p,μ)$ ensembles, we do not observe any critical behavior or phase transition.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Game-Based Discovery: Harnessing Mini-Games within Primary Games for Scientific Data Collection and Problem Solving
Authors:
Abhishek Phadke,
Mamta Yadav,
Stanislav Ustymenko
Abstract:
In the popular video game Batman: Arkham Knight, produced by Rocksteady Studios and released in 2015, the primary protagonist of the game is Batman, a vigilante dressed as a bat, fighting crime from the shadows in the fictitious city of Gotham. The game involves a real-world player who takes up the role of Batman to solve a peculiar side mission wherein they have to reconstruct the clean DNA seque…
▽ More
In the popular video game Batman: Arkham Knight, produced by Rocksteady Studios and released in 2015, the primary protagonist of the game is Batman, a vigilante dressed as a bat, fighting crime from the shadows in the fictitious city of Gotham. The game involves a real-world player who takes up the role of Batman to solve a peculiar side mission wherein they have to reconstruct the clean DNA sequence of a human and separate it from mutant DNA to manufacture an antidote to cure the villain. Although this is undoubtedly a fascinating part of the game, one that was absent in previous Batman games, it showcases an interesting notion of using mini-games embedded within primary games to achieve a particular real-world research objective. Although the DNA data used in this case was not real, there are multiple such instances in video games where mini-games have been used for an underlying motive besides entertainment. Based on popular case studies incorporating a similar method, this study characterizes the methodology of designing mini-games within primary games for research purposes into a descriptive framework, highlighting the process's advantages and limitations. It is concluded that these mini-games not only facilitate a deeper understanding of complex scientific concepts but also accelerate data processing and analysis by leveraging crowd-sourced human intuition and pattern recognition capabilities. This paper argues for strategically incorporating miniaturized, gamified elements into established video games that are mainly intended for recreational purposes.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Dissipative tidal effects to next-to-leading order and constraints on the dissipative tidal deformability using gravitational wave data
Authors:
Abhishek Hegade K. R.,
Justin L. Ripley,
Nicolás Yunes
Abstract:
Dissipative tidal interactions can be used to probe the out-of-equilibrium physics of neutron stars using gravitational wave observations. In this paper, we present the first post-Newtonian (PN) corrections to the orbital dynamics of a binary system containing objects whose tidal interactions have a dissipative contribution. We derive the 1PN-accurate equations of motion in the center-of-mass fram…
▽ More
Dissipative tidal interactions can be used to probe the out-of-equilibrium physics of neutron stars using gravitational wave observations. In this paper, we present the first post-Newtonian (PN) corrections to the orbital dynamics of a binary system containing objects whose tidal interactions have a dissipative contribution. We derive the 1PN-accurate equations of motion in the center-of-mass frame and a generalized energy-balance law that is valid for dissipative tidal interactions. We show how mass and energy loss due to the absorption of orbital energy change the orbital dynamics and derive the next-to-leading order correction to the gravitational wave phase of a binary system in a quasi-circular orbit containing initially non-spinning components. We then use this waveform model to constrain, for the first time, the individual dissipative tidal deformabilities of each of the binary components that generated the GW170817 event using real data. We find that the GW170817 data requires $Ξ_{1} \lesssim 1121$ and $Ξ_{2} \lesssim 1692$ at 90\% confidence, where $Ξ_{1,2}$ are the individual tidal deformabilities of the primary and secondary binary components that produced the GW170817 event.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
A detailed study of the very-high-energy Crab pulsar emission with the LST-1
Authors:
CTA-LST Project,
:,
K. Abe,
S. Abe,
A. Abhishek,
F. Acero,
A. Aguasca-Cabot,
I. Agudo,
N. Alvarez Crespo,
L. A. Antonelli,
C. Aramo,
A. Arbet-Engels,
C. Arcaro,
M. Artero,
K. Asano,
P. Aubert,
A. Baktash,
A. Bamba,
A. Baquero Larriva,
L. Baroncelli,
U. Barres de Almeida,
J. A. Barrio,
I. Batkovic,
J. Baxter,
J. Becerra González
, et al. (272 additional authors not shown)
Abstract:
Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov…
▽ More
Context: There are currently three pulsars firmly detected by imaging atmospheric Cherenkov telescopes (IACTs), two of them reaching TeV energies, challenging models of very-high-energy (VHE) emission in pulsars. More precise observations are needed to better characterize pulsar emission at these energies. The LST-1 is the prototype of the Large-Sized Telescope, that will be part of the Cherenkov Telescope Array Observatory (CTAO). Its improved performance over previous IACTs makes it well suited for studying pulsars. Aims: To study the Crab pulsar emission with the LST-1, improving and complementing the results from other telescopes. These observations can also be used to characterize the potential of the LST-1 to study other pulsars and detect new ones. Methods: We analyzed a total of $\sim$103 hours of gamma-ray observations of the Crab pulsar conducted with the LST-1 in the period from September 2020 to January 2023. The observations were carried out at zenith angles less than 50 degrees. A new analysis of the Fermi-LAT data was also performed, including $\sim$14 years of observations. Results: The Crab pulsar phaseogram, long-term light-curve, and phase-resolved spectra are reconstructed with the LST-1 from 20 GeV to 450 GeV for P1 and up to 700 GeV for P2. The pulsed emission is detected with a significance of 15.2$σ$. The two characteristic emission peaks of the Crab pulsar are clearly detected (>10$σ$), as well as the so-called bridge emission (5.7$σ$). We find that both peaks are well described by power laws, with spectral indices of $\sim$3.44 and $\sim$3.03 respectively. The joint analysis of Fermi-LAT and LST-1 data shows a good agreement between both instruments in the overlap** energy range. The detailed results obtained in the first observations of the Crab pulsar with LST-1 show the potential that CTAO will have to study this type of sources.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Probing the connection between IceCube neutrinos and MOJAVE AGN
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise,
C. Bellenghi
, et al. (399 additional authors not shown)
Abstract:
Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi…
▽ More
Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well established which can be resolved via correlation studies with photon observations. For neutrinos produced due to photohadronic interactions in AGN, in addition to a correlation of neutrinos with high-energy photons, there would also be a correlation of neutrinos with photons emitted at radio wavelengths. In this work, we perform an in-depth stacking study of the correlation between 15 GHz radio observations of AGN reported in the MOJAVE XV catalog, and ten years of neutrino data from IceCube. We also use a time-dependent approach which improves the statistical power of the stacking analysis. No significant correlation was found for both analyses and upper limits are reported. When compared to the IceCube diffuse flux, at 100 TeV and for a spectral index of 2.5, the upper limits derived are $\sim3\%$ and $\sim9\%$ for the time-averaged and time-dependent case, respectively.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Understanding Routing-Induced Censorship Changes Globally
Authors:
Abhishek Bhaskar,
Paul Pearce
Abstract:
Internet censorship is pervasive, with significant effort dedicated to understanding what is censored, and where. Prior censorship work however have identified significant inconsistencies in their results; experiments show unexplained non-determinism thought to be caused by censor load, end-host geographic diversity, or incomplete censorship -- inconsistencies which impede reliable, repeatable and…
▽ More
Internet censorship is pervasive, with significant effort dedicated to understanding what is censored, and where. Prior censorship work however have identified significant inconsistencies in their results; experiments show unexplained non-determinism thought to be caused by censor load, end-host geographic diversity, or incomplete censorship -- inconsistencies which impede reliable, repeatable and correct understanding of global censorship. In this work we investigate the extent to which Equal-cost Multi-path (ECMP) routing is the cause for these inconsistencies, develo** methods to measure and compensate for them. We find ECMP routing significantly changes observed censorship across protocols, censor mechanisms, and in 17 countries. We identify that previously observed non-determinism or regional variations are attributable to measurements between fixed end-hosts taking different routes based on Flow-ID; i.e., choice of intra-subnet source IP or ephemeral source port leads to differences in observed censorship. To achieve this we develop new route-stable censorship measurement methods that allow consistent measurement of DNS, HTTP, and HTTPS censorship. We find ECMP routing yields censorship changes across 42% of IPs and 51% of ASes, but that impact is not uniform. We identify numerous causes of the behavior, ranging from likely failed infrastructure, to routes to the same end-host taking geographically diverse paths which experience differences in censorship en-route. Finally, we explore our results in the context of prior global measurement studies, exploring first the applicability of our findings to prior observed variations, and then demonstrating how specific experiments from two studies could be impacted by, and specific results are explainable by, ECMP routing. Our work points to methods for improving future studies, reducing inconsistencies and increasing repeatability.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Autonomous Control of a Novel Closed Chain Five Bar Active Suspension via Deep Reinforcement Learning
Authors:
Nishesh Singh,
Sidharth Ramesh,
Abhishek Shankar,
Jyotishka Duttagupta,
Leander Stephen D'Souza,
Sanjay Singh
Abstract:
Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoi…
▽ More
Planetary exploration requires traversal in environments with rugged terrains. In addition, Mars rovers and other planetary exploration robots often carry sensitive scientific experiments and components onboard, which must be protected from mechanical harm. This paper deals with an active suspension system focused on chassis stabilisation and an efficient traversal method while encountering unavoidable obstacles. Soft Actor-Critic (SAC) was applied along with Proportional Integral Derivative (PID) control to stabilise the chassis and traverse large obstacles at low speeds. The model uses the rover's distance from surrounding obstacles, the height of the obstacle, and the chassis' orientation to actuate the control links of the suspension accurately. Simulations carried out in the Gazebo environment are used to validate the proposed active system.
△ Less
Submitted 30 June, 2024; v1 submitted 27 June, 2024;
originally announced June 2024.
-
Application of Liquid Rank Reputation System for Twitter Trend Analysis on Bitcoin
Authors:
Abhishek Saxena,
Anton Kolonin
Abstract:
Analyzing social media trends can create a win-win situation for both creators and consumers. Creators can receive fair compensation, while consumers gain access to engaging, relevant, and personalized content. This paper proposes a new model for analyzing Bitcoin trends on Twitter by incorporating a 'liquid democracy' approach based on user reputation. This system aims to identify the most impact…
▽ More
Analyzing social media trends can create a win-win situation for both creators and consumers. Creators can receive fair compensation, while consumers gain access to engaging, relevant, and personalized content. This paper proposes a new model for analyzing Bitcoin trends on Twitter by incorporating a 'liquid democracy' approach based on user reputation. This system aims to identify the most impactful trends and their influence on Bitcoin prices and trading volume. It uses a Twitter sentiment analysis model based on a reputation rating system to determine the impact on Bitcoin price change and traded volume. In addition, the reputation model considers the users' higher-order friends on the social network (the initial Twitter input channels in our case study) to improve the accuracy and diversity of the reputation results. We analyze Bitcoin-related news on Twitter to understand how trends and user sentiment, measured through our Liquid Rank Reputation System, affect Bitcoin price fluctuations and trading activity within the studied time frame. This reputation model can also be used as an additional layer in other trend and sentiment analysis models. The paper proposes the implementation, challenges, and future scope of the liquid rank reputation model.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
KANQAS: Kolmogorov Arnold Network for Quantum Architecture Search
Authors:
Akash Kundu,
Aritra Sarkar,
Abhishek Sadhu
Abstract:
Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high n…
▽ More
Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high number of parameters. In this work, we evaluate the practicality of KANs in quantum architecture search problems, analyzing their efficiency in terms of the probability of success, frequency of optimal solutions and their dependencies on various degrees of freedom of the network. In a noiseless scenario, the probability of success and the number of optimal quantum circuit configurations to generate the multi-qubit maximally entangled states are significantly higher than MLPs. Moreover in noisy scenarios, KAN can achieve a better fidelity in approximating maximally entangled state than MLPs, where the performance of the MLP significantly depends on the choice of activation function. Further investigation reveals that KAN requires a very small number of learnable parameters compared to MLPs, however, the average time of executing each episode for KAN is much higher.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Linac_Gen: integrating machine learning and particle-in-cell methods for enhanced beam dynamics at Fermilab
Authors:
Abhishek Pathak
Abstract:
Here, we introduce Linac_Gen, a tool developed at Fermilab, which combines machine learning algorithms with Particle-in-Cell methods to advance beam dynamics in linacs. Linac_Gen employs techniques such as Random Forest, Genetic Algorithms, Support Vector Machines, and Neural Networks, achieving a tenfold increase in speed for phase-space matching in linacs over traditional methods through the use…
▽ More
Here, we introduce Linac_Gen, a tool developed at Fermilab, which combines machine learning algorithms with Particle-in-Cell methods to advance beam dynamics in linacs. Linac_Gen employs techniques such as Random Forest, Genetic Algorithms, Support Vector Machines, and Neural Networks, achieving a tenfold increase in speed for phase-space matching in linacs over traditional methods through the use of genetic algorithms. Crucially, Linac_Gen's adept handling of 3D field maps elevates the precision and realism in simulating beam instabilities and resonances, marking a key advancement in the field. Benchmarked against established codes, Linac_Gen demonstrates not only improved efficiency and precision in beam dynamics studies but also in the design and optimization of linac systems, as evidenced in its application to Fermilab's PIP-II linac project. This work represents a notable advancement in accelerator physics, marrying ML with PIC methods to set new standards for efficiency and accuracy in accelerator design and research. Linac_Gen exemplifies a novel approach in accelerator technology, offering substantial improvements in both theoretical and practical aspects of beam dynamics.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization
Authors:
Cheng-Yu Hsieh,
Yung-Sung Chuang,
Chun-Liang Li,
Zifeng Wang,
Long T. Le,
Abhishek Kumar,
James Glass,
Alexander Ratner,
Chen-Yu Lee,
Ranjay Krishna,
Tomas Pfister
Abstract:
Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between…
▽ More
Large language models (LLMs), even when specifically trained to process long input contexts, struggle to capture relevant information located in the middle of their input. This phenomenon has been known as the lost-in-the-middle problem. In this work, we make three contributions. First, we set out to understand the factors that cause this phenomenon. In doing so, we establish a connection between lost-in-the-middle to LLMs' intrinsic attention bias: LLMs exhibit a U-shaped attention bias where the tokens at the beginning and at the end of its input receive higher attention, regardless of their relevance. Second, we mitigate this positional bias through a calibration mechanism, found-in-the-middle, that allows the model to attend to contexts faithfully according to their relevance, even though when they are in the middle. Third, we show found-in-the-middle not only achieves better performance in locating relevant information within a long context, but also eventually leads to improved retrieval-augmented generation (RAG) performance across various tasks, outperforming existing methods by up to 15 percentage points. These findings open up future directions in understanding LLM attention bias and its potential consequences.
△ Less
Submitted 3 July, 2024; v1 submitted 23 June, 2024;
originally announced June 2024.
-
Circular Polarization of Simulated Images of Black Holes
Authors:
Abhishek V. Joshi,
Ben S. Prather,
Chi-kwan Chan,
Maciek Wielgus,
Charles F. Gammie
Abstract:
Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole,…
▽ More
Models of the resolved Event Horizon Telescope (EHT) sources Sgr A* and M87* are constrained by observations at multiple wavelengths, resolutions, polarizations, and time cadences. In this paper we compare unresolved circular polarization (CP) measurements to a library of models, where each model is characterized by a distribution of CP over time. In the library we vary the spin of the black hole, the magnetic field strength at the horizon (i.e. both SANE and MAD models), the observer inclination, a parameter for the maximum ion-electron temperature ratio assuming a thermal plasma, and the direction of the magnetic field dipole moment. We find that ALMA observations of Sgr A* are inconsistent with all edge-on ($i = 90^\circ$) models. Restricting attention to the magnetically arrested disk (MAD) models favored by earlier EHT studies of Sgr A*, we find that only models with magnetic dipole moment pointing away from the observer are consistent with ALMA data. We also note that in 26 of the 27 passing MAD models the accretion flow rotates clockwise on the sky. We provide a table of the mean and standard deviation of the CP distributions for all model parameters along with their trends.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe
Authors:
Sandeep Singh Sengar,
Abhishek Kumar,
Owen Singh
Abstract:
This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic m…
▽ More
This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic movements and partial occlusions. The improved framework is benchmarked against traditional models, demonstrating considerable precision and computational speed gains. The advancements have wide-ranging applications in augmented reality, sports analytics, and healthcare, enabling more immersive experiences, refined performance analysis, and advanced patient monitoring. The study also explores the integration of these enhancements within mobile and embedded systems, addressing the need for computational efficiency and broader accessibility. The implications of this research set a new benchmark for real-time human pose estimation technologies and pave the way for future innovations in the field. The implementation code for the paper is available at https://github.com/avhixd/Human_pose_estimation.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
News Deja Vu: Connecting Past and Present with Semantic Search
Authors:
Brevin Franklin,
Emily Silcock,
Abhishek Arora,
Tom Bryan,
Melissa Dell
Abstract:
Social scientists and the general public often analyze contemporary events by drawing parallels with the past, a process complicated by the vast, noisy, and unstructured nature of historical texts. For example, hundreds of millions of page scans from historical newspapers have been noisily transcribed. Traditional sparse methods for searching for relevant material in these vast corpora, e.g., with…
▽ More
Social scientists and the general public often analyze contemporary events by drawing parallels with the past, a process complicated by the vast, noisy, and unstructured nature of historical texts. For example, hundreds of millions of page scans from historical newspapers have been noisily transcribed. Traditional sparse methods for searching for relevant material in these vast corpora, e.g., with keywords, can be brittle given complex vocabularies and OCR noise. This study introduces News Deja Vu, a novel semantic search tool that leverages transformer large language models and a bi-encoder approach to identify historical news articles that are most similar to modern news queries. News Deja Vu first recognizes and masks entities, in order to focus on broader parallels rather than the specific named entities being discussed. Then, a contrastively trained, lightweight bi-encoder retrieves historical articles that are most similar semantically to a modern query, illustrating how phenomena that might seem unique to the present have varied historical precedents. Aimed at social scientists, the user-friendly News Deja Vu package is designed to be accessible for those who lack extensive familiarity with deep learning. It works with large text datasets, and we show how it can be deployed to a massive scale corpus of historical, open-source news articles. While human expertise remains important for drawing deeper insights, News Deja Vu provides a powerful tool for exploring parallels in how people have perceived past and present.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Contrastive Entity Coreference and Disambiguation for Historical Texts
Authors:
Abhishek Arora,
Emily Silcock,
Leander Heldring,
Melissa Dell
Abstract:
Massive-scale historical document collections are crucial for social science research. Despite increasing digitization, these documents typically lack unique cross-document identifiers for individuals mentioned within the texts, as well as individual identifiers from external knowledgebases like Wikipedia/Wikidata. Existing entity disambiguation methods often fall short in accuracy for historical…
▽ More
Massive-scale historical document collections are crucial for social science research. Despite increasing digitization, these documents typically lack unique cross-document identifiers for individuals mentioned within the texts, as well as individual identifiers from external knowledgebases like Wikipedia/Wikidata. Existing entity disambiguation methods often fall short in accuracy for historical documents, which are replete with individuals not remembered in contemporary knowledgebases. This study makes three key contributions to improve cross-document coreference resolution and disambiguation in historical texts: a massive-scale training dataset replete with hard negatives - that sources over 190 million entity pairs from Wikipedia contexts and disambiguation pages - high-quality evaluation data from hand-labeled historical newswire articles, and trained models evaluated on this historical benchmark. We contrastively train bi-encoder models for coreferencing and disambiguating individuals in historical texts, achieving accurate, scalable performance that identifies out-of-knowledgebase individuals. Our approach significantly surpasses other entity disambiguation models on our historical newswire benchmark. Our models also demonstrate competitive performance on modern entity disambiguation benchmarks, particularly certain news disambiguation datasets.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
AMC: Access to Miss Correlation Prefetcher for Evolving Graph Analytics
Authors:
Abhishek Singh,
Christian Schulte,
Xiaochen Guo
Abstract:
Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, curren…
▽ More
Modern memory hierarchies work well with applications that have good spatial locality. Evolving (dynamic) graphs are important applications widely used to model graphs and networks with edge and vertex changes. They exhibit irregular memory access patterns and suffer from a high miss ratio and long miss penalty. Prefetching can be employed to predict and fetch future demand misses. However, current hardware prefetchers can not efficiently predict for applications with irregular memory accesses. In evolving graph applications, vertices that do not change during graph changes exhibit the same access correlation patterns. Current temporal prefetchers use one-to-one or one-to-many correlation to exploit these patterns. Similar patterns are recorded in the same entry, which causes aliasing and can lead to poor prefetch accuracy and coverage. This work proposes a software-assisted hardware prefetcher for evolving graphs. The key idea is to record the correlations between a sequence of vertex accesses and the following misses and then prefetch when the same vertex access sequence occurs in the future. The proposed Access-to-Miss Correlation (AMC) prefetcher provides a lightweight programming interface to identify the data structures of interest and sets the iteration boundary to update the correlation table. For the evaluated applications, AMC achieves a geomean speedup of 1.5x as compared to the best-performing prefetcher in prior work (VLDP). AMC can achieve an average of 62% accuracy and coverage, whereas VLDP has an accuracy of 31% and coverage of 23%.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Superfluid stiffness of twisted multilayer graphene superconductors
Authors:
Abhishek Banerjee,
Zeyu Hao,
Mary Kreidel,
Patrick Ledwith,
Isabelle Phinney,
Jeong Min Park,
Andrew M. Zimmerman,
Kenji Watanabe,
Takashi Taniguchi,
Robert M Westervelt,
Pablo Jarillo-Herrero,
Pavel A. Volkov,
Ashvin Vishwanath,
Kin Chung Fong,
Philip Kim
Abstract:
The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasipar…
▽ More
The robustness of the macroscopic quantum nature of a superconductor can be characterized by the superfluid stiffness, $ρ_s$, a quantity that describes the energy required to vary the phase of the macroscopic quantum wave function. In unconventional superconductors, such as cuprates, the low-temperature behavior of $ρ_s$ drastically differs from that of conventional superconductors due to quasiparticle excitations from gapless points (nodes) in momentum space. Intensive research on the recently discovered magic-angle twisted graphene family has revealed, in addition to superconducting states, strongly correlated electronic states associated with spontaneously broken symmetries, inviting the study of $ρ_s$ to uncover the potentially unconventional nature of its superconductivity. Here we report the measurement of $ρ_s$ in magic-angle twisted trilayer graphene (TTG), revealing unconventional nodal-gap superconductivity. Utilizing radio-frequency reflectometry techniques to measure the kinetic inductive response of superconducting TTG coupled to a microwave resonator, we find a linear temperature dependence of $ρ_s$ at low temperatures and nonlinear Meissner effects in the current bias dependence, both indicating nodal structures in the superconducting order parameter. Furthermore, the do** dependence shows a linear correlation between the zero temperature $ρ_s$ and the superconducting transition temperature $T_c$, reminiscent of Uemura's relation in cuprates, suggesting phase-coherence-limited superconductivity. Our results provide strong evidence for nodal superconductivity in TTG and put strong constraints on the mechanisms of these graphene-based superconductors.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Quantum steering under constrained free-will
Authors:
Abhishek Sadhu,
Siddhartha Das
Abstract:
Quantum steering is a kind of bipartite quantum correlations where one party's measurement remotely alters the state of another party. In an adversarial scenario, there could be a hidden variable introducing a bias in the choice of measurement settings of the parties. However, observers without access to the hidden variable are unaware of this bias. The main focus of this work is to analyze quantu…
▽ More
Quantum steering is a kind of bipartite quantum correlations where one party's measurement remotely alters the state of another party. In an adversarial scenario, there could be a hidden variable introducing a bias in the choice of measurement settings of the parties. However, observers without access to the hidden variable are unaware of this bias. The main focus of this work is to analyze quantum steering without assuming that the parties freely choose their measurement settings. For this, we introduce the measurement-dependent (MD-)steering scenario where the measurement settings chosen by the parties are biased by an adversary. In such a scenario, we present a class of inequalities to test for MD-steerable correlations. Further, we discuss the implications of violating such inequalities in certifying randomness from quantum extremal behaviors. We also assume that an adversary might prepare an assemblage as a mixture of MD-steerable and MD-unsteerable assemblages and provide a bound on the measurement dependence for the observed correlation to remain MD-steerable.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather Conditions
Authors:
Shivank Garg,
Abhishek Baghel,
Amit Agarwal,
Durga Toshniwal
Abstract:
With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed t…
▽ More
With the rise of autonomous vehicles and advanced driver-assistance systems (ADAS), ensuring reliable object detection in all weather conditions is crucial for safety and efficiency. Adverse weather like snow, rain, and fog presents major challenges for current detection systems, often resulting in failures and potential safety risks. This paper introduces a novel framework and pipeline designed to improve object detection under such conditions, focusing on traffic signal detection where traditional methods often fail due to domain shifts caused by adverse weather. We provide a comprehensive analysis of the limitations of existing techniques. Our proposed pipeline significantly enhances detection accuracy in snow, rain, and fog. Results show a 40.8% improvement in average IoU and F1 scores compared to naive fine-tuning and a 22.4% performance increase in domain shift scenarios, such as training on artificial snow and testing on rain images.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review
Authors:
Abhishek Swami,
Snehal Farande,
Atharv Patil,
Atharva Parle,
Vivekanand Mane,
Prathamesh Thorat
Abstract:
The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this artic…
▽ More
The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this article, some essential techniques such as edge detection, region-growing technique, and thresholding approach, and the deep learning models such as variants of YOLOv8-which is the best object detection and segmentation framework-are reviewed. We further investigate that the traditional image processing techniques like segmentation are very much simple and provides the alternative to the advanced methods as well. Our review gives useful knowledge on the practical usage of the innovative and traditional approaches of manual X-ray interpretation. The discovered information will help professionals and researchers to gain more profound knowledge in digital interpretation techniques in medical imaging.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context
Authors:
Ziyi Liu,
Abhishek Anand,
Pei Zhou,
Jen-tse Huang,
Jieyu Zhao
Abstract:
Large language models (LLMs) have demonstrated the potential to mimic human social intelligence. However, most studies focus on simplistic and static self-report or performance-based tests, which limits the depth and validity of the analysis. In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by map** their ability to understand and manage intentions…
▽ More
Large language models (LLMs) have demonstrated the potential to mimic human social intelligence. However, most studies focus on simplistic and static self-report or performance-based tests, which limits the depth and validity of the analysis. In this paper, we developed a novel framework, InterIntent, to assess LLMs' social intelligence by map** their ability to understand and manage intentions in a game setting. We focus on four dimensions of social intelligence: situational awareness, self-regulation, self-awareness, and theory of mind. Each dimension is linked to a specific game task: intention selection, intention following, intention summarization, and intention guessing. Our findings indicate that while LLMs exhibit high proficiency in selecting intentions, achieving an accuracy of 88\%, their ability to infer the intentions of others is significantly weaker, trailing human performance by 20\%. Additionally, game performance correlates with intention understanding, highlighting the importance of the four components towards success in this game. These findings underline the crucial role of intention understanding in evaluating LLMs' social intelligence and highlight the potential of using social deduction games as a complex testbed to enhance LLM evaluation. InterIntent contributes a structured approach to bridging the evaluation gap in social intelligence within multiplayer games.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Decomposed evaluations of geographic disparities in text-to-image models
Authors:
Abhishek Sureddy,
Dishant Padalia,
Nandhinee Periyakaruppa,
Oindrila Saha,
Adina Williams,
Adriana Romero-Soriano,
Megan Richards,
Polina Kirichenko,
Melissa Hall
Abstract:
Recent work has identified substantial disparities in generated images of different geographic regions, including stereotypical depictions of everyday objects like houses and cars. However, existing measures for these disparities have been limited to either human evaluations, which are time-consuming and costly, or automatic metrics evaluating full images, which are unable to attribute these dispa…
▽ More
Recent work has identified substantial disparities in generated images of different geographic regions, including stereotypical depictions of everyday objects like houses and cars. However, existing measures for these disparities have been limited to either human evaluations, which are time-consuming and costly, or automatic metrics evaluating full images, which are unable to attribute these disparities to specific parts of the generated images. In this work, we introduce a new set of metrics, Decomposed Indicators of Disparities in Image Generation (Decomposed-DIG), that allows us to separately measure geographic disparities in the depiction of objects and backgrounds in generated images. Using Decomposed-DIG, we audit a widely used latent diffusion model and find that generated images depict objects with better realism than backgrounds and that backgrounds in generated images tend to contain larger regional disparities than objects. We use Decomposed-DIG to pinpoint specific examples of disparities, such as stereotypical background generation in Africa, struggling to generate modern vehicles in Africa, and unrealistically placing some objects in outdoor settings. Informed by our metric, we use a new prompting structure that enables a 52% worst-region improvement and a 20% average improvement in generated background diversity.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
On localizing groups of exotic diffeomorphisms of 4-manifolds
Authors:
Hokuto Konno,
Abhishek Mallick
Abstract:
Ruberman in the 90's showed that the group of exotic diffeomorphisms of closed 4-manifolds can be infinitely generated. We provide various results on the question of when such infinite generation can localize to a smaller embedded submanifold of the original manifold. Our results include: (1) All known infinitely generated groups of exotic diffeomorphisms of 4-manifolds detected by families Seiber…
▽ More
Ruberman in the 90's showed that the group of exotic diffeomorphisms of closed 4-manifolds can be infinitely generated. We provide various results on the question of when such infinite generation can localize to a smaller embedded submanifold of the original manifold. Our results include: (1) All known infinitely generated groups of exotic diffeomorphisms of 4-manifolds detected by families Seiberg-Witten theory do not localize to any topologically (locally-flatly) embedded rational homology balls in the ambient 4-manifold. (2) Many exotic diffeomorphisms cannot be obtained as Dehn twists along homology spheres (under mild assumptions). (3) There is no contractible 4-manifolds with Seifert fibered boundary that have a universal property for exotic diffeomorphisms analogous to a universal cork. In addition, there is no universal compact 4-manifold $W$ such that the set of exotic diffeomorphisms of a 4-manifold can localize to an embedding of $W$. (4) Certain infinite generations of exotic diffeomorphism groups do localize to a non-compact subset $V$ with a small Betti number, but not to any compact subset of $V$. (5) An analogous result holds for map** class groups of 4-manifolds.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Astronomical Spectroscopy with Skipper CCDs: First Results from a Skipper CCD Focal Plane Prototype at SIFS
Authors:
Edgar Marrufo Villalpando,
Alex Drlica-Wagner,
Brandon Roach,
Marco Bonati,
Abhishek Bakshi,
Julia Campa,
Gustavo Cancelo,
Braulio Cancino,
Claudio R. Chavez,
Fernando Chierchie,
Juan Estrada,
Guillermo Fernandez Moroni,
Luciano Fraga,
Manuel E. Gaido,
Stephen E. Holland,
Rachel Hur,
Michelle Jonas,
Peter Moore,
Eduardo Paolini,
Andrés A. Plazas Malagón,
Leandro Stefanazzi,
Javier Tiffenberg,
Ken Treptou,
Sho Uemura,
Neal Wilcer
Abstract:
We present the first on-sky results from an ultra-low-readout-noise Skipper CCD focal plane prototype for the SOAR Integral Field Spectrograph (SIFS). The Skipper CCD focal plane consists of four 6k x 1k, 15 $μ$m pixel, fully-depleted, p-channel devices that have been thinned to ~250 $μ$m, backside processed, and treated with an anti-reflective coating. These Skipper CCDs were configured for astro…
▽ More
We present the first on-sky results from an ultra-low-readout-noise Skipper CCD focal plane prototype for the SOAR Integral Field Spectrograph (SIFS). The Skipper CCD focal plane consists of four 6k x 1k, 15 $μ$m pixel, fully-depleted, p-channel devices that have been thinned to ~250 $μ$m, backside processed, and treated with an anti-reflective coating. These Skipper CCDs were configured for astronomical spectroscopy, i.e., single-sample readout noise < 4.3 e- rms/pixel, the ability to achieve multi-sample readout noise $\ll$ 1 e- rms/pixel, full-well capacities ~40,000-65,000 e-, low dark current and charge transfer inefficiency (~2 x 10$^{-4}$ e-/pixel/s and 3.44 x 10$^{-7}$, respectively), and an absolute quantum efficiency of $\gtrsim$ 80% between 450 nm and 980 nm ($\gtrsim$ 90% between 600 nm and 900 nm). We optimized the readout sequence timing to achieve sub-electron noise (~0.5 e- rms/pixel) in a region of 2k x 4k pixels and photon-counting noise (~0.22 e- rms/pixel) in a region of 220 x 4k pixels, each with a readout time of $\lesssim$ 17 min. We observed two quasars (HB89 1159+123 and QSO J1621-0042) at redshift z ~ 3.5, two high-redshift galaxy clusters (CL J1001+0220 and SPT-CL J2040-4451), an emission line galaxy at z = 0.3239, a candidate member star of the Boötes II ultra-faint dwarf galaxy, and five CALSPEC spectrophotometric standard stars (HD074000, HD60753, HD106252, HD101452, HD200654). We present charge-quantized, photon-counting observations of the quasar HB89 1159+123 and show the detector sensitivity increase for faint spectral features. We demonstrate signal-to-noise performance improvements for SIFS observations in the low-background, readout-noise-dominated regime. We outline scientific studies that will leverage the SIFS-Skipper CCD data and new detector architectures that utilize the Skipper floating gate amplifier with faster readout times.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Newswire: A Large-Scale Structured Database of a Century of Historical News
Authors:
Emily Silcock,
Abhishek Arora,
Luca D'Amico-Wong,
Melissa Dell
Abstract:
In the U.S. historically, local newspapers drew their content largely from newswires like the Associated Press. Historians argue that newswires played a pivotal role in creating a national identity and shared understanding of the world, but there is no comprehensive archive of the content sent over newswires. We reconstruct such an archive by applying a customized deep learning pipeline to hundred…
▽ More
In the U.S. historically, local newspapers drew their content largely from newswires like the Associated Press. Historians argue that newswires played a pivotal role in creating a national identity and shared understanding of the world, but there is no comprehensive archive of the content sent over newswires. We reconstruct such an archive by applying a customized deep learning pipeline to hundreds of terabytes of raw image scans from thousands of local newspapers. The resulting dataset contains 2.7 million unique public domain U.S. newswire articles, written between 1878 and 1977. Locations in these articles are georeferenced, topics are tagged using customized neural topic classification, named entities are recognized, and individuals are disambiguated to Wikipedia using a novel entity disambiguation model. To construct the Newswire dataset, we first recognize newspaper layouts and transcribe around 138 millions structured article texts from raw image scans. We then use a customized neural bi-encoder model to de-duplicate reproduced articles, in the presence of considerable abridgement and noise, quantifying how widely each article was reproduced. A text classifier is used to ensure that we only include newswire articles, which historically are in the public domain. The structured data that accompany the texts provide rich information about the who (disambiguated individuals), what (topics), and where (georeferencing) of the news that millions of Americans read over the course of a century. We also include Library of Congress metadata information about the newspapers that ran the articles on their front pages. The Newswire dataset is useful both for large language modeling - expanding training data beyond what is available from modern web texts - and for studying a diversity of questions in computational linguistics, social science, and the digital humanities.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Simulations of distributed-phase-reference quantum key distribution protocols
Authors:
Venkat Abhignan,
Abhishek Jamunkar,
Gokul Nair,
Mohit Mittal,
Megha Shrivastava
Abstract:
Quantum technology can enable secure communication for cryptography purposes using quantum key distribution. Quantum key distribution protocols provide a secret key between two users with security guaranteed by the laws of quantum mechanics. To define the proper implementation of a quantum key distribution system using a particular cryptography protocol, it is crucial to critically and meticulousl…
▽ More
Quantum technology can enable secure communication for cryptography purposes using quantum key distribution. Quantum key distribution protocols provide a secret key between two users with security guaranteed by the laws of quantum mechanics. To define the proper implementation of a quantum key distribution system using a particular cryptography protocol, it is crucial to critically and meticulously assess the device's performance due to technological limitations in the components used. We perform simulations on the ANSYS Interconnect platform to characterise the practical implementation of these devices using distributed-phase-reference protocols differential-phase-shift and coherent-one-way quantum key distribution. Further, we briefly describe and simulate some possible eavesdrop** attempts, backflash attack, trojan-horse attack and detector-blinding attack exploiting the device imperfections.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
FastAST: Accelerating Audio Spectrogram Transformer via Token Merging and Cross-Model Knowledge Distillation
Authors:
Swarup Ranjan Behera,
Abhishek Dhiman,
Karthik Gowda,
Aalekhya Satya Narayani
Abstract:
Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retrainin…
▽ More
Audio classification models, particularly the Audio Spectrogram Transformer (AST), play a crucial role in efficient audio analysis. However, optimizing their efficiency without compromising accuracy remains a challenge. In this paper, we introduce FastAST, a framework that integrates Token Merging (ToMe) into the AST framework. FastAST enhances inference speed without requiring extensive retraining by merging similar tokens in audio spectrograms. Furthermore, during training, FastAST brings about significant speed improvements. The experiments indicate that FastAST can increase audio classification throughput with minimal impact on accuracy. To mitigate the accuracy impact, we integrate Cross-Model Knowledge Distillation (CMKD) into the FastAST framework. Integrating ToMe and CMKD into AST results in improved accuracy compared to AST while maintaining faster inference speeds. FastAST represents a step towards real-time, resource-efficient audio analysis.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms
Authors:
Akanksha Sharma,
Jyoti Nigam,
Abhishek Rathore,
Arnav Bhavsar
Abstract:
In this work, we delve into the EEG classification task in the domain of visual brain decoding via two frameworks, involving two different learning paradigms. Considering the spatio-temporal nature of EEG data, one of our frameworks is based on a CNN-BiLSTM model. The other involves a CNN-Transformer architecture which inherently involves the more versatile attention based learning paradigm. In bo…
▽ More
In this work, we delve into the EEG classification task in the domain of visual brain decoding via two frameworks, involving two different learning paradigms. Considering the spatio-temporal nature of EEG data, one of our frameworks is based on a CNN-BiLSTM model. The other involves a CNN-Transformer architecture which inherently involves the more versatile attention based learning paradigm. In both cases, a special 1D-CNN feature extraction module is used to generate the initial embeddings with 1D convolutions in the time and the EEG channel domains. Considering the EEG signals are noisy, non stationary and the discriminative features are even less clear (than in semantically structured data such as text or image), we also follow a window-based classification followed by majority voting during inference, to yield labels at a signal level. To illustrate how brain patterns correlate with different image classes, we visualize t-SNE plots of the BiLSTM embeddings alongside brain activation maps for the top 10 classes. These visualizations provide insightful revelations into the distinct neural signatures associated with each visual category, showcasing the BiLSTM's capability to capture and represent the discriminative brain activity linked to visual stimuli. We demonstrate the performance of our approach on the updated EEG-Imagenet dataset with positive comparisons with state-of-the-art methods.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Constraints on Lorentz invariance violation from the extraordinary Mrk 421 flare of 2014 using a novel analysis method
Authors:
MAGIC Collaboration,
S. Abe,
J. Abhir,
A. Abhishek,
V. A. Acciari,
A. Aguasca-Cabot,
I. Agudo,
T. Aniello,
S. Ansoldi,
L. A. Antonelli,
A. Arbet Engels,
C. Arcaro,
M. Artero,
K. Asano,
A. Babić,
A. Baquero,
U. Barres de Almeida,
J. A. Barrio,
I. Batković,
A. Bautista,
J. Baxter,
J. Becerra González,
W. Bednarek,
E. Bernardini,
J. Bernete
, et al. (192 additional authors not shown)
Abstract:
The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysica…
▽ More
The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysical observations. In this scenario, the cosmological distances traversed by photons act as an amplifier for this effect. By leveraging the observation of a remarkable flare from the blazar Mrk\,421, recorded at energies above 100 GeV by the MAGIC telescopes on the night of April 25 to 26, 2014, we look for time delays scaling linearly and quadratically with the photon energies. Using for the first time in LIV studies a binned-likelihood approach we set constraints on the QG energy scale. For the linear scenario, we set $95\%$ lower limits $E_\mathrm{QG}>2.7\times10^{17}$ GeV for the subluminal case and $E_\mathrm{QG}> 3.6 \times10^{17}$ GeV for the superluminal case. For the quadratic scenario, the $95\%$ lower limits for the subluminal and superluminal cases are $E_\mathrm{QG}>2.6 \times10^{10}$ GeV and $E_\mathrm{QG}>2.5\times10^{10}$ GeV, respectively.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Electronic, optical, and transport properties of alkali metal oxides (Cs2O): A DFT study
Authors:
Anjali Kumari,
Kamal Kumar,
Abhishek Kumar Mishra,
Ramesh Sharma
Abstract:
The electronic, structural, optical, and thermoelectric properties of the Cs2O cubic structure have been investigated using density functional theory (DFT). The calculations utilize a full relativistic version of the full-potential augmented plane-wave plus local orbitals method, which is based on density functional theory, employing both the GGA and LDA approximations. Additionally, we employed t…
▽ More
The electronic, structural, optical, and thermoelectric properties of the Cs2O cubic structure have been investigated using density functional theory (DFT). The calculations utilize a full relativistic version of the full-potential augmented plane-wave plus local orbitals method, which is based on density functional theory, employing both the GGA and LDA approximations. Additionally, we employed the GGA proposed by Trans-Blaha (GGA-mBJ) for band structure computations, revealing the indirect band gap nature of Cs2O. The optical properties are also addressed by computing the refractive index, extinction coefficient, and complex dielectric tensor. The electrical conductivity, Seebeck coefficient, and thermal conductivity exhibit temperature-dependent variations, indicating the formation of a thermoelectric material. Our findings indicate that the compound under investigation is categorized as a p-type semiconductor, with the majority of charge carriers responsible for conduction being holes rather than electrons.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Existence of Positive Solutions for Generalized Fractional Brézis-Nirenberg Problem
Authors:
Rohit Kumar,
Abhishek Sarkar
Abstract:
In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*}
(-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where…
▽ More
In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*}
(-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where $s\in (0,1),~p \in (1,\frac{N}{s}), ~p_{s}^{*}= \frac{Np}{N-sp}$ and the operator $(-Δ)_{p}^{s}$ is the fractional $p$-Laplace operator. The space $\mathcal{D}^{s,p}(\mathbb{R}^{N})$ is the completion of $C_c^\infty(\mathbb{R}^N)$ with respect to the Gaglairdo semi-norm. In this article, we prove the existence of a positive solution to this problem by allowing the Hardy weight $w$ to change its sign.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Born-Oppenheimer Potentials for $SU(3)$ Gauge Theory
Authors:
Fareed Alasiri,
Eric Braaten,
Abhishek Mohapatra
Abstract:
We develop parameterizations of 8 of the lowest Born-Oppenheimer potentials for quarkonium hybrid mesons as functions of the separation $r$ of the static quark and antiquark sources. The parameters are determined by fitting results calculated using pure $SU(3)$ lattice gauge theory. The parameterizations have the correct limiting behavior at small $r$, where the potentials form multiplets associat…
▽ More
We develop parameterizations of 8 of the lowest Born-Oppenheimer potentials for quarkonium hybrid mesons as functions of the separation $r$ of the static quark and antiquark sources. The parameters are determined by fitting results calculated using pure $SU(3)$ lattice gauge theory. The parameterizations have the correct limiting behavior at small $r$, where the potentials form multiplets associated with gluelumps. They have the correct limiting behavior at large $r$, where the potentials form multiplets associated with excitations of a relativistic string. There is a narrow avoided crossing in the small-$r$ region between two potentials with the same Born-Oppenheimer quantum numbers.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Embracing Nonlinearity and Geometry: A dimensional analysis guided design of shock absorbing materials
Authors:
Abhishek Gupta,
Komal Chawla,
Ramathasan Thevamaran
Abstract:
Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their str…
▽ More
Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their stress-strain response. Contrary to this belief, we report that foams with a nonlinear stress-strain response can be useful to achieve simultaneously thin and lightweight protective pads. We introduce a new framework for the thickness or volume-constrained design of compact and lightweight protective foams while ensuring the desired structural integrity and mechanical performance. Our streamlined dimensional analysis approach provides geometric constraints on the dimensionless thickness and cross-sectional area of a protective foam with a given stress-strain response to limit the acceleration and compressive strain within desired critical limits. We also identify optimal mechanical properties that will result in the most compact and lightest protective foam layer for absorbing a given kinetic energy of impact. Guided by this design framework, we achieve optimal protective properties in hierarchically architected vertically aligned carbon nanotube (VACNT) foams, enabling next generation protective applications in extreme environments.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Geometric Localization of Homology Cycles
Authors:
Amritendu Dhar,
Vijay Natarajan,
Abhishek Rathod
Abstract:
Computing an optimal cycle in a given homology class, also referred to as the homology localization problem, is known to be an NP-hard problem in general. Furthermore, there is currently no known optimality criterion that localizes classes geometrically and admits a stability property under the setting of persistent homology. We present a geometric optimization of the cycles that is computable in…
▽ More
Computing an optimal cycle in a given homology class, also referred to as the homology localization problem, is known to be an NP-hard problem in general. Furthermore, there is currently no known optimality criterion that localizes classes geometrically and admits a stability property under the setting of persistent homology. We present a geometric optimization of the cycles that is computable in polynomial time and is stable in an approximate sense. Tailoring our search criterion to different settings, we obtain various optimization problems like optimal homologous cycle, minimum homology basis, and minimum persistent homology basis. In practice, the (trivial) exact algorithm is computationally expensive despite having a worst case polynomial runtime. Therefore, we design approximation algorithms for the above problems and study their performance experimentally. These algorithms have reasonable runtimes for moderate sized datasets and the cycles computed by these algorithms are consistently of high quality as demonstrated via experiments on multiple datasets.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Tolerant Algorithms for Learning with Arbitrary Covariate Shift
Authors:
Surbhi Goel,
Abhishek Shetty,
Konstantinos Stavropoulos,
Arsen Vasilyan
Abstract:
We study the problem of learning under arbitrary distribution shift, where the learner is trained on a labeled set from one distribution but evaluated on a different, potentially adversarially generated test distribution. We focus on two frameworks: PQ learning [Goldwasser, A. Kalai, Y. Kalai, Montasser NeurIPS 2020], allowing abstention on adversarially generated parts of the test distribution, a…
▽ More
We study the problem of learning under arbitrary distribution shift, where the learner is trained on a labeled set from one distribution but evaluated on a different, potentially adversarially generated test distribution. We focus on two frameworks: PQ learning [Goldwasser, A. Kalai, Y. Kalai, Montasser NeurIPS 2020], allowing abstention on adversarially generated parts of the test distribution, and TDS learning [Klivans, Stavropoulos, Vasilyan COLT 2024], permitting abstention on the entire test distribution if distribution shift is detected. All prior known algorithms either rely on learning primitives that are computationally hard even for simple function classes, or end up abstaining entirely even in the presence of a tiny amount of distribution shift.
We address both these challenges for natural function classes, including intersections of halfspaces and decision trees, and standard training distributions, including Gaussians. For PQ learning, we give efficient learning algorithms, while for TDS learning, our algorithms can tolerate moderate amounts of distribution shift. At the core of our approach is an improved analysis of spectral outlier-removal techniques from learning with nasty noise. Our analysis can (1) handle arbitrarily large fraction of outliers, which is crucial for handling arbitrary distribution shifts, and (2) obtain stronger bounds on polynomial moments of the distribution after outlier removal, yielding new insights into polynomial regression under distribution shifts. Lastly, our techniques lead to novel results for tolerant testable learning [Rubinfeld and Vasilyan STOC 2023], and learning with nasty noise.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Quantum-enabled continuous microwave-to-optics frequency conversion
Authors:
Han Zhao,
William David Chen,
Abhishek Kejriwal,
Mohammad Mirhosseini
Abstract:
A quantum interface between microwave and optical photons is essential for entangling remote superconducting quantum processors. To preserve fragile quantum states, a transducer must operate efficiently while generating less than one photon of noise referred to its input. Here, we present a platform that meets these criteria, utilizing a combination of electrostatic and optomechanical interactions…
▽ More
A quantum interface between microwave and optical photons is essential for entangling remote superconducting quantum processors. To preserve fragile quantum states, a transducer must operate efficiently while generating less than one photon of noise referred to its input. Here, we present a platform that meets these criteria, utilizing a combination of electrostatic and optomechanical interactions in devices made entirely from crystalline silicon. This platform's small mechanical dissipation and low optical absorption enable ground-state radiative cooling, resulting in quantum-enabled operation with a continuous laser drive. Under the optimal settings for high efficiency (low noise), we measure an external efficiency of $2.2\%$ ($0.47\%$) and an input-referred added noise of $0.94$ ($0.58$) in microwave-to-optics conversion. We quantify the transducer throughput using the efficiency-bandwidth product, finding it exceeds previous demonstrations with similar noise performance by approximately two orders of magnitude, thereby paving a practical path to interconnecting remote superconducting qubits.
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Authors:
Soroush Nasiriany,
Abhiram Maddukuri,
Lance Zhang,
Adeet Parikh,
Aaron Lo,
Abhishek Joshi,
Ajay Mandlekar,
Yuke Zhu
Abstract:
Recent advancements in Artificial Intelligence (AI) have largely been propelled by scaling. In Robotics, scaling is hindered by the lack of access to massive robot datasets. We advocate using realistic physical simulation as a means to scale environments, tasks, and datasets for robot learning methods. We present RoboCasa, a large-scale simulation framework for training generalist robots in everyd…
▽ More
Recent advancements in Artificial Intelligence (AI) have largely been propelled by scaling. In Robotics, scaling is hindered by the lack of access to massive robot datasets. We advocate using realistic physical simulation as a means to scale environments, tasks, and datasets for robot learning methods. We present RoboCasa, a large-scale simulation framework for training generalist robots in everyday environments. RoboCasa features realistic and diverse scenes focusing on kitchen environments. We provide thousands of 3D assets across over 150 object categories and dozens of interactable furniture and appliances. We enrich the realism and diversity of our simulation with generative AI tools, such as object assets from text-to-3D models and environment textures from text-to-image models. We design a set of 100 tasks for systematic evaluation, including composite tasks generated by the guidance of large language models. To facilitate learning, we provide high-quality human demonstrations and integrate automated trajectory generation methods to substantially enlarge our datasets with minimal human burden. Our experiments show a clear scaling trend in using synthetically generated robot data for large-scale imitation learning and show great promise in harnessing simulation data in real-world tasks. Videos and open-source code are available at https://robocasa.ai/
△ Less
Submitted 4 June, 2024;
originally announced June 2024.
-
Investigating a Device Independence Quantum Random Number Generation
Authors:
Vardaan Mongia,
Abhishek Kumar,
Shashi Prabhakar,
Anindya Banerji,
R. P. Singh
Abstract:
Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used a…
▽ More
Quantum random number generation (QRNG) is a resource that is a necessity in the field of cryptography. However, its certification has been challenging. In this article, we certify randomness with the aid of quantum entanglement in a device independent setting, where we choose two-photon interference for source characterisation. The CHSH inequality violation and quantum state tomography are used as independent checks on the measurement devices. These measures ensure the unpredictability of quantum random number generation. This work can be easily extended to faster randomness expansion protocols.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Schrödinger Bridge with Quadratic State Cost is Exactly Solvable
Authors:
Alexis M. H. Teter,
Wenqing Wang,
Abhishek Halder
Abstract:
Schrödinger bridge is a diffusion process that steers a given distribution to another in a prescribed time while minimizing the effort to do so. It can be seen as the stochastic dynamical version of the optimal mass transport, and has growing applications in generative diffusion models and stochastic optimal control. In this work, we propose a regularized variant of the Schrödinger bridge with a q…
▽ More
Schrödinger bridge is a diffusion process that steers a given distribution to another in a prescribed time while minimizing the effort to do so. It can be seen as the stochastic dynamical version of the optimal mass transport, and has growing applications in generative diffusion models and stochastic optimal control. In this work, we propose a regularized variant of the Schrödinger bridge with a quadratic state cost-to-go that incentivizes the optimal sample paths to stay close to a nominal level. Unlike the conventional Schrödinger bridge, the regularization induces a state-dependent rate of killing and creation of probability mass, and its solution requires determining the Markov kernel of a reaction-diffusion partial differential equation. We derive this Markov kernel in closed form. Our solution recovers the heat kernel in the vanishing regularization (i.e., diffusion without reaction) limit, thereby recovering the solution of the conventional Schrödinger bridge. Our results enable the use of dynamic Sinkhorn recursion for computing the Schrödinger bridge with a quadratic state cost-to-go, which would otherwise be challenging to use in this setting. We deduce properties of the new kernel and explain its connections with certain exactly solvable models in quantum mechanics.
△ Less
Submitted 16 June, 2024; v1 submitted 1 June, 2024;
originally announced June 2024.
-
Final Physics Design of Proton Improvement Plan-II At Fermilab
Authors:
Abhishek Pathak,
Arun Saini,
Eduard Pozdeyev
Abstract:
This paper presents the final physics design of the Proton Improvement Plan-II (PIP-II) at Fermilab, focusing on the linear accelerator (Linac) and its beam transfer line. We address the challenges in longitudinal and transverse lattice design, specifically targeting collective effects, parametric resonances, and space charge nonlinearities that impact beam stability and emittance control. The str…
▽ More
This paper presents the final physics design of the Proton Improvement Plan-II (PIP-II) at Fermilab, focusing on the linear accelerator (Linac) and its beam transfer line. We address the challenges in longitudinal and transverse lattice design, specifically targeting collective effects, parametric resonances, and space charge nonlinearities that impact beam stability and emittance control. The strategies implemented effectively mitigate space charge complexities, resulting in significant improvements in beam quality -- evidenced by reduced emittance growth, lower beam halo, decreased loss, and better energy spread management. This comprehensive study is pivotal for the PIP-II project's success, providing valuable insights and approaches for future accelerator designs, especially in managing nonlinearities and enhancing beam dynamics.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Solving partial differential equations with sampled neural networks
Authors:
Chinmay Datar,
Taniya Kapoor,
Abhishek Chandra,
Qing Sun,
Iryna Burak,
Erik Lien Bolager,
Anna Veselovska,
Massimo Fornasier,
Felix Dietrich
Abstract:
Approximation of solutions to partial differential equations (PDE) is an important problem in computational science and engineering. Using neural networks as an ansatz for the solution has proven a challenge in terms of training time and approximation accuracy. In this contribution, we discuss how sampling the hidden weights and biases of the ansatz network from data-agnostic and data-dependent pr…
▽ More
Approximation of solutions to partial differential equations (PDE) is an important problem in computational science and engineering. Using neural networks as an ansatz for the solution has proven a challenge in terms of training time and approximation accuracy. In this contribution, we discuss how sampling the hidden weights and biases of the ansatz network from data-agnostic and data-dependent probability distributions allows us to progress on both challenges. In most examples, the random sampling schemes outperform iterative, gradient-based optimization of physics-informed neural networks regarding training time and accuracy by several orders of magnitude. For time-dependent PDE, we construct neural basis functions only in the spatial domain and then solve the associated ordinary differential equation with classical methods from scientific computing over a long time horizon. This alleviates one of the greatest challenges for neural PDE solvers because it does not require us to parameterize the solution in time. For second-order elliptic PDE in Barron spaces, we prove the existence of sampled networks with $L^2$ convergence to the solution. We demonstrate our approach on several time-dependent and static PDEs. We also illustrate how sampled networks can effectively solve inverse problems in this setting. Benefits compared to common numerical schemes include spectral convergence and mesh-free construction of basis functions.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Autonomous programmable microscopic electronic lablets optimized with digital control
Authors:
Thomas Maeke,
John McCaskill,
Dominic Funke,
Pierre Mayr,
Abhishek Sharma,
Uwe Tangen,
Jürgen Oehm
Abstract:
Lablets are autonomous microscopic particles with programmable CMOS electronics that can control electrokinetic phenomena and electrochemical reactions in solution via actuator and sensor microelectrodes. In this paper, we describe the design and fabrication of optimized singulated lablets (CMOS3) with dimensions 140x140x50 micrometers carrying an integrated coplanar encapsulated supercapacitor as…
▽ More
Lablets are autonomous microscopic particles with programmable CMOS electronics that can control electrokinetic phenomena and electrochemical reactions in solution via actuator and sensor microelectrodes. In this paper, we describe the design and fabrication of optimized singulated lablets (CMOS3) with dimensions 140x140x50 micrometers carrying an integrated coplanar encapsulated supercapacitor as a rechargeable power supply. The lablets are designed to allow docking to one another or to a smart surface for interchange of energy, electronic information, and chemicals. The paper focusses on the digital and analog design of the lablets to allow significant programmable functionality in a microscopic footprint, including the control of autonomous actuation and sensing up to the level of being able to support a complete lablet self-reproduction life cycle, although experimentally this remains to be proven. The potential of lablets in autonomous sensing and control and for evolutionary experimentation are discussed.
△ Less
Submitted 16 June, 2024; v1 submitted 30 May, 2024;
originally announced May 2024.
-
Hydrodynamics of a hard-core non-polar active lattice gas
Authors:
Ritwik Mukherjee,
Soumyabrata Saha,
Tridib Sadhu,
Abhishek Dhar,
Sanjib Sabhapandit
Abstract:
We present a fluctuating hydrodynamic description of a non-polar active lattice gas model with excluded volume interactions that exhibits motility-induced phase separation under appropriate conditions. For quasi-one dimension and higher, stability analysis of the noiseless hydrodynamics gives quantitative bounds on the phase boundary of the motility-induced phase separation in terms of spinodal an…
▽ More
We present a fluctuating hydrodynamic description of a non-polar active lattice gas model with excluded volume interactions that exhibits motility-induced phase separation under appropriate conditions. For quasi-one dimension and higher, stability analysis of the noiseless hydrodynamics gives quantitative bounds on the phase boundary of the motility-induced phase separation in terms of spinodal and binodal. Inclusion of the multiplicative noise in the fluctuating hydrodynamics describes the exponentially decaying two-point correlations in the stationary-state homogeneous phase. Our hydrodynamic description and theoretical predictions based on it are in excellent agreement with our Monte-Carlo simulations and pseudo-spectral iteration of the hydrodynamics equations. Our construction of hydrodynamics for this model is not suitable in strictly one-dimension with single-file constraints, and we argue that this breakdown is associated with micro-phase separation.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
DFT study of structural, electronic and optical properties of 2D MgO monolayer under bi-axial mechanical strain
Authors:
Kamal Kumar,
Anjali Kumari,
Soni Mishra,
Ramesh Sharma,
Abhishek Kumar Mishra
Abstract:
The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains…
▽ More
The structural, electronic, and dielectric (optical) properties of graphene-like 2D MgO monolayer have been explored through first-principles calculations under bi-axial tensile and compressive mechanical strain within a range of -10% to +10%. Our findings revealed that the pristine MgO monolayer is an indirect band gap semiconducting material and the semiconducting mature of MgO monolayer remains consistent under both compressive and tensile mechanical strain. This nature of MgO is confirmed through partial density of states (PDOS) as well as electronic band structure. PDOS exhibits the contribution of different atomic orbitals in bond formation and nature of bond, while band structure provides insight into electron transitions between energy levels of valance and conduction bands. All optical parameters (dielectric function, reflectivity, energy loss, refractive index, extinction coefficient and absorption) are plotted in an energy range 0-15 eV. Within this energy interval, MgO possesses the highest value of the refractive index (2.13) at 3.12 eV energy. Also, a detailed analysis of changes in the geometrical structure of MgO monolayer is provided.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Stochastic Optimization Algorithms for Instrumental Variable Regression with Streaming Data
Authors:
Xuxing Chen,
Abhishek Roy,
Yifan Hu,
Krishnakumar Balasubramanian
Abstract:
We develop and analyze algorithms for instrumental variable regression by viewing the problem as a conditional stochastic optimization problem. In the context of least-squares instrumental variable regression, our algorithms neither require matrix inversions nor mini-batches and provides a fully online approach for performing instrumental variable regression with streaming data. When the true mode…
▽ More
We develop and analyze algorithms for instrumental variable regression by viewing the problem as a conditional stochastic optimization problem. In the context of least-squares instrumental variable regression, our algorithms neither require matrix inversions nor mini-batches and provides a fully online approach for performing instrumental variable regression with streaming data. When the true model is linear, we derive rates of convergence in expectation, that are of order $\mathcal{O}(\log T/T)$ and $\mathcal{O}(1/T^{1-ι})$ for any $ι>0$, respectively under the availability of two-sample and one-sample oracles, respectively, where $T$ is the number of iterations. Importantly, under the availability of the two-sample oracle, our procedure avoids explicitly modeling and estimating the relationship between confounder and the instrumental variables, demonstrating the benefit of the proposed approach over recent works based on reformulating the problem as minimax optimization problems. Numerical experiments are provided to corroborate the theoretical results.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels
Authors:
Abhay Deshpande,
Liyiming Ke,
Quinn Pfeifer,
Abhishek Gupta,
Siddhartha S. Srinivasa
Abstract:
We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by l…
▽ More
We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by learning a locally continuous dynamics model from demonstrations to guide the agent back toward expert states. Through extensive experiments on peg insertion and fine gras**, we provide the first empirical validation that CCIL can significantly improve imitation learning performance despite discontinuities present in contact-rich manipulation. We find that: (1) real-world manipulation exhibits sufficient local smoothness to apply CCIL, (2) generated corrective labels are most beneficial in low-data regimes, and (3) label filtering based on estimated dynamics model error enables performance gains. To effectively apply CCIL to robotic domains, we offer a practical instantiation of the framework and insights into design choices and hyperparameter selection. Our work demonstrates CCIL's practicality for alleviating compounding errors in imitation learning on physical robots.
△ Less
Submitted 3 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Exploring waveforms with non-GR deviations for extreme mass-ratio inspirals
Authors:
Shailesh Kumar,
Rishabh Kumar Singh,
Abhishek Chowdhuri,
Arpan Bhattacharyya
Abstract:
The fundamental process of detecting and examining the polarization modes of gravitational waves plays a pivotal role in enhancing our grasp on the precise mechanisms behind their generation. A thorough investigation is essential for delving deeper into the essence of gravitational waves and rigorously evaluating and validating the range of modified gravity theories. In this line of interest, a ge…
▽ More
The fundamental process of detecting and examining the polarization modes of gravitational waves plays a pivotal role in enhancing our grasp on the precise mechanisms behind their generation. A thorough investigation is essential for delving deeper into the essence of gravitational waves and rigorously evaluating and validating the range of modified gravity theories. In this line of interest, a general description of black holes in theories beyond general relativity can serve a meaningful purpose where distinct deviation parameters can be mapped to solutions representing distinct theories. Employing a refined version of the deformed Kerr geometry, which is free from pathological behaviours such as unphysical divergences in the metric, we explore an extreme mass-ratio inspiral system, wherein a stellar-mass object perturbs a supermassive black hole. We compute the effects of deformation parameters on gravitational wave fluxes, orbital evolution and phase dynamics with leading order post-Newtonian corrections. With the waveform analysis, we assess the plausibility of detecting deviations from general relativity through observations facilitated by the Laser Interferometer Space Antenna (LISA), simultaneously constraining the extent of these deviations. Therefore, this analysis provides an understanding while highlighting the essential role of observations in advancing gravitational phenomena beyond general relativity.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.