Search | arXiv e-print repository

Dissipative chaos and steady state of open Tavis-Cummings dimer

Authors: Debabrata Mondal, Andrey Kolovsky, S. Sinha

Abstract: We consider a coupled atom-photon system described by the Tavis-Cummings dimer (two coupled cavities) in the presence of photon loss and atomic pum**, to investigate the quantum signature of dissipative chaos. The appropriate classical limit of the model allows us to obtain a phase diagram identifying different dynamical phases, especially the onset of chaos. Both classically and quantum mechani… ▽ More We consider a coupled atom-photon system described by the Tavis-Cummings dimer (two coupled cavities) in the presence of photon loss and atomic pum**, to investigate the quantum signature of dissipative chaos. The appropriate classical limit of the model allows us to obtain a phase diagram identifying different dynamical phases, especially the onset of chaos. Both classically and quantum mechanically, we demonstrate the emergence of a steady state in the chaotic regime and analyze its properties. The interplay between quantum fluctuation and chaos leads to enhanced mixing dynamics and dephasing, resulting in the formation of an incoherent photonic fluid. The steady state exhibits an intriguing phenomenon of subsystem thermalization even outside the chaotic regime; however, its effective temperature increases with the degree of chaos. Moreover, the statistical properties of the steady state show a close connection with the random matrix theory. Finally, we discuss the experimental relevance of our findings, which can be tested in cavity and circuit quantum electrodynamics setups. △ Less

Submitted 30 June, 2024; originally announced July 2024.

arXiv:2406.13302 [pdf, other]

Situational Instructions Database: Task Guidance in Dynamic Environments

Authors: Muhammad Saif Ullah Khan, Sankalp Sinha, Didier Stricker, Muhammad Zeshan Afzal

Abstract: The Situational Instructions Database (SID) addresses the need for enhanced situational awareness in artificial intelligence (AI) systems operating in dynamic environments. By integrating detailed scene graphs with dynamically generated, task-specific instructions, SID provides a novel dataset that allows AI systems to perform complex, real-world tasks with improved context sensitivity and operati… ▽ More The Situational Instructions Database (SID) addresses the need for enhanced situational awareness in artificial intelligence (AI) systems operating in dynamic environments. By integrating detailed scene graphs with dynamically generated, task-specific instructions, SID provides a novel dataset that allows AI systems to perform complex, real-world tasks with improved context sensitivity and operational accuracy. This dataset leverages advanced generative models to simulate a variety of realistic scenarios based on the 3D Semantic Scene Graphs (3DSSG) dataset, enriching it with scenario-specific information that details environmental interactions and tasks. SID facilitates the development of AI applications that can adapt to new and evolving conditions without extensive retraining, supporting research in autonomous technology and AI-driven decision-making processes. This dataset is instrumental in develo** robust, context-aware AI agents capable of effectively navigating and responding to unpredictable settings. Available for research and development, SID serves as a critical resource for advancing the capabilities of intelligent systems in complex environments. Dataset available at \url{https://github.com/mindgarage/situational-instructions-database}. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 9 pages, 6 figures

arXiv:2406.12191 [pdf, ps, other]

Quantum $K$-invariants via Quot schemes I

Authors: Shubham Sinha, Ming Zhang

Abstract: We study the virtual Euler characteristics of sheaves over Quot schemes of curves, establishing that these invariants fit into a topological quantum field theory (TQFT) valued in $\mathbb{Z}[[q]]$. Utilizing Quot scheme compactifications alongside the TQFT framework, we derive presentations of the small quantum $K$-ring of the Grassmannian. Our approach offers a new method for finding explicit for… ▽ More We study the virtual Euler characteristics of sheaves over Quot schemes of curves, establishing that these invariants fit into a topological quantum field theory (TQFT) valued in $\mathbb{Z}[[q]]$. Utilizing Quot scheme compactifications alongside the TQFT framework, we derive presentations of the small quantum $K$-ring of the Grassmannian. Our approach offers a new method for finding explicit formulas for quantum $K$-invariants. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 44 pages

MSC Class: 14N35; 14N10; 19E08; 14M15

arXiv:2406.10764 [pdf, other]

GNOME: Generating Negotiations through Open-Domain Map** of Exchanges

Authors: Darshan Deshpande, Shambhavi Sinha, Anirudh Ravi Kumar, Debaditya Pal, Jonathan May

Abstract: Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes exi… ▽ More Language Models have previously shown strong negotiation capabilities in closed domains where the negotiation strategy prediction scope is constrained to a specific setup. In this paper, we first show that these models are not generalizable beyond their original training domain despite their wide-scale pretraining. Following this, we propose an automated framework called GNOME, which processes existing human-annotated, closed-domain datasets using Large Language Models and produces synthetic open-domain dialogues for negotiation. GNOME improves the generalizability of negotiation systems while reducing the expensive and subjective task of manual data curation. Through our experimental setup, we create a benchmark comparing encoder and decoder models trained on existing datasets against datasets created through GNOME. Our results show that models trained on our dataset not only perform better than previous state of the art models on domain specific strategy prediction, but also generalize better to previously unseen domains. △ Less

Submitted 15 June, 2024; originally announced June 2024.

arXiv:2406.10247 [pdf, other]

QCQA: Quality and Capacity-aware grouped Query Attention

Authors: Vinay Joshi, Prashant Laddha, Shambhavi Sinha, Om Ji Omer, Sreenivas Subramoney

Abstract: Excessive memory requirements of key and value features (KV-cache) present significant challenges in the autoregressive inference of large language models (LLMs), restricting both the speed and length of text generation. Approaches such as Multi-Query Attention (MQA) and Grouped Query Attention (GQA) mitigate these challenges by grou** query heads and consequently reducing the number of correspo… ▽ More Excessive memory requirements of key and value features (KV-cache) present significant challenges in the autoregressive inference of large language models (LLMs), restricting both the speed and length of text generation. Approaches such as Multi-Query Attention (MQA) and Grouped Query Attention (GQA) mitigate these challenges by grou** query heads and consequently reducing the number of corresponding key and value heads. However, MQA and GQA decrease the KV-cache size requirements at the expense of LLM accuracy (quality of text generation). These methods do not ensure an optimal tradeoff between KV-cache size and text generation quality due to the absence of quality-aware grou** of query heads. To address this issue, we propose Quality and Capacity-Aware Grouped Query Attention (QCQA), which identifies optimal query head grou**s using an evolutionary algorithm with a computationally efficient and inexpensive fitness function. We demonstrate that QCQA achieves a significantly better tradeoff between KV-cache capacity and LLM accuracy compared to GQA. For the Llama2 $7\,$B model, QCQA achieves $\mathbf{20}$\% higher accuracy than GQA with similar KV-cache size requirements in the absence of fine-tuning. After fine-tuning both QCQA and GQA, for a similar KV-cache size, QCQA provides $\mathbf{10.55}\,$\% higher accuracy than GQA. Furthermore, QCQA requires $40\,$\% less KV-cache size than GQA to attain similar accuracy. The proposed quality and capacity-aware grou** of query heads can serve as a new paradigm for KV-cache optimization in autoregressive LLM inference. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2406.08787 [pdf, ps, other]

A Survey on Compositional Learning of AI Models: Theoretical and Experimetnal Practices

Authors: Sania Sinha, Tanawan Premsri, Parisa Kordjamshidi

Abstract: Compositional learning, mastering the ability to combine basic concepts and construct more intricate ones, is crucial for human cognition, especially in human language comprehension and visual perception. This notion is tightly connected to generalization over unobserved situations. Despite its integral role in intelligence, there is a lack of systematic theoretical and experimental research metho… ▽ More Compositional learning, mastering the ability to combine basic concepts and construct more intricate ones, is crucial for human cognition, especially in human language comprehension and visual perception. This notion is tightly connected to generalization over unobserved situations. Despite its integral role in intelligence, there is a lack of systematic theoretical and experimental research methodologies, making it difficult to analyze the compositional learning abilities of computational models. In this paper, we survey the literature on compositional learning of AI models and the connections made to cognitive studies. We identify abstract concepts of compositionality in cognitive and linguistic studies and connect these to the computational challenges faced by language and vision models in compositional reasoning. We overview the formal definitions, tasks, evaluation benchmarks, variety of computational models, and theoretical findings. We cover modern studies on large language models to provide a deeper understanding of the cutting-edge compositional capabilities exhibited by state-of-the-art AI models and pinpoint important directions for future research. △ Less

Submitted 12 June, 2024; originally announced June 2024.

arXiv:2406.02521 [pdf, other]

Superconducting magic-angle twisted trilayer graphene hosts competing magnetic order and moiré inhomogeneities

Authors: Ayshi Mukherjee, Surat Layek, Subhajit Sinha, Ritajit Kundu, Alisha H. Marchawala, Mahesh Hingankar, Joydip Sarkar, L. D. Varma Sangani, Heena Agarwal, Sanat Ghosh, Aya Batoul Tazi, Kenji Watanabe, Takashi Taniguchi, Abhay N. Pasupathy, Arijit Kundu, Mandar M. Deshmukh

Abstract: The microscopic mechanism of superconductivity in the magic-angle twisted graphene family, including magic-angle twisted trilayer graphene (MATTG), is poorly understood. Properties of MATTG, like Pauli limit violation, suggest unconventional superconductivity. Theoretical studies propose proximal magnetic states in the phase diagram, but direct experimental evidence is lacking. We show direct evid… ▽ More The microscopic mechanism of superconductivity in the magic-angle twisted graphene family, including magic-angle twisted trilayer graphene (MATTG), is poorly understood. Properties of MATTG, like Pauli limit violation, suggest unconventional superconductivity. Theoretical studies propose proximal magnetic states in the phase diagram, but direct experimental evidence is lacking. We show direct evidence for an in-plane magnetic order proximal to the superconducting state using two complementary electrical transport measurements. First, we probe the superconducting phase by using statistically significant switching events from superconducting to the dissipative state of MATTG. The system behaves like a network of Josephson junctions due to lattice relaxation-induced moiré inhomogeneity in the system. We observe non-monotonic and hysteretic responses in the switching distributions as a function of temperature and in-plane magnetic field. Second, in normal regions doped slightly away from the superconducting regime, we observe hysteresis in magnetoresistance with an in-plane magnetic field; showing evidence for in-plane magnetic order that vanishes $\sim$900 mK. Additionally, we show a broadened Berezinskii-Kosterlitz-Thouless transition due to relaxation-induced moiré inhomogeneity. We find superfluid stiffness $J_{\mathrm{s}}$$\sim$0.15 K with strong temperature dependence. Theoretically, the magnetic and superconducting order arising from the magnetic order's fluctuations have been proposed - we show direct evidence for both. Our observation that the hysteretic magnetoresistance is sensitive to the in-plane field may constrain possible intervalley-coherent magnetic orders and the resulting superconductivity that arises from its fluctuations. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 22 pages, 8 figures, and supplementary information

arXiv:2405.19653 [pdf, other]

SysCaps: Language Interfaces for Simulation Surrogates of Complex Systems

Authors: Patrick Emami, Zhaonan Li, Saumya Sinha, Truc Nguyen

Abstract: Data-driven simulation surrogates help computational scientists study complex systems. They can also help inform impactful policy decisions. We introduce a learning framework for surrogate modeling where language is used to interface with the underlying system being simulated. We call a language description of a system a "system caption", or SysCap. To address the lack of datasets of paired natura… ▽ More Data-driven simulation surrogates help computational scientists study complex systems. They can also help inform impactful policy decisions. We introduce a learning framework for surrogate modeling where language is used to interface with the underlying system being simulated. We call a language description of a system a "system caption", or SysCap. To address the lack of datasets of paired natural language SysCaps and simulation runs, we use large language models (LLMs) to synthesize high-quality captions. Using our framework, we train multimodal text and timeseries regression models for two real-world simulators of complex energy systems. Our experiments demonstrate the feasibility of designing language interfaces for real-world surrogate models at comparable accuracy to standard baselines. We qualitatively and quantitatively show that SysCaps unlock text-prompt-style surrogate modeling and new generalization abilities beyond what was previously possible. We will release the generated SysCaps datasets and our code to support follow-on studies. △ Less

Submitted 29 May, 2024; originally announced May 2024.

Comments: 17 pages. Under review

arXiv:2405.13809 [pdf, other]

Self-trap** phenomenon, multistability and chaos in open anisotropic Dicke dimer

Authors: G. Vivek, Debabrata Mondal, Subhadeep Chakraborty, S. Sinha

Abstract: We investigate semiclassical dynamics of coupled atom-photon interacting system described by a dimer of anisotropic Dicke model in the presence of photon loss, exhibiting a rich variety of non-linear dynamics. Based on symmetries and dynamical classification, we characterize and chart out various dynamical phases in a phase diagram. A key feature of this system is the multistability of different d… ▽ More We investigate semiclassical dynamics of coupled atom-photon interacting system described by a dimer of anisotropic Dicke model in the presence of photon loss, exhibiting a rich variety of non-linear dynamics. Based on symmetries and dynamical classification, we characterize and chart out various dynamical phases in a phase diagram. A key feature of this system is the multistability of different dynamical states, particularly the coexistence of various superradiant phases as well as limit cycles. Remarkably, this dimer system manifests self-trap** phenomena, resulting in a photon population imbalance between the cavities. Such a self-trapped state arises from saddle-node bifurcation, which can be understood from an equivalent Landau-Ginzburg description. Additionally, we identify a unique class of oscillatory dynamics self-trapped limit cycle hosting self-trap** of photons. The absence of stable dynamical phases leads to the onset of chaos, which is diagnosed using the saturation value of the decorrelator dynamics. Moreover, in a narrow region, the self-trapped states can coexist with chaotic attractor, which may have intriguing consequences in quantum dynamics. Finally, we discuss the experimental relevance of our findings, which can be tested in cavity and circuit quantum electrodynamics setups. △ Less

Submitted 22 May, 2024; originally announced May 2024.

arXiv:2405.11446 [pdf, other]

MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

Authors: Sanchit Sinha, Yuguang Yue, Victor Soto, Mayank Kulkarni, Jianhua Lu, Aidong Zhang

Abstract: Adapting large language models (LLMs) to unseen tasks with in-context training samples without fine-tuning remains an important research problem. To learn a robust LLM that adapts well to unseen tasks, multiple meta-training approaches have been proposed such as MetaICL and MetaICT, which involve meta-training pre-trained LLMs on a wide variety of diverse tasks. These meta-training approaches esse… ▽ More Adapting large language models (LLMs) to unseen tasks with in-context training samples without fine-tuning remains an important research problem. To learn a robust LLM that adapts well to unseen tasks, multiple meta-training approaches have been proposed such as MetaICL and MetaICT, which involve meta-training pre-trained LLMs on a wide variety of diverse tasks. These meta-training approaches essentially perform in-context multi-task fine-tuning and evaluate on a disjointed test set of tasks. Even though they achieve impressive performance, their goal is never to compute a truly general set of parameters. In this paper, we propose MAML-en-LLM, a novel method for meta-training LLMs, which can learn truly generalizable parameters that not only perform well on disjointed tasks but also adapts to unseen tasks. We see an average increase of 2% on unseen domains in the performance while a massive 4% improvement on adaptation performance. Furthermore, we demonstrate that MAML-en-LLM outperforms baselines in settings with limited amount of training data on both seen and unseen domains by an average of 2%. Finally, we discuss the effects of type of tasks, optimizers and task complexity, an avenue barely explored in meta-training literature. Exhaustive experiments across 7 task settings along with two data settings demonstrate that models trained with MAML-en-LLM outperform SOTA meta-training approaches. △ Less

Submitted 19 May, 2024; originally announced May 2024.

Comments: KDD 2024, 11 pages(9 main, 2 ref, 1 App) Openreview https://openreview.net/forum?id=JwecLNhWDy&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DKDD.org%2F2024%2FResearch_Track%2FAuthors%23your-submissions)

arXiv:2405.08959 [pdf, other]

doi 10.1038/s41578-024-00671-4

Tunable moiré materials for probing Berry physics and topology

Authors: Pratap Chandra Adak, Subhajit Sinha, Amit Agarwal, Mandar M. Deshmukh

Abstract: Berry curvature physics and quantum geometric effects have been instrumental in advancing topological condensed matter physics in recent decades. Although Landau level-based flat bands and conventional 3D solids have been pivotal in exploring rich topological phenomena, they are constrained by their limited ability to undergo dynamic tuning. In stark contrast, moiré systems have risen as a versati… ▽ More Berry curvature physics and quantum geometric effects have been instrumental in advancing topological condensed matter physics in recent decades. Although Landau level-based flat bands and conventional 3D solids have been pivotal in exploring rich topological phenomena, they are constrained by their limited ability to undergo dynamic tuning. In stark contrast, moiré systems have risen as a versatile platform for engineering bands and manipulating the distribution of Berry curvature in momentum space. These moiré systems not only harbor tunable topological bands, modifiable through a plethora of parameters, but also provide unprecedented access to large length scales and low energy scales. Furthermore, they offer unique opportunities stemming from the symmetry-breaking mechanisms and electron correlations associated with the underlying flat bands that are beyond the reach of conventional crystalline solids. A diverse array of tools, encompassing quantum electron transport in both linear and non-linear response regimes and optical excitation techniques, provide direct avenues for investigating Berry physics. This review navigates the evolving landscape of tunable moiré materials, highlighting recent experimental breakthroughs in the field of topological physics. Additionally, we delineate several challenges and offer insights into promising avenues for future research. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: This is a version submitted to Nature Reviews Materials. The document contains 8 figures and 32 pages

Journal ref: Nature Reviews Materials (2024)

arXiv:2405.04739 [pdf]

Pressure induced metallization and loss of surface magnetism in FeSi

Authors: Yuhang Deng, Farhad Taraporevala, Haozhe Wang, Eric Lee-Wong, Camilla M. Moir, **hyuk Lim, Shubham Sinha, Weiwei Xie, James Hamlin, Yogesh Vohra, M. Brian Maple

Abstract: Single crystalline FeSi samples with a conducting surface state (CSS) were studied under high pressure ($\textit{P}$) and magnetic field ($\textit{B}$) by means of electrical resistance ($\textit{R}$) measurements to explore how the bulk semiconducting state and the surface state are tuned by the application of pressure. We found that the energy gap ($Δ$) associated with the semiconducting bulk ph… ▽ More Single crystalline FeSi samples with a conducting surface state (CSS) were studied under high pressure ($\textit{P}$) and magnetic field ($\textit{B}$) by means of electrical resistance ($\textit{R}$) measurements to explore how the bulk semiconducting state and the surface state are tuned by the application of pressure. We found that the energy gap ($Δ$) associated with the semiconducting bulk phase begins to close abruptly at a critical pressure ($P_{cr}$) of ~10 GPa and the bulk material becomes metallic with no obvious sign of any emergent phases or non-Fermi liquid behavior in $\textit{R}$($\textit{T}$) in the neighborhood of $P_{cr}$ above 3 K. Moreover, the metallic phase appears to remain at near-ambient pressure upon release of the pressure. Interestingly, the hysteresis in the $\textit{R}$($\textit{T}$) curve associated with the magnetically ordered CSS decreases with pressure and vanishes at $P_{cr}$, while the slope of the $\textit{R}$($\textit{B}$) curve, d$\textit{R}$/d$\textit{B}$, which has a negative value for $\textit{P}$ < $P_{cr}$, decreases in magnitude with $\textit{P}$ and changes sign at $P_{cr}$. Thus, the CSS and the corresponding two-dimensional magnetic order collapse at $P_{cr}$ where the energy gap $Δ$ of the bulk material starts to close abruptly, revealing the connection between the CSS and the semiconducting bulk state in FeSi. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2405.03660 [pdf, other]

CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification

Authors: Sankalp Sinha, Muhammad Saif Ullah Khan, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal

Abstract: Zero-shot learning has been extensively investigated in the broader field of visual recognition, attracting significant interest recently. However, the current work on zero-shot learning in document image classification remains scarce. The existing studies either focus exclusively on zero-shot inference, or their evaluation does not align with the established criteria of zero-shot evaluation in th… ▽ More Zero-shot learning has been extensively investigated in the broader field of visual recognition, attracting significant interest recently. However, the current work on zero-shot learning in document image classification remains scarce. The existing studies either focus exclusively on zero-shot inference, or their evaluation does not align with the established criteria of zero-shot evaluation in the visual recognition domain. We provide a comprehensive document image classification analysis in Zero-Shot Learning (ZSL) and Generalized Zero-Shot Learning (GZSL) settings to address this gap. Our methodology and evaluation align with the established practices of this domain. Additionally, we propose zero-shot splits for the RVL-CDIP dataset. Furthermore, we introduce CICA (pronounced 'ki-ka'), a framework that enhances the zero-shot learning capabilities of CLIP. CICA consists of a novel 'content module' designed to leverage any generic document-related textual information. The discriminative features extracted by this module are aligned with CLIP's text and image features using a novel 'coupled-contrastive' loss. Our module improves CLIP's ZSL top-1 accuracy by 6.7% and GZSL harmonic mean by 24% on the RVL-CDIP dataset. Our module is lightweight and adds only 3.3% more parameters to CLIP. Our work sets the direction for future research in zero-shot document classification. △ Less

Submitted 6 May, 2024; originally announced May 2024.

Comments: 18 Pages, 4 Figures and Accepted in ICDAR 2024

arXiv:2405.01960 [pdf, other]

Proliferation-driven mechanical feedback regulates cell dynamics in growing tissues

Authors: Sumit Sinha, Xin Li, Abdul N Malmi-Kakkada, D. Thirumalai

Abstract: Local stresses in a tissue, a collective property, regulate cell division and apoptosis. In turn, cell growth and division induce active stresses in the tissue. As a consequence, there is a feedback between cell growth and local stresses. However, how the cell dynamics depend on local stress-dependent cell division and the feedback strength is not fully understood. Here, we probe the consequences… ▽ More Local stresses in a tissue, a collective property, regulate cell division and apoptosis. In turn, cell growth and division induce active stresses in the tissue. As a consequence, there is a feedback between cell growth and local stresses. However, how the cell dynamics depend on local stress-dependent cell division and the feedback strength is not fully understood. Here, we probe the consequences of stress-mediated growth and cell division on cell dynamics using agent-based simulations of a two-dimensional growing tissue. We discover a rich dynamical behavior of individual cells, ranging from jamming (mean square displacement, $Δ(t) \sim t^α$ with $α$ less than unity), to hyperdiffusion ($α> 2$) depending on cell division rate and the strength of the mechanical feedback. Strikingly, $Δ(t)$ is determined by the tissue growth law, which quantifies cell proliferation (number of cells $N(t)$ as a function of time). The growth law ($N(t) \sim t^λ$ at long times) is regulated by the critical pressure that controls the strength of the mechanical feedback and the ratio between cell division-apoptosis rates. We show that $λ\sim α$, which implies that higher growth rate leads to a greater degree of cell migration. The variations in cell motility are linked to the emergence of highly persistent forces extending over several cell cycle times. Our predictions are testable using cell-tracking imaging techniques. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 5 figures. arXiv admin note: text overlap with arXiv:2202.04806

arXiv:2405.00349 [pdf, other]

A Self-explaining Neural Architecture for Generalizable Concept Learning

Authors: Sanchit Sinha, Guangzhi Xiong, Aidong Zhang

Abstract: With the wide proliferation of Deep Neural Networks in high-stake applications, there is a growing demand for explainability behind their decision-making process. Concept learning models attempt to learn high-level 'concepts' - abstract entities that align with human understanding, and thus provide interpretability to DNN architectures. However, in this paper, we demonstrate that present SOTA conc… ▽ More With the wide proliferation of Deep Neural Networks in high-stake applications, there is a growing demand for explainability behind their decision-making process. Concept learning models attempt to learn high-level 'concepts' - abstract entities that align with human understanding, and thus provide interpretability to DNN architectures. However, in this paper, we demonstrate that present SOTA concept learning approaches suffer from two major problems - lack of concept fidelity wherein the models fail to learn consistent concepts among similar classes and limited concept interoperability wherein the models fail to generalize learned concepts to new domains for the same task. Kee** these in mind, we propose a novel self-explaining architecture for concept learning across domains which - i) incorporates a new concept saliency network for representative concept selection, ii) utilizes contrastive learning to capture representative domain invariant concepts, and iii) uses a novel prototype-based concept grounding regularization to improve concept alignment across domains. We demonstrate the efficacy of our proposed approach over current SOTA concept learning approaches on four widely used real-world datasets. Empirical results show that our method improves both concept fidelity measured through concept overlap and concept interoperability measured through domain adaptation performance. △ Less

Submitted 5 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

Comments: IJCAI 2024. 16 pages (7 main content, 2 references, 7 Appendix) Code available at https://github.com/sanchit97/secl

arXiv:2404.13883 [pdf, other]

Decoherence of a charged Brownian particle in a magnetic field : an analysis of the roles of coupling via position and momentum variables

Authors: Suraka Bhattacharjee, Koushik Mandal, Supurna Sinha

Abstract: The study of decoherence plays a key role in our understanding of the transition from the quantum to the classical world. Typically, one considers a system coupled to an external bath which forms a model for an open quantum system. While most of the studies pertain to a position coupling between the system and the environment, some involve a momentum coupling, giving rise to an anomalous diffusive… ▽ More The study of decoherence plays a key role in our understanding of the transition from the quantum to the classical world. Typically, one considers a system coupled to an external bath which forms a model for an open quantum system. While most of the studies pertain to a position coupling between the system and the environment, some involve a momentum coupling, giving rise to an anomalous diffusive model. Here we have gone beyond existing studies and analysed the quantum Langevin dynamics of a harmonically oscillating charged Brownian particle in the presence of a magnetic field and coupled to an Ohmic heat bath via both position and momentum couplings. The presence of both position and momentum couplings leads to a stronger interaction with the environment, resulting in a faster loss of coherence compared to a situation where only position coupling is present. The rate of decoherence can be tuned by controlling the relative strengths of the position and momentum coupling parameters. In addition, the magnetic field results in the slowing down of the loss of information from the system, irrespective of the nature of coupling between the system and the bath. Our results can be experimentally verified by designing a suitable ion trap setup. △ Less

Submitted 22 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 15 pages, 3 figures. Expert comments on our manuscript are always welcome

arXiv:2404.06405 [pdf, other]

Wu's Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry

Authors: Shiven Sinha, Ameya Prabhu, Ponnurangam Kumaraguru, Siddharth Bhat, Matthias Bethge

Abstract: Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 2… ▽ More Proving geometric theorems constitutes a hallmark of visual reasoning combining both intuitive and logical skills. Therefore, automated theorem proving of Olympiad-level geometry problems is considered a notable milestone in human-level automated reasoning. The introduction of AlphaGeometry, a neuro-symbolic model trained with 100 million synthetic samples, marked a major breakthrough. It solved 25 of 30 International Mathematical Olympiad (IMO) problems whereas the reported baseline based on Wu's method solved only ten. In this note, we revisit the IMO-AG-30 Challenge introduced with AlphaGeometry, and find that Wu's method is surprisingly strong. Wu's method alone can solve 15 problems, and some of them are not solved by any of the other methods. This leads to two key findings: (i) Combining Wu's method with the classic synthetic methods of deductive databases and angle, ratio, and distance chasing solves 21 out of 30 methods by just using a CPU-only laptop with a time limit of 5 minutes per problem. Essentially, this classic method solves just 4 problems less than AlphaGeometry and establishes the first fully symbolic baseline strong enough to rival the performance of an IMO silver medalist. (ii) Wu's method even solves 2 of the 5 problems that AlphaGeometry failed to solve. Thus, by combining AlphaGeometry with Wu's method we set a new state-of-the-art for automated theorem proving on IMO-AG-30, solving 27 out of 30 problems, the first AI method which outperforms an IMO gold medalist. △ Less

Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: Work in Progress. Released for wider feedback

arXiv:2403.18074 [pdf, other]

Every Shot Counts: Using Exemplars for Repetition Counting in Videos

Authors: Saptarshi Sinha, Alexandros Stergiou, Dima Damen

Abstract: Video repetition counting infers the number of repetitions of recurring actions or motion within a video. We propose an exemplar-based approach that discovers visual correspondence of video exemplars across repetitions within target videos. Our proposed Every Shot Counts (ESCounts) model is an attention-based encoder-decoder that encodes videos of varying lengths alongside exemplars from the same… ▽ More Video repetition counting infers the number of repetitions of recurring actions or motion within a video. We propose an exemplar-based approach that discovers visual correspondence of video exemplars across repetitions within target videos. Our proposed Every Shot Counts (ESCounts) model is an attention-based encoder-decoder that encodes videos of varying lengths alongside exemplars from the same and different videos. In training, ESCounts regresses locations of high correspondence to the exemplars within the video. In tandem, our method learns a latent that encodes representations of general repetitive motions, which we use for exemplar-free, zero-shot inference. Extensive experiments over commonly used datasets (RepCount, Countix, and UCFRep) showcase ESCounts obtaining state-of-the-art performance across all three datasets. On RepCount, ESCounts increases the off-by-one from 0.39 to 0.56 and decreases the mean absolute error from 0.38 to 0.21. Detailed ablations further demonstrate the effectiveness of our method. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: Project website: https://sinhasaptarshi.github.io/escounts

arXiv:2403.17707 [pdf, other]

Effect of light-assisted tunable interaction on the position response function of cold atoms

Authors: Anirban Misra, Urbashi Satpathi, Supurna Sinha, Sanjukta Roy, Saptarishi Chaudhuri

Abstract: The position response of a particle subjected to a perturbation is of general interest in physics. We study the modification of the position response function of an ensemble of cold atoms in a magneto-optical trap in the presence of tunable light-assisted interactions. We subject the cold atoms to an intense laser light tuned near the photoassociation resonance and observe the position response of… ▽ More The position response of a particle subjected to a perturbation is of general interest in physics. We study the modification of the position response function of an ensemble of cold atoms in a magneto-optical trap in the presence of tunable light-assisted interactions. We subject the cold atoms to an intense laser light tuned near the photoassociation resonance and observe the position response of the atoms subjected to a sudden displacement. Surprisingly, we observe that the entire cold atomic cloud undergoes collective oscillations. We use a generalised quantum Langevin approach to theoretically analyse the results of the experiments and find good agreement. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 20 pages, 7 Figures

arXiv:2403.15869 [pdf, other]

Evolution beats random chance: Performance-dependent network evolution for enhanced computational capacity

Authors: Manish Yadav, Sudeshna Sinha, Merten Stender

Abstract: The quest to understand structure-function relationships in networks across scientific disciplines has intensified. However, the optimal network architecture remains elusive, particularly for complex information processing. Therefore, we investigate how optimal and specific network structures form to efficiently solve distinct tasks using a novel framework of performance-dependent network evolutio… ▽ More The quest to understand structure-function relationships in networks across scientific disciplines has intensified. However, the optimal network architecture remains elusive, particularly for complex information processing. Therefore, we investigate how optimal and specific network structures form to efficiently solve distinct tasks using a novel framework of performance-dependent network evolution, leveraging reservoir computing principles. Our study demonstrates that task-specific minimal network structures obtained through this framework consistently outperform networks generated by alternative growth strategies and Erdős-Rényi random networks. Evolved networks exhibit unexpected sparsity and adhere to scaling laws in node-density space while showcasing a distinctive asymmetry in input and information readout nodes distribution. Consequently, we propose a heuristic for quantifying task complexity from performance-dependently evolved networks, offering valuable insights into the evolutionary dynamics of network structure-function relationships. Our findings not only advance the fundamental understanding of process-specific network evolution but also shed light on the design and optimization of complex information processing mechanisms, notably in machine learning. △ Less

Submitted 26 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

Comments: 22 pages, 6 figures

arXiv:2402.18454 [pdf, other]

Prospects for measuring time variation of astrophysical neutrino sources at dark matter detectors

Authors: Yi Zhuang, Louis E. Strigari, Lei **, Samiran Sinha

Abstract: We study the prospects for measuring the time variation of solar and atmospheric neutrino fluxes at future large-scale Xenon and Argon dark matter detectors. For solar neutrinos, a yearly time variation arises from the eccentricity of the Earth's orbit, and, for charged current interactions, from a smaller energy-dependent day-night variation to due flavor regeneration as neutrinos travel through… ▽ More We study the prospects for measuring the time variation of solar and atmospheric neutrino fluxes at future large-scale Xenon and Argon dark matter detectors. For solar neutrinos, a yearly time variation arises from the eccentricity of the Earth's orbit, and, for charged current interactions, from a smaller energy-dependent day-night variation to due flavor regeneration as neutrinos travel through the Earth. For a 100-ton Xenon detector running for 10 years with a Xenon-136 fraction of $\lesssim 0.1\%$, in the electron recoil channel a time-variation amplitude of about 0.8\% is detectable with a power of 90\% and the level of significance of 10\%. This is sufficient to detect time variation due to eccentricity, which has amplitude of $\sim 3\%$. In the nuclear recoil channel, the detectable amplitude is about 10\% under current detector resolution and efficiency conditions, and this generally reduces to about 1\% for improved detector resolution and efficiency, the latter of which is sufficient to detect time variation due to eccentricity. Our analysis assumes both known and unknown periods. We provide scalings to determine the sensitivity to an arbitrary time-varying amplitude as a function of detector parameters. Identifying the time variation of the neutrino fluxes will be important for distinguishing neutrinos from dark matter signals and other detector-related backgrounds, and extracting properties of neutrinos that can be uniquely studied in dark matter experiments. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 23 pages, 15 figures

arXiv:2402.15589 [pdf, other]

Prompting LLMs to Compose Meta-Review Drafts from Peer-Review Narratives of Scholarly Manuscripts

Authors: Shubhra Kanti Karmaker Santu, Sanjeev Kumar Sinha, Naman Bansal, Alex Knipper, Souvika Sarkar, John Salvador, Yash Mahajan, Sri Guttikonda, Mousumi Akter, Matthew Freestone, Matthew C. Williams Jr

Abstract: One of the most important yet onerous tasks in the academic peer-reviewing process is composing meta-reviews, which involves understanding the core contributions, strengths, and weaknesses of a scholarly manuscript based on peer-review narratives from multiple experts and then summarizing those multiple experts' perspectives into a concise holistic overview. Given the latest major developments in… ▽ More One of the most important yet onerous tasks in the academic peer-reviewing process is composing meta-reviews, which involves understanding the core contributions, strengths, and weaknesses of a scholarly manuscript based on peer-review narratives from multiple experts and then summarizing those multiple experts' perspectives into a concise holistic overview. Given the latest major developments in generative AI, especially Large Language Models (LLMs), it is very compelling to rigorously study the utility of LLMs in generating such meta-reviews in an academic peer-review setting. In this paper, we perform a case study with three popular LLMs, i.e., GPT-3.5, LLaMA2, and PaLM2, to automatically generate meta-reviews by prompting them with different types/levels of prompts based on the recently proposed TELeR taxonomy. Finally, we perform a detailed qualitative study of the meta-reviews generated by the LLMs and summarize our findings and recommendations for prompting LLMs for this complex task. △ Less

Submitted 23 February, 2024; originally announced February 2024.

ACM Class: I.2.7

arXiv:2402.15037 [pdf, other]

Analyzing Games in Maker Protocol Part One: A Multi-Agent Influence Diagram Approach Towards Coordination

Authors: Abhimanyu Nag, Samrat Gupta, Sudipan Sinha, Arka Datta

Abstract: Decentralized Finance (DeFi) ecosystems, exemplified by the Maker Protocol, rely on intricate games to maintain stability and security. Understanding the dynamics of these games is crucial for ensuring the robustness of the system. This motivating research proposes a novel methodology leveraging Multi-Agent Influence Diagrams (MAID), originally proposed by Koller and Milch, to dissect and analyze… ▽ More Decentralized Finance (DeFi) ecosystems, exemplified by the Maker Protocol, rely on intricate games to maintain stability and security. Understanding the dynamics of these games is crucial for ensuring the robustness of the system. This motivating research proposes a novel methodology leveraging Multi-Agent Influence Diagrams (MAID), originally proposed by Koller and Milch, to dissect and analyze the games within the Maker stablecoin protocol. By representing users and governance of the Maker protocol as agents and their interactions as edges in a graph, we capture the complex network of influences governing agent behaviors. Furthermore in the upcoming papers, we will show a Nash Equilibrium model to elucidate strategies that promote coordination and enhance economic security within the ecosystem. Through this approach, we aim to motivate the use of this method to introduce a new method of formal verification of game theoretic security in DeFi platforms. △ Less

Submitted 22 February, 2024; originally announced February 2024.

arXiv:2402.12629 [pdf, other]

Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale

Authors: Anmol Agarwal, Pratyush Priyadarshi, Shiven Sinha, Shrey Gupta, Hitkul Jangra, Kiran Garimella, Ponnurangam Kumaraguru

Abstract: In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimedia essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scal… ▽ More In this paper, we tackle the complex task of analyzing televised debates, with a focus on a prime time news debate show from India. Previous methods, which often relied solely on text, fall short in capturing the multimedia essence of these debates. To address this gap, we introduce a comprehensive automated toolkit that employs advanced computer vision and speech-to-text techniques for large-scale multimedia analysis. Utilizing state-of-the-art computer vision algorithms and speech-to-text methods, we transcribe, diarize, and analyze thousands of YouTube videos of prime-time television debates in India. These debates are a central part of Indian media but have been criticized for compromised journalistic integrity and excessive dramatization. Our toolkit provides concrete metrics to assess bias and incivility, capturing a comprehensive multimedia perspective that includes text, audio utterances, and video frames. Our findings reveal significant biases in topic selection and panelist representation, along with alarming levels of incivility. This work offers a scalable, automated approach for future research in multimedia analysis, with profound implications for the quality of public discourse and democratic debate. We will make our data analysis pipeline and collected data publicly available to catalyze further research in this domain. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.08823 [pdf, other]

RanDumb: A Simple Approach that Questions the Efficacy of Continual Representation Learning

Authors: Ameya Prabhu, Shiven Sinha, Ponnurangam Kumaraguru, Philip H. S. Torr, Ozan Sener, Puneet K. Dokania

Abstract: We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks acro… ▽ More We propose RanDumb to examine the efficacy of continual representation learning. RanDumb embeds raw pixels using a fixed random transform which approximates an RBF-Kernel, initialized before seeing any data, and learns a simple linear classifier on top. We present a surprising and consistent finding: RanDumb significantly outperforms the continually learned representations using deep networks across numerous continual learning benchmarks, demonstrating the poor performance of representation learning in these scenarios. RanDumb stores no exemplars and performs a single pass over the data, processing one sample at a time. It complements GDumb, operating in a low-exemplar regime where GDumb has especially poor performance. We reach the same consistent conclusions when RanDumb is extended to scenarios with pretrained models replacing the random transform with pretrained feature extractor. Our investigation is both surprising and alarming as it questions our understanding of how to effectively design and train models that require efficient continual representation learning, and necessitates a principled reinvestigation of the widely explored problem formulation itself. Our code is available at https://github.com/drimpossible/RanDumb. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: Tech Report

arXiv:2402.05466 [pdf, other]

Engineering End-to-End Remote Labs using IoT-based Retrofitting

Authors: K. S. Viswanadh, Akshit Gureja, Nagesh Walchatwar, Rishabh Agrawal, Shiven Sinha, Sachin Chaudhari, Karthik Vaidhyanathan, Venkatesh Choppella, Prabhakar Bhimalapuram, Harikumar Kandath, Aftab Hussain

Abstract: Remote labs are a groundbreaking development in the education industry, providing students with access to laboratory education anytime, anywhere. However, most remote labs are costly and difficult to scale, especially in develo** countries. With this as a motivation, this paper proposes a new remote labs (RLabs) solution that includes two use case experiments: Vanishing Rod and Focal Length. The… ▽ More Remote labs are a groundbreaking development in the education industry, providing students with access to laboratory education anytime, anywhere. However, most remote labs are costly and difficult to scale, especially in develo** countries. With this as a motivation, this paper proposes a new remote labs (RLabs) solution that includes two use case experiments: Vanishing Rod and Focal Length. The hardware experiments are built at a low-cost by retrofitting Internet of Things (IoT) components. They are also made portable by designing miniaturised and modular setups. The software architecture designed as part of the solution seamlessly supports the scalability of the experiments, offering compatibility with a wide range of hardware devices and IoT platforms. Additionally, it can live-stream remote experiments without needing dedicated server space for the stream. The software architecture also includes an automation suite that periodically checks the status of the experiments using computer vision (CV). RLabs is qualitatively evaluated against seven non-functional attributes - affordability, portability, scalability, compatibility, maintainability, usability, and universality. Finally, user feedback was collected from a group of students, and the scores indicate a positive response to the students' learning and the platform's usability. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 30 pages, 7 tables and 20 figures. Submitted to ACM Transactions on IoT

arXiv:2402.04466 [pdf, other]

Towards Deterministic End-to-end Latency for Medical AI Systems in NVIDIA Holoscan

Authors: Soham Sinha, Shekhar Dwivedi, Mahdi Azizian

Abstract: The introduction of AI and ML technologies into medical devices has revolutionized healthcare diagnostics and treatments. Medical device manufacturers are keen to maximize the advantages afforded by AI and ML by consolidating multiple applications onto a single platform. However, concurrent execution of several AI applications, each with its own visualization components, leads to unpredictable end… ▽ More The introduction of AI and ML technologies into medical devices has revolutionized healthcare diagnostics and treatments. Medical device manufacturers are keen to maximize the advantages afforded by AI and ML by consolidating multiple applications onto a single platform. However, concurrent execution of several AI applications, each with its own visualization components, leads to unpredictable end-to-end latency, primarily due to GPU resource contentions. To mitigate this, manufacturers typically deploy separate workstations for distinct AI applications, thereby increasing financial, energy, and maintenance costs. This paper addresses these challenges within the context of NVIDIA's Holoscan platform, a real-time AI system for streaming sensor data and images. We propose a system design optimized for heterogeneous GPU workloads, encompassing both compute and graphics tasks. Our design leverages CUDA MPS for spatial partitioning of compute workloads and isolates compute and graphics processing onto separate GPUs. We demonstrate significant performance improvements across various end-to-end latency determinism metrics through empirical evaluation with real-world Holoscan medical device applications. For instance, the proposed design reduces maximum latency by 21-30% and improves latency distribution flatness by 17-25% for up to five concurrent endoscopy tool tracking AI applications, compared to a single-GPU baseline. Against a default multi-GPU setup, our optimizations decrease maximum latency by 35% for up to six concurrent applications by improving GPU utilization by 42%. This paper provides clear design insights for AI applications in the edge-computing domain including medical systems, where performance predictability of concurrent and heterogeneous GPU workloads is a critical requirement. △ Less

Submitted 6 February, 2024; originally announced February 2024.

ACM Class: C.3; J.7; D.2.11; D.2.10; D.4.8

arXiv:2402.01980 [pdf, other]

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

Authors: Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

Abstract: Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about… ▽ More Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data. Instruction tuning has been shown to improve the many capabilities of large language models (LLMs) such as commonsense reasoning, reading comprehension, and computer programming. However, little is known about the effectiveness of instruction tuning on the social domain where implicit pragmatic cues are often needed to be captured. We explore the use of instruction tuning for social science NLP tasks and introduce Socialite-Llama -- an open-source, instruction-tuned Llama. On a suite of 20 social science tasks, Socialite-Llama improves upon the performance of Llama as well as matches or improves upon the performance of a state-of-the-art, multi-task finetuned model on a majority of them. Further, Socialite-Llama also leads to improvement on 5 out of 6 related social tasks as compared to Llama, suggesting instruction tuning can lead to generalized social understanding. All resources including our code, model and dataset can be found through bit.ly/socialitellama. △ Less

Submitted 14 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: Short paper accepted to EACL 2024. 4 pgs, 2 tables

arXiv:2401.18083 [pdf, other]

Improved Scene Landmark Detection for Camera Localization

Authors: Tien Do, Sudipta N. Sinha

Abstract: Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-spe… ▽ More Camera localization methods based on retrieval, local feature matching, and 3D structure-based pose estimation are accurate but require high storage, are slow, and are not privacy-preserving. A method based on scene landmark detection (SLD) was recently proposed to address these limitations. It involves training a convolutional neural network (CNN) to detect a few predetermined, salient, scene-specific 3D points or landmarks and computing camera pose from the associated 2D-3D correspondences. Although SLD outperformed existing learning-based approaches, it was notably less accurate than 3D structure-based methods. In this paper, we show that the accuracy gap was due to insufficient model capacity and noisy labels during training. To mitigate the capacity issue, we propose to split the landmarks into subgroups and train a separate network for each subgroup. To generate better training labels, we propose using dense reconstructions to estimate visibility of scene landmarks. Finally, we present a compact architecture to improve memory efficiency. Accuracy wise, our approach is on par with state of the art structure based methods on the INDOOR-6 dataset but runs significantly faster and uses less storage. Code and models can be found at https://github.com/microsoft/SceneLandmarkLocalization. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: To be presented at 3DV 2024

arXiv:2401.14064 [pdf, other]

Quantum Treatment of the Current through Plasma-Metal Junction: Fundamentals

Authors: Muthukumar Balasundaram, Suraj Kumar Sinha

Abstract: We study the quantum nature of current through plasma-probe junction from the viewpoint of the metal probe. The intrinsic material properties of the metal and their influence on the nature of the observed current are theoretically worked out. The novel idea is that the plasma-sheath at the plasma-probe junction is treated as a potential barrier, and in analogy with the current conduction through a… ▽ More We study the quantum nature of current through plasma-probe junction from the viewpoint of the metal probe. The intrinsic material properties of the metal and their influence on the nature of the observed current are theoretically worked out. The novel idea is that the plasma-sheath at the plasma-probe junction is treated as a potential barrier, and in analogy with the current conduction through a metal-metal junction, the current through the plasma-sheath is treated as a quantum barrier penetration problem. Essentially, we obtain an expression for the electron-current as a function of the bias voltage in its full range, thereby unlocking the intricate dependency of the current on the material properties of the probe. △ Less

Submitted 25 January, 2024; originally announced January 2024.

Comments: 5 pages, 5 figures

arXiv:2401.09549 [pdf, other]

Interferometric Single-Shot Parity Measurement in an InAs-Al Hybrid Device

Authors: Morteza Aghaee, Alejandro Alcaraz Ramirez, Zulfi Alam, Rizwan Ali, Mariusz Andrzejczuk, Andrey Antipov, Mikhail Astafev, Amin Barzegar, Bela Bauer, Jonathan Becker, Umesh Kumar Bhaskar, Alex Bocharov, Srini Boddapati, David Bohn, Jouri Bommer, Leo Bourdet, Arnaud Bousquet, Samuel Boutin, Lucas Casparis, Benjamin James Chapman, Sohail Chatoor, Anna Wulff Christensen, Cassandra Chua, Patrick Codd, William Cole , et al. (137 additional authors not shown)

Abstract: The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostruct… ▽ More The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostructures with a gate-defined nanowire. The interferometer is formed by tunnel-coupling the proximitized nanowire to quantum dots. The nanowire causes a state-dependent shift of these quantum dots' quantum capacitance of up to 1 fF. Our quantum capacitance measurements show flux h/2e-periodic bimodality with a signal-to-noise ratio of 1 in 3.7 $μ$s at optimal flux values. From the time traces of the quantum capacitance measurements, we extract a dwell time in the two associated states that is longer than 1 ms at in-plane magnetic fields of approximately 2 T. These results are consistent with a measurement of the fermion parity encoded in a pair of Majorana zero modes that are separated by approximately 3 $μ$m and subjected to a low rate of poisoning by non-equilibrium quasiparticles. The large capacitance shift and long poisoning time enable a parity measurement error probability of 1%. △ Less

Submitted 2 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: Added data on a second measurement of device A and a measurement of device B, expanded discussion of a trivial scenario. Refs added, author list updated

arXiv:2401.01596 [pdf, other]

MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical question summarisation have been limited to the English language. This work introduces the task of multimodal medical question summarization for codemixed input in a low-resource setting. To address this gap, we introduce the Multimodal Medical Codemixed Question Summarization MMCQS dataset, which combines Hindi-English codemixed medical queries with visual aids. This integration enriches the representation of a patient's medical condition, providing a more comprehensive perspective. We also propose a framework named MedSumm that leverages the power of LLMs and VLMs for this task. By utilizing our MMCQS dataset, we demonstrate the value of integrating visual information from images to improve the creation of medically detailed summaries. This multimodal strategy not only improves healthcare decision-making but also promotes a deeper comprehension of patient queries, paving the way for future exploration in personalized and responsive medical care. Our dataset, code, and pre-trained models will be made publicly available. △ Less

Submitted 3 January, 2024; originally announced January 2024.

Comments: ECIR 2024

arXiv:2312.14799 [pdf, other]

Disorder-induced non-linear growth of viscously-unstable immiscible two-phase flow fingers in porous media

Authors: Santanu Sinha, Yves Méheust, Hursanay Fyhn, Subhadeep Roy, Alex Hansen

Abstract: The immiscible displacement of a fluid by another one inside a porous medium produces different types of patterns depending on the capillary number Ca and viscosity ratio M. At high Ca, viscous fingers resulting from the viscous instability between fluid-fluid interfaces are believed to exhibit the same Laplacian growth behavior as viscously-unstable fingers observed in Hele-Shaw cells by Saffman… ▽ More The immiscible displacement of a fluid by another one inside a porous medium produces different types of patterns depending on the capillary number Ca and viscosity ratio M. At high Ca, viscous fingers resulting from the viscous instability between fluid-fluid interfaces are believed to exhibit the same Laplacian growth behavior as viscously-unstable fingers observed in Hele-Shaw cells by Saffman and Taylor [1], or as diffusion limited aggregates (DLA) [2]. I.e., the interface velocity depends linearly on the local gradient of the physical field that drives the growth process (for two-phase flow, the pressure field). However, steady-state two-phase flow in porous media is known to exhibit a regime for which the flow rate depends as a non-linear power law on the global pressure drop, due to the disorder in the capillary barriers at pore throats. A similar nonlinear growth regime was also evidenced experimentally for viscously-unstable drainage in two-dimensional porous media 20 years ago [3]. Here we revisit this flow regime using dynamic pore-network modeling, and explore the non-linearity in the growth properties. We characterize the previously-unstudied dependencies of the statistical finger width and nonlinear growth law's exponent on Ca, and discuss quantitatively, based on theoretical arguments, how disorder in the capillary barriers controls the growth process' non-linearity, and why the flow regime crosses over to Laplacian growth at sufficiently high Ca. In addition, the statistical properties of the fingering patterns are compared to those of Saffman-Taylor fingers, DLA growth patterns, and the results from the aforementioned previous experimental study. △ Less

Submitted 22 December, 2023; originally announced December 2023.

Comments: 18 pages, 13 figures

arXiv:2312.11541 [pdf, other]

CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare

Authors: Akash Ghosh, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, Setu Sinha

Abstract: In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of… ▽ More In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of medical conditions, we introduce the Multimodal Medical Question Summarization (MMQS) Dataset. This dataset, a major contribution to our work, pairs medical queries with visual aids, facilitating a richer and more nuanced understanding of patient needs. We also propose a framework, utilizing the power of Contrastive Language Image Pretraining(CLIP) and Large Language Models(LLMs), consisting of four modules that identify medical disorders, generate relevant context, filter medical concepts, and craft visually aware summaries. Our comprehensive framework harnesses the power of CLIP, a multimodal foundation model, and various general-purpose LLMs, comprising four main modules: the medical disorder identification module, the relevant context generation module, the context filtration module for distilling relevant medical concepts and knowledge, and finally, a general-purpose LLM to generate visually aware medical question summaries. Leveraging our MMQS dataset, we showcase how visual cues from images enhance the generation of medically nuanced summaries. This multimodal approach not only enhances the decision-making process in healthcare but also fosters a more nuanced understanding of patient queries, laying the groundwork for future research in personalized and responsive medical care △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: AAAI 2024

arXiv:2312.03572 [pdf, ps, other]

Generalized $α$-Observational Entropy

Authors: Shivam Sinha, S. Aravinda

Abstract: Recognizing the inadequacy of existing measures for thermodynamic entropy, recent research focuses on observational Eetropy (OE) as a promising alternative, offering practical applicability and theoretical insights. In this work, we extend the scope of observational entropy by generalizing it to a parameterized version called $α$-Observational entropy ($α$-OE). $α$-OE is expressed in terms of the… ▽ More Recognizing the inadequacy of existing measures for thermodynamic entropy, recent research focuses on observational Eetropy (OE) as a promising alternative, offering practical applicability and theoretical insights. In this work, we extend the scope of observational entropy by generalizing it to a parameterized version called $α$-Observational entropy ($α$-OE). $α$-OE is expressed in terms of the Petz-Rényi relative entropy between the states on which a quantum-to-classical channel is applied. It is also expressed by using Sandwitched relative entropy. We prove various properties of the $α$-OE, which are the generalization of the properties of OE, including the monotonically increasing of $α$-OE as a function of refinement of coarse-graining. The generalized quantum relative entropies play a central role in many areas of quantum information theory, and we provide a connection of these entropic quantities to thermodynamic properties. △ Less

Submitted 6 December, 2023; originally announced December 2023.

Comments: 8 pages. Preliminary draft

arXiv:2312.00894 [pdf, other]

Leveraging Large Language Models to Improve REST API Testing

Authors: Myeongsoo Kim, Tyler Stennett, Dhruv Shah, Saurabh Sinha, Alessandro Orso

Abstract: The widespread adoption of REST APIs, coupled with their growing complexity and size, has led to the need for automated REST API testing tools. Current tools focus on the structured data in REST API specifications but often neglect valuable insights available in unstructured natural-language descriptions in the specifications, which leads to suboptimal test coverage. Recently, to address this gap,… ▽ More The widespread adoption of REST APIs, coupled with their growing complexity and size, has led to the need for automated REST API testing tools. Current tools focus on the structured data in REST API specifications but often neglect valuable insights available in unstructured natural-language descriptions in the specifications, which leads to suboptimal test coverage. Recently, to address this gap, researchers have developed techniques that extract rules from these human-readable descriptions and query knowledge bases to derive meaningful input values. However, these techniques are limited in the types of rules they can extract and prone to produce inaccurate results. This paper presents RESTGPT, an innovative approach that leverages the power and intrinsic context-awareness of Large Language Models (LLMs) to improve REST API testing. RESTGPT takes as input an API specification, extracts machine-interpretable rules, and generates example parameter values from natural-language descriptions in the specification. It then augments the original specification with these rules and values. Our evaluations indicate that RESTGPT outperforms existing techniques in both rule extraction and value generation. Given these promising results, we outline future research directions for advancing REST API testing through LLMs. △ Less

Submitted 29 January, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: To be published in the 46th IEEE/ACM International Conference on Software Engineering - New Ideas and Emerging Results Track (ICSE-NIER 2024)

arXiv:2311.18820 [pdf, other]

Adversarial Attacks and Defenses for Wireless Signal Classifiers using CDI-aware GANs

Authors: Sujata Sinha, Alkan Soysal

Abstract: We introduce a Channel Distribution Information (CDI)-aware Generative Adversarial Network (GAN), designed to address the unique challenges of adversarial attacks in wireless communication systems. The generator in this CDI-aware GAN maps random input noise to the feature space, generating perturbations intended to deceive a target modulation classifier. Its discriminators play a dual role: one en… ▽ More We introduce a Channel Distribution Information (CDI)-aware Generative Adversarial Network (GAN), designed to address the unique challenges of adversarial attacks in wireless communication systems. The generator in this CDI-aware GAN maps random input noise to the feature space, generating perturbations intended to deceive a target modulation classifier. Its discriminators play a dual role: one enforces that the perturbations follow a Gaussian distribution, making them indistinguishable from Gaussian noise, while the other ensures these perturbations account for realistic channel effects and resemble no-channel perturbations. Our proposed CDI-aware GAN can be used as an attacker and a defender. In attack scenarios, the CDI-aware GAN demonstrates its prowess by generating robust adversarial perturbations that effectively deceive the target classifier, outperforming known methods. Furthermore, CDI-aware GAN as a defender significantly improves the target classifier's resilience against adversarial attacks. △ Less

Submitted 30 November, 2023; originally announced November 2023.

arXiv:2311.18101 [pdf, other]

On the role of mechanical feedback in synchronous to asynchronous transition during embryogenesis

Authors: Abdul Malmi-Kakkada, Sumit Sinha, D. Thirumalai

Abstract: Experiments have shown that during the initial stage of Zebrafish morphogenesis a synchronous to asynchronous transition (SAT) occurs, as the cells divide extremely rapidly. In the synchronous phase, the cells divide in unison unlike in the asynchronous phase. Despite the widespread observation of SAT in experiments, a theory to calculate the critical number of cell cycles, $n^{*}$, at which async… ▽ More Experiments have shown that during the initial stage of Zebrafish morphogenesis a synchronous to asynchronous transition (SAT) occurs, as the cells divide extremely rapidly. In the synchronous phase, the cells divide in unison unlike in the asynchronous phase. Despite the widespread observation of SAT in experiments, a theory to calculate the critical number of cell cycles, $n^{*}$, at which asynchronous growth emerges does not exist. Here, using a model for the cell cycle, with the assumption that cell division times are Gaussian distributed with broadening, we predict $n^{*}$ and the time at which the SAT occurs. The theoretical results are in excellent agreement with experiments. The theory, supplemented by agent based simulations, establish that the SAT emerges as a consequence of biomechanical feedback on cell division. The emergence of asynchronous phase is due to linearly increasing fluctuations in the cell cycle times with each round of cell division. We also make several testable predictions, which would further shed light on the role of biomechanical feedback on the growth of multicellular systems. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: 16 pages, 4 figures and Supplementary Information

arXiv:2311.17039 [pdf, other]

Optimal control of interacting active particles on complex landscapes

Authors: Sumit Sinha, Vishaal Krishnan, L Mahadevan

Abstract: Active many-body systems composed of many interacting degrees of freedom often operate out of equilibrium, giving rise to non-trivial emergent behaviors which can be functional in both evolved and engineered contexts. This naturally suggests the question of control to optimize function. Using navigation as a paradigm for function, we deploy the language of stochastic optimal control theory to form… ▽ More Active many-body systems composed of many interacting degrees of freedom often operate out of equilibrium, giving rise to non-trivial emergent behaviors which can be functional in both evolved and engineered contexts. This naturally suggests the question of control to optimize function. Using navigation as a paradigm for function, we deploy the language of stochastic optimal control theory to formulate the inverse problem of shepherding a system of interacting active particles across a complex landscape. We implement a solution to this high-dimensional problem using an Adjoint-based Path Integral Control (APIC) algorithm that combines the power of recently introduced continuous-time back-propagation methods and automatic differentiation with the classical Feynman-Kac path integral formulation in statistical mechanics. Numerical experiments for controlling individual and interacting particles in complex landscapes show different classes of successful navigation strategies as a function of landscape complexity, as well as the intrinsic noise and drive of the active particles. However, in all cases, we see the emergence of paths that correspond to traversal along the edges of ridges and ravines, which we can understand using a variational analysis. We also show that the work associated with optimal strategies is inversely proportional to the length of the time horizon of optimal control, a result that follows from scaling considerations. All together, our approach serves as a foundational framework to control active non-equilibrium systems optimally to achieve functionality, embodied as a path on a high-dimensional manifold. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 27 pages, 6 figures

arXiv:2311.16330 [pdf, ps, other]

MITP Colours in Darkness workshop summary report

Authors: Jonathan Butterworth, Cesare Cazzaniga, Aran Garcia-Bellido, Deepak Kar, Suchita Kulkarni, Pedro Schwaller, Sukanya Sinha, Danielle Wilson-Edwards, Jose Zurita

Abstract: This report summarises the talks and discussions that took place over the course of the MITP Youngst@rs Colours in Darkness workshop 2023. All talks can be found at https://indico.mitp.uni-mainz.de/event/377/. This report summarises the talks and discussions that took place over the course of the MITP Youngst@rs Colours in Darkness workshop 2023. All talks can be found at https://indico.mitp.uni-mainz.de/event/377/. △ Less

Submitted 27 November, 2023; originally announced November 2023.

arXiv:2311.16209 [pdf, ps, other]

doi 10.1088/1402-4896/ad5912

Effect of Quantum Information Scrambling on Bound Entangled States

Authors: Suprabhat Sinha

Abstract: Spreading information in physical systems is a common phenomenon. However, when the information is quantum in nature, tracking, describing, and quantifying the information is a challenging task. Quantum information scrambling defines the quantum information propagating chaotically over the physical system. This article describes the effect of quantum information scrambling on bound entangled state… ▽ More Spreading information in physical systems is a common phenomenon. However, when the information is quantum in nature, tracking, describing, and quantifying the information is a challenging task. Quantum information scrambling defines the quantum information propagating chaotically over the physical system. This article describes the effect of quantum information scrambling on bound entangled states. A bound entangled state is a particular type of entangled state that carries noisy entanglement. The distillation of this type of entangled state is very difficult. In recent times, the usefulness of these states has been depicted in different applications. The outcome of this study exhibits that quantum information scrambling develops entanglement in the separable portion of the bound entangled states. Although quantum information scrambling reduces entanglement, the study pointed out that quantum information scrambling plays a significant role in activating the bound entangled states by introducing a certain amount of approximately stable entanglement. △ Less

Submitted 21 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

arXiv:2311.12585 [pdf]

An IoT-based Smart Parking System

Authors: Ridhi Choudhary, Arnav Sanjay Sinha, Krishna Jaiswal, Anurag Chandra

Abstract: The number of vehicles on the road is growing every day, thus there's a growing need to develop effective and hassle-free parking systems. Finding a parking space may be a big challenge, especially in crowded cities or areas with scheduled sporting or cultural events. The project suggests an automated parking system that makes use of technology like sensor systems and microcontrollers. In order to… ▽ More The number of vehicles on the road is growing every day, thus there's a growing need to develop effective and hassle-free parking systems. Finding a parking space may be a big challenge, especially in crowded cities or areas with scheduled sporting or cultural events. The project suggests an automated parking system that makes use of technology like sensor systems and microcontrollers. In order to make it easier for drivers to park in empty spots and cut down on the time and effort needed for manual searches, this system is made to identify empty parking spaces and display the available parking spots on an LCD screen. △ Less

Submitted 21 November, 2023; originally announced November 2023.

Comments: 3 pages

arXiv:2311.05469 [pdf]

doi 10.1002/adma.202300416

Skyrmion-Excited Spin Wave Fractal Network

Authors: Nan Tang, W. L. N. C. Liyanage, Sergio A. Montoya, Sheena Patel, Lizabeth J. Quigley, Alexander J. Grutter, Michael R. Fitzsimmons, Sunil Sinha, Julie A. Borchers, Eric E. Fullerton, Lisa DeBeer-Schmitt, Dustin A. Gilbert

Abstract: Magnetic skyrmions exhibit unique, technologically relevant pseudo-particle behaviors which arise from their topological protection, including well-defined, three-dimensional dynamic modes that occur at microwave frequencies. During dynamic excitation, spin waves are ejected into the interstitial regions between skyrmions, creating the magnetic equivalent of a turbulent sea. However, since the spi… ▽ More Magnetic skyrmions exhibit unique, technologically relevant pseudo-particle behaviors which arise from their topological protection, including well-defined, three-dimensional dynamic modes that occur at microwave frequencies. During dynamic excitation, spin waves are ejected into the interstitial regions between skyrmions, creating the magnetic equivalent of a turbulent sea. However, since the spin waves in these systems have a well-defined length scale, and the skyrmions are on an ordered lattice, ordered structures from spin wave interference can precipitate from the chaos. This work uses small angle neutron scattering (SANS) to capture the dynamics in hybrid skyrmions and investigate the spin wave structure. Performing simultaneous ferromagnetic resonance and SANS, the diffraction pattern shows a large increase in low-angle scattering intensity which is present only in the resonance condition. This scattering pattern is best fit using a mass fractal model, which suggests the spin waves form a long-range fractal network. The fractal structure is constructed of fundamental units with a size that encodes the spin wave emissions and are constrained by the skyrmion lattice. These results offer critical insights into the nanoscale dynamics of skyrmions, identify a new dynamic spin wave fractal structure, and demonstrates SANS as a unique tool to probe high-speed dynamics. △ Less

Submitted 9 November, 2023; originally announced November 2023.

Journal ref: Advanced Materials, 2300416 (2023)

arXiv:2310.12779 [pdf, other]

Emergence of a quasi-ergodic steady state in a dissipative Tavis-Cummings array

Authors: Debabrata Mondal, K. Sengupta, Subhasis Sinha

Abstract: In an atom-photon interacting system described by Tavis Cummings Hubbard (TCH) model, we demonstrate the emergence of a quasi-steady state in a dissipative environment that exhibits intriguing ergodic behavior. The TCH model undergoes a dissipative transition from normal to superradiant phase hosting a gapped Higgs and gapless Goldstone modes. However, in a large region of the phase diagram, the i… ▽ More In an atom-photon interacting system described by Tavis Cummings Hubbard (TCH) model, we demonstrate the emergence of a quasi-steady state in a dissipative environment that exhibits intriguing ergodic behavior. The TCH model undergoes a dissipative transition from normal to superradiant phase hosting a gapped Higgs and gapless Goldstone modes. However, in a large region of the phase diagram, the instability of the Goldstone mode leads to the disappearance of the stable superradiant phase. In this regime, the decorrelator dynamics reveals light cone spreading of the perturbations and positive Lyapunov exponent, indicating enhanced fluctuations. Remarkably, a quasi-steady state emerges under quench dynamics in this unstable regime; in this state, a class of collective quantities such as site averaged photon number and atomic excitations approach a steady value, in spite of large temporal fluctuations in corresponding microscopic variables. This quasi-steady state describes an incoherent fluid of photons with significant phase fluctuation. The phase space dynamics reveals a fascinating ergodic behavior in presence of dissipation, leading to the characterization of the dynamical variables into two distinct classes. The first class includes site-averaged photon numbers and atomic excitations; these exhibit a stationary distribution regardless of the initial condition indicating ergodic behavior. The second class of variables, particularly those related to phase in contrast, retain information about the initial conditions, resulting in a violation of ergodicity for finite size system. Additionally, the dynamical variables of the ergodic class exhibit fascinating collective scarring phenomenon as the peak of their distribution is attracted towards the unstable steady state, analogous to the single particle quantum scar. We discuss the relevance of our findings in the current experiments. △ Less

Submitted 19 October, 2023; originally announced October 2023.

arXiv:2310.06191 [pdf]

Investigating the Correlation between Force Output, Strains, and Pressure for Active Skeletal Muscle Contractions

Authors: Karan Taneja, Xiaolong He, John Hodgson, Usha Sinha, Shantanu Sinha, J. S. Chen

Abstract: Experimental observations suggest that the force output of the skeletal muscle tissue can be correlated to the intra-muscular pressure generated by the muscle belly. However, pressure often proves difficult to measure through in-vivo tests. Simulations on the other hand, offer a tool to model muscle contractions and analyze the relationship between muscle force generation and deformations as well… ▽ More Experimental observations suggest that the force output of the skeletal muscle tissue can be correlated to the intra-muscular pressure generated by the muscle belly. However, pressure often proves difficult to measure through in-vivo tests. Simulations on the other hand, offer a tool to model muscle contractions and analyze the relationship between muscle force generation and deformations as well as pressure outputs, enabling us to gain insight into correlations among experimentally measurable quantities such as principal and volumetric strains, and the force output. In this work, a correlation study is performed using Pearson's and Spearman's correlation coefficients on the force output of the skeletal muscle, the principal and volumetric strains experienced by the muscle and the pressure developed within the muscle belly as the muscle tissue undergoes isometric contractions due to varying activation profiles. The study reveals strong correlations between force output and the strains at all locations of the belly, irrespective of the type of activation profile used. This observation enables estimation on the contribution of various muscle groups to the total force by the experimentally measurable principal and volumetric strains in the muscle belly. It is also observed that pressure does not correlate well with force output due to stress relaxation near the boundary of muscle belly. △ Less

Submitted 9 October, 2023; originally announced October 2023.

arXiv:2310.04540 [pdf, other]

Multi-decadal Sea Level Prediction using Neural Networks and Spectral Clustering on Climate Model Large Ensembles and Satellite Altimeter Data

Authors: Saumya Sinha, John Fasullo, R. Steven Nerem, Claire Monteleoni

Abstract: Sea surface height observations provided by satellite altimetry since 1993 show a rising rate (3.4 mm/year) for global mean sea level. While on average, sea level has risen 10 cm over the last 30 years, there is considerable regional variation in the sea level change. Through this work, we predict sea level trends 30 years into the future at a 2-degree spatial resolution and investigate the future… ▽ More Sea surface height observations provided by satellite altimetry since 1993 show a rising rate (3.4 mm/year) for global mean sea level. While on average, sea level has risen 10 cm over the last 30 years, there is considerable regional variation in the sea level change. Through this work, we predict sea level trends 30 years into the future at a 2-degree spatial resolution and investigate the future patterns of the sea level change. We show the potential of machine learning (ML) in this challenging application of long-term sea level forecasting over the global ocean. Our approach incorporates sea level data from both altimeter observations and climate model simulations. We develop a supervised learning framework using fully connected neural networks (FCNNs) that can predict the sea level trend based on climate model projections. Alongside this, our method provides uncertainty estimates associated with the ML prediction. We also show the effectiveness of partitioning our spatial dataset and learning a dedicated ML model for each segmented region. We compare two partitioning strategies: one achieved using domain knowledge, and the other employing spectral clustering. Our results demonstrate that segmenting the spatial dataset with spectral clustering improves the ML predictions. △ Less

Submitted 6 October, 2023; originally announced October 2023.

arXiv:2310.03818 [pdf, other]

Diboride compounds doped with transition metals$\unicode{x2013}$a route to superconductivity through structure stabilization as well as defects

Authors: P. M. Dee, J. S. Kim, A. C. Hire, J. Lim, L. Fanfarillo, S. Sinha, J. J. Hamlin, R. G. Hennig, P. J. Hirschfeld, G. R. Stewart

Abstract: Recent investigations into MoB$_{2}$ have unveiled a direct connection between a pressure-induced structural transition to a P6/mmm space group structure and the emergence of superconductivity, producing critical temperatures up to 32 K at 100 GPa. This pressure-induced superconducting state underscores the potential of doped MoB$_{2}$ as a possible candidate for metastable superconductivity at am… ▽ More Recent investigations into MoB$_{2}$ have unveiled a direct connection between a pressure-induced structural transition to a P6/mmm space group structure and the emergence of superconductivity, producing critical temperatures up to 32 K at 100 GPa. This pressure-induced superconducting state underscores the potential of doped MoB$_{2}$ as a possible candidate for metastable superconductivity at ambient pressure. In this work, we demonstrate that do** by Zr, Hf, or Ta stabilizes the P6/mmm structure at ambient pressure and results in the realization of a superconducting state with critical temperatures ranging from 2.4 up to 8.5 K depending on the specific do**. We estimate the electron-phonon coupling $λ$ and the density of states based on resistivity and specific heat data, finding that $λ$ ranges from 0.4 - 0.6 for these compounds. Finally, to investigate the role of possible metastable defect structures on the critical temperature, we analyze MoB$_{2}$, MoB$_{2.5}$, and Nb/Zr-doped MoB$_{2}$ using rapid cooling techniques. Notably, splat-quenching produces samples with higher critical temperatures and even retains superconductivity in MoB$_{2}$ at ambient pressure, achieving a critical temperature of 4.5 K. △ Less

Submitted 11 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

Comments: 11 pages, 10 figures (Updated version Oct. 11 2023 ==> arXiv title fixed)

arXiv:2310.01062 [pdf, other]

Bayesian inference methodology to characterize the dust emissivity at far-infrared and submillimeter frequencies

Authors: Debabrata Adak, Shabbir Shaikh, Srijita Sinha, Tuhin Ghosh, Francois Boulanger, Guilaine Lagache, Tarun Souradeep, Marc-Antoine Miville-Deschênes

Abstract: We present a Bayesian inference method to characterise the dust emission properties using the well-known dust-HI correlation in the diffuse interstellar medium at Planck frequencies $ν\ge 217$ GHz. We use the Galactic HI map from the Galactic All-Sky Survey (GASS) as a template to trace the Galactic dust emission. We jointly infer the pixel-dependent dust emissivity and the zero level present in t… ▽ More We present a Bayesian inference method to characterise the dust emission properties using the well-known dust-HI correlation in the diffuse interstellar medium at Planck frequencies $ν\ge 217$ GHz. We use the Galactic HI map from the Galactic All-Sky Survey (GASS) as a template to trace the Galactic dust emission. We jointly infer the pixel-dependent dust emissivity and the zero level present in the Planck intensity maps. We use the Hamiltonian Monte Carlo technique to sample the high dimensional parameter space ($D \sim 10^3$). We demonstrate that the methodology leads to unbiased recovery of dust emissivity per pixel and the zero level when applied to realistic Planck sky simulations over a 6300 deg$^2$ area around the Southern Galactic pole. As an application on data, we analyse the Planck intensity map at 353 GHz to jointly infer the pixel-dependent dust emissivity at Nside=32 resolution (1.8° pixel size) and the global offset. We find that the spatially varying dust emissivity has a mean of 0.031 $MJysr (10^{20} \mathrm{cm^{-2}})^{-1}$ and $1σ$ standard deviation of 0.007 $MJysr (10^{20} \mathrm{cm^{-2}})^{-1}$. The mean dust emissivity increases monotonically with increasing mean HI column density. We find that the inferred global offset is consistent with the expected level of Cosmic Infrared Background (CIB) monopole added to the Planck data at 353 GHz. This method is useful in studying the line-of-sight variations of dust spectral energy distribution in the multi-phase interstellar medium. △ Less

Submitted 29 May, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: 19 pages, 14 figures, Accepted for publication in MNRAS

arXiv:2309.15782 [pdf]

Joint-YODNet: A Light-weight Object Detector for UAVs to Achieve Above 100fps

Authors: Vipin Gautam, Shitala Prasad, Sharad Sinha

Abstract: Small object detection via UAV (Unmanned Aerial Vehicle) images captured from drones and radar is a complex task with several formidable challenges. This domain encompasses numerous complexities that impede the accurate detection and localization of small objects. To address these challenges, we propose a novel method called JointYODNet for UAVs to detect small objects, leveraging a joint loss fun… ▽ More Small object detection via UAV (Unmanned Aerial Vehicle) images captured from drones and radar is a complex task with several formidable challenges. This domain encompasses numerous complexities that impede the accurate detection and localization of small objects. To address these challenges, we propose a novel method called JointYODNet for UAVs to detect small objects, leveraging a joint loss function specifically designed for this task. Our method revolves around the development of a joint loss function tailored to enhance the detection performance of small objects. Through extensive experimentation on a diverse dataset of UAV images captured under varying environmental conditions, we evaluated different variations of the loss function and determined the most effective formulation. The results demonstrate that our proposed joint loss function outperforms existing methods in accurately localizing small objects. Specifically, our method achieves a recall of 0.971, and a F1Score of 0.975, surpassing state-of-the-art techniques. Additionally, our method achieves a [email protected](%) of 98.6, indicating its robustness in detecting small objects across varying scales △ Less

Submitted 27 September, 2023; originally announced September 2023.

arXiv:2309.15780 [pdf]

AaP-ReID: Improved Attention-Aware Person Re-identification

Authors: Vipin Gautam, Shitala Prasad, Sharad Sinha

Abstract: Person re-identification (ReID) is a well-known problem in the field of computer vision. The primary objective is to identify a specific individual within a gallery of images. However, this task is challenging due to various factors, such as pose variations, illumination changes, obstructions, and the presence ofconfusing backgrounds. Existing ReID methods often fail to capture discriminative feat… ▽ More Person re-identification (ReID) is a well-known problem in the field of computer vision. The primary objective is to identify a specific individual within a gallery of images. However, this task is challenging due to various factors, such as pose variations, illumination changes, obstructions, and the presence ofconfusing backgrounds. Existing ReID methods often fail to capture discriminative features (e.g., head, shoes, backpacks) and instead capture irrelevant features when the target is occluded. Motivated by the success of part-based and attention-based ReID methods, we improve AlignedReID++ and present AaP-ReID, a more effective method for person ReID that incorporates channel-wise attention into a ResNet-based architecture. Our method incorporates the Channel-Wise Attention Bottleneck (CWAbottleneck) block and can learn discriminating features by dynamically adjusting the importance ofeach channel in the feature maps. We evaluated Aap-ReID on three benchmark datasets: Market-1501, DukeMTMC-reID, and CUHK03. When compared with state-of-the-art person ReID methods, we achieve competitive results with rank-1 accuracies of 95.6% on Market-1501, 90.6% on DukeMTMC-reID, and 82.4% on CUHK03. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Showing 1–50 of 698 results for author: Sinha, S