-
The PARADIGM project I: How early merger histories shape the present-day sizes of Milky-Way-mass galaxies
Authors:
Gandhali D. Joshi,
Andrew Pontzen,
Oscar Agertz,
Martin P. Rey,
Justin Read,
Annalisa Pillepich
Abstract:
The way in which mergers affect galaxy formation depends on both feedback processes, and on the geometry and strength of the mergers themselves. We introduce the PARADIGM project, where we study the response of a simulated Milky-Way-mass galaxy forming in a cosmological setting to differing merger histories, using genetically modified initial conditions. Each initial condition is simulated with th…
▽ More
The way in which mergers affect galaxy formation depends on both feedback processes, and on the geometry and strength of the mergers themselves. We introduce the PARADIGM project, where we study the response of a simulated Milky-Way-mass galaxy forming in a cosmological setting to differing merger histories, using genetically modified initial conditions. Each initial condition is simulated with the VINTERGATAN and IllustrisTNG codes. While VINTERGATAN has been developed with an emphasis on resolving the cold interstellar medium, IllustrisTNG uses a subgrid two-phase model and consequently scales to large volume simulations, making them ideal to examine complementary views on how merger histories and feedback interact. Our genetic modifications alter the mass ratio of an important $z \approx 2$ merger while maintaining the halo's $z=0$ mass. Whether simulated with VINTERGATAN or IllustrisTNG, smaller mass ratios for this early merger result in larger galaxies at $z=0$, due to a greater build up of a kinematically cold disc. We conclude that such broad trends are robustly reproducible; however, the normalization of the resulting stellar sizes is substantially different in the two codes (ranging between $0.5-1.7\ \rm{kpc}$ for VINTERGATAN but $1.3-7.0\ \rm{kpc}$ for IllustrisTNG). The VINTERGATAN galaxies systematically form stars earlier, leading to a larger bulge component. Despite the difference in size normalization, both simulation suites lie on the observed size-mass relation for their respective morphological types. In light of these results, we discuss the interplay between internal processes and large scale gravitational interactions and gas accretion, and how the two galaxy models converge on similar emergent trends but along different evolutionary pathways.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Hybrid Approach to Parallel Stochastic Gradient Descent
Authors:
Aakash Sudhirbhai Vora,
Dhrumil Chetankumar Joshi,
Aksh Kantibhai Patel
Abstract:
Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel. Synchronous and asynchronous approach to data parallelism is used by most systems to train the model in parallel. However, both of them have their drawbacks. We pr…
▽ More
Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel. Synchronous and asynchronous approach to data parallelism is used by most systems to train the model in parallel. However, both of them have their drawbacks. We propose a third approach to data parallelism which is a hybrid between synchronous and asynchronous approaches, using both approaches to train the neural network. When the threshold function is selected appropriately to gradually shift all parameter aggregation from asynchronous to synchronous, we show that in a given time period our hybrid approach outperforms both asynchronous and synchronous approaches.
△ Less
Submitted 27 June, 2024;
originally announced July 2024.
-
Cellular Automata model for period-$n$ synchronization: A new universality class
Authors:
Divya D. Joshi,
Prashant M. Gade
Abstract:
There are few known universality classes of absorbing phase transitions in one dimension and most models fall in the well-known directed percolation (DP) class. Synchronization is a transition to an absorbing state and this transition is often DP class. With local coupling, the transition is often to a fixed point state. Transitions to a periodic synchronized state are seldom observed. Recently a…
▽ More
There are few known universality classes of absorbing phase transitions in one dimension and most models fall in the well-known directed percolation (DP) class. Synchronization is a transition to an absorbing state and this transition is often DP class. With local coupling, the transition is often to a fixed point state. Transitions to a periodic synchronized state are seldom observed. Recently a transition to a synchronized period-3 state that is not in DP class is observed. We model it using a cellular automata model with states 1 to $n$. The rules are a) Each site in state $i$ changes to state $i+1$ for $i<n$ and 1 if $i=n$. b) After this update, it takes value of either neighbor unless it is in state 1. With these rules, we observe a transition to synchronization with critical exponents different from those of DP for $n>2$. For $n=2$, a different exponent is observed.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Chiplets on Wheels: Review Paper on Holistic Chiplet Solutions for Autonomous Vehicles
Authors:
Swathi Narashiman,
Venkat A,
Divyaratna Joshi,
Deepak Sridhar,
Harish Rajesh,
Sanjay Sattva,
Aniruddha S,
Jayanth B,
Varun Manjunath,
Ragavendiran N
Abstract:
On the advent of the slow death of Moore's law, the silicon industry is moving towards a new era of chiplets. The automotive industry is experiencing a profound transformation towards software-defined vehicles, fueled by the surging demand for automotive compute chips, expected to reach 20-22 billion by 2030. High-performance compute (HPC) chips become instrumental in meeting the soaring demand fo…
▽ More
On the advent of the slow death of Moore's law, the silicon industry is moving towards a new era of chiplets. The automotive industry is experiencing a profound transformation towards software-defined vehicles, fueled by the surging demand for automotive compute chips, expected to reach 20-22 billion by 2030. High-performance compute (HPC) chips become instrumental in meeting the soaring demand for computational power. Various strategies, including centralized electrical and electronic architecture and the innovative Chiplet Systems, are under exploration. The latter, breaking down System-on-Chips (SoCs) into functional units, offers unparalleled customization and integration possibilities. The research accentuates the crucial open Chiplet ecosystem, fostering collaboration and enhancing supply chain resilience. In this paper, we address the unique challenges that arise when attempting to leverage chiplet-based architecture to design a holistic silicon solution for the automotive industry. We propose a throughput-oriented micro-architecture for ADAS and infotainment systems alongside a novel methodology to evaluate chiplet architectures. Further, we develop in-house simulation tools leveraging the gem5 framework to simulate latency and throughput. Finally, we perform an extensive design of thermally-aware chiplet placement and develop a micro-fluids-based cooling design.
△ Less
Submitted 31 May, 2024;
originally announced June 2024.
-
TnT-LLM: Text Mining at Scale with Large Language Models
Authors:
Mengting Wan,
Tara Safavi,
Sujay Kumar Jauhar,
Yu** Kim,
Scott Counts,
Jennifer Neville,
Siddharth Suri,
Chirag Shah,
Ryen W White,
Longqi Yang,
Reid Andersen,
Georg Buscher,
Dhruv Joshi,
Nagu Rangan
Abstract:
Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi…
▽ More
Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. This is particularly challenging when the label space is under-specified and large-scale data annotations are unavailable. In this paper, we address these challenges with Large Language Models (LLMs), whose prompt-based interface facilitates the induction and use of large-scale pseudo labels. We propose TnT-LLM, a two-phase framework that employs LLMs to automate the process of end-to-end label generation and assignment with minimal human effort for any given use-case. In the first phase, we introduce a zero-shot, multi-stage reasoning approach which enables LLMs to produce and refine a label taxonomy iteratively. In the second phase, LLMs are used as data labelers that yield training samples so that lightweight supervised classifiers can be reliably built, deployed, and served at scale. We apply TnT-LLM to the analysis of user intent and conversational domain for Bing Copilot (formerly Bing Chat), an open-domain chat-based search engine. Extensive experiments using both human and automatic evaluation metrics demonstrate that TnT-LLM generates more accurate and relevant label taxonomies when compared against state-of-the-art baselines, and achieves a favorable balance between accuracy and efficiency for classification at scale. We also share our practical experiences and insights on the challenges and opportunities of using LLMs for large-scale text mining in real-world applications.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Direction of slip modulates the perception of slip distance and slip speed
Authors:
Ayesha Tooba Khan,
Deepak Joshi,
Biswarup Mukherjee
Abstract:
Purpose: The purpose of this study was to investigate the psychophysical understanding of the slip stimulus. We emphasized that the perception of slip and its characteristics, such as slip distance and slip speed depend on the interaction between slip direction, slip distance as well as slip speed. Methods: We developed a novel slip induction device to simulate the artificial sense of slip. We con…
▽ More
Purpose: The purpose of this study was to investigate the psychophysical understanding of the slip stimulus. We emphasized that the perception of slip and its characteristics, such as slip distance and slip speed depend on the interaction between slip direction, slip distance as well as slip speed. Methods: We developed a novel slip induction device to simulate the artificial sense of slip. We conducted a psychophysical experiment on eight healthy subjects. The experiment was designed to evaluate the effect of slip direction on slip perception as well as on the perception of slip distance and slip speed. A series of psychophysical questions were asked at the end of the slip stimulation to record the subjective responses of the participants. The average success rate (%) was used to quantify the subject responses. Results: We demonstrated that the perception of slip is independent of slip direction however, perception of slip distance and slip speed are significantly modulated by slip direction. We also observed that a significant interaction exists between slip distance and slip speed in the upward slip direction. It was also observed that the average success rate was significantly different for various combinations of slip distance and slip speed in the upward slip direction. Conclusions: Our study clearly establishes a significant interaction between the slip direction, slip distance, and slip speed for psychophysical understanding of the perception of slip distance and slip speed.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
Authors:
Shengcao Cao,
Dhiraj Joshi,
Liang-Yan Gui,
Yu-Xiong Wang
Abstract:
The human visual perception system demonstrates exceptional capabilities in learning without explicit supervision and understanding the part-to-whole composition of objects. Drawing inspiration from these two abilities, we propose Hierarchical Adaptive Self-Supervised Object Detection (HASSOD), a novel approach that learns to detect objects and understand their compositions without human supervisi…
▽ More
The human visual perception system demonstrates exceptional capabilities in learning without explicit supervision and understanding the part-to-whole composition of objects. Drawing inspiration from these two abilities, we propose Hierarchical Adaptive Self-Supervised Object Detection (HASSOD), a novel approach that learns to detect objects and understand their compositions without human supervision. HASSOD employs a hierarchical adaptive clustering strategy to group regions into object masks based on self-supervised visual representations, adaptively determining the number of objects per image. Furthermore, HASSOD identifies the hierarchical levels of objects in terms of composition, by analyzing coverage relations between masks and constructing tree structures. This additional self-supervised learning task leads to improved detection performance and enhanced interpretability. Lastly, we abandon the inefficient multi-round self-training process utilized in prior methods and instead adapt the Mean Teacher framework from semi-supervised learning, which leads to a smoother and more efficient training process. Through extensive experiments on prevalent image datasets, we demonstrate the superiority of HASSOD over existing methods, thereby advancing the state of the art in self-supervised object detection. Notably, we improve Mask AR from 20.2 to 22.5 on LVIS, and from 17.0 to 26.0 on SA-1B. Project page: https://HASSOD-NeurIPS23.github.io.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Don't Believe Everything You Read: Enhancing Summarization Interpretability through Automatic Identification of Hallucinations in Large Language Models
Authors:
Priyesh Vakharia,
Devavrat Joshi,
Meenal Chavan,
Dhananjay Sonawane,
Bhrigu Garg,
Parsa Mazaheri
Abstract:
Large Language Models (LLMs) are adept at text manipulation -- tasks such as machine translation and text summarization. However, these models can also be prone to hallucination, which can be detrimental to the faithfulness of any answers that the model provides. Recent works in combating hallucinations in LLMs deal with identifying hallucinated sentences and categorizing the different ways in whi…
▽ More
Large Language Models (LLMs) are adept at text manipulation -- tasks such as machine translation and text summarization. However, these models can also be prone to hallucination, which can be detrimental to the faithfulness of any answers that the model provides. Recent works in combating hallucinations in LLMs deal with identifying hallucinated sentences and categorizing the different ways in which models hallucinate. This paper takes a deep dive into LLM behavior with respect to hallucinations, defines a token-level approach to identifying different kinds of hallucinations, and further utilizes this token-level tagging to improve the interpretability and faithfulness of LLMs in dialogue summarization tasks. Through this, the paper presents a new, enhanced dataset and a new training paradigm.
△ Less
Submitted 2 April, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
R2D2: Reducing Redundancy and Duplication in Data Lakes
Authors:
Raunak Shah,
Koyel Mukherjee,
Atharv Tyagi,
Sai Keerthana Karnam,
Dhruv Joshi,
Shivam Bhosale,
Subrata Mitra
Abstract:
Enterprise data lakes often suffer from substantial amounts of duplicate and redundant data, with data volumes ranging from terabytes to petabytes. This leads to both increased storage costs and unnecessarily high maintenance costs for these datasets. In this work, we focus on identifying and reducing redundancy in enterprise data lakes by addressing the problem of 'dataset containment'. To the be…
▽ More
Enterprise data lakes often suffer from substantial amounts of duplicate and redundant data, with data volumes ranging from terabytes to petabytes. This leads to both increased storage costs and unnecessarily high maintenance costs for these datasets. In this work, we focus on identifying and reducing redundancy in enterprise data lakes by addressing the problem of 'dataset containment'. To the best of our knowledge, this is one of the first works that addresses table-level containment at a large scale.
We propose R2D2: a three-step hierarchical pipeline that efficiently identifies almost all instances of containment by progressively reducing the search space in the data lake. It first builds (i) a schema containment graph, followed by (ii) statistical min-max pruning, and finally, (iii) content level pruning. We further propose minimizing the total storage and access costs by optimally identifying redundant datasets that can be deleted (and reconstructed on demand) while respecting latency constraints.
We implement our system on Azure Databricks clusters using Apache Spark for enterprise data stored in ADLS Gen2, and on AWS clusters for open-source data. In contrast to existing modified baselines that are inaccurate or take several days to run, our pipeline can process an enterprise customer data lake at the TB scale in approximately 5 hours with high accuracy. We present theoretical results as well as extensive empirical validation on both enterprise (scale of TBs) and open-source datasets (scale of MBs - GBs), which showcase the effectiveness of our pipeline.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Deepfake Detection: Leveraging the Power of 2D and 3D CNN Ensembles
Authors:
Aagam Bakliwal,
Amit D. Joshi
Abstract:
In the dynamic realm of deepfake detection, this work presents an innovative approach to validate video content. The methodology blends advanced 2-dimensional and 3-dimensional Convolutional Neural Networks. The 3D model is uniquely tailored to capture spatiotemporal features via sliding filters, extending through both spatial and temporal dimensions. This configuration enables nuanced pattern rec…
▽ More
In the dynamic realm of deepfake detection, this work presents an innovative approach to validate video content. The methodology blends advanced 2-dimensional and 3-dimensional Convolutional Neural Networks. The 3D model is uniquely tailored to capture spatiotemporal features via sliding filters, extending through both spatial and temporal dimensions. This configuration enables nuanced pattern recognition in pixel arrangement and temporal evolution across frames. Simultaneously, the 2D model leverages EfficientNet architecture, harnessing auto-scaling in Convolutional Neural Networks. Notably, this ensemble integrates Voting Ensembles and Adaptive Weighted Ensembling. Strategic prioritization of the 3-dimensional model's output capitalizes on its exceptional spatio-temporal feature extraction. Experimental validation underscores the effectiveness of this strategy, showcasing its potential in countering deepfake generation's deceptive practices.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Stable Bosonic Topological Edge Modes in the Presence of Many-Body Interactions
Authors:
Niclas Heinsdorf,
Darshan G. Joshi,
Hosho Katsura,
Andreas P. Schnyder
Abstract:
Many magnetic materials are predicted to exhibit bosonic topological edge modes in their excitation spectra, because of the nontrivial topology of their magnon, triplon or other quasi-particle band structures. However, there is a discrepancy between theory prediction and experimental observation, which suggests some underlying mechanism that intrinsically suppresses the expected experimental signa…
▽ More
Many magnetic materials are predicted to exhibit bosonic topological edge modes in their excitation spectra, because of the nontrivial topology of their magnon, triplon or other quasi-particle band structures. However, there is a discrepancy between theory prediction and experimental observation, which suggests some underlying mechanism that intrinsically suppresses the expected experimental signatures, like the thermal Hall current. Many-body interactions that are not accounted for in the non-interacting quasi-particle picture are most often identified as the reason for the absence of the topological edge modes. Here we report stable bosonic edge modes at the boundaries of a ladder quantum paramagnet with gapped triplon excitations in the presence of the full many-body interaction. For the first time, we use tensor network methods to resolve topological edge modes in the time-dependent spin-spin correlations and the dynamical structure factor, which is directly accessible experimentally. We further show that these edge modes have anomalously long time coherence, discuss the topological phase diagram of the model, demonstrate the fractionalization of its low-lying excitations, and propose potential material candidates.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Unveiling Emotions from EEG: A GRU-Based Approach
Authors:
Sarthak Johari,
Gowri Namratha Meedinti,
Radhakrishnan Delhibabu,
Deepak Joshi
Abstract:
One of the most important study areas in affective computing is emotion identification using EEG data. In this study, the Gated Recurrent Unit (GRU) algorithm, which is a type of Recurrent Neural Networks (RNNs), is tested to see if it can use EEG signals to predict emotional states. Our publicly accessible dataset consists of resting neutral data as well as EEG recordings from people who were exp…
▽ More
One of the most important study areas in affective computing is emotion identification using EEG data. In this study, the Gated Recurrent Unit (GRU) algorithm, which is a type of Recurrent Neural Networks (RNNs), is tested to see if it can use EEG signals to predict emotional states. Our publicly accessible dataset consists of resting neutral data as well as EEG recordings from people who were exposed to stimuli evoking happy, neutral, and negative emotions. For the best feature extraction, we pre-process the EEG data using artifact removal, bandpass filters, and normalization methods. With 100% accuracy on the validation set, our model produced outstanding results by utilizing the GRU's capacity to capture temporal dependencies. When compared to other machine learning techniques, our GRU model's Extreme Gradient Boosting Classifier had the highest accuracy. Our investigation of the confusion matrix revealed insightful information about the performance of the model, enabling precise emotion classification. This study emphasizes the potential of deep learning models like GRUs for emotion recognition and advances in affective computing. Our findings open up new possibilities for interacting with computers and comprehending how emotions are expressed through brainwave activity.
△ Less
Submitted 20 July, 2023;
originally announced August 2023.
-
Noise removal methods on ambulatory EEG: A Survey
Authors:
Sarthak Johari,
Gowri Namratha Meedinti,
Radhakrishnan Delhibabu,
Deepak Joshi
Abstract:
Over many decades, research is being attempted for the removal of noise in the ambulatory EEG. In this respect, an enormous number of research papers is published for identification of noise removal, It is difficult to present a detailed review of all these literature. Therefore, in this paper, an attempt has been made to review the detection and removal of an noise. More than 100 research papers…
▽ More
Over many decades, research is being attempted for the removal of noise in the ambulatory EEG. In this respect, an enormous number of research papers is published for identification of noise removal, It is difficult to present a detailed review of all these literature. Therefore, in this paper, an attempt has been made to review the detection and removal of an noise. More than 100 research papers have been discussed to discern the techniques for detecting and removal the ambulatory EEG. Further, the literature survey shows that the pattern recognition required to detect ambulatory method, eye open and close, varies with different conditions of EEG datasets. This is mainly due to the fact that EEG detected under different conditions has different characteristics. This is, in turn, necessitates the identification of pattern recognition technique to effectively distinguish EEG noise data from a various condition of EEG data.
△ Less
Submitted 16 July, 2023;
originally announced August 2023.
-
VINTERGATAN-GM: How do mergers affect the satellite populations of MW-like galaxies?
Authors:
Gandhali D. Joshi,
Andrew Pontzen,
Oscar Agertz,
Martin P. Rey,
Justin Read,
Florent Renaud
Abstract:
We investigate the impact of a galaxy's merger history on its system of satellites using the new \textsc{vintergatan-gm} suite of zoom-in hydrodynamical simulations of Milky Way-mass systems. The suite simulates five realizations of the same halo with targeted `genetic modifications' (GMs) of a $z \approx 2$ merger, but resulting in the same halo mass at $z=0$. We find that differences in the sate…
▽ More
We investigate the impact of a galaxy's merger history on its system of satellites using the new \textsc{vintergatan-gm} suite of zoom-in hydrodynamical simulations of Milky Way-mass systems. The suite simulates five realizations of the same halo with targeted `genetic modifications' (GMs) of a $z \approx 2$ merger, but resulting in the same halo mass at $z=0$. We find that differences in the satellite stellar mass functions last for $2.25-4.25$ Gyr after the $z \approx 2$ merger; specifically, the haloes that have undergone smaller mergers host up to 60\% more satellites than those of the larger merger scenarios. However, by $z=0$ these differences in the satellite stellar mass functions have been erased. The differences in satellite numbers seen soon after the mergers are driven by several factors, including the timings of significant mergers (with $M_{\rm 200c}$ mass ratios $>1:30$ and bringing in $M_{\rm *} \geq 10^{8}{\rm M}_{\odot}$ at infall), the masses and satellite populations of the central and merging systems, and the subsequent extended history of smaller mergers. The results persist when measured at fixed central stellar mass rather than fixed time, implying that a host's recent merger history can be a significant source of scatter when reconstructing its dynamical properties from its satellite population.
△ Less
Submitted 10 January, 2024; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Machine Vision Using Cellphone Camera: A Comparison of deep networks for classifying three challenging denominations of Indian Coins
Authors:
Keyur D. Joshi,
Dhruv Shah,
Varshil Shah,
Nilay Gandhi,
Sanket J. Shah,
Sanket B. Shah
Abstract:
Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypo…
▽ More
Indian currency coins come in a variety of denominations. Off all the varieties Rs.1, RS.2, and Rs.5 have similar diameters. Majority of the coin styles in market circulation for denominations of Rs.1 and Rs.2 coins are nearly the same except for numerals on its reverse side. If a coin is resting on its obverse side, the correct denomination is not distinguishable by humans. Therefore, it was hypothesized that a digital image of a coin resting on its either size could be classified into its correct denomination by training a deep neural network model. The digital images were generated by using cheap cell phone cameras. To find the most suitable deep neural network architecture, four were selected based on the preliminary analysis carried out for comparison. The results confirm that two of the four deep neural network models can classify the correct denomination from either side of a coin with an accuracy of 97%.
△ Less
Submitted 12 May, 2023;
originally announced June 2023.
-
Stability Analysis of Fractional Difference Equations with Delay
Authors:
Divya D. Joshi,
Sachin Bhalekar,
Prashant M. Gade
Abstract:
Long-term memory is a feature observed in systems ranging from neural networks to epidemiological models. The memory in such systems is usually modeled by the time delay. Furthermore, the nonlocal operators, such as the "fractional order difference" can also have a long-time memory. Therefore, the fractional difference equations with delay are an appropriate model in a range of systems. Even so, t…
▽ More
Long-term memory is a feature observed in systems ranging from neural networks to epidemiological models. The memory in such systems is usually modeled by the time delay. Furthermore, the nonlocal operators, such as the "fractional order difference" can also have a long-time memory. Therefore, the fractional difference equations with delay are an appropriate model in a range of systems. Even so, there are not many detailed studies available related to the stability analysis of fractional order systems with delay. In this work, we derive the stability conditions for linear fractional difference equations with a delay term $τ$. We have given detailed stability analysis for the cases $τ=1$ and $τ=2$. The results are extended to nonlinear maps.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Estimating related words computationally using language model from the Mahabharata -- an Indian epic
Authors:
Vrunda Gadesha,
Keyur D Joshi,
Shefali Naik
Abstract:
'Mahabharata' is the most popular among many Indian pieces of literature referred to in many domains for completely different purposes. This text itself is having various dimension and aspects which is useful for the human being in their personal life and professional life. This Indian Epic is originally written in the Sanskrit Language. Now in the era of Natural Language Processing, Artificial In…
▽ More
'Mahabharata' is the most popular among many Indian pieces of literature referred to in many domains for completely different purposes. This text itself is having various dimension and aspects which is useful for the human being in their personal life and professional life. This Indian Epic is originally written in the Sanskrit Language. Now in the era of Natural Language Processing, Artificial Intelligence, Machine Learning, and Human-Computer interaction this text can be processed according to the domain requirement. It is interesting to process this text and get useful insights from Mahabharata. The limitation of the humans while analyzing Mahabharata is that they always have a sentiment aspect towards the story narrated by the author. Apart from that, the human cannot memorize statistical or computational details, like which two words are frequently coming in one sentence? What is the average length of the sentences across the whole literature? Which word is the most popular word across the text, what are the lemmas of the words used across the sentences? Thus, in this paper, we propose an NLP pipeline to get some statistical and computational insights along with the most relevant word searching method from the largest epic 'Mahabharata'. We stacked the different text-processing approaches to articulate the best results which can be further used in the various domain where Mahabharata needs to be referred.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Contrastive Mean Teacher for Domain Adaptive Object Detectors
Authors:
Shengcao Cao,
Dhiraj Joshi,
Liang-Yan Gui,
Yu-Xiong Wang
Abstract:
Object detectors often suffer from the domain gap between training (source domain) and real-world applications (target domain). Mean-teacher self-training is a powerful paradigm in unsupervised domain adaptation for object detection, but it struggles with low-quality pseudo-labels. In this work, we identify the intriguing alignment and synergy between mean-teacher self-training and contrastive lea…
▽ More
Object detectors often suffer from the domain gap between training (source domain) and real-world applications (target domain). Mean-teacher self-training is a powerful paradigm in unsupervised domain adaptation for object detection, but it struggles with low-quality pseudo-labels. In this work, we identify the intriguing alignment and synergy between mean-teacher self-training and contrastive learning. Motivated by this, we propose Contrastive Mean Teacher (CMT) -- a unified, general-purpose framework with the two paradigms naturally integrated to maximize beneficial learning signals. Instead of using pseudo-labels solely for final predictions, our strategy extracts object-level features using pseudo-labels and optimizes them via contrastive learning, without requiring labels in the target domain. When combined with recent mean-teacher self-training methods, CMT leads to new state-of-the-art target-domain performance: 51.9% mAP on Foggy Cityscapes, outperforming the previously best by 2.1% mAP. Notably, CMT can stabilize performance and provide more significant gains as pseudo-label noise increases.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning
Authors:
Han**g Wang,
Dhiraj Joshi,
Shiqiang Wang,
Qiang Ji
Abstract:
Predictions made by deep learning models are prone to data perturbations, adversarial attacks, and out-of-distribution inputs. To build a trusted AI system, it is therefore critical to accurately quantify the prediction uncertainties. While current efforts focus on improving uncertainty quantification accuracy and efficiency, there is a need to identify uncertainty sources and take actions to miti…
▽ More
Predictions made by deep learning models are prone to data perturbations, adversarial attacks, and out-of-distribution inputs. To build a trusted AI system, it is therefore critical to accurately quantify the prediction uncertainties. While current efforts focus on improving uncertainty quantification accuracy and efficiency, there is a need to identify uncertainty sources and take actions to mitigate their effects on predictions. Therefore, we propose to develop explainable and actionable Bayesian deep learning methods to not only perform accurate uncertainty quantification but also explain the uncertainties, identify their sources, and propose strategies to mitigate the uncertainty impacts. Specifically, we introduce a gradient-based uncertainty attribution method to identify the most problematic regions of the input that contribute to the prediction uncertainty. Compared to existing methods, the proposed UA-Backprop has competitive accuracy, relaxed assumptions, and high efficiency. Moreover, we propose an uncertainty mitigation strategy that leverages the attribution results as attention to further improve the model performance. Both qualitative and quantitative evaluations are conducted to demonstrate the effectiveness of our proposed methods.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Terahertz probing of anisotropic conductivity and morphology of CuMnAs epitaxial thin films
Authors:
Peter Kubaščík,
Andrej Farkaš,
Kamil Olejník,
Tinkara Troha,
Matěj Hývl,
Filip Krizek,
Deep C. Joshi,
Tomáš Ostatnický,
Jiří Jechumtál,
Eva Schmoranzerová,
Richard P. Campion,
Jakub Zázvorka,
Vít Novák,
Petr Kužel,
Tomáš Jungwirth,
Petr Němec,
Lukáš Nádvorník
Abstract:
Antiferromagnetic CuMnAs thin films have attracted attention since the discovery of the manipulation of their magnetic structure via electrical, optical, and terahertz pulses of electric fields, enabling convenient approaches to the switching between magnetoresistive states of the film for the information storage. However, the magnetic structure and, thus, the efficiency of the manipulation can be…
▽ More
Antiferromagnetic CuMnAs thin films have attracted attention since the discovery of the manipulation of their magnetic structure via electrical, optical, and terahertz pulses of electric fields, enabling convenient approaches to the switching between magnetoresistive states of the film for the information storage. However, the magnetic structure and, thus, the efficiency of the manipulation can be affected by the film morphology and growth defects. In this study, we investigate the properties of CuMnAs thin films by probing the defect-related uniaxial anisotropy of electric conductivity by contact-free terahertz transmission spectroscopy. We show that the terahertz measurements conveniently detect the conductivity anisotropy, that are consistent with conventional DC Hall-bar measurements. Moreover, the terahertz technique allows for considerably finer determination of anisotropy axes and it is less sensitive to the local film degradation. Thanks to the averaging over a large detection area, the THz probing also allows for an analysis of strongly non-uniform thin films. Using scanning near-field terahertz and electron microscopies, we relate the observed anisotropic conductivity of CuMnAs to the elongation and orientation of growth defects, which influence the local microscopic conductivity. We also demonstrate control over the morphology of defects by using vicinal substrates.
△ Less
Submitted 27 March, 2023;
originally announced March 2023.
-
Controlling Fractional Difference Equations Using Feedback
Authors:
Divya D. Joshi,
Sachin Bhalekar,
Prashant M. Gade
Abstract:
One of the most popular methods of controlling dynamical systems is feedback. It can be used without acquiring detailed knowledge of the underlying system. In this work, we study the stability of fractional-order linear difference equations under feedback. The stability results are derived for an arbitrary feedback time $τ$. We study the cases of $τ=1$ and $τ=2$ in further detail. The extension to…
▽ More
One of the most popular methods of controlling dynamical systems is feedback. It can be used without acquiring detailed knowledge of the underlying system. In this work, we study the stability of fractional-order linear difference equations under feedback. The stability results are derived for an arbitrary feedback time $τ$. We study the cases of $τ=1$ and $τ=2$ in further detail. The extension to the stability of fixed points under feedback for nonlinear fractional order difference equations with fixed points $ x_{*}=0$ is also carried out.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
FedSpectral+: Spectral Clustering using Federated Learning
Authors:
Janvi Thakkar,
Devvrat Joshi
Abstract:
Clustering in graphs has been a well-known research problem, particularly because most Internet and social network data is in the form of graphs. Organizations widely use spectral clustering algorithms to find clustering in graph datasets. However, applying spectral clustering to a large dataset is challenging due to computational overhead. While the distributed spectral clustering algorithm exist…
▽ More
Clustering in graphs has been a well-known research problem, particularly because most Internet and social network data is in the form of graphs. Organizations widely use spectral clustering algorithms to find clustering in graph datasets. However, applying spectral clustering to a large dataset is challenging due to computational overhead. While the distributed spectral clustering algorithm exists, they face the problem of data privacy and increased communication costs between the clients. Thus, in this paper, we propose a spectral clustering algorithm using federated learning (FL) to overcome these issues. FL is a privacy-protecting algorithm that accumulates model parameters from each local learner rather than collecting users' raw data, thus providing both scalability and data privacy. We developed two approaches: FedSpectral and FedSpectral+. FedSpectral is a baseline approach that uses local spectral clustering labels to aggregate the global spectral clustering by creating a similarity graph. FedSpectral+, a state-of-the-art approach, uses the power iteration method to learn the global spectral embedding by incorporating the entire graph data without access to the raw information distributed among the clients. We further designed our own similarity metric to check the clustering quality of the distributed approach to that of the original/non-FL clustering. The proposed approach FedSpectral+ obtained a similarity of 98.85% and 99.8%, comparable to that of global clustering on the ego-Facebook and email-Eu-core dataset.
△ Less
Submitted 4 February, 2023;
originally announced February 2023.
-
HQAlign: Aligning nanopore reads for SV detection using current-level modeling
Authors:
Dhaivat Joshi,
Suhas Diggavi,
Mark J. P. Chaisson,
Sreeram Kannan
Abstract:
Motivation: Detection of structural variants (SV) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifying novel SVs. Long read sequencers such as nanopore sequencing can address this problem by providing…
▽ More
Motivation: Detection of structural variants (SV) from the alignment of sample DNA reads to the reference genome is an important problem in understanding human diseases. Long reads that can span repeat regions, along with an accurate alignment of these long reads play an important role in identifying novel SVs. Long read sequencers such as nanopore sequencing can address this problem by providing very long reads but with high error rates, making accurate alignment challenging. Many errors induced by nanopore sequencing have a bias because of the physics of the sequencing process and proper utilization of these error characteristics can play an important role in designing a robust aligner for SV detection problems. In this paper, we design and evaluate HQAlign, an aligner for SV detection using nanopore sequenced reads. The key ideas of HQAlign include (i) using basecalled nanopore reads along with the nanopore physics to improve alignments for SVs (ii) incorporating SV specific changes to the alignment pipeline (iii) adapting these into existing state-of-the-art long read aligner pipeline, minimap2 (v2.24), for efficient alignments.
Results: We show that HQAlign captures about 4%-6% complementary SVs across different datasets which are missed by minimap2 alignments while having a standalone performance at par with minimap2 for real nanopore reads data. For the common SV calls between HQAlign and minimap2, HQAlign improves the start and the end breakpoint accuracy for about 10%-50% of SVs across different datasets. Moreover, HQAlign improves the alignment rate to 89.35% from minimap2 85.64% for nanopore reads alignment to recent telomere-to-telomere CHM13 assembly, and it improves to 86.65% from 83.48% for nanopore reads alignment to GRCh37 human genome.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
k-Means SubClustering: A Differentially Private Algorithm with Improved Clustering Quality
Authors:
Devvrat Joshi,
Janvi Thakkar
Abstract:
In today's data-driven world, the sensitivity of information has been a significant concern. With this data and additional information on the person's background, one can easily infer an individual's private data. Many differentially private iterative algorithms have been proposed in interactive settings to protect an individual's privacy from these inference attacks. The existing approaches adapt…
▽ More
In today's data-driven world, the sensitivity of information has been a significant concern. With this data and additional information on the person's background, one can easily infer an individual's private data. Many differentially private iterative algorithms have been proposed in interactive settings to protect an individual's privacy from these inference attacks. The existing approaches adapt the method to compute differentially private(DP) centroids by iterative Llyod's algorithm and perturbing the centroid with various DP mechanisms. These DP mechanisms do not guarantee convergence of differentially private iterative algorithms and degrade the quality of the cluster. Thus, in this work, we further extend the previous work on 'Differentially Private k-Means Clustering With Convergence Guarantee' by taking it as our baseline. The novelty of our approach is to sub-cluster the clusters and then select the centroid which has a higher probability of moving in the direction of the future centroid. At every Lloyd's step, the centroids are injected with the noise using the exponential DP mechanism. The results of the experiments indicate that our approach outperforms the current state-of-the-art method, i.e., the baseline algorithm, in terms of clustering quality while maintaining the same differential privacy requirements. The clustering quality significantly improved by 4.13 and 2.83 times than baseline for the Wine and Breast_Cancer dataset, respectively.
△ Less
Submitted 7 January, 2023;
originally announced January 2023.
-
Development and optimization of low power non-thermal plasma jet operational parameters for treating dyes and emerging contaminants
Authors:
Deepchandra Joshi,
G. Veda Prakash Shaikh Ziauddin Ahammad,
Satyananda Kar,
T R Sreekrishnan
Abstract:
Emerging contaminants (ECs) have come out as the latest class of environmental contaminants, which are highly recalcitrant and toxic in nature. Currently, no suitable rectification methods are available against the ECs, resulting in a continuous increase in their concentration. Non-thermal plasma, as an advanced oxidation process, has been emerging as a promising technology against the ECs treatme…
▽ More
Emerging contaminants (ECs) have come out as the latest class of environmental contaminants, which are highly recalcitrant and toxic in nature. Currently, no suitable rectification methods are available against the ECs, resulting in a continuous increase in their concentration. Non-thermal plasma, as an advanced oxidation process, has been emerging as a promising technology against the ECs treatment. In the present work, a detailed experimental study is carried out to evaluate the efficacy of a non-thermal plasma jet with two dyes, Rhodamine B and Methylene Blue, as model contaminants. The plasma jet provided a complete dye decoloration in 30 min with an applied voltage of 6.5 kV. .OH, having the highest oxidation potential, acts as the main reactive species, which with direct action on contaminants also acts indirectly by getting converted into H2O2 and O3. Further, the effect of critical operational parameters viz., sample pH, applied voltage (4.5-6.5 kV), conductivity (5-20 mScm-1), and sample distance on plasma treatment efficacy was also examined. Out of all the assessed parameters, the applied voltage and sample conductivity was found to be the most significant operating parameter. A high voltage and low conductivity were found to favor the dye decoloration, while the pH effect was not that significant. To understand the influence of plasma discharge gas on treatment efficacy, all the experiments are conducted with Argon and Helium gases under the fixed geometrical configuration. Both the gases provided a similar dye decoloration efficiency. The DBD plasma system with complete dye removal also rendered maximum mineralization of 73 % for Rd. B, and 60 % for Met. Blue. Finally, the system's efficiency against the actual ECs (four pharmaceutical compounds, viz., metformin, atenolol, acetaminophen, and ranitidine) and microbial contaminant (Escherichia coli) was also tested.
△ Less
Submitted 26 December, 2022;
originally announced December 2022.
-
VINTERGATAN-GM: The cosmological imprints of early mergers on Milky-Way-mass galaxies
Authors:
Martin P. Rey,
Oscar Agertz,
Tjitske K. Starkenburg,
Florent Renaud,
Gandhali D. Joshi,
Andrew Pontzen,
Nicolas F. Martin,
Diane K. Feuillet,
Justin I. Read
Abstract:
We present a new suite of cosmological zoom-in hydrodynamical ($\approx 20\, \mathrm{pc}$ spatial resolution) simulations of Milky-Way mass galaxies to study how a varying mass ratio for a Gaia-Sausage-Enceladus (GSE) progenitor impacts the $z=0$ chemodynamics of halo stars. Using the genetic modification approach, we create five cosmological histories for a Milky-Way-mass dark matter halo (…
▽ More
We present a new suite of cosmological zoom-in hydrodynamical ($\approx 20\, \mathrm{pc}$ spatial resolution) simulations of Milky-Way mass galaxies to study how a varying mass ratio for a Gaia-Sausage-Enceladus (GSE) progenitor impacts the $z=0$ chemodynamics of halo stars. Using the genetic modification approach, we create five cosmological histories for a Milky-Way-mass dark matter halo ($M_{200} \approx 10^{12} \, M_\mathrm{\odot}$), incrementally increasing the stellar mass ratio of a $z\approx2$ merger from 1:25 to 1:2, while fixing the galaxy's final dynamical, stellar mass and large-scale environment. We find markedly different morphologies at $z=0$ following this change in early history, with a growing merger resulting in increasingly compact and bulge-dominated galaxies. Despite this structural diversity, all galaxies show a radially-biased population of inner halo stars like the Milky-Way's GSE which, surprisingly, has a similar magnitude, age, $\rm [Fe/H]$ and $\rm [α/Fe]$ distribution whether the $z\approx2$ merger is more minor or major. This arises because a smaller ex-situ population at $z\approx2$ is compensated by a larger population formed in an earlier merger-driven starburst whose contribution to the GES can grow dynamically over time, with both populations strongly overlap** in the $\rm [Fe/H]-\rm [α/Fe]$ plane. Our study demonstrates that multiple high-redshift histories can lead to similar $z=0$ chemodynamical features in the halo, highlighting the need for additional constraints to distinguish them, and the importance of considering the full spectrum of progenitors when interpreting $z=0$ data to reconstruct our Galaxy's past.
△ Less
Submitted 15 March, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge Automation
Authors:
Siddharth Agrawal,
Keyur D. Joshi
Abstract:
Detection and recognition of a licence plate is important when automating weighbridge services. While many large databases are available for Latin and Chinese alphanumeric license plates, data for Indian License Plates is inadequate. In particular, databases of Indian commercial truck license plates are inadequate, despite the fact that commercial vehicle license plate recognition plays a profound…
▽ More
Detection and recognition of a licence plate is important when automating weighbridge services. While many large databases are available for Latin and Chinese alphanumeric license plates, data for Indian License Plates is inadequate. In particular, databases of Indian commercial truck license plates are inadequate, despite the fact that commercial vehicle license plate recognition plays a profound role in terms of logistics management and weighbridge automation. Moreover, models to recognise license plates are not effectively able to generalise to such data due to its challenging nature, and due to the abundant frequency of handwritten license plates, leading to the usage of diverse font styles. Thus, a database and effective models to recognise and detect such license plates are crucial. This paper provides a database on commercial truck license plates, and using state-of-the-art models in real-time object Detection: You Only Look Once Version 7, and SceneText Recognition: Permuted Autoregressive Sequence Models, our method outperforms the other cited references where the maximum accuracy obtained was less than 90%, while we have achieved 95.82% accuracy in our algorithm implementation on the presented challenging license plate dataset. Index Terms- Automatic License Plate Recognition, character recognition, license plate detection, vision transformer.
△ Less
Submitted 22 December, 2022; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Spin density wave, Fermi liquid, and fractionalized phases in a theory of antiferromagnetic metals using paramagnons and bosonic spinons
Authors:
Alexander Nikolaenko,
Jonas von Milczewski,
Darshan G. Joshi,
Subir Sachdev
Abstract:
The pseudogap metal phase of the hole-doped cuprates can be described by small Fermi surfaces of electron-like quasiparticles, which enclose a volume violating the Luttinger relation. This violation requires the existence of additional fractionalized excitations which can be viewed as fractionalized remnants of the paramagnon. We fractionalize the paramagnon into bosonic spinons, and present a gau…
▽ More
The pseudogap metal phase of the hole-doped cuprates can be described by small Fermi surfaces of electron-like quasiparticles, which enclose a volume violating the Luttinger relation. This violation requires the existence of additional fractionalized excitations which can be viewed as fractionalized remnants of the paramagnon. We fractionalize the paramagnon into bosonic spinons, and present a gauge theory of bosonic spinons, a Higgs field, and an ancilla layer of fermions coupled to the original electrons. Along with the small Fermi surface metal, this theory displays conventional phases: the Fermi liquid with a low-energy paramagnon mode, and phases with spin density wave order. We follow the evolution of the electronic photoemission spectrum across these quantum phase transitions. We consider both the two-sublattice Néel and incommensurate spin density wave phases.
△ Less
Submitted 18 November, 2023; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Satellites of Milky Way- and M31-like galaxies with TNG50: quenched fractions, gas content, and star formation histories
Authors:
Christoph Engler,
Annalisa Pillepich,
Gandhali D. Joshi,
Anna Pasquali,
Dylan Nelson,
Eva K. Grebel
Abstract:
We analyse the quenched fractions, gas content, and star formation histories of ~1200 satellite galaxies with $M_* \geq 5 \times 10^6~{\rm M}_\odot$ around 198 Milky Way- (MW) and Andromeda-like (M31) hosts in TNG50, the highest-resolution simulation of IllustrisTNG. Satellite quenched fractions are larger for smaller masses, for smaller distances to their host galaxy, and in the more massive M31-…
▽ More
We analyse the quenched fractions, gas content, and star formation histories of ~1200 satellite galaxies with $M_* \geq 5 \times 10^6~{\rm M}_\odot$ around 198 Milky Way- (MW) and Andromeda-like (M31) hosts in TNG50, the highest-resolution simulation of IllustrisTNG. Satellite quenched fractions are larger for smaller masses, for smaller distances to their host galaxy, and in the more massive M31-like compared to MW-like hosts. As satellites cross their host's virial radius, their gas content drops: most satellites within 300 kpc lack detectable gas reservoirs at $z=0$, unless they are massive like the Magellanic Clouds and M32. Nevertheless, their stellar assembly exhibits a large degree of diversity. On average, the cumulative star formation histories are more extended for brighter, more massive satellites with a later infall, and for those in less massive hosts. Based on these relationships, we can even infer infall periods for observedMWand M31 dwarfs: e.g. 0-4 Gyr ago for the Magellanic Clouds and Leo I, 4-8 and 0-2 Gyr ago for M32 and IC 10, respectively. Ram pressure strip** (in combination with tidal strip**) deprives TNG50 satellites of their gas reservoirs and ultimately quenches their star formation, even though only a few per cent of the present-day satellites around the 198 TNG50 MW/M31-like hosts appear as jellyfish. The typical time since quenching for currently quenched TNG50 satellites is $6.9^{+2.5}_{-3.3}~{\rm Gyr}$ ago. The TNG50 results are consistent with the quenched fractions and stellar assembly of observed MW and M31 satellites, however, satellites of the SAGA survey with $M_* \sim 10^{8-9}~{\rm M}_\odot$ exhibit lower quenched fractions than TNG50 and other, observed analogues.
△ Less
Submitted 17 May, 2023; v1 submitted 31 October, 2022;
originally announced November 2022.
-
Merged-GHCIDR: Geometrical Approach to Reduce Image Data
Authors:
Devvrat Joshi,
Janvi Thakkar,
Siddharth Soni,
Shril Mody,
Rohan Patil,
Nipun Batra
Abstract:
The computational resources required to train a model have been increasing since the inception of deep networks. Training neural networks on massive datasets have become a challenging and time-consuming task. So, there arises a need to reduce the dataset without compromising the accuracy. In this paper, we present novel variations of an earlier approach called reduction through homogeneous cluster…
▽ More
The computational resources required to train a model have been increasing since the inception of deep networks. Training neural networks on massive datasets have become a challenging and time-consuming task. So, there arises a need to reduce the dataset without compromising the accuracy. In this paper, we present novel variations of an earlier approach called reduction through homogeneous clustering for reducing dataset size. The proposed methods are based on the idea of partitioning the dataset into homogeneous clusters and selecting images that contribute significantly to the accuracy. We propose two variations: Geometrical Homogeneous Clustering for Image Data Reduction (GHCIDR) and Merged-GHCIDR upon the baseline algorithm - Reduction through Homogeneous Clustering (RHC) to achieve better accuracy and training time. The intuition behind GHCIDR involves selecting data points by cluster weights and geometrical distribution of the training set. Merged-GHCIDR involves merging clusters having the same labels using complete linkage clustering. We used three deep learning models- Fully Connected Networks (FCN), VGG1, and VGG16. We experimented with the two variants on four datasets- MNIST, CIFAR10, Fashion-MNIST, and Tiny-Imagenet. Merged-GHCIDR with the same percentage reduction as RHC showed an increase of 2.8%, 8.9%, 7.6% and 3.5% accuracy on MNIST, Fashion-MNIST, CIFAR10, and Tiny-Imagenet, respectively.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
Geometrical Homogeneous Clustering for Image Data Reduction
Authors:
Shril Mody,
Janvi Thakkar,
Devvrat Joshi,
Siddharth Soni,
Rohan Patil,
Nipun Batra
Abstract:
In this paper, we present novel variations of an earlier approach called homogeneous clustering algorithm for reducing dataset size. The intuition behind the approaches proposed in this paper is to partition the dataset into homogeneous clusters and select some images which contribute significantly to the accuracy. Selected images are the proper subset of the training data and thus are human-reada…
▽ More
In this paper, we present novel variations of an earlier approach called homogeneous clustering algorithm for reducing dataset size. The intuition behind the approaches proposed in this paper is to partition the dataset into homogeneous clusters and select some images which contribute significantly to the accuracy. Selected images are the proper subset of the training data and thus are human-readable. We propose four variations upon the baseline algorithm-RHC. The intuition behind the first approach, RHCKON, is that the boundary points contribute significantly towards the representation of clusters. It involves selecting k farthest and one nearest neighbour of the centroid of the clusters. In the following two approaches (KONCW and CWKC), we introduce the concept of cluster weights. They are based on the fact that larger clusters contribute more than smaller sized clusters. The final variation is GHCIDR which selects points based on the geometrical aspect of data distribution. We performed the experiments on two deep learning models- Fully Connected Networks (FCN) and VGG1. We experimented with the four variants on three datasets- MNIST, CIFAR10, and Fashion-MNIST. We found that GHCIDR gave the best accuracy of 99.35%, 81.10%, and 91.66% and a training data reduction of 87.27%, 32.34%, and 76.80% on MNIST, CIFAR10, and Fashion-MNIST respectively.
△ Less
Submitted 27 August, 2022;
originally announced August 2022.
-
Study of Low-dimensional Nonlinear Fractional Difference Equations of Complex Order
Authors:
Divya D Joshi,
Prashant M Gade,
Sachin Bhalekar
Abstract:
We study the fractional maps of complex order, $α_0e^{i r π/2}$ for $0<α_0<1$ and $0\le r<1$ in 1 and 2 dimensions. In two dimensions, we study H{é}non and Lozi map and in $1d$, we study logistic, tent, Gauss, circle, and Bernoulli maps. The generalization in $2d$ can be done in two different ways which are not equivalent for fractional-order and lead to different bifurcation diagrams. We observed…
▽ More
We study the fractional maps of complex order, $α_0e^{i r π/2}$ for $0<α_0<1$ and $0\le r<1$ in 1 and 2 dimensions. In two dimensions, we study H{é}non and Lozi map and in $1d$, we study logistic, tent, Gauss, circle, and Bernoulli maps. The generalization in $2d$ can be done in two different ways which are not equivalent for fractional-order and lead to different bifurcation diagrams. We observed that the smooth maps such as logistic, Gauss, and H{é}non maps do not show chaos while discontinuous maps such as Lozi, Bernoulli, and circle maps show chaos. The tent map is continuous but not differentiable and it shows chaos as well. In $2d$, we find that the complex fractional-order maps that show chaos also show multistability. Thus, it can be inferred that the smooth maps of complex fractional-order tend to show more regular behavior than the discontinuous or non-differentiable maps.
△ Less
Submitted 24 August, 2022;
originally announced August 2022.
-
Superconductivity of non-Fermi liquids described by Sachdev-Ye-Kitaev models
Authors:
Chenyuan Li,
Subir Sachdev,
Darshan G. Joshi
Abstract:
We investigate models of electrons in the Sachdev-Ye-Kitaev class with random and all-to-all electron hop**, electron spin exchange, and Cooper-pair hop**. An attractive on-site interaction between electrons leads to superconductivity at low temperatures. Depending on the relative strengths of the hop** and spin exchange, the normal state at the critical temperature is either a Fermi-liquid…
▽ More
We investigate models of electrons in the Sachdev-Ye-Kitaev class with random and all-to-all electron hop**, electron spin exchange, and Cooper-pair hop**. An attractive on-site interaction between electrons leads to superconductivity at low temperatures. Depending on the relative strengths of the hop** and spin exchange, the normal state at the critical temperature is either a Fermi-liquid or a non-Fermi liquid. We present a large-$M$ (where spin symmetry is enlarged to SU$(M)$) study of the normal state to superconductor phase transition. We describe the transition temperature, the superconducting order parameter, and the electron spectral functions. We contrast between Fermi liquid and non-Fermi liquid normal states: we find that for weaker attractive on-site interaction there is a relative enhancement of $T_c$ when the normal state is a non-Fermi liquid, and correspondingly a strong deviation from BCS limit. Also, the phase transition in this case becomes a first-order transition for strong non-Fermi liquids. On the other hand, for stronger on-site interaction, there is no appreciable difference in $T_c$ between whether the superconductivity emerges from a Fermi liquid or a non-Fermi liquid. Notable features of superconductivity emerging from a non-Fermi liquid are that the superconducting electron spectral function is different from the Fermi-liquid case, with additional peaks at higher energies, and there is no Hebel-Slichter peak in the NMR relaxation rate in the non-Fermi liquid case.
△ Less
Submitted 9 November, 2022; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Topological Optimized Convolutional Visual Recurrent Network for Brain Tumor Segmentation and Classification
Authors:
Dhananjay Joshi,
Kapil Kumar Nagwanshi,
Nitin S. Choubey,
Naveen Singh Rajput
Abstract:
In today's world of health care, brain tumor (BT) detection has become a common occurrence. However, the manual BT classification approach is time-consuming and only available at a few diagnostic centres. So Deep Convolutional Neural Network (DCNN) is introduced in the medical field for making accurate diagnoses and aiding in the patient's treatment before surgery. But these networks have problems…
▽ More
In today's world of health care, brain tumor (BT) detection has become a common occurrence. However, the manual BT classification approach is time-consuming and only available at a few diagnostic centres. So Deep Convolutional Neural Network (DCNN) is introduced in the medical field for making accurate diagnoses and aiding in the patient's treatment before surgery. But these networks have problems such as overfitting and being unable to extract necessary features for classification. To overcome these problems, we developed the TDA-IPH and Convolutional Transfer learning and Visual Recurrent learning with Elephant Herding Optimization hyper-parameter tuning (CTVR-EHO) models for BT segmentation and classification. Initially, the Topological Data Analysis based Improved Persistent Homology (TDA-IPH) is designed to segment the BT image. Then, from the segmented image, features are extracted simultaneously using TL via the AlexNet model and Bidirectional Visual Long Short Term Memory (Bi-VLSTM). Elephant Herding Optimization (EHO) is used to tune the hyper parameters of both networks to get an optimal result. Finally, extracted features are concatenated and classified using the softmax activation layer. The simulation result of this proposed CTVR-EHO and TDA-IPH method is analysed based on some metrics such as precision, accuracy, recall, loss, and F score. When compared to other existing BT segmentation and classification models, the proposed CTVR-EHO and TDA-IPH approaches show high accuracy (99.8%), high recall (99.23%), high precision (99.67%), and high F score (99.59%).
△ Less
Submitted 6 June, 2022;
originally announced July 2022.
-
SoUthern Cluster sCale Extended Source Survey (SUCCESS): A GMRT and MeerKAT study of nine massive galaxy clusters
Authors:
R. Kale,
V. Parekh,
M. Rahaman,
D. C. Joshi,
T. Venturi,
K. Kolokythas,
J. O. Chibueze,
S. Sikhosana,
D. Pillay,
K. Knowles
Abstract:
We aim to carry out a radio study of the SoUthern Cluster sCale Extended Source Survey (SUCCESS) sample consisting of twenty massive (M$_{500} > 5\times10^{14}$ M$_{\odot}$), nearby (redshift $<0.3$) and southern ($-50^{\circ} < δ< -30^\circ$) galaxy clusters detected by the Planck satellite and the South Pole Telescope. Here we report targeted GMRT observations (325/610 MHz) for a sub-sample of n…
▽ More
We aim to carry out a radio study of the SoUthern Cluster sCale Extended Source Survey (SUCCESS) sample consisting of twenty massive (M$_{500} > 5\times10^{14}$ M$_{\odot}$), nearby (redshift $<0.3$) and southern ($-50^{\circ} < δ< -30^\circ$) galaxy clusters detected by the Planck satellite and the South Pole Telescope. Here we report targeted GMRT observations (325/610 MHz) for a sub-sample of nine clusters. We also use the first data release of MeerKAT Galaxy Cluster Legacy Survey (1283 MHz) for five of these nine clusters. The properties of the mini-halo in RXC J0528.9-3927, a candidate mini-halo in A3322, the radio halo and candidate double relics in A3399, and the radio halo in RXC J0232.2-4420 are presented. We also report detection of candidate radio relics at distances 1 and 1.9 Mpc from the center of RXC J0232.2-4420. The southeast relic of A3399 is consistent with the radio power - mass scaling relation for radio relics, while the candidate relics around RXC J0232.2-4420 are outliers. This indicates an origin of the candidate relics near RXC J0232.2-4420 to be independent of this cluster and a cluster merger-shock origin for the relic in A3399. In this sub-sample of clusters 1/9 hosts a radio halo and double relics, 1/9 hosts a radio halo and 2/9 host mini-halos. The dynamical states based on X-ray morphology show that A3399 is a disturbed cluster; however, the radio halo cluster RXC J0232.2-4420 is relaxed, and the mini-halo clusters have intermediate morphologies, adding to the cases of the less commonly found associations.
△ Less
Submitted 22 June, 2022;
originally announced June 2022.
-
Emergent $\mathbb{Z}_2$ gauge theories and topological excitations in Rydberg atom arrays
Authors:
Rhine Samajdar,
Darshan G. Joshi,
Yanting Teng,
Subir Sachdev
Abstract:
Strongly interacting arrays of Rydberg atoms provide versatile platforms for exploring exotic many-body phases and dynamics of correlated quantum systems. Motivated by recent experimental advances, we show that the combination of Rydberg interactions and appropriate lattice geometries naturally leads to emergent $\mathbb{Z}_2$ gauge theories endowed with matter fields. Based on this map**, we de…
▽ More
Strongly interacting arrays of Rydberg atoms provide versatile platforms for exploring exotic many-body phases and dynamics of correlated quantum systems. Motivated by recent experimental advances, we show that the combination of Rydberg interactions and appropriate lattice geometries naturally leads to emergent $\mathbb{Z}_2$ gauge theories endowed with matter fields. Based on this map**, we describe how Rydberg platforms could realize two distinct classes of topological $\mathbb{Z}_2$ quantum spin liquids, which differ in their patterns of translational symmetry fractionalization. We also discuss the natures of the fractionalized excitations of these $\mathbb{Z}_2$ spin liquid states using both fermionic and bosonic parton theories, and illustrate their rich interplay with proximate solid phases.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Critical metallic phase in the overdoped random $t$-$J$ model
Authors:
Maine Christos,
Darshan G. Joshi,
Subir Sachdev,
Maria Tikhanovskaya
Abstract:
We investigate a model of electrons with random and all-to-all hop** and spin exchange interactions, with a constraint of no double occupancy. The model is studied in a Sachdev-Ye-Kitaev-like large-$M$ limit with SU($M$) spin symmetry. The saddle point equations of this model are similar to appoximate dynamic mean field equations of realistic, non-random, $t$-$J$ models. We use numerical studies…
▽ More
We investigate a model of electrons with random and all-to-all hop** and spin exchange interactions, with a constraint of no double occupancy. The model is studied in a Sachdev-Ye-Kitaev-like large-$M$ limit with SU($M$) spin symmetry. The saddle point equations of this model are similar to appoximate dynamic mean field equations of realistic, non-random, $t$-$J$ models. We use numerical studies on both real and imaginary frequency axes, along with asymptotic analyses, to establish the existence of a critical non-Fermi-liquid metallic ground state at large do**, with the spin correlation exponent varying with do**. This critical solution possesses a time-reparametrization symmetry, akin to SYK models, which contributes a linear-in-temperature resistivity over the full range of do** where the solution is present. It is therefore an attractive mean-field description of the overdoped region of cuprates, where experiments have observed a linear-$T$ resistivity in a broad region. The critical metal also displays a strong particle-hole asymmetry, which is relevant to Seebeck coefficient measurements. We show that the critical metal has an instability to a low-do** spin-glass phase, and compute a critical do** value. We also describe the properties of this metallic spin-glass phase.
△ Less
Submitted 2 June, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Resonant thermal Hall effect of phonons coupled to dynamical defects
Authors:
Haoyu Guo,
Darshan G. Joshi,
Subir Sachdev
Abstract:
We present computations of the thermal Hall coefficient of phonons scattering off a defect with multiple energy levels. Using a microscopic formulation based on the Kubo formula, we find that the leading contribution perturbative in the phonon-defect coupling is proportional to the phonon lifetime, and has a `side-jump' interpretation. Consequently, the thermal Hall angle is independent of the pho…
▽ More
We present computations of the thermal Hall coefficient of phonons scattering off a defect with multiple energy levels. Using a microscopic formulation based on the Kubo formula, we find that the leading contribution perturbative in the phonon-defect coupling is proportional to the phonon lifetime, and has a `side-jump' interpretation. Consequently, the thermal Hall angle is independent of the phonon lifetime. The contribution to the thermal Hall coefficient is at resonance when the phonon energy equals a defect level spacing. Our results are obtained for three different defect models, which apply to different correlated electron materials. For the pseudogap regime of the cuprates, we propose a model of phonons coupled to an impurity quantum spin in the presence of quasi-static magnetic order with an isotropic Zeeman coupling to the applied field, and without spin-orbit interaction.
△ Less
Submitted 31 October, 2022; v1 submitted 27 January, 2022;
originally announced January 2022.
-
Altering Backward Pass Gradients improves Convergence
Authors:
Bishshoy Das,
Milton Mondal,
Brejesh Lall,
Shiv Dutt Joshi,
Sumantra Dutta Roy
Abstract:
In standard neural network training, the gradients in the backward pass are determined by the forward pass. As a result, the two stages are coupled. This is how most neural networks are trained currently. However, gradient modification in the backward pass has seldom been studied in the literature. In this paper we explore decoupled training, where we alter the gradients in the backward pass. We p…
▽ More
In standard neural network training, the gradients in the backward pass are determined by the forward pass. As a result, the two stages are coupled. This is how most neural networks are trained currently. However, gradient modification in the backward pass has seldom been studied in the literature. In this paper we explore decoupled training, where we alter the gradients in the backward pass. We propose a simple yet powerful method called PowerGrad Transform, that alters the gradients before the weight update in the backward pass and significantly enhances the predictive performance of the neural network. PowerGrad Transform trains the network to arrive at a better optima at convergence. It is computationally extremely efficient, virtually adding no additional cost to either memory or compute, but results in improved final accuracies on both the training and test sets. PowerGrad Transform is easy to integrate into existing training routines, requiring just a few lines of code. PowerGrad Transform accelerates training and makes it possible for the network to better fit the training data. With decoupled training, PowerGrad Transform improves baseline accuracies for ResNet-50 by 0.73%, for SE-ResNet-50 by 0.66% and by more than 1.0% for the non-normalized ResNet-18 network on the ImageNet classification task.
△ Less
Submitted 20 September, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
Stability and Dynamics of Complex Order Fractional Difference Equations
Authors:
Sachin Bhalekar,
Prashant M. Gade,
Divya Joshi
Abstract:
We extend the definition of $n$-dimensional difference equations to complex order $α\in \mathbb{C} $. We investigate the stability of linear systems defined by an $n$-dimensional matrix $A$ and derive conditions for the stability of equilibrium points for linear systems. For the one-dimensional case where $A =λ\in \mathbb {C}$, we find that the stability region, if any is enclosed by a boundary cu…
▽ More
We extend the definition of $n$-dimensional difference equations to complex order $α\in \mathbb{C} $. We investigate the stability of linear systems defined by an $n$-dimensional matrix $A$ and derive conditions for the stability of equilibrium points for linear systems. For the one-dimensional case where $A =λ\in \mathbb {C}$, we find that the stability region, if any is enclosed by a boundary curve and we obtain a parametric equation for the same. Furthermore, we find that there is no stable region if this parametric curve is self-intersecting. Even for $ λ\in \mathbb{R} $, the solutions can be complex and dynamics in one-dimension is richer than the case for $ α\in \mathbb{R} $. These results can be extended to $n$-dimensions. For nonlinear systems, we observe that the stability of the linearized system determines the stability of the equilibrium point.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Novel Time Domain Based Upper-Limb Prosthesis Control using Incremental Learning Approach
Authors:
Sidharth Pancholi,
Amit M. Joshi Deepak Joshi,
Bradly S. Duerstock
Abstract:
The upper limb of the body is a vital for various kind of activities for human. The complete or partial loss of the upper limb would lead to a significant impact on daily activities of the amputees. EMG carries important information of human physique which helps to decode the various functionalities of human arm. EMG signal based bionics and prosthesis have gained huge research attention over the…
▽ More
The upper limb of the body is a vital for various kind of activities for human. The complete or partial loss of the upper limb would lead to a significant impact on daily activities of the amputees. EMG carries important information of human physique which helps to decode the various functionalities of human arm. EMG signal based bionics and prosthesis have gained huge research attention over the past decade. Conventional EMG-PR based prosthesis struggles to give accurate performance due to off-line training used and incapability to compensate for electrode position shift and change in arm position. This work proposes online training and incremental learning based system for upper limb prosthetic application. This system consists of ADS1298 as AFE (analog front end) and a 32 bit arm cortex-m4 processor for DSP (digital signal processing). The system has been tested for both intact and amputated subjects. Time derivative moment based features have been implemented and utilized for effective pattern classification. Initially, system have been trained for four classes using the on-line training process later on the number of classes have been incremented on user demand till eleven, and system performance has been evaluated. The system yielded a completion rate of 100% for healthy and amputated subjects when four motions have been considered. Further 94.33% and 92% completion rate have been showcased by the system when the number of classes increased to eleven for healthy and amputees respectively. The motion efficacy test is also evaluated for all the subjects. The highest efficacy rate of 91.23% and 88.64% are observed for intact and amputated subjects respectively.
△ Less
Submitted 13 January, 2024; v1 submitted 25 August, 2021;
originally announced September 2021.
-
Auxiliary Heuristics for Frontier Based Planners
Authors:
Arsh Tangri,
Dhruv Joshi,
Ashalatha Nayak
Abstract:
Autonomous exploration of unknown environments is a vital function for robots and has applications in a wide variety of scenarios. Our focus primarily lies in its application for the task of efficient coverage of unknown environments. Various methods have been proposed for this task and frontier based methods are an efficient category in this class of methods. Efficiency is of utmost importance in…
▽ More
Autonomous exploration of unknown environments is a vital function for robots and has applications in a wide variety of scenarios. Our focus primarily lies in its application for the task of efficient coverage of unknown environments. Various methods have been proposed for this task and frontier based methods are an efficient category in this class of methods. Efficiency is of utmost importance in exploration and heuristics play a critical role in guiding our search. In this work we demonstrate the ability of heuristics that are learnt by imitating clairvoyant oracles. These learnt heuristics can be used to predict the expected future return from selected states without building search trees, which are inefficient and limited by on-board compute. We also propose an additional filter-based heuristic which results in an enhancement in the performance of the frontier-based planner with respect to certain tasks such as coverage planning.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
A Robust and Accurate Deep Learning based Pattern Recognition Framework for Upper Limb Prosthesis using sEMG
Authors:
Sidharth Pancholi,
Amit M. Joshi,
Deepak Joshi
Abstract:
In EMG based pattern recognition (EMG-PR), deep learning-based techniques have become more prominent for their self-regulating capability to extract discriminant features from large data-sets. Moreover, the performance of traditional machine learning-based methods show limitation to categorize over a certain number of classes and degrades over a period of time. In this paper, an accurate, robust,…
▽ More
In EMG based pattern recognition (EMG-PR), deep learning-based techniques have become more prominent for their self-regulating capability to extract discriminant features from large data-sets. Moreover, the performance of traditional machine learning-based methods show limitation to categorize over a certain number of classes and degrades over a period of time. In this paper, an accurate, robust, and fast convolutional neural network-based framework for EMG pattern identification is presented. To assess the performance of the proposed system, five publicly available and benchmark data-sets of upper limb activities were used. This data-set contains 49 to 52 upper limb motions (NinaPro DB1, NinaPro DB2, and NinaPro DB3), Data with force variation, and data with arm position variation for intact and amputated subjects. The classification accuracies of 91.11% (53 classes), 89.45% (49 classes), 81.67% (49 classes of amputees), 95.67% (6 classes with force variation), and 99.11% (8 classes with arm position variation) have been observed during the testing and validation. The performance of the proposed system is compared with the state of art techniques in the literature. The findings demonstrate that classification accuracy and time complexity have improved significantly. Keras, TensorFlow's high-level API for constructing deep learning models, was used for signal pre-processing and deep-learning-based algorithms. The suggested method was run on an Intel 3.5GHz Core i7, 7th Gen CPU with 8GB DDR4 RAM.
△ Less
Submitted 11 June, 2021; v1 submitted 4 June, 2021;
originally announced June 2021.
-
Measurement of two-point coherence functions of electromagnetic optical fields and applications of optical coherence
Authors:
Bhaskar Kanseri,
Deepa Joshi
Abstract:
For stationary light fields, manifestation of statistical properties such as coherence and polarization are attributed to the same physical phenomena, i.e. correlations in fluctuations of optical fields. In order to explain various properties associated with electromagnetic optical fields, both coherence and polarization need to be placed at same footings. This leads to two-point (space or time) g…
▽ More
For stationary light fields, manifestation of statistical properties such as coherence and polarization are attributed to the same physical phenomena, i.e. correlations in fluctuations of optical fields. In order to explain various properties associated with electromagnetic optical fields, both coherence and polarization need to be placed at same footings. This leads to two-point (space or time) generalization of single-point properties such as Stokes parameters and elements of coherency matrix. This paper reviews the basic aspects concerning vectorial optical fields and experimental methods developed during last couple of decades for the measurement of two-point correlation functions of electromagnetic optical fields in spatial and temporal domain. Studies related to coherence properties of optical fields have led to several important technological applications during last seven decades, which are also discussed briefly in this review.
△ Less
Submitted 23 May, 2021;
originally announced May 2021.
-
The Generalized Fourier Transform: A Unified Framework for the Fourier, Laplace, Mellin and $Z$ Transforms
Authors:
Pushpendra Singh,
Anubha Gupta,
Shiv Dutt Joshi
Abstract:
This paper introduces Generalized Fourier transform (GFT) that is an extension or the generalization of the Fourier transform (FT). The Unilateral Laplace transform (LT) is observed to be the special case of GFT. GFT, as proposed in this work, contributes significantly to the scholarly literature. There are many salient contribution of this work. Firstly, GFT is applicable to a much larger class o…
▽ More
This paper introduces Generalized Fourier transform (GFT) that is an extension or the generalization of the Fourier transform (FT). The Unilateral Laplace transform (LT) is observed to be the special case of GFT. GFT, as proposed in this work, contributes significantly to the scholarly literature. There are many salient contribution of this work. Firstly, GFT is applicable to a much larger class of signals, some of which cannot be analyzed with FT and LT. For example, we have shown the applicability of GFT on the polynomially decaying functions and super exponentials. Secondly, we demonstrate the efficacy of GFT in solving the initial value problems (IVPs). Thirdly, the generalization presented for FT is extended for other integral transforms with examples shown for wavelet transform and cosine transform. Likewise, generalized Gamma function is also presented. One interesting application of GFT is the computation of generalized moments, for the otherwise non-finite moments, of any random variable such as the Cauchy random variable. Fourthly, we introduce Fourier scale transform (FST) that utilizes GFT with the topological isomorphism of an exponential map. Lastly, we propose Generalized Discrete-Time Fourier transform (GDTFT). The DTFT and unilateral $z$-transform are shown to be the special cases of the proposed GDTFT. The properties of GFT and GDTFT have also been discussed.
△ Less
Submitted 12 February, 2021;
originally announced March 2021.
-
Critical anomalous metals near superconductivity in models with random interactions
Authors:
Chenyuan Li,
Darshan G. Joshi,
Subir Sachdev
Abstract:
Anomalous metals are observed in numerous experiments on disordered two-dimensional systems proximate to superconductivity. A characteristic feature of an anomalous metal is that its low temperature conductivity has a weakly temperature dependent value, significantly higher than that of a disordered Fermi liquid. We propose a dynamical mean-field model of an anomalous metal: interacting electrons…
▽ More
Anomalous metals are observed in numerous experiments on disordered two-dimensional systems proximate to superconductivity. A characteristic feature of an anomalous metal is that its low temperature conductivity has a weakly temperature dependent value, significantly higher than that of a disordered Fermi liquid. We propose a dynamical mean-field model of an anomalous metal: interacting electrons similar in structure to that of the well-studied universal Hamiltonian of mesoscopic metallic grains, but with independent random interactions between pairs of sites, involving Cooper pair hop** and spin exchange. We find evidence for critical anomalous phases or points between a superconducting phase and a disordered Fermi liquid phase in this model. Our results are obtained by a renormalization group analysis in a weak coupling limit, and a complementary solution at large $M$ when the spin symmetry is generalized to USp($M$). The large $M$ limit describes the anomalous metal by fractionalization of the electron into spinons, holons, and doublons, with these partons forming critical non-Fermi liquids in the Sachdev-Ye-Kitaev class. We compute the low temperature conductivity in the large $M$ limit, and find temperature-independent values moderately enhanced from that in the disordered metal.
△ Less
Submitted 29 March, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
The cumulative star-formation histories of dwarf galaxies with TNG50. I: Environment-driven diversity and connection to quenching
Authors:
Gandhali D. Joshi,
Annalisa Pillepich,
Dylan Nelson,
Elad Zinger,
Federico Marinacci,
Volker Springel,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We present the cumulative star-formation histories (SFHs) of >15000 dwarf galaxies ($M_{*}=10^{7-10}M_{\odot}$) from the TNG50 run of the IllustrisTNG suite across a vast range of environments. The key factors determining the dwarfs' SFHs are their status as central or satellite and their stellar mass, with centrals and more massive dwarfs assembling their stellar mass at later times on average co…
▽ More
We present the cumulative star-formation histories (SFHs) of >15000 dwarf galaxies ($M_{*}=10^{7-10}M_{\odot}$) from the TNG50 run of the IllustrisTNG suite across a vast range of environments. The key factors determining the dwarfs' SFHs are their status as central or satellite and their stellar mass, with centrals and more massive dwarfs assembling their stellar mass at later times on average compared to satellites and lower mass dwarfs. The satellites (in hosts of total mass $M_{200c,\,host}=10^{12-14.3}M_{\odot}$) assembled 90% of their z=0 stellar mass ~$7.0_{-5.5}^{+3.3}$ Gyr ago, while the centrals did so only ~$1.0_{-0.5}^{+4.0}$ Gyr ago. TNG50 predicts a large diversity in SFHs for both centrals and satellites, so that the stacked cumulative SFHs are representative of the TNG50 dwarf populations only in an average sense and individual dwarfs can have significantly different cumulative SFHs. Satellite dwarfs with the highest stellar mass to host mass ratios have the latest stellar mass assembly. Satellites at fixed stellar and host halo mass, found closer to the cluster centre, or accreted at earlier times, show significantly earlier stellar mass assembly. These trends, as well as the shapes of the SFHs themselves, are a manifestation of the varying proportions within a given subsample of quenched vs. star-forming galaxies, which exhibit markedly distinct SFH shapes. We also find a subtle effect whereby satellite dwarfs in the most massive hosts at z=0 have higher SFRs at early times, well before final infall into their z=0 host, compared to a control sample of centrals mass-matched at the time of accretion. This suggests that the large-scale environment can have a mild effect even on future satellites by providing the conditions for enhanced SF at early epochs. Our results are useful theoretical predictions for comparison to future resolved-stellar-population observations.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
Quantum Internet- Applications, Functionalities, Enabling Technologies, Challenges, and Research Directions
Authors:
Amoldeep Singh,
Kapal Dev,
Harun Siljak,
Hem Dutt Joshi,
Maurizio Magarini
Abstract:
The advanced notebooks, mobile phones, and internet applications in today's world that we use are all entrenched in classical communication bits of zeros and ones. Classical internet has laid its foundation originating from the amalgamation of mathematics and Claude Shannon's theory of information. But today's internet technology is a playground for eavesdroppers. This poses a serious challenge to…
▽ More
The advanced notebooks, mobile phones, and internet applications in today's world that we use are all entrenched in classical communication bits of zeros and ones. Classical internet has laid its foundation originating from the amalgamation of mathematics and Claude Shannon's theory of information. But today's internet technology is a playground for eavesdroppers. This poses a serious challenge to various applications that relies on classical internet technology. This has motivated the researchers to switch to new technologies that are fundamentally more secure. Exploring the quantum effects, researchers paved the way into quantum networks that provide security, privacy and range of capabilities such as quantum computation, communication and metrology. The realization of quantum internet requires quantum communication between various remote nodes through quantum channels guarded by quantum cryptographic protocols. Such networks rely upon quantum bits (qubits) that can simultaneously take the value of zeros and ones. Due to extraordinary properties of qubits such as entanglement, teleportation and superposition, it gives an edge to quantum networks over traditional networks in many ways. But at the same time transmitting qubits over long distances is a formidable task and extensive research is going on quantum teleportation over such distances, which will become a breakthrough in physically realizing quantum internet in near future. In this paper, quantum internet functionalities, technologies, applications and open challenges have been extensively surveyed to help readers gain a basic understanding of infrastructure required for the development of global quantum internet.
△ Less
Submitted 1 June, 2021; v1 submitted 12 January, 2021;
originally announced January 2021.
-
An analytical diabolo model for robotic learning and control
Authors:
Felix von Drigalski,
Devwrat Joshi,
Takayuki Murooka,
Kazutoshi Tanaka,
Masashi Hamaya,
Yoshihisa Ijiri
Abstract:
In this paper, we present a diabolo model that can be used for training agents in simulation to play diabolo, as well as running it on a real dual robot arm system. We first derive an analytical model of the diabolo-string system and compare its accuracy using data recorded via motion capture, which we release as a public dataset of skilled play with diabolos of different dynamics. We show that ou…
▽ More
In this paper, we present a diabolo model that can be used for training agents in simulation to play diabolo, as well as running it on a real dual robot arm system. We first derive an analytical model of the diabolo-string system and compare its accuracy using data recorded via motion capture, which we release as a public dataset of skilled play with diabolos of different dynamics. We show that our model outperforms a deep-learning-based predictor, both in terms of precision and physically consistent behavior. Next, we describe a method based on optimal control to generate robot trajectories that produce the desired diabolo trajectory, as well as a system to transform higher-level actions into robot motions. Finally, we test our method on a real robot system by playing the diabolo, and throwing it to and catching it from a human player.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
Signatures of a spin-1/2 cooperative paramagnet in the diluted triangular lattice of Y$_2$CuTiO$_6$
Authors:
S. Kundu,
Akmal Hossain,
Pranava Keerthi S,
Ranjan Das,
M. Baenitz,
Peter J. Baker,
Jean-Christophe Orain,
D. C. Joshi,
Roland Mathieu,
Priya Mahadevan,
Sumiran Pujari,
Subhro Bhattacharjee,
A. V. Mahajan,
D. D. Sarma
Abstract:
We present a combination of thermodynamic and dynamic experimental signatures of a disorder driven dynamic cooperative paramagnet in a 50% site diluted triangular lattice spin-1/2 system, Y$_2$CuTiO$_6$. Magnetic ordering and spin freezing are absent down to 50 mK, far below the Curie Weiss scale of ~-134 K. We observe scaling collapses of the magnetic field- and temperature-dependent magnetic hea…
▽ More
We present a combination of thermodynamic and dynamic experimental signatures of a disorder driven dynamic cooperative paramagnet in a 50% site diluted triangular lattice spin-1/2 system, Y$_2$CuTiO$_6$. Magnetic ordering and spin freezing are absent down to 50 mK, far below the Curie Weiss scale of ~-134 K. We observe scaling collapses of the magnetic field- and temperature-dependent magnetic heat capacity and magnetisation data, respectively, in conformity with expectations from the random singlet physics. Our experiments establish the suppression of any freezing scale, if at all present, by more than three orders of magnitude, opening a plethora of interesting possibilities such as disorder-stabilized long range quantum entangled ground states.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.