Search | arXiv e-print repository

How to Rent GPUs on a Budget

Authors: Zhouzi Li, Benjamin Berg, Arpan Mukhopadhyay, Mor Harchol-Balter

Abstract: The explosion in Machine Learning (ML) over the past ten years has led to a dramatic increase in demand for GPUs to train ML models. Because it is prohibitively expensive for most users to build and maintain a large GPU cluster, large cloud providers (Microsoft Azure, Amazon AWS, Google Cloud) have seen explosive growth in demand for renting cloud-based GPUs. In this cloud-computing paradigm, a us… ▽ More The explosion in Machine Learning (ML) over the past ten years has led to a dramatic increase in demand for GPUs to train ML models. Because it is prohibitively expensive for most users to build and maintain a large GPU cluster, large cloud providers (Microsoft Azure, Amazon AWS, Google Cloud) have seen explosive growth in demand for renting cloud-based GPUs. In this cloud-computing paradigm, a user must specify their demand for GPUs at every moment in time, and will pay for every GPU-hour they use. ML training jobs are known to be parallelizable to different degrees. Given a stream of ML training jobs, a user typically wants to minimize the mean response time across all jobs. Here, the response time of a job denotes the time from when a job arrives until it is complete. Additionally, the user is constrained by some operating budget. Specifically, in this paper the user is constrained to use no more than $b$ GPUs per hour, over a long-run time average. The question is how to minimize mean response time while meeting the budget constraint. Because training jobs receive a diminishing marginal benefit from running on additional GPUs, allocating too many GPUs to a single training job can dramatically increase the overall cost paid by the user. Hence, an optimal rental policy must balance a tradeoff between training cost and mean response time. This paper derives the optimal rental policy for a stream of training jobs where the jobs have different levels of parallelizability (specified by a speedup function) and different job sizes (amounts of inherent work). We make almost no assumptions about the arrival process and about the job size distribution. Our optimal policy specifies how many GPUs to rent at every moment in time and how to allocate these GPUs. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.09427 [pdf, other]

On Optimal Server Allocation for Moldable Jobs with Concave Speed-Up

Authors: Samira Ghanbarian, Arpan Mukhopadhyay, Ravi R. Mazumdar, Fabrice M. Guillemin

Abstract: A large proportion of jobs submitted to modern computing clusters and data centers are parallelizable and capable of running on a flexible number of computing cores or servers. Although allocating more servers to such a job results in a higher speed-up in the job's execution, it reduces the number of servers available to other jobs, which in the worst case, can result in an incoming job not findin… ▽ More A large proportion of jobs submitted to modern computing clusters and data centers are parallelizable and capable of running on a flexible number of computing cores or servers. Although allocating more servers to such a job results in a higher speed-up in the job's execution, it reduces the number of servers available to other jobs, which in the worst case, can result in an incoming job not finding any available server to run immediately upon arrival. Hence, a key question to address is: how to optimally allocate servers to jobs such that (i) the average execution time across jobs is minimized and (ii) almost all jobs find at least one server immediately upon arrival. To address this question, we consider a system with $n$ servers, where jobs are parallelizable up to $d^{(n)}$ servers and the speed-up function of jobs is concave and increasing. Jobs not finding any available servers upon entry are blocked and lost. We propose a simple server allocation scheme that achieves the minimum average execution time of accepted jobs while ensuring that the blocking probability of jobs vanishes as the system becomes large ($n \to \infty$). This result is established for various traffic conditions as well as for heterogeneous workloads. To prove our result, we employ Stein's method which also yields non-asymptotic bounds on the blocking probability and the mean execution time. Furthermore, our simulations show that the performance of the scheme is insensitive to the distribution of job execution times. △ Less

Submitted 15 April, 2024; originally announced June 2024.

MSC Class: 60J28 (Primary) 60K25; 68M20 (Secondary)

arXiv:2405.13205 [pdf, other]

Multi-Agent Reinforcement Learning with Hierarchical Coordination for Emergency Responder Stationing

Authors: Amutheezan Sivagnanam, Ava Pettet, Hunter Lee, Ayan Mukhopadhyay, Abhishek Dubey, Aron Laszka

Abstract: An emergency responder management (ERM) system dispatches responders, such as ambulances, when it receives requests for medical aid. ERM systems can also proactively reposition responders between predesignated waiting locations to cover any gaps that arise due to the prior dispatch of responders or significant changes in the distribution of anticipated requests. Optimal repositioning is computatio… ▽ More An emergency responder management (ERM) system dispatches responders, such as ambulances, when it receives requests for medical aid. ERM systems can also proactively reposition responders between predesignated waiting locations to cover any gaps that arise due to the prior dispatch of responders or significant changes in the distribution of anticipated requests. Optimal repositioning is computationally challenging due to the exponential number of ways to allocate responders between locations and the uncertainty in future requests. The state-of-the-art approach in proactive repositioning is a hierarchical approach based on spatial decomposition and online Monte Carlo tree search, which may require minutes of computation for each decision in a domain where seconds can save lives. We address the issue of long decision times by introducing a novel reinforcement learning (RL) approach, based on the same hierarchical decomposition, but replacing online search with learning. To address the computational challenges posed by large, variable-dimensional, and discrete state and action spaces, we propose: (1) actor-critic based agents that incorporate transformers to handle variable-dimensional states and actions, (2) projections to fixed-dimensional observations to handle complex states, and (3) combinatorial techniques to map continuous actions to discrete allocations. We evaluate our approach using real-world data from two U.S. cities, Nashville, TN and Seattle, WA. Our experiments show that compared to the state of the art, our approach reduces computation time per decision by three orders of magnitude, while also slightly reducing average ambulance response time by 5 seconds. △ Less

Submitted 8 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

arXiv:2404.19203 [pdf]

doi 10.1109/ITherm55368.2023.10177601

Thermal Performance of a Liquid-cooling Assisted Thin Wickless Vapor Chamber

Authors: Arani Mukhopadhyay, Anish Pal, Mohamad Jafari Gukeh, Constantine M. Megaridis

Abstract: The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components. Vapor chambers (VCs), because of their ability to effectively spread heat over a large area by two-phase heat transfer, seem ideal for such applications. However, creating thin and efficient vapor chambers that work o… ▽ More The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components. Vapor chambers (VCs), because of their ability to effectively spread heat over a large area by two-phase heat transfer, seem ideal for such applications. However, creating thin and efficient vapor chambers that work over a wide range of power inputs is a persisting challenge. VCs that use wicks for circulating the phase changing media, suffer from capillary restrictions, dry-out, clogging, increase in size and weight, and can often be costly. Recent developments in wick-free wettability patterned vapor chambers replace traditional wicks with laser-fabricated wickless components. An experimental setup allows for fast testing and experimental evaluation of water-charged VCs with liquid-assisted cooling. The sealed chamber can maintain vacuum for long durations, and can be used for testing of very thin wick-free VCs. This work extends our previous study by decreasing overall thickness of the wick-free VC down to 3 mm and evaluates its performance. Furthermore, the impact of wettability patterns on VC performance is investigated, by carrying out experiments both in non-patterned and patterned VCs. Experiments are first carried out on a wick-free VC with no wettability patterns and comprising of an entirely superhydrophilic evaporator coupled with a hydrophobic condenser. Thereafter, wettability patterns that aid the rapid return of water to the heated site on the evaporator and improve condensation on the condenser of the vapor chamber are implemented. The thermal characteristics show that the patterned VCs outperform the non-patterned VCs under all scenarios. The patterned VCs exhibit low thermal resistance independent of fluid charging ratio withstanding higher power inputs without thermal dry-outs. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: Presented at IEEE ITherm (Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems) 2023. Orlando, FL, US. Corresponding: [email protected]

arXiv:2404.19195 [pdf]

doi 10.1109/ITherm55368.2023.10177653

Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics Cooling

Authors: Arani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis

Abstract: Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers. Traditional vapor chambers use wicks to transport condensate for efficient thermal exchange and to prevent "dry-out" of the evaporator. However, wicks in vapor chambers present significant design challenges arising out of large pressure drops across the w… ▽ More Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers. Traditional vapor chambers use wicks to transport condensate for efficient thermal exchange and to prevent "dry-out" of the evaporator. However, wicks in vapor chambers present significant design challenges arising out of large pressure drops across the wicking material, which slows down condensate transport rates and increases the chances for dry-out. Thicker wicks add to overall thermal resistance, while deterring the development of thinner devices by limiting the total thickness of the vapor chamber. Wickless vapor chambers eliminate the use of metal wicks entirely, by incorporating complementary wettability-patterned flat plates on both the evaporator and the condenser side. Such surface modifications enhance fluid transport on the evaporator side, while allowing the chambers to be virtually as thin as imaginable, thereby permitting design of thermally efficient thin electronic cooling devices. While wick-free vapor chambers have been studied and efficient design strategies have been suggested, we delve into real-life applications of wick-free vapor chambers in forced air cooling of high-power electronics. An experimental setup is developed wherein two Si-based MOSFETs of TO-247-3 packaging having high conduction resistance, are connected in parallel and switched at 100 kHz, to emulate high frequency power electronics operations. A rectangular copper wick-free vapor chamber spreads heat laterally over a surface 13 times larger than the heating area. This chamber is cooled externally by a fan that circulates air at room temperature. The present experimental setup extends our previous work on wick-free vapor chambers, while demonstrating the effectiveness of low-cost air cooling in vapor-chamber enhanced high-power electronics applications. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: Presented at IEEE ITherm (Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems) 2023, Orlando FL. Corresponding author: [email protected]

arXiv:2404.16533 [pdf, other]

AMEP: The Active Matter Evaluation Package for Python

Authors: Lukas Hecht, Kay-Robert Dormann, Kai Luca Spanheimer, Mahdieh Ebrahimi, Malte Cordts, Suvendu Mandal, Aritra K. Mukhopadhyay, Benno Liebchen

Abstract: The Active Matter Evaluation Package (AMEP) is a Python library for analyzing simulation data of particle-based and continuum simulations. It provides a powerful and simple interface for handling large data sets and for calculating and visualizing a broad variety of observables that are relevant to active matter systems. Examples range from the mean-square displacement and the structure factor to… ▽ More The Active Matter Evaluation Package (AMEP) is a Python library for analyzing simulation data of particle-based and continuum simulations. It provides a powerful and simple interface for handling large data sets and for calculating and visualizing a broad variety of observables that are relevant to active matter systems. Examples range from the mean-square displacement and the structure factor to cluster-size distributions, binder cumulants, and growth exponents. AMEP is written in pure Python and is based on powerful libraries such as NumPy, SciPy, Matplotlib, and scikit-image. Computationally expensive methods are parallelized and optimized to run efficiently on workstations, laptops, and high-performance computing architectures, and an HDF5-based data format is used in the backend to store and handle simulation data as well as analysis results. AMEP provides the first comprehensive framework for analyzing simulation results of both particle-based and continuum simulations (as well as experimental data) of active matter systems. In particular, AMEP also allows it to analyze simulations that combine particle-based and continuum techniques such as used to study the motion of bacteria in chemical fields or for modeling particle motion in a flow field. AMEP is available at https://amepproject.de and can be installed via conda and pip. △ Less

Submitted 25 April, 2024; originally announced April 2024.

Comments: See https://github.com/amepproject/amep for the source code and https://amepproject.de/ for the documentation

arXiv:2404.13151 [pdf, other]

doi 10.1103/PhysRevLett.132.233403

Observation of Momentum Space Josephson Effects

Authors: Annesh Mukhopadhyay, Xi-Wang Luo, Colby Schimelfenig, M. K. H. Ome, Sean Mossman, Chuanwei Zhang, Peter Engels

Abstract: The momentum space Josephson effect describes the supercurrent flow between weakly coupled Bose-Einstein condensates (BECs) at two discrete momentum states. Here, we experimentally observe this exotic phenomenon using a BEC with Raman-induced spin-orbit coupling, where the tunneling between two local band minima is implemented by the momentum kick of an additional optical lattice. A sudden quench… ▽ More The momentum space Josephson effect describes the supercurrent flow between weakly coupled Bose-Einstein condensates (BECs) at two discrete momentum states. Here, we experimentally observe this exotic phenomenon using a BEC with Raman-induced spin-orbit coupling, where the tunneling between two local band minima is implemented by the momentum kick of an additional optical lattice. A sudden quench of the Raman detuning induces coherent spin-momentum oscillations of the BEC, which is analogous to the a.c. Josephson effect. We observe both plasma and regular Josephson oscillations in different parameter regimes. The experimental results agree well with the theoretical model and numerical simulation, and showcase the important role of nonlinear interactions. We also show that the measurement of the Josephson plasma frequency gives the Bogoliubov zero quasimomentum gap, which determines the mass of the corresponding pseudo-Goldstone mode, a long-sought phenomenon in particle physics. The observation of momentum space Josephson physics offers an exciting platform for quantum simulation and sensing utilizing momentum states as a synthetic degree. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 12 pages, 11 figures

arXiv:2404.09184 [pdf, ps, other]

Some algebras with trivial rings of differential operators

Authors: Alapan Mukhopadhyay, Karen E. Smith

Abstract: Let $k$ be an arbitrary field. We construct examples of regular local $k$-algebras $R$ (of positive dimension) for which the ring of differential operators $D_k(R)$ is trivial in the sense that it contains {\it no} operators of positive order. The examples are excellent in characteristic zero but not in positive characteristic. These rings can be viewed as being non-singular but they are not simpl… ▽ More Let $k$ be an arbitrary field. We construct examples of regular local $k$-algebras $R$ (of positive dimension) for which the ring of differential operators $D_k(R)$ is trivial in the sense that it contains {\it no} operators of positive order. The examples are excellent in characteristic zero but not in positive characteristic. These rings can be viewed as being non-singular but they are not simple as $D$-modules, laying to rest speculation that $D$-simplicity might characterize a nice class of singularities in general. In prime characteristic, the construction also provides examples of {\it regular} local rings $R$ (with fraction field a function field) whose Frobenius push-forward $F_*^eR$ is {\it indecomposable} as an $R$-module for all $e\in \mathbb N$. Along the way, we investigate hypotheses on a local ring $(R, m)$ under which $D$-simplicity for $R$ is equivalent to $D$-simplicity for its $m$-adic completion, and give examples of rings for which the differential operators do not behave well under completion. We also generalize a characterization of $D$-simplicity due to Jeffries in the $\mathbb N$-graded case: for a Noetherian local $k$-algebra $(R, m, k)$, $D$-simplicity of $R$ is equivalent to surjectivity of the natural map $D_k(R)\to D_k(R, k)$. △ Less

Submitted 14 April, 2024; originally announced April 2024.

Comments: Comments welcome

MSC Class: 13A35; 13N10; 13N15; 16S32

arXiv:2404.02149 [pdf, other]

Nambu-Goto equation from three-dimensional gravity

Authors: Avik Banerjee, Ayan Mukhopadhyay, Giuseppe Policastro

Abstract: We demonstrate that the solutions of three-dimensional gravity obtained by gluing two copies of a spacetime across a junction constituted of a tensile string are in one-to-one correspondence with the solutions of the Nambu-Goto equation in the same spacetime up to a finite number of rigid deformations. The non-linear Nambu-Goto equation satisfied by the average of the embedding coordinates of the… ▽ More We demonstrate that the solutions of three-dimensional gravity obtained by gluing two copies of a spacetime across a junction constituted of a tensile string are in one-to-one correspondence with the solutions of the Nambu-Goto equation in the same spacetime up to a finite number of rigid deformations. The non-linear Nambu-Goto equation satisfied by the average of the embedding coordinates of the junction emerges directly from the junction conditions along with the rigid deformations and corrections due to the tension. Therefore, the equivalence principle generalizes non-trivially to the string. Our results are valid both in three-dimensional flat and AdS spacetimes. In the context of AdS$_3$/CFT$_2$ correspondence, our setup could be used to describe a class of interfaces in the conformal field theory featuring relative time reparametrization at the interface which encodes the solution of the Nambu-Goto equation corresponding to the bulk junction. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 27 pages, 1 figure

arXiv:2403.05997 [pdf]

doi 10.1007/978-981-99-6074-3_43

Time-dependent droplet detachment behaviour from wettability-engineered fibers during fog harvesting

Authors: Arijit Saha, Arkadeep Datta, Arani Mukhopadhyay, Amitava Datta, Ranjan Ganguly

Abstract: Water collection from natural and industrial fogs has recently been viewed as a viable freshwater source. An interesting outgrowth of the relevant research as focused on arresting of the drift losses (un-evaporated and re-condensed water droplets present in the exhaust plume from industrial cooling towers. Such exploits in fog collection have implemented metal and polyester meshes as fog water col… ▽ More Water collection from natural and industrial fogs has recently been viewed as a viable freshwater source. An interesting outgrowth of the relevant research as focused on arresting of the drift losses (un-evaporated and re-condensed water droplets present in the exhaust plume from industrial cooling towers. Such exploits in fog collection have implemented metal and polyester meshes as fog water collectors (FWC). Fog droplets im**e and deposit on mesh fibers. They coalesce with previously deposited liquid to evolve as larger drops before detaching from the fibers under their own weight, an event largely dependent on the mesh fiber wettability, diameter and its arrangement relative to the fog flow. To better estimate drainage and hence collection from these fibers, the study, focuses on droplet detachment from differently wetted horizontally positioned cylindrical fibers of various diameters, placed orthogonally in the path of an oncoming fog. Droplet detachment volume is found to increase with fiber diameter and fiber surface wettability. Interestingly, in a typical fogging condition, the detachment volume is also found to exhibit a time-dependent behaviour, altering the droplet detachment criteria otherwise predicted from emulation. Our current study sheds light on this unexplored phenomenon. △ Less

Submitted 9 March, 2024; originally announced March 2024.

Comments: Presented at the Fluid Mechanics and Fluid Power Conference, India (2022). Corresponding author: [email protected] . Lecture Notes in Mechanical Engineering. Springer, Singapore (2024)

arXiv:2403.04072 [pdf, other]

Forecasting and Mitigating Disruptions in Public Bus Transit Services

Authors: Chaeeun Han, Jose Paolo Talusan, Dan Freudberg, Ayan Mukhopadhyay, Abhishek Dubey, Aron Laszka

Abstract: Public transportation systems often suffer from unexpected fluctuations in demand and disruptions, such as mechanical failures and medical emergencies. These fluctuations and disruptions lead to delays and overcrowding, which are detrimental to the passengers' experience and to the overall performance of the transit service. To proactively mitigate such events, many transit agencies station substi… ▽ More Public transportation systems often suffer from unexpected fluctuations in demand and disruptions, such as mechanical failures and medical emergencies. These fluctuations and disruptions lead to delays and overcrowding, which are detrimental to the passengers' experience and to the overall performance of the transit service. To proactively mitigate such events, many transit agencies station substitute (reserve) vehicles throughout their service areas, which they can dispatch to augment or replace vehicles on routes that suffer overcrowding or disruption. However, determining the optimal locations where substitute vehicles should be stationed is a challenging problem due to the inherent randomness of disruptions and due to the combinatorial nature of selecting locations across a city. In collaboration with the transit agency of Nashville, TN, we address this problem by introducing data-driven statistical and machine-learning models for forecasting disruptions and an effective randomized local-search algorithm for selecting locations where substitute vehicles are to be stationed. Our research demonstrates promising results in proactive disruption management, offering a practical and easily implementable solution for transit agencies to enhance the reliability of their services. Our results resonate beyond mere operational efficiency: by advancing proactive strategies, our approach fosters more resilient and accessible public transportation, contributing to equitable urban mobility and ultimately benefiting the communities that rely on public transportation the most. △ Less

Submitted 6 March, 2024; originally announced March 2024.

arXiv:2403.03339 [pdf, other]

An Online Approach to Solving Public Transit Stationing and Dispatch Problem

Authors: Jose Paolo Talusan, Chaeeun Han, Ayan Mukhopadhyay, Aron Laszka, Dan Freudberg, Abhishek Dubey

Abstract: Public bus transit systems provide critical transportation services for large sections of modern communities. On-time performance and maintaining the reliable quality of service is therefore very important. Unfortunately, disruptions caused by overcrowding, vehicular failures, and road accidents often lead to service performance degradation. Though transit agencies keep a limited number of vehicle… ▽ More Public bus transit systems provide critical transportation services for large sections of modern communities. On-time performance and maintaining the reliable quality of service is therefore very important. Unfortunately, disruptions caused by overcrowding, vehicular failures, and road accidents often lead to service performance degradation. Though transit agencies keep a limited number of vehicles in reserve and dispatch them to relieve the affected routes during disruptions, the procedure is often ad-hoc and has to rely on human experience and intuition to allocate resources (vehicles) to affected trips under uncertainty. In this paper, we describe a principled approach using non-myopic sequential decision procedures to solve the problem and decide (a) if it is advantageous to anticipate problems and proactively station transit buses near areas with high-likelihood of disruptions and (b) decide if and which vehicle to dispatch to a particular problem. Our approach was developed in partnership with the Metropolitan Transportation Authority for a mid-sized city in the USA and models the system as a semi-Markov decision problem (solved as a Monte-Carlo tree search procedure) and shows that it is possible to obtain an answer to these two coupled decision problems in a way that maximizes the overall reward (number of people served). We sample many possible futures from generative models, each is assigned to a tree and processed using root parallelization. We validate our approach using 3 years of data from our partner agency. Our experiments show that the proposed framework serves 2% more passengers while reducing deadhead miles by 40%. △ Less

Submitted 5 March, 2024; originally announced March 2024.

arXiv:2402.02691 [pdf]

ALIVE: A Low-Cost Interactive Vaccine Storage Environment Module ensuring easy portability and remote tracking of operational logistics to the last mile

Authors: Arkadeep Datta, Arani Mukhopadhyay, Amitava Datta, Ranjan Ganguly

Abstract: The COVID-19 pandemic has profoundly reshaped our lives, prompting a search for solutions to its far-reaching effects. Vaccines emerged as a beacon of hope, yet reaching remote areas faces last-mile hurdles and cost issues due to loss of vaccine potency due to poor temperature regulation of the storage units and unanticipated vaccine wastage en route, a common occurrence in conventional vaccine tr… ▽ More The COVID-19 pandemic has profoundly reshaped our lives, prompting a search for solutions to its far-reaching effects. Vaccines emerged as a beacon of hope, yet reaching remote areas faces last-mile hurdles and cost issues due to loss of vaccine potency due to poor temperature regulation of the storage units and unanticipated vaccine wastage en route, a common occurrence in conventional vaccine transportation methods. We introduce ALIVE, a low-cost Interactive Vaccine Storage Environment module. ALIVE provides an off-grid, self-sufficient solution for vaccine storage and transport, enabled by active cooling technology. ALIVE's innovation lies in its integration with the Internet of Things (IoT), allowing real-time monitoring and control. This IoT-enabled Application Programming Interface (API) features a data acquisition and environment parameter control system, managing oversight and decision-making. ALIVE's compact, lightweight design makes it adaptable to various logistical scenarios, while its versatility enables it to maintain both time-invariant and time-dependent thermophysical and spatial parameters. Operationalized through a PID algorithm, ALIVE ensures precise temperature control within the vaccine chamber. Its dynamic features, such as remote actuation and data sharing, demonstrate its adaptability and potential applications. Despite the frugal nature of development, the system promises significant benefits, including reduced vaccine loss and remote monitoring advantages. Collaborations with healthcare partners seek to further enhance ALIVE's readiness and expand its impact. ALIVE revolutionizes vaccine logistics, offering scalable, cost-effective solutions for bridging accessibility gaps in challenging distribution scenarios. Its adaptability positions it for widespread application, from last-mile vaccine delivery to environment-controlled supply chains and beyond. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: Presented at the International Conference on Robotics, Control, Automation, and Artificial Intelligence (RCAAI 2023). Corresponding: [email protected]

arXiv:2402.00017 [pdf, other]

Deploying ADVISER: Impact and Lessons from Using Artificial Intelligence for Child Vaccination Uptake in Nigeria

Authors: Opadele Kehinde, Ruth Abdul, Bose Afolabi, Parminder Vir, Corinne Namblard, Ayan Mukhopadhyay, Abiodun Adereni

Abstract: More than 5 million children under five years die from largely preventable or treatable medical conditions every year, with an overwhelmingly large proportion of deaths occurring in underdeveloped countries with low vaccination uptake. One of the United Nations' sustainable development goals (SDG 3) aims to end preventable deaths of newborns and children under five years of age. We focus on Nigeri… ▽ More More than 5 million children under five years die from largely preventable or treatable medical conditions every year, with an overwhelmingly large proportion of deaths occurring in underdeveloped countries with low vaccination uptake. One of the United Nations' sustainable development goals (SDG 3) aims to end preventable deaths of newborns and children under five years of age. We focus on Nigeria, where the rate of infant mortality is appalling. In particular, low vaccination uptake in Nigeria is a major driver of more than 2,000 daily deaths of children under the age of five years. In this paper, we describe our collaboration with government partners in Nigeria to deploy ADVISER: AI-Driven Vaccination Intervention Optimiser. The framework, based on an integer linear program that seeks to maximize the cumulative probability of successful vaccination, is the first successful deployment of an AI-enabled toolchain for optimizing the allocation of health interventions in Nigeria. In this paper, we provide a background of the ADVISER framework and present results, lessons, and success stories of deploying ADVISER to more than 13,000 families in the state of Oyo, Nigeria. △ Less

Submitted 30 December, 2023; originally announced February 2024.

Comments: Accepted for publication at the AAAI Conference on Artificial Intelligence (AAAI-24)

arXiv:2401.06291 [pdf, other]

Frequency-Time Diffusion with Neural Cellular Automata

Authors: John Kalkhof, Arlene Kühn, Yannik Frisch, Anirban Mukhopadhyay

Abstract: Despite considerable success, large Denoising Diffusion Models (DDMs) with UNet backbone pose practical challenges, particularly on limited hardware and in processing gigapixel images. To address these limitations, we introduce two Neural Cellular Automata (NCA)-based DDMs: Diff-NCA and FourierDiff-NCA. Capitalizing on the local communication capabilities of NCA, Diff-NCA significantly reduces the… ▽ More Despite considerable success, large Denoising Diffusion Models (DDMs) with UNet backbone pose practical challenges, particularly on limited hardware and in processing gigapixel images. To address these limitations, we introduce two Neural Cellular Automata (NCA)-based DDMs: Diff-NCA and FourierDiff-NCA. Capitalizing on the local communication capabilities of NCA, Diff-NCA significantly reduces the parameter counts of NCA-based DDMs. Integrating Fourier-based diffusion enables global communication early in the diffusion process. This feature is particularly valuable in synthesizing complex images with important global features, such as the CelebA dataset. We demonstrate that even a 331k parameter Diff-NCA can generate 512x512 pathology slices, while FourierDiff-NCA (1.1m parameters) reaches a three times lower FID score of 43.86, compared to the four times bigger UNet (3.94m parameters) with a score of 128.2. Additionally, FourierDiff-NCA can perform diverse tasks such as super-resolution, out-of-distribution image synthesis, and inpainting without explicit training. △ Less

Submitted 13 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.05284 [pdf]

doi 10.1021/acs.langmuir.4c00075

Droplet morphology-based wettability tuning and design of fog harvesting mesh to minimize mesh-clogging

Authors: Arani Mukhopadhyay, Arkadeep Datta, Partha Sarathi Dutta, Amitava Datta, Ranjan Ganguly

Abstract: Fog harvesting relies on intercepting atmospheric or industrial fog by placing a porous obstacle, e.g., a mesh and collecting the deposited water. In the face of global water scarcity, such fog harvesting has emerged as a viable alternative source of potable water. Typical fog harvesting meshes suffer from poor collection efficiency due to aerodynamic bypassing of the oncoming fog stream and poor… ▽ More Fog harvesting relies on intercepting atmospheric or industrial fog by placing a porous obstacle, e.g., a mesh and collecting the deposited water. In the face of global water scarcity, such fog harvesting has emerged as a viable alternative source of potable water. Typical fog harvesting meshes suffer from poor collection efficiency due to aerodynamic bypassing of the oncoming fog stream and poor collection of the deposited water from the mesh. One pestering challenge in this context is the frequent clogging up of mesh pores by the deposited fog water, which not only yields low drainage efficiency but also generates high aerodynamic resistance to the oncoming fog stream, thereby negatively impacting the fog collection efficiency. Minimizing the clogging is possible by rendering the mesh fiber superhydrophobic, but that entails other detrimental effects like premature drip** and flow-induced re-entrainment of water droplets into the fog stream from the mesh fiber. Herein, we improvise on the traditional interweaved metal mesh designs by defining critical parameters, viz., mesh pitch, shade coefficient, and fiber wettability, and deduce their optimal values from numerically and experimentally observed morphology of collected fog-water droplets under various operating scenarios. We extend our investigations over a varying range of mesh-wettability, including superhydrophilic and hydrophobic fibers, and go on to find optimal shade coefficients which would theoretically render clog-proof fog harvesting meshes. The aerodynamic, deposition, and overall collection efficiencies are characterized. Hydrophobic meshes with square pores, having fiber diameters smaller than the capillary length scale of water, and an optimal shade coefficient, are found to be the most effective design of such clog-proof meshes. △ Less

Submitted 3 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

Comments: Arani and Arkadeep contributed equally. Corresponding author: Prof. Ranjan Ganguly (Email: [email protected]). All work carried out in the Advanced Materials Research and Applications (AMRA) Laboratory, India

arXiv:2401.03197 [pdf, other]

Decision Making in Non-Stationary Environments with Policy-Augmented Search

Authors: Ava Pettet, Yunuo Zhang, Baiting Luo, Kyle Wray, Hendrik Baier, Aron Laszka, Abhishek Dubey, Ayan Mukhopadhyay

Abstract: Sequential decision-making under uncertainty is present in many important problems. Two popular approaches for tackling such problems are reinforcement learning and online search (e.g., Monte Carlo tree search). While the former learns a policy by interacting with the environment (typically done before execution), the latter uses a generative model of the environment to sample promising action tra… ▽ More Sequential decision-making under uncertainty is present in many important problems. Two popular approaches for tackling such problems are reinforcement learning and online search (e.g., Monte Carlo tree search). While the former learns a policy by interacting with the environment (typically done before execution), the latter uses a generative model of the environment to sample promising action trajectories at decision time. Decision-making is particularly challenging in non-stationary environments, where the environment in which an agent operates can change over time. Both approaches have shortcomings in such settings -- on the one hand, policies learned before execution become stale when the environment changes and relearning takes both time and computational effort. Online search, on the other hand, can return sub-optimal actions when there are limitations on allowed runtime. In this paper, we introduce \textit{Policy-Augmented Monte Carlo tree search} (PA-MCTS), which combines action-value estimates from an out-of-date policy with an online search using an up-to-date model of the environment. We prove theoretical results showing conditions under which PA-MCTS selects the one-step optimal action and also bound the error accrued while following PA-MCTS as a policy. We compare and contrast our approach with AlphaZero, another hybrid planning approach, and Deep Q Learning on several OpenAI Gym environments. Through extensive experiments, we show that under non-stationary settings with limited time constraints, PA-MCTS outperforms these baselines. △ Less

Submitted 20 January, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

Comments: Extended Abstract accepted for presentation at AAMAS 2024

arXiv:2401.01841 [pdf, other]

Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes

Authors: Baiting Luo, Yunuo Zhang, Abhishek Dubey, Ayan Mukhopadhyay

Abstract: A fundamental (and largely open) challenge in sequential decision-making is dealing with non-stationary environments, where exogenous environmental conditions change over time. Such problems are traditionally modeled as non-stationary Markov decision processes (NSMDP). However, existing approaches for decision-making in NSMDPs have two major shortcomings: first, they assume that the updated enviro… ▽ More A fundamental (and largely open) challenge in sequential decision-making is dealing with non-stationary environments, where exogenous environmental conditions change over time. Such problems are traditionally modeled as non-stationary Markov decision processes (NSMDP). However, existing approaches for decision-making in NSMDPs have two major shortcomings: first, they assume that the updated environmental dynamics at the current time are known (although future dynamics can change); and second, planning is largely pessimistic, i.e., the agent acts ``safely'' to account for the non-stationary evolution of the environment. We argue that both these assumptions are invalid in practice -- updated environmental conditions are rarely known, and as the agent interacts with the environment, it can learn about the updated dynamics and avoid being pessimistic, at least in states whose dynamics it is confident about. We present a heuristic search algorithm called \textit{Adaptive Monte Carlo Tree Search (ADA-MCTS)} that addresses these challenges. We show that the agent can learn the updated dynamics of the environment over time and then act as it learns, i.e., if the agent is in a region of the state space about which it has updated knowledge, it can avoid being pessimistic. To quantify ``updated knowledge,'' we disintegrate the aleatoric and epistemic uncertainty in the agent's updated belief and show how the agent can use these estimates for decision-making. We compare the proposed approach with the multiple state-of-the-art approaches in decision-making across multiple well-established open-source problems and empirically show that our approach is faster and highly adaptive without sacrificing safety. △ Less

Submitted 21 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: Accepted for publication at the International Conference on Autonomous Agents and MultiAgent Systems (AAMAS), 2024

arXiv:2401.00928 [pdf, other]

OSINT Research Studios: A Flexible Crowdsourcing Framework to Scale Up Open Source Intelligence Investigations

Authors: Anirban Mukhopadhyay, Sukrit Venkatagiri, Kurt Luther

Abstract: Open Source Intelligence (OSINT) investigations, which rely entirely on publicly available data such as social media, play an increasingly important role in solving crimes and holding governments accountable. The growing volume of data and complex nature of tasks, however, means there is a pressing need to scale and speed up OSINT investigations. Expert-led crowdsourcing approaches show promise bu… ▽ More Open Source Intelligence (OSINT) investigations, which rely entirely on publicly available data such as social media, play an increasingly important role in solving crimes and holding governments accountable. The growing volume of data and complex nature of tasks, however, means there is a pressing need to scale and speed up OSINT investigations. Expert-led crowdsourcing approaches show promise but tend to either focus on narrow tasks or domains or require resource-intense, long-term relationships between expert investigators and crowds. We address this gap by providing a flexible framework that enables investigators across domains to enlist crowdsourced support for the discovery and verification of OSINT. We use a design-based research (DBR) approach to develop OSINT Research Studios (ORS), a sociotechnical system in which novice crowds are trained to support professional investigators with complex OSINT investigations. Through our qualitative evaluation, we found that ORS facilitates ethical and effective OSINT investigations across multiple domains. We also discuss broader implications of expert-crowd collaboration and opportunities for future work. △ Less

Submitted 1 January, 2024; originally announced January 2024.

Comments: To be published in CSCW 2024

arXiv:2312.10497 [pdf, other]

Diffusion Approximations of Speed-Aware Join-the-Shortest-Queue Scheme: Transient and Stationary Analysis

Authors: Sanidhay Bhambay, Burak Büke, Arpan Mukhopadhyay

Abstract: The Join-the-Shortest-Queue (JSQ) load balancing scheme is widely acknowledged for its effectiveness in minimizing the average response time for jobs in systems with identical servers. However, when applied to a heterogeneous server system with servers of different processing speeds, the JSQ scheme exhibits suboptimal performance. Recently, a variation of JSQ called the Speed-Aware-Join-the-Shorte… ▽ More The Join-the-Shortest-Queue (JSQ) load balancing scheme is widely acknowledged for its effectiveness in minimizing the average response time for jobs in systems with identical servers. However, when applied to a heterogeneous server system with servers of different processing speeds, the JSQ scheme exhibits suboptimal performance. Recently, a variation of JSQ called the Speed-Aware-Join-the-Shortest-Queue (SA-JSQ) scheme has been shown to attain fluid limit optimality for systems with heterogeneous servers. In this paper, we examine the SA-JSQ scheme for heterogeneous server systems under the Halfin-Whitt regime. Our analysis begins by establishing that the scaled and centered version of the system state weakly converges to a diffusion process characterized by stochastic integral equations. Furthermore, we prove that the diffusion process is positive recurrent and the sequence of stationary measures for the scaled and centered queue length processes converge to the stationary measure for the limiting diffusion process. To achieve this result, we employ Stein's method with a generator expansion approach. △ Less

Submitted 16 December, 2023; originally announced December 2023.

MSC Class: 60K25 (Primary) 60F05; 68M20 (Secondary)

arXiv:2312.08442 [pdf, other]

Learning holographic horizons

Authors: Vishnu Jejjala, Sukrut Mondkar, Ayan Mukhopadhyay, Rishi Raj

Abstract: We apply machine learning to understand fundamental aspects of holographic duality, specifically the entropies obtained from the apparent and event horizon areas. We show that simple features of only the time series of the pressure anisotropy, namely the values and half-widths of the maxima and minima, the times these are attained, and the times of the first zeroes can predict the areas of the app… ▽ More We apply machine learning to understand fundamental aspects of holographic duality, specifically the entropies obtained from the apparent and event horizon areas. We show that simple features of only the time series of the pressure anisotropy, namely the values and half-widths of the maxima and minima, the times these are attained, and the times of the first zeroes can predict the areas of the apparent and event horizons in the dual bulk geometry at all times with a fixed maximum length (30) of the input vector. Given that simple Vaidya-type metrics constructed just from the apparent and event horizon areas can be used to approximately obtain unequal time correlation functions, we argue that the corresponding entropy functions are the measures of information that need to be extracted from simple one-point functions to reconstruct specific aspects of correlation functions of the dual state with the best possible approximations. △ Less

Submitted 3 February, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: 10+10 pages, 1 Figure; Discussion improved, k-fold cross validation added

arXiv:2312.05124 [pdf, other]

doi 10.1016/j.physa.2024.129613

Repeated quantum game as a stochastic game: Effects of the shadow of the future and entanglement

Authors: Archan Mukhopadhyay, Saikat Sur, Tanay Saha, Shubhadeep Sadhukhan, Sagar Chakraborty

Abstract: We present a systematic investigation of the quantum games, constructed using a novel repeated game protocol, when played repeatedly ad infinitum. We focus on establishing that such repeated games -- by virtue of inherent quantum-mechanical randomness -- can be mapped to the paradigm of stochastic games. Subsequently, using the setup of two-player--two-action games, we explore the pure reactive st… ▽ More We present a systematic investigation of the quantum games, constructed using a novel repeated game protocol, when played repeatedly ad infinitum. We focus on establishing that such repeated games -- by virtue of inherent quantum-mechanical randomness -- can be mapped to the paradigm of stochastic games. Subsequently, using the setup of two-player--two-action games, we explore the pure reactive strategies belonging to the set of reactive strategies, whose support in the quantum games is no longer countably finite but rather non-denumerably infinite. We find that how two pure strategies fare against each other is crucially dependent on the discount factor (the probability of occurrence of every subsequent round) and how much entangled the quantum states of the players are. We contrast the results obtained with the corresponding results in the classical setup and find fundamental differences between them: e.g, when the underlying game is the prisoner's dilemma, in the quantum game setup, always-defect strategy can be beaten by the tit-for-tat strategy for high enough discount factor. △ Less

Submitted 8 December, 2023; originally announced December 2023.

Journal ref: Physica A: Statistical Mechanics and its Applications. 2024;637:129613

arXiv:2311.14297 [pdf, other]

Identification of odd-frequency superconducting pairing in Josephson junctions

Authors: Subhajit Pal, Aabir Mukhopadhyay, Sourin Das

Abstract: Choosing the right spin polarization of electron enables its local injection into the helical edge state with a well-defined momentum direction, despite the uncertainty principle, owing to spin-momentum locking. This fact facilitates a direct identification of odd-frequency pairing through parity measurement (under frequency reversal) of the anomalous Green's function in a setup comprising multi-t… ▽ More Choosing the right spin polarization of electron enables its local injection into the helical edge state with a well-defined momentum direction, despite the uncertainty principle, owing to spin-momentum locking. This fact facilitates a direct identification of odd-frequency pairing through parity measurement (under frequency reversal) of the anomalous Green's function in a setup comprising multi-terminal Josephson junction on the helical edge state of a 2D topological insulator. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Comments: 9 pages, 2 figures

arXiv:2311.02185 [pdf, ps, other]

doi 10.1007/JHEP04(2024)137

Searching for Minicharged Particles at the Energy Frontier with the MoEDAL-MAPP Experiment at the LHC

Authors: Vasiliki A. Mitsou, Marc de Montigny, Abhinab Mukhopadhyay, Pierre-Philippe A. Ouimet, James Pinfold, Ameir Shaa, Michael Staelens

Abstract: MoEDAL's Apparatus for Penetrating Particles (MAPP) Experiment is designed to expand the search for new physics at the LHC, significantly extending the physics program of the baseline MoEDAL Experiment. The Phase-1 MAPP detector (MAPP-1) is currently undergoing installation at the LHC's UA83 gallery adjacent to the LHCb/MoEDAL region at Interaction Point 8 and will begin data-taking in early 2024.… ▽ More MoEDAL's Apparatus for Penetrating Particles (MAPP) Experiment is designed to expand the search for new physics at the LHC, significantly extending the physics program of the baseline MoEDAL Experiment. The Phase-1 MAPP detector (MAPP-1) is currently undergoing installation at the LHC's UA83 gallery adjacent to the LHCb/MoEDAL region at Interaction Point 8 and will begin data-taking in early 2024. The focus of the MAPP experiment is on the quest for new feebly interacting particles$\unicode{x2014}$avatars of new physics with extremely small Standard Model couplings, such as minicharged particles (mCPs). In this study, we present the results of a comprehensive analysis of MAPP-1's sensitivity to mCPs arising in the canonical model involving the kinetic mixing of a massless dark $U(1)$ gauge field with the Standard Model hypercharge gauge field. We focus on several dominant production mechanisms of mCPs at the LHC across the mass$\unicode{x2013}$mixing parameter space of interest to MAPP: Drell$\unicode{x2013}$Yan pair production, direct decays of heavy quarkonia and light vector mesons, and single Dalitz decays of pseudoscalar mesons. The $95\%$ confidence level background-free sensitivity of MAPP-1 for mCPs produced at the LHC's Run 3 and the HL-LHC through these mechanisms, along with projected constraints on the minicharged strongly interacting dark matter window, are reported. Our results indicate that MAPP-1 exhibits sensitivity to sizable regions of unconstrained parameter space and can probe effective charges as low as $8 \times 10^{-4}\:e$ and $6 \times 10^{-4}\:e$ for Run 3 and the HL-LHC, respectively. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Comments: 11 pages, 7 figures

Journal ref: J. High Energ. Phys. 2024, 137 (2024)

arXiv:2311.00548 [pdf, other]

Continual atlas-based segmentation of prostate MRI

Authors: Amin Ranem, Camila González, Daniel Pinto dos Santos, Andreas M. Bucher, Ahmed E. Othman, Anirban Mukhopadhyay

Abstract: Continual learning (CL) methods designed for natural image classification often fail to reach basic quality standards for medical image segmentation. Atlas-based segmentation, a well-established approach in medical imaging, incorporates domain knowledge on the region of interest, leading to semantically coherent predictions. This is especially promising for CL, as it allows us to leverage structur… ▽ More Continual learning (CL) methods designed for natural image classification often fail to reach basic quality standards for medical image segmentation. Atlas-based segmentation, a well-established approach in medical imaging, incorporates domain knowledge on the region of interest, leading to semantically coherent predictions. This is especially promising for CL, as it allows us to leverage structural information and strike an optimal balance between model rigidity and plasticity over time. When combined with privacy-preserving prototypes, this process offers the advantages of rehearsal-based CL without compromising patient privacy. We propose Atlas Replay, an atlas-based segmentation approach that uses prototypes to generate high-quality segmentation masks through image registration that maintain consistency even as the training distribution changes. We explore how our proposed method performs compared to state-of-the-art CL methods in terms of knowledge transferability across seven publicly available prostate segmentation datasets. Prostate segmentation plays a vital role in diagnosing prostate cancer, however, it poses challenges due to substantial anatomical variations, benign structural differences in older age groups, and fluctuating acquisition parameters. Our results show that Atlas Replay is both robust and generalizes well to yet-unseen domains while being able to maintain knowledge, unlike end-to-end segmentation methods. Our code base is available under https://github.com/MECLabTUDA/Atlas-Replay. △ Less

Submitted 6 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.16695 [pdf, other]

From Pointwise to Powerhouse: Initialising Neural Networks with Generative Models

Authors: Christian Harder, Moritz Fuchs, Yuri Tolkach, Anirban Mukhopadhyay

Abstract: Traditional initialisation methods, e.g. He and Xavier, have been effective in avoiding the problem of vanishing or exploding gradients in neural networks. However, they only use simple pointwise distributions, which model one-dimensional variables. Moreover, they ignore most information about the architecture and disregard past training experiences. These limitations can be overcome by employing… ▽ More Traditional initialisation methods, e.g. He and Xavier, have been effective in avoiding the problem of vanishing or exploding gradients in neural networks. However, they only use simple pointwise distributions, which model one-dimensional variables. Moreover, they ignore most information about the architecture and disregard past training experiences. These limitations can be overcome by employing generative models for initialisation. In this paper, we introduce two groups of new initialisation methods. First, we locally initialise weight groups by employing variational autoencoders. Secondly, we globally initialise full weight sets by employing graph hypernetworks. We thoroughly evaluate the impact of the employed generative models on state-of-the-art neural networks in terms of accuracy, convergence speed and ensembling. Our results show that global initialisations result in higher accuracy and faster initial convergence speed. However, the implementation through graph hypernetworks leads to diminished ensemble performance on out of distribution data. To counteract, we propose a modification called noise graph hypernetwork, which encourages diversity in the produced ensemble members. Furthermore, our approach might be able to transfer learned knowledge to different image distributions. Our work provides insights into the potential, the trade-offs and possible modifications of these new initialisation methods. △ Less

Submitted 25 October, 2023; originally announced October 2023.

ACM Class: J.3; I.5.1; I.5.4

arXiv:2310.16241 [pdf, other]

Task Grou** for Automated Multi-Task Machine Learning via Task Affinity Prediction

Authors: Afiya Ayman, Ayan Mukhopadhyay, Aron Laszka

Abstract: When a number of similar tasks have to be learned simultaneously, multi-task learning (MTL) models can attain significantly higher accuracy than single-task learning (STL) models. However, the advantage of MTL depends on various factors, such as the similarity of the tasks, the sizes of the datasets, and so on; in fact, some tasks might not benefit from MTL and may even incur a loss of accuracy co… ▽ More When a number of similar tasks have to be learned simultaneously, multi-task learning (MTL) models can attain significantly higher accuracy than single-task learning (STL) models. However, the advantage of MTL depends on various factors, such as the similarity of the tasks, the sizes of the datasets, and so on; in fact, some tasks might not benefit from MTL and may even incur a loss of accuracy compared to STL. Hence, the question arises: which tasks should be learned together? Domain experts can attempt to group tasks together following intuition, experience, and best practices, but manual grou** can be labor-intensive and far from optimal. In this paper, we propose a novel automated approach for task grou**. First, we study the affinity of tasks for MTL using four benchmark datasets that have been used extensively in the MTL literature, focusing on neural network-based MTL models. We identify inherent task features and STL characteristics that can help us to predict whether a group of tasks should be learned together using MTL or if they should be learned independently using STL. Building on this predictor, we introduce a randomized search algorithm, which employs the predictor to minimize the number of MTL trainings performed during the search for task groups. We demonstrate on the four benchmark datasets that our predictor-driven search approach can find better task grou**s than existing baseline approaches. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.10270 [pdf, other]

$h$-function, Hilbert-Kunz density function and Frobenius-Poincaré function

Authors: Cheng Meng, Alapan Mukhopadhyay

Abstract: Given ideals $I,J$ of a noetherian local ring $(R, \mathfrak m)$ such that $I+J$ is $\mathfrak m$-primary and a finitely generated $R$-module $M$, we associate an invariant of $(M,R,I,J)$ called the $h$-function. Our results on $h$-functions allow extensions of the theories of Frobenius-Poincaré functions and Hilbert-Kunz density functions from the known graded case to the local case, answering a… ▽ More Given ideals $I,J$ of a noetherian local ring $(R, \mathfrak m)$ such that $I+J$ is $\mathfrak m$-primary and a finitely generated $R$-module $M$, we associate an invariant of $(M,R,I,J)$ called the $h$-function. Our results on $h$-functions allow extensions of the theories of Frobenius-Poincaré functions and Hilbert-Kunz density functions from the known graded case to the local case, answering a question of V.Trivedi. When $J$ is $\mathfrak m$-primary, we describe the support of the corresponding density function in terms of other invariants of $(R, I,J)$. We show that the support captures the $F$-threshold: $c^J(I)$, under mild assumptions, extending results of V. Trivedi and Watanabe. The $h$-function encodes Hilbert-Samuel, Hilbert-Kunz multiplicity and $F$-threshold of the ideal pair involved. Using this feature of $h$-functions, we provide an equivalent formulation of a conjecture of Huneke, Mustaţă, Takagi, Watanabe; recover a result of Smirnov and Betancourt; prove that a result of Hanes comparing multiplicities, is equivalent to an a priori weaker containment condition on ideals. We also point out that a conjecture of Smirnov-Betancourt as stated is false and suggest a correction which we relate to the conjecture of Huneke et al. We develop the theory of $h$-functions in a more general setting which yields a density function for $F$-signature. A key to many results on $h$-functions is a `convexity technique' that we introduce, which in particular proves differentiability of Hilbert-Kunz density functions almost everywhere on $(0,\infty)$, thus contributing to another question of Trivedi. △ Less

Submitted 25 April, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: v2: substantial changes: applications added, results improved

arXiv:2310.09541 [pdf, ps, other]

Poissonian pair correlation for higher dimensional real sequences

Authors: Tanmoy Bera, Mithun Kumar Das, Anirban Mukhopadhyay

Abstract: In this article, we examine the Poissonian pair correlation (PPC) statistic for higher-dimensional real sequences. Specifically, we demonstrate that for $d\geq 3$, for almost all $(α_1,\ldots,α_d) \in \mathbb{R}^d$, the sequence $\big(\{x_nα_1\},\dots,\{x_nα_d\}\big)$ in $[0,1)^d$ has PPC conditionally on the additive energy bound of $(x_n).$ This bound is more relaxed compared to the additive ene… ▽ More In this article, we examine the Poissonian pair correlation (PPC) statistic for higher-dimensional real sequences. Specifically, we demonstrate that for $d\geq 3$, for almost all $(α_1,\ldots,α_d) \in \mathbb{R}^d$, the sequence $\big(\{x_nα_1\},\dots,\{x_nα_d\}\big)$ in $[0,1)^d$ has PPC conditionally on the additive energy bound of $(x_n).$ This bound is more relaxed compared to the additive energy bound for one dimension as discussed in [1]. We also establish the metric PPC for $(n^{θ_1},\ldots,n^{θ_d})$ provided that the $θ_i$'s are greater than one. More generally, we derive the PPC for $\big(\{x_n^{(1)}α_1\},\dots,\{x_n^{(d)}α_d\}\big) \in [0,1)^d$ for almost all $(α_1,\ldots,α_d) \in \mathbb{R}^d.$ △ Less

Submitted 14 October, 2023; originally announced October 2023.

Comments: 21 pages

MSC Class: 11K06; 11J83; 11M06; 11J25; 11J71; 42B05; 11L07

arXiv:2310.00504 [pdf, other]

Exploring SAM Ablations for Enhancing Medical Segmentation in Radiology and Pathology

Authors: Amin Ranem, Niklas Babendererde, Moritz Fuchs, Anirban Mukhopadhyay

Abstract: Medical imaging plays a critical role in the diagnosis and treatment planning of various medical conditions, with radiology and pathology heavily reliant on precise image segmentation. The Segment Anything Model (SAM) has emerged as a promising framework for addressing segmentation challenges across different domains. In this white paper, we delve into SAM, breaking down its fundamental components… ▽ More Medical imaging plays a critical role in the diagnosis and treatment planning of various medical conditions, with radiology and pathology heavily reliant on precise image segmentation. The Segment Anything Model (SAM) has emerged as a promising framework for addressing segmentation challenges across different domains. In this white paper, we delve into SAM, breaking down its fundamental components and uncovering the intricate interactions between them. We also explore the fine-tuning of SAM and assess its profound impact on the accuracy and reliability of segmentation results, focusing on applications in radiology (specifically, brain tumor segmentation) and pathology (specifically, breast cancer segmentation). Through a series of carefully designed experiments, we analyze SAM's potential application in the field of medical imaging. We aim to bridge the gap between advanced segmentation techniques and the demanding requirements of healthcare, shedding light on SAM's transformative capabilities. △ Less

Submitted 30 September, 2023; originally announced October 2023.

arXiv:2309.15159 [pdf, other]

Anomalous topology and synthetic flat band in multi-terminal Josephson Junctions

Authors: Aabir Mukhopadhyay, Udit Khanna, Sourin Das

Abstract: Andreev bound states trapped in a multi-terminal Josephson junction (JJ) can be assigned a synthetic band topology owing to their periodic dependence on the Josephson phase bias. We demonstrate that the BdG symmetry adds a twist to this topological character, i.e., gap closing points may or \textit{may not} correspond to change of Chern number, hence extending the standard paradigm for topological… ▽ More Andreev bound states trapped in a multi-terminal Josephson junction (JJ) can be assigned a synthetic band topology owing to their periodic dependence on the Josephson phase bias. We demonstrate that the BdG symmetry adds a twist to this topological character, i.e., gap closing points may or \textit{may not} correspond to change of Chern number, hence extending the standard paradigm for topological bands. We further show that the topology of Andreev bands depends only on the scattering matrix of the junction and is independent of the topological nature of superconductors forming the JJ hence indicating a universal behaviour of multi-terminal JJ. We also show that the chiral junction, supported by quantum Hall state at the junction region, leads to flat Andreev bands (implying absence of DC Josephson effects) that are devoid of Berry curvature ( implying absence of AC Josephson effects). Such electrically inert JJ may be useful for storage of quantum information in future quantum devices. △ Less

Submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.15085 [pdf, other]

Statistics of Moduli Space of vector bundles II

Authors: Arijit Dey, Sampa Dey, Anirban Mukhopadhyay

Abstract: Let $X$ be a smooth irreducible projective curve of genus $g \geq 2$ over a finite field $\F_{q}$ of characteristic $p$ with $q$ elements such that the function field $\F_{q}(X)$ is a geometric Galois extension of the rational function field of degree $N.$ Consider $gcd(n,d)=1$, let $M_{L}(n,d)$ be the moduli space of rank $n$ stable vector bundles over $X$ with fixed determinant isomorphic to a… ▽ More Let $X$ be a smooth irreducible projective curve of genus $g \geq 2$ over a finite field $\F_{q}$ of characteristic $p$ with $q$ elements such that the function field $\F_{q}(X)$ is a geometric Galois extension of the rational function field of degree $N.$ Consider $gcd(n,d)=1$, let $M_{L}(n,d)$ be the moduli space of rank $n$ stable vector bundles over $X$ with fixed determinant isomorphic to a $\mathbb F_q$-rational line bundle $L$. Suppose $N_q (M_L(n,d))$ denotes the cardinality of the set of $\F_{q}$-rational points of $M_{L}(n,d)$. We give an asymptotic bound of $\log(N_{q}(M_{L}(n,d)) - (n^2-1)(g-1)\log{q})$ for large genus $g,$ depending on $N$. Further, considering this logarithmic difference as a random variable, we prove a central limit theorem over a large family of hyperelliptic curves with uniform probability measure. Further, over the same family of hyperelliptic curves, we study the distribution of $\F_{q}$-rational points over the moduli space of rank $2$ stable vector bundles with trivial determinant $M^{s}_{\mathcal{O}_{H}}(2,0)$ and it's Seshadri desingularisation ${\widetilde{N}}$ by choosing an appropriate random variable in each case. We also see that the corresponding random variables having standard Gaussian distribution as $g$ and $q$ tends to infinity. △ Less

Submitted 26 September, 2023; originally announced September 2023.

Comments: 28 pages

MSC Class: 14D20; 11M38

arXiv:2309.10037 [pdf, other]

A stabilizer code model with non-invertible symmetries: Strange fractons, confinement, and non-commutative and non-Abelian fusion rules

Authors: Tanay Kibe, Ayan Mukhopadhyay, Pramod Padmanabhan

Abstract: We introduce a stabilizer code model with a qutrit at every edge on a square lattice and with non-invertible plaquette operators. The degeneracy of the ground state is topological as in the toric code, and it also has the usual deconfined excitations consisting of pairs of electric and magnetic charges. However, there are novel types of confined fractonic excitations composed of a cluster of adjac… ▽ More We introduce a stabilizer code model with a qutrit at every edge on a square lattice and with non-invertible plaquette operators. The degeneracy of the ground state is topological as in the toric code, and it also has the usual deconfined excitations consisting of pairs of electric and magnetic charges. However, there are novel types of confined fractonic excitations composed of a cluster of adjacent faces (defects) with vanishing flux. They manifest confinement, and even larger configurations of these fractons are fully immobile although they acquire emergent internal degrees of freedom. Deconfined excitations change their nature in presence of these fractonic defects. As for instance, a magnetic monopole can exist anywhere on the lattice exterior to a fractonic defect cluster while electric charges acquire restricted mobility. These imply that our model featuring fractons is neither of type I, nor of type II. Furthermore, local operators which are symmetries can annihilate any ground state and also the full sector of states which can decay to a ground state under local perturbations. All these properties can be captured via a novel type of non-commutative and non-Abelian fusion category in which the product is associative but does not commute, and can be expressed as a sum of (operator) equivalence classes which includes that of the zero operator. We introduce many other variants of this model and discuss their relevance in quantum field theory. △ Less

Submitted 14 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

Comments: 43 pages, 16 figures; v2: more clarifications about the properties of symmetry transformations added; v3: expanded discussion section

arXiv:2309.07785 [pdf, other]

Combinatorial Proof of an Identity of Berkovich and Uncu

Authors: Aritram Dhar, Avi Mukhopadhyay

Abstract: The BG-rank BG($π$) of an integer partition $π$ is defined as $$\text{BG}(π) := i-j$$ where $i$ is the number of odd-indexed odd parts and $j$ is the number of even-indexed odd parts of $π$. In a recent work, Fu and Tang ask for a direct combinatorial proof of the following identity of Berkovich and Uncu $$B_{2N+ν}(k,q)=q^{2k^2-k}\left[\begin{matrix}2N+ν\\N+k\end{matrix}\right]_{q^2}$$ for any int… ▽ More The BG-rank BG($π$) of an integer partition $π$ is defined as $$\text{BG}(π) := i-j$$ where $i$ is the number of odd-indexed odd parts and $j$ is the number of even-indexed odd parts of $π$. In a recent work, Fu and Tang ask for a direct combinatorial proof of the following identity of Berkovich and Uncu $$B_{2N+ν}(k,q)=q^{2k^2-k}\left[\begin{matrix}2N+ν\\N+k\end{matrix}\right]_{q^2}$$ for any integer $k$ and non-negative integer $N$ where $ν\in \{0,1\}$, $B_N(k,q)$ is the generating function for partitions into distinct parts less than or equal to $N$ with BG-rank equal to $k$ and $\left[\begin{matrix}a+b\\b\end{matrix}\right]_q$ is a Gaussian binomial coefficient. In this paper, we provide a combinatorial proof of Berkovich and Uncu's identity along the lines of Vandervelde and Fu and Tang's idea. △ Less

Submitted 9 October, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: 18 pages, 8 figures. Comments are welcome!

MSC Class: 05A15; 05A17; 05A19; 11P81; 11P83; 11P84

arXiv:2309.02954 [pdf, other]

M3D-NCA: Robust 3D Segmentation with Built-in Quality Control

Authors: John Kalkhof, Anirban Mukhopadhyay

Abstract: Medical image segmentation relies heavily on large-scale deep learning models, such as UNet-based architectures. However, the real-world utility of such models is limited by their high computational requirements, which makes them impractical for resource-constrained environments such as primary care facilities and conflict zones. Furthermore, shifts in the imaging domain can render these models in… ▽ More Medical image segmentation relies heavily on large-scale deep learning models, such as UNet-based architectures. However, the real-world utility of such models is limited by their high computational requirements, which makes them impractical for resource-constrained environments such as primary care facilities and conflict zones. Furthermore, shifts in the imaging domain can render these models ineffective and even compromise patient safety if such errors go undetected. To address these challenges, we propose M3D-NCA, a novel methodology that leverages Neural Cellular Automata (NCA) segmentation for 3D medical images using n-level patchification. Moreover, we exploit the variance in M3D-NCA to develop a novel quality metric which can automatically detect errors in the segmentation process of NCAs. M3D-NCA outperforms the two magnitudes larger UNet models in hippocampus and prostate segmentation by 2% Dice and can be run on a Raspberry Pi 4 Model B (2GB RAM). This highlights the potential of M3D-NCA as an effective and efficient alternative for medical image segmentation in resource-constrained environments. △ Less

Submitted 6 September, 2023; originally announced September 2023.

arXiv:2309.00688 [pdf, other]

Jointly Exploring Client Drift and Catastrophic Forgetting in Dynamic Learning

Authors: Niklas Babendererde, Moritz Fuchs, Camila Gonzalez, Yuri Tolkach, Anirban Mukhopadhyay

Abstract: Federated and Continual Learning have emerged as potential paradigms for the robust and privacy-aware use of Deep Learning in dynamic environments. However, Client Drift and Catastrophic Forgetting are fundamental obstacles to guaranteeing consistent performance. Existing work only addresses these problems separately, which neglects the fact that the root cause behind both forms of performance det… ▽ More Federated and Continual Learning have emerged as potential paradigms for the robust and privacy-aware use of Deep Learning in dynamic environments. However, Client Drift and Catastrophic Forgetting are fundamental obstacles to guaranteeing consistent performance. Existing work only addresses these problems separately, which neglects the fact that the root cause behind both forms of performance deterioration is connected. We propose a unified analysis framework for building a controlled test environment for Client Drift -- by perturbing a defined ratio of clients -- and Catastrophic Forgetting -- by shifting all clients with a particular strength. Our framework further leverages this new combined analysis by generating a 3D landscape of the combined performance impact from both. We demonstrate that the performance drop through Client Drift, caused by a certain share of shifted clients, is correlated to the drop from Catastrophic Forgetting resulting from a corresponding shift strength. Correlation tests between both problems for Computer Vision (CelebA) and Medical Imaging (PESO) support this new perspective, with an average Pearson rank correlation coefficient of over 0.94. Our framework's novel ability of combined spatio-temporal shift analysis allows us to investigate how both forms of distribution shift behave in mixed scenarios, opening a new pathway for better generalization. We show that a combination of moderate Client Drift and Catastrophic Forgetting can even improve the performance of the resulting model (causing a "Generalization Bump") compared to when only one of the shifts occurs individually. We apply a simple and commonly used method from Continual Learning in the federated setting and observe this phenomenon to be reoccurring, leveraging the ability of our framework to analyze existing and novel methods for Federated and Continual Learning. △ Less

Submitted 1 September, 2023; originally announced September 2023.

arXiv:2309.00060 [pdf, ps, other]

On the Performance of Large Loss Systems with Adaptive Multiserver Jobs

Authors: Samira Ghanbarian, Arpan Mukhopadhyay, Fabrice M. Guillemin, Ravi R. Mazumdar

Abstract: In this paper, we study systems where each job or request can be split into a flexible number of sub-jobs up to a maximum limit. The number of sub-jobs a job is split into depends on the number of available servers found upon its arrival. All sub-jobs of a job are then processed in parallel at different servers leading to a linear speed-up of the job. We refer to such jobs as {\em adaptive multi-s… ▽ More In this paper, we study systems where each job or request can be split into a flexible number of sub-jobs up to a maximum limit. The number of sub-jobs a job is split into depends on the number of available servers found upon its arrival. All sub-jobs of a job are then processed in parallel at different servers leading to a linear speed-up of the job. We refer to such jobs as {\em adaptive multi-server jobs}. We study the problem of optimal assignment of such jobs when each server can process at most one sub-job at any given instant and there is no waiting room in the system. We assume that, upon arrival, a job can only access a randomly sampled subset of $k(n)$ servers from a total of $n$ servers, and the number of sub-jobs is determined based on the number of idle servers within the sampled subset. We analyze the steady-state performance of the system when system load varies according to $λ(n) =1 - βn^{-α}$ for $α\in [0,1)$, and $β\geq 0$. Our interest is to find how large the subset $k(n)$ should be in order to have zero blocking and maximum speed-up in the limit as $n \to \infty$. We first characterize the system's performance when the jobs have access to the full system, i.e., $k(n)=n$. In this setting, we show that the blocking probability approaches to zero at the rate $O(1/\sqrt{n})$ and the mean response time of accepted jobs approaches to its minimum achievable value at rate $O(1/n)$. We then consider the case where the jobs only have access to subset of servers, i.e., $k(n) < n$. We show that as long as $k(n)=ω(n^α)$, the same asymptotic performance can be achieved as in the case with full system access. In particular, for $k(n)=Θ(n^α\log n)$, we show that both the blocking probability and the mean response time approach to their desired limits at rate $O(n^{-(1-α)/2})$. △ Less

Submitted 31 August, 2023; originally announced September 2023.

MSC Class: 60K25; 68M20

arXiv:2308.11690 [pdf]

The case for studying other planetary magnetospheres and atmospheres in Heliophysics

Authors: Ian J. Cohen, Chris Arridge, Abigail Azari, Chris Bard, George Clark, Frank Crary, Shannon Curry, Peter Delamere, Ryan M. Dewey, Gina A. DiBraccio, Chuanfei Dong, Alexander Drozdov, Austin Egert, Rachael Filwett, Jasper Halekas, Alexa Halford, Andréa Hughes, Katherine Garcia-Sage, Matina Gkioulidou, Charlotte Goetz, Cesare Grava, Michael Hirsch, Hans Leo F. Huybrighs, Peter Kollmann, Laurent Lamy , et al. (15 additional authors not shown)

Abstract: Heliophysics is the field that "studies the nature of the Sun, and how it influences the very nature of space - and, in turn, the atmospheres of planetary bodies and the technology that exists there." However, NASA's Heliophysics Division tends to limit study of planetary magnetospheres and atmospheres to only those of Earth. This leaves exploration and understanding of space plasma physics at oth… ▽ More Heliophysics is the field that "studies the nature of the Sun, and how it influences the very nature of space - and, in turn, the atmospheres of planetary bodies and the technology that exists there." However, NASA's Heliophysics Division tends to limit study of planetary magnetospheres and atmospheres to only those of Earth. This leaves exploration and understanding of space plasma physics at other worlds to the purview of the Planetary Science and Astrophysics Divisions. This is detrimental to the study of space plasma physics in general since, although some cross-divisional funding opportunities do exist, vital elements of space plasma physics can be best addressed by extending the expertise of Heliophysics scientists to other stellar and planetary magnetospheres. However, the diverse worlds within the solar system provide crucial environmental conditions that are not replicated at Earth but can provide deep insight into fundamental space plasma physics processes. Studying planetary systems with Heliophysics objectives, comprehensive instrumentation, and new grant opportunities for analysis and modeling would enable a novel understanding of fundamental and universal processes of space plasma physics. As such, the Heliophysics community should be prepared to consider, prioritize, and fund dedicated Heliophysics efforts to planetary targets to specifically study space physics and aeronomy objectives. △ Less

Submitted 24 August, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

arXiv:2308.02587 [pdf, other]

Synthesising Rare Cataract Surgery Samples with Guided Diffusion Models

Authors: Yannik Frisch, Moritz Fuchs, Antoine Sanner, Felix Anton Ucar, Marius Frenzel, Joana Wasielica-Poslednik, Adrian Gericke, Felix Mathias Wagner, Thomas Dratsch, Anirban Mukhopadhyay

Abstract: Cataract surgery is a frequently performed procedure that demands automation and advanced assistance systems. However, gathering and annotating data for training such systems is resource intensive. The publicly available data also comprises severe imbalances inherent to the surgical process. Motivated by this, we analyse cataract surgery video data for the worst-performing phases of a pre-trained… ▽ More Cataract surgery is a frequently performed procedure that demands automation and advanced assistance systems. However, gathering and annotating data for training such systems is resource intensive. The publicly available data also comprises severe imbalances inherent to the surgical process. Motivated by this, we analyse cataract surgery video data for the worst-performing phases of a pre-trained downstream tool classifier. The analysis demonstrates that imbalances deteriorate the classifier's performance on underrepresented cases. To address this challenge, we utilise a conditional generative model based on Denoising Diffusion Implicit Models (DDIM) and Classifier-Free Guidance (CFG). Our model can synthesise diverse, high-quality examples based on complex multi-class multi-label conditions, such as surgical phases and combinations of surgical tools. We affirm that the synthesised samples display tools that the classifier recognises. These samples are hard to differentiate from real images, even for clinical experts with more than five years of experience. Further, our synthetically extended data can improve the data sparsity problem for the downstream task of tool classification. The evaluations demonstrate that the model can generate valuable unseen examples, allowing the tool classifier to improve by up to 10% for rare cases. Overall, our approach can facilitate the development of automated assistance systems for cataract surgery by providing a reliable source of realistic synthetic data, which we make available for everyone. △ Less

Submitted 3 August, 2023; originally announced August 2023.

arXiv:2307.10384 [pdf, other]

doi 10.1140/epjc/s10052-024-12915-2

How Gubser flow ends in a holographic conformal theory

Authors: Avik Banerjee, Toshali Mitra, Ayan Mukhopadhyay, Alexander Soloviev

Abstract: Gubser flow is an axis-symmetric and boost-invariant evolution in a relativistic quantum field theory which is best studied by map** $\mathbf{R}^{3,1}$ to $dS_{3}\times \mathbf{R}$ when the field theory has conformal symmetry. We show that at late de-Sitter time, which corresponds to large proper time and central region of the future wedge within $\mathbf{R}^{3,1}$, the holographic conformal fie… ▽ More Gubser flow is an axis-symmetric and boost-invariant evolution in a relativistic quantum field theory which is best studied by map** $\mathbf{R}^{3,1}$ to $dS_{3}\times \mathbf{R}$ when the field theory has conformal symmetry. We show that at late de-Sitter time, which corresponds to large proper time and central region of the future wedge within $\mathbf{R}^{3,1}$, the holographic conformal field theory plasma can reach a state in which $\varepsilon = P_T = - P_L$, with $\varepsilon$, $P_T$ and $P_L$ being the energy density, transverse and longitudinal pressures, respectively. We further determine the full sub-leading behaviour of the energy-momentum tensor at late time. Restricting to flows in which the energy density decays at large transverse distance from the central axis in $\mathbf{R}^{3,1}$, we show that this decay should be faster than any power law. Furthermore, in this case the energy density also vanishes in $\mathbf{R}^{3,1}$ faster than any power as we go back to early proper time. Hydrodynamic behavior can appear in intermediate time. △ Less

Submitted 16 May, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

Comments: 25 pages, 2 figures; version accepted in EPJ-C, Section 5 and concluding sections revised

Journal ref: Eur. Phys. J. C 84, 550 (2024)

arXiv:2307.04799 [pdf, other]

doi 10.1007/JHEP10(2023)096

Black hole complementarity from microstate models: A study of information replication and the encoding in the black hole interior

Authors: Tanay Kibe, Sukrut Mondkar, Ayan Mukhopadhyay, Hareram Swain

Abstract: We study how the black hole complementarity principle can emerge from quantum gravitational dynamics within a local semiclassical approximation. Further develo** and then simplifying a microstate model based on the fragmentation instability of a near-extremal black hole, we find that the key to the replication (but not cloning) of infalling information is the decoupling of various degrees of fre… ▽ More We study how the black hole complementarity principle can emerge from quantum gravitational dynamics within a local semiclassical approximation. Further develo** and then simplifying a microstate model based on the fragmentation instability of a near-extremal black hole, we find that the key to the replication (but not cloning) of infalling information is the decoupling of various degrees of freedom. The infalling matter decouples from the interior retaining a residual time-dependent quantum state in the hair which encodes the initial state of the matter non-isometrically. The non-linear ringdown of the interior after energy absorption and decoupling also encodes the initial state, and transfers the information to Hawking radiation. During the Hawking evaporation process, the fragmented throats decouple from each other and the hair decouples from the throats. We find that the hair mirrors infalling information after the decoupling time which scales with the logarithm of the entropy (at the time of infall) when the average mass per fragmented throat (a proxy for the temperature) is held fixed. The decoding protocol for the mirrored information does not require knowledge of the interior, and only limited information from the Hawking radiation, as can be argued to be necessitated by the complementarity principle. We discuss the scope of the model to illuminate various aspects of information processing in a black hole. △ Less

Submitted 2 October, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: 44 pages, 21 figures; v2: expanded conclusion and discussion section

Journal ref: J. High Energ. Phys. 2023, 96 (2023)

arXiv:2306.17601 [pdf, other]

Next-to-leading power corrections to the event shape variables

Authors: Neelima Agarwal, Melissa van Beekveld, Eric Laenen, Shubham Mishra, Ayan Mukhopadhyay, Anurag Tripathi

Abstract: We investigate the origin of next-to-leading power corrections to the event shapes thrust and $c$-parameter, at next-to-leading order. For both event shapes we trace the origin of such terms in the exact calculation, and compare with a recent approach involving the eikonal approximation and momentum shifts that follow from the Low-Burnett-Kroll-Del Duca theorem. We assess the differences both anal… ▽ More We investigate the origin of next-to-leading power corrections to the event shapes thrust and $c$-parameter, at next-to-leading order. For both event shapes we trace the origin of such terms in the exact calculation, and compare with a recent approach involving the eikonal approximation and momentum shifts that follow from the Low-Burnett-Kroll-Del Duca theorem. We assess the differences both analytically and numerically. For the $c$-parameter both exact and approximate results are expressed in terms of elliptic integrals, but near the elastic limit it exhibits patterns similar to the thrust results. △ Less

Submitted 30 June, 2023; originally announced June 2023.

Comments: 23 pages, 8 figures and 1 table

arXiv:2306.10068 [pdf, ps, other]

Artificial Intelligence for Emergency Response

Authors: Ayan Mukhopadhyay

Abstract: Emergency response management (ERM) is a challenge faced by communities across the globe. First responders must respond to various incidents, such as fires, traffic accidents, and medical emergencies. They must respond quickly to incidents to minimize the risk to human life. Consequently, considerable attention has been devoted to studying emergency incidents and response in the last several decad… ▽ More Emergency response management (ERM) is a challenge faced by communities across the globe. First responders must respond to various incidents, such as fires, traffic accidents, and medical emergencies. They must respond quickly to incidents to minimize the risk to human life. Consequently, considerable attention has been devoted to studying emergency incidents and response in the last several decades. In particular, data-driven models help reduce human and financial loss and improve design codes, traffic regulations, and safety measures. This tutorial paper explores four sub-problems within emergency response: incident prediction, incident detection, resource allocation, and resource dispatch. We aim to present mathematical formulations for these problems and broad frameworks for each problem. We also share open-source (synthetic) data from a large metropolitan area in the USA for future work on data-driven emergency response. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: This is a pre-print for a book chapter to appear in Vorobeychik, Yevgeniy., and Mukhopadhyay, Ayan., (Eds.). (2023). \textit{Artificial Intelligence and Society}. ACM Press

arXiv:2306.06890 [pdf, ps, other]

On the irreducibility of extended Laguerre Polynomials

Authors: Anuj Jakhar, Srinivas Kotyada, Arunabha Mukhopadhyay

Abstract: Let $m\geq 1$ and $a_m$ be integers. Let $α$ be a rational number which is not a negative integer such that $α= \frac{u}{v}$ with $\gcd(u,v) = 1, v>0$. Let $φ(x)$ belonging to $\Z[x]$ be a monic polynomial which is irreducible modulo all the primes less than or equal to $vm+u$. Let $a_i(x)$ with $0\leq i\leq m-1$ belonging to $\Z[x]$ be polynomials having degree less than $\degφ(x)$. Assume that t… ▽ More Let $m\geq 1$ and $a_m$ be integers. Let $α$ be a rational number which is not a negative integer such that $α= \frac{u}{v}$ with $\gcd(u,v) = 1, v>0$. Let $φ(x)$ belonging to $\Z[x]$ be a monic polynomial which is irreducible modulo all the primes less than or equal to $vm+u$. Let $a_i(x)$ with $0\leq i\leq m-1$ belonging to $\Z[x]$ be polynomials having degree less than $\degφ(x)$. Assume that the content of $(a_ma_0(x))$ is not divisible by any prime less than or equal to $vm+u$. In this paper, we prove that the polynomials $L_{m,α}^φ(x) = \frac{1}{m!}(a_mφ(x)^m+\sum\limits_{j=0}^{m-1}b_ja_j(x)φ(x)^j)$ are irreducible over the rationals for all but finitely many $m$, where $b_j = \binom{m}{j}(m+α)(m-1+α)\cdots (j+1+α)~~~\mbox{ for }0\leq j\leq m-1$. Further, we show that $L_{m,α}^φ(x)$ is irreducible over rationals for each $α\in \{0, 1, 2, 3, 4\}$ unless $(m, α) \in \{ (1,0), (2,2), (4,4),(6,4)\}.$ For proving our results, we use the notion of $φ$-Newton polygon and some results from analytic number theory. We illustrate our results through examples. △ Less

Submitted 12 June, 2023; originally announced June 2023.

Comments: arXiv admin note: text overlap with arXiv:2306.01767, arXiv:2306.03294

arXiv:2305.12357 [pdf, other]

doi 10.1145/3563657.3595997

CoSINT: Designing a Collaborative Capture the Flag Competition to Investigate Misinformation

Authors: Sukrit Venkatagiri, Anirban Mukhopadhyay, David Hicks, Aaron Brantly, Kurt Luther

Abstract: Crowdsourced investigations shore up democratic institutions by debunking misinformation and uncovering human rights abuses. However, current crowdsourcing approaches rely on simplistic collaborative or competitive models and lack technological support, limiting their collective impact. Prior research has shown that blending elements of competition and collaboration can lead to greater performance… ▽ More Crowdsourced investigations shore up democratic institutions by debunking misinformation and uncovering human rights abuses. However, current crowdsourcing approaches rely on simplistic collaborative or competitive models and lack technological support, limiting their collective impact. Prior research has shown that blending elements of competition and collaboration can lead to greater performance and creativity, but crowdsourced investigations pose unique analytical and ethical challenges. In this paper, we employed a four-month-long Research through Design process to design and evaluate a novel interaction style called collaborative capture the flag competitions (CoCTFs). We instantiated this interaction style through CoSINT, a platform that enables a trained crowd to work with professional investigators to identify and investigate social media misinformation. Our mixed-methods evaluation showed that CoSINT leverages the complementary strengths of competition and collaboration, allowing a crowd to quickly identify and debunk misinformation. We also highlight tensions between competition versus collaboration and discuss implications for the design of crowdsourced investigations. △ Less

Submitted 21 May, 2023; originally announced May 2023.

Comments: To appear in ACM Designing Interactive Systems 2023 (DIS 2023). To cite this paper please use the official citation available here: https://doi.org/10.1145/3563657.3595997

Journal ref: Designing Interactive Systems Conference 2023

arXiv:2305.08438 [pdf, other]

Active adaptolates: motility-induced percolating structures with an adaptive packing geometry

Authors: Aritra K. Mukhopadhyay, Peter Schmelcher, Benno Liebchen

Abstract: It is well known that periodic potentials can be used to induce freezing and melting in colloids. Here, we transfer this concept to active systems and find the emergence of a so-far unknown active matter phase in between the frozen solid-like phase and the molten phase. This phase of "active adaptolates" adopts the geometry of the underlying lattice like the frozen phase, maintains ballistic dynam… ▽ More It is well known that periodic potentials can be used to induce freezing and melting in colloids. Here, we transfer this concept to active systems and find the emergence of a so-far unknown active matter phase in between the frozen solid-like phase and the molten phase. This phase of "active adaptolates" adopts the geometry of the underlying lattice like the frozen phase, maintains ballistic dynamics like the molten phase, and percolates. In particular, this finding creates a route to use external fields for designing the intrinsic structure of active systems without qualitatively affecting their dynamics. △ Less

Submitted 2 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

arXiv:2304.12381 [pdf, other]

Recognizing and generating unswitchable graphs

Authors: Asish Mukhopadhyay, Daniel John, Srivatsan Vasudevan

Abstract: In this paper, we show that unswitchable graphs are a proper subclass of split graphs, and exploit this fact to propose efficient algorithms for their recognition and generation. In this paper, we show that unswitchable graphs are a proper subclass of split graphs, and exploit this fact to propose efficient algorithms for their recognition and generation. △ Less

Submitted 12 April, 2023; originally announced April 2023.

Comments: 13 pages, 14 figures

arXiv:2303.18085 [pdf, ps, other]

High Frobenius pushforwards generate the bounded derived category

Authors: Matthew R. Ballard, Srikanth B. Iyengar, Pat Lank, Alapan Mukhopadhyay, Josh Pollitz

Abstract: This work concerns generators for the bounded derived category of coherent sheaves over a noetherian scheme $X$ of prime characteristic. The main result is that when the Frobenius map on $X$ is finite, for any compact generator $G$ of $\mathsf{D}(X)$ the Frobenius pushforward $F ^e_*G$ generates the bounded derived category whenever $p^e$ is larger than the codepth of $X$, an invariant that is a m… ▽ More This work concerns generators for the bounded derived category of coherent sheaves over a noetherian scheme $X$ of prime characteristic. The main result is that when the Frobenius map on $X$ is finite, for any compact generator $G$ of $\mathsf{D}(X)$ the Frobenius pushforward $F ^e_*G$ generates the bounded derived category whenever $p^e$ is larger than the codepth of $X$, an invariant that is a measure of the singularity of $X$. The conclusion holds for all positive integers $e$ when $X$ is locally complete intersection. The question of when one can take $G=\mathcal{O}_X$ is also investigated. For smooth projective complete intersections it reduces to a question of generation of the Kuznetsov component. △ Less

Submitted 13 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

Comments: 31 pages. Minor edits

MSC Class: 14A30 (primary); 13A35; 14G17; 13D09

arXiv:2303.00869 [pdf, ps, other]

The Power of Two Choices with Load Comparison Errors

Authors: Sanidhay Bhambay, Arpan Mukhopadhyay, Thirupathaiah Vasantam

Abstract: In this paper, we analyze the effects of erroneous load comparisons on the performance of the Po2 scheme. Specifically, we consider load-dependent and load-independent errors. In the load-dependent error model, an incoming job is sent to the server with the larger queue length among the two sampled servers with probability $ε$ if the difference in the queue lengths of the two sampled servers is le… ▽ More In this paper, we analyze the effects of erroneous load comparisons on the performance of the Po2 scheme. Specifically, we consider load-dependent and load-independent errors. In the load-dependent error model, an incoming job is sent to the server with the larger queue length among the two sampled servers with probability $ε$ if the difference in the queue lengths of the two sampled servers is less than or equal to a constant $g$; no error is made if the queue-length difference is higher than $g$. For this type of errors, we show that the benefits of the Po2 scheme is retained as long as the system size is sufficiently large and $λ$ is sufficiently close to $1$. Furthermore, we show that, unlike the standard Po2 scheme, the performance of the Po2 scheme under this type of errors can be worse than the random scheme if $ε> 1/2$ and $λ$ is sufficiently small. In the load-independent error model, the incoming job is sent to the sampled server with the {\em maximum load} with an error probability of $ε$ independent of the loads of the sampled servers. For this model, we show that the performance benefits of the Po2 scheme are retained only if $ε\leq 1/2$; for $ε> 1/2$ we show that the stability region of the system reduces and the system performs poorly in comparison to the {\em random scheme}. △ Less

Submitted 14 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.11137 [pdf, other]

Fairguard: Harness Logic-based Fairness Rules in Smart Cities

Authors: Yiqi Zhao, Ziyan An, Xuqing Gao, Ayan Mukhopadhyay, Meiyi Ma

Abstract: Smart cities operate on computational predictive frameworks that collect, aggregate, and utilize data from large-scale sensor networks. However, these frameworks are prone to multiple sources of data and algorithmic bias, which often lead to unfair prediction results. In this work, we first demonstrate that bias persists at a micro-level both temporally and spatially by studying real city data fro… ▽ More Smart cities operate on computational predictive frameworks that collect, aggregate, and utilize data from large-scale sensor networks. However, these frameworks are prone to multiple sources of data and algorithmic bias, which often lead to unfair prediction results. In this work, we first demonstrate that bias persists at a micro-level both temporally and spatially by studying real city data from Chattanooga, TN. To alleviate the issue of such bias, we introduce Fairguard, a micro-level temporal logic-based approach for fair smart city policy adjustment and generation in complex temporal-spatial domains. The Fairguard framework consists of two phases: first, we develop a static generator that is able to reduce data bias based on temporal logic conditions by minimizing correlations between selected attributes. Then, to ensure fairness in predictive algorithms, we design a dynamic component to regulate prediction results and generate future fair predictions by harnessing logic rules. Evaluations show that logic-enabled static Fairguard can effectively reduce the biased correlations while dynamic Fairguard can guarantee fairness on protected groups at run-time with minimal impact on overall performance. △ Less

Submitted 8 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

Showing 1–50 of 264 results for author: Mukhopadhyay, A