-
Superconductivity in three-dimensional interacting doped topological insulators
Authors:
Andras L. Szabo,
Bitan Roy
Abstract:
Three-dimensional doped Dirac insulators foster simply connected (in both topological and trivial regimes) and annular (deep inside the topological regime) Fermi surfaces (FSs) in the normal state, and allow on-site repulsions among fermions with opposite spin ($U_1$) and parity ($U_2$) eigenvalues. From an unbiased leading-order (one-loop) renormalization group analysis, controlled by a suitable…
▽ More
Three-dimensional doped Dirac insulators foster simply connected (in both topological and trivial regimes) and annular (deep inside the topological regime) Fermi surfaces (FSs) in the normal state, and allow on-site repulsions among fermions with opposite spin ($U_1$) and parity ($U_2$) eigenvalues. From an unbiased leading-order (one-loop) renormalization group analysis, controlled by a suitable $ε$ expansion, we show that this system develops strong propensity toward the nucleation of scalar $s$-wave and odd-parity pseudoscalar $p$-wave pairings, favored by repulsive $U_1$ and $U_2$ interactions, respectively, irrespective of the underlying FS topology. Our results can be pertinent for the observed superconductivity in various doped narrow gap semiconductors, and the theoretical foundation can readily be applied to investigate similar phenomenon in various doped topological materials.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Minsum Problem for Discrete and Weighted Set Flow on Dynamic Path Network
Authors:
Bubai Manna,
Bodhayan Roy,
Vorapong Suppakitpaisarn
Abstract:
In this research, we examine the minsum flow problem in dynamic path networks where flows are represented as discrete and weighted sets. The minsum flow problem has been widely studied for its relevance in finding evacuation routes during emergencies such as earthquakes. However, previous approaches often assume that individuals are separable and identical, which does not adequately account for th…
▽ More
In this research, we examine the minsum flow problem in dynamic path networks where flows are represented as discrete and weighted sets. The minsum flow problem has been widely studied for its relevance in finding evacuation routes during emergencies such as earthquakes. However, previous approaches often assume that individuals are separable and identical, which does not adequately account for the fact that some groups of people, such as families, need to move together and that some groups may be more important than others. To address these limitations, we modify the minsum flow problem to support flows represented as discrete and weighted sets. We also propose a 2-approximation pseudo-polynomial time algorithm to solve this modified problem for path networks with uniform capacity.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Information-Theoretic Foundations for Neural Scaling Laws
Authors:
Hong Jun Jeon,
Benjamin Van Roy
Abstract:
Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we dev…
▽ More
Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we develop rigorous information-theoretic foundations for neural scaling laws. This allows us to characterize scaling laws for data generated by a two-layer neural network of infinite width. We observe that the optimal relation between data and model size is linear, up to logarithmic factors, corroborating large-scale empirical investigations. Concise yet general results of the kind we establish may bring clarity to this topic and inform future investigations.
△ Less
Submitted 27 June, 2024;
originally announced July 2024.
-
Covering Simple Orthogonal Polygons with Rectangles
Authors:
Aniket Basu Roy
Abstract:
We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover…
▽ More
We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover problem where we are interested in covering only the boundary of the polygon in contrast to the original problem where we are interested in covering the interior of the polygon, hence it is also referred as the Interior Cover problem. For the Boundary Cover problem, a $4$-factor approximation algorithm is known to exist and it is APX-hard when the polygon has holes [Berman and DasGupta, Algorithmica '94].
In this work, we investigate how effective is local search algorithm for the above covering problems on simple polygons. We prove that a simple local search algorithm yields a PTAS for the Boundary Cover problem when the polygon is simple. Our proof relies on the existence of planar supports on appropriate hypergraphs defined on the Boundary Cover problem instance. On the other hand, we construct instances where support graphs for the Interior Cover problem have arbitrarily large bicliques, thus implying that the same local search technique cannot yield a PTAS for this problem. We also show large locality gap for its dual problem, namely the Maximum Antirectangle problem.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
Notes on heating phase dynamics in Floquet CFTs and Modular quantization
Authors:
Suchetan Das,
Bobby Ezhuthachan,
Somnath Porey,
Baishali Roy
Abstract:
In this article, we explore the connection between the heating phase of periodically driven CFTs and the Modular Hamiltonian of a subregion in the vacuum state. We show that the heating phase Hamiltonian corresponds to the Modular Hamiltonian, with the fixed points map** to the endpoints of the subregion. In the bulk dual, we find that these fixed points correspond to the Ryu-Takayanagi surface…
▽ More
In this article, we explore the connection between the heating phase of periodically driven CFTs and the Modular Hamiltonian of a subregion in the vacuum state. We show that the heating phase Hamiltonian corresponds to the Modular Hamiltonian, with the fixed points map** to the endpoints of the subregion. In the bulk dual, we find that these fixed points correspond to the Ryu-Takayanagi surface of the AdS-Rindler wedge. Consequently, the entanglement entropy associated to the boundary interval within two fixed points exactly matches with the Rindler entropy of AdS-Rindler. We observe the emergent Virasoro algebra in the boundary quantization of the Modular Hamiltonian has a striking similarity with the emergent near Horizon Virasoro algebra. This is a consequence of the fact that while obtaining the boundary Virasoro algebra, a cut-off with conformal boundary condition around the fixed point is introduced, which in the bulk is related to a stretched horizon, with an emergent two-dimensional conformal symmetry. We also argue that as one tunes the parameter space of Floquet Hamiltonians to transition from the non-heating to the heating phase the operator algebra type changes from Von Neumann type $I$ to $III_1$ factor, providing a non-equilibrium analogue of the Hawking-Page transition.
△ Less
Submitted 11 July, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Attention-Based Learning for Fluid State Interpolation and Editing in a Time-Continuous Framework
Authors:
Bruno Roy
Abstract:
In this work, we introduce FluidsFormer: a transformer-based approach for fluid interpolation within a continuous-time framework. By combining the capabilities of PITT and a residual neural network (RNN), we analytically predict the physical properties of the fluid state. This enables us to interpolate substep frames between simulated keyframes, enhancing the temporal smoothness and sharpness of a…
▽ More
In this work, we introduce FluidsFormer: a transformer-based approach for fluid interpolation within a continuous-time framework. By combining the capabilities of PITT and a residual neural network (RNN), we analytically predict the physical properties of the fluid state. This enables us to interpolate substep frames between simulated keyframes, enhancing the temporal smoothness and sharpness of animations. We demonstrate promising results for smoke interpolation and conduct initial experiments on liquids.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Moving Mirrors, OTOCs and Scrambling
Authors:
Parthajit Biswas,
Bobby Ezhuthachan,
Arnab Kundu,
Baishali Roy
Abstract:
We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "esca** mirror") to the recent realization of Page curv…
▽ More
We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "esca** mirror") to the recent realization of Page curve in evaporating black holes (for a "kink mirror"). We explore a class of OTOCs in the presence of such a boundary and explicitly demonstrate the following primary aspects: First, we show that the dynamical CFT data directly affect an OTOC and maximally chaotic scrambling occurs for the esca** mirror for a large-$c$ CFT with identity block dominance. We further show that the exponential growth of OTOC associated with the physics of scrambling yields a power-law growth in the model for evaporating black holes which demonstrates a unitary dynamics in terms of a Page curve. We also demonstrate that, by tuning a parameter, one can naturally interpolate between an exponential growth associated to scrambling and a power-law growth in unitary dynamics. Our work explicitly exhibits the role of higher-point functions in CFT dynamics as well as the distinction between scrambling and Page curve. We also discuss several future possibilities based on this class of models.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
On Approximating the Dynamic and Discrete Network Flow Problem
Authors:
Bubai Manna,
Bodhayan Roy,
Vorapong Suppakitpaisarn
Abstract:
We examine the dynamic network flow problem under the assumption that the flow consists of discrete units. The dynamic network flow problem is commonly addressed in the context of develo** evacuation plans, where the flow is typically treated as a continuous quantity. However, real-world scenarios often involve moving groups, such as families, as single units. We demonstrate that solving the dyn…
▽ More
We examine the dynamic network flow problem under the assumption that the flow consists of discrete units. The dynamic network flow problem is commonly addressed in the context of develo** evacuation plans, where the flow is typically treated as a continuous quantity. However, real-world scenarios often involve moving groups, such as families, as single units. We demonstrate that solving the dynamic flow problem with this consideration is APX-hard. Conversely, we present a PTAS for instances where the base graph is a path with a constant number of nodes. We introduce a `ready time' constraint to the minsum bin packing problem, meaning certain items cannot be placed in specific bins, develop a PTAS for this modified problem, and apply our algorithms to the discrete and dynamic flow problem.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Minimum Consistent Subset in Trees and Interval Graphs
Authors:
Aritra Banik,
Sayani Das,
Anil Maheshwari,
Bubai Manna,
Subhas C Nandy,
Krishna Priya K M,
Bodhayan Roy,
Sasanka Roy,
Abhishek Sahu
Abstract:
In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of i…
▽ More
In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of its nearest neighbors in $V'$ (measured in terms of the hop distance) shares the same color as $v$. The decision problem, indicating whether there exists a subset $V'$ of cardinality at most $l$ for some positive integer $l$, is known to be NP-complete even for planar graphs.
In this paper, we establish that the MCS problem for trees, when the number of colors $c$ is considered an input parameter, is NP-complete. We propose a fixed-parameter tractable (FPT) algorithm for MCS on trees running in $O(2^{6c}n^6)$ time, significantly improving the currently best-known algorithm whose running time is $O(2^{4c}n^{2c+3})$.
In an effort to comprehensively understand the computational complexity of the MCS problem across different graph classes, we extend our investigation to interval graphs. We show that it remains NP-complete for interval graphs, thus enriching graph classes where MCS remains intractable.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
OffRAMPS: An FPGA-based Intermediary for Analysis and Modification of Additive Manufacturing Control Systems
Authors:
Jason Blocklove,
Md Raz,
Prithwish Basu Roy,
Hammond Pearce,
Prashanth Krishnamurthy,
Farshad Khorrami,
Ramesh Karri
Abstract:
Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compr…
▽ More
Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compromises comes from the difficulty in evaluating and benchmarking both identified threat vectors as well as methods for detecting adversarial actions. In this work, we introduce a generalized platform for systematic analysis of attacks against and defenses for 3D printers. Our "OFFRAMPS" platform is based on the open-source 3D printer control board "RAMPS." OFFRAMPS allows analysis, recording, and modification of all control signals and I/O for a 3D printer. We show the efficacy of OFFRAMPS by presenting a series of case studies based on several Trojans, including ones identified in the literature, and show that OFFRAMPS can both emulate and detect these attacks, i.e., it can both change and detect arbitrary changes to the g-code print commands.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
$\mathbb{A}^1$-homotopy type of $\mathbb{A}^2 \setminus \left\{(0,0) \right\}$
Authors:
Utsav Choudhury,
Biman Roy
Abstract:
In this article we prove that any $\mathbb{A}^1$-connected smooth $k$-variety is $\mathbb{A}^1$-uniruled for any algebraically closed field $k$. We establish that if a non empty open subscheme $X$ of a smooth affine $k$-scheme is $\mathbb{A}^1$-weakly equivalent to $\mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$, then $X \cong \mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$ as $k$-varieties f…
▽ More
In this article we prove that any $\mathbb{A}^1$-connected smooth $k$-variety is $\mathbb{A}^1$-uniruled for any algebraically closed field $k$. We establish that if a non empty open subscheme $X$ of a smooth affine $k$-scheme is $\mathbb{A}^1$-weakly equivalent to $\mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$, then $X \cong \mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$ as $k$-varieties for any field $k$ of characteristic $0$.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
From Local Spin Nematicity to Altermagnets: Footprints of Band Topology
Authors:
Sanjib Kumar Das,
Bitan Roy
Abstract:
Altermagnets are crystallographic rotational symmetry breaking spin-ordered states, possessing a net zero magnetization despite manifesting Kramers non-degenerate bands. Here, we show that momentum-independent local spin nematic orders in monolayer, Bernal bilayer and rhombohedral trilayer graphene give rise to $p$-wave, $d$-wave and $f$-wave altermagnets, respectively, thereby inheriting topology…
▽ More
Altermagnets are crystallographic rotational symmetry breaking spin-ordered states, possessing a net zero magnetization despite manifesting Kramers non-degenerate bands. Here, we show that momentum-independent local spin nematic orders in monolayer, Bernal bilayer and rhombohedral trilayer graphene give rise to $p$-wave, $d$-wave and $f$-wave altermagnets, respectively, thereby inheriting topology of linear, quadratic and cubic free fermion band dispersions that are also described in terms of angular momentum $\ell=1,\; 2$ and $3$ harmonics in the reciprocal space. The same conclusions also hold inside a spin-triplet nematic superconductor, featuring Majorana altermagnets. Altogether, these findings highlight the importance of electronic band structure in identifying such exotic magnetic orders in quantum materials. We depict the effects of in-plane magnetic fields on altermagnets, and propose novel spin-disordered alter-valleymagnets in these systems.
△ Less
Submitted 8 April, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Stabilizing topological superconductivity in disordered spin-orbit coupled semiconductor-superconductor heterostructures
Authors:
Binayyak B. Roy,
Rimika Jaiswal,
Tudor D. Stanescu,
Sumanta Tewari
Abstract:
We investigate theoretically a one-dimensional semiconductor-superconductor (SM-SC) heterostructure with Rashba spin-orbit coupling and parallel Zeeman field in the presence of disorder generated by random charged impurities and identify the optimal regimes for realizing topological superconductivity and Majorana zero modes. Using a Green's function approach, we show that upon increasing the disor…
▽ More
We investigate theoretically a one-dimensional semiconductor-superconductor (SM-SC) heterostructure with Rashba spin-orbit coupling and parallel Zeeman field in the presence of disorder generated by random charged impurities and identify the optimal regimes for realizing topological superconductivity and Majorana zero modes. Using a Green's function approach, we show that upon increasing the disorder strength the stable topological superconducting phase characterized by robust end-to-end Majorana correlations "migrates" toward larger values of the Zeeman field and can be stabilized by increasing the effective SM-SC coupling. Based on these findings, we propose a strategy for accessing a regime characterized by well-separated Majorana zero modes that is based on (a) enhancing the strength of the effective SM-SC coupling (e.g., through interface engineering) and (b) expanding the range of accessible Zeeman fields (e.g., by enhancing the gyromagnetic ratio or optimizing the parent superconductor, to enable the application of larger magnetic fields). While this strategy may still require some reduction of the disorder strength, this requirement is significantly less strict than the corresponding requirement in a strategy that focuses exclusively on disorder reduction.
△ Less
Submitted 29 February, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Krylov Complexity in $2d$ CFTs with SL$(2,\mathbb{R})$ deformed Hamiltonians
Authors:
Vinay Malvimat,
Somnath Porey,
Baishali Roy
Abstract:
In this study, we analyze Krylov Complexity in two-dimensional conformal field theories subjected to deformed SL$(2,\mathbb{R})$ Hamiltonians. In the vacuum state, we find that the K-complexity exhibits a universal phase structure. The phase structure involves the K-complexity exhibiting an oscillatory behaviour in the non-heating phase, which contrasts with the exponential growth observed in the…
▽ More
In this study, we analyze Krylov Complexity in two-dimensional conformal field theories subjected to deformed SL$(2,\mathbb{R})$ Hamiltonians. In the vacuum state, we find that the K-complexity exhibits a universal phase structure. The phase structure involves the K-complexity exhibiting an oscillatory behaviour in the non-heating phase, which contrasts with the exponential growth observed in the heating phase, while it displays polynomial growth at the phase boundary. Furthermore, we extend our analysis to compute the K-complexity of a light operator in excited states, considering both large-c CFT and free field theory. In the free field theory, we find a state-independent phase structure of K-complexity. However, in the large-c CFT, the behavior varies, with the K-Complexity once again displaying exponential growth in the heating phase and polynomial growth at the phase boundary. Notably, the precise exponent governing this growth depends on the heaviness of the state under examination. In the non-heating phase, we observe a transition in K-complexity behavior from oscillatory to exponential growth, akin to findings in [1], as it represents a special case within the non-heating phase.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
Efficient Exploration for LLMs
Authors:
Vikranth Dwaracherla,
Seyed Mohammad Asghari,
Botao Hao,
Benjamin Van Roy
Abstract:
We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demo…
▽ More
We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demonstrate that efficient exploration enables high levels of performance with far fewer queries. Further, both uncertainty estimation and the choice of exploration scheme play critical roles.
△ Less
Submitted 4 June, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
An Information-Theoretic Analysis of In-Context Learning
Authors:
Hong Jun Jeon,
Jason D. Lee,
Qi Lei,
Benjamin Van Roy
Abstract:
Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra…
▽ More
Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustrate, we apply them to establish new results about in-context learning with transformers. Our theoretical results characterizes how error decays in both the number of training sequences and sequence lengths. Our results are very general; for example, they avoid contrived mixing time assumptions made by all prior results that establish decay of error with sequence length.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Adaptive Crowdsourcing Via Self-Supervised Learning
Authors:
Anmol Kagrecha,
Henrik Marklund,
Benjamin Van Roy,
Hong Jun Jeon,
Richard Zeckhauser
Abstract:
Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across c…
▽ More
Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across crowdworkers or their estimates correlate, the weighted sum offers a more accurate group estimate than the average. Existing algorithms such as expectation maximization can, at least in principle, produce similarly accurate group estimates. However, their computational requirements become onerous when complex models, such as neural networks, are required to express relationships among crowdworkers. Predict-each-worker accommodates such complexity as well as many other practical challenges. We analyze the efficacy of predict-each-worker through theoretical and computational studies. Among other things, we establish asymptotic optimality as the number of engagements per crowdworker grows.
△ Less
Submitted 1 February, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
Fusion of $^{7}$Li with $^{205}$Tl at near barrier energies
Authors:
V. V. Parkar,
Prasanna M.,
Ruchi Rathod,
V. Jha,
S. K. Pandit,
A. Shrivastava,
K. Mahata,
K. Ramachandran,
R. Palit,
Md. S. R. Laskar,
B. J. Roy,
Bhushan Kanagalekar,
B. G. Hegde
Abstract:
The complete and incomplete fusion cross sections for the $^{7}$Li+$^{205}$Tl reaction were measured at near barrier energies by online characteristic $γ$ ray detection technique. The complete fusion (CF) cross sections at energies above the Coulomb barrier were found to be suppressed by $\sim$ 26 \% compared to the coupled channel calculations. Reduced fusion cross sections for the present system…
▽ More
The complete and incomplete fusion cross sections for the $^{7}$Li+$^{205}$Tl reaction were measured at near barrier energies by online characteristic $γ$ ray detection technique. The complete fusion (CF) cross sections at energies above the Coulomb barrier were found to be suppressed by $\sim$ 26 \% compared to the coupled channel calculations. Reduced fusion cross sections for the present system at energies normalised to the Coulomb barrier were also found to be systematically lower than those with strongly bound projectiles forming a similar compound nucleus. The suppression observed in CF cross sections is found to be commensurate with the measured total incomplete fusion (ICF) cross sections. In the ICF cross sections, t capture is found to be dominant than $α$ capture at all the measured energies. The systematic study of available CF, ICF and total fusion (TF) data with $^7$Li projectile is performed.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Contiguous Allocation of Indivisible Items on a Path
Authors:
Yasushi Kawase,
Bodhayan Roy,
Mohammad Azharuddin Sanpui
Abstract:
We study the problem of allocating indivisible items on a path among agents. The objective is to find a fair and efficient allocation in which each agent's bundle forms a contiguous block on the line. We demonstrate that, even when the valuations are binary additive, deciding whether every item can be allocated to an agent who wants it is NP-complete. Consequently, we provide two fixed-parameter t…
▽ More
We study the problem of allocating indivisible items on a path among agents. The objective is to find a fair and efficient allocation in which each agent's bundle forms a contiguous block on the line. We demonstrate that, even when the valuations are binary additive, deciding whether every item can be allocated to an agent who wants it is NP-complete. Consequently, we provide two fixed-parameter tractable (FPT) algorithms for maximizing utilitarian social welfare, with respect to the number of agents and the number of items. Additionally, we present a 2-approximation algorithm for the special case when the valuations are binary additive and the maximum utility is equal to the number of items. Furthermore, we establish that deciding whether the maximum egalitarian social welfare is at least 2 or at most 1 is NP-complete, even when the valuations are binary additive. We also explore the case where the order of the blocks of items allocated to the agents is predetermined. In this case, we show that both maximum utilitarian social welfare and egalitarian social welfare can be computed in polynomial time. However, we determine that checking the existence of an EF1 allocation is NP-complete, even when the valuations are binary additive.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Dynamical stability and phase space analysis of an Emergent Universe with non-interacting and interacting fluids
Authors:
Bikash Chandra Roy,
Anirban Chanda,
Bikash Chandra Paul
Abstract:
We investigate the evolution of a flat Emergent Universe obtained with a non-linear equation of state (nEoS) in Einstein's general theory of Relativity. The nEoS is equivalent to three different types of barotropic cosmic fluids, which are found from the nEoS parameter. The EU began expanding initially with no interaction among the cosmic fluids. Assuming an interaction that sets in at a time…
▽ More
We investigate the evolution of a flat Emergent Universe obtained with a non-linear equation of state (nEoS) in Einstein's general theory of Relativity. The nEoS is equivalent to three different types of barotropic cosmic fluids, which are found from the nEoS parameter. The EU began expanding initially with no interaction among the cosmic fluids. Assuming an interaction that sets in at a time $t \geq t_i$ in the fluid components, we study the evolution of the EU that leads to the present observed universe. We adopt a dynamical system analysis method to obtain the critical points of the autonomous system for studying the evolution of an EU with or without interaction in fluid components. We also study the stability of critical points and draw the phase portraits. The density parameters and the corresponding cosmological parameters are obtained for both the non-interacting and interacting phases of the evolution dynamics.
△ Less
Submitted 5 January, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
RLHF and IIA: Perverse Incentives
Authors:
Wanqiao Xu,
Shi Dong,
Xiuyuan Lu,
Grace Lam,
Zheng Wen,
Benjamin Van Roy
Abstract:
Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms.
Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms.
△ Less
Submitted 1 February, 2024; v1 submitted 2 December, 2023;
originally announced December 2023.
-
Geometric Tracking Control of a Multi-rotor UAV for Partially Known Trajectories
Authors:
Yogesh Kumar,
S. B. Roy,
P. B. Sujit
Abstract:
This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrus…
▽ More
This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrust input for all time. We tackle the additional challenge of the non-availability of the higher derivatives of the trajectory by introducing novel nonlinear filter structures. We formalize theoretically the effect of these filter structures on the system error dynamics. Subsequently, through a rigorous theoretical analysis, we demonstrate that the proposed controller leads to uniformly ultimately bounded system error dynamics.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
KiD: A Hardware Design Framework Targeting Unified NTT Multiplication for CRYSTALS-Kyber and CRYSTALS-Dilithium on FPGA
Authors:
Suraj Mandal,
Debapriya Basu Roy
Abstract:
Large-degree polynomial multiplication is an integral component of post-quantum secure lattice-based cryptographic algorithms like CRYSTALS-Kyber and Dilithium. The computational complexity of large-degree polynomial multiplication can be reduced significantly through Number Theoretic Transformation (NTT). In this paper, we aim to develop a unified and shared NTT architecture that can support poly…
▽ More
Large-degree polynomial multiplication is an integral component of post-quantum secure lattice-based cryptographic algorithms like CRYSTALS-Kyber and Dilithium. The computational complexity of large-degree polynomial multiplication can be reduced significantly through Number Theoretic Transformation (NTT). In this paper, we aim to develop a unified and shared NTT architecture that can support polynomial multiplication for both CRYSTALS-Kyber and Dilithium. More specifically, in this paper, we have proposed three different unified architectures for NTT multiplication in CRYSTALS-Kyber and Dilithium with varying numbers of configurable radix-2 butterfly units. Additionally, the developed implementation is coupled with a conflict-free memory map** scheme that allows the architecture to be fully pipelined. We have validated our implementation on Artix-7, Zynq-7000 and Zynq Ultrascale+ FPGAs. Our standalone implementations for NTT multiplication for CRYSTALS-Kyber and Dilithium perform better than the existing works, and our unified architecture shows excellent area and timing performance compared to both standalone and existing unified implementations. This architecture can potentially be used for compact and efficient implementation for CRYSTALS-Kyber and Dilithium.
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Authors:
Zheqing Zhu,
Yueyang Liu,
Xu Kuang,
Benjamin Van Roy
Abstract:
Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends. While a number of non-stationary contextual bandit learning algorithms have been proposed in the literature, they excessively explore due to a lack of prioritization for information of enduring value, or are designed in ways that do not scale in modern applicati…
▽ More
Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends. While a number of non-stationary contextual bandit learning algorithms have been proposed in the literature, they excessively explore due to a lack of prioritization for information of enduring value, or are designed in ways that do not scale in modern applications with high-dimensional user-specific features and large action set, or both. In this paper, we introduce a novel non-stationary contextual bandit algorithm that addresses these concerns. It combines a scalable, deep-neural-network-based architecture with a carefully designed exploration mechanism that strategically prioritizes collecting information with the most lasting value in a non-stationary environment. Through empirical evaluations on two real-world recommendation datasets, which exhibit pronounced non-stationarity, we demonstrate that our approach significantly outperforms the state-of-the-art baselines.
△ Less
Submitted 14 October, 2023; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Model non-Hermitian topological operators without skin effect
Authors:
Daniel J. Salib,
Sanjib Kumar Das,
Bitan Roy
Abstract:
We propose a general principle of constructing non-Hermitian (NH) operators for insulating and gapless topological phases in any dimension ($d$) that over an extended NH parameter regime feature real eigenvalues and zero-energy topological boundary modes, when in particular their Hermitian cousins are also topological. However, the topological zero modes disappear when the NH operators accommodate…
▽ More
We propose a general principle of constructing non-Hermitian (NH) operators for insulating and gapless topological phases in any dimension ($d$) that over an extended NH parameter regime feature real eigenvalues and zero-energy topological boundary modes, when in particular their Hermitian cousins are also topological. However, the topological zero modes disappear when the NH operators accommodate complex eigenvalues. These systems are always devoid of NH skin effects, thereby extending the realm of the bulk-boundary correspondence to NH systems in terms of solely the left or right zero-energy boundary localized eigenmodes. We showcase these general and robust outcomes for NH topological insulators in $d=1,2$ and $3$, encompassing their higher-order incarnations, as well as for NH topological Dirac, Weyl and nodal-loop semimetals. Possible realizations of proposed NH topological phases in designer materials, optical lattices and classical metamaterials are highlighted.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Observational constraints on the Emergent Universe with interacting non-linear fluids and its stability analysis
Authors:
Anirban Chanda,
Bikash Chandra Roy,
Kazuharu Bamba,
Bikash Chandra Paul
Abstract:
We investigate a flat Emergent Universe (EU) with a nonlinear equation of state which is equivalent to three different compositions of fluids. In the EU, initially, the evolution of the universe began with no interaction, but as time evolves, an interaction sets in among the three fluids leading to the observed universe. The characteristic of an EU is that it is a singularity-free universe that ev…
▽ More
We investigate a flat Emergent Universe (EU) with a nonlinear equation of state which is equivalent to three different compositions of fluids. In the EU, initially, the evolution of the universe began with no interaction, but as time evolves, an interaction sets in among the three fluids leading to the observed universe. The characteristic of an EU is that it is a singularity-free universe that evolves with all the basic features of the early evolution. A given nonlinear equation of state parameter permits a universe with three different fluids. We get a universe with dark energy, cosmic string, and radiation domination to begin with, which at a later epoch transits into a universe with three different fluids with matter domination, dark matter, and dark energy for a given interaction strength among the cosmic fluids. Later the model parameters are constrained using the observed Hubble data and Type Ia Supernova (SnIa) data from the Pantheon data set. The classical stability analysis of the model is performed using the square speed of sound. It is found that a theoretically stable cosmological model can be obtained in this case, however, the model becomes classically unstable at the present epoch when the observational bounds on the model parameters are taken into account.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Quantum Electrodynamics of Non-Hermitian Dirac Fermions
Authors:
Sk Asrap Murshed,
Bitan Roy
Abstract:
We develop an effective quantum electrodynamics for non-Hermitian (NH) Dirac materials interacting with photons. These systems are described by nonspatial symmetry protected Lorentz invariant NH Dirac operators, featuring two velocity parameters $v_{_{\rm H}}$ and $v_{_{\rm NH}}$ associated with the standard Hermitian and a masslike anti-Hermitian Dirac operators, respectively. They display linear…
▽ More
We develop an effective quantum electrodynamics for non-Hermitian (NH) Dirac materials interacting with photons. These systems are described by nonspatial symmetry protected Lorentz invariant NH Dirac operators, featuring two velocity parameters $v_{_{\rm H}}$ and $v_{_{\rm NH}}$ associated with the standard Hermitian and a masslike anti-Hermitian Dirac operators, respectively. They display linear energy-momentum relation, however, in terms of an effective Fermi velocity $v_{_{\rm F}}=\sqrt{v^2_{_{\rm H}}-v^2_{_{\rm NH}}}$ of NH Dirac fermions. Interaction with the fluctuating electromagnetic radiation then gives birth to an emergent Lorentz symmetry in this family of NH Dirac materials in the deep infrared regime, where the system possesses a unique terminal velocity $v_{_{\rm F}}=c$, with $c$ being the speed of light. While in two dimensions such a terminal velocity is set by the speed of light in the free space, dynamic screening in three spatial dimensions permits its nonuniversal values. Manifestations of such an emergent spacetime symmetry on the scale dependence of various physical observables in correlated NH Dirac materials are discussed.
△ Less
Submitted 25 January, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
Reusability Challenges of Scientific Workflows: A Case Study for Galaxy
Authors:
Khairul Alam,
Banani Roy,
Alexander Serebrenik
Abstract:
Scientific workflow has become essential in software engineering because it provides a structured approach to designing, executing, and analyzing scientific experiments. Software developers and researchers have developed hundreds of scientific workflow management systems so scientists in various domains can benefit from them by automating repetitive tasks, enhancing collaboration, and ensuring the…
▽ More
Scientific workflow has become essential in software engineering because it provides a structured approach to designing, executing, and analyzing scientific experiments. Software developers and researchers have developed hundreds of scientific workflow management systems so scientists in various domains can benefit from them by automating repetitive tasks, enhancing collaboration, and ensuring the reproducibility of their results. However, even for expert users, workflow creation is a complex task due to the dramatic growth of tools and data heterogeneity. Thus, scientists attempt to reuse existing workflows shared in workflow repositories. Unfortunately, several challenges prevent scientists from reusing those workflows. In this study, we thus first attempted to identify those reusability challenges. We also offered an action list and evidence-based guidelines to promote the reusability of scientific workflows. Our intensive manual investigation examined the reusability of existing workflows and exposed several challenges. The challenges preventing reusability include tool upgrading, tool support unavailability, design flaws, incomplete workflows, failure to load a workflow, etc. Such challenges and our action list offered guidelines to future workflow composers to create better workflows with enhanced reusability. In the future, we plan to develop a recommender system using reusable workflows that can assist scientists in creating effective and error-free workflows.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Unveiling the potential of large language models in generating semantic and cross-language clones
Authors:
Palash R. Roy,
Ajmain I. Alam,
Farouq Al-omari,
Banani Roy,
Chanchal K. Roy,
Kevin A. Schneider
Abstract:
Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone posses…
▽ More
Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone possesses a code snippet in a particular programming language but seeks equivalent functionality in a different language, a semantic cross-language code clone generation approach could provide valuable assistance. In this study, using SemanticCloneBench as a vehicle, we evaluated how well the GPT-3 model could help generate semantic and cross-language clone variants for a given fragment.We have comprised a diverse set of code fragments and assessed GPT-3s performance in generating code variants.Through extensive experimentation and analysis, where 9 judges spent 158 hours to validate, we investigate the model's ability to produce accurate and semantically correct variants. Our findings shed light on GPT-3's strengths in code generation, offering insights into the potential applications and challenges of using advanced language models in software development. Our quantitative analysis yields compelling results. In the realm of semantic clones, GPT-3 attains an impressive accuracy of 62.14% and 0.55 BLEU score, achieved through few-shot prompt engineering. Furthermore, the model shines in transcending linguistic confines, boasting an exceptional 91.25% accuracy in generating cross-language clones
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Multiplierless Design of High-Speed Very Large Constant Multiplications
Authors:
Levent Aksoy,
Debapriya Basu Roy,
Malik Imran,
Samuel Pagliarini
Abstract:
In cryptographic algorithms, the constants to be multiplied by a variable can be very large due to security requirements. Thus, the hardware complexity of such algorithms heavily depends on the design architecture handling large constants. In this paper, we introduce an electronic design automation tool, called LEIGER, which can automatically generate the realizations of very large constant multip…
▽ More
In cryptographic algorithms, the constants to be multiplied by a variable can be very large due to security requirements. Thus, the hardware complexity of such algorithms heavily depends on the design architecture handling large constants. In this paper, we introduce an electronic design automation tool, called LEIGER, which can automatically generate the realizations of very large constant multiplications for low-complexity and high-speed applications, targeting the ASIC design platform. LEIGER can utilize the shift-adds architecture and use 3-input operations, i.e., carry-save adders (CSAs), where the number of CSAs is reduced using a prominent optimization algorithm. It can also generate constant multiplications under a hybrid design architecture, where 2-and 3-input operations are used at different stages. Moreover, it can describe constant multiplications under a design architecture using compressor trees. As a case study, high-speed Montgomery multiplication, which is a fundamental operation in cryptographic algorithms, is designed with its constant multiplication block realized under the proposed architectures. Experimental results indicate that LEIGER enables a designer to explore the trade-off between area and delay of the very large constant and Montgomery multiplications and leads to designs with area-delay product, latency, and energy consumption values significantly better than those obtained by a recently proposed algorithm.
△ Less
Submitted 12 September, 2023; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Quantized thermal and spin transports of dirty planar topological superconductors
Authors:
Sanjib Kumar Das,
Bitan Roy
Abstract:
Nontrivial bulk topological invariants of quantum materials can leave their signatures on charge, thermal and spin transports. In two dimensions, their imprints can be experimentally measured from well-developed multiterminal Hall bar arrangements. Here, we numerically compute the low temperature ($T$) thermal ($κ_{xy}$) and zero temperature spin ($σ^{sp}_{xy}$) Hall conductivities, and longitudin…
▽ More
Nontrivial bulk topological invariants of quantum materials can leave their signatures on charge, thermal and spin transports. In two dimensions, their imprints can be experimentally measured from well-developed multiterminal Hall bar arrangements. Here, we numerically compute the low temperature ($T$) thermal ($κ_{xy}$) and zero temperature spin ($σ^{sp}_{xy}$) Hall conductivities, and longitudinal thermal conductance ($G^{th}_{xx}$) of various prominent two-dimensional fully gapped topological superconductors, belonging to distinct Altland-Zirnbauer symmetry classes, namely $p+ip$ (class D), $d+id$ (class C) and $p \pm ip$ (class DIII) paired states, in mesoscopic six-terminal Hall bar setups from the scattering matrix formalism using Kwant. In both clean and weak disorder limits, the time-reversal symmetry breaking $p+ip$ and $d+id$ pairings show half-quantized and quantized $κ_{xy}$ [in units of $κ_0=π^2 k^2_B T/(3h)$], respectively, while the latter one in addition accommodates a quantized $σ^{sp}_{xy}$ [in units of $σ^{sp}_0=\hbar/(8 π)$]. By contrast, the time-reversal invariant $p \pm ip$ pairing only displays a quantized $G^{th}_{xx}$ at low $T$ up to a moderate strength of disorder. In the strong disorder regime, all these topological responses ($κ_{xy}$, $σ^{sp}_{xy}$, and $G^{th}_{xx}$) vanish. Possible material platforms hosting such paired states and manifesting these robust topological thermal and spin responses are discussed.
△ Less
Submitted 2 May, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Yukawa-Lorentz Symmetry in Non-Hermitian Dirac Materials
Authors:
Vladimir Juricic,
Bitan Roy
Abstract:
Lorentz spacetime symmetry represents a unifying feature of the fundamental forces, typically manifest at sufficiently high energies, while in quantum materials it emerges in the deep low-energy regime. However, its fate in quantum materials coupled to an environment thus far remained unexplored. We here introduce a general framework of constructing symmetry-protected Lorentz invariant non-Hermiti…
▽ More
Lorentz spacetime symmetry represents a unifying feature of the fundamental forces, typically manifest at sufficiently high energies, while in quantum materials it emerges in the deep low-energy regime. However, its fate in quantum materials coupled to an environment thus far remained unexplored. We here introduce a general framework of constructing symmetry-protected Lorentz invariant non-Hermitian (NH) Dirac semimetals (DSMs), realized by invoking masslike anti-Hermitian Dirac operators to its Hermitian counterpart. Such NH DSMs feature purely real or imaginary isotropic linear band dispersion, yielding a vanishing density of states. Dynamic mass orderings in NH DSMs thus take place for strong Hubbardlike local interactions through a quantum phase transition, hosting a non-Fermi liquid, beyond which the system becomes an insulator. We show that depending on the internal Clifford algebra between the NH Dirac operator and candidate mass order-parameter, the resulting quantum-critical fluid either remains coupled with the environment or recovers full Hermiticity by decoupling from the bath, while always enjoying an emergent Yukawa-Lorentz symmetry in terms of a unique terminal velocity. We showcase the competition between such mass orderings, their hallmarks on quasiparticle spectra in the ordered phases, and the relevance of our findings for correlated designer NH Dirac materials.
△ Less
Submitted 28 May, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
GPTCloneBench: A comprehensive benchmark of semantic clones and cross-language clones using GPT-3 model and SemanticCloneBench
Authors:
Ajmain Inqiad Alam,
Palash Ranjan Roy,
Farouq Al-omari,
Chanchal Kumar Roy,
Banani Roy,
Kevin Schneider
Abstract:
With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting t…
▽ More
With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting that BigCloneBench, originally not designed for semantic clone detection, presents several limitations that hinder its suitability as a comprehensive training dataset for this specific purpose. Furthermore, CLCDSA dataset suffers from a lack of reusable examples aligning with real-world software systems, rendering it inadequate for cross-language clone detection approaches. In this work, we present a comprehensive semantic clone and cross-language clone benchmark, GPTCloneBench by exploiting SemanticCloneBench and OpenAI's GPT-3 model. In particular, using code fragments from SemanticCloneBench as sample inputs along with appropriate prompt engineering for GPT-3 model, we generate semantic and cross-language clones for these specific fragments and then conduct a combination of extensive manual analysis, tool-assisted filtering, functionality testing and automated validation in building the benchmark. From 79,928 clone pairs of GPT-3 output, we created a benchmark with 37,149 true semantic clone pairs, 19,288 false semantic pairs(Type-1/Type-2), and 20,770 cross-language clones across four languages (Java, C, C#, and Python). Our benchmark is 15-fold larger than SemanticCloneBench, has more functional code examples for software systems and programming language support than CLCDSA, and overcomes BigCloneBench's qualities, quantification, and language variety limitations.
△ Less
Submitted 1 September, 2023; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Maintaining Plasticity in Continual Learning via Regenerative Regularization
Authors:
Saurabh Kumar,
Henrik Marklund,
Benjamin Van Roy
Abstract:
In continual learning, plasticity refers to the ability of an agent to quickly adapt to new information. Neural networks are known to lose plasticity when processing non-stationary data streams. In this paper, we propose L2 Init, a simple approach for maintaining plasticity by incorporating in the loss function L2 regularization toward initial parameters. This is very similar to standard L2 regula…
▽ More
In continual learning, plasticity refers to the ability of an agent to quickly adapt to new information. Neural networks are known to lose plasticity when processing non-stationary data streams. In this paper, we propose L2 Init, a simple approach for maintaining plasticity by incorporating in the loss function L2 regularization toward initial parameters. This is very similar to standard L2 regularization (L2), the only difference being that L2 regularizes toward the origin. L2 Init is simple to implement and requires selecting only a single hyper-parameter. The motivation for this method is the same as that of methods that reset neurons or parameter values. Intuitively, when recent losses are insensitive to particular parameters, these parameters should drift toward their initial values. This prepares parameters to adapt quickly to new tasks. On problems representative of different types of nonstationarity in continual supervised learning, we demonstrate that L2 Init most consistently mitigates plasticity loss compared to previously proposed approaches.
△ Less
Submitted 3 October, 2023; v1 submitted 23 August, 2023;
originally announced August 2023.
-
Giant conductance of PSS:PEDOT micro-surfaces induced by microbubble lithography
Authors:
Anand Dev Ranjan,
Rakesh Sen,
Sumeet Kumar,
Rahul Vaippully,
Soumya Dutta,
Soumyajit Roy,
Basudev Roy,
Ayan Banerjee
Abstract:
We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed…
▽ More
We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed by a flexible PDMS substrate, and observe conductivity enhancement of 5 times on the former (694 S/cm), and 20 times (2844 S/cm) on the latter, without the use of external do** agents or invasive chemical treatment. Probing the patterned interface, we observe that MBL is able to tune the conformational states of PEDOT:PSS from coils in the pristine form, to extended coils on glass, and almost linear structures in PDMS due to its more malleable liquid-like interface. This results in higher ordering and vanishing grain boundaries leading to the highest conductivity of PEDOT:PSS on PDMS substrates.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia
Authors:
Diana McSpadden,
Steven Goldenberg,
Binata Roy,
Malachi Schram,
Jonathan L. Goodall,
Heather Richter
Abstract:
Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using d…
▽ More
Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using data from Norfolk rainfall events between 2016 and 2018, this study compares the performance of a previous surrogate model based on a random forest algorithm with two deep learning models: Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). This investigation underscores the importance of using a model architecture that supports the communication of prediction uncertainty and the effective integration of relevant, multi-modal features.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
A Definition of Continual Reinforcement Learning
Authors:
David Abel,
André Barreto,
Benjamin Van Roy,
Doina Precup,
Hado van Hasselt,
Satinder Singh
Abstract:
In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward. However, this perspective is based on a restricted view of learning as finding a solution, rather than treating learning as endless adaptation. In contrast, continual reinforcement learning refers to the setting in which the best agents never stop learning.…
▽ More
In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward. However, this perspective is based on a restricted view of learning as finding a solution, rather than treating learning as endless adaptation. In contrast, continual reinforcement learning refers to the setting in which the best agents never stop learning. Despite the importance of continual reinforcement learning, the community lacks a simple definition of the problem that highlights its commitments and makes its primary concepts precise and clear. To this end, this paper is dedicated to carefully defining the continual reinforcement learning problem. We formalize the notion of agents that "never stop learning" through a new mathematical language for analyzing and cataloging agents. Using this new language, we define a continual learning agent as one that can be understood as carrying out an implicit search process indefinitely, and continual reinforcement learning as the setting in which the best agents are all continual learning agents. We provide two motivating examples, illustrating that traditional views of multi-task reinforcement learning and continual supervised learning are special cases of our definition. Collectively, these definitions and perspectives formalize many intuitive concepts at the heart of learning, and open new research pathways surrounding continual learning agents.
△ Less
Submitted 1 December, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
On the Convergence of Bounded Agents
Authors:
David Abel,
André Barreto,
Hado van Hasselt,
Benjamin Van Roy,
Doina Precup,
Satinder Singh
Abstract:
When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly…
▽ More
When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly less clear. In this paper, we propose two complementary accounts of agent convergence in a framing of the reinforcement learning problem that centers around bounded agents. The first view says that a bounded agent has converged when the minimal number of states needed to describe the agent's future behavior cannot decrease. The second view says that a bounded agent has converged just when the agent's performance only changes if the agent's internal state changes. We establish basic properties of these two definitions, show that they accommodate typical views of convergence in standard settings, and prove several facts about their nature and relationship. We take these perspectives, definitions, and analysis to bring clarity to a central idea of the field.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Artificial Intelligence for the Electron Ion Collider (AI4EIC)
Authors:
C. Allaire,
R. Ammendola,
E. -C. Aschenauer,
M. Balandat,
M. Battaglieri,
J. Bernauer,
M. Bondì,
N. Branson,
T. Britton,
A. Butter,
I. Chahrour,
P. Chatagnon,
E. Cisbani,
E. W. Cline,
S. Dash,
C. Dean,
W. Deconinck,
A. Deshpande,
M. Diefenthaler,
R. Ent,
C. Fanelli,
M. Finger,
M. Finger, Jr.,
E. Fol,
S. Furletov
, et al. (70 additional authors not shown)
Abstract:
The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took…
▽ More
The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took place, centered on exploring all current and prospective application areas of AI for the EIC. This workshop is not only beneficial for the EIC, but also provides valuable insights for the newly established ePIC collaboration at EIC. This paper summarizes the different activities and R&D projects covered across the sessions of the workshop and provides an overview of the goals, approaches and strategies regarding AI/ML in the EIC community, as well as cutting-edge techniques currently studied in other experiments.
△ Less
Submitted 17 July, 2023;
originally announced July 2023.
-
Continual Learning as Computationally Constrained Reinforcement Learning
Authors:
Saurabh Kumar,
Henrik Marklund,
Ashish Rao,
Yifan Zhu,
Hong Jun Jeon,
Yueyang Liu,
Benjamin Van Roy
Abstract:
An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning,…
▽ More
An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning, introducing a framework and set of tools to stimulate further research.
△ Less
Submitted 20 August, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Towards Stirling engine using an optically confined particle subjected to asymmetric temperature profile
Authors:
Gokul Nalupurackal,
Muruga Lokesh,
Sarangi Suresh,
Srestha Roy,
Snigdhadev Chakraborty,
Jayesh Goswami,
Arnab Pal,
Basudev Roy
Abstract:
The realization of microscopic heat engines has gained a surge of research interest in statistical physics, soft matter, and biological physics. A typical microscopic heat engine employs a colloidal particle trapped in a confining potential, which is modulated in time to mimic the cycle operations. Here, we use a lanthanide-doped upconverting particle (UCP) suspended in a passive aqueous bath, whi…
▽ More
The realization of microscopic heat engines has gained a surge of research interest in statistical physics, soft matter, and biological physics. A typical microscopic heat engine employs a colloidal particle trapped in a confining potential, which is modulated in time to mimic the cycle operations. Here, we use a lanthanide-doped upconverting particle (UCP) suspended in a passive aqueous bath, which is highly absorptive at 975 nm and converts NIR photons to visible, as the working substance of the engine. When a single UCP is optically trapped with a 975 nm laser, it behaves like an active particle by executing motion subjected to an asymmetric temperature profile along the direction of propagation of the laser. The strong absorption of 975 nm light by the particle introduces a temperature gradient and results in significant thermophoretic diffusion along the temperature gradient. However, the activity of the particle vanishes when the trap** wavelength is switched to 1064 nm. We carefully regulate the wavelength-dependent activity of the particle to engineer all four cycles of a Stirling engine by using a combination of 1064 nm and 975 nm wavelengths. Since the motion of the particle is stochastic, the work done on the particle due to the stiffness modulation per cycle is random. We provide statistical estimation for this work averaged over 5 cycles which can be extended towards several cycles to make a Stirling engine. Our experiment proposes a robust set-up to systematically harness temperature which is a crucial factor behind building microscopic engines.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Scalable Neural Contextual Bandit for Recommender Systems
Authors:
Zheqing Zhu,
Benjamin Van Roy
Abstract:
High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been so…
▽ More
High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been some progress with neural contextual bandit algorithms towards enabling online exploration through neural networks, their onerous computational demands hinder widespread adoption in real-world recommender systems. In this work, we propose a scalable sample-efficient neural contextual bandit algorithm for recommender systems. To do this, we design an epistemic neural network architecture, Epistemic Neural Recommendation (ENR), that enables Thompson sampling at a large scale. In two distinct large-scale experiments with real-world tasks, ENR significantly boosts click-through rates and user ratings by at least 9% and 6% respectively compared to state-of-the-art neural contextual bandit algorithms. Furthermore, it achieves equivalent performance with at least 29% fewer user interactions compared to the best-performing baseline algorithm. Remarkably, while accomplishing these improvements, ENR demands orders of magnitude fewer computational resources than neural contextual bandit baseline algorithms.
△ Less
Submitted 18 August, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems
Authors:
Baolin Li,
Rohan Basu Roy,
Daniel Wang,
Siddharth Samsi,
Vijay Gadepally,
Devesh Tiwari
Abstract:
The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which requires urgent intervention. In this work, we present a comprehensive analysis of the carbon footprint of high-performance computing (HPC) systems, considering the carbon footprint during both the hardware manufacturing and system operational stages. Our work employs HPC hardware component carbon footprint mod…
▽ More
The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which requires urgent intervention. In this work, we present a comprehensive analysis of the carbon footprint of high-performance computing (HPC) systems, considering the carbon footprint during both the hardware manufacturing and system operational stages. Our work employs HPC hardware component carbon footprint modeling, regional carbon intensity analysis, and experimental characterization of the system life cycle to highlight the importance of quantifying the carbon footprint of HPC systems.
△ Less
Submitted 18 November, 2023; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Neural ShDF: Reviving an Efficient and Consistent Mesh Segmentation Method
Authors:
Bruno Roy
Abstract:
Partitioning a polygonal mesh into meaningful parts can be challenging. Many applications require decomposing such structures for further processing in computer graphics. In the last decade, several methods were proposed to tackle this problem, at the cost of intensive computational times. Recently, machine learning has proven to be effective for the segmentation task on 3D structures. Nevertheles…
▽ More
Partitioning a polygonal mesh into meaningful parts can be challenging. Many applications require decomposing such structures for further processing in computer graphics. In the last decade, several methods were proposed to tackle this problem, at the cost of intensive computational times. Recently, machine learning has proven to be effective for the segmentation task on 3D structures. Nevertheless, these state-of-the-art methods are often hardly generalizable and require dividing the learned model into several specific classes of objects to avoid overfitting. We present a data-driven approach leveraging deep learning to encode a map** function prior to mesh segmentation for multiple applications. Our network reproduces a neighborhood map using our knowledge of the \textsl{Shape Diameter Function} (SDF) method using similarities among vertex neighborhoods. Our approach is resolution-agnostic as we downsample the input meshes and query the full-resolution structure solely for neighborhood contributions. Using our predicted SDF values, we can inject the resulting structure into a graph-cut algorithm to generate an efficient and robust mesh segmentation while considerably reducing the required computation times.
△ Less
Submitted 31 August, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Light-activated memristor by Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS device
Authors:
Ankita Sengupta,
Basudev Nag Chowdhury,
Bodhishatwa Roy,
Biswarup Satpati,
Satyaban Bhunia,
Sanatan Chattopadhyay
Abstract:
The current work proposes a novel scheme for develo** a light-activated non-filamentary memristor device by fabricating an Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS structure. Under illumination, the electrons in such embedded Au-nanoparticles are excited from d-level to quantized s-p level and are swept out on application of an appropriate gate bias, leaving behind the holes without rec…
▽ More
The current work proposes a novel scheme for develo** a light-activated non-filamentary memristor device by fabricating an Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS structure. Under illumination, the electrons in such embedded Au-nanoparticles are excited from d-level to quantized s-p level and are swept out on application of an appropriate gate bias, leaving behind the holes without recombination. Such photogenerated holes are confined within the nanoparticles and thus screen the external field to lead to a memristive effect in the device. The phenomenon is experimentally observed in the fabricated Pt/HfO$_2$-(layer-II)/Au-NPs/HfO$_2$-(layer-I)/p-Si devices, where such memristive effect is activated/deactivated by light pulses. The memory window and high-to-low resistance ratio of the device are obtained to be ~1 V and ~10, respectively, which suggest the performance of a standard state-of-the-art memristor. Further, the present device offers a voltage-sweep-endurance up to at least 150 cycles and the memory retention up to ~10,000 s. Such a device concept can be extended for a combination of different nanoparticles with various dimensions and dielectric layers to optimize their memristive effect for achieving CMOS-compatible memory devices with superior reliability.
△ Less
Submitted 27 May, 2023;
originally announced June 2023.
-
Hybrid symmetry class topological insulators
Authors:
Sanjib Kumar Das,
Bitan Roy
Abstract:
Traditional topological materials belong to different Altland-Zirnbauer symmetry classes (AZSCs) depending on their non-spatial symmetries. Here we introduce the notion of hybrid symmetry class topological insulators (HSCTIs): A fusion of two different AZSC topological insulators (TIs) such that they occupy orthogonal Cartesian hyperplanes and their universal massive Dirac Hamiltonian mutually ant…
▽ More
Traditional topological materials belong to different Altland-Zirnbauer symmetry classes (AZSCs) depending on their non-spatial symmetries. Here we introduce the notion of hybrid symmetry class topological insulators (HSCTIs): A fusion of two different AZSC topological insulators (TIs) such that they occupy orthogonal Cartesian hyperplanes and their universal massive Dirac Hamiltonian mutually anticommute. The boundaries of HSCTIs can also harbor TIs, typically affiliated with an AZSC different from the parent ones. As such, a fusion between planar quantum spin Hall and vertical Su-Schrieffer-Heeger insulators gives birth to a three-dimensional HSCTI, accommodating quantum anomalous Hall insulators and quantized Hall conductivity on the top and bottom surfaces. We extend this construction to encompass crystalline HSCTI and topological superconductors, and beyond three dimensions. Possible (meta)material platforms to harness HSCTIs are discussed.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Authors:
William Brannon,
Wonjune Kang,
Suyash Fulay,
Hang Jiang,
Brandon Roy,
Deb Roy,
Jad Kabbara
Abstract:
Learning on text-attributed graphs (TAGs), in which nodes are associated with one or more texts, has been the subject of much recent work. However, most approaches tend to make strong assumptions about the downstream task of interest, are reliant on hand-labeled data, or fail to equally balance the importance of both text and graph representations. In this work, we propose Contrastive Graph-Text p…
▽ More
Learning on text-attributed graphs (TAGs), in which nodes are associated with one or more texts, has been the subject of much recent work. However, most approaches tend to make strong assumptions about the downstream task of interest, are reliant on hand-labeled data, or fail to equally balance the importance of both text and graph representations. In this work, we propose Contrastive Graph-Text pretraining (ConGraT), a general, self-supervised approach for jointly learning separate representations of texts and nodes in a TAG. Our method trains a language model (LM) and a graph neural network (GNN) to align their representations in a common latent space using a batch-wise contrastive learning objective inspired by CLIP. We further propose an extension to the CLIP objective that leverages graph structure to incorporate information about inter-node similarity. Extensive experiments demonstrate that ConGraT outperforms baselines on various downstream tasks, including node and text category classification, link prediction, and language modeling. Finally, we present an application of our method to community detection in social graphs, which enables finding more textually grounded communities, rather than purely graph-based ones. Code and certain datasets are available at https://github.com/wwbrannon/congrat.
△ Less
Submitted 9 July, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Authors:
Wanqiao Xu,
Shi Dong,
Dilip Arumugam,
Benjamin Van Roy
Abstract:
A centerpiece of the ever-popular reinforcement learning from human feedback (RLHF) approach to fine-tuning autoregressive language models is the explicit training of a reward model to emulate human feedback, distinct from the language model itself. This reward model is then coupled with policy-gradient methods to dramatically improve the alignment between language model outputs and desired respon…
▽ More
A centerpiece of the ever-popular reinforcement learning from human feedback (RLHF) approach to fine-tuning autoregressive language models is the explicit training of a reward model to emulate human feedback, distinct from the language model itself. This reward model is then coupled with policy-gradient methods to dramatically improve the alignment between language model outputs and desired responses. In this work, we adopt a novel perspective wherein a pre-trained language model is itself simultaneously a policy, reward function, and transition function. An immediate consequence of this is that reward learning and language model fine-tuning can be performed jointly and directly, without requiring any further downstream policy optimization. While this perspective does indeed break the traditional agent-environment interface, we nevertheless maintain that there can be enormous statistical benefits afforded by bringing to bear traditional algorithmic concepts from reinforcement learning. Our experiments demonstrate one concrete instance of this through efficient exploration based on the representation and resolution of epistemic uncertainty. In order to illustrate these ideas in a transparent manner, we restrict attention to a simple didactic data generating process and leave for future work extension to systems of practical scale.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Magnetic catalysis in weakly interacting hyperbolic Dirac materials
Authors:
Noble Gluscevich,
Btan Roy
Abstract:
Due to linearly vanishing density of states, emergent massless Dirac quasiparticle resulting from the free fermion motion on a family of two-dimensional half-filled bipartite hyperbolic lattices feature dynamic mass generation through quantum phase transitions only for sufficiently strong finite-range Coulomb repulsion. As such, strong nearest-neighbor Coulomb repulsion ($V$) is conducive to the n…
▽ More
Due to linearly vanishing density of states, emergent massless Dirac quasiparticle resulting from the free fermion motion on a family of two-dimensional half-filled bipartite hyperbolic lattices feature dynamic mass generation through quantum phase transitions only for sufficiently strong finite-range Coulomb repulsion. As such, strong nearest-neighbor Coulomb repulsion ($V$) is conducive to the nucleation of a charge-density-wave (CDW) order with a staggered pattern of average fermionic density between two sublattices of bipartite hyperbolic lattices. Considering a collection of spinless fermions (for simplicity), here we show that application of strong external magnetic fields by virtue of producing a \emph{finite} density of states near the zero energy triggers the condensation of the CDW order even for \emph{infinitesimal} $V$. The proposed magnetic catalysis mechanism is operative for uniform as well as inhomogeneous (bell-shaped) magnetic fields. We present scaling of the CDW order with the total flux enclosed by hyperbolic Dirac materials for a wide range of (especially subcritical) $V$.
△ Less
Submitted 18 May, 2023;
originally announced May 2023.
-
Thermo-electric history effects and resistive switching in epitaxial thin film of Mott insulator V2O3
Authors:
Binoy Krishna De,
V. G. Sathe,
S. B. Roy
Abstract:
We report interesting thermo-electric history effects associated with an electric field-induced first order phase transition from Mott-insulator to the metallic state in the epitaxial thin film of V2O3. This phase transition results in tuneable resistive switching in V2O3. These findings are promising for novel technologies like optoelectronics and neuromorphic computing and may lead to highly ene…
▽ More
We report interesting thermo-electric history effects associated with an electric field-induced first order phase transition from Mott-insulator to the metallic state in the epitaxial thin film of V2O3. This phase transition results in tuneable resistive switching in V2O3. These findings are promising for novel technologies like optoelectronics and neuromorphic computing and may lead to highly energy-efficient switching applications of Mott insulators.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.