Search | arXiv e-print repository

Superconductivity in three-dimensional interacting doped topological insulators

Abstract: Three-dimensional doped Dirac insulators foster simply connected (in both topological and trivial regimes) and annular (deep inside the topological regime) Fermi surfaces (FSs) in the normal state, and allow on-site repulsions among fermions with opposite spin ($U_1$) and parity ($U_2$) eigenvalues. From an unbiased leading-order (one-loop) renormalization group analysis, controlled by a suitable… ▽ More Three-dimensional doped Dirac insulators foster simply connected (in both topological and trivial regimes) and annular (deep inside the topological regime) Fermi surfaces (FSs) in the normal state, and allow on-site repulsions among fermions with opposite spin ($U_1$) and parity ($U_2$) eigenvalues. From an unbiased leading-order (one-loop) renormalization group analysis, controlled by a suitable $ε$ expansion, we show that this system develops strong propensity toward the nucleation of scalar $s$-wave and odd-parity pseudoscalar $p$-wave pairings, favored by repulsive $U_1$ and $U_2$ interactions, respectively, irrespective of the underlying FS topology. Our results can be pertinent for the observed superconductivity in various doped narrow gap semiconductors, and the theoretical foundation can readily be applied to investigate similar phenomenon in various doped topological materials. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 5 Pages, 1 Figure, 2 Tables (Supplemental Material as ancillary file)

arXiv:2407.02177 [pdf, other]

Minsum Problem for Discrete and Weighted Set Flow on Dynamic Path Network

Authors: Bubai Manna, Bodhayan Roy, Vorapong Suppakitpaisarn

Abstract: In this research, we examine the minsum flow problem in dynamic path networks where flows are represented as discrete and weighted sets. The minsum flow problem has been widely studied for its relevance in finding evacuation routes during emergencies such as earthquakes. However, previous approaches often assume that individuals are separable and identical, which does not adequately account for th… ▽ More In this research, we examine the minsum flow problem in dynamic path networks where flows are represented as discrete and weighted sets. The minsum flow problem has been widely studied for its relevance in finding evacuation routes during emergencies such as earthquakes. However, previous approaches often assume that individuals are separable and identical, which does not adequately account for the fact that some groups of people, such as families, need to move together and that some groups may be more important than others. To address these limitations, we modify the minsum flow problem to support flows represented as discrete and weighted sets. We also propose a 2-approximation pseudo-polynomial time algorithm to solve this modified problem for path networks with uniform capacity. △ Less

Submitted 2 July, 2024; originally announced July 2024.

arXiv:2407.01456 [pdf, other]

Information-Theoretic Foundations for Neural Scaling Laws

Authors: Hong Jun Jeon, Benjamin Van Roy

Abstract: Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we dev… ▽ More Neural scaling laws aim to characterize how out-of-sample error behaves as a function of model and training dataset size. Such scaling laws guide allocation of a computational resources between model and data processing to minimize error. However, existing theoretical support for neural scaling laws lacks rigor and clarity, entangling the roles of information and optimization. In this work, we develop rigorous information-theoretic foundations for neural scaling laws. This allows us to characterize scaling laws for data generated by a two-layer neural network of infinite width. We observe that the optimal relation between data and model size is linear, up to logarithmic factors, corroborating large-scale empirical investigations. Concise yet general results of the kind we establish may bring clarity to this topic and inform future investigations. △ Less

Submitted 27 June, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2212.01365

arXiv:2406.16209 [pdf, other]

Covering Simple Orthogonal Polygons with Rectangles

Authors: Aniket Basu Roy

Abstract: We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover… ▽ More We study the problem of Covering Orthogonal Polygons with Rectangles. For polynomial-time algorithms, the best-known approximation factor is $O(\sqrt{\log n})$ when the input polygon may have holes [Kumar and Ramesh, STOC '99, SICOMP '03], and there is a $2$-factor approximation algorithm known when the polygon is hole-free [Franzblau, SIDMA '89]. Arguably, an easier problem is the Boundary Cover problem where we are interested in covering only the boundary of the polygon in contrast to the original problem where we are interested in covering the interior of the polygon, hence it is also referred as the Interior Cover problem. For the Boundary Cover problem, a $4$-factor approximation algorithm is known to exist and it is APX-hard when the polygon has holes [Berman and DasGupta, Algorithmica '94]. In this work, we investigate how effective is local search algorithm for the above covering problems on simple polygons. We prove that a simple local search algorithm yields a PTAS for the Boundary Cover problem when the polygon is simple. Our proof relies on the existence of planar supports on appropriate hypergraphs defined on the Boundary Cover problem instance. On the other hand, we construct instances where support graphs for the Interior Cover problem have arbitrarily large bicliques, thus implying that the same local search technique cannot yield a PTAS for this problem. We also show large locality gap for its dual problem, namely the Maximum Antirectangle problem. △ Less

Submitted 23 June, 2024; originally announced June 2024.

Comments: 29 pages, 19 figures

arXiv:2406.10899 [pdf, other]

Notes on heating phase dynamics in Floquet CFTs and Modular quantization

Authors: Suchetan Das, Bobby Ezhuthachan, Somnath Porey, Baishali Roy

Abstract: In this article, we explore the connection between the heating phase of periodically driven CFTs and the Modular Hamiltonian of a subregion in the vacuum state. We show that the heating phase Hamiltonian corresponds to the Modular Hamiltonian, with the fixed points map** to the endpoints of the subregion. In the bulk dual, we find that these fixed points correspond to the Ryu-Takayanagi surface… ▽ More In this article, we explore the connection between the heating phase of periodically driven CFTs and the Modular Hamiltonian of a subregion in the vacuum state. We show that the heating phase Hamiltonian corresponds to the Modular Hamiltonian, with the fixed points map** to the endpoints of the subregion. In the bulk dual, we find that these fixed points correspond to the Ryu-Takayanagi surface of the AdS-Rindler wedge. Consequently, the entanglement entropy associated to the boundary interval within two fixed points exactly matches with the Rindler entropy of AdS-Rindler. We observe the emergent Virasoro algebra in the boundary quantization of the Modular Hamiltonian has a striking similarity with the emergent near Horizon Virasoro algebra. This is a consequence of the fact that while obtaining the boundary Virasoro algebra, a cut-off with conformal boundary condition around the fixed point is introduced, which in the bulk is related to a stretched horizon, with an emergent two-dimensional conformal symmetry. We also argue that as one tunes the parameter space of Floquet Hamiltonians to transition from the non-heating to the heating phase the operator algebra type changes from Von Neumann type $I$ to $III_1$ factor, providing a non-equilibrium analogue of the Hawking-Page transition. △ Less

Submitted 11 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

Comments: 22 pages, 3 figures, references added and typos corrected

arXiv:2406.08188 [pdf, other]

doi 10.1145/3641234.3671085

Attention-Based Learning for Fluid State Interpolation and Editing in a Time-Continuous Framework

Authors: Bruno Roy

Abstract: In this work, we introduce FluidsFormer: a transformer-based approach for fluid interpolation within a continuous-time framework. By combining the capabilities of PITT and a residual neural network (RNN), we analytically predict the physical properties of the fluid state. This enables us to interpolate substep frames between simulated keyframes, enhancing the temporal smoothness and sharpness of a… ▽ More In this work, we introduce FluidsFormer: a transformer-based approach for fluid interpolation within a continuous-time framework. By combining the capabilities of PITT and a residual neural network (RNN), we analytically predict the physical properties of the fluid state. This enables us to interpolate substep frames between simulated keyframes, enhancing the temporal smoothness and sharpness of animations. We demonstrate promising results for smoke interpolation and conduct initial experiments on liquids. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 5 pages, 3 figures, submitted and accepted to SIGGRAPH

arXiv:2406.05772 [pdf, ps, other]

Moving Mirrors, OTOCs and Scrambling

Authors: Parthajit Biswas, Bobby Ezhuthachan, Arnab Kundu, Baishali Roy

Abstract: We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "esca** mirror") to the recent realization of Page curv… ▽ More We explore the physics of scrambling in the moving mirror models, in which a two-dimensional CFT is subjected to a time-dependent boundary condition. It is well-known that by choosing an appropriate mirror profile, one can model quantum aspects of black holes in two-dimensions, ranging from Hawking radiation in an eternal black hole (for an "esca** mirror") to the recent realization of Page curve in evaporating black holes (for a "kink mirror"). We explore a class of OTOCs in the presence of such a boundary and explicitly demonstrate the following primary aspects: First, we show that the dynamical CFT data directly affect an OTOC and maximally chaotic scrambling occurs for the esca** mirror for a large-$c$ CFT with identity block dominance. We further show that the exponential growth of OTOC associated with the physics of scrambling yields a power-law growth in the model for evaporating black holes which demonstrates a unitary dynamics in terms of a Page curve. We also demonstrate that, by tuning a parameter, one can naturally interpolate between an exponential growth associated to scrambling and a power-law growth in unitary dynamics. Our work explicitly exhibits the role of higher-point functions in CFT dynamics as well as the distinction between scrambling and Page curve. We also discuss several future possibilities based on this class of models. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 30 pages, 6 figures

arXiv:2404.16329 [pdf, other]

On Approximating the Dynamic and Discrete Network Flow Problem

Authors: Bubai Manna, Bodhayan Roy, Vorapong Suppakitpaisarn

Abstract: We examine the dynamic network flow problem under the assumption that the flow consists of discrete units. The dynamic network flow problem is commonly addressed in the context of develo** evacuation plans, where the flow is typically treated as a continuous quantity. However, real-world scenarios often involve moving groups, such as families, as single units. We demonstrate that solving the dyn… ▽ More We examine the dynamic network flow problem under the assumption that the flow consists of discrete units. The dynamic network flow problem is commonly addressed in the context of develo** evacuation plans, where the flow is typically treated as a continuous quantity. However, real-world scenarios often involve moving groups, such as families, as single units. We demonstrate that solving the dynamic flow problem with this consideration is APX-hard. Conversely, we present a PTAS for instances where the base graph is a path with a constant number of nodes. We introduce a `ready time' constraint to the minsum bin packing problem, meaning certain items cannot be placed in specific bins, develop a PTAS for this modified problem, and apply our algorithms to the discrete and dynamic flow problem. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.15487 [pdf, other]

Minimum Consistent Subset in Trees and Interval Graphs

Authors: Aritra Banik, Sayani Das, Anil Maheshwari, Bubai Manna, Subhas C Nandy, Krishna Priya K M, Bodhayan Roy, Sasanka Roy, Abhishek Sahu

Abstract: In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of i… ▽ More In the Minimum Consistent Subset (MCS) problem, we are presented with a connected simple undirected graph $G=(V,E)$, consisting of a vertex set $V$ of size $n$ and an edge set $E$. Each vertex in $V$ is assigned a color from the set $\{1,2,\ldots, c\}$. The objective is to determine a subset $V' \subseteq V$ with minimum possible cardinality, such that for every vertex $v \in V$, at least one of its nearest neighbors in $V'$ (measured in terms of the hop distance) shares the same color as $v$. The decision problem, indicating whether there exists a subset $V'$ of cardinality at most $l$ for some positive integer $l$, is known to be NP-complete even for planar graphs. In this paper, we establish that the MCS problem for trees, when the number of colors $c$ is considered an input parameter, is NP-complete. We propose a fixed-parameter tractable (FPT) algorithm for MCS on trees running in $O(2^{6c}n^6)$ time, significantly improving the currently best-known algorithm whose running time is $O(2^{4c}n^{2c+3})$. In an effort to comprehensively understand the computational complexity of the MCS problem across different graph classes, we extend our investigation to interval graphs. We show that it remains NP-complete for interval graphs, thus enriching graph classes where MCS remains intractable. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.15446 [pdf, other]

OffRAMPS: An FPGA-based Intermediary for Analysis and Modification of Additive Manufacturing Control Systems

Authors: Jason Blocklove, Md Raz, Prithwish Basu Roy, Hammond Pearce, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri

Abstract: Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compr… ▽ More Cybersecurity threats in Additive Manufacturing (AM) are an increasing concern as AM adoption continues to grow. AM is now being used for parts in the aerospace, transportation, and medical domains. Threat vectors which allow for part compromise are particularly concerning, as any failure in these domains would have life-threatening consequences. A major challenge to investigation of AM part-compromises comes from the difficulty in evaluating and benchmarking both identified threat vectors as well as methods for detecting adversarial actions. In this work, we introduce a generalized platform for systematic analysis of attacks against and defenses for 3D printers. Our "OFFRAMPS" platform is based on the open-source 3D printer control board "RAMPS." OFFRAMPS allows analysis, recording, and modification of all control signals and I/O for a 3D printer. We show the efficacy of OFFRAMPS by presenting a series of case studies based on several Trojans, including ones identified in the literature, and show that OFFRAMPS can both emulate and detect these attacks, i.e., it can both change and detect arbitrary changes to the g-code print commands. △ Less

Submitted 23 April, 2024; originally announced April 2024.

arXiv:2404.01087 [pdf, other]

$\mathbb{A}^1$-homotopy type of $\mathbb{A}^2 \setminus \left\{(0,0) \right\}$

Authors: Utsav Choudhury, Biman Roy

Abstract: In this article we prove that any $\mathbb{A}^1$-connected smooth $k$-variety is $\mathbb{A}^1$-uniruled for any algebraically closed field $k$. We establish that if a non empty open subscheme $X$ of a smooth affine $k$-scheme is $\mathbb{A}^1$-weakly equivalent to $\mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$, then $X \cong \mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$ as $k$-varieties f… ▽ More In this article we prove that any $\mathbb{A}^1$-connected smooth $k$-variety is $\mathbb{A}^1$-uniruled for any algebraically closed field $k$. We establish that if a non empty open subscheme $X$ of a smooth affine $k$-scheme is $\mathbb{A}^1$-weakly equivalent to $\mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$, then $X \cong \mathbb{A}^2_{k} \setminus \left\{(0,0) \right\}$ as $k$-varieties for any field $k$ of characteristic $0$. △ Less

Submitted 1 April, 2024; originally announced April 2024.

MSC Class: 14F42; 19E15

arXiv:2403.14620 [pdf, other]

From Local Spin Nematicity to Altermagnets: Footprints of Band Topology

Authors: Sanjib Kumar Das, Bitan Roy

Abstract: Altermagnets are crystallographic rotational symmetry breaking spin-ordered states, possessing a net zero magnetization despite manifesting Kramers non-degenerate bands. Here, we show that momentum-independent local spin nematic orders in monolayer, Bernal bilayer and rhombohedral trilayer graphene give rise to $p$-wave, $d$-wave and $f$-wave altermagnets, respectively, thereby inheriting topology… ▽ More Altermagnets are crystallographic rotational symmetry breaking spin-ordered states, possessing a net zero magnetization despite manifesting Kramers non-degenerate bands. Here, we show that momentum-independent local spin nematic orders in monolayer, Bernal bilayer and rhombohedral trilayer graphene give rise to $p$-wave, $d$-wave and $f$-wave altermagnets, respectively, thereby inheriting topology of linear, quadratic and cubic free fermion band dispersions that are also described in terms of angular momentum $\ell=1,\; 2$ and $3$ harmonics in the reciprocal space. The same conclusions also hold inside a spin-triplet nematic superconductor, featuring Majorana altermagnets. Altogether, these findings highlight the importance of electronic band structure in identifying such exotic magnetic orders in quantum materials. We depict the effects of in-plane magnetic fields on altermagnets, and propose novel spin-disordered alter-valleymagnets in these systems. △ Less

Submitted 8 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: 6 Pages and 1 Figure: Modified Title, Streamlined Presentation (Supplemental Material as ancillary file)

arXiv:2402.18549 [pdf, other]

Stabilizing topological superconductivity in disordered spin-orbit coupled semiconductor-superconductor heterostructures

Authors: Binayyak B. Roy, Rimika Jaiswal, Tudor D. Stanescu, Sumanta Tewari

Abstract: We investigate theoretically a one-dimensional semiconductor-superconductor (SM-SC) heterostructure with Rashba spin-orbit coupling and parallel Zeeman field in the presence of disorder generated by random charged impurities and identify the optimal regimes for realizing topological superconductivity and Majorana zero modes. Using a Green's function approach, we show that upon increasing the disor… ▽ More We investigate theoretically a one-dimensional semiconductor-superconductor (SM-SC) heterostructure with Rashba spin-orbit coupling and parallel Zeeman field in the presence of disorder generated by random charged impurities and identify the optimal regimes for realizing topological superconductivity and Majorana zero modes. Using a Green's function approach, we show that upon increasing the disorder strength the stable topological superconducting phase characterized by robust end-to-end Majorana correlations "migrates" toward larger values of the Zeeman field and can be stabilized by increasing the effective SM-SC coupling. Based on these findings, we propose a strategy for accessing a regime characterized by well-separated Majorana zero modes that is based on (a) enhancing the strength of the effective SM-SC coupling (e.g., through interface engineering) and (b) expanding the range of accessible Zeeman fields (e.g., by enhancing the gyromagnetic ratio or optimizing the parent superconductor, to enable the application of larger magnetic fields). While this strategy may still require some reduction of the disorder strength, this requirement is significantly less strict than the corresponding requirement in a strategy that focuses exclusively on disorder reduction. △ Less

Submitted 29 February, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.15835 [pdf, other]

Krylov Complexity in $2d$ CFTs with SL$(2,\mathbb{R})$ deformed Hamiltonians

Authors: Vinay Malvimat, Somnath Porey, Baishali Roy

Abstract: In this study, we analyze Krylov Complexity in two-dimensional conformal field theories subjected to deformed SL$(2,\mathbb{R})$ Hamiltonians. In the vacuum state, we find that the K-complexity exhibits a universal phase structure. The phase structure involves the K-complexity exhibiting an oscillatory behaviour in the non-heating phase, which contrasts with the exponential growth observed in the… ▽ More In this study, we analyze Krylov Complexity in two-dimensional conformal field theories subjected to deformed SL$(2,\mathbb{R})$ Hamiltonians. In the vacuum state, we find that the K-complexity exhibits a universal phase structure. The phase structure involves the K-complexity exhibiting an oscillatory behaviour in the non-heating phase, which contrasts with the exponential growth observed in the heating phase, while it displays polynomial growth at the phase boundary. Furthermore, we extend our analysis to compute the K-complexity of a light operator in excited states, considering both large-c CFT and free field theory. In the free field theory, we find a state-independent phase structure of K-complexity. However, in the large-c CFT, the behavior varies, with the K-Complexity once again displaying exponential growth in the heating phase and polynomial growth at the phase boundary. Notably, the precise exponent governing this growth depends on the heaviness of the state under examination. In the non-heating phase, we observe a transition in K-complexity behavior from oscillatory to exponential growth, akin to findings in [1], as it represents a special case within the non-heating phase. △ Less

Submitted 24 February, 2024; originally announced February 2024.

Comments: 26 pages, 13 figures

arXiv:2402.00396 [pdf, other]

Efficient Exploration for LLMs

Authors: Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Abstract: We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demo… ▽ More We present evidence of substantial benefit from efficient exploration in gathering human feedback to improve large language models. In our experiments, an agent sequentially generates queries while fitting a reward model to the feedback received. Our best-performing agent generates queries using double Thompson sampling, with uncertainty represented by an epistemic neural network. Our results demonstrate that efficient exploration enables high levels of performance with far fewer queries. Further, both uncertainty estimation and the choice of exploration scheme play critical roles. △ Less

Submitted 4 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted at ICML 2024

arXiv:2401.15530 [pdf, ps, other]

An Information-Theoretic Analysis of In-Context Learning

Authors: Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

Abstract: Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustra… ▽ More Previous theoretical results pertaining to meta-learning on sequences build on contrived assumptions and are somewhat convoluted. We introduce new information-theoretic tools that lead to an elegant and very general decomposition of error into three components: irreducible error, meta-learning error, and intra-task error. These tools unify analyses across many meta-learning challenges. To illustrate, we apply them to establish new results about in-context learning with transformers. Our theoretical results characterizes how error decays in both the number of training sequences and sequence lengths. Our results are very general; for example, they avoid contrived mixing time assumptions made by all prior results that establish decay of error with sequence length. △ Less

Submitted 27 January, 2024; originally announced January 2024.

arXiv:2401.13239 [pdf, other]

Adaptive Crowdsourcing Via Self-Supervised Learning

Authors: Anmol Kagrecha, Henrik Marklund, Benjamin Van Roy, Hong Jun Jeon, Richard Zeckhauser

Abstract: Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across c… ▽ More Common crowdsourcing systems average estimates of a latent quantity of interest provided by many crowdworkers to produce a group estimate. We develop a new approach -- predict-each-worker -- that leverages self-supervised learning and a novel aggregation scheme. This approach adapts weights assigned to crowdworkers based on estimates they provided for previous quantities. When skills vary across crowdworkers or their estimates correlate, the weighted sum offers a more accurate group estimate than the average. Existing algorithms such as expectation maximization can, at least in principle, produce similarly accurate group estimates. However, their computational requirements become onerous when complex models, such as neural networks, are required to express relationships among crowdworkers. Predict-each-worker accommodates such complexity as well as many other practical challenges. We analyze the efficacy of predict-each-worker through theoretical and computational studies. Among other things, we establish asymptotic optimality as the number of engagements per crowdworker grows. △ Less

Submitted 1 February, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

Comments: 33 pages, 3 figures

arXiv:2401.07628 [pdf, other]

doi 10.1103/PhysRevC.109.014610

Fusion of $^{7}$Li with $^{205}$Tl at near barrier energies

Authors: V. V. Parkar, Prasanna M., Ruchi Rathod, V. Jha, S. K. Pandit, A. Shrivastava, K. Mahata, K. Ramachandran, R. Palit, Md. S. R. Laskar, B. J. Roy, Bhushan Kanagalekar, B. G. Hegde

Abstract: The complete and incomplete fusion cross sections for the $^{7}$Li+$^{205}$Tl reaction were measured at near barrier energies by online characteristic $γ$ ray detection technique. The complete fusion (CF) cross sections at energies above the Coulomb barrier were found to be suppressed by $\sim$ 26 \% compared to the coupled channel calculations. Reduced fusion cross sections for the present system… ▽ More The complete and incomplete fusion cross sections for the $^{7}$Li+$^{205}$Tl reaction were measured at near barrier energies by online characteristic $γ$ ray detection technique. The complete fusion (CF) cross sections at energies above the Coulomb barrier were found to be suppressed by $\sim$ 26 \% compared to the coupled channel calculations. Reduced fusion cross sections for the present system at energies normalised to the Coulomb barrier were also found to be systematically lower than those with strongly bound projectiles forming a similar compound nucleus. The suppression observed in CF cross sections is found to be commensurate with the measured total incomplete fusion (ICF) cross sections. In the ICF cross sections, t capture is found to be dominant than $α$ capture at all the measured energies. The systematic study of available CF, ICF and total fusion (TF) data with $^7$Li projectile is performed. △ Less

Submitted 15 January, 2024; originally announced January 2024.

Comments: 9 pages, 7 figures. arXiv admin note: text overlap with arXiv:1801.06996

Journal ref: Phys. Rev. C 109, 014610 (2024)

arXiv:2401.04318 [pdf, ps, other]

Contiguous Allocation of Indivisible Items on a Path

Authors: Yasushi Kawase, Bodhayan Roy, Mohammad Azharuddin Sanpui

Abstract: We study the problem of allocating indivisible items on a path among agents. The objective is to find a fair and efficient allocation in which each agent's bundle forms a contiguous block on the line. We demonstrate that, even when the valuations are binary additive, deciding whether every item can be allocated to an agent who wants it is NP-complete. Consequently, we provide two fixed-parameter t… ▽ More We study the problem of allocating indivisible items on a path among agents. The objective is to find a fair and efficient allocation in which each agent's bundle forms a contiguous block on the line. We demonstrate that, even when the valuations are binary additive, deciding whether every item can be allocated to an agent who wants it is NP-complete. Consequently, we provide two fixed-parameter tractable (FPT) algorithms for maximizing utilitarian social welfare, with respect to the number of agents and the number of items. Additionally, we present a 2-approximation algorithm for the special case when the valuations are binary additive and the maximum utility is equal to the number of items. Furthermore, we establish that deciding whether the maximum egalitarian social welfare is at least 2 or at most 1 is NP-complete, even when the valuations are binary additive. We also explore the case where the order of the blocks of items allocated to the agents is predetermined. In this case, we show that both maximum utilitarian social welfare and egalitarian social welfare can be computed in polynomial time. However, we determine that checking the existence of an EF1 allocation is NP-complete, even when the valuations are binary additive. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: A preliminary version was accepted at AAMAS 2024 as an extended abstract

arXiv:2401.00782 [pdf, other]

Dynamical stability and phase space analysis of an Emergent Universe with non-interacting and interacting fluids

Authors: Bikash Chandra Roy, Anirban Chanda, Bikash Chandra Paul

Abstract: We investigate the evolution of a flat Emergent Universe obtained with a non-linear equation of state (nEoS) in Einstein's general theory of Relativity. The nEoS is equivalent to three different types of barotropic cosmic fluids, which are found from the nEoS parameter. The EU began expanding initially with no interaction among the cosmic fluids. Assuming an interaction that sets in at a time… ▽ More We investigate the evolution of a flat Emergent Universe obtained with a non-linear equation of state (nEoS) in Einstein's general theory of Relativity. The nEoS is equivalent to three different types of barotropic cosmic fluids, which are found from the nEoS parameter. The EU began expanding initially with no interaction among the cosmic fluids. Assuming an interaction that sets in at a time $t \geq t_i$ in the fluid components, we study the evolution of the EU that leads to the present observed universe. We adopt a dynamical system analysis method to obtain the critical points of the autonomous system for studying the evolution of an EU with or without interaction in fluid components. We also study the stability of critical points and draw the phase portraits. The density parameters and the corresponding cosmological parameters are obtained for both the non-interacting and interacting phases of the evolution dynamics. △ Less

Submitted 5 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

Comments: 11 pages, 4 figures

arXiv:2312.01057 [pdf, other]

RLHF and IIA: Perverse Incentives

Authors: Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

Abstract: Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms. Existing algorithms for reinforcement learning from human feedback (RLHF) can incentivize responses at odds with preferences because they are based on models that assume independence of irrelevant alternatives (IIA). The perverse incentives induced by IIA hinder innovations on query formats and learning algorithms. △ Less

Submitted 1 February, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

arXiv:2311.07331 [pdf, other]

Geometric Tracking Control of a Multi-rotor UAV for Partially Known Trajectories

Authors: Yogesh Kumar, S. B. Roy, P. B. Sujit

Abstract: This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrus… ▽ More This paper presents a trajectory-tracking controller for multi-rotor unmanned aerial vehicles (UAVs) in scenarios where only the desired position and heading are known without the higher-order derivatives. The proposed solution modifies the state-of-the-art geometric controller, effectively addressing challenges related to the non-existence of the desired attitude and ensuring positive total thrust input for all time. We tackle the additional challenge of the non-availability of the higher derivatives of the trajectory by introducing novel nonlinear filter structures. We formalize theoretically the effect of these filter structures on the system error dynamics. Subsequently, through a rigorous theoretical analysis, we demonstrate that the proposed controller leads to uniformly ultimately bounded system error dynamics. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.04581 [pdf, other]

KiD: A Hardware Design Framework Targeting Unified NTT Multiplication for CRYSTALS-Kyber and CRYSTALS-Dilithium on FPGA

Authors: Suraj Mandal, Debapriya Basu Roy

Abstract: Large-degree polynomial multiplication is an integral component of post-quantum secure lattice-based cryptographic algorithms like CRYSTALS-Kyber and Dilithium. The computational complexity of large-degree polynomial multiplication can be reduced significantly through Number Theoretic Transformation (NTT). In this paper, we aim to develop a unified and shared NTT architecture that can support poly… ▽ More Large-degree polynomial multiplication is an integral component of post-quantum secure lattice-based cryptographic algorithms like CRYSTALS-Kyber and Dilithium. The computational complexity of large-degree polynomial multiplication can be reduced significantly through Number Theoretic Transformation (NTT). In this paper, we aim to develop a unified and shared NTT architecture that can support polynomial multiplication for both CRYSTALS-Kyber and Dilithium. More specifically, in this paper, we have proposed three different unified architectures for NTT multiplication in CRYSTALS-Kyber and Dilithium with varying numbers of configurable radix-2 butterfly units. Additionally, the developed implementation is coupled with a conflict-free memory map** scheme that allows the architecture to be fully pipelined. We have validated our implementation on Artix-7, Zynq-7000 and Zynq Ultrascale+ FPGAs. Our standalone implementations for NTT multiplication for CRYSTALS-Kyber and Dilithium perform better than the existing works, and our unified architecture shows excellent area and timing performance compared to both standalone and existing unified implementations. This architecture can potentially be used for compact and efficient implementation for CRYSTALS-Kyber and Dilithium. △ Less

Submitted 8 November, 2023; originally announced November 2023.

arXiv:2310.07786 [pdf, other]

Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

Authors: Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy

Abstract: Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends. While a number of non-stationary contextual bandit learning algorithms have been proposed in the literature, they excessively explore due to a lack of prioritization for information of enduring value, or are designed in ways that do not scale in modern applicati… ▽ More Real-world applications of contextual bandits often exhibit non-stationarity due to seasonality, serendipity, and evolving social trends. While a number of non-stationary contextual bandit learning algorithms have been proposed in the literature, they excessively explore due to a lack of prioritization for information of enduring value, or are designed in ways that do not scale in modern applications with high-dimensional user-specific features and large action set, or both. In this paper, we introduce a novel non-stationary contextual bandit algorithm that addresses these concerns. It combines a scalable, deep-neural-network-based architecture with a carefully designed exploration mechanism that strategically prioritizes collecting information with the most lasting value in a non-stationary environment. Through empirical evaluations on two real-world recommendation datasets, which exhibit pronounced non-stationarity, we demonstrate that our approach significantly outperforms the state-of-the-art baselines. △ Less

Submitted 14 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2309.12310 [pdf, other]

Model non-Hermitian topological operators without skin effect

Authors: Daniel J. Salib, Sanjib Kumar Das, Bitan Roy

Abstract: We propose a general principle of constructing non-Hermitian (NH) operators for insulating and gapless topological phases in any dimension ($d$) that over an extended NH parameter regime feature real eigenvalues and zero-energy topological boundary modes, when in particular their Hermitian cousins are also topological. However, the topological zero modes disappear when the NH operators accommodate… ▽ More We propose a general principle of constructing non-Hermitian (NH) operators for insulating and gapless topological phases in any dimension ($d$) that over an extended NH parameter regime feature real eigenvalues and zero-energy topological boundary modes, when in particular their Hermitian cousins are also topological. However, the topological zero modes disappear when the NH operators accommodate complex eigenvalues. These systems are always devoid of NH skin effects, thereby extending the realm of the bulk-boundary correspondence to NH systems in terms of solely the left or right zero-energy boundary localized eigenmodes. We showcase these general and robust outcomes for NH topological insulators in $d=1,2$ and $3$, encompassing their higher-order incarnations, as well as for NH topological Dirac, Weyl and nodal-loop semimetals. Possible realizations of proposed NH topological phases in designer materials, optical lattices and classical metamaterials are highlighted. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: 8 Pages, 5 Figures

arXiv:2309.09158 [pdf, other]

Observational constraints on the Emergent Universe with interacting non-linear fluids and its stability analysis

Authors: Anirban Chanda, Bikash Chandra Roy, Kazuharu Bamba, Bikash Chandra Paul

Abstract: We investigate a flat Emergent Universe (EU) with a nonlinear equation of state which is equivalent to three different compositions of fluids. In the EU, initially, the evolution of the universe began with no interaction, but as time evolves, an interaction sets in among the three fluids leading to the observed universe. The characteristic of an EU is that it is a singularity-free universe that ev… ▽ More We investigate a flat Emergent Universe (EU) with a nonlinear equation of state which is equivalent to three different compositions of fluids. In the EU, initially, the evolution of the universe began with no interaction, but as time evolves, an interaction sets in among the three fluids leading to the observed universe. The characteristic of an EU is that it is a singularity-free universe that evolves with all the basic features of the early evolution. A given nonlinear equation of state parameter permits a universe with three different fluids. We get a universe with dark energy, cosmic string, and radiation domination to begin with, which at a later epoch transits into a universe with three different fluids with matter domination, dark matter, and dark energy for a given interaction strength among the cosmic fluids. Later the model parameters are constrained using the observed Hubble data and Type Ia Supernova (SnIa) data from the Pantheon data set. The classical stability analysis of the model is performed using the square speed of sound. It is found that a theoretically stable cosmological model can be obtained in this case, however, the model becomes classically unstable at the present epoch when the observational bounds on the model parameters are taken into account. △ Less

Submitted 17 September, 2023; originally announced September 2023.

Comments: 16 pages, 9 figures

Report number: FU-PCG-123

arXiv:2309.07916 [pdf, other]

doi 10.1007/JHEP01(2024)143

Quantum Electrodynamics of Non-Hermitian Dirac Fermions

Authors: Sk Asrap Murshed, Bitan Roy

Abstract: We develop an effective quantum electrodynamics for non-Hermitian (NH) Dirac materials interacting with photons. These systems are described by nonspatial symmetry protected Lorentz invariant NH Dirac operators, featuring two velocity parameters $v_{_{\rm H}}$ and $v_{_{\rm NH}}$ associated with the standard Hermitian and a masslike anti-Hermitian Dirac operators, respectively. They display linear… ▽ More We develop an effective quantum electrodynamics for non-Hermitian (NH) Dirac materials interacting with photons. These systems are described by nonspatial symmetry protected Lorentz invariant NH Dirac operators, featuring two velocity parameters $v_{_{\rm H}}$ and $v_{_{\rm NH}}$ associated with the standard Hermitian and a masslike anti-Hermitian Dirac operators, respectively. They display linear energy-momentum relation, however, in terms of an effective Fermi velocity $v_{_{\rm F}}=\sqrt{v^2_{_{\rm H}}-v^2_{_{\rm NH}}}$ of NH Dirac fermions. Interaction with the fluctuating electromagnetic radiation then gives birth to an emergent Lorentz symmetry in this family of NH Dirac materials in the deep infrared regime, where the system possesses a unique terminal velocity $v_{_{\rm F}}=c$, with $c$ being the speed of light. While in two dimensions such a terminal velocity is set by the speed of light in the free space, dynamic screening in three spatial dimensions permits its nonuniversal values. Manifestations of such an emergent spacetime symmetry on the scale dependence of various physical observables in correlated NH Dirac materials are discussed. △ Less

Submitted 25 January, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: 19 Pages, 3 Figures: Published version in Journal of High Energy Physics

Journal ref: J. High Energ. Phys. 2024, 143 (2024)

arXiv:2309.07291 [pdf]

Reusability Challenges of Scientific Workflows: A Case Study for Galaxy

Authors: Khairul Alam, Banani Roy, Alexander Serebrenik

Abstract: Scientific workflow has become essential in software engineering because it provides a structured approach to designing, executing, and analyzing scientific experiments. Software developers and researchers have developed hundreds of scientific workflow management systems so scientists in various domains can benefit from them by automating repetitive tasks, enhancing collaboration, and ensuring the… ▽ More Scientific workflow has become essential in software engineering because it provides a structured approach to designing, executing, and analyzing scientific experiments. Software developers and researchers have developed hundreds of scientific workflow management systems so scientists in various domains can benefit from them by automating repetitive tasks, enhancing collaboration, and ensuring the reproducibility of their results. However, even for expert users, workflow creation is a complex task due to the dramatic growth of tools and data heterogeneity. Thus, scientists attempt to reuse existing workflows shared in workflow repositories. Unfortunately, several challenges prevent scientists from reusing those workflows. In this study, we thus first attempted to identify those reusability challenges. We also offered an action list and evidence-based guidelines to promote the reusability of scientific workflows. Our intensive manual investigation examined the reusability of existing workflows and exposed several challenges. The challenges preventing reusability include tool upgrading, tool support unavailability, design flaws, incomplete workflows, failure to load a workflow, etc. Such challenges and our action list offered guidelines to future workflow composers to create better workflows with enhanced reusability. In the future, we plan to develop a recommender system using reusable workflows that can assist scientists in creating effective and error-free workflows. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: Accepted in APSEC 2023

arXiv:2309.06424 [pdf]

Unveiling the potential of large language models in generating semantic and cross-language clones

Authors: Palash R. Roy, Ajmain I. Alam, Farouq Al-omari, Banani Roy, Chanchal K. Roy, Kevin A. Schneider

Abstract: Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone posses… ▽ More Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone possesses a code snippet in a particular programming language but seeks equivalent functionality in a different language, a semantic cross-language code clone generation approach could provide valuable assistance. In this study, using SemanticCloneBench as a vehicle, we evaluated how well the GPT-3 model could help generate semantic and cross-language clone variants for a given fragment.We have comprised a diverse set of code fragments and assessed GPT-3s performance in generating code variants.Through extensive experimentation and analysis, where 9 judges spent 158 hours to validate, we investigate the model's ability to produce accurate and semantically correct variants. Our findings shed light on GPT-3's strengths in code generation, offering insights into the potential applications and challenges of using advanced language models in software development. Our quantitative analysis yields compelling results. In the realm of semantic clones, GPT-3 attains an impressive accuracy of 62.14% and 0.55 BLEU score, achieved through few-shot prompt engineering. Furthermore, the model shines in transcending linguistic confines, boasting an exceptional 91.25% accuracy in generating cross-language clones △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: Accepted in IWSC

arXiv:2309.05550 [pdf, other]

Multiplierless Design of High-Speed Very Large Constant Multiplications

Authors: Levent Aksoy, Debapriya Basu Roy, Malik Imran, Samuel Pagliarini

Abstract: In cryptographic algorithms, the constants to be multiplied by a variable can be very large due to security requirements. Thus, the hardware complexity of such algorithms heavily depends on the design architecture handling large constants. In this paper, we introduce an electronic design automation tool, called LEIGER, which can automatically generate the realizations of very large constant multip… ▽ More In cryptographic algorithms, the constants to be multiplied by a variable can be very large due to security requirements. Thus, the hardware complexity of such algorithms heavily depends on the design architecture handling large constants. In this paper, we introduce an electronic design automation tool, called LEIGER, which can automatically generate the realizations of very large constant multiplications for low-complexity and high-speed applications, targeting the ASIC design platform. LEIGER can utilize the shift-adds architecture and use 3-input operations, i.e., carry-save adders (CSAs), where the number of CSAs is reduced using a prominent optimization algorithm. It can also generate constant multiplications under a hybrid design architecture, where 2-and 3-input operations are used at different stages. Moreover, it can describe constant multiplications under a design architecture using compressor trees. As a case study, high-speed Montgomery multiplication, which is a fundamental operation in cryptographic algorithms, is designed with its constant multiplication block realized under the proposed architectures. Experimental results indicate that LEIGER enables a designer to explore the trade-off between area and delay of the very large constant and Montgomery multiplications and leads to designs with area-delay product, latency, and energy consumption values significantly better than those obtained by a recently proposed algorithm. △ Less

Submitted 12 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

arXiv:2308.16908 [pdf, other]

doi 10.1103/PhysRevB.109.195403

Quantized thermal and spin transports of dirty planar topological superconductors

Authors: Sanjib Kumar Das, Bitan Roy

Abstract: Nontrivial bulk topological invariants of quantum materials can leave their signatures on charge, thermal and spin transports. In two dimensions, their imprints can be experimentally measured from well-developed multiterminal Hall bar arrangements. Here, we numerically compute the low temperature ($T$) thermal ($κ_{xy}$) and zero temperature spin ($σ^{sp}_{xy}$) Hall conductivities, and longitudin… ▽ More Nontrivial bulk topological invariants of quantum materials can leave their signatures on charge, thermal and spin transports. In two dimensions, their imprints can be experimentally measured from well-developed multiterminal Hall bar arrangements. Here, we numerically compute the low temperature ($T$) thermal ($κ_{xy}$) and zero temperature spin ($σ^{sp}_{xy}$) Hall conductivities, and longitudinal thermal conductance ($G^{th}_{xx}$) of various prominent two-dimensional fully gapped topological superconductors, belonging to distinct Altland-Zirnbauer symmetry classes, namely $p+ip$ (class D), $d+id$ (class C) and $p \pm ip$ (class DIII) paired states, in mesoscopic six-terminal Hall bar setups from the scattering matrix formalism using Kwant. In both clean and weak disorder limits, the time-reversal symmetry breaking $p+ip$ and $d+id$ pairings show half-quantized and quantized $κ_{xy}$ [in units of $κ_0=π^2 k^2_B T/(3h)$], respectively, while the latter one in addition accommodates a quantized $σ^{sp}_{xy}$ [in units of $σ^{sp}_0=\hbar/(8 π)$]. By contrast, the time-reversal invariant $p \pm ip$ pairing only displays a quantized $G^{th}_{xx}$ at low $T$ up to a moderate strength of disorder. In the strong disorder regime, all these topological responses ($κ_{xy}$, $σ^{sp}_{xy}$, and $G^{th}_{xx}$) vanish. Possible material platforms hosting such paired states and manifesting these robust topological thermal and spin responses are discussed. △ Less

Submitted 2 May, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

Comments: Published version in PRB: 13 pages, 4 figures, 1 Table

Journal ref: Phys. Rev. B 109, 195403 (2024)

arXiv:2308.16907 [pdf, other]

doi 10.1038/s42005-024-01629-2

Yukawa-Lorentz Symmetry in Non-Hermitian Dirac Materials

Authors: Vladimir Juricic, Bitan Roy

Abstract: Lorentz spacetime symmetry represents a unifying feature of the fundamental forces, typically manifest at sufficiently high energies, while in quantum materials it emerges in the deep low-energy regime. However, its fate in quantum materials coupled to an environment thus far remained unexplored. We here introduce a general framework of constructing symmetry-protected Lorentz invariant non-Hermiti… ▽ More Lorentz spacetime symmetry represents a unifying feature of the fundamental forces, typically manifest at sufficiently high energies, while in quantum materials it emerges in the deep low-energy regime. However, its fate in quantum materials coupled to an environment thus far remained unexplored. We here introduce a general framework of constructing symmetry-protected Lorentz invariant non-Hermitian (NH) Dirac semimetals (DSMs), realized by invoking masslike anti-Hermitian Dirac operators to its Hermitian counterpart. Such NH DSMs feature purely real or imaginary isotropic linear band dispersion, yielding a vanishing density of states. Dynamic mass orderings in NH DSMs thus take place for strong Hubbardlike local interactions through a quantum phase transition, hosting a non-Fermi liquid, beyond which the system becomes an insulator. We show that depending on the internal Clifford algebra between the NH Dirac operator and candidate mass order-parameter, the resulting quantum-critical fluid either remains coupled with the environment or recovers full Hermiticity by decoupling from the bath, while always enjoying an emergent Yukawa-Lorentz symmetry in terms of a unique terminal velocity. We showcase the competition between such mass orderings, their hallmarks on quasiparticle spectra in the ordered phases, and the relevance of our findings for correlated designer NH Dirac materials. △ Less

Submitted 28 May, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

Comments: Published Version: 10 Pages, 3 Figures & 1 Table (Supplemental Material as Ancillary File)

Journal ref: Communications Physics 7, 169 (2024)

arXiv:2308.13963 [pdf]

GPTCloneBench: A comprehensive benchmark of semantic clones and cross-language clones using GPT-3 model and SemanticCloneBench

Authors: Ajmain Inqiad Alam, Palash Ranjan Roy, Farouq Al-omari, Chanchal Kumar Roy, Banani Roy, Kevin Schneider

Abstract: With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting t… ▽ More With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting that BigCloneBench, originally not designed for semantic clone detection, presents several limitations that hinder its suitability as a comprehensive training dataset for this specific purpose. Furthermore, CLCDSA dataset suffers from a lack of reusable examples aligning with real-world software systems, rendering it inadequate for cross-language clone detection approaches. In this work, we present a comprehensive semantic clone and cross-language clone benchmark, GPTCloneBench by exploiting SemanticCloneBench and OpenAI's GPT-3 model. In particular, using code fragments from SemanticCloneBench as sample inputs along with appropriate prompt engineering for GPT-3 model, we generate semantic and cross-language clones for these specific fragments and then conduct a combination of extensive manual analysis, tool-assisted filtering, functionality testing and automated validation in building the benchmark. From 79,928 clone pairs of GPT-3 output, we created a benchmark with 37,149 true semantic clone pairs, 19,288 false semantic pairs(Type-1/Type-2), and 20,770 cross-language clones across four languages (Java, C, C#, and Python). Our benchmark is 15-fold larger than SemanticCloneBench, has more functional code examples for software systems and programming language support than CLCDSA, and overcomes BigCloneBench's qualities, quantification, and language variety limitations. △ Less

Submitted 1 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

Comments: Accepted in 39th IEEE International Conference on Software Maintenance and Evolution(ICSME 2023)

arXiv:2308.11958 [pdf, other]

Maintaining Plasticity in Continual Learning via Regenerative Regularization

Authors: Saurabh Kumar, Henrik Marklund, Benjamin Van Roy

Abstract: In continual learning, plasticity refers to the ability of an agent to quickly adapt to new information. Neural networks are known to lose plasticity when processing non-stationary data streams. In this paper, we propose L2 Init, a simple approach for maintaining plasticity by incorporating in the loss function L2 regularization toward initial parameters. This is very similar to standard L2 regula… ▽ More In continual learning, plasticity refers to the ability of an agent to quickly adapt to new information. Neural networks are known to lose plasticity when processing non-stationary data streams. In this paper, we propose L2 Init, a simple approach for maintaining plasticity by incorporating in the loss function L2 regularization toward initial parameters. This is very similar to standard L2 regularization (L2), the only difference being that L2 regularizes toward the origin. L2 Init is simple to implement and requires selecting only a single hyper-parameter. The motivation for this method is the same as that of methods that reset neurons or parameter values. Intuitively, when recent losses are insensitive to particular parameters, these parameters should drift toward their initial values. This prepares parameters to adapt quickly to new tasks. On problems representative of different types of nonstationarity in continual supervised learning, we demonstrate that L2 Init most consistently mitigates plasticity loss compared to previously proposed approaches. △ Less

Submitted 3 October, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

arXiv:2307.14231 [pdf, other]

Giant conductance of PSS:PEDOT micro-surfaces induced by microbubble lithography

Authors: Anand Dev Ranjan, Rakesh Sen, Sumeet Kumar, Rahul Vaippully, Soumya Dutta, Soumyajit Roy, Basudev Roy, Ayan Banerjee

Abstract: We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed… ▽ More We provide direct evidence of the effects of interface engineering of various substrates by Microbubble lithography (MBL). We choose a model organic plastic (or polymer) poly(3,4-ethylenedioxythiophene) polystyrene sulfonate (PEDOT:PSS), with conductivity of 140 S/cm, as a representative organic system to showcase our technique. Thus, we fabricate permanent patterns of PEDOT:PSS on glass, followed by a flexible PDMS substrate, and observe conductivity enhancement of 5 times on the former (694 S/cm), and 20 times (2844 S/cm) on the latter, without the use of external do** agents or invasive chemical treatment. Probing the patterned interface, we observe that MBL is able to tune the conformational states of PEDOT:PSS from coils in the pristine form, to extended coils on glass, and almost linear structures in PDMS due to its more malleable liquid-like interface. This results in higher ordering and vanishing grain boundaries leading to the highest conductivity of PEDOT:PSS on PDMS substrates. △ Less

Submitted 26 July, 2023; originally announced July 2023.

arXiv:2307.14185 [pdf, other]

A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia

Authors: Diana McSpadden, Steven Goldenberg, Binata Roy, Malachi Schram, Jonathan L. Goodall, Heather Richter

Abstract: Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using d… ▽ More Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using data from Norfolk rainfall events between 2016 and 2018, this study compares the performance of a previous surrogate model based on a random forest algorithm with two deep learning models: Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). This investigation underscores the importance of using a model architecture that supports the communication of prediction uncertainty and the effective integration of relevant, multi-modal features. △ Less

Submitted 26 July, 2023; originally announced July 2023.

Comments: 10 pages, 8 figures

arXiv:2307.11046 [pdf, other]

A Definition of Continual Reinforcement Learning

Authors: David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh

Abstract: In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward. However, this perspective is based on a restricted view of learning as finding a solution, rather than treating learning as endless adaptation. In contrast, continual reinforcement learning refers to the setting in which the best agents never stop learning.… ▽ More In a standard view of the reinforcement learning problem, an agent's goal is to efficiently identify a policy that maximizes long-term reward. However, this perspective is based on a restricted view of learning as finding a solution, rather than treating learning as endless adaptation. In contrast, continual reinforcement learning refers to the setting in which the best agents never stop learning. Despite the importance of continual reinforcement learning, the community lacks a simple definition of the problem that highlights its commitments and makes its primary concepts precise and clear. To this end, this paper is dedicated to carefully defining the continual reinforcement learning problem. We formalize the notion of agents that "never stop learning" through a new mathematical language for analyzing and cataloging agents. Using this new language, we define a continual learning agent as one that can be understood as carrying out an implicit search process indefinitely, and continual reinforcement learning as the setting in which the best agents are all continual learning agents. We provide two motivating examples, illustrating that traditional views of multi-task reinforcement learning and continual supervised learning are special cases of our definition. Collectively, these definitions and perspectives formalize many intuitive concepts at the heart of learning, and open new research pathways surrounding continual learning agents. △ Less

Submitted 1 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

Comments: NeurIPS 2023

arXiv:2307.11044 [pdf, other]

On the Convergence of Bounded Agents

Authors: David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh

Abstract: When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly… ▽ More When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly less clear. In this paper, we propose two complementary accounts of agent convergence in a framing of the reinforcement learning problem that centers around bounded agents. The first view says that a bounded agent has converged when the minimal number of states needed to describe the agent's future behavior cannot decrease. The second view says that a bounded agent has converged just when the agent's performance only changes if the agent's internal state changes. We establish basic properties of these two definitions, show that they accommodate typical views of convergence in standard settings, and prove several facts about their nature and relationship. We take these perspectives, definitions, and analysis to bring clarity to a central idea of the field. △ Less

Submitted 20 July, 2023; originally announced July 2023.

arXiv:2307.08593 [pdf, other]

Artificial Intelligence for the Electron Ion Collider (AI4EIC)

Authors: C. Allaire, R. Ammendola, E. -C. Aschenauer, M. Balandat, M. Battaglieri, J. Bernauer, M. Bondì, N. Branson, T. Britton, A. Butter, I. Chahrour, P. Chatagnon, E. Cisbani, E. W. Cline, S. Dash, C. Dean, W. Deconinck, A. Deshpande, M. Diefenthaler, R. Ent, C. Fanelli, M. Finger, M. Finger, Jr., E. Fol, S. Furletov , et al. (70 additional authors not shown)

Abstract: The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took… ▽ More The Electron-Ion Collider (EIC), a state-of-the-art facility for studying the strong force, is expected to begin commissioning its first experiments in 2028. This is an opportune time for artificial intelligence (AI) to be included from the start at this facility and in all phases that lead up to the experiments. The second annual workshop organized by the AI4EIC working group, which recently took place, centered on exploring all current and prospective application areas of AI for the EIC. This workshop is not only beneficial for the EIC, but also provides valuable insights for the newly established ePIC collaboration at EIC. This paper summarizes the different activities and R&D projects covered across the sessions of the workshop and provides an overview of the goals, approaches and strategies regarding AI/ML in the EIC community, as well as cutting-edge techniques currently studied in other experiments. △ Less

Submitted 17 July, 2023; originally announced July 2023.

Comments: 27 pages, 11 figures, AI4EIC workshop, tutorials and hackathon

arXiv:2307.04345 [pdf, other]

Continual Learning as Computationally Constrained Reinforcement Learning

Authors: Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy

Abstract: An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning,… ▽ More An agent that efficiently accumulates knowledge to develop increasingly sophisticated skills over a long lifetime could advance the frontier of artificial intelligence capabilities. The design of such agents, which remains a long-standing challenge of artificial intelligence, is addressed by the subject of continual learning. This monograph clarifies and formalizes concepts of continual learning, introducing a framework and set of tools to stimulate further research. △ Less

Submitted 20 August, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

arXiv:2306.15236 [pdf, other]

doi 10.1088/1367-2630/acd94e

Towards Stirling engine using an optically confined particle subjected to asymmetric temperature profile

Authors: Gokul Nalupurackal, Muruga Lokesh, Sarangi Suresh, Srestha Roy, Snigdhadev Chakraborty, Jayesh Goswami, Arnab Pal, Basudev Roy

Abstract: The realization of microscopic heat engines has gained a surge of research interest in statistical physics, soft matter, and biological physics. A typical microscopic heat engine employs a colloidal particle trapped in a confining potential, which is modulated in time to mimic the cycle operations. Here, we use a lanthanide-doped upconverting particle (UCP) suspended in a passive aqueous bath, whi… ▽ More The realization of microscopic heat engines has gained a surge of research interest in statistical physics, soft matter, and biological physics. A typical microscopic heat engine employs a colloidal particle trapped in a confining potential, which is modulated in time to mimic the cycle operations. Here, we use a lanthanide-doped upconverting particle (UCP) suspended in a passive aqueous bath, which is highly absorptive at 975 nm and converts NIR photons to visible, as the working substance of the engine. When a single UCP is optically trapped with a 975 nm laser, it behaves like an active particle by executing motion subjected to an asymmetric temperature profile along the direction of propagation of the laser. The strong absorption of 975 nm light by the particle introduces a temperature gradient and results in significant thermophoretic diffusion along the temperature gradient. However, the activity of the particle vanishes when the trap** wavelength is switched to 1064 nm. We carefully regulate the wavelength-dependent activity of the particle to engineer all four cycles of a Stirling engine by using a combination of 1064 nm and 975 nm wavelengths. Since the motion of the particle is stochastic, the work done on the particle due to the stiffness modulation per cycle is random. We provide statistical estimation for this work averaged over 5 cycles which can be extended towards several cycles to make a Stirling engine. Our experiment proposes a robust set-up to systematically harness temperature which is a crucial factor behind building microscopic engines. △ Less

Submitted 27 June, 2023; originally announced June 2023.

Comments: For published version, see https://iopscience.iop.org/article/10.1088/1367-2630/acd94e/meta

Journal ref: New J. Phys. 25 063001 (2023)

arXiv:2306.14834 [pdf, other]

Scalable Neural Contextual Bandit for Recommender Systems

Authors: Zheqing Zhu, Benjamin Van Roy

Abstract: High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been so… ▽ More High-quality recommender systems ought to deliver both innovative and relevant content through effective and exploratory interactions with users. Yet, supervised learning-based neural networks, which form the backbone of many existing recommender systems, only leverage recognized user interests, falling short when it comes to efficiently uncovering unknown user preferences. While there has been some progress with neural contextual bandit algorithms towards enabling online exploration through neural networks, their onerous computational demands hinder widespread adoption in real-world recommender systems. In this work, we propose a scalable sample-efficient neural contextual bandit algorithm for recommender systems. To do this, we design an epistemic neural network architecture, Epistemic Neural Recommendation (ENR), that enables Thompson sampling at a large scale. In two distinct large-scale experiments with real-world tasks, ENR significantly boosts click-through rates and user ratings by at least 9% and 6% respectively compared to state-of-the-art neural contextual bandit algorithms. Furthermore, it achieves equivalent performance with at least 29% fewer user interactions compared to the best-performing baseline algorithm. Remarkably, while accomplishing these improvements, ENR demands orders of magnitude fewer computational resources than neural contextual bandit baseline algorithms. △ Less

Submitted 18 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Journal ref: ACM International Conference on Information and Knowledge Management (CIKM 2023) 32nd ACM International Conference on Information and Knowledge Management (CIKM 2023)

arXiv:2306.13177 [pdf, other]

doi 10.1145/3581784.3607035

Toward Sustainable HPC: Carbon Footprint Estimation and Environmental Implications of HPC Systems

Authors: Baolin Li, Rohan Basu Roy, Daniel Wang, Siddharth Samsi, Vijay Gadepally, Devesh Tiwari

Abstract: The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which requires urgent intervention. In this work, we present a comprehensive analysis of the carbon footprint of high-performance computing (HPC) systems, considering the carbon footprint during both the hardware manufacturing and system operational stages. Our work employs HPC hardware component carbon footprint mod… ▽ More The rapid growth in demand for HPC systems has led to a rise in carbon footprint, which requires urgent intervention. In this work, we present a comprehensive analysis of the carbon footprint of high-performance computing (HPC) systems, considering the carbon footprint during both the hardware manufacturing and system operational stages. Our work employs HPC hardware component carbon footprint modeling, regional carbon intensity analysis, and experimental characterization of the system life cycle to highlight the importance of quantifying the carbon footprint of HPC systems. △ Less

Submitted 18 November, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

arXiv:2306.11737 [pdf, other]

doi 10.1145/3588028.3603652

Neural ShDF: Reviving an Efficient and Consistent Mesh Segmentation Method

Authors: Bruno Roy

Abstract: Partitioning a polygonal mesh into meaningful parts can be challenging. Many applications require decomposing such structures for further processing in computer graphics. In the last decade, several methods were proposed to tackle this problem, at the cost of intensive computational times. Recently, machine learning has proven to be effective for the segmentation task on 3D structures. Nevertheles… ▽ More Partitioning a polygonal mesh into meaningful parts can be challenging. Many applications require decomposing such structures for further processing in computer graphics. In the last decade, several methods were proposed to tackle this problem, at the cost of intensive computational times. Recently, machine learning has proven to be effective for the segmentation task on 3D structures. Nevertheless, these state-of-the-art methods are often hardly generalizable and require dividing the learned model into several specific classes of objects to avoid overfitting. We present a data-driven approach leveraging deep learning to encode a map** function prior to mesh segmentation for multiple applications. Our network reproduces a neighborhood map using our knowledge of the \textsl{Shape Diameter Function} (SDF) method using similarities among vertex neighborhoods. Our approach is resolution-agnostic as we downsample the input meshes and query the full-resolution structure solely for neighborhood contributions. Using our predicted SDF values, we can inject the resulting structure into a graph-cut algorithm to generate an efficient and robust mesh segmentation while considerably reducing the required computation times. △ Less

Submitted 31 August, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: 9 pages, 13 figures, and 3 tables. Short paper and poster published and presented at SIGGRAPH 2023

arXiv:2306.03044 [pdf]

Light-activated memristor by Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS device

Authors: Ankita Sengupta, Basudev Nag Chowdhury, Bodhishatwa Roy, Biswarup Satpati, Satyaban Bhunia, Sanatan Chattopadhyay

Abstract: The current work proposes a novel scheme for develo** a light-activated non-filamentary memristor device by fabricating an Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS structure. Under illumination, the electrons in such embedded Au-nanoparticles are excited from d-level to quantized s-p level and are swept out on application of an appropriate gate bias, leaving behind the holes without rec… ▽ More The current work proposes a novel scheme for develo** a light-activated non-filamentary memristor device by fabricating an Au-nanoparticle embedded HfO$_2$-bilayer/p-Si MOS structure. Under illumination, the electrons in such embedded Au-nanoparticles are excited from d-level to quantized s-p level and are swept out on application of an appropriate gate bias, leaving behind the holes without recombination. Such photogenerated holes are confined within the nanoparticles and thus screen the external field to lead to a memristive effect in the device. The phenomenon is experimentally observed in the fabricated Pt/HfO$_2$-(layer-II)/Au-NPs/HfO$_2$-(layer-I)/p-Si devices, where such memristive effect is activated/deactivated by light pulses. The memory window and high-to-low resistance ratio of the device are obtained to be ~1 V and ~10, respectively, which suggest the performance of a standard state-of-the-art memristor. Further, the present device offers a voltage-sweep-endurance up to at least 150 cycles and the memory retention up to ~10,000 s. Such a device concept can be extended for a combination of different nanoparticles with various dimensions and dielectric layers to optimize their memristive effect for achieving CMOS-compatible memory devices with superior reliability. △ Less

Submitted 27 May, 2023; originally announced June 2023.

arXiv:2305.16313 [pdf, other]

Hybrid symmetry class topological insulators

Authors: Sanjib Kumar Das, Bitan Roy

Abstract: Traditional topological materials belong to different Altland-Zirnbauer symmetry classes (AZSCs) depending on their non-spatial symmetries. Here we introduce the notion of hybrid symmetry class topological insulators (HSCTIs): A fusion of two different AZSC topological insulators (TIs) such that they occupy orthogonal Cartesian hyperplanes and their universal massive Dirac Hamiltonian mutually ant… ▽ More Traditional topological materials belong to different Altland-Zirnbauer symmetry classes (AZSCs) depending on their non-spatial symmetries. Here we introduce the notion of hybrid symmetry class topological insulators (HSCTIs): A fusion of two different AZSC topological insulators (TIs) such that they occupy orthogonal Cartesian hyperplanes and their universal massive Dirac Hamiltonian mutually anticommute. The boundaries of HSCTIs can also harbor TIs, typically affiliated with an AZSC different from the parent ones. As such, a fusion between planar quantum spin Hall and vertical Su-Schrieffer-Heeger insulators gives birth to a three-dimensional HSCTI, accommodating quantum anomalous Hall insulators and quantized Hall conductivity on the top and bottom surfaces. We extend this construction to encompass crystalline HSCTI and topological superconductors, and beyond three dimensions. Possible (meta)material platforms to harness HSCTIs are discussed. △ Less

Submitted 25 May, 2023; originally announced May 2023.

Comments: 6 pages, 5 figures: Supplemental Material as Ancillary file

arXiv:2305.14321 [pdf, other]

ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

Authors: William Brannon, Wonjune Kang, Suyash Fulay, Hang Jiang, Brandon Roy, Deb Roy, Jad Kabbara

Abstract: Learning on text-attributed graphs (TAGs), in which nodes are associated with one or more texts, has been the subject of much recent work. However, most approaches tend to make strong assumptions about the downstream task of interest, are reliant on hand-labeled data, or fail to equally balance the importance of both text and graph representations. In this work, we propose Contrastive Graph-Text p… ▽ More Learning on text-attributed graphs (TAGs), in which nodes are associated with one or more texts, has been the subject of much recent work. However, most approaches tend to make strong assumptions about the downstream task of interest, are reliant on hand-labeled data, or fail to equally balance the importance of both text and graph representations. In this work, we propose Contrastive Graph-Text pretraining (ConGraT), a general, self-supervised approach for jointly learning separate representations of texts and nodes in a TAG. Our method trains a language model (LM) and a graph neural network (GNN) to align their representations in a common latent space using a batch-wise contrastive learning objective inspired by CLIP. We further propose an extension to the CLIP objective that leverages graph structure to incorporate information about inter-node similarity. Extensive experiments demonstrate that ConGraT outperforms baselines on various downstream tasks, including node and text category classification, link prediction, and language modeling. Finally, we present an application of our method to community detection in social graphs, which enables finding more textually grounded communities, rather than purely graph-based ones. Code and certain datasets are available at https://github.com/wwbrannon/congrat. △ Less

Submitted 9 July, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: New visualizations, added references, and an application to community detection. To appear at the TextGraphs workshop @ ACL 2024. 21 pages, 5 figures, 13 tables

arXiv:2305.11455 [pdf, other]

Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models

Authors: Wanqiao Xu, Shi Dong, Dilip Arumugam, Benjamin Van Roy

Abstract: A centerpiece of the ever-popular reinforcement learning from human feedback (RLHF) approach to fine-tuning autoregressive language models is the explicit training of a reward model to emulate human feedback, distinct from the language model itself. This reward model is then coupled with policy-gradient methods to dramatically improve the alignment between language model outputs and desired respon… ▽ More A centerpiece of the ever-popular reinforcement learning from human feedback (RLHF) approach to fine-tuning autoregressive language models is the explicit training of a reward model to emulate human feedback, distinct from the language model itself. This reward model is then coupled with policy-gradient methods to dramatically improve the alignment between language model outputs and desired responses. In this work, we adopt a novel perspective wherein a pre-trained language model is itself simultaneously a policy, reward function, and transition function. An immediate consequence of this is that reward learning and language model fine-tuning can be performed jointly and directly, without requiring any further downstream policy optimization. While this perspective does indeed break the traditional agent-environment interface, we nevertheless maintain that there can be enormous statistical benefits afforded by bringing to bear traditional algorithmic concepts from reinforcement learning. Our experiments demonstrate one concrete instance of this through efficient exploration based on the representation and resolution of epistemic uncertainty. In order to illustrate these ideas in a transparent manner, we restrict attention to a simple didactic data generating process and leave for future work extension to systems of practical scale. △ Less

Submitted 19 May, 2023; originally announced May 2023.

arXiv:2305.11174 [pdf, other]

Magnetic catalysis in weakly interacting hyperbolic Dirac materials

Authors: Noble Gluscevich, Btan Roy

Abstract: Due to linearly vanishing density of states, emergent massless Dirac quasiparticle resulting from the free fermion motion on a family of two-dimensional half-filled bipartite hyperbolic lattices feature dynamic mass generation through quantum phase transitions only for sufficiently strong finite-range Coulomb repulsion. As such, strong nearest-neighbor Coulomb repulsion ($V$) is conducive to the n… ▽ More Due to linearly vanishing density of states, emergent massless Dirac quasiparticle resulting from the free fermion motion on a family of two-dimensional half-filled bipartite hyperbolic lattices feature dynamic mass generation through quantum phase transitions only for sufficiently strong finite-range Coulomb repulsion. As such, strong nearest-neighbor Coulomb repulsion ($V$) is conducive to the nucleation of a charge-density-wave (CDW) order with a staggered pattern of average fermionic density between two sublattices of bipartite hyperbolic lattices. Considering a collection of spinless fermions (for simplicity), here we show that application of strong external magnetic fields by virtue of producing a \emph{finite} density of states near the zero energy triggers the condensation of the CDW order even for \emph{infinitesimal} $V$. The proposed magnetic catalysis mechanism is operative for uniform as well as inhomogeneous (bell-shaped) magnetic fields. We present scaling of the CDW order with the total flux enclosed by hyperbolic Dirac materials for a wide range of (especially subcritical) $V$. △ Less

Submitted 18 May, 2023; originally announced May 2023.

Comments: 6 Pages, 4 Figures

arXiv:2305.05190 [pdf, ps, other]

Thermo-electric history effects and resistive switching in epitaxial thin film of Mott insulator V2O3

Authors: Binoy Krishna De, V. G. Sathe, S. B. Roy

Abstract: We report interesting thermo-electric history effects associated with an electric field-induced first order phase transition from Mott-insulator to the metallic state in the epitaxial thin film of V2O3. This phase transition results in tuneable resistive switching in V2O3. These findings are promising for novel technologies like optoelectronics and neuromorphic computing and may lead to highly ene… ▽ More We report interesting thermo-electric history effects associated with an electric field-induced first order phase transition from Mott-insulator to the metallic state in the epitaxial thin film of V2O3. This phase transition results in tuneable resistive switching in V2O3. These findings are promising for novel technologies like optoelectronics and neuromorphic computing and may lead to highly energy-efficient switching applications of Mott insulators. △ Less

Submitted 9 May, 2023; originally announced May 2023.

Comments: 4 pages, 4 figures

Showing 1–50 of 464 results for author: Roy, B