Skip to main content

Showing 1–28 of 28 results for author: Panigrahi, A

.
  1. arXiv:2405.20760  [pdf, ps, other

    math.NT

    On $r$-primitive $k$-normal polynomials with two prescribed coefficients

    Authors: Avnish K. Sharma, Mamta Rani, Sharwan K. Tiwari, Anupama Panigrahi

    Abstract: This article investigates the existence of an $r$-primitive $k$-normal polynomial, defined as the minimal polynomial of an $r$-primitive $k$-normal element in $\mathbb{F}_{q^n}$, with a specified degree $n$ and two given coefficients over the finite field $\mathbb{F}_{q}$. Here, $q$ represents an odd prime power, and $n$ is an integer. The article establishes a sufficient condition to ensure the e… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 27 pages, 3 Tables

    MSC Class: 12E20; 11T23

  2. arXiv:2403.18817  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Signatures of electronic ordering in transport in graphene flat bands

    Authors: Archisman Panigrahi, Leonid Levitov

    Abstract: Recently, a wide family of electronic orders was unveiled in graphene flat bands, such as spin- and valley-polarized phases as well as nematic momentum-polarized phases, stabilized by exchange interactions via a generalized Stoner mechanism. Momentum polarization involves orbital degrees of freedom and is therefore expected to impact resistivity in a way which is uniquely sensitive to the ordering… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 12 pages, 4 figures

  3. arXiv:2403.17120  [pdf, other

    cond-mat.mes-hall

    Spin-orbit proximity in MoS$_2$/bilayer graphene heterostructures

    Authors: M. Masseroni, M. Gull, A. Panigrahi, N. Jacobsen, F. Fischer, C. Tong, J. D. Gerber, M. Niese, T. Taniguchi, K. Watanabe, L. Levitov, T. Ihn, K. Ensslin, H. Duprez

    Abstract: Van der Waals heterostructures provide a versatile platform for tailoring electronic properties through the integration of two-dimensional materials. Among these combinations, the interaction between bilayer graphene and transition metal dichalcogenides (TMDs) stands out due to its potential for inducing spin-orbit coupling (SOC) in graphene. Future devices concepts require the understanding the p… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2402.05913  [pdf, other

    cs.CL cs.LG

    Efficient Stagewise Pretraining via Progressive Subnetworks

    Authors: Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank Reddi, Satyen Kale, Sanjiv Kumar

    Abstract: Recent developments in large language models have sparked interest in efficient pretraining methods. A recent effective paradigm is to perform stage-wise training, where the size of the model is gradually increased over the course of training (e.g. gradual stacking (Reddi et al., 2023)). While the resource and wall-time savings are appealing, it has limitations, particularly the inability to evalu… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2312.05671  [pdf, other

    cs.CL

    Hate Speech and Offensive Content Detection in Indo-Aryan Languages: A Battle of LSTM and Transformers

    Authors: Nikhil Narayan, Mrutyunjay Biswal, Pramod Goyal, Abhranta Panigrahi

    Abstract: Social media platforms serve as accessible outlets for individuals to express their thoughts and experiences, resulting in an influx of user-generated data spanning all age groups. While these platforms enable free expression, they also present significant challenges, including the proliferation of hate speech and offensive content. Such objectionable language disrupts objective discourse and can… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 14 pages, 3 figures. Accepted Working Notes at HASOC-FIRE 2023, to be published in CEUR Working Notes of FIRE

  6. arXiv:2307.01189  [pdf, other

    cs.CL cs.LG

    Trainable Transformer in Transformer

    Authors: Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora

    Abstract: Recent works attribute the capability of in-context learning (ICL) in large pre-trained language models to implicitly simulating and fine-tuning an internal model (e.g., linear or 2-layer MLP) during inference. However, such constructions require large memory overhead, which makes simulation of more sophisticated internal models intractable. In this work, we propose an efficient construction, Tran… ▽ More

    Submitted 8 February, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Code base: https://github.com/abhishekpanigrahi1996/transformer_in_transformer

  7. arXiv:2303.08117  [pdf, other

    cs.CL cs.LG

    Do Transformers Parse while Predicting the Masked Word?

    Authors: Haoyu Zhao, Abhishek Panigrahi, Rong Ge, Sanjeev Arora

    Abstract: Pre-trained language models have been shown to encode linguistic structures, e.g. dependency and constituency parse trees, in their embeddings while being trained on unsupervised loss functions like masked language modeling. Some doubts have been raised whether the models actually are doing parsing or only some computation weakly correlated with it. We study questions: (a) Is it possible to explic… ▽ More

    Submitted 15 October, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accpeted to EMNLP 2023, 30 pages

  8. Analytic calculation of the vison gap in the Kitaev spin liquid

    Authors: Aaditya Panigrahi, Piers Coleman, Alexei Tsvelik

    Abstract: Although the ground-state energy of the Kitaev spin liquid can be calculated exactly, the associated vison gap energy has to date only been calculated numerically from finite size diagonalization. Here we show that the phase shift for scattering Majorana fermions off a single bond-flip can be calculated analytically, leading to a closed-form expression for the vison gap energy $Δ= 0.2633J$. Genera… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: 7 pages, 6 figures

    Journal ref: Phys. Rev. B 108, 045151 (2023)

  9. arXiv:2302.06600  [pdf, other

    cs.CL cs.LG

    Task-Specific Skill Localization in Fine-tuned Language Models

    Authors: Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora

    Abstract: Pre-trained language models can be fine-tuned to solve diverse NLP tasks, including in few-shot settings. Thus fine-tuning allows the model to quickly pick up task-specific ``skills,'' but there has been limited study of where these newly-learnt skills reside inside the massive model. This paper introduces the term skill localization for this problem and proposes a solution. Given the downstream t… ▽ More

    Submitted 1 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted at 40th International Conference on Machine Learning (ICML 2023)

  10. arXiv:2207.09127  [pdf, ps, other

    cs.CR

    Smart Contract Assisted Blockchain based PKI System

    Authors: Amrutanshu Panigrahi, Ajit Kumar Nayak, Rourab Paul

    Abstract: The proposed smart contract can prevent seven cyber attacks, such as Denial of Service (DoS), Man in the Middle Attack (MITM), Distributed Denial of Service (DDoS), 51\%, Injection attacks, Routing Attack, and Eclipse attack. The Delegated Proof of Stake (DPoS) consensus algorithm used in this model reduces the number of validators for each transaction which makes it suitable for lightweight appli… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: manuscript

  11. arXiv:2205.10287  [pdf, other

    cs.LG

    On the SDEs and Scaling Rules for Adaptive Gradient Algorithms

    Authors: Sadhika Malladi, Kaifeng Lyu, Abhishek Panigrahi, Sanjeev Arora

    Abstract: Approximating Stochastic Gradient Descent (SGD) as a Stochastic Differential Equation (SDE) has allowed researchers to enjoy the benefits of studying a continuous optimization trajectory while carefully preserving the stochasticity of SGD. Analogous study of adaptive gradient methods, such as RMSprop and Adam, has been challenging because there were no rigorously proven SDE approximations for thes… ▽ More

    Submitted 13 February, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  12. arXiv:2205.09745  [pdf, other

    cs.LG math.OC

    Understanding Gradient Descent on Edge of Stability in Deep Learning

    Authors: Sanjeev Arora, Zhiyuan Li, Abhishek Panigrahi

    Abstract: Deep learning experiments by Cohen et al. [2021] using deterministic Gradient Descent (GD) revealed an Edge of Stability (EoS) phase when learning rate (LR) and sharpness (i.e., the largest eigenvalue of Hessian) no longer behave as in traditional optimization. Sharpness stabilizes around $2/$LR and loss goes up and down across iterations, yet still with an overall downward trend. The current pape… ▽ More

    Submitted 28 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 63 pages. This paper has been accepted for conference proceedings in the 39th International Conference on Machine Learning (ICML), 2022

  13. arXiv:2203.04104  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    A solvable 3D Kondo lattice exhibiting odd-frequency pairing and order fractionalization

    Authors: Piers Coleman, Aaditya Panigrahi, Alexei Tsvelik

    Abstract: The Kondo lattice model plays a key role in our understanding of quantum materials, but a lack of small parameters has posed a long-standing problem. We present a 3 dimensional S= 1/2 Kondo lattice model describing a spin liquid within an electron sea. Strong correlations in the spin liquid are treated exactly, enabling a controlled analytical approach. Like a Peierls or BCS phase, a logarithmical… ▽ More

    Submitted 20 July, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Contains Supplementary Material with calculation of divergent charge e pair susceptibility

    Journal ref: Phys. Rev. Lett. 129, 177601 (2022)

  14. arXiv:2201.11334  [pdf, ps, other

    math.NT math.RA

    Inverses of $r$-primitive $k$-normal elements over finite fields

    Authors: Mamta Rani, Avnish K. Sharma, Sharwan K. Tiwari, Anupama Panigrahi

    Abstract: Let $r$, $n$ be positive integers, $k$ be a non-negative integer and $q$ be any prime power such that $r\mid q^n-1.$ An element $α$ of the finite field $\mathbb{F}_{q^n}$ is called an {\it $r$-primitive} element, if its multiplicative order is $(q^n-1)/r$, and it is called a {\it $k$-normal} element over $\mathbb{F}_q$, if the greatest common divisor of the polynomials… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: 30 pages

    MSC Class: 12E20\sep 11T23

  15. arXiv:2112.06911  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el hep-th math-ph

    Projected Topological Branes

    Authors: Archisman Panigrahi, Vladimir Juricic, Bitan Roy

    Abstract: Nature harbors crystals of dimensionality ($d$) only up to three. Here we introduce the notion of \emph{projected topological branes} (PTBs): Lower-dimensional branes embedded in higher-dimensional parent topological crystals, constructed via a geometric cut-and-project procedure on the Hilbert space of the parent lattice Hamiltonian. When such a brane is inclined at a rational or an irrational sl… ▽ More

    Submitted 16 September, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Published version: 10 Pages, 7 Figures (Supplementary Information as Ancillary file)

    Journal ref: Communications Physics 5, 230 (2022)

  16. Energy magnetization and transport in systems with a non-zero Berry curvature in a magnetic field

    Authors: Archisman Panigrahi, Subroto Mukerjee

    Abstract: We demonstrate that the well-known expression for the charge magnetization of a sample with a non-zero Berry curvature can be obtained by demanding that the Einstein relation holds for the electric transport current. We extend this formalism to the transport energy current and show that the energy magnetization must satisfy a particular condition. We provide a physical interpretation of this condi… ▽ More

    Submitted 31 July, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: 22 pages, 2 figures

    Journal ref: SciPost Phys. Core 6, 052 (2023)

  17. arXiv:2106.00047  [pdf, other

    cs.LG

    Learning and Generalization in RNNs

    Authors: Abhishek Panigrahi, Navin Goyal

    Abstract: Simple recurrent neural networks (RNNs) and their more advanced cousins LSTMs etc. have been very successful in sequence modeling. Their theoretical understanding, however, is lacking and has not kept pace with the progress for feedforward networks, where a reasonably complete understanding in the special case of highly overparametrized one-hidden-layer networks has emerged. In this paper, we make… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

  18. arXiv:2105.05244  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci quant-ph

    Non-Hermitian dislocation modes: Stability and melting across exceptional points

    Authors: Archisman Panigrahi, Roderich Moessner, Bitan Roy

    Abstract: The traditional bulk-boundary correspondence assuring robust gapless modes at the edges and surfaces of insulating and nodal topological materials gets masked in non-Hermitian (NH) systems by the skin effect, manifesting an accumulation of a macroscopic number of states near such interfaces. Here we show that dislocation lattice defects are immune to such skin effect or at most display a \emph{wea… ▽ More

    Submitted 12 July, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: Published version: 6 Pages, 4 Figures (Supplemental Material: as Ancillary file)

    Journal ref: Phys. Rev. B 106, L041302 (2022)

  19. arXiv:1910.09626  [pdf, other

    cs.LG stat.ML

    Non-Gaussianity of Stochastic Gradient Noise

    Authors: Abhishek Panigrahi, Raghav Somani, Navin Goyal, Praneeth Netrapalli

    Abstract: What enables Stochastic Gradient Descent (SGD) to achieve better generalization than Gradient Descent (GD) in Neural Network training? This question has attracted much attention. In this paper, we study the distribution of the Stochastic Gradient Noise (SGN) vectors during the training. We observe that for batch sizes 256 and above, the distribution is best described as Gaussian at-least in the ea… ▽ More

    Submitted 25 October, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

  20. arXiv:1908.05660  [pdf, other

    cs.LG stat.ML

    Effect of Activation Functions on the Training of Overparametrized Neural Nets

    Authors: Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

    Abstract: It is well-known that overparametrized neural networks trained using gradient-based methods quickly achieve small training error with appropriate hyperparameter settings. Recent papers have proved this statement theoretically for highly overparametrized networks under reasonable assumptions. These results either assume that the activation function is ReLU or they crucially depend on the minimum ei… ▽ More

    Submitted 10 April, 2020; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: Major update: Several new results, some reorganization and rewriting of previous results, new references

  21. arXiv:1903.03941  [pdf, other

    cs.SI cs.IR

    DeepTagRec: A Content-cum-User based Tag Recommendation Framework for Stack Overflow

    Authors: Suman Kalyan Maity, Abhishek Panigrahi, Sayan Ghosh, Arundhati Banerjee, Pawan Goyal, Animesh Mukherjee

    Abstract: In this paper, we develop a content-cum-user based deep learning framework DeepTagRec to recommend appropriate question tags on Stack Overflow. The proposed system learns the content representation from question title and body. Subsequently, the learnt representation from heterogeneous relationship between user and tags is fused with the content representation for the final tag prediction. On a ve… ▽ More

    Submitted 10 March, 2019; originally announced March 2019.

    Comments: 7 pages, 1 figure, 2 tables, In proceedings of ECIR 2019

  22. arXiv:1812.00342  [pdf, other

    cs.LG stat.ML

    Analysis on Gradient Propagation in Batch Normalized Residual Networks

    Authors: Abhishek Panigrahi, Yueru Chen, C. -C. Jay Kuo

    Abstract: We conduct mathematical analysis on the effect of batch normalization (BN) on gradient backpropogation in residual network training, which is believed to play a critical role in addressing the gradient vanishing/explosion problem, in this work. By analyzing the mean and variance behavior of the input and the gradient in the forward and backward passes through the BN and residual branches, respecti… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

  23. arXiv:1811.04968  [pdf, other

    quant-ph cs.ET cs.LG physics.comp-ph

    PennyLane: Automatic differentiation of hybrid quantum-classical computations

    Authors: Ville Bergholm, Josh Izaac, Maria Schuld, Christian Gogolin, Shahnawaz Ahmed, Vishnu Ajith, M. Sohaib Alam, Guillermo Alonso-Linaje, B. AkashNarayanan, Ali Asadi, Juan Miguel Arrazola, Utkarsh Azad, Sam Banning, Carsten Blank, Thomas R Bromley, Benjamin A. Cordier, Jack Ceroni, Alain Delgado, Olivia Di Matteo, Amintor Dusko, Tanya Garg, Diego Guala, Anthony Hayes, Ryan Hill, Aroosa Ijaz , et al. (43 additional authors not shown)

    Abstract: PennyLane is a Python 3 software framework for differentiable programming of quantum computers. The library provides a unified architecture for near-term quantum computing devices, supporting both qubit and continuous-variable paradigms. PennyLane's core feature is the ability to compute gradients of variational quantum circuits in a way that is compatible with classical techniques such as backpro… ▽ More

    Submitted 29 July, 2022; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: Code available at https://github.com/XanaduAI/pennylane/ . Significant contributions to the code (new features, new plugins, etc.) will be recognized by the opportunity to be a co-author on this paper

  24. arXiv:1809.07354  [pdf, other

    cs.SI

    Analyzing Social Book Reading Behavior on Goodreads and how it predicts Amazon Best Sellers

    Authors: Suman Kalyan Maity, Abhishek Panigrahi, Animesh Mukherjee

    Abstract: A book's success/popularity depends on various parameters - extrinsic and intrinsic. In this paper, we study how the book reading characteristics might influence the popularity of a book. Towards this objective, we perform a cross-platform study of Goodreads entities and attempt to establish the connection between various Goodreads entities and the popular books ("Amazon best sellers"). We analyze… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: 25 pages, 8 figures, 5 tables, Influence and Behavior Analysis in Social Networks and Social Media (Springer)

  25. arXiv:1602.05981  [pdf

    q-bio.TO physics.bio-ph q-bio.QM

    HIV, Cardiovascular Diseases, and Chronic Arsenic Exposure co-exist in a Positive Synergy

    Authors: Arghya Panigrahi, Amit K Chattopadhyay, Goutam Paul, Soumya Panigrahi

    Abstract: Recent epidemiological evidences indicate that arsenic exposure increases risk of atherosclerosis, cardiovascular diseases and microangiopathies in addition to the serious global health concern related to its carcinogenic effects. In experiments on animals, acute and chronic exposure to arsenic directly correlates cardiac tachyarrhythmia, and atherogenesis in a concentration and duration dependent… ▽ More

    Submitted 9 February, 2016; originally announced February 2016.

    Comments: 15 pages, 4 figures; accepted in Bengal Heart Journal

  26. Determining the network throughput and flow rate using GSR And AAL2R

    Authors: Adyasha Behera, Amrutanshu Panigrahi

    Abstract: In multi-radio wireless mesh networks, one node is eligible to transmit packets over multiple channels to different destination nodes simultaneously. This feature of multi-radio wireless mesh network makes high throughput for the network and increase the chance for multi path routing. This is because the multiple channel availability for transmission decreases the probability of the most elegant p… ▽ More

    Submitted 7 August, 2015; originally announced August 2015.

  27. arXiv:1008.2530  [pdf

    nlin.CD nlin.CG nlin.PS

    2-Variable Boolean Operation -- its use in Pattern Formation

    Authors: Sudhakar Sahoo, Ipsita Mohanty, Garisha Chowdhary, Arpit Panigrahi

    Abstract: In this paper the theory of 2-Variable Boolean Operation (2-VBO) has been discussed on a pair of n-bit strings. 2-VBO serves to bring out the relation between numbers which when plot on a 2-D surface form interesting patterns; patterns that may be fixed, periodic, chaotic or complex. Some of these patterns represent natural fractals. This paper also provides mathematical analysis corresponding to… ▽ More

    Submitted 15 August, 2010; originally announced August 2010.

    Comments: 8 pages, 20 figures and 2 tables

  28. arXiv:math/0503095  [pdf, ps, other

    math.CO math.NT

    On the structure of $p$-zero-sum free sequences and its application to a variant of Erdos--Ginzburg--Ziv theorem

    Authors: W D Gao, A Panigrahi, R Thangadurai

    Abstract: Let $p$ be any odd prime number. Let $k$ be any positive integer such that $2\leq k\leq [\frac{p+1}3]+1$. Let $S = (a_1,a_2,...,a_{2p-k})$ be any sequence in ${\Bbb Z}_p$ such that there is no subsequence of length $p$ of $S$ whose sum is zero in $\zp$. Then we prove that we can arrange the sequence $S$ as follows:… ▽ More

    Submitted 5 March, 2005; originally announced March 2005.

    Comments: 11 pages

    MSC Class: 20D60; 11B75

    Journal ref: Proc. Indian Acad. Sci. (Math. Sci.), Vol. 115, No. 1, February 2005, pp. 67-77