Skip to main content

Showing 1–16 of 16 results for author: Sahoo, S S

.
  1. arXiv:2406.07524  [pdf, other

    cs.CL cs.AI cs.LG

    Simple and Effective Masked Diffusion Language Models

    Authors: Subham Sekhar Sahoo, Marianne Arriola, Yair Schiff, Aaron Gokaslan, Edgar Marroquin, Justin T Chiu, Alexander Rush, Volodymyr Kuleshov

    Abstract: While diffusion models excel at generating high-quality images, prior work reports a significant performance gap between diffusion and autoregressive (AR) methods in language modeling. In this work, we show that simple masked discrete diffusion is more performant than previously thought. We apply an effective training recipe that improves the performance of masked diffusion models and derive a sim… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Report number: cr07

  2. arXiv:2312.13236  [pdf, other

    cs.LG cs.CV

    Diffusion Models With Learned Adaptive Noise

    Authors: Subham Sekhar Sahoo, Aaron Gokaslan, Chris De Sa, Volodymyr Kuleshov

    Abstract: Diffusion models have gained traction as powerful algorithms for synthesizing high-quality images. Central to these algorithms is the diffusion process, a set of equations which maps data to noise in a way that can significantly affect performance. In this paper, we explore whether the diffusion process can be learned from data. Our work is grounded in Bayesian inference and seeks to improve log-l… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  3. arXiv:2309.13445  [pdf, other

    cs.AR cs.AI eess.SP

    AxOMaP: Designing FPGA-based Approximate Arithmetic Operators using Mathematical Programming

    Authors: Siva Satyendra Sahoo, Salim Ullah, Akash Kumar

    Abstract: With the increasing application of machine learning (ML) algorithms in embedded systems, there is a rising necessity to design low-cost computer arithmetic for these resource-constrained systems. As a result, emerging models of computation, such as approximate and stochastic computing, that leverage the inherent error-resilience of such algorithms are being actively explored for implementing ML in… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: 23 pages, Under review at ACM TRETS

  4. arXiv:2309.12830  [pdf, other

    cs.AR cs.AI cs.LG eess.SP

    AxOCS: Scaling FPGA-based Approximate Operators using Configuration Supersampling

    Authors: Siva Satyendra Sahoo, Salim Ullah, Soumyo Bhattacharjee, Akash Kumar

    Abstract: The rising usage of AI and ML-based processing across application domains has exacerbated the need for low-cost ML implementation, specifically for resource-constrained embedded systems. To this end, approximate computing, an approach that explores the power, performance, area (PPA), and behavioral accuracy (BEHAV) trade-offs, has emerged as a possible solution for implementing embedded machine le… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: 11 pages, under review with IEEE TCAS-I

    ACM Class: B.2.4; J.6; J.7; I.2.1

  5. arXiv:2209.04249  [pdf, other

    physics.atom-ph physics.ins-det physics.optics

    Single-beam room-temperature atomic magnetometer with large bandwidth and dynamic range

    Authors: K. K. George Kurian, Sushree S. Sahoo, P. K. Madhu, G. Rajalakshmi

    Abstract: We present a single-beam atomic magnetometer operating at room temperature for the measurement of ac magnetic fields. The magnetometer functions in the non-linear regime of magneto-optical rotation of $^{85}$Rb atomic vapour. We demonstrate a sensitivity of $\sim 0.9$ pT$/ \sqrt{Hz}$ at 2 kHz and a large bandwidth of 24 kHz. The dynamic range of measurement is $10^6$, making the sensor effective e… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 7 pages, 6 Figures

  6. arXiv:2206.06672  [pdf, other

    cs.LG stat.ML

    Semi-Autoregressive Energy Flows: Exploring Likelihood-Free Training of Normalizing Flows

    Authors: Phillip Si, Zeyi Chen, Subham Sekhar Sahoo, Yair Schiff, Volodymyr Kuleshov

    Abstract: Training normalizing flow generative models can be challenging due to the need to calculate computationally expensive determinants of Jacobians. This paper studies the likelihood-free training of flows and proposes the energy objective, an alternative sample-based loss based on proper scoring rules. The energy objective is determinant-free and supports flexible model architectures that are not eas… ▽ More

    Submitted 22 June, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 9 pages, 3 figures, 8 tables, 11 pages appendix

    MSC Class: 68T37 (Primary) 68T07 (Secondary)

  7. arXiv:2206.04833  [pdf, other

    cs.LG

    Training Neural Networks using SAT solvers

    Authors: Subham S. Sahoo

    Abstract: We propose an algorithm to explore the global optimization method, using SAT solvers, for training a neural net. Deep Neural Networks have achieved great feats in tasks like-image recognition, speech recognition, etc. Much of their success can be attributed to the gradient-based optimisation methods, which scale well to huge datasets while still giving solutions, better than any other existing met… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  8. arXiv:2205.15213  [pdf, other

    cs.LG

    Backpropagation through Combinatorial Algorithms: Identity with Projection Works

    Authors: Subham Sekhar Sahoo, Anselm Paulus, Marin Vlastelica, Vít Musil, Volodymyr Kuleshov, Georg Martius

    Abstract: Embedding discrete solvers as differentiable layers has given modern deep learning architectures combinatorial expressivity and discrete reasoning capabilities. The derivative of these solvers is zero or undefined, therefore a meaningful replacement is crucial for effective gradient-based learning. Prior works rely on smoothing the solver with input perturbations, relaxing the solver to continuous… ▽ More

    Submitted 17 March, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: ICLR 2023 conference paper. The first two authors contributed equally

  9. arXiv:2103.08168  [pdf, other

    physics.atom-ph physics.optics

    Nonlinear magnetoelectric effect in atomic vapor

    Authors: Sushree S. Sahoo, Soumya R. Mishra, G. Rajalakshmi, Ashok K. Mohapatra

    Abstract: Magnetoelectric (ME) effect refers to the coupling between electric and magnetic fields in a medium resulting in electric polarization induced by magnetic fields and magnetization induced by electric fields. The linear ME effect in certain magnetoelectric materials such as multiferroics has been of great interest due to its application in the fabrication of spintronics devices, memories, and magne… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  10. arXiv:2010.12869  [pdf, other

    cs.AR cs.AI cs.ET cs.PF

    ExPAN(N)D: Exploring Posits for Efficient Artificial Neural Network Design in FPGA-based Systems

    Authors: Suresh Nambi, Salim Ullah, Aditya Lohana, Siva Satyendra Sahoo, Farhad Merchant, Akash Kumar

    Abstract: The recent advances in machine learning, in general, and Artificial Neural Networks (ANN), in particular, has made smart embedded systems an attractive option for a larger number of application areas. However, the high computational complexity, memory footprints, and energy requirements of machine learning models hinder their deployment on resource-constrained embedded systems. Most state-of-the-a… ▽ More

    Submitted 27 October, 2020; v1 submitted 24 October, 2020; originally announced October 2020.

  11. arXiv:2006.16322  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Scaling Symbolic Methods using Gradients for Neural Model Explanation

    Authors: Subham Sekhar Sahoo, Subhashini Venugopalan, Li Li, Rishabh Singh, Patrick Riley

    Abstract: Symbolic techniques based on Satisfiability Modulo Theory (SMT) solvers have been proposed for analyzing and verifying neural network properties, but their usage has been fairly limited owing to their poor scalability with larger networks. In this work, we propose a technique for combining gradient-based methods with symbolic techniques to scale such analyses and demonstrate its application for mo… ▽ More

    Submitted 5 May, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

  12. arXiv:2001.00004  [pdf, ps, other

    cs.DS

    New Competitive Analysis Results of Online List Scheduling Algorithm

    Authors: Rakesh Mohanty, Debasis Dwibedy, Shreeya Swagatika Sahoo

    Abstract: Online algorithm has been an emerging area of interest for researchers in various domains of computer science. The online $m$-machine list scheduling problem introduced by Graham has gained theoretical as well as practical significance in the development of competitive analysis as a performance measure for online algorithms. In this paper, we study and explore the performance of Graham's online \t… ▽ More

    Submitted 28 December, 2019; originally announced January 2020.

    Comments: 9 pages, In Proceeding of the 14th Annual ADMA Conference 2018, India

  13. arXiv:1806.07259  [pdf, other

    cs.LG stat.ML

    Learning Equations for Extrapolation and Control

    Authors: Subham S. Sahoo, Christoph H. Lampert, Georg Martius

    Abstract: We present an approach to identify concise equations from data using a shallow neural network approach. In contrast to ordinary black-box regression, this approach allows understanding functional relations and generalizing them from observed data to unseen parts of the parameter space. We show how to extend the class of learnable equations for a recently proposed equation learning network to inclu… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: 9 pages, 9 figures, ICML 2018

    MSC Class: 68T05; 68T30; 68T40; 62M20; 62J02; 65D15; 70E60; 93C40 ACM Class: I.2.6; I.2.8

  14. Mirrorless optical parametric oscillator inside an all-optical waveguide

    Authors: Sushree S Sahoo, Snigdha S Pati, Ashok K mohapatra

    Abstract: Mirrorless optical parametric oscillator (MOPO) is a consequence of intrinsic feedback provided by the nonlinearity in a medium due to the interaction of a pair of strong counter-propagating fields. As the name suggests, the device doesn't require a cavity for lasing other than the nonlinear medium. Here, we report the first demonstration of MOPO under the effect of an all-optical waveguide. The e… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

    Journal ref: Phys. Rev. A 98, 063838 (2018)

  15. arXiv:1606.08167  [pdf, other

    physics.optics quant-ph

    Study of optical nonlinearity of a highly dispersive medium using optical heterodyne detection technique

    Authors: Arup Bhowmick, Sushree S. Sahoo, Ashok K Mohapatra

    Abstract: We discuss the optical heterodyne detection technique to study the absorption and dispersion of a probe beam propagating through a medium with a narrow resonance. The technique has been demonstrated for Rydberg Electro-magnetically induced transparency (EIT) in rubidium thermal vapor and the optical non-linearity of a probe beam with variable intensity has been studied. A quantitative comparison o… ▽ More

    Submitted 27 June, 2016; originally announced June 2016.

    Journal ref: Phys. Rev. A 94, 023839 (2016)

  16. arXiv:1505.07768  [pdf, ps, other

    physics.ins-det hep-ex

    Characterizations of GEM detector prototype

    Authors: Rajendra Nath Patra, Amit Nanda, Sharmili Rudra, P. Bhattacharya, Sumanya Sekhar Sahoo, S. Biswas, B. Mohanty, T. K. Nayak, P. K. Sahu, S. Sahu

    Abstract: At NISER-IoP detector laboratory an initiative is taken to build and test Gas Electron Multiplier (GEM) detectors for ALICE experiment. The optimisation of the gas flow rate and the long-term stability test of the GEM detector are performed. The method and test results are presented.

    Submitted 26 May, 2015; originally announced May 2015.

    Comments: 3 Pages, 4 figures

    Journal ref: Nucl. Instrum. Meth. A 824 (2016) 501-503