Skip to main content

Showing 1–20 of 20 results for author: Shanbhag, N

.
  1. arXiv:2302.01375  [pdf, other

    cs.LG cs.AI

    On the Robustness of Randomized Ensembles to Adversarial Perturbations

    Authors: Hassan Dbouk, Naresh R. Shanbhag

    Abstract: Randomized ensemble classifiers (RECs), where one classifier is randomly selected during inference, have emerged as an attractive alternative to traditional ensembling methods for realizing adversarially robust classifiers with limited compute requirements. However, recent works have shown that existing methods for constructing RECs are more vulnerable than initially claimed, casting major doubts… ▽ More

    Submitted 28 May, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Published as a conference paper in ICML 2023

  2. arXiv:2210.08974  [pdf

    cs.CY

    Coordinated Science Laboratory 70th Anniversary Symposium: The Future of Computing

    Authors: Klara Nahrstedt, Naresh Shanbhag, Vikram Adve, Nancy Amato, Romit Roy Choudhury, Carl Gunter, Nam Sung Kim, Olgica Milenkovic, Sayan Mitra, Lav Varshney, Yurii Vlasov, Sarita Adve, Rashid Bashir, Andreas Cangellaris, James DiCarlo, Katie Driggs-Campbell, Nick Feamster, Mattia Gazzola, Karrie Karahalios, Sanmi Koyejo, Paul Kwiat, Bo Li, Negar Mehr, Ravish Mehra, Andrew Miller , et al. (3 additional authors not shown)

    Abstract: In 2021, the Coordinated Science Laboratory CSL, an Interdisciplinary Research Unit at the University of Illinois Urbana-Champaign, hosted the Future of Computing Symposium to celebrate its 70th anniversary. CSL's research covers the full computing stack, computing's impact on society and the resulting need for social responsibility. In this white paper, we summarize the major technological points… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  3. arXiv:2206.06737  [pdf, other

    cs.LG cs.CR cs.CV

    Adversarial Vulnerability of Randomized Ensembles

    Authors: Hassan Dbouk, Naresh R. Shanbhag

    Abstract: Despite the tremendous success of deep neural networks across various tasks, their vulnerability to imperceptible adversarial perturbations has hindered their deployment in the real world. Recently, works on randomized ensembles have empirically demonstrated significant improvements in adversarial robustness over standard adversarially trained (AT) models with minimal computational overhead, makin… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Published as a conference paper in ICML 2022

  4. arXiv:2110.14871  [pdf, other

    cs.LG cs.CV

    Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

    Authors: Hassan Dbouk, Naresh R. Shanbhag

    Abstract: Despite their tremendous successes, convolutional neural networks (CNNs) incur high computational/storage costs and are vulnerable to adversarial perturbations. Recent works on robust model compression address these challenges by combining model compression techniques with adversarial training. But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simul… ▽ More

    Submitted 6 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021 (Spotlight)

  5. arXiv:2105.14710  [pdf, other

    cs.LG stat.ML

    Robustifying $\ell_\infty$ Adversarial Training to the Union of Perturbation Models

    Authors: Ameya D. Patil, Michael Tuttle, Alexander G. Schwing, Naresh R. Shanbhag

    Abstract: Classical adversarial training (AT) frameworks are designed to achieve high adversarial accuracy against a single attack type, typically $\ell_\infty$ norm-bounded perturbations. Recent extensions in AT have focused on defending against the union of multiple perturbations but this benefit is obtained at the expense of a significant (up to $10\times$) increase in training complexity over single-att… ▽ More

    Submitted 11 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

  6. arXiv:2012.13645  [pdf, other

    cs.AR eess.SP

    Fundamental Limits on Energy-Delay-Accuracy of In-memory Architectures in Inference Applications

    Authors: Sujan Kumar Gonugondla, Charbel Sakr, Hassan Dbouk, Naresh R. Shanbhag

    Abstract: This paper obtains fundamental limits on the computational precision of in-memory computing architectures (IMCs). An IMC noise model and associated SNR metrics are defined and their interrelationships analyzed to show that the accuracy of IMCs is fundamentally limited by the compute SNR ($\text{SNR}_{\text{a}}$) of its analog core, and that activation, weight and output precision needs to be assig… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: 14 pages, 13 figures

  7. arXiv:2007.09818  [pdf, other

    cs.CV cs.LG stat.ML

    DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks

    Authors: Hassan Dbouk, Hetul Sanghvi, Mahesh Mehendale, Naresh Shanbhag

    Abstract: Deep neural networks have achieved state-of-the art performance on various computer vision tasks. However, their deployment on resource-constrained devices has been hindered due to their high computational and storage complexity. While various complexity reduction techniques, such as lightweight network architecture design and parameter quantization, have been successful in reducing the cost of im… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Published as a conference paper in ECCV 2020

  8. arXiv:2005.02434  [pdf

    cs.CY cs.ET

    Nanotechnology-inspired Information Processing Systems of the Future

    Authors: Randy Bryant, Mark Hill, Tom Kazior, Daniel Lee, Jie Liu, Klara Nahrstedt, Vijay Narayanan, Jan Rabaey, Hava Siegelmann, Naresh Shanbhag, Naveen Verma, H. -S. Philip Wong

    Abstract: Nanoscale semiconductor technology has been a key enabler of the computing revolution. It has done so via advances in new materials and manufacturing processes that resulted in the size of the basic building block of computing systems - the logic switch and memory devices - being reduced into the nanoscale regime. Nanotechnology has provided increased computing functionality per unit volume, energ… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: A Computing Community Consortium (CCC) workshop report, 18 pages

    Report number: ccc2016report_3

  9. arXiv:2002.09786  [pdf, other

    cs.LG cs.CV stat.ML

    HarDNN: Feature Map Vulnerability Evaluation in CNNs

    Authors: Abdulrahman Mahmoud, Siva Kumar Sastry Hari, Christopher W. Fletcher, Sarita V. Adve, Charbel Sakr, Naresh Shanbhag, Pavlo Molchanov, Michael B. Sullivan, Timothy Tsai, Stephen W. Keckler

    Abstract: As Convolutional Neural Networks (CNNs) are increasingly being employed in safety-critical applications, it is important that they behave reliably in the face of hardware errors. Transient hardware errors may percolate undesirable state during execution, resulting in software-manifested errors which can adversely affect high-level decision making. This paper presents HarDNN, a software-directed ap… ▽ More

    Submitted 25 February, 2020; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: 14 pages, 5 figures, a short version accepted for publication in First Workshop on Secure and Resilient Autonomy (SARA) co-located with MLSys2020

  10. arXiv:1901.06588  [pdf, other

    cs.LG stat.ML

    Accumulation Bit-Width Scaling For Ultra-Low Precision Training Of Deep Networks

    Authors: Charbel Sakr, Naigang Wang, Chia-Yu Chen, Jungwook Choi, Ankur Agrawal, Naresh Shanbhag, Kailash Gopalakrishnan

    Abstract: Efforts to reduce the numerical precision of computations in deep learning training have yielded systems that aggressively quantize weights and activations, yet employ wide high-precision accumulators for partial sums in inner-product operations to preserve the quality of convergence. The absence of any framework to analyze the precision requirements of partial sum accumulations results in conserv… ▽ More

    Submitted 19 January, 2019; originally announced January 2019.

    Comments: Published as a conference paper in ICLR 2019

  11. arXiv:1812.11732  [pdf, other

    cs.LG stat.ML

    Per-Tensor Fixed-Point Quantization of the Back-Propagation Algorithm

    Authors: Charbel Sakr, Naresh Shanbhag

    Abstract: The high computational and parameter complexity of neural networks makes their training very slow and difficult to deploy on energy and storage-constrained computing systems. Many network complexity reduction techniques have been proposed including fixed-point implementation. However, a systematic approach for designing full fixed-point training and inference of deep neural networks remains elusiv… ▽ More

    Submitted 31 December, 2018; originally announced December 2018.

    Comments: Published as a conference paper in ICLR 2019

  12. Generalized Water-filling for Source-aware Energy-efficient SRAMs

    Authors: Yongjune Kim, Mingu Kang, Lav R. Varshney, Naresh R. Shanbhag

    Abstract: Conventional low-power static random access memories (SRAMs) reduce read energy by decreasing the bit-line voltage swings uniformly across the bit-line columns. This is because the read energy is proportional to the bit-line swings. On the other hand, bit-line swings are limited by the need to avoid decision errors especially in the most significant bits. We propose an information-theoretic approa… ▽ More

    Submitted 29 November, 2017; v1 submitted 19 October, 2017; originally announced October 2017.

  13. arXiv:1702.06119  [pdf

    cs.ET

    Shannon-inspired Statistical Computing to Enable Spintronics

    Authors: Ameya D. Patil, Sasikanth Manipatruni, Dmitri Nikonov, Ian A. Young, Naresh R. Shanbhag

    Abstract: Modern computing systems based on the von Neumann architecture are built from silicon complementary metal oxide semiconductor (CMOS) transistors that need to operate under practically error free conditions with 1 error in $10^{15}$ switching events. The physical dimensions of CMOS transistors have scaled down over the past five decades leading to exponential increases in functional density and ene… ▽ More

    Submitted 19 February, 2017; originally announced February 2017.

  14. arXiv:1611.03109  [pdf, other

    cs.LG cs.AR

    Energy-efficient Machine Learning in Silicon: A Communications-inspired Approach

    Authors: Naresh R. Shanbhag

    Abstract: This position paper advocates a communications-inspired approach to the design of machine learning systems on energy-constrained embedded `always-on' platforms. The communications-inspired approach has two versions - 1) a deterministic version where existing low-power communication IC design methods are repurposed, and 2) a stochastic version referred to as Shannon-inspired statistical information… ▽ More

    Submitted 25 October, 2016; originally announced November 2016.

    Comments: This paper was presented at the 2016 ICML Workshop on On-Device Intelligence, June 24, 2016

  15. arXiv:1610.07501  [pdf, other

    cs.AR

    A 481pJ/decision 3.4M decision/s Multifunctional Deep In-memory Inference Processor using Standard 6T SRAM Array

    Authors: Mingu Kang, Sujan Gonugondla, Ameya Patil, Naresh Shanbhag

    Abstract: This paper describes a multi-functional deep in-memory processor for inference applications. Deep in-memory processing is achieved by embedding pitch-matched low-SNR analog processing into a standard 6T 16KB SRAM array in 65 nm CMOS. Four applications are demonstrated. The prototype achieves up to 5.6X (9.7X estimated for multi-bank scenario) energy savings with negligible (<1%) accuracy degradati… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

  16. arXiv:1607.07804  [pdf, other

    cs.LG

    Error-Resilient Machine Learning in Near Threshold Voltage via Classifier Ensemble

    Authors: Sai Zhang, Naresh Shanbhag

    Abstract: In this paper, we present the design of error-resilient machine learning architectures by employing a distributed machine learning framework referred to as classifier ensemble (CE). CE combines several simple classifiers to obtain a strong one. In contrast, centralized machine learning employs a single complex block. We compare the random forest (RF) and the support vector machine (SVM), which are… ▽ More

    Submitted 3 July, 2016; originally announced July 2016.

  17. arXiv:1607.00669  [pdf, other

    stat.ML cs.LG

    Understanding the Energy and Precision Requirements for Online Learning

    Authors: Charbel Sakr, Ameya Patil, Sai Zhang, Yongjune Kim, Naresh Shanbhag

    Abstract: It is well-known that the precision of data, hyperparameters, and internal representations employed in learning systems directly impacts its energy, throughput, and latency. The precision requirements for the training algorithm are also important for systems that learn on-the-fly. Prior work has shown that the data and hyperparameters can be quantized heavily without incurring much penalty in clas… ▽ More

    Submitted 26 August, 2016; v1 submitted 3 July, 2016; originally announced July 2016.

    Comments: 14 pages, 5 figures 4 of which have 2 subfigures

  18. arXiv:1607.00667  [pdf, other

    cs.AR cs.ET

    Reducing the Energy Cost of Inference via In-sensor Information Processing

    Authors: Sai Zhang, Mingu Kang, Charbel Sakr, Naresh Shanbhag

    Abstract: There is much interest in incorporating inference capabilities into sensor-rich embedded platforms such as autonomous vehicles, wearables, and others. A central problem in the design of such systems is the need to extract information locally from sensed data on a severely limited energy budget. This necessitates the design of energy-efficient sensory embedded system. A typical sensory embedded sys… ▽ More

    Submitted 3 July, 2016; originally announced July 2016.

  19. arXiv:1109.5600  [pdf, ps, other

    math.PR

    Some new approaches to infinite divisibility

    Authors: Theofanis Sapatinas, Damodar N. Shanbhag, Arjun K. Gupta

    Abstract: Using an approach based, amongst other things, on Proposition 1 of Kaluza (1928), Goldie (1967) and, using a different approach based especially on zeros of polynomials, Steutel (1967) have proved that each nondegenerate distribution function (d.f.) $F$ (on $\RR$, the real line), satisfying $F(0-) = 0$ and $F(x) = F(0) + (1-F(0)) G(x)$, $x > 0$, where $G$ is the d.f. corresponding to a mixture of… ▽ More

    Submitted 26 September, 2011; originally announced September 2011.

    Comments: 18 pages, no figures, To appear in the Electronic Journal of Probability

    MSC Class: Primary 60E05; Secondary 62E10

    Journal ref: Electronic Journal of Probability, Vol. 16, 2359-2374 (2011)

  20. arXiv:0909.5289  [pdf, other

    math.ST

    Moment properties of multivariate infinitely divisible laws and criteria for self-decomposability

    Authors: Theofanis Sapatinas, Damodar N. Shanbhag

    Abstract: Ramachandran (1969, Theorem 8) has shown that for any univariate infinitely divisible distribution and any positive real number $α$, an absolute moment of order $α$ relative to the distribution exists (as a finite number) if and only if this is so for a certain truncated version of the corresponding L$\acute{\rm e}$vy measure. A generalized version of this result in the case of multivariate infi… ▽ More

    Submitted 29 September, 2009; originally announced September 2009.

    Comments: 22 pages (To appear in: Journal of Multivariate Analysis)

    MSC Class: 60E07 (primary); 60E05; 60G51; 62H10 (secondary)

    Journal ref: Journal of Multivariate Analysis, Vol. 101, 500-511, (2010)