Skip to main content

Showing 1–2 of 2 results for author: Saikumar, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03687  [pdf, other

    cs.LG cs.CV

    DRIVE: Dual Gradient-Based Rapid Iterative Pruning

    Authors: Dhananjay Saikumar, Blesson Varghese

    Abstract: Modern deep neural networks (DNNs) consist of millions of parameters, necessitating high-performance computing during training and inference. Pruning is one solution that significantly reduces the space and time complexities of DNNs. Traditional pruning methods that are applied post-training focus on streamlining inference, but there are recent efforts to leverage sparsity early on by pruning befo… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2402.14139  [pdf, other

    cs.LG

    NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning

    Authors: Dhananjay Saikumar, Blesson Varghese

    Abstract: Efficient on-device Convolutional Neural Network (CNN) training in resource-constrained mobile and edge environments is an open challenge. Backpropagation is the standard approach adopted, but it is GPU memory intensive due to its strong inter-layer dependencies that demand intermediate activations across the entire CNN model to be retained in GPU memory. This necessitates smaller batch sizes to m… ▽ More

    Submitted 4 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to EuroSys 2024