Skip to main content

Showing 1–22 of 22 results for author: Pandey, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13175  [pdf, other

    cs.LG cs.AI

    Sparse High Rank Adapters

    Authors: Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Viswanath Ganapathy, Rafael Esteves, Shreya Kadambi, Shubhankar Borse, Paul Whatmough, Risheek Garrepalli, Mart Van Baalen, Harris Teague, Markus Nagel

    Abstract: Low Rank Adaptation (LoRA) has gained massive attention in the recent generative AI research. One of the main advantages of LoRA is its ability to be fused with pretrained models adding no overhead during inference. However, from a mobile deployment standpoint, we can either avoid inference overhead in the fused mode but lose the ability to switch adapters rapidly, or suffer significant (up to 30%… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.08798  [pdf, other

    cs.CV

    FouRA: Fourier Low Rank Adaptation

    Authors: Shubhankar Borse, Shreya Kadambi, Nilesh Prasad Pandey, Kartikeya Bhardwaj, Viswanath Ganapathy, Sweta Priyadarshi, Risheek Garrepalli, Rafael Esteves, Munawar Hayat, Fatih Porikli

    Abstract: While Low-Rank Adaptation (LoRA) has proven beneficial for efficiently fine-tuning large models, LoRA fine-tuned text-to-image diffusion models lack diversity in the generated images, as the model tends to copy data from the observed training samples. This effect becomes more pronounced at higher values of adapter strength and for adapters with higher ranks which are fine-tuned on smaller datasets… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.02290  [pdf, other

    cs.LG

    A Study of Optimizations for Fine-tuning Large Language Models

    Authors: Arjun Singh, Nikhil Pandey, Anup Shirgaonkar, Pavan Manoj, Vijay Aski

    Abstract: Fine-tuning large language models is a popular choice among users trying to adapt them for specific applications. However, fine-tuning these models is a demanding task because the user has to examine several factors, such as resource budget, runtime, model size and context length among others. A specific challenge is that fine-tuning is memory intensive, imposing constraints on the required hardwa… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures. Revised text for clarity, updated references

  4. arXiv:2403.18159  [pdf, other

    cs.LG cs.AI cs.CL

    Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models

    Authors: Kartikeya Bhardwaj, Nilesh Prasad Pandey, Sweta Priyadarshi, Kyunggeun Lee, Jun Ma, Harris Teague

    Abstract: Large generative models such as large language models (LLMs) and diffusion models have revolutionized the fields of NLP and computer vision respectively. However, their slow inference, high computation and memory requirement makes it challenging to deploy them on edge devices. In this study, we propose a light-weight quantization aware fine tuning technique using knowledge distillation (KD-QAT) to… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at Practical ML for Low Resource Settings Workshop at ICLR 2024

  5. Feature Selection using the concept of Peafowl Mating in IDS

    Authors: Partha Ghosh, Joy Sharma, Nilesh Pandey

    Abstract: Cloud computing has high applicability as an Internet based service that relies on sharing computing resources. Cloud computing provides services that are Infrastructure based, Platform based and Software based. The popularity of this technology is due to its superb performance, high level of computing ability, low cost of services, scalability, availability and flexibility. The obtainability and… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Journal ref: International Journal of Computer Networks & Communications (IJCNC) Vol.16, No.1, January 2024

  6. arXiv:2402.00918  [pdf, other

    cs.CV cs.AI

    MUSTAN: Multi-scale Temporal Context as Attention for Robust Video Foreground Segmentation

    Authors: Praveen Kumar Pokala, Jaya Sai Kiran Patibandla, Naveen Kumar Pandey, Balakrishna Reddy Pailla

    Abstract: Video foreground segmentation (VFS) is an important computer vision task wherein one aims to segment the objects under motion from the background. Most of the current methods are image-based, i.e., rely only on spatial cues while ignoring motion cues. Therefore, they tend to overfit the training data and don't generalize well to out-of-domain (OOD) distribution. To solve the above problem, prior w… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 10 pages, 8 figures

  7. arXiv:2309.07230  [pdf, other

    cs.SE

    ESRO: Experience Assisted Service Reliability against Outages

    Authors: Sarthak Chakraborty, Shubham Agarwal, Shaddy Garg, Abhimanyu Sethia, Udit Narayan Pandey, Videh Aggarwal, Shiv Saini

    Abstract: Modern cloud services are prone to failures due to their complex architecture, making diagnosis a critical process. Site Reliability Engineers (SREs) spend hours leveraging multiple sources of data, including the alerts, error logs, and domain expertise through past experiences to locate the root cause(s). These experiences are documented as natural language text in outage reports for previous out… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023)

  8. arXiv:2309.01729  [pdf, other

    cs.LG cs.AI cs.CV

    Softmax Bias Correction for Quantized Generative Models

    Authors: Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel

    Abstract: Post-training quantization (PTQ) is the go-to compression technique for large generative models, such as stable diffusion or large language models. PTQ methods commonly keep the softmax activation in higher precision as it has been shown to be very sensitive to quantization noise. However, this can lead to a significant runtime and power overhead during inference on resource-constraint edge device… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  9. arXiv:2307.03589  [pdf, other

    math.NA cs.CE

    Nitsche method for Navier-Stokes equations with slip boundary conditions: Convergence analysis and VMS-LES stabilization

    Authors: Aparna Bansal, Nicolás Alejandro Barnafi, Dwijendra Narain Pandey

    Abstract: In this paper, we analyze the Nitsche's method for the stationary Navier-Stokes equations on Lipschitz domains under minimal regularity assumptions. Our analysis provides a robust formulation for implementing slip (i.e. Navier) boundary conditions in arbitrarily complex boundaries. The well-posedness of the discrete problem is established using the Banach Nečas Babuška and the Banach fixed point t… ▽ More

    Submitted 18 July, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    MSC Class: 65N30; 65N12; 65N15; 65J15; 80A20; 76D07

  10. arXiv:2302.05397  [pdf, other

    cs.LG

    A Practical Mixed Precision Algorithm for Post-Training Quantization

    Authors: Nilesh Prasad Pandey, Markus Nagel, Mart van Baalen, Yin Huang, Chirag Patel, Tijmen Blankevoort

    Abstract: Neural network quantization is frequently used to optimize model size, latency and power consumption for on-device deployment of neural networks. In many cases, a target bit-width is set for an entire network, meaning every layer get quantized to the same number of bits. However, for many networks some layers are significantly more robust to quantization noise than others, leaving an important axi… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  11. arXiv:2212.14776  [pdf, ps, other

    cs.LG

    On the Interpretability of Attention Networks

    Authors: Lakshmi Narayan Pandey, Rahul Vashisht, Harish G. Ramaswamy

    Abstract: Attention mechanisms form a core component of several successful deep learning architectures, and are based on one key idea: ''The output depends only on a small (but unknown) segment of the input.'' In several practical applications like image captioning and language translation, this is mostly true. In trained models with an attention mechanism, the outputs of an intermediate module that encodes… ▽ More

    Submitted 14 May, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

    Comments: ACML 2022,PMLR, Volume 189, https://proceedings.mlr.press/v189/pandey23a/pandey23a.pdf

    Journal ref: Proceedings of The 14th Asian Conference on Machine, 832--847, 2023, Volume:189; PMLR

  12. arXiv:2202.01828  [pdf, ps, other

    astro-ph.IM cs.DC

    Astronomical data organization, management and access in Scientific Data Lakes

    Authors: Y. G. Grange, V. N. Pandey, X. Espinal, R. Di Maria, A. P. Millar

    Abstract: The data volumes stored in telescope archives is constantly increasing due to the development and improvements in the instrumentation. Often the archives need to be stored over a distributed storage architecture, provided by independent compute centres. Such a distributed data archive requires overarching data management orchestration. Such orchestration comprises of tools which handle data storag… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: 4 pages, 1 figure, to appear in the proceedings of Astronomical Data Analysis Software and Systems XXXI published by ASP

  13. arXiv:2105.06033  [pdf, other

    cs.CV

    Extreme Face Inpainting with Sketch-Guided Conditional GAN

    Authors: Nilesh Pandey, Andreas Savakis

    Abstract: Recovering badly damaged face images is a useful yet challenging task, especially in extreme cases where the masked or damaged region is very large. One of the major challenges is the ability of the system to generalize on faces outside the training dataset. We propose to tackle this extreme inpainting task with a conditional Generative Adversarial Network (GAN) that utilizes structural informatio… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  14. arXiv:1909.02165  [pdf, other

    cs.CV cs.GR eess.IV

    Poly-GAN: Multi-Conditioned GAN for Fashion Synthesis

    Authors: Nilesh Pandey, Andreas Savakis

    Abstract: We present Poly-GAN, a novel conditional GAN architecture that is motivated by Fashion Synthesis, an application where garments are automatically placed on images of human models at an arbitrary pose. Poly-GAN allows conditioning on multiple inputs and is suitable for many tasks, including image alignment, image stitching, and inpainting. Existing methods have a similar pipeline where three differ… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

  15. arXiv:1907.02742  [pdf, other

    eess.IV cs.CV

    Adversarial Learning with Multiscale Features and Kernel Factorization for Retinal Blood Vessel Segmentation

    Authors: Farhan Akram, Vivek Kumar Singh, Hatem A. Rashwan, Mohamed Abdel-Nasser, Md. Mostafa Kamal Sarker, Nidhi Pandey, Domenec Puig

    Abstract: In this paper, we propose an efficient blood vessel segmentation method for the eye fundus images using adversarial learning with multiscale features and kernel factorization. In the generator network of the adversarial framework, spatial pyramid pooling, kernel factorization and squeeze excitation block are employed to enhance the feature representation in spatial domain on different scales with… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: 9 pages, 4 figures

  16. arXiv:1907.00887  [pdf, other

    eess.IV cs.CV

    An Efficient Solution for Breast Tumor Segmentation and Classification in Ultrasound Images Using Deep Adversarial Learning

    Authors: Vivek Kumar Singh, Hatem A. Rashwan, Mohamed Abdel-Nasser, Md. Mostafa Kamal Sarker, Farhan Akram, Nidhi Pandey, Santiago Romani, Domenec Puig

    Abstract: This paper proposes an efficient solution for tumor segmentation and classification in breast ultrasound (BUS) images. We propose to add an atrous convolution layer to the conditional generative adversarial network (cGAN) segmentation model to learn tumor features at different resolutions of BUS images. To automatically re-balance the relative impact of each of the highest level encoded features,… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 9 pages

  17. arXiv:1809.01687  [pdf, other

    cs.CV

    Breast Tumor Segmentation and Shape Classification in Mammograms using Generative Adversarial and Convolutional Neural Network

    Authors: Vivek Kumar Singh, Hatem A. Rashwan, Santiago Romani, Farhan Akram, Nidhi Pandey, Md. Mostafa Kamal Sarker, Adel Saleh, Meritexell Arenas, Miguel Arquez, Domenec Puig, Jordina Torrents-Barrena

    Abstract: Mammogram inspection in search of breast tumors is a tough assignment that radiologists must carry out frequently. Therefore, image analysis methods are needed for the detection and delineation of breast masses, which portray crucial morphological information that will support reliable diagnosis. In this paper, we proposed a conditional Generative Adversarial Network (cGAN) devised to segment a br… ▽ More

    Submitted 23 October, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

    Comments: 33 pages, Submitted to Expert Systems with Applications

  18. arXiv:1807.11433  [pdf, other

    cs.CV

    REFUGE CHALLENGE 2018-Task 2:Deep Optic Disc and Cup Segmentation in Fundus Images Using U-Net and Multi-scale Feature Matching Networks

    Authors: Vivek Kumar Singh, Hatem A. Rashwan, Adel Saleh, Farhan Akram, Md Mostafa Kamal Sarker, Nidhi Pandey, Saddam Abdulwahab

    Abstract: In this paper, an optic disc and cup segmentation method is proposed using U-Net followed by a multi-scale feature matching network. The proposed method targets task 2 of the REFUGE challenge 2018. In order to solve the segmentation problem of task 2, we firstly crop the input image using single shot multibox detector (SSD). The cropped image is then passed to an encoder-decoder network with skip… ▽ More

    Submitted 30 July, 2018; originally announced July 2018.

    Comments: EYE REFUGE CHALLENGE 2018, submitted 7 Pages

  19. arXiv:1806.03905  [pdf, other

    cs.CV

    Retinal Optic Disc Segmentation using Conditional Generative Adversarial Network

    Authors: Vivek Kumar Singh, Hatem Rashwan, Farhan Akram, Nidhi Pandey, Md. Mostaf Kamal Sarker, Adel Saleh, Saddam Abdulwahab, Najlaa Maaroof, Santiago Romani, Domenec Puig

    Abstract: This paper proposed a retinal image segmentation method based on conditional Generative Adversarial Network (cGAN) to segment optic disc. The proposed model consists of two successive networks: generator and discriminator. The generator learns to map information from the observing input (i.e., retinal fundus color image), to the output (i.e., binary mask). Then, the discriminator learns as a loss… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: 8 pages, Submitted to 21st International Conference of the Catalan Association for Artificial Intelligence (CCIA 2018)

  20. arXiv:1805.10207  [pdf, other

    cs.CV

    Conditional Generative Adversarial and Convolutional Networks for X-ray Breast Mass Segmentation and Shape Classification

    Authors: Vivek Kumar Singh, Santiago Romani, Hatem A. Rashwan, Farhan Akram, Nidhi Pandey, Md. Mostafa Kamal Sarker, Jordina Torrents Barrena, Saddam Abdulwahab, Adel Saleh, Miguel Arquez, Meritxell Arenas, Domenec Puig

    Abstract: This paper proposes a novel approach based on conditional Generative Adversarial Networks (cGAN) for breast mass segmentation in mammography. We hypothesized that the cGAN structure is well-suited to accurately outline the mass area, especially when the training data is limited. The generative network learns intrinsic features of tumors while the adversarial network enforces segmentations to be si… ▽ More

    Submitted 10 June, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 8 pages, Accepted at Medical Image Computing and Computer Assisted Intervention (MICCAI) 2018

  21. arXiv:1804.04226  [pdf

    cs.OH

    Increased Prediction Accuracy in the Game of Cricket using Machine Learning

    Authors: Kalpdrum Passi, Niravkumar Pandey

    Abstract: Player selection is one the most important tasks for any sport and cricket is no exception. The performance of the players depends on various factors such as the opposition team, the venue, his current form etc. The team management, the coach and the captain select 11 players for each match from a squad of 15 to 20 players. They analyze different characteristics and the statistics of the players t… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Journal ref: International Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.8, No.2, March 2018

  22. arXiv:1101.2573  [pdf

    cs.DC

    An Overview of Portable Distributed Techniques

    Authors: Sanjay Bansal, Nirved Pandey

    Abstract: In this paper, we reviewed of several portable parallel programming paradigms for use in a distributed programming environment. The Techniques reviewed here are portable. These are mainly distributing computing using MPI pure java based, MPI native java based (JNI) and PVM. We will discuss architecture and utilities of each technique based on our literature review. We explored these portable distr… ▽ More

    Submitted 13 January, 2011; originally announced January 2011.

    Comments: International Journal of Computer Science Issues online at http://www.ijcsi.org

    Journal ref: IJCSI, Volume 7, Issue 3, May 2010