Skip to main content

Showing 1–50 of 73 results for author: Dubey, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10170  [pdf, other

    cs.AR

    A Mess of Memory System Benchmarking, Simulation and Application Profiling

    Authors: Pouya Esmaili-Dokht, Francesco Sgherzi, Valeria Soldera Girelli, Isaac Boixaderas, Mariana Carmin, Alireza Momeni, Adria Armejach, Estanislao Mercadal, German Llort, Petar Radojkovic, Miquel Moreto, Judit Gimenez, Xavier Martorell, Eduard Ayguade, Jesus Labarta, Emanuele Confalonieri, Rishabh Dubey, Jason Adlard

    Abstract: The Memory stress (Mess) framework provides a unified view of the memory system benchmarking, simulation and application profiling. The Mess benchmark provides a holistic and detailed memory system characterization. It is based on hundreds of measurements that are represented as a family of bandwidth--latency curves. The benchmark increases the coverage of all the previous tools and leads to new f… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 17 pages

  2. arXiv:2404.13252  [pdf, other

    cs.CV cs.LG eess.IV

    3D-Convolution Guided Spectral-Spatial Transformer for Hyperspectral Image Classification

    Authors: Shyam Varahagiri, Aryaman Sinha, Shiv Ram Dubey, Satish Kumar Singh

    Abstract: In recent years, Vision Transformers (ViTs) have shown promising classification performance over Convolutional Neural Networks (CNNs) due to their self-attention mechanism. Many researchers have incorporated ViTs for Hyperspectral Image (HSI) classification. HSIs are characterised by narrow contiguous spectral bands, providing rich spectral data. Although ViTs excel with sequential data, they cann… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted in IEEE Conference on Artificial Intelligence, 2024

  3. arXiv:2401.15366  [pdf, other

    cs.CV eess.IV

    Face to Cartoon Incremental Super-Resolution using Knowledge Distillation

    Authors: Trinetra Devkatte, Shiv Ram Dubey, Satish Kumar Singh, Abdenour Hadid

    Abstract: Facial super-resolution/hallucination is an important area of research that seeks to enhance low-resolution facial images for a variety of applications. While Generative Adversarial Networks (GANs) have shown promise in this area, their ability to adapt to new, unseen data remains a challenge. This paper addresses this problem by proposing an incremental super-resolution using GANs with knowledge… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  4. arXiv:2401.15362  [pdf, other

    cs.CV

    Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval

    Authors: Ayush Dubey, Shiv Ram Dubey, Satish Kumar Singh, Wei-Ta Chu

    Abstract: Unsupervised image retrieval aims to learn the important visual characteristics without any given level to retrieve the similar images for a given query image. The Convolutional Neural Network (CNN)-based approaches have been extensively exploited with self-supervised contrastive learning for image hashing. However, the existing approaches suffer due to lack of effective utilization of global feat… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  5. arXiv:2312.01999  [pdf, other

    eess.IV cs.CV

    SRTransGAN: Image Super-Resolution using Transformer based Generative Adversarial Network

    Authors: Neeraj Baghel, Shiv Ram Dubey, Satish Kumar Singh

    Abstract: Image super-resolution aims to synthesize high-resolution image from a low-resolution image. It is an active area to overcome the resolution limitations in several applications like low-resolution object-recognition, medical image enhancement, etc. The generative adversarial network (GAN) based methods have been the state-of-the-art for image super-resolution by utilizing the convolutional neural… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  6. LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot gras**

    Authors: Sujal Vijayaraghavan, Redwan Alqasemi, Rajiv Dubey, Sudeep Sarkar

    Abstract: Robot grasp typically follows five stages: object detection, object localisation, object pose estimation, grasp pose estimation, and grasp planning. We focus on object pose estimation. Our approach relies on three pieces of information: multiple views of the object, the camera's extrinsic parameters at those viewpoints, and 3D CAD models of objects. The first step involves a standard deep learning… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  7. arXiv:2310.13216  [pdf, other

    eess.IV cs.CV

    PTSR: Patch Translator for Image Super-Resolution

    Authors: Neeraj Baghel, Shiv Ram Dubey, Satish Kumar Singh

    Abstract: Image super-resolution generation aims to generate a high-resolution image from its low-resolution image. However, more complex neural networks bring high computational costs and memory storage. It is still an active area for offering the promise of overcoming resolution limitations in many applications. In recent years, transformers have made significant progress in computer vision tasks as their… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  8. arXiv:2306.15318  [pdf

    cs.CV cs.AI

    Towards predicting Pedestrian Evacuation Time and Density from Floorplans using a Vision Transformer

    Authors: Patrick Berggold, Stavros Nousias, Rohit K. Dubey, André Borrmann

    Abstract: Conventional pedestrian simulators are inevitable tools in the design process of a building, as they enable project engineers to prevent overcrowding situations and plan escape routes for evacuation. However, simulation runtime and the multiple cumbersome steps in generating simulation results are potential bottlenecks during the building design process. Data-driven approaches have demonstrated th… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  9. arXiv:2302.08641  [pdf, other

    cs.CV eess.IV

    Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey

    Authors: Shiv Ram Dubey, Satish Kumar Singh

    Abstract: Generative Adversarial Networks (GANs) have been very successful for synthesizing the images in a given dataset. The artificially generated images by GANs are very realistic. The GANs have shown potential usability in several computer vision applications, including image generation, image-to-image translation, video synthesis, and others. Conventionally, the generator network is the backbone of GA… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  10. arXiv:2302.07245  [pdf, other

    cs.CV

    WSD: Wild Selfie Dataset for Face Recognition in Selfie Images

    Authors: Laxman Kumarapu, Shiv Ram Dubey, Snehasis Mukherjee, Parkhi Mohan, Sree Pragna Vinnakoti, Subhash Karthikeya

    Abstract: With the rise of handy smart phones in the recent years, the trend of capturing selfie images is observed. Hence efficient approaches are required to be developed for recognising faces in selfie images. Due to the short distance between the camera and face in selfie images, and the different visual effects offered by the selfie apps, face recognition becomes more challenging with existing approach… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  11. arXiv:2212.13375  [pdf

    eess.SP cs.AI cs.LG

    Power Quality Event Recognition and Classification Using an Online Sequential Extreme Learning Machine Network based on Wavelets

    Authors: Rahul Kumar Dubey

    Abstract: Reduced system dependability and higher maintenance costs may be the consequence of poor electric power quality, which can disturb normal equipment performance, speed up aging, and even cause outright failures. This study implements and tests a prototype of an Online Sequential Extreme Learning Machine (OS-ELM) classifier based on wavelets for detecting power quality problems under transient condi… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  12. arXiv:2210.06364  [pdf, other

    cs.CV

    AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNs

    Authors: Shiv Ram Dubey, Satish Kumar Singh, Bidyut Baran Chaudhuri

    Abstract: The stochastic gradient descent (SGD) optimizers are generally used to train the convolutional neural networks (CNNs). In recent years, several adaptive momentum based SGD optimizers have been introduced, such as Adam, diffGrad, Radam and AdaBelief. However, the existing SGD optimizers do not exploit the gradient norm of past iterations and lead to poor convergence and performance. In this paper,… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023

  13. arXiv:2210.03734  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    T2CI-GAN: Text to Compressed Image generation using Generative Adversarial Network

    Authors: Bulla Rajesh, Nandakishore Dusa, Mohammed Javed, Shiv Ram Dubey, P. Nagabhushan

    Abstract: The problem of generating textual descriptions for the visual data has gained research attention in the recent years. In contrast to that the problem of generating visual data from textual descriptions is still very challenging, because it requires the combination of both Natural Language Processing (NLP) and Computer Vision techniques. The existing methods utilize the Generative Adversarial Netwo… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at IAPR's 6th CVIP 2022

  14. arXiv:2207.09070  [pdf, other

    cs.CV

    Context Unaware Knowledge Distillation for Image Retrieval

    Authors: Bytasandram Yaswanth Reddy, Shiv Ram Dubey, Rakesh Kumar Sanodiya, Ravi Ranjan Prasad Karn

    Abstract: Existing data-dependent hashing methods use large backbone networks with millions of parameters and are computationally complex. Existing knowledge distillation methods use logits and other features of the deep (teacher) model and as knowledge for the compact (student) model, which requires the teacher's network to be fine-tuned on the context in parallel with the student model on the context. Tra… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted in International Conference on Computer Vision and Machine Intelligence (CVMI), 2022

  15. arXiv:2207.09066  [pdf, other

    cs.CV

    Moment Centralization based Gradient Descent Optimizers for Convolutional Neural Networks

    Authors: Sumanth Sadu, Shiv Ram Dubey, SR Sreeja

    Abstract: Convolutional neural networks (CNNs) have shown very appealing performance for many computer vision applications. The training of CNNs is generally performed using stochastic gradient descent (SGD) based optimization techniques. The adaptive momentum-based SGD optimizers are the recent trends. However, the existing optimizers are not able to maintain a zero mean in the first-order moment and strug… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted in International Conference on Computer Vision and Machine Intelligence (CVMI), 2022

  16. arXiv:2206.15129  [pdf, other

    cs.AI cs.HC cs.LG

    Personalized Detection of Cognitive Biases in Actions of Users from Their Logs: Anchoring and Recency Biases

    Authors: Atanu R Sinha, Navita Goyal, Sunny Dhamnani, Tanay Asija, Raja K Dubey, M V Kaarthik Raja, Georgios Theocharous

    Abstract: Cognitive biases are mental shortcuts humans use in dealing with information and the environment, and which result in biased actions and behaviors (or, actions), unbeknownst to themselves. Biases take many forms, with cognitive biases occupying a central role that inflicts fairness, accountability, transparency, ethics, law, medicine, and discrimination. Detection of biases is considered a necessa… ▽ More

    Submitted 1 July, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

  17. arXiv:2206.02518  [pdf

    cs.CE

    A Model for Predicting Ignition Potential of Complex Fuel in Diurnally Variable Environment

    Authors: Saurabh Saxena, Ritambhara Dubey, Neda Yaghoobian

    Abstract: Fuel ignition potential is one of the primary drivers influencing the extent of damage in wildland and wildland-urban interface fires. Determining fire and ember exposure of fuels that vary spatially and temporally will help to recognize necessary defensive actions and reduce damages. In this paper, the development of a new computational model, Temperature And Moisture Evolution predictor for comp… ▽ More

    Submitted 16 January, 2023; v1 submitted 8 May, 2022; originally announced June 2022.

  18. arXiv:2205.05967  [pdf, other

    cs.CV

    Target Aware Network Architecture Search and Compression for Efficient Knowledge Transfer

    Authors: S. H. Shabbeer Basha, Debapriya Tula, Sravan Kumar Vinakota, Shiv Ram Dubey

    Abstract: Transfer Learning enables Convolutional Neural Networks (CNN) to acquire knowledge from a source domain and transfer it to a target domain, where collecting large-scale annotated examples is time-consuming and expensive. Conventionally, while transferring the knowledge learned from one task to another task, the deeper layers of a pre-trained CNN are finetuned over the target dataset. However, thes… ▽ More

    Submitted 24 January, 2024; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: This paper is accepted for publication in Multimedia Systems Journal

  19. arXiv:2204.06608  [pdf, other

    cs.LG

    Modularity benefits reinforcement learning agents with competing homeostatic drives

    Authors: Zack Dulberg, Rachit Dubey, Isabel M. Berwian, Jonathan D. Cohen

    Abstract: The problem of balancing conflicting needs is fundamental to intelligence. Standard reinforcement learning algorithms maximize a scalar reward, which requires combining different objective-specific rewards into a single number. Alternatively, different objectives could also be combined at the level of action value, such that specialist modules responsible for different objectives submit different… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: 4 pages, accepted paper at the Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM) 2022

  20. HRel: Filter Pruning based on High Relevance between Activation Maps and Class Labels

    Authors: CH Sarvani, Mrinmoy Ghorai, Shiv Ram Dubey, SH Shabbeer Basha

    Abstract: This paper proposes an Information Bottleneck theory based filter pruning method that uses a statistical measure called Mutual Information (MI). The MI between filters and class labels, also called \textit{Relevance}, is computed using the filter's activation maps and the annotations. The filters having High Relevance (HRel) are considered to be more important. Consequently, the least important fi… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Journal ref: "Neural Networks Volume 147, March 2022, Pages 186-197 " https://www.sciencedirect.com/science/article/abs/pii/S0893608021004962

  21. arXiv:2201.00947  [pdf, other

    cs.CV eess.IV

    HWRCNet: Handwritten Word Recognition in JPEG Compressed Domain using CNN-BiLSTM Network

    Authors: Bulla Rajesh, Abhishek Kumar Gupta, Ayush Raj, Mohammed Javed, Shiv Ram Dubey

    Abstract: Handwritten word recognition from document images using deep learning is an active research area in the field of Document Image Analysis and Recognition. In the present era of Big data, since more and more documents are being generated and archived in the compressed form to provide better storage and transmission efficiencies, the problem of word recognition in the respective compressed domain wit… ▽ More

    Submitted 17 February, 2023; v1 submitted 3 January, 2022; originally announced January 2022.

    Comments: Accepted in International Conference on Data Analytics and Learning, 2022

  22. arXiv:2112.01845  [pdf, other

    cs.CV eess.IV

    Semantic Map Injected GAN Training for Image-to-Image Translation

    Authors: Balaram Singh Kshatriya, Shiv Ram Dubey, Himangshu Sarma, Kunal Chaudhary, Meva Ram Gurjar, Rahul Rai, Sunny Manchanda

    Abstract: Image-to-image translation is the recent trend to transform images from one domain to another domain using generative adversarial network (GAN). The existing GAN models perform the training by only utilizing the input and output modalities of transformation. In this paper, we perform the semantic injected training of GAN models. Specifically, we train with original input and output modalities and… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Accepted in Fourth Workshop on Computer Vision Applications (WCVA) at ICVGIP 2021

  23. arXiv:2109.14545  [pdf, other

    cs.LG cs.NE

    Activation Functions in Deep Learning: A Comprehensive Survey and Benchmark

    Authors: Shiv Ram Dubey, Satish Kumar Singh, Bidyut Baran Chaudhuri

    Abstract: Neural networks have shown tremendous growth in recent years to solve numerous problems. Various types of neural networks have been introduced to deal with different types of problems. However, the main goal of any neural network is to transform the non-linearly separable input data into more linearly separable abstract features using a hierarchy of layers. These layers are combinations of linear… ▽ More

    Submitted 28 June, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted in Neurocomputing, Elsevier

  24. arXiv:2109.12564  [pdf, other

    cs.CV

    Vision Transformer Hashing for Image Retrieval

    Authors: Shiv Ram Dubey, Satish Kumar Singh, Wei-Ta Chu

    Abstract: Deep learning has shown a tremendous growth in hashing techniques for image retrieval. Recently, Transformer has emerged as a new architecture by utilizing self-attention without convolution. Transformer is also extended to Vision Transformer (ViT) for the visual recognition with a promising performance on ImageNet. In this paper, we propose a Vision Transformer based Hashing (VTS) for image retri… ▽ More

    Submitted 22 March, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: Accepted in IEEE International Conference on Multimedia and Expo (ICME), 2022

  25. arXiv:2109.12556  [pdf, other

    cs.CV

    Frequency Disentangled Residual Network

    Authors: Satya Rajendra Singh, Roshan Reddy Yedla, Shiv Ram Dubey, Rakesh Sanodiya, Wei-Ta Chu

    Abstract: Residual networks (ResNets) have been utilized for various computer vision and image processing applications. The residual connection improves the training of the network with better gradient flow. A residual block consists of few convolutional layers having trainable parameters, which leads to overfitting. Moreover, the present residual networks are not able to utilize the high and low frequency… ▽ More

    Submitted 30 January, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

  26. arXiv:2109.12504  [pdf, other

    cs.LG math.OC

    AdaInject: Injection Based Adaptive Gradient Descent Optimizers for Convolutional Neural Networks

    Authors: Shiv Ram Dubey, S. H. Shabbeer Basha, Satish Kumar Singh, Bidyut Baran Chaudhuri

    Abstract: The convolutional neural networks (CNNs) are generally trained using stochastic gradient descent (SGD) based optimization techniques. The existing SGD optimizers generally suffer with the overshooting of the minimum and oscillation near minimum. In this paper, we propose a new approach, hereafter referred as AdaInject, for the gradient descent optimizers by injecting the second order moment into t… ▽ More

    Submitted 18 September, 2022; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: Accepted By IEEE Transactions on Artificial Intelligence

  27. arXiv:2109.00862  [pdf, other

    cs.RO cs.AI

    VORRT-COLREGs: A Hybrid Velocity Obstacles and RRT Based COLREGs-Compliant Path Planner for Autonomous Surface Vessels

    Authors: Rahul Dubey, Sushil J Louis

    Abstract: This paper presents VORRT-COLREGs, a hybrid technique that combines velocity obstacles (VO) and rapidly-exploring random trees (RRT) to generate safe trajectories for autonomous surface vessels (ASVs) while following nautical rules of the road. RRT generates a set of way points and the velocity obstacles method ensures safe travel between way points. We also ensure that the actions of ASVs do not… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted in IEEE OCEANS 2021

  28. arXiv:2105.13067  [pdf, other

    eess.IV cs.CV

    Efficient High-Resolution Image-to-Image Translation using Multi-Scale Gradient U-Net

    Authors: Kumarapu Laxman, Shiv Ram Dubey, Baddam Kalyan, Satya Raj Vineel Kojjarapu

    Abstract: Recently, Conditional Generative Adversarial Network (Conditional GAN) have shown very promising performance in several image-to-image translation applications. However, the uses of these conditional GANs are quite limited to low-resolution images, such as 256X256.The Pix2Pix-HD is a recent attempt to utilize the conditional GAN for high-resolution image synthesis. In this paper, we propose a Mult… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: 12 pages, 6 figurea

  29. arXiv:2105.10262  [pdf, other

    cs.CV

    Joint Triplet Autoencoder for Histopathological Colon Cancer Nuclei Retrieval

    Authors: Satya Rajendra Singh, Shiv Ram Dubey, Shruthi MS, Sairathan Ventrapragada, Saivamshi Salla Dasharatha

    Abstract: Deep learning has shown a great improvement in the performance of visual tasks. Image retrieval is the task of extracting the visually similar images from a database for a query image. The feature matching is performed to rank the images. Various hand-designed features have been derived in past to represent the images. Nowadays, the power of deep learning is being utilized for automatic feature le… ▽ More

    Submitted 24 May, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

  30. arXiv:2105.10239  [pdf, other

    eess.IV cs.CV cs.LG

    AC-CovidNet: Attention Guided Contrastive CNN for Recognition of Covid-19 in Chest X-Ray Images

    Authors: Anirudh Ambati, Shiv Ram Dubey

    Abstract: Covid-19 global pandemic continues to devastate health care systems across the world. At present, the Covid-19 testing is costly and time-consuming. Chest X-Ray (CXR) testing can be a fast, scalable, and non-invasive method. The existing methods suffer due to the limited CXR samples available from Covid-19. Thus, inspired by the limitations of the open-source work in this field, we propose attenti… ▽ More

    Submitted 22 January, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: Accepted in Sixth IAPR International Conference on Computer Vision & Image Processing (CVIP2021)

  31. arXiv:2105.10190  [pdf, other

    cs.LG cs.NE stat.ML

    AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks

    Authors: S. K. Roy, M. E. Paoletti, J. M. Haut, S. R. Dubey, P. Kar, A. Plaza, B. B. Chaudhuri

    Abstract: Convolutional neural networks (CNNs) are trained using stochastic gradient descent (SGD)-based optimizers. Recently, the adaptive moment estimation (Adam) optimizer has become very popular due to its adaptive momentum, which tackles the dying gradient problem of SGD. Nevertheless, existing optimizers are still unable to exploit the optimization curvature information efficiently. This paper propose… ▽ More

    Submitted 9 September, 2023; v1 submitted 21 May, 2021; originally announced May 2021.

  32. arXiv:2102.00160  [pdf, other

    cs.CV

    Deep Model Compression based on the Training History

    Authors: S. H. Shabbeer Basha, Mohammad Farazuddin, Viswanath Pulabaigari, Shiv Ram Dubey, Snehasis Mukherjee

    Abstract: Deep Convolutional Neural Networks (DCNNs) have shown promising performances in several visual recognition problems which motivated the researchers to propose popular architectures such as LeNet, AlexNet, VGGNet, ResNet, and many more. These architectures come at a cost of high computational complexity and parameter storage. To get rid of storage and computational complexity, deep model compressio… ▽ More

    Submitted 12 May, 2022; v1 submitted 30 January, 2021; originally announced February 2021.

  33. arXiv:2012.14456  [pdf, other

    cs.CV cs.AI

    Color Channel Perturbation Attacks for Fooling Convolutional Neural Networks and A Defense Against Such Attacks

    Authors: Jayendra Kantipudi, Shiv Ram Dubey, Soumendu Chakraborty

    Abstract: The Convolutional Neural Networks (CNNs) have emerged as a very powerful data dependent hierarchical feature extraction method. It is widely used in several computer vision problems. The CNNs learn the important visual features from training samples automatically. It is observed that the network overfits the training samples very easily. Several regularization methods have been proposed to avoid t… ▽ More

    Submitted 20 December, 2020; originally announced December 2020.

    Comments: Accepted in IEEE Transactions on Artificial Intelligence

  34. MERANet: Facial Micro-Expression Recognition using 3D Residual Attention Network

    Authors: Viswanatha Reddy Gajjala, Sai Prasanna Teja Reddy, Snehasis Mukherjee, Shiv Ram Dubey

    Abstract: Micro-expression has emerged as a promising modality in affective computing due to its high objectivity in emotion detection. Despite the higher recognition accuracy provided by the deep learning models, there are still significant scope for improvements in micro-expression recognition techniques. The presence of micro-expressions in small-local regions of the face, as well as the limited size of… ▽ More

    Submitted 23 January, 2022; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Published in Twelfth Indian Conference on Computer Vision, Graphics and Image Processing (ICVGIP), 2021

  35. A Decade Survey of Content Based Image Retrieval using Deep Learning

    Authors: Shiv Ram Dubey

    Abstract: The content based image retrieval aims to find the similar images from a large scale dataset against a query image. Generally, the similarity between the representative features of the query image and dataset images is used to rank the images for retrieval. In early days, various hand designed feature descriptors have been investigated based on the visual cues such as color, texture, shape, etc. t… ▽ More

    Submitted 20 May, 2021; v1 submitted 22 November, 2020; originally announced December 2020.

    Comments: Published by IEEE Transactions on Circuits and Systems for Video Technology

  36. arXiv:2012.00633  [pdf

    cs.CL cs.AI cs.IR cs.LG

    Meta-Embeddings for Natural Language Inference and Semantic Similarity tasks

    Authors: Shree Charran R, Rahul Kumar Dubey

    Abstract: Word Representations form the core component for almost all advanced Natural Language Processing (NLP) applications such as text mining, question-answering, and text summarization, etc. Over the last two decades, immense research is conducted to come up with one single model to solve all major NLP tasks. The major problem currently is that there are a plethora of choices for different NLP tasks. T… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  37. arXiv:2011.13782  [pdf, other

    cs.AI

    Connecting Context-specific Adaptation in Humans to Meta-learning

    Authors: Rachit Dubey, Erin Grant, Michael Luo, Karthik Narasimhan, Thomas Griffiths

    Abstract: Cognitive control, the ability of a system to adapt to the demands of a task, is an integral part of cognition. A widely accepted fact about cognitive control is that it is context-sensitive: Adults and children alike infer information about a task's demands from contextual cues and use these inferences to learn from ambiguous cues. However, the precise way in which people use contextual cues to g… ▽ More

    Submitted 30 November, 2020; v1 submitted 27 November, 2020; originally announced November 2020.

    Comments: 9 pages

    ACM Class: I.2.8

  38. arXiv:2011.06496  [pdf, other

    cs.CV cs.LG

    On the Performance of Convolutional Neural Networks under High and Low Frequency Information

    Authors: Roshan Reddy Yedla, Shiv Ram Dubey

    Abstract: Convolutional neural networks (CNNs) have shown very promising performance in recent years for different problems, including object recognition, face recognition, medical image analysis, etc. However, generally the trained CNN models are tested over the test set which is very similar to the trained set. The generalizability and robustness of the CNN models are very important aspects to make it to… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Comments: Accepted in Fifth IAPR International Conference on Computer Vision and Image Processing (CVIP), 2020

  39. arXiv:2009.02571  [pdf

    cs.SE cs.LG stat.ML

    The Integrity of Machine Learning Algorithms against Software Defect Prediction

    Authors: Param Khakhar and, Rahul Kumar Dubey

    Abstract: The increased computerization in recent years has resulted in the production of a variety of different software, however measures need to be taken to ensure that the produced software isn't defective. Many researchers have worked in this area and have developed different Machine Learning-based approaches that predict whether the software is defective or not. This issue can't be resolved simply by… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: 7 pages, 4 figures

  40. arXiv:2006.02724  [pdf

    cs.LG stat.ML

    Characterizing the Weight Space for Different Learning Models

    Authors: Saurav Musunuru, Jay N. Paranjape, Rahul Kumar Dubey, Vijendran G. Venkoparao

    Abstract: Deep Learning has become one of the primary research areas in develo** intelligent machines. Most of the well-known applications (such as Speech Recognition, Image Processing and NLP) of AI are driven by Deep Learning. Deep Learning algorithms mimic human brain using artificial neural networks and progressively learn to accurately solve a given problem. But there are significant challenges in De… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 6 pages, 8 figures, CONF CDS 2020

  41. arXiv:2006.01408  [pdf

    cs.LG cs.CV stat.ML

    Exploring the role of Input and Output Layers of a Deep Neural Network in Adversarial Defense

    Authors: Jay N. Paranjape, Rahul Kumar Dubey, Vijendran V Gopalan

    Abstract: Deep neural networks are learning models having achieved state of the art performance in many fields like prediction, computer vision, language processing and so on. However, it has been shown that certain inputs exist which would not trick a human normally, but may mislead the model completely. These inputs are known as adversarial inputs. These inputs pose a high security threat when such models… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: 5 pages, 7 figures, to be presented at CONF-CDS 2020

  42. arXiv:2005.04209  [pdf

    cs.HC cs.AI

    BCI-Controlled Hands-Free Wheelchair Navigation with Obstacle Avoidance

    Authors: Ramy Mounir, Redwan Alqasemi, Rajiv Dubey

    Abstract: Brain-Computer interfaces (BCI) are widely used in reading brain signals and converting them into real-world motion. However, the signals produced from the BCI are noisy and hard to analyze. This paper looks specifically towards combining the BCI's latest technology with ultrasonic sensors to provide a hands-free wheelchair that can efficiently navigate through crowded environments. This combinati… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: Accepted by IROS 2018 workshop on "Haptic-enabled shared control of robotic systems: a compromise between teleoperation and autonomy"

  43. AutoTune: Automatically Tuning Convolutional Neural Networks for Improved Transfer Learning

    Authors: S. H. Shabbeer Basha, Sravan Kumar Vinakota, Viswanath Pulabaigari, Snehasis Mukherjee, Shiv Ram Dubey

    Abstract: Transfer learning enables solving a specific task having limited data by using the pre-trained deep networks trained on large-scale datasets. Typically, while transferring the learned knowledge from source task to the target task, the last few layers are fine-tuned (re-trained) over the target dataset. However, these layers are originally designed for the source task that might not be suitable for… ▽ More

    Submitted 3 December, 2020; v1 submitted 25 April, 2020; originally announced May 2020.

    Comments: This paper is published in Neural Networks journal

  44. arXiv:2002.07082  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    PCSGAN: Perceptual Cyclic-Synthesized Generative Adversarial Networks for Thermal and NIR to Visible Image Transformation

    Authors: Kancharagunta Kishan Babu, Shiv Ram Dubey

    Abstract: In many real world scenarios, it is difficult to capture the images in the visible light spectrum (VIS) due to bad lighting conditions. However, the images can be captured in such scenarios using Near-Infrared (NIR) and Thermal (THM) cameras. The NIR and THM images contain the limited details. Thus, there is a need to transform the images from THM/NIR to VIS for better understanding. However, it i… ▽ More

    Submitted 6 August, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Published in Neurocomputing Journal, Elsevier

    Journal ref: Neurocomputing, 413:41-50, Nov 2020

  45. arXiv:2001.11951  [pdf, other

    cs.CV cs.LG eess.IV

    AutoFCL: Automatically Tuning Fully Connected Layers for Handling Small Dataset

    Authors: S. H. Shabbeer Basha, Sravan Kumar Vinakota, Shiv Ram Dubey, Viswanath Pulabaigari, Snehasis Mukherjee

    Abstract: Deep Convolutional Neural Networks (CNN) have evolved as popular machine learning models for image classification during the past few years, due to their ability to learn the problem-specific features directly from the input images. The success of deep learning models solicits architecture engineering rather than hand-engineering the features. However, designing state-of-the-art CNN for a given ta… ▽ More

    Submitted 28 January, 2021; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: This paper is published in Neural Computing & Applications Journal

  46. arXiv:2001.05489  [pdf, other

    cs.CV cs.LG eess.IV

    CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation

    Authors: Kancharagunta Kishan Babu, Shiv Ram Dubey

    Abstract: Generative Adversarial Networks (GANs) have facilitated a new direction to tackle the image-to-image transformation problem. Different GANs use generator and discriminator networks with different losses in the objective function. Still there is a gap to fill in terms of both the quality of the generated images and close to the ground truth images. In this work, we introduce a new Image-to-Image Tr… ▽ More

    Submitted 26 November, 2021; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: Journal of Visual Communication and Image Representation, 2022

  47. arXiv:1912.10946  [pdf, other

    cs.CV cs.LG eess.IV

    PSNet: Parametric Sigmoid Norm Based CNN for Face Recognition

    Authors: Yash Srivastava, Vaishnav Murali, Shiv Ram Dubey

    Abstract: The Convolutional Neural Networks (CNN) have become very popular recently due to its outstanding performance in various computer vision applications. It is also used over widely studied face recognition problem. However, the existing layers of CNN are unable to cope with the problem of hard examples which generally produce lower class scores. Thus, the existing methods become biased towards the ea… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: Accepted in IEEE CICT 2019 Conference

  48. arXiv:1910.08665  [pdf, other

    cs.LG cs.CV stat.ML

    NASIB: Neural Architecture Search withIn Budget

    Authors: Abhishek Singh, Anubhav Garg, **an Zhou, Shiv Ram Dubey, Debo Dutta

    Abstract: Neural Architecture Search (NAS) represents a class of methods to generate the optimal neural network architecture and typically iterate over candidate architectures till convergence over some particular metric like validation loss. They are constrained by the available computation resources, especially in enterprise environments. In this paper, we propose a new approach for NAS, called NASIB, whi… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  49. arXiv:1910.00767  [pdf

    cs.MA cs.AI

    Cognitive Agent Based Simulation Model For Improving Disaster Response Procedures

    Authors: Rohit K. Dubey, Samuel S. Sohn, Christoph Hoelscher, Mubbasir Kapadia

    Abstract: In the event of a disaster, saving human lives is of utmost importance. For develo** proper evacuation procedures and guidance systems, behavioural data on how people respond during panic and stress is crucial. In the absence of real human data on building evacuation, there is a need for a crowd simulator to model egress and decision-making under uncertainty. In this paper, we propose an agent-b… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  50. arXiv:1909.11015  [pdf, other

    cs.LG cs.CV cs.NE math.OC

    diffGrad: An Optimization Method for Convolutional Neural Networks

    Authors: Shiv Ram Dubey, Soumendu Chakraborty, Swalpa Kumar Roy, Snehasis Mukherjee, Satish Kumar Singh, Bidyut Baran Chaudhuri

    Abstract: Stochastic Gradient Decent (SGD) is one of the core techniques behind the success of deep neural networks. The gradient provides information on the direction in which a function has the steepest rate of change. The main problem with basic SGD is to change by equal sized steps for all parameters, irrespective of gradient behavior. Hence, an efficient way of deep network optimization is to make adap… ▽ More

    Submitted 26 November, 2021; v1 submitted 12 September, 2019; originally announced September 2019.

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, 2020