-
qPMS Sigma -- An Efficient and Exact Parallel Algorithm for the Planted $(l, d)$ Motif Search Problem
Authors:
Saurav Dhar,
Amlan Saha,
Dhiman Goswami,
Md. Abul Kashem Mia
Abstract:
Motif finding is an important step for the detection of rare events occurring in a set of DNA or protein sequences. Extraction of information about these rare events can lead to new biological discoveries. Motifs are some important patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity betwee…
▽ More
Motif finding is an important step for the detection of rare events occurring in a set of DNA or protein sequences. Extraction of information about these rare events can lead to new biological discoveries. Motifs are some important patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity between families of proteins, etc. Although several flavors of motif searching algorithms have been studied in the literature, we study the version known as $ (l, d) $-motif search or Planted Motif Search (PMS). In PMS, given two integers $ l $, $ d $ and $ n $ input sequences we try to find all the patterns of length $ l $ that appear in each of the $ n $ input sequences with at most $ d $ mismatches. We also discuss the quorum version of PMS in our work that finds motifs that are not planted in all the input sequences but at least in $ q $ of the sequences. Our algorithm is mainly based on the algorithms qPMSPrune, qPMS7, TraverStringRef and PMS8. We introduce some techniques to compress the input strings and make faster comparison between strings with bitwise operations. Our algorithm performs a little better than the existing exact algorithms to solve the qPMS problem in DNA sequence. We have also proposed an idea for parallel implementation of our algorithm.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Differential Evolution Algorithm based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition
Authors:
Sandipan Dhar,
Anuvab Sen,
Aritra Bandyopadhyay,
Nanda Dulal Jana,
Arjun Ghosh,
Zahra Sarayloo
Abstract:
Speech Command Recognition (SCR), which deals with identification of short uttered speech commands, is crucial for various applications, including IoT devices and assistive technology. Despite the promise shown by Convolutional Neural Networks (CNNs) in SCR tasks, their efficacy relies heavily on hyper-parameter selection, which is typically laborious and time-consuming when done manually. This pa…
▽ More
Speech Command Recognition (SCR), which deals with identification of short uttered speech commands, is crucial for various applications, including IoT devices and assistive technology. Despite the promise shown by Convolutional Neural Networks (CNNs) in SCR tasks, their efficacy relies heavily on hyper-parameter selection, which is typically laborious and time-consuming when done manually. This paper introduces a hyper-parameter selection method for CNNs based on the Differential Evolution (DE) algorithm, aiming to enhance performance in SCR tasks. Training and testing with the Google Speech Command (GSC) dataset, the proposed approach showed effectiveness in classifying speech commands. Moreover, a comparative analysis with Genetic Algorithm based selections and other deep CNN (DCNN) models highlighted the efficiency of the proposed DE algorithm in hyper-parameter selection for CNNs in SCR tasks.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Comparative Evaluation of Metaheuristic Algorithms for Hyperparameter Selection in Short-Term Weather Forecasting
Authors:
Anuvab Sen,
Arul Rhik Mazumder,
Dibyarup Dutta,
Udayon Sen,
Pathikrit Syam,
Sandipan Dhar
Abstract:
Weather forecasting plays a vital role in numerous sectors, but accurately capturing the complex dynamics of weather systems remains a challenge for traditional statistical models. Apart from Auto Regressive time forecasting models like ARIMA, deep learning techniques (Vanilla ANNs, LSTM and GRU networks), have shown promise in improving forecasting accuracy by capturing temporal dependencies. Thi…
▽ More
Weather forecasting plays a vital role in numerous sectors, but accurately capturing the complex dynamics of weather systems remains a challenge for traditional statistical models. Apart from Auto Regressive time forecasting models like ARIMA, deep learning techniques (Vanilla ANNs, LSTM and GRU networks), have shown promise in improving forecasting accuracy by capturing temporal dependencies. This paper explores the application of metaheuristic algorithms, namely Genetic Algorithm (GA), Differential Evolution (DE), and Particle Swarm Optimization (PSO), to automate the search for optimal hyperparameters in these model architectures. Metaheuristic algorithms excel in global optimization, offering robustness, versatility, and scalability in handling non-linear problems. We present a comparative analysis of different model architectures integrated with metaheuristic optimization, evaluating their performance in weather forecasting based on metrics such as Mean Squared Error (MSE) and Mean Absolute Percentage Error (MAPE). The results demonstrate the potential of metaheuristic algorithms in enhancing weather forecasting accuracy \& helps in determining the optimal set of hyper-parameters for each model. The paper underscores the importance of harnessing advanced optimization techniques to select the most suitable metaheuristic algorithm for the given weather forecasting task.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Systematic biases when using deep neural networks for annotating large catalogs of astronomical images
Authors:
Sanchari Dhar,
Lior Shamir
Abstract:
Deep convolutional neural networks (DCNNs) have become the most common solution for automatic image annotation due to their non-parametric nature, good performance, and their accessibility through libraries such as TensorFlow. Among other fields, DCNNs are also a common approach to the annotation of large astronomical image databases acquired by digital sky surveys. One of the main downsides of DC…
▽ More
Deep convolutional neural networks (DCNNs) have become the most common solution for automatic image annotation due to their non-parametric nature, good performance, and their accessibility through libraries such as TensorFlow. Among other fields, DCNNs are also a common approach to the annotation of large astronomical image databases acquired by digital sky surveys. One of the main downsides of DCNNs is the complex non-intuitive rules that make DCNNs act as a ``black box", providing annotations in a manner that is unclear to the user. Therefore, the user is often not able to know what information is used by the DCNNs for the classification. Here we demonstrate that the training of a DCNN is sensitive to the context of the training data such as the location of the objects in the sky. We show that for basic classification of elliptical and spiral galaxies, the sky location of the galaxies used for training affects the behavior of the algorithm, and leads to a small but consistent and statistically significant bias. That bias exhibits itself in the form of cosmological-scale anisotropy in the distribution of basic galaxy morphology. Therefore, while DCNNs are powerful tools for annotating images of extended sources, the construction of training sets for galaxy morphology should take into consideration more aspects than the visual appearance of the object. In any case, catalogs created with deep neural networks that exhibit signs of cosmological anisotropy should be interpreted with the possibility of consistent bias.
△ Less
Submitted 9 January, 2022;
originally announced January 2022.
-
Analysis of the Pennsylvania Additive Classification Tool: Biases and Important Features
Authors:
Swarup Dhar,
Vanessa Massaro,
Darakhshan Mir,
Nathan C. Ryan
Abstract:
The Pennsylvania Additive Classification Tool (PACT) is a carceral algorithm used by the Pennsylvania Department of Corrections in order to determine the security level for an incarcerated person in the state's prison system. For a newly incarcerated person it is used in their initial classification. The initial classification can be overridden both for discretionary and administrative reasons. An…
▽ More
The Pennsylvania Additive Classification Tool (PACT) is a carceral algorithm used by the Pennsylvania Department of Corrections in order to determine the security level for an incarcerated person in the state's prison system. For a newly incarcerated person it is used in their initial classification. The initial classification can be overridden both for discretionary and administrative reasons. An incarcerated person is reclassified annually using a variant of the PACT and this reclassification can be overridden, too, and for similar reasons. In this paper, for each of these four processes (the two classifications and their corresponding overrides), we develop several logistic models, both binary and multinomial, to replicate these processes with high accuracy. By examining these models, we both identify which features are most important in the model and quantify and describe biases that exist in the PACT, its overrides, and its use in reclassification. Because the details of how the PACT operates have been redacted from public documents, it is important to know how it works and what disparate impact it might have on different incarcerated people.
△ Less
Submitted 10 December, 2021;
originally announced December 2021.
-
Analyzing a Carceral Algorithm used by the Pennsylvania Department of Corrections
Authors:
Vanessa Massaro,
Swarup Dhar,
Darakhshan Mir,
Nathan C. Ryan
Abstract:
Scholars have focused on algorithms used during sentencing, bail, and parole, but little work explores what we call carceral algorithms that are used during incarceration. This paper is focused on the Pennsylvania Additive Classification Tool (PACT) used to classify prisoners' custody levels while they are incarcerated. Algorithms that are used during incarceration warrant deeper attention by scho…
▽ More
Scholars have focused on algorithms used during sentencing, bail, and parole, but little work explores what we call carceral algorithms that are used during incarceration. This paper is focused on the Pennsylvania Additive Classification Tool (PACT) used to classify prisoners' custody levels while they are incarcerated. Algorithms that are used during incarceration warrant deeper attention by scholars because they have the power to enact the lived reality of the prisoner. The algorithm in this case determines the likelihood a person would endure additional disciplinary actions, can complete required programming, and gain experiences that, among other things, are distilled into variables feeding into the parole algorithm. Given such power, examining algorithms used on people currently incarcerated offers a unique analytic view to think about the dialectic relationship between data and algorithms. Our examination of the PACT is two-fold and complementary. First, our qualitative overview of the historical context surrounding PACT reveals that it is designed to prioritize incapacitation and control over rehabilitation. While it closely informs prisoner rehabilitation plans and parole considerations, it is rooted in population management for prison securitization. Second, on analyzing data for 146,793 incarcerated people in PA, along with associated metadata related to the PACT, we find it is replete with racial bias as well as errors, omissions, and inaccuracies. Our findings to date further caution against data-driven criminal justice reforms that rely on pre-existing data infrastructures and expansive, uncritical, data-collection routines.
△ Less
Submitted 6 December, 2021;
originally announced December 2021.
-
Uncertainty in Criminal Justice Algorithms: simulation studies of the Pennsylvania Additive Classification Tool
Authors:
Swarup Dhar,
Vanessa Massaro,
Darakhshan Mir,
Nathan C. Ryan
Abstract:
Much attention has been paid to algorithms related to sentencing, the setting of bail, parole decisions and recidivism while less attention has been paid to carceral algorithms, those algorithms used to determine an incarcerated individual's lived experience. In this paper we study one such algorithm, the Pennsylvania Additive Classification Tool (PACT) that assigns custody levels to incarcerated…
▽ More
Much attention has been paid to algorithms related to sentencing, the setting of bail, parole decisions and recidivism while less attention has been paid to carceral algorithms, those algorithms used to determine an incarcerated individual's lived experience. In this paper we study one such algorithm, the Pennsylvania Additive Classification Tool (PACT) that assigns custody levels to incarcerated individuals. We analyze the PACT in ways that criminal justice algorithms are often analyzed: namely, we train an accurate machine learning model for the PACT; we study its fairness across sex, age and race; and we determine which features are most important. In addition to these conventional computations, we propose and carry out some new ways to study such algorithms. Instead of focusing on the outcomes themselves, we propose shifting our attention to the variability in the outcomes, especially because many carceral algorithms are used repeatedly and there can be a propagation of uncertainty. By carrying out several simulations of assigning custody levels, we shine light on problematic aspects of tools like the PACT.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
A Survey on Proactive Customer Care: Enabling Science and Steps to Realize it
Authors:
Viswanath Ganapathy,
Sauptik Dhar,
Olimpiya Saha,
Pelin Kurt Garberson,
Javad Heydari,
Mohak Shah
Abstract:
In recent times, advances in artificial intelligence (AI) and IoT have enabled seamless and viable maintenance of appliances in home and building environments. Several studies have shown that AI has the potential to provide personalized customer support which could predict and avoid errors more reliably than ever before. In this paper, we have analyzed the various building blocks needed to enable…
▽ More
In recent times, advances in artificial intelligence (AI) and IoT have enabled seamless and viable maintenance of appliances in home and building environments. Several studies have shown that AI has the potential to provide personalized customer support which could predict and avoid errors more reliably than ever before. In this paper, we have analyzed the various building blocks needed to enable a successful AI-driven predictive maintenance use-case. Unlike, existing surveys which mostly provide a deep dive into the recent AI algorithms for Predictive Maintenance (PdM), our survey provides the complete view; starting from business impact to recent technology advancements in algorithms as well as systems research and model deployment. Furthermore, we provide exemplar use-cases on predictive maintenance of appliances using publicly available data sets. Our survey can serve as a template needed to design a successful predictive maintenance use-case. Finally, we touch upon existing public data sources and provide a step-wise breakdown of an AI-driven proactive customer care (PCC) use-case, starting from generic anomaly detection to fault prediction and finally root-cause analysis. We highlight how such a step-wise approach can be advantageous for accurate model building and helpful for gaining insights into predictive maintenance of electromechanical appliances.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Universum GANs: Improving GANs through contradictions
Authors:
Sauptik Dhar,
Javad Heydari,
Samarth Tripathi,
Unmesh Kurup,
Mohak Shah
Abstract:
Limited availability of labeled-data makes any supervised learning problem challenging. Alternative learning settings like semi-supervised and universum learning alleviate the dependency on labeled data, but still require a large amount of unlabeled data, which may be unavailable or expensive to acquire. GAN-based data generation methods have recently shown promise by generating synthetic samples…
▽ More
Limited availability of labeled-data makes any supervised learning problem challenging. Alternative learning settings like semi-supervised and universum learning alleviate the dependency on labeled data, but still require a large amount of unlabeled data, which may be unavailable or expensive to acquire. GAN-based data generation methods have recently shown promise by generating synthetic samples to improve learning. However, most existing GAN based approaches either provide poor discriminator performance under limited labeled data settings; or results in low quality generated data. In this paper, we propose a Universum GAN game which provides improved discriminator accuracy under limited data settings, while generating high quality realistic data. We further propose an evolving discriminator loss which improves its convergence and generalization performance. We derive the theoretical guarantees and provide empirical results in support of our approach.
△ Less
Submitted 20 September, 2022; v1 submitted 18 June, 2021;
originally announced June 2021.
-
DOC3-Deep One Class Classification using Contradictions
Authors:
Sauptik Dhar,
Bernardo Gonzalez Torres
Abstract:
This paper introduces the notion of learning from contradictions (a.k.a Universum learning) for deep one class classification problems. We formalize this notion for the widely adopted one class large-margin loss, and propose the Deep One Class Classification using Contradictions (DOC3) algorithm. We show that learning from contradictions incurs lower generalization error by comparing the Empirical…
▽ More
This paper introduces the notion of learning from contradictions (a.k.a Universum learning) for deep one class classification problems. We formalize this notion for the widely adopted one class large-margin loss, and propose the Deep One Class Classification using Contradictions (DOC3) algorithm. We show that learning from contradictions incurs lower generalization error by comparing the Empirical Rademacher Complexity (ERC) of DOC3 against its traditional inductive learning counterpart. Our empirical results demonstrate the efficacy of DOC3 compared to popular baseline algorithms on several real-life data sets.
△ Less
Submitted 23 May, 2022; v1 submitted 17 May, 2021;
originally announced May 2021.
-
An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion
Authors:
Sandipan Dhar,
Nanda Dulal Jana,
Swagatam Das
Abstract:
Voice Conversion (VC) emerged as a significant domain of research in the field of speech synthesis in recent years due to its emerging application in voice-assisting technology, automated movie dubbing, and speech-to-singing conversion to name a few. VC basically deals with the conversion of vocal style of one speaker to another speaker while kee** the linguistic contents unchanged. VC task is p…
▽ More
Voice Conversion (VC) emerged as a significant domain of research in the field of speech synthesis in recent years due to its emerging application in voice-assisting technology, automated movie dubbing, and speech-to-singing conversion to name a few. VC basically deals with the conversion of vocal style of one speaker to another speaker while kee** the linguistic contents unchanged. VC task is performed through a three-stage pipeline consisting of speech analysis, speech feature map**, and speech reconstruction. Nowadays the Generative Adversarial Network (GAN) models are widely in use for speech feature map** from source to target speaker. In this paper, we propose an adaptive learning-based GAN model called ALGAN-VC for an efficient one-to-one VC of speakers. Our ALGAN-VC framework consists of some approaches to improve the speech quality and voice similarity between source and target speakers. The model incorporates a Dense Residual Network (DRN) like architecture to the generator network for efficient speech feature learning, for source to target speech feature conversion. We also integrate an adaptive learning mechanism to compute the loss function for the proposed model. Moreover, we use a boosted learning rate approach to enhance the learning capability of the proposed model. The model is trained by using both forward and inverse map** simultaneously for a one-to-one VC. The proposed model is tested on Voice Conversion Challenge (VCC) 2016, 2018, and 2020 datasets as well as on our self-prepared speech dataset, which has been recorded in Indian regional languages and in English. A subjective and objective evaluation of the generated speech samples indicated that the proposed model elegantly performed the voice conversion task by achieving high speaker similarity and adequate speech quality.
△ Less
Submitted 25 April, 2021;
originally announced April 2021.
-
Stabilizing Bi-Level Hyperparameter Optimization using Moreau-Yosida Regularization
Authors:
Sauptik Dhar,
Unmesh Kurup,
Mohak Shah
Abstract:
This research proposes to use the Moreau-Yosida envelope to stabilize the convergence behavior of bi-level Hyperparameter optimization solvers, and introduces the new algorithm called Moreau-Yosida regularized Hyperparameter Optimization (MY-HPO) algorithm. Theoretical analysis on the correctness of the MY-HPO solution and initial convergence analysis is also provided. Our empirical results show s…
▽ More
This research proposes to use the Moreau-Yosida envelope to stabilize the convergence behavior of bi-level Hyperparameter optimization solvers, and introduces the new algorithm called Moreau-Yosida regularized Hyperparameter Optimization (MY-HPO) algorithm. Theoretical analysis on the correctness of the MY-HPO solution and initial convergence analysis is also provided. Our empirical results show significant improvement in loss values for a fixed computation budget, compared to the state-of-art bi-level HPO solvers.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Lessons Learned Develo** and Extending a Visual Analytics Solution for Investigative Analysis of Scamming Activities
Authors:
Ronak Tanna,
Shivam Dhar,
Ashwin Sudhir,
Shreyash Devan,
Shubham Verma
Abstract:
Cybersecurity analysts work on large communication data sets to perform investigative analysis by painstakingly going over thousands of email conversations to find potential scamming activities and the network of cyber scammers. Traditionally,experts used email clients, database systems and text editors to perform this investigation. With the advent of technology,elaborate tools that summarize dat…
▽ More
Cybersecurity analysts work on large communication data sets to perform investigative analysis by painstakingly going over thousands of email conversations to find potential scamming activities and the network of cyber scammers. Traditionally,experts used email clients, database systems and text editors to perform this investigation. With the advent of technology,elaborate tools that summarize data more efficiently by using cutting edge data visualization techniques have come out. Beagle[1] is one such tool which visualizes the large communication data using different panels such that the inspector has better chances of finding the scam network. This paper is a report on our work to implement and improve the work done by Jay Koven et al. [1]. We have proposed and demonstrated via implementation, a few more visualizations that we feel would help in grou** and analyzing the e-mail data more efficiently. Lastly, we have also presented a case study that shows the potential use of our tool in a real-world scenario.
△ Less
Submitted 7 February, 2020;
originally announced February 2020.
-
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Authors:
Sauptik Dhar,
Junyao Guo,
Jiayi Liu,
Samarth Tripathi,
Unmesh Kurup,
Mohak Shah
Abstract:
The predominant paradigm for using machine learning models on a device is to train a model in the cloud and perform inference using the trained model on the device. However, with increasing number of smart devices and improved hardware, there is interest in performing model training on the device. Given this surge in interest, a comprehensive survey of the field from a device-agnostic perspective…
▽ More
The predominant paradigm for using machine learning models on a device is to train a model in the cloud and perform inference using the trained model on the device. However, with increasing number of smart devices and improved hardware, there is interest in performing model training on the device. Given this surge in interest, a comprehensive survey of the field from a device-agnostic perspective sets the stage for both understanding the state-of-the-art and for identifying open challenges and future avenues of research. However, on-device learning is an expansive field with connections to a large number of related topics in AI and machine learning (including online learning, model adaptation, one/few-shot learning, etc.). Hence, covering such a large number of topics in a single survey is impractical. This survey finds a middle ground by reformulating the problem of on-device learning as resource constrained learning where the resources are compute and memory. This reformulation allows tools, techniques, and algorithms from a wide variety of research areas to be compared equitably. In addition to summarizing the state-of-the-art, the survey also identifies a number of challenges and next steps for both the algorithmic and theoretical aspects of on-device learning.
△ Less
Submitted 24 July, 2020; v1 submitted 1 November, 2019;
originally announced November 2019.
-
Variable Metric Proximal Gradient Method with Diagonal Barzilai-Borwein Stepsize
Authors:
Youngsuk Park,
Sauptik Dhar,
Stephen Boyd,
Mohak Shah
Abstract:
Variable metric proximal gradient (VM-PG) is a widely used class of convex optimization method. Lately, there has been a lot of research on the theoretical guarantees of VM-PG with different metric selections. However, most such metric selections are dependent on (an expensive) Hessian, or limited to scalar stepsizes like the Barzilai-Borwein (BB) stepsize with lots of safeguarding. Instead, in th…
▽ More
Variable metric proximal gradient (VM-PG) is a widely used class of convex optimization method. Lately, there has been a lot of research on the theoretical guarantees of VM-PG with different metric selections. However, most such metric selections are dependent on (an expensive) Hessian, or limited to scalar stepsizes like the Barzilai-Borwein (BB) stepsize with lots of safeguarding. Instead, in this paper we propose an adaptive metric selection strategy called the diagonal Barzilai-Borwein (BB) stepsize. The proposed diagonal selection better captures the local geometry of the problem while kee** per-step computation cost similar to the scalar BB stepsize i.e. $O(n)$. Under this metric selection for VM-PG, the theoretical convergence is analyzed. Our empirical studies illustrate the improved convergence results under the proposed diagonal BB stepsize, specifically for ill-conditioned machine learning problems for both synthetic and real-world datasets.
△ Less
Submitted 15 October, 2019;
originally announced October 2019.
-
Single Class Universum-SVM
Authors:
Sauptik Dhar,
Vladimir Cherkassky
Abstract:
This paper extends the idea of Universum learning [1, 2] to single-class learning problems. We propose Single Class Universum-SVM setting that incorporates a priori knowledge (in the form of additional data samples) into the single class estimation problem. These additional data samples or Universum belong to the same application domain as (positive) data samples from a single class (of interest),…
▽ More
This paper extends the idea of Universum learning [1, 2] to single-class learning problems. We propose Single Class Universum-SVM setting that incorporates a priori knowledge (in the form of additional data samples) into the single class estimation problem. These additional data samples or Universum belong to the same application domain as (positive) data samples from a single class (of interest), but they follow a different distribution. Proposed methodology for single class U-SVM is based on the known connection between binary classification and single class learning formulations [3]. Several empirical comparisons are presented to illustrate the utility of the proposed approach.
△ Less
Submitted 21 September, 2019;
originally announced September 2019.
-
Improving Model Training by Periodic Sampling over Weight Distributions
Authors:
Samarth Tripathi,
Jiayi Liu,
Unmesh Kurup,
Mohak Shah,
Sauptik Dhar
Abstract:
In this paper, we explore techniques centered around periodic sampling of model weights that provide convergence improvements on gradient update methods (vanilla \acs{SGD}, Momentum, Adam) for a variety of vision problems (classification, detection, segmentation). Importantly, our algorithms provide better, faster and more robust convergence and training performance with only a slight increase in…
▽ More
In this paper, we explore techniques centered around periodic sampling of model weights that provide convergence improvements on gradient update methods (vanilla \acs{SGD}, Momentum, Adam) for a variety of vision problems (classification, detection, segmentation). Importantly, our algorithms provide better, faster and more robust convergence and training performance with only a slight increase in computation time. Our techniques are independent of the neural network model, gradient optimization methods or existing optimal training policies and converge in a less volatile fashion with performance improvements that are approximately monotonic. We conduct a variety of experiments to quantify these improvements and identify scenarios where these techniques could be more useful.
△ Less
Submitted 19 March, 2020; v1 submitted 14 May, 2019;
originally announced May 2019.
-
Multiclass Universum SVM
Authors:
Sauptik Dhar,
Vladimir Cherkassky,
Mohak Shah
Abstract:
We introduce Universum learning for multiclass problems and propose a novel formulation for multiclass universum SVM (MU-SVM). We also propose an analytic span bound for model selection with almost 2-4x faster computation times than standard resampling techniques. We empirically demonstrate the efficacy of the proposed MUSVM formulation on several real world datasets achieving > 20% improvement in…
▽ More
We introduce Universum learning for multiclass problems and propose a novel formulation for multiclass universum SVM (MU-SVM). We also propose an analytic span bound for model selection with almost 2-4x faster computation times than standard resampling techniques. We empirically demonstrate the efficacy of the proposed MUSVM formulation on several real world datasets achieving > 20% improvement in test accuracies compared to multi-class SVM.
△ Less
Submitted 23 August, 2018;
originally announced August 2018.
-
A Novel Approach for Fast and Accurate Mean Error Distance Computation in Approximate Adders
Authors:
Avishek Sinha Roy,
Anindya Sundar Dhar
Abstract:
In error-tolerant applications, approximate adders have been exploited extensively to achieve energy efficient system designs. Mean error distance is one of the important error metrics used as a performance measure of approximate adders. In this work, a fast and efficient methodology is proposed to determine the exact mean error distance in approximate lower significant bit adders. A detailed desc…
▽ More
In error-tolerant applications, approximate adders have been exploited extensively to achieve energy efficient system designs. Mean error distance is one of the important error metrics used as a performance measure of approximate adders. In this work, a fast and efficient methodology is proposed to determine the exact mean error distance in approximate lower significant bit adders. A detailed description of the proposed algorithm along with an example has been demonstrated in this paper. Experimental analysis shows that the proposed method performs better than existing Monte Carlo simulation approach both in terms of accuracy and execution time.
△ Less
Submitted 6 March, 2018;
originally announced March 2018.
-
Algorithm/Architecture Co-design of Proportionate-type LMS Adaptive Filters for Sparse System Identification
Authors:
Subrahmanyam Mula,
Vinay Chakravarthi Gogineni,
Anindya Sundar Dhar
Abstract:
This paper investigates the problem of implementing proportionate-type LMS family of algorithms in hardware for sparse adaptive filtering applications especially the network echo cancelation. We derive a re-formulated proportionate type algorithm through algorithm-architecture co-design methodology that can be pipelined and has an efficient architecture for hardware implementation. We study the co…
▽ More
This paper investigates the problem of implementing proportionate-type LMS family of algorithms in hardware for sparse adaptive filtering applications especially the network echo cancelation. We derive a re-formulated proportionate type algorithm through algorithm-architecture co-design methodology that can be pipelined and has an efficient architecture for hardware implementation. We study the convergence, steady-state and tracking performances of these re-formulated algorithms for white, color and speech inputs before implementing them in hardware. To the best of our knowledge this is the first attempt to implement proportionate-type algorithms in hardware. We show that Delayed $μ$-law Proportionate LMS (DMPLMS) algorithm for white input and Delayed Wavelet MPLMS (DWMPLMS) for colored input are the robust VLSI solutions for network echo cancellation where the sparsity of the echo paths can vary with time. We implemented all the designs considering $16$-bit fixed point representation in hardware, synthesized the designs and synthesis results show that DMPLMS algorithm with $\approx25\%$ increase in hardware over conventional DLMS architecture, achieves $3X$ improvement in convergence rate for white input and DWMPLMS algorithm with $\approx58\%$ increase in hardware achieves $15X$ improvement in convergence rate for correlated input conditions.
△ Less
Submitted 17 March, 2017;
originally announced March 2017.
-
Universum Learning for Multiclass SVM
Authors:
Sauptik Dhar,
Naveen Ramakrishnan,
Vladimir Cherkassky,
Mohak Shah
Abstract:
We introduce Universum learning for multiclass problems and propose a novel formulation for multiclass universum SVM (MU-SVM). We also propose a span bound for MU-SVM that can be used for model selection thereby avoiding resampling. Empirical results demonstrate the effectiveness of MU-SVM and the proposed bound.
We introduce Universum learning for multiclass problems and propose a novel formulation for multiclass universum SVM (MU-SVM). We also propose a span bound for MU-SVM that can be used for model selection thereby avoiding resampling. Empirical results demonstrate the effectiveness of MU-SVM and the proposed bound.
△ Less
Submitted 28 September, 2016;
originally announced September 2016.
-
On Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clustering
Authors:
Kajsa Møllersen,
Subhra S. Dhar,
Fred Godtliebsen
Abstract:
Hybrid clustering combines partitional and hierarchical clustering for computational effectiveness and versatility in cluster shape. In such clustering, a dissimilarity measure plays a crucial role in the hierarchical merging. The dissimilarity measure has great impact on the final clustering, and data-independent properties are needed to choose the right dissimilarity measure for the problem at h…
▽ More
Hybrid clustering combines partitional and hierarchical clustering for computational effectiveness and versatility in cluster shape. In such clustering, a dissimilarity measure plays a crucial role in the hierarchical merging. The dissimilarity measure has great impact on the final clustering, and data-independent properties are needed to choose the right dissimilarity measure for the problem at hand. Properties for distance-based dissimilarity measures have been studied for decades, but properties for density-based dissimilarity measures have so far received little attention. Here, we propose six data-independent properties to evaluate density-based dissimilarity measures associated with hybrid clustering, regarding equality, orthogonality, symmetry, outlier and noise observations, and light-tailed models for heavy-tailed clusters. The significance of the properties is investigated, and we study some well-known dissimilarity measures based on Shannon entropy, misclassification rate, Bhattacharyya distance and Kullback-Leibler divergence with respect to the proposed properties. As none of them satisfy all the proposed properties, we introduce a new dissimilarity measure based on the Kullback-Leibler information and show that it satisfies all proposed properties. The effect of the proposed properties is also illustrated on several real and simulated data sets.
△ Less
Submitted 21 September, 2016;
originally announced September 2016.
-
Universum Learning for SVM Regression
Authors:
Sauptik Dhar,
Vladimir Cherkassky
Abstract:
This paper extends the idea of Universum learning [18, 19] to regression problems. We propose new Universum-SVM formulation for regression problems that incorporates a priori knowledge in the form of additional data samples. These additional data samples or Universum belong to the same application domain as the training samples, but they follow a different distribution. Several empirical compariso…
▽ More
This paper extends the idea of Universum learning [18, 19] to regression problems. We propose new Universum-SVM formulation for regression problems that incorporates a priori knowledge in the form of additional data samples. These additional data samples or Universum belong to the same application domain as the training samples, but they follow a different distribution. Several empirical comparisons are presented to illustrate the utility of the proposed approach.
△ Less
Submitted 26 May, 2016;
originally announced May 2016.
-
Accelerating SystemVerilog UVM Based VIP to Improve Methodology for Verification of Image Signal Processing Designs Using HW Emulator
Authors:
Abhishek Jain,
Piyush Kumar Gupta,
Dr. Hima Gupta,
Sachish Dhar
Abstract:
In this paper we present the development of Acceleratable UVCs from standard UVCs in SystemVerilog and their usage in UVM based Verification Environment of Image Signal Processing designs to increase run time performance. This paper covers development of Acceleratable UVCs from standard UVCs for internal control and data buses of ST imaging group by partitioning of transaction-level components and…
▽ More
In this paper we present the development of Acceleratable UVCs from standard UVCs in SystemVerilog and their usage in UVM based Verification Environment of Image Signal Processing designs to increase run time performance. This paper covers development of Acceleratable UVCs from standard UVCs for internal control and data buses of ST imaging group by partitioning of transaction-level components and cycle-accurate signal-level components between the software simulator and hardware accelerator respectively. Standard Co-Emulation API: Modeling Interface (SCE-MI) compliant, transaction-level communications link between test benches running on a host system and Emulation machine is established. Accelerated Verification IPs are used at UVM based Verification Environment of Image Signal Processing designs both with simulator and emulator as UVM acceleration is an extension of the standard simulation-only UVM and is fully backward compatible with it. Acceleratable UVCs significantly reduces development schedule risks while leveraging transaction models used during simulation.
In this paper, we discuss our experiences on UVM based methodology adoption on TestBench-Xpress(TBX) based technology step by step. We are also doing comparison between the run time performance results from earlier simulator-only environment and the new, hardware-accelerated environment. Although this paper focuses on Acceleratable UVCs development and their usage for image signal processing designs, Same concept can be extended for non-image signal processing designs.
KEYWORDS- SystemVerilog, Universal Verification Methodology (UVM), TestBench-Xpress (TBX), Universal Verification Component (UVC), Standard Co-Emulation API: Modelling Interface (SCE-MI), Acceleratable UVC, Emulator, XRTL Tasks/Functions (xtf), Transactor interface (tif), Verification IP (VIP).
△ Less
Submitted 15 January, 2014;
originally announced January 2014.
-
A Mobile Message Scheduling and Delivery System using m-Learning framework
Authors:
Moumita Majumder,
Sumit Dhar
Abstract:
Wireless data communications in form of Short Message Service (SMS) and Wireless Access Protocols (WAP) browsers have gained global popularity, yet, not much has been done to extend the usage of these devices in electronic learning (e-learning) and information sharing. This project explores the extension of e learning into wireless/ handheld (W/H) computing devices with the help of a mobile learni…
▽ More
Wireless data communications in form of Short Message Service (SMS) and Wireless Access Protocols (WAP) browsers have gained global popularity, yet, not much has been done to extend the usage of these devices in electronic learning (e-learning) and information sharing. This project explores the extension of e learning into wireless/ handheld (W/H) computing devices with the help of a mobile learning (m-learning) framework. This framework provides the requirements to develop m-learning application that can be used to share academic and administrative information among people within the university campus. A prototype application has been developed to demonstrate the important functionality of the proposed system in simulated environment. This system is supposed to work both in bulk SMS and interactive SMS delivery mode. Here we have combined both Short Message Service (SMS) and Wireless Access Protocols (WAP) browsers. SMS is used for Short and in time information delivery and WAP is used for detailed information delivery like course content, training material, interactive evolution tests etc. The push model is used for sending personalized multicasting messages to a group of mobile users with a common profile thereby improving the effectiveness and usefulness of the cntent delivered. Again pull mechanism can be applied for sending information as SMS when requested by end user in interactive SMS delivery mode. The main strength of the system is that, the actual SMS delivery application can be hosted on a mobile device, which can operate even when the device is on move.
△ Less
Submitted 29 March, 2010;
originally announced March 2010.
-
Automated Vehicle Location (AVL) Using Global Positioning System (GPS)
Authors:
Victor Dutta,
R. Bera,
Sourav Dhar,
Jaydeep Chakravorty,
Nishant Bagehel
Abstract:
this is a review paper. this describes how DGPS is helpful for lane detection and to avoid collission.
this is a review paper. this describes how DGPS is helpful for lane detection and to avoid collission.
△ Less
Submitted 18 March, 2009;
originally announced March 2009.
-
Smart Antenna Based Broadband communication in Intelligent Transportation system
Authors:
Sourav Dhar,
Debdattta Kandar,
Tanushree Bose,
Rabindranath Bera
Abstract:
This paper presents a review for the development of Intelligent Transportation System (ITS) world wide and the use of Smart Antennas in ITS. This review work also discusses the usual problems in ITS and proposes the solution of such problems using smart antennas.
This paper presents a review for the development of Intelligent Transportation System (ITS) world wide and the use of Smart Antennas in ITS. This review work also discusses the usual problems in ITS and proposes the solution of such problems using smart antennas.
△ Less
Submitted 18 March, 2009;
originally announced March 2009.
-
MIMO Based Multimedia Communication System
Authors:
D. Kandar,
Sourav Dhar,
Rabindranath Bera,
C. K. Sarkar
Abstract:
High data rate is required for multimedia communication. But the communication at high data rate is always challenging. In this work we have successfully performed data chatting, Voice chatting and high quality video transmission between two distant units using MIMO adapter, Direct sequence spread spectrum system and MATLAB/SIMULINK platform.
High data rate is required for multimedia communication. But the communication at high data rate is always challenging. In this work we have successfully performed data chatting, Voice chatting and high quality video transmission between two distant units using MIMO adapter, Direct sequence spread spectrum system and MATLAB/SIMULINK platform.
△ Less
Submitted 9 March, 2009;
originally announced March 2009.
-
Digital Radar for Collision Avoidance and Automatic Cruise Control in Transportation
Authors:
Rabindranath Bera,
Sourav Dhar,
Debdatta Kandar
Abstract:
A proper remote sensing device is required for automatic cruise control (ACC) to avoid collision in transportation system. In this paper we proposed a direct sequence spread spectrum (DSSS) radar for remote sensing in intelligent transporation system(ITS). We have successfully detected single target and through 1D radar imaging we are capable to separate multiple targets. We have also implemente…
▽ More
A proper remote sensing device is required for automatic cruise control (ACC) to avoid collision in transportation system. In this paper we proposed a direct sequence spread spectrum (DSSS) radar for remote sensing in intelligent transporation system(ITS). We have successfully detected single target and through 1D radar imaging we are capable to separate multiple targets. We have also implemented DSSS radar using software defined radio (SDR) and successfully detected a single target.
△ Less
Submitted 9 March, 2009;
originally announced March 2009.
-
Wi-Fi, WiMax and WCDMA A comparative study based on Channel Impairments and Equalization method used
Authors:
Rabindranath Bera,
Sanjib Sil,
Sourav Dhar,
Subir K. Sarkar
Abstract:
In this paper we describe the channel impairments and equalization methods currently used in WiFi, WiMax and WCDMA. After a review of channel model for Intelligent Transportation System (ITS), we proposed an equalization method which will be useful for the estimation of strong multipath channel at a high velocity.
In this paper we describe the channel impairments and equalization methods currently used in WiFi, WiMax and WCDMA. After a review of channel model for Intelligent Transportation System (ITS), we proposed an equalization method which will be useful for the estimation of strong multipath channel at a high velocity.
△ Less
Submitted 9 March, 2009;
originally announced March 2009.
-
An Experimental Investigation of Secure Communication With Chaos Masking
Authors:
Sourav Dhar,
Kabir Chakraborty
Abstract:
The most exciting recent development in nonlinear dynamics is realization that chaos can be useful. One application involves "Secure Communication". Two piecewise linear systems with switching nonlinearities have been taken as chaos generators. In the present work the phenomenon of secure communication with chaos masking has been investigated experimentally. In this investigation chaos which is…
▽ More
The most exciting recent development in nonlinear dynamics is realization that chaos can be useful. One application involves "Secure Communication". Two piecewise linear systems with switching nonlinearities have been taken as chaos generators. In the present work the phenomenon of secure communication with chaos masking has been investigated experimentally. In this investigation chaos which is generated from two chaos generators is masked with the massage signal to be transmitted, thus makes communication is more secure.
△ Less
Submitted 20 August, 2007;
originally announced August 2007.
-
RADAR Imaging in the Open field At 300 MHz-3000 MHz Radio Band
Authors:
Rabindranath Bera,
Jitendranath Bera,
Sanjib Sil,
Sourav Dhar,
Debdatta Kandar,
Dipak Mondal
Abstract:
With the technological growth of broadband wireless technology like CDMA and UWB, a lots of development efforts towards wireless communication system and Imaging radar system are well justified. Efforts are also being imparted towards a Convergence Technology.. the convergence between a communication and radar technology which will result in ITS (Intelligent Transport System) and other applicati…
▽ More
With the technological growth of broadband wireless technology like CDMA and UWB, a lots of development efforts towards wireless communication system and Imaging radar system are well justified. Efforts are also being imparted towards a Convergence Technology.. the convergence between a communication and radar technology which will result in ITS (Intelligent Transport System) and other applications. This encourages present authors for this development. They are trying to utilize or converge the communication technologies towards radar and to achieve the Interference free and clutter free quality remote images of targets using DS-UWB wireless technology.
△ Less
Submitted 15 May, 2007;
originally announced May 2007.
-
CDMA Technology for Intelligent Transportation Systems
Authors:
Rabindranath Bera,
Jitendranath Bera,
Sanjib Sil,
Dipak Mondal,
Sourav Dhar,
Debdatta Kandar
Abstract:
Scientists and Technologists involved in the development of radar and remote sensing systems all over the world are now trying to involve themselves in saving of manpower in the form of develo** a new application of their ideas in Intelligent Transport system(ITS). The world statistics shows that by incorporating such wireless radar system in the car would decrease the world road accident by 8…
▽ More
Scientists and Technologists involved in the development of radar and remote sensing systems all over the world are now trying to involve themselves in saving of manpower in the form of develo** a new application of their ideas in Intelligent Transport system(ITS). The world statistics shows that by incorporating such wireless radar system in the car would decrease the world road accident by 8-10% yearly. The wireless technology has to be chosen properly which is capable of tackling the severe interferences present in the open road. A combined digital technology like Spread spectrum along with diversity reception will help a lot in this regard. Accordingly, the choice is for FHSS based space diversity system which will utilize carrier frequency around 5.8 GHz ISM band with available bandwidth of 80 MHz and no license. For efficient design, the radio channel is characterized on which the design is based. Out of two available modes e.g. Communication and Radar modes, the radar mode is providing the conditional measurement of the range of the nearest car after authentication of the received code, thus ensuring the reliability and accuracy of measurement. To make the system operational in simultaneous mode, we have started the Software Defined Radio approach for best speed and flexibility.
△ Less
Submitted 15 May, 2007;
originally announced May 2007.
-
Wireless Networking to Support Data and Voice Communication Using Spread Spectrum Technology in The Physical Layer
Authors:
Sourav Dhar,
Rabindranath Bera
Abstract:
Wireless networking is rapidly growing and becomes an inexpensive technology which allows multiple users to simultaneously access the network and the internet while roaming about the campus. In the present work, the software development of a wireless LAN(WLAN) is highlighted. This WLAN utilizes direct sequence spread spectrum (DSSS) technology at 902MHz RF carrier frequency in its physical layer…
▽ More
Wireless networking is rapidly growing and becomes an inexpensive technology which allows multiple users to simultaneously access the network and the internet while roaming about the campus. In the present work, the software development of a wireless LAN(WLAN) is highlighted. This WLAN utilizes direct sequence spread spectrum (DSSS) technology at 902MHz RF carrier frequency in its physical layer. Cost effective installation and antijaming property of spread spectrum technology are the major advantages of this work.
△ Less
Submitted 11 May, 2007;
originally announced May 2007.
-
Wireless Lan to Support Multimedia Communication Using Spread Spectrum Technology
Authors:
Sourav Dhar,
Rabindranath Bera,
K. Mal
Abstract:
Wireless LAN is currently enjoying rapid deployment in University departments, business offices, hospitals and homes. It becomes an inexpensive technology and allows multiple numbers of the households to simultaneously access the internet while roaming about the house. In the present work, the design and development of a wireless LAN is highlighted which utilizes direct sequence spread spectrum…
▽ More
Wireless LAN is currently enjoying rapid deployment in University departments, business offices, hospitals and homes. It becomes an inexpensive technology and allows multiple numbers of the households to simultaneously access the internet while roaming about the house. In the present work, the design and development of a wireless LAN is highlighted which utilizes direct sequence spread spectrum (DSSS) technology at 900MHz RF carrier frequency in its physical layer. This provides enormous security in the physical layer and hence it is very difficult to hack or jam the network. The installation cost is also less due to the use of 900 MHz RF carrier frequency..
△ Less
Submitted 22 March, 2007;
originally announced March 2007.