Skip to main content

Showing 1–22 of 22 results for author: Chakraborty, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.04465  [pdf, ps, other

    stat.AP cs.SI physics.data-an

    Learning Patterns from Biological Networks: A Compounded Burr Probability Model

    Authors: Tanujit Chakraborty, Shraddha M. Naik, Swarup Chattopadhyay, Suchismita Das

    Abstract: Complex biological networks, comprising metabolic reactions, gene interactions, and protein interactions, often exhibit scale-free characteristics with power-law degree distributions. However, empirical studies have revealed discrepancies between observed biological network data and ideal power-law fits, highlighting the need for improved modeling approaches. To address this challenge, we propose… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2312.05878  [pdf, other

    stat.ML cs.LG

    Skew Probabilistic Neural Networks for Learning from Imbalanced Data

    Authors: Shraddha M. Naik, Tanujit Chakraborty, Abdenour Hadid, Bibhas Chakraborty

    Abstract: Real-world datasets often exhibit imbalanced data distribution, where certain class levels are severely underrepresented. In such cases, traditional pattern classifiers have shown a bias towards the majority class, impeding accurate predictions for the minority class. This paper introduces an imbalanced data-oriented approach using probabilistic neural networks (PNNs) with a skew normal probabilit… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  3. arXiv:2312.04502  [pdf, other

    physics.ao-ph stat.AP

    Pattern change of precipitation extremes in Bear Island

    Authors: Arnob Ray, Tanujit Chakraborty, Athulya Radhakrishnan, Chittaranjan Hens, Syamal K. Dana, Dibakar Ghosh, Nuncio Murukesh

    Abstract: Extreme precipitation in the Arctic region plays a crucial role in global weather and climate patterns. Bear Island (Bjørnøya) is located in the Norwegian Svalbard archipelago, which is, therefore, selected for our study on extreme precipitation. The island occupies a unique geographic position at the intersection of the high and low Arctic, characterized by a flat and lake-filled northern region… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  4. arXiv:2311.14359  [pdf, other

    stat.ML cs.LG stat.AP

    Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study

    Authors: Xueqing Liu, Nina Deliu, Tanujit Chakraborty, Lauren Bell, Bibhas Chakraborty

    Abstract: Mobile health (mHealth) technologies aim to improve distal outcomes, such as clinical conditions, by optimizing proximal outcomes through just-in-time adaptive interventions. Contextual bandits provide a suitable framework for customizing such interventions according to individual time-varying contexts, intending to maximize cumulative proximal outcomes. However, unique challenges such as modeling… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  5. arXiv:2306.05951  [pdf, other

    cs.LG physics.geo-ph stat.AP

    Prediction of Transportation Index for Urban Patterns in Small and Medium-sized Indian Cities using Hybrid RidgeGAN Model

    Authors: Rahisha Thottolil, Uttam Kumar, Tanujit Chakraborty

    Abstract: The rapid urbanization trend in most develo** countries including India is creating a plethora of civic concerns such as loss of green space, degradation of environmental health, clean water availability, air pollution, traffic congestion leading to delays in vehicular transportation, etc. Transportation and network modeling through transportation indices have been widely used to understand tran… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Journal ref: Scientific Reports, 2023, Vol. 13

  6. arXiv:2209.04259  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Knowledge-based Deep Learning for Modeling Chaotic Systems

    Authors: Zakaria Elabid, Tanujit Chakraborty, Abdenour Hadid

    Abstract: Deep Learning has received increased attention due to its unbeatable success in many fields, such as computer vision, natural language processing, recommendation systems, and most recently in simulating multiphysics problems and predicting nonlinear dynamical systems. However, modeling and forecasting the dynamics of chaotic systems remains an open research problem since training deep learning mod… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Report number: December 2022, Pages 1203-1209

    Journal ref: 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

  7. arXiv:2209.03945  [pdf, other

    cs.LG econ.EM eess.SP stat.ML

    W-Transformers : A Wavelet-based Transformer Framework for Univariate Time Series Forecasting

    Authors: Lena Sasal, Tanujit Chakraborty, Abdenour Hadid

    Abstract: Deep learning utilizing transformers has recently achieved a lot of success in many vital areas such as natural language processing, computer vision, anomaly detection, and recommendation systems, among many others. Among several merits of transformers, the ability to capture long-range temporal dependencies and interactions is desirable for time series forecasting, leading to its progress in vari… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    Report number: Pages 671-676

    Journal ref: 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

  8. arXiv:2206.10696  [pdf, other

    cs.LG nlin.CD q-bio.PE q-bio.QM stat.AP

    Epicasting: An Ensemble Wavelet Neural Network (EWNet) for Forecasting Epidemics

    Authors: Madhurima Panja, Tanujit Chakraborty, Uttam Kumar, Nan Liu

    Abstract: Infectious diseases remain among the top contributors to human illness and death worldwide, among which many diseases produce epidemic waves of infection. The unavailability of specific drugs and ready-to-use vaccines to prevent most of these epidemics makes the situation worse. These force public health officials and policymakers to rely on early warning systems generated by reliable and accurate… ▽ More

    Submitted 14 March, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Report number: Volume 165, August 2023, Pages 185-212

    Journal ref: Neural Networks. 2023

  9. Probabilistic AutoRegressive Neural Networks for Accurate Long-range Forecasting

    Authors: Madhurima Panja, Tanujit Chakraborty, Uttam Kumar, Abdenour Hadid

    Abstract: Forecasting time series data is a critical area of research with applications spanning from stock prices to early epidemic prediction. While numerous statistical and machine learning methods have been proposed, real-life prediction problems often require hybrid solutions that bridge classical forecasting approaches and modern neural network models. In this study, we introduce the Probabilistic Aut… ▽ More

    Submitted 27 June, 2023; v1 submitted 1 April, 2022; originally announced April 2022.

    Report number: December 2023, Pages 457--477

    Journal ref: International Conference on Neural Information Processing 2023

  10. arXiv:2010.05079  [pdf, ps, other

    q-bio.PE stat.AP

    Nowcasting of COVID-19 confirmed cases: Foundations, trends, and challenges

    Authors: Tanujit Chakraborty, Indrajit Ghosh, Tirna Mahajan, Tejasvi Arora

    Abstract: The coronavirus disease 2019 (COVID-19) has become a public health emergency of international concern affecting more than 200 countries and territories worldwide. As of September 30, 2020, it has caused a pandemic outbreak with more than 33 million confirmed infections and more than 1 million reported deaths worldwide. Several statistical, machine learning, and hybrid models have previously tried… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

  11. Real-time forecasts and risk assessment of novel coronavirus (COVID-19) cases: A data-driven analysis

    Authors: Tanujit Chakraborty, Indrajit Ghosh

    Abstract: The coronavirus disease 2019 (COVID-19) has become a public health emergency of international concern affecting 201 countries and territories around the globe. As of April 4, 2020, it has caused a pandemic outbreak with more than 11,16,643 confirmed infections and more than 59,170 reported deaths worldwide. The main focus of this paper is two-fold: (a) generating short term (real-time) forecasts o… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Journal ref: Chaos, Solitons & Fractals Volume 135, June 2020, 109850

  12. arXiv:2004.07859  [pdf

    q-bio.PE q-bio.QM stat.CO stat.OT

    Coronavirus (COVID-19): ARIMA based time-series analysis to forecast near future

    Authors: Hiteshi Tandon, Prabhat Ranjan, Tanmoy Chakraborty, Vandana Suhag

    Abstract: COVID-19, a novel coronavirus, is currently a major worldwide threat. It has infected more than a million people globally leading to hundred-thousands of deaths. In such grave circumstances, it is very important to predict the future infected cases to support prevention of the disease and aid in the healthcare service preparation. Following that notion, we have developed a model and then employed… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Pages: 11, Tables: 4, Figures: 6; Author Contributions: H.T. and T.C. conceptualized the project. H.T. designed the study, performed the computations and investigations, contributed to data analysis and wrote the manuscript. P.R. provided the resources. T.C. and V.S. supervised the study and reviewed the manuscript

    Journal ref: Coronavirus (COVID-19): ARIMA-based Time-series Analysis to Forecast near Future and the Effect of School Reopening in India, Journal of Health Management, 2022, 24 (3), 373-388

  13. Bayesian Neural Tree Models for Nonparametric Regression

    Authors: Tanujit Chakraborty, Gauri Kamat, Ashis Kumar Chakraborty

    Abstract: Frequentist and Bayesian methods differ in many aspects, but share some basic optimal properties. In real-life classification and regression problems, situations exist in which a model based on one of the methods is preferable based on some subjective criterion. Nonparametric classification and regression techniques, such as decision trees and neural networks, have frequentist (classification and… ▽ More

    Submitted 27 July, 2020; v1 submitted 1 September, 2019; originally announced September 2019.

    Journal ref: Australian and New Zealand Journal of Statistics, 2023

  14. arXiv:1906.00145  [pdf, other

    cs.LG cs.SI stat.ML

    DiffQue: Estimating Relative Difficulty of Questions in Community Question Answering Services

    Authors: Deepak Thukral, Adesh Pandey, Rishabh Gupta, Vikram Goyal, Tanmoy Chakraborty

    Abstract: Automatic estimation of relative difficulty of a pair of questions is an important and challenging problem in community question answering (CQA) services. There are limited studies which addressed this problem. Past studies mostly leveraged expertise of users answering the questions and barely considered other properties of CQA services such as metadata of users and posts, temporal information and… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: 25 pages, 7 figures, ACM Transactions on Intelligent Systems and Technology (TIST) 2019

  15. arXiv:1902.03124  [pdf, other

    cs.SI cs.LG stat.ML

    Heterogeneous Edge Embeddings for Friend Recommendation

    Authors: Janu Verma, Srishti Gupta, Debdoot Mukherjee, Tanmoy Chakraborty

    Abstract: We propose a friend recommendation system (an application of link prediction) using edge embeddings on social networks. Most real-world social networks are multi-graphs, where different kinds of relationships (e.g. chat, friendship) are possible between a pair of users. Existing network embedding techniques do not leverage signals from different edge types and thus perform inadequately on link pre… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: To appear in ECIR, 2019

  16. arXiv:1902.02930  [pdf, other

    cs.LG stat.ML

    Multi-task Learning for Target-dependent Sentiment Classification

    Authors: Divam Gupta, Kushagra Singh, Soumen Chakrabarti, Tanmoy Chakraborty

    Abstract: Detecting and aggregating sentiments toward people, organizations, and events expressed in unstructured social media have become critical text mining operations. Early systems detected sentiments over whole passages, whereas more recently, target-specific sentiments have been of greater interest. In this paper, we present MTTDSC, a multi-task target-dependent sentiment classification system that i… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: PAKDD 2019

  17. arXiv:1811.11456  [pdf, ps, other

    cs.LG cs.CL stat.ML

    GIRNet: Interleaved Multi-Task Recurrent State Sequence Models

    Authors: Divam Gupta, Tanmoy Chakraborty, Soumen Chakrabarti

    Abstract: In several natural language tasks, labeled sequences are available in separate domains (say, languages), but the goal is to label sequences with mixed domain (such as code-switched text). Or, we may have available models for labeling whole passages (say, with sentiments), which we would like to exploit toward better position-specific label inference (say, target-dependent sentiment annotation). A… ▽ More

    Submitted 25 December, 2018; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: Accepted at AAAI 2019

  18. Superensemble Classifier for Improving Predictions in Imbalanced Datasets

    Authors: Tanujit Chakraborty, Ashis Kumar Chakraborty

    Abstract: Learning from an imbalanced dataset is a tricky proposition. Because these datasets are biased towards one class, most existing classifiers tend not to perform well on minority class examples. Conventional classifiers usually aim to optimize the overall accuracy without considering the relative distribution of each class. This article presents a superensemble classifier, to tackle and improve pred… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: text overlap with arXiv:1805.12381

    Journal ref: Communications in Statistics: Case Studies, Data Analysis and Applications Volume 6, 2020 - Issue 2

  19. Imbalanced Ensemble Classifier for learning from imbalanced business school data set

    Authors: Tanujit Chakraborty

    Abstract: Private business schools in India face a common problem of selecting quality students for their MBA programs to achieve the desired placement percentage. Generally, such data sets are biased towards one class, i.e., imbalanced in nature. And learning from the imbalanced dataset is a difficult proposition. This paper proposes an imbalanced ensemble classifier which can handle the imbalanced nature… ▽ More

    Submitted 17 October, 2018; v1 submitted 31 May, 2018; originally announced May 2018.

    Journal ref: International Journal of Mathematical, Engineering and Management Sciences Vol. 4, No. 4, 861-869, 2019

  20. A Nonparametric Ensemble Binary Classifier and its Statistical Properties

    Authors: Tanujit Chakraborty, Ashis Kumar Chakraborty, C. A. Murthy

    Abstract: In this work, we propose an ensemble of classification trees (CT) and artificial neural networks (ANN). Several statistical properties including universal consistency and upper bound of an important parameter of the proposed classifier are shown. Numerical evidence is also provided using various real life data sets to assess the performance of the model. Our proposed nonparametric ensemble classif… ▽ More

    Submitted 18 September, 2018; v1 submitted 29 April, 2018; originally announced April 2018.

    Journal ref: Statistics & Probability Letters Volume 149, June 2019, Pages 16-23

  21. A novel distribution-free hybrid regression model for manufacturing process efficiency improvement

    Authors: Tanujit Chakraborty, Ashis Kumar Chakraborty, Swarup Chattopadhyay

    Abstract: This work is motivated by a particular problem of a modern paper manufacturing industry, in which maximum efficiency of the fiber-filler recovery process is desired. A lot of unwanted materials along with valuable fibers and fillers come out as a by-product of the paper manufacturing process and mostly goes as waste. The job of an efficient Krofta supracell is to separate the unwanted materials fr… ▽ More

    Submitted 29 October, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Journal ref: Journal of Computational and Applied Mathematics. 2019 Dec 15;362:130-42

  22. arXiv:1708.08591  [pdf, other

    cs.LG stat.ML

    EC3: Combining Clustering and Classification for Ensemble Learning

    Authors: Tanmoy Chakraborty

    Abstract: Classification and clustering algorithms have been proved to be successful individually in different contexts. Both of them have their own advantages and limitations. For instance, although classification algorithms are more powerful than clustering methods in predicting class labels of objects, they do not perform well when there is a lack of sufficient manually labeled reliable data. On the othe… ▽ More

    Submitted 29 August, 2017; originally announced August 2017.

    Comments: 14 pages, 7 figures, 11 tables