Skip to main content

Showing 1–17 of 17 results for author: Kalagnanam, J

.
  1. arXiv:2401.03955  [pdf, other

    cs.LG cs.AI

    Tiny Time Mixers (TTMs): Fast Pre-trained Models for Enhanced Zero/Few-Shot Forecasting of Multivariate Time Series

    Authors: Vijay Ekambaram, Arindam Jati, Pankaj Dayama, Sumanta Mukherjee, Nam H. Nguyen, Wesley M. Gifford, Chandra Reddy, Jayant Kalagnanam

    Abstract: Large pre-trained models excel in zero/few-shot learning for language and vision tasks but face challenges in multivariate time series (TS) forecasting due to diverse data characteristics. Consequently, recent research efforts have focused on develo** pre-trained TS forecasting models. These models, whether built from scratch or adapted from large language models (LLMs), excel in zero/few-shot f… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  2. arXiv:2310.20280  [pdf, other

    cs.LG cs.AI

    AutoMixer for Improved Multivariate Time-Series Forecasting on Business and IT Observability Data

    Authors: Santosh Palaskar, Vijay Ekambaram, Arindam Jati, Neelamadhav Gantayat, Avirup Saha, Seema Nagar, Nam H. Nguyen, Pankaj Dayama, Renuka Sindhgatta, Prateeti Mohapatra, Harshit Kumar, Jayant Kalagnanam, Nandyala Hemachandra, Narayan Rangaraj

    Abstract: The efficiency of business processes relies on business key performance indicators (Biz-KPIs), that can be negatively impacted by IT failures. Business and IT Observability (BizITObs) data fuses both Biz-KPIs and IT event channels together as multivariate time series data. Forecasting Biz-KPIs in advance can enhance efficiency and revenue through proactive corrective measures. However, BizITObs da… ▽ More

    Submitted 2 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted in the Thirty-Sixth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-24)

  3. TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting

    Authors: Vijay Ekambaram, Arindam Jati, Nam Nguyen, Phanwadee Sinthong, Jayant Kalagnanam

    Abstract: Transformers have gained popularity in time series forecasting for their ability to capture long-sequence interactions. However, their high memory and computing requirements pose a critical bottleneck for long-term forecasting. To address this, we propose TSMixer, a lightweight neural architecture exclusively composed of multi-layer perceptron (MLP) modules for multivariate forecasting and represe… ▽ More

    Submitted 11 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted in the Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 23), Research Track. Delayed release in arXiv to comply with the conference policies on the double-blind review process. This paper has been submitted to the KDD peer-review process on Feb 02, 2023

    ACM Class: I.2

  4. arXiv:2306.00778  [pdf, other

    cs.LG stat.ML

    An End-to-End Time Series Model for Simultaneous Imputation and Forecast

    Authors: Trang H. Tran, Lam M. Nguyen, Kyongmin Yeo, Nam Nguyen, Dzung Phan, Roman Vaculin, Jayant Kalagnanam

    Abstract: Time series forecasting using historical data has been an interesting and challenging topic, especially when the data is corrupted by missing values. In many industrial problem, it is important to learn the inference function between the auxiliary observations and target variables as it provides additional knowledge when the data is not fully observed. We develop an end-to-end time series model th… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  5. arXiv:2211.14730  [pdf, other

    cs.LG cs.AI

    A Time Series is Worth 64 Words: Long-term Forecasting with Transformers

    Authors: Yuqi Nie, Nam H. Nguyen, Phanwadee Sinthong, Jayant Kalagnanam

    Abstract: We propose an efficient design of Transformer-based models for multivariate time series forecasting and self-supervised representation learning. It is based on two key components: (i) segmentation of time series into subseries-level patches which are served as input tokens to Transformer; (ii) channel-independence where each channel contains a single univariate time series that shares the same emb… ▽ More

    Submitted 5 March, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted by ICLR 2023

  6. arXiv:2112.05653  [pdf, other

    cs.LG math.OC

    Interpretable Clustering via Multi-Polytope Machines

    Authors: Connor Lawless, Jayant Kalagnanam, Lam M. Nguyen, Dzung Phan, Chandra Reddy

    Abstract: Clustering is a popular unsupervised learning tool often used to discover groups within a larger population such as customer segments, or patient subtypes. However, despite its use as a tool for subgroup discovery and description - few state-of-the-art algorithms provide any rationale or description behind the clusters found. We propose a novel approach for interpretable clustering that both clust… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted to the 36th AAAI Conference on Artificial Intelligence (AAAI 2022)

  7. arXiv:2112.02215  [pdf, other

    cs.LG cs.AI math.OC

    Deep Policy Iteration with Integer Programming for Inventory Management

    Authors: Pavithra Harsha, Ashish Jagmohan, Jayant R. Kalagnanam, Brian Quanz, Divya Singhvi

    Abstract: We present a Reinforcement Learning (RL) based framework for optimizing long-term discounted reward problems with large combinatorial action space and state dependent constraints. These characteristics are common to many operations management problems, e.g., network inventory replenishment, where managers have to deal with uncertain demand, lost sales, and capacity constraints that results in more… ▽ More

    Submitted 14 October, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: Prior shorter version accepted to NeurIPS 2021 Deep RL Workshop. Authors are listed in alphabetical order

    ACM Class: I.2.6; I.2.1; I.2.8; J.7; I.5.1; G.3

  8. arXiv:2011.03375  [pdf, other

    cs.LG math.OC

    A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

    Authors: Haoran Zhu, Pavankumar Murali, Dzung T. Phan, Lam M. Nguyen, Jayant R. Kalagnanam

    Abstract: Several recent publications report advances in training optimal decision trees (ODT) using mixed-integer programs (MIP), due to algorithmic advances in integer programming and a growing interest in addressing the inherent suboptimality of heuristic approaches such as CART. In this paper, we propose a novel MIP formulation, based on a 1-norm support vector machine model, to train a multivariate ODT… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  9. arXiv:2003.01184  [pdf, other

    cs.LG cs.NE physics.comp-ph

    Variational inference formulation for a model-free simulation of a dynamical system with unknown parameters by a recurrent neural network

    Authors: Kyongmin Yeo, Dylan E. C. Grullon, Fan-Keng Sun, Duane S. Boning, Jayant R. Kalagnanam

    Abstract: We propose a recurrent neural network for a "model-free" simulation of a dynamical system with unknown parameters without prior knowledge. The deep learning model aims to jointly learn the nonlinear time marching operator and the effects of the unknown parameters from a time series dataset. We assume that the time series data set consists of an ensemble of trajectories for a range of the parameter… ▽ More

    Submitted 26 February, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

  10. arXiv:1901.07648  [pdf, other

    math.OC cs.LG stat.ML

    Finite-Sum Smooth Optimization with SARAH

    Authors: Lam M. Nguyen, Marten van Dijk, Dzung T. Phan, Phuong Ha Nguyen, Tsui-Wei Weng, Jayant R. Kalagnanam

    Abstract: The total complexity (measured as the total number of gradient computations) of a stochastic first-order optimization algorithm that finds a first-order stationary point of a finite-sum smooth nonconvex objective function $F(w)=\frac{1}{n} \sum_{i=1}^n f_i(w)$ has been proven to be at least $Ω(\sqrt{n}/ε)$ for $n \leq \mathcal{O}(ε^{-2})$ where $ε$ denotes the attained accuracy… ▽ More

    Submitted 22 April, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  11. arXiv:1901.07634   

    cs.LG math.OC stat.ML

    DTN: A Learning Rate Scheme with Convergence Rate of $\mathcal{O}(1/t)$ for SGD

    Authors: Lam M. Nguyen, Phuong Ha Nguyen, Dzung T. Phan, Jayant R. Kalagnanam, Marten van Dijk

    Abstract: This paper has some inconsistent results, i.e., we made some failed claims because we did some mistakes for using the test criterion for a series. Precisely, our claims on the convergence rate of $\mathcal{O}(1/t)$ of SGD presented in Theorem 1, Corollary 1, Theorem 2 and Corollary 2 are wrongly derived because they are based on Lemma 5. In Lemma 5, we do not correctly use the test criterion for a… ▽ More

    Submitted 27 February, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: This paper has inconsistent results, i.e., we made some failed claims because we did some mistakes for using the test criterion for a series

  12. arXiv:1801.06159  [pdf, other

    stat.ML cs.LG math.OC

    When Does Stochastic Gradient Algorithm Work Well?

    Authors: Lam M. Nguyen, Nam H. Nguyen, Dzung T. Phan, Jayant R. Kalagnanam, Katya Scheinberg

    Abstract: In this paper, we consider a general stochastic optimization problem which is often at the core of supervised learning, such as deep learning and linear classification. We consider a standard stochastic gradient descent (SGD) method with a fixed, large step size and propose a novel assumption on the objective function, under which this method has the improved convergence rates (to a neighborhood o… ▽ More

    Submitted 25 December, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

  13. arXiv:1801.03009  [pdf, other

    physics.ao-ph cs.CE physics.data-an

    Development of hp-inverse model by using generalized polynomial chaos

    Authors: Kyongmin Yeo, Youngdeok Hwang, Xiao Liu, Jayant Kalagnanam

    Abstract: We present a hp-inverse model to estimate a smooth, non-negative source function from a limited number of observations for a two-dimensional linear source inversion problem. A standard least-square inverse model is formulated by using a set of Gaussian radial basis functions (GRBF) on a rectangular mesh system with a uniform grid space. Here, the choice of the mesh system is modeled as a random va… ▽ More

    Submitted 14 December, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

  14. arXiv:1612.03225  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimal Generalized Decision Trees via Integer Programming

    Authors: Oktay Gunluk, Jayant Kalagnanam, Minhan Li, Matt Menickelly, Katya Scheinberg

    Abstract: Decision trees have been a very popular class of predictive models for decades due to their interpretability and good performance on categorical features. However, they are not always robust and tend to overfit the data. Additionally, if allowed to grow large, they lose interpretability. In this paper, we present a mixed integer programming formulation to construct optimal decision trees of a pres… ▽ More

    Submitted 13 August, 2019; v1 submitted 9 December, 2016; originally announced December 2016.

    MSC Class: 90C10

  15. arXiv:1609.09816  [pdf, other

    stat.ME stat.AP

    A Spatio-Temporal Modeling Approach for Weather Radar Reflectivity Data and Its Applications in Tropical Southeast Asia

    Authors: Xiao Liu, Viknesswaran Gopal, Jayant Kalagnanam

    Abstract: Weather radar echoes, correlated in both space and time, are the most important input data for short-term precipitation forecast. Motivated by real datasets, this paper is concerned with the spatio-temporal modeling of two-dimensional radar reflectivity fields from a sequence of radar images. Under a Lagrangian integration scheme, we model the radar reflectivity data by a spatio-temporal condition… ▽ More

    Submitted 30 September, 2016; originally announced September 2016.

    Comments: 31 pages, 9 figures

  16. arXiv:1609.07217  [pdf, other

    stat.ME stat.AP

    Statistical Modeling for Spatio-Temporal Degradation Data

    Authors: Xiao Liu, Kyongmin Yeo, Jayant Kalagnanam

    Abstract: This paper investigates the modeling of an important class of degradation data, which are collected from a spatial domain over time; for example, the surface quality degradation. Like many existing time-dependent stochastic degradation models, a special random field is constructed for modeling the spatio-temporal degradation process. In particular, we express the degradation at any spatial locatio… ▽ More

    Submitted 27 December, 2017; v1 submitted 22 September, 2016; originally announced September 2016.

    Comments: 30 pages, 7 figures. Manuscript prepared for submission

  17. arXiv:1304.2362  [pdf

    cs.AI

    A Comparison of Decision Analysis and Expert Rules for Sequential Diagnosis

    Authors: Jayant Kalagnanam, Max Henrion

    Abstract: There has long been debate about the relative merits of decision theoretic methods and heuristic rule-based approaches for reasoning under uncertainty. We report an experimental comparison of the performance of the two approaches to troubleshooting, specifically to test selection for fault diagnosis. We use as experimental testbed the problem of diagnosing motorcycle engines. The first approach… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fourth Conference on Uncertainty in Artificial Intelligence (UAI1988)

    Report number: UAI-P-1988-PG-205-212