Skip to main content

Showing 1–50 of 93 results for author: Klabjan, D

.
  1. arXiv:2404.09443  [pdf, other

    cs.LG cs.DC

    Hybrid FedGraph: An efficient hybrid federated learning algorithm using graph convolutional neural network

    Authors: Jaeyeon Jang, Diego Klabjan, Veena Mendiratta, Fanfei Meng

    Abstract: Federated learning is an emerging paradigm for decentralized training of machine learning models on distributed clients, without revealing the data to the central server. Most existing works have focused on horizontal or vertical data distributions, where each client possesses different samples with shared features, or each client fully shares only sample indices, respectively. However, the hybrid… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2402.04417  [pdf, ps, other

    cs.LG cs.MA

    Decentralized Blockchain-based Robust Multi-agent Multi-armed Bandit

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study a robust multi-agent multi-armed bandit problem where multiple clients or participants are distributed on a fully decentralized blockchain, with the possibility of some being malicious. The rewards of arms are homogeneous among the clients, following time-invariant stochastic distributions that are revealed to the participants only when the system is secure enough. The system's objective… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 16 pages

  3. arXiv:2311.16135  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Use of Deep Neural Networks for Uncertain Stress Functions with Extensions to Impact Mechanics

    Authors: Garrett Blum, Ryan Doris, Diego Klabjan, Horacio Espinosa, Ron Szalkowski

    Abstract: Stress-strain curves, or more generally, stress functions, are an extremely important characterization of a material's mechanical properties. However, stress functions are often difficult to derive and are narrowly tailored to a specific material. Further, large deformations, high strain-rates, temperature sensitivity, and effect of material parameters compound modeling challenges. We propose a ge… ▽ More

    Submitted 19 December, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Index Terms: Stress, Uncertainty, Impact Mechanics, Deep Learning, Neural Network. 10 pages, 9 figures, 6 tables

  4. arXiv:2311.07027  [pdf, other

    cs.CR

    Robust softmax aggregation on blockchain based federated learning with convergence guarantee

    Authors: Huiyu Wu, Diego Klabjan

    Abstract: Blockchain based federated learning is a distributed learning scheme that allows model training without participants sharing their local data sets, where the blockchain components eliminate the need for a trusted central server compared to traditional Federated Learning algorithms. In this paper we propose a softmax aggregation blockchain based federated learning framework. First, we propose a new… ▽ More

    Submitted 28 December, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

  5. arXiv:2311.03745  [pdf, other

    cs.CV cs.LG

    Unsupervised Video Summarization

    Authors: Hanqing Li, Diego Klabjan, Jean Utke

    Abstract: This paper introduces a new, unsupervised method for automatic video summarization using ideas from generative adversarial networks but eliminating the discriminator, having a simple loss function, and separating training of different parts of the model. An iterative training strategy is also applied by alternately training the reconstructor and the frame selector for multiple iterations. Furtherm… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  6. arXiv:2311.02546  [pdf, ps, other

    cs.LG

    On the Second-Order Convergence of Biased Policy Gradient Algorithms

    Authors: Siqiao Mu, Diego Klabjan

    Abstract: Since the objective functions of reinforcement learning problems are typically highly nonconvex, it is desirable that policy gradient, the most popular algorithm, escapes saddle points and arrives at second-order stationary points. Existing results only consider vanilla policy gradient algorithms with unbiased gradient estimators, but practical implementations under the infinite-horizon discounted… ▽ More

    Submitted 13 May, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

  7. arXiv:2310.10611  [pdf, other

    cs.LG stat.ML

    IW-GAE: Importance weighted group accuracy estimation for improved calibration and model selection in unsupervised domain adaptation

    Authors: Taejong Joo, Diego Klabjan

    Abstract: Reasoning about a model's accuracy on a test sample from its confidence is a central problem in machine learning, being connected to important applications such as uncertainty representation, model selection, and exploration. While these connections have been well-studied in the i.i.d. settings, distribution shifts pose significant challenges to the traditional methods. Therefore, model calibratio… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  8. arXiv:2309.01063  [pdf, other

    cs.CV cs.LG

    Semi-supervised 3D Video Information Retrieval with Deep Neural Network and Bi-directional Dynamic-time War** Algorithm

    Authors: Yintai Ma, Diego Klabjan

    Abstract: This paper presents a novel semi-supervised deep learning algorithm for retrieving similar 2D and 3D videos based on visual content. The proposed approach combines the power of deep convolutional and recurrent neural networks with dynamic time war** as a similarity measure. The proposed algorithm is designed to handle large video datasets and retrieve the most related videos to a given inquiry v… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: 10 pages, submitted to IEEE Conference Big Data 2023

  9. arXiv:2309.00626  [pdf, other

    q-fin.TR cs.LG

    An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency Trading

    Authors: Shuyang Wang, Diego Klabjan

    Abstract: We propose an ensemble method to improve the generalization performance of trading strategies trained by deep reinforcement learning algorithms in a highly stochastic environment of intraday cryptocurrency portfolio trading. We adopt a model selection method that evaluates on multiple validation periods, and propose a novel mixture distribution policy to effectively ensemble the selected models. W… ▽ More

    Submitted 27 July, 2023; originally announced September 2023.

  10. arXiv:2308.08046  [pdf, ps, other

    cs.LG stat.ML

    Regret Lower Bounds in Multi-agent Multi-armed Bandit

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: Multi-armed Bandit motivates methods with provable upper bounds on regret and also the counterpart lower bounds have been extensively studied in this context. Recently, Multi-agent Multi-armed Bandit has gained significant traction in various domains, where individual clients face bandit problems in a distributed manner and the objective is the overall system performance, typically measured by reg… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 10 pages

  11. arXiv:2307.07529  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Multiple Coordinated Agents under Directed Acyclic Graph Constraints

    Authors: Jaeyeon Jang, Diego Klabjan, Han Liu, Nital S. Patel, Xiuqi Li, Balakrishnan Ananthanarayanan, Husam Dauod, Tzung-Han Juang

    Abstract: This paper proposes a novel multi-agent reinforcement learning (MARL) method to learn multiple coordinated agents under directed acyclic graph (DAG) constraints. Unlike existing MARL approaches, our method explicitly exploits the DAG structure between agents to achieve more effective learning performance. Theoretically, we propose a novel surrogate value function based on a MARL model with synthet… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  12. arXiv:2307.00226  [pdf, other

    cs.CV cs.LG

    S-Omninet: Structured Data Enhanced Universal Multimodal Learning Architecture

    Authors: Ye Xue, Diego Klabjan, Jean Utke

    Abstract: Multimodal multitask learning has attracted an increasing interest in recent years. Singlemodal models have been advancing rapidly and have achieved astonishing results on various tasks across multiple domains. Multimodal learning offers opportunities for further improvements by integrating data from multiple modalities. Many methods are proposed to learn on a specific type of multimodal data, suc… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

  13. arXiv:2306.05579  [pdf, other

    cs.LG stat.ML

    Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study a decentralized multi-agent multi-armed bandit problem in which multiple clients are connected by time dependent random graphs provided by an environment. The reward distributions of each arm vary across clients and rewards are generated independently over time by an environment based on distributions that include both sub-exponential and sub-gaussian distributions. Each client pulls an a… ▽ More

    Submitted 17 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 58 pages, to appear at Advances in Neural Information Processing Systems (NeurIPS 2023 Spotlight)

  14. arXiv:2305.01151  [pdf, ps, other

    cs.LG

    Early Classifying Multimodal Sequences

    Authors: Alexander Cao, Jean Utke, Diego Klabjan

    Abstract: Often pieces of information are received sequentially over time. When did one collect enough such pieces to classify? Trading wait time for decision certainty leads to early classification problems that have recently gained attention as a means of adapting classification to more dynamic environments. However, so far results have been limited to unimodal sequences. In this pilot study, we expand in… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 7 pages, 5 figures

  15. arXiv:2304.11268  [pdf, other

    math.OC

    Stochastic Scale Invariant Power Iteration for KL-divergence Nonnegative Matrix Factorization

    Authors: Cheolmin Kim, Youngseok Kim, Diego Klabjan

    Abstract: We introduce a mini-batch stochastic variance-reduced algorithm to solve finite-sum scale invariant problems which cover several examples in machine learning and statistics such as principal component analysis (PCA) and estimation of mixture proportions. The algorithm is a stochastic generalization of scale invariant power iteration, specializing to power iteration when full-batch is used for the… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  16. arXiv:2304.03463  [pdf, ps, other

    cs.LG

    A Policy for Early Sequence Classification

    Authors: Alexander Cao, Jean Utke, Diego Klabjan

    Abstract: Sequences are often not received in their entirety at once, but instead, received incrementally over time, element by element. Early predictions yielding a higher benefit, one aims to classify a sequence as accurately as possible, as soon as possible, without having to wait for the last element. For this early sequence classification, we introduce our novel classifier-induced stop**. While previ… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 12 pages, 6 figures

  17. arXiv:2302.14299  [pdf, other

    cs.LG cs.AI

    Gradient-Boosted Based Structured and Unstructured Learning

    Authors: Andrea Treviño Gavito, Diego Klabjan, Jean Utke

    Abstract: We propose two frameworks to deal with problem settings in which both structured and unstructured data are available. Structured data problems are best solved by traditional machine learning models such as boosting and tree-based algorithms, whereas deep learning has been widely applied to problems dealing with images, text, audio, and other unstructured data sources. However, for the setting in w… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  18. arXiv:2302.14278  [pdf, other

    cs.LG cs.AI

    Multi-Layer Attention-Based Explainability via Transformers for Tabular Data

    Authors: Andrea Treviño Gavito, Diego Klabjan, Jean Utke

    Abstract: We propose a graph-oriented attention-based explainability method for tabular data. Tasks involving tabular data have been solved mostly using traditional tree-based machine learning models which have the challenges of feature selection and engineering. With that in mind, we consider a transformer architecture for tabular data, which is amenable to explainability, and present a novel way to levera… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  19. arXiv:2212.11360  [pdf, other

    cs.LG

    Feature Acquisition using Monte Carlo Tree Search

    Authors: Sungsoo Lim, Diego Klabjan, Mark Shapiro

    Abstract: Feature acquisition algorithms address the problem of acquiring informative features while balancing the costs of acquisition to improve the learning performances of ML models. Previous approaches have focused on calculating the expected utility values of features to determine the acquisition sequences. Other approaches formulated the problem as a Markov Decision Process (MDP) and applied reinforc… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 13 pages, 7 figures

  20. arXiv:2212.00884  [pdf, other

    cs.LG stat.ML

    Pareto Regret Analyses in Multi-objective Multi-armed Bandit

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study Pareto optimality in multi-objective multi-armed bandit by providing a formulation of adversarial multi-objective multi-armed bandit and defining its Pareto regrets that can be applied to both stochastic and adversarial settings. The regrets do not rely on any scalarization functions and reflect Pareto optimality compared to scalarized regrets. We also present new algorithms assuming both… ▽ More

    Submitted 30 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 19 pages; accepted at ICML 2023 and to be published in Proceedings of Machine Learning Research (PMLR)

  21. arXiv:2210.08106  [pdf, other

    cs.LG

    A Primal-Dual Algorithm for Hybrid Federated Learning

    Authors: Tom Overman, Garrett Blum, Diego Klabjan

    Abstract: Very few methods for hybrid federated learning, where clients only hold subsets of both features and samples, exist. Yet, this scenario is extremely important in practical settings. We provide a fast, robust algorithm for hybrid federated learning that hinges on Fenchel Duality. We prove the convergence of the algorithm to the same solution as if the model is trained centrally in a variety of prac… ▽ More

    Submitted 9 February, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by AAAI 2024. To appear in AAAI proceedings

  22. arXiv:2210.05607  [pdf, other

    cs.LG math.OC

    Divergence Results and Convergence of a Variance Reduced Version of ADAM

    Authors: Ruiqi Wang, Diego Klabjan

    Abstract: Stochastic optimization algorithms using exponential moving averages of the past gradients, such as ADAM, RMSProp and AdaGrad, have been having great successes in many applications, especially in training deep neural networks. ADAM in particular stands out as efficient and robust. Despite of its outstanding performance, ADAM has been proved to be divergent for some specific problems. We revisit th… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  23. arXiv:2205.00548  [pdf, other

    cs.CL cs.IR

    Large-Scale Multi-Document Summarization with Information Extraction and Compression

    Authors: Ning Wang, Han Liu, Diego Klabjan

    Abstract: We develop an abstractive summarization framework independent of labeled data for multiple heterogeneous documents. Unlike existing multi-document summarization methods, our framework processes documents telling different stories instead of documents on the same topic. We also enhance an existing sentence fusion method with a uni-directional language model to prioritize fused sentences with higher… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  24. arXiv:2203.00762  [pdf, other

    cs.LG cs.CL cs.IR

    Topic Analysis for Text with Side Data

    Authors: Biyi Fang, Kripa Rajshekhar, Diego Klabjan

    Abstract: Although latent factor models (e.g., matrix factorization) obtain good performance in predictions, they suffer from several problems including cold-start, non-transparency, and suboptimal recommendations. In this paper, we employ text with side data to tackle these limitations. We introduce a hybrid generative probabilistic model that combines a neural network with a latent topic model, which is a… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  25. arXiv:2203.00761  [pdf, other

    cs.LG cs.CV

    Tricks and Plugins to GBM on Images and Sequences

    Authors: Biyi Fang, Jean Utke, Diego Klabjan

    Abstract: Convolutional neural networks (CNNs) and transformers, which are composed of multiple processing layers and blocks to learn the representations of data with multiple abstract levels, are the most successful machine learning models in recent years. However, millions of parameters and many blocks make them difficult to be trained, and sometimes several days or weeks are required to find an ideal arc… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

  26. arXiv:2201.02923  [pdf, ps, other

    cs.LG

    Open-Set Recognition of Breast Cancer Treatments

    Authors: Alexander Cao, Diego Klabjan, Yuan Luo

    Abstract: Open-set recognition generalizes a classification task by classifying test samples as one of the known classes from training or "unknown." As novel cancer drug cocktails with improved treatment are continually discovered, predicting cancer treatments can naturally be formulated in terms of an open-set recognition problem. Drawbacks, due to modeling unknown samples during training, arise from strai… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

    Comments: 22 pages, 9 figures and 9 tables

  27. arXiv:2111.08577  [pdf, other

    cs.LG

    Neuron-based Pruning of Deep Neural Networks with Better Generalization using Kronecker Factored Curvature Approximation

    Authors: Abdolghani Ebrahimi, Diego Klabjan

    Abstract: Existing methods of pruning deep neural networks focus on removing unnecessary parameters of the trained network and fine tuning the model afterwards to find a good solution that recovers the initial performance of the trained model. Unlike other works, our method pays special attention to the quality of the solution in the compressed model and inference computation time by pruning neurons. The pr… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 15 pages, 5 figures

  28. arXiv:2108.07433  [pdf, other

    cs.LG cs.DC

    Aggregation Delayed Federated Learning

    Authors: Ye Xue, Diego Klabjan, Yuan Luo

    Abstract: Federated learning is a distributed machine learning paradigm where multiple data owners (clients) collaboratively train one machine learning model while kee** data on their own devices. The heterogeneity of client datasets is one of the most important challenges of federated learning algorithms. Studies have found performance reduction with standard federated algorithms, such as FedAvg, on non-… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  29. arXiv:2107.02845  [pdf, other

    cs.LG

    Logit-based Uncertainty Measure in Classification

    Authors: Huiyu Wu, Diego Klabjan

    Abstract: We introduce a new, reliable, and agnostic uncertainty measure for classification tasks called logit uncertainty. It is based on logit outputs of neural networks. We in particular show that this new uncertainty measure yields a superior performance compared to existing uncertainty measures on different tasks, including out of sample detection and finding erroneous predictions. We analyze theoretic… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  30. arXiv:2105.10065  [pdf, other

    cs.LG

    A Probabilistic Approach to Neural Network Pruning

    Authors: Xin Qian, Diego Klabjan

    Abstract: Neural network pruning techniques reduce the number of parameters without compromising predicting ability of a network. Many algorithms have been developed for pruning both over-parameterized fully-connected networks (FCNs) and convolutional neural networks (CNNs), but analytical studies of capabilities and compression ratios of such pruned sub-networks are lacking. We theoretically study the perf… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  31. arXiv:2102.11210  [pdf, other

    cs.LG

    Non-Convex Optimization with Spectral Radius Regularization

    Authors: Adam Sandler, Diego Klabjan, Yuan Luo

    Abstract: We develop a regularization method which finds flat minima during the training of deep neural networks and other machine learning models. These minima generalize better than sharp minima, allowing models to better generalize to real word test data, which may be distributed differently from the training data. Specifically, we propose a method of regularized optimization to reduce the spectral radiu… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 12 pages

  32. arXiv:2102.00380  [pdf, other

    cs.LG stat.ML

    Classification Models for Partially Ordered Sequences

    Authors: Stephanie Ger, Diego Klabjan, Jean Utke

    Abstract: Many models such as Long Short Term Memory (LSTMs), Gated Recurrent Units (GRUs) and transformers have been developed to classify time series data with the assumption that events in a sequence are ordered. On the other hand, fewer models have been developed for set based inputs, where order does not matter. There are several use cases where data is given as partially-ordered sequences because of t… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  33. arXiv:2101.02561  [pdf, other

    stat.ML cs.AI cs.LG

    Open Set Domain Adaptation by Extreme Value Theory

    Authors: Yiming Xu, Diego Klabjan

    Abstract: Common domain adaptation techniques assume that the source domain and the target domain share an identical label space, which is problematic since when target samples are unlabeled we have no knowledge on whether the two domains share the same label space. When this is not the case, the existing methods fail to perform well because the additional unknown classes are also matched with the source do… ▽ More

    Submitted 22 December, 2020; originally announced January 2021.

  34. arXiv:2012.04759  [pdf, other

    cs.AI

    Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

    Authors: Yiming Xu, Diego Klabjan

    Abstract: In model serving, having one fixed model during the entire often life-long inference process is usually detrimental to model performance, as data distribution evolves over time, resulting in lack of reliability of the model trained on historical data. It is important to detect changes and retrain the model in time. The existing methods generally have three weaknesses: 1) using only classification… ▽ More

    Submitted 14 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

  35. arXiv:2009.14111  [pdf, other

    cs.LG stat.ML

    Inverse Classification with Limited Budget and Maximum Number of Perturbed Samples

    Authors: Jaehoon Koo, Diego Klabjan, Jean Utke

    Abstract: Most recent machine learning research focuses on develo** new classifiers for the sake of improving classification accuracy. With many well-performing state-of-the-art classifiers available, there is a growing need for understanding interpretability of a classifier necessitated by practical purposes such as to find the best diet recommendation for a diabetes patient. Inverse classification is a… ▽ More

    Submitted 29 September, 2020; originally announced September 2020.

  36. arXiv:2009.09538  [pdf, other

    cs.LG cs.AI stat.ML

    Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study the challenging exploration incentive problem in both bandit and reinforcement learning, where the rewards are scale-free and potentially unbounded, driven by real-world scenarios and differing from existing work. Past works in reinforcement learning either assume costly interactions with an environment or propose algorithms finding potentially low quality local maxima. Motivated by EXP-t… ▽ More

    Submitted 3 May, 2024; v1 submitted 20 September, 2020; originally announced September 2020.

    Comments: 40 pages, 8 figures

  37. arXiv:2006.04027  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Efficient Architecture Search for Continual Learning

    Authors: Qiang Gao, Zhipeng Luo, Diego Klabjan

    Abstract: Continual learning with neural networks is an important learning framework in AI that aims to learn a sequence of tasks well. However, it is often confronted with three challenges: (1) overcome the catastrophic forgetting problem, (2) adapt the current network to new tasks, and meanwhile (3) control its model complexity. To reach these goals, we propose a novel approach named as Continual Learning… ▽ More

    Submitted 9 June, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: 12 pages, 11 figures

  38. arXiv:2006.02003  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Open-Set Recognition with Gaussian Mixture Variational Autoencoders

    Authors: Alexander Cao, Yuan Luo, Diego Klabjan

    Abstract: In inference, open-set classification is to either classify a sample into a known class from training or reject it as an unknown class. Existing deep open-set classifiers train explicit closed-set classifiers, in some cases disjointly utilizing reconstruction, which we find dilutes the latent representation's ability to distinguish unknown classes. In contrast, we train our model to cooperatively… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: 12 pages including 8 figures and 4 tables, plus 6 pages of supplementary material

  39. arXiv:2004.14203  [pdf, other

    cs.LG stat.ML

    Neural Network Retraining for Model Serving

    Authors: Diego Klabjan, Xiaofeng Zhu

    Abstract: We propose incremental (re)training of a neural network model to cope with a continuous flow of new data in inference during model serving. As such, this is a life-long learning process. We address two challenges of life-long retraining: catastrophic forgetting and efficient retraining. If we combine all past and new data it can easily become intractable to retrain the neural network model. On the… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

  40. arXiv:2004.13146  [pdf, other

    math.OC cs.LG

    The Impact of the Mini-batch Size on the Variance of Gradients in Stochastic Gradient Descent

    Authors: Xin Qian, Diego Klabjan

    Abstract: The mini-batch stochastic gradient descent (SGD) algorithm is widely used in training machine learning models, in particular deep learning models. We study SGD dynamics under linear regression and two-layer linear networks, with an easy extension to deeper linear networks, by focusing on the variance of the gradients, which is the first study of this nature. In the linear regression case, we show… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  41. arXiv:2001.07866  [pdf, other

    stat.ML cs.IR cs.LG

    Keyword-based Topic Modeling and Keyword Selection

    Authors: Xingyu Wang, Lida Zhang, Diego Klabjan

    Abstract: Certain type of documents such as tweets are collected by specifying a set of keywords. As topics of interest change with time it is beneficial to adjust keywords dynamically. The challenge is that these need to be specified ahead of knowing the forthcoming documents and the underlying topics. The future topics should mimic past topics of interest yet there should be some novelty in them. We devel… ▽ More

    Submitted 21 January, 2020; originally announced January 2020.

  42. arXiv:2001.01828  [pdf, other

    cs.IR cs.LG stat.ML

    Listwise Learning to Rank by Exploring Unique Ratings

    Authors: Xiaofeng Zhu, Diego Klabjan

    Abstract: In this paper, we propose new listwise learning-to-rank models that mitigate the shortcomings of existing ones. Existing listwise learning-to-rank models are generally derived from the classical Plackett-Luce model, which has three major limitations. (1) Its permutation probabilities overlook ties, i.e., a situation when more than one document has the same rating with respect to a query. This can… ▽ More

    Submitted 22 January, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

    Journal ref: WSDM 2020

  43. arXiv:1911.12426  [pdf, other

    cs.LG stat.ME stat.ML

    Conditional Hierarchical Bayesian Tucker Decomposition for Genetic Data Analysis

    Authors: Adam Sandler, Diego Klabjan, Yuan Luo

    Abstract: We develop methods for reducing the dimensionality of large data sets, common in biomedical applications. Learning about patients using genetic data often includes more features than observations, which makes direct supervised learning difficult. One method of reducing the feature space is to use latent Dirichlet allocation to group genetic variants in an unsupervised manner. Latent Dirichlet allo… ▽ More

    Submitted 27 December, 2022; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: 38 pages, 8 figures, 5 tables

  44. Mixture-based Multiple Imputation Model for Clinical Data with a Temporal Dimension

    Authors: Ye Xue, Diego Klabjan, Yuan Luo

    Abstract: The problem of missing values in multivariable time series is a key challenge in many applications such as clinical data mining. Although many imputation methods show their effectiveness in many applications, few of them are designed to accommodate clinical multivariable time series. In this work, we propose a multiple imputation model that capture both cross-sectional information and temporal cor… ▽ More

    Submitted 2 March, 2020; v1 submitted 12 August, 2019; originally announced August 2019.

  45. arXiv:1906.11906  [pdf, other

    cs.CV cs.CL cs.LG

    Data Extraction from Charts via Single Deep Neural Network

    Authors: Xiaoyi Liu, Diego Klabjan, Patrick NBless

    Abstract: Automatic data extraction from charts is challenging for two reasons: there exist many relations among objects in a chart, which is not a common consideration in general computer vision problems; and different types of charts may not be processed by the same model. To address these problems, we propose a framework of a single deep neural network, which consists of object detection, text recognitio… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

  46. arXiv:1905.10540  [pdf, other

    cs.LG cs.NE stat.ML

    Dynamic Cell Structure via Recursive-Recurrent Neural Networks

    Authors: Xin Qian, Matthew Kennedy, Diego Klabjan

    Abstract: In a recurrent setting, conventional approaches to neural architecture search find and fix a general model for all data samples and time steps. We propose a novel algorithm that can dynamically search for the structure of cells in a recurrent neural network model. Based on a combination of recurrent and recursive neural networks, our algorithm is able to construct customized cell structures for ea… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

  47. arXiv:1905.09882  [pdf, other

    math.OC cs.LG stat.ML

    Scale Invariant Power Iteration

    Authors: Cheolmin Kim, Youngseok Kim, Diego Klabjan

    Abstract: Power iteration has been generalized to solve many interesting problems in machine learning and statistics. Despite its striking success, theoretical understanding of when and how such an algorithm enjoys good convergence property is limited. In this work, we introduce a new class of optimization problems called scale invariant problems and prove that they can be efficiently solved by scale invari… ▽ More

    Submitted 11 June, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

  48. arXiv:1905.09356  [pdf, other

    cs.LG cs.DS stat.ML

    Convergence Analyses of Online ADAM Algorithm in Convex Setting and Two-Layer ReLU Neural Network

    Authors: Biyi Fang, Diego Klabjan

    Abstract: Nowadays, online learning is an appealing learning paradigm, which is of great interest in practice due to the recent emergence of large scale applications such as online advertising placement and online web ranking. Standard online learning assumes a finite number of samples while in practice data is streamed infinitely. In such a setting gradient descent with a diminishing learning rate does not… ▽ More

    Submitted 25 November, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

  49. arXiv:1903.04360  [pdf, other

    cs.IR cs.LG stat.ML

    Automatic Ontology Learning from Domain-Specific Short Unstructured Text Data

    Authors: Yiming Xu, Dnyanesh Rajpathak, Ian Gibbs, Diego Klabjan

    Abstract: Ontology learning is a critical task in industry, dealing with identifying and extracting concepts captured in text data such that these concepts can be used in different tasks, e.g. information retrieval. Ontology learning is non-trivial due to several reasons with limited amount of prior research work that automatically learns a domain specific ontology from data. In our work, we propose a two-s… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

  50. arXiv:1901.08179  [pdf, ps, other

    math.OC

    Stochastic Variance-Reduced Heavy Ball Power Iteration

    Authors: Cheolmin Kim, Diego Klabjan

    Abstract: We present a stochastic variance-reduced heavy ball power iteration algorithm for solving PCA and provide a convergence analysis for it. The algorithm is an extension of heavy ball power iteration, incorporating a step size so that progress can be controlled depending on the magnitude of the variance of stochastic gradients. The algorithm works with any size of the mini-batch, and if the step size… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.