Skip to main content

Showing 1–19 of 19 results for author: Phan, D T

.
  1. arXiv:2404.01270  [pdf, other

    cs.LG cs.CR cs.DC

    Decentralized Collaborative Learning Framework with External Privacy Leakage Analysis

    Authors: Tsuyoshi Idé, Dzung T. Phan, Rudy Raymond

    Abstract: This paper presents two methodological advancements in decentralized multi-task learning under privacy constraints, aiming to pave the way for future developments in next-generation Blockchain platforms. First, we expand the existing framework for collaborative dictionary learning (CollabDict), which has previously been limited to Gaussian mixture models, by incorporating deep variational autoenco… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: To appear in Proceeding of 2023 International workshop Blockchain Kaigi (BCK 23), JPS Conference Proceedings, 2024

  2. arXiv:2403.03611  [pdf

    eess.AS cs.SD

    Comparison Performance of Spectrogram and Scalogram as Input of Acoustic Recognition Task

    Authors: Dang Thoai Phan

    Abstract: Acoustic recognition is a common task for deep learning in recent researches, with the employment of spectral feature extraction such as Short-time Fourier transform and Wavelet transform. However, not many researches have found that discuss the advantages and drawbacks, as well as performance comparison of them. In this consideration, this paper aims to comparing the attributes of these two trans… ▽ More

    Submitted 26 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2208.10671  [pdf, other

    cs.LG cs.AI stat.ME

    Cardinality-Regularized Hawkes-Granger Model

    Authors: Tsuyoshi Idé, Georgios Kollias, Dzung T. Phan, Naoki Abe

    Abstract: We propose a new sparse Granger-causal learning framework for temporal event data. We focus on a specific class of point processes called the Hawkes process. We begin by pointing out that most of the existing sparse causal learning algorithms for the Hawkes process suffer from a singularity in maximum likelihood estimation. As a result, their sparse solutions can appear only as numerical artifacts… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages, 9 figures

  4. arXiv:2202.00865  [pdf, other

    math.OC

    StepDIRECT -- A Derivative-Free Optimization Method for Stepwise Functions

    Authors: Dzung T. Phan, Hongsheng Liu, Lam M. Nguyen

    Abstract: In this paper, we propose the StepDIRECT algorithm for derivative-free optimization (DFO), in which the black-box objective function has a stepwise landscape. Our framework is based on the well-known DIRECT algorithm. By incorporating the local variability to explore the flatness, we provide a new criterion to select the potentially optimal hyper-rectangles. In addition, we introduce a stochastic… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  5. arXiv:2110.09131  [pdf, other

    cs.CL cs.AI

    Ensembling Graph Predictions for AMR Parsing

    Authors: Hoang Thanh Lam, Gabriele Picco, Yufang Hou, Young-Suk Lee, Lam M. Nguyen, Dzung T. Phan, Vanessa López, Ramon Fernandez Astudillo

    Abstract: In many machine learning tasks, models are trained to predict structure data such as graphs. For example, in natural language processing, it is very common to parse texts into dependency trees or abstract meaning representation (AMR) graphs. On the other hand, ensemble methods combine predictions from multiple models to create a new one that is more robust and accurate than individual predictions.… ▽ More

    Submitted 24 January, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

  6. arXiv:2104.02278  [pdf, other

    cs.LG cs.MA

    A novel activity pattern generation incorporating deep learning for transport demand models

    Authors: Danh T. Phan, Hai L. Vu

    Abstract: Activity generation plays an important role in activity-based demand modelling systems. While machine learning, especially deep learning, has been increasingly used for mode choice and traffic flow prediction, much less research exploiting the advantage of deep learning for activity generation tasks. This paper proposes a novel activity pattern generation framework by incorporating deep learning w… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

    Comments: 21 pages, 12 figures

  7. arXiv:2103.03452  [pdf, other

    stat.ML cs.DC cs.LG

    FedDR -- Randomized Douglas-Rachford Splitting Algorithms for Nonconvex Federated Composite Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We develop two new algorithms, called, FedDR and asyncFedDR, for solving a fundamental nonconvex composite optimization problem in federated learning. Our algorithms rely on a novel combination between a nonconvex Douglas-Rachford splitting method, randomized block-coordinate strategies, and asynchronous implementation. They can also handle convex regularizers. Unlike recent methods in the literat… ▽ More

    Submitted 28 October, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 39 pages, and 12 figures

    Report number: UNC-STOR-June 2021

    Journal ref: NeurIPs 2021

  8. arXiv:2011.03375  [pdf, other

    cs.LG math.OC

    A Scalable MIP-based Method for Learning Optimal Multivariate Decision Trees

    Authors: Haoran Zhu, Pavankumar Murali, Dzung T. Phan, Lam M. Nguyen, Jayant R. Kalagnanam

    Abstract: Several recent publications report advances in training optimal decision trees (ODT) using mixed-integer programs (MIP), due to algorithmic advances in integer programming and a growing interest in addressing the inherent suboptimality of heuristic approaches such as CART. In this paper, we propose a novel MIP formulation, based on a 1-norm support vector machine model, to train a multivariate ODT… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  9. arXiv:2003.00430  [pdf, other

    cs.LG math.OC

    A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk, Quoc Tran-Dinh

    Abstract: We propose a novel hybrid stochastic policy gradient estimator by combining an unbiased policy gradient estimator, the REINFORCE estimator, with another biased one, an adapted SARAH estimator for policy optimization. The hybrid policy gradient estimator is shown to be biased, but has variance reduced property. Using this estimator, we develop a new Proximal Hybrid Stochastic Policy Gradient Algori… ▽ More

    Submitted 21 September, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

    Journal ref: Proceedings of the International Conference on Artificial Intelligence and Statistics, PMLR 108:374-385, 2020

  10. arXiv:2002.08246  [pdf, other

    math.OC cs.LG stat.ML

    A Unified Convergence Analysis for Shuffling-Type Gradient Methods

    Authors: Lam M. Nguyen, Quoc Tran-Dinh, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk

    Abstract: In this paper, we propose a unified convergence analysis for a class of generic shuffling-type gradient methods for solving finite-sum optimization problems. Our analysis works with any sampling without replacement strategy and covers many known variants such as randomized reshuffling, deterministic or randomized single permutation, and cyclic and incremental gradient schemes. We focus on two diff… ▽ More

    Submitted 19 September, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Journal of Machine Learning Research, 2021

  11. arXiv:1908.00528  [pdf, other

    cs.AI eess.SY

    Neural Simplex Architecture

    Authors: Dung T. Phan, Radu Grosu, Nils Jansen, Nicola Paoletti, Scott A. Smolka, Scott D. Stoller

    Abstract: We present the Neural Simplex Architecture (NSA), a new approach to runtime assurance that provides safety guarantees for neural controllers (obtained e.g. using reinforcement learning) of autonomous and other complex systems without unduly sacrificing performance. NSA is inspired by the Simplex control architecture of Sha et al., but with some significant differences. In the traditional approach,… ▽ More

    Submitted 24 March, 2020; v1 submitted 1 August, 2019; originally announced August 2019.

    Comments: 12th NASA Formal Methods Symposium (NFM 2020)

  12. arXiv:1907.03793  [pdf, other

    math.OC cs.LG stat.ML

    A Hybrid Stochastic Optimization Framework for Stochastic Composite Nonconvex Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We introduce a new approach to develop stochastic optimization algorithms for a class of stochastic composite and possibly nonconvex optimization problems. The main idea is to combine two stochastic estimators to create a new hybrid one. We first introduce our hybrid estimator and then investigate its fundamental properties to form a foundational theory for algorithmic development. Next, we apply… ▽ More

    Submitted 2 May, 2020; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 49 pages, 2 tables, 9 figures

    Report number: UNC-STOR-2019.07.V1-03

  13. arXiv:1905.05920  [pdf, other

    math.OC stat.ML

    Hybrid Stochastic Gradient Descent Algorithms for Stochastic Nonconvex Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We introduce a hybrid stochastic estimator to design stochastic gradient algorithms for solving stochastic optimization problems. Such a hybrid estimator is a convex combination of two existing biased and unbiased estimators and leads to some useful property on its variance. We limit our consideration to a hybrid SARAH-SGD for nonconvex expectation problems. However, our idea can be extended to ha… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 41 pages and 18 figures

    Report number: UNC-STOR-May-2019-03

  14. arXiv:1902.05679  [pdf, other

    math.OC cs.LG stat.ML

    ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Quoc Tran-Dinh

    Abstract: We propose a new stochastic first-order algorithmic framework to solve stochastic composite nonconvex optimization problems that covers both finite-sum and expectation settings. Our algorithms rely on the SARAH estimator introduced in (Nguyen et al, 2017) and consist of two steps: a proximal gradient and an averaging step making them different from existing nonconvex proximal-type algorithms. The… ▽ More

    Submitted 28 March, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: 45 pages, 8 figures, and 2 table

    Report number: STOR-UNC-Feb14.2019

  15. arXiv:1901.07648  [pdf, other

    math.OC cs.LG stat.ML

    Finite-Sum Smooth Optimization with SARAH

    Authors: Lam M. Nguyen, Marten van Dijk, Dzung T. Phan, Phuong Ha Nguyen, Tsui-Wei Weng, Jayant R. Kalagnanam

    Abstract: The total complexity (measured as the total number of gradient computations) of a stochastic first-order optimization algorithm that finds a first-order stationary point of a finite-sum smooth nonconvex objective function $F(w)=\frac{1}{n} \sum_{i=1}^n f_i(w)$ has been proven to be at least $Ω(\sqrt{n}/ε)$ for $n \leq \mathcal{O}(ε^{-2})$ where $ε$ denotes the attained accuracy… ▽ More

    Submitted 22 April, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

  16. arXiv:1901.07634   

    cs.LG math.OC stat.ML

    DTN: A Learning Rate Scheme with Convergence Rate of $\mathcal{O}(1/t)$ for SGD

    Authors: Lam M. Nguyen, Phuong Ha Nguyen, Dzung T. Phan, Jayant R. Kalagnanam, Marten van Dijk

    Abstract: This paper has some inconsistent results, i.e., we made some failed claims because we did some mistakes for using the test criterion for a series. Precisely, our claims on the convergence rate of $\mathcal{O}(1/t)$ of SGD presented in Theorem 1, Corollary 1, Theorem 2 and Corollary 2 are wrongly derived because they are based on Lemma 5. In Lemma 5, we do not correctly use the test criterion for a… ▽ More

    Submitted 27 February, 2019; v1 submitted 22 January, 2019; originally announced January 2019.

    Comments: This paper has inconsistent results, i.e., we made some failed claims because we did some mistakes for using the test criterion for a series

  17. arXiv:1810.04100  [pdf, other

    math.OC cs.LG

    Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD

    Authors: Marten van Dijk, Lam M. Nguyen, Phuong Ha Nguyen, Dzung T. Phan

    Abstract: We study Stochastic Gradient Descent (SGD) with diminishing step sizes for convex objective functions. We introduce a definitional framework and theory that defines and characterizes a core property, called curvature, of convex objective functions. In terms of curvature we can derive a new inequality that can be used to compute an optimal sequence of diminishing step sizes by solving a differentia… ▽ More

    Submitted 13 May, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97, 2019

  18. arXiv:1801.06159  [pdf, other

    stat.ML cs.LG math.OC

    When Does Stochastic Gradient Algorithm Work Well?

    Authors: Lam M. Nguyen, Nam H. Nguyen, Dzung T. Phan, Jayant R. Kalagnanam, Katya Scheinberg

    Abstract: In this paper, we consider a general stochastic optimization problem which is often at the core of supervised learning, such as deep learning and linear classification. We consider a standard stochastic gradient descent (SGD) method with a fixed, large step size and propose a novel assumption on the objective function, under which this method has the improved convergence rates (to a neighborhood o… ▽ More

    Submitted 25 December, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

  19. arXiv:1404.4132  [pdf, ps, other

    math.NA math.OC

    Projection Algorithms for Non-Convex Minimization with Application to Sparse Principal Component Analysis

    Authors: William W. Hager, Dzung T. Phan, Jia-Jie Zhu

    Abstract: We consider concave minimization problems over non-convex sets.Optimization problems with this structure arise in sparse principal component analysis. We analyze both a gradient projection algorithm and an approximate Newton algorithm where the Hessian approximation is a multiple of the identity. Convergence results are established. In numerical experiments arising in sparse principal component an… ▽ More

    Submitted 6 April, 2019; v1 submitted 15 April, 2014; originally announced April 2014.