Skip to main content

Showing 1–8 of 8 results for author: Pham, N H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.10439  [pdf, other

    cs.CL cs.LG

    ViDeBERTa: A powerful pre-trained language model for Vietnamese

    Authors: Cong Dao Tran, Nhut Huy Pham, Anh Nguyen, Truong Son Hy, Tu Vu

    Abstract: This paper presents ViDeBERTa, a new pre-trained monolingual language model for Vietnamese, with three versions - ViDeBERTa_xsmall, ViDeBERTa_base, and ViDeBERTa_large, which are pre-trained on a large-scale corpus of high-quality and diverse Vietnamese texts using DeBERTa architecture. Although many successful pre-trained language models based on Transformer have been widely proposed for the Engl… ▽ More

    Submitted 10 February, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  2. arXiv:2202.03558  [pdf, other

    cs.LG cs.AI

    Attacking c-MARL More Effectively: A Data Driven Approach

    Authors: Nhan H. Pham, Lam M. Nguyen, Jie Chen, Hoang Thanh Lam, Subhro Das, Tsui-Wei Weng

    Abstract: In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement learning (c-MARL). However, the robustness of c-MARL agents against adversarial attacks has been rarely explored. In this paper, we propose to evaluate the robustness of c-MARL agents via a model-based approach, named c-MBA. Our proposed formulation can craft much stronger adversarial state perturb… ▽ More

    Submitted 10 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

  3. arXiv:2109.08860   

    cs.GT

    Groups Influence with Minimum Cost in Social Networks

    Authors: Phuong N. H. Pham, Canh V. Pham, Hieu V. Duong, Thanh T. Nguyen, My T. Thai

    Abstract: This paper studies a Group Influence with Minimum cost which aims to find a seed set with smallest cost that can influence all target groups, where each user is associated with a cost and a group is influenced if the total score of the influenced users belonging to the group is at least a certain threshold. As the group-influence function is neither submodular nor supermodular, theoretical bounds… ▽ More

    Submitted 14 December, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: The paper contains some errors

  4. arXiv:2103.03452  [pdf, other

    stat.ML cs.DC cs.LG

    FedDR -- Randomized Douglas-Rachford Splitting Algorithms for Nonconvex Federated Composite Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We develop two new algorithms, called, FedDR and asyncFedDR, for solving a fundamental nonconvex composite optimization problem in federated learning. Our algorithms rely on a novel combination between a nonconvex Douglas-Rachford splitting method, randomized block-coordinate strategies, and asynchronous implementation. They can also handle convex regularizers. Unlike recent methods in the literat… ▽ More

    Submitted 28 October, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 39 pages, and 12 figures

    Report number: UNC-STOR-June 2021

    Journal ref: NeurIPs 2021

  5. arXiv:2003.10973  [pdf, ps, other

    math.OC cs.LG

    Finite-Time Analysis of Stochastic Gradient Descent under Markov Randomness

    Authors: Thinh T. Doan, Lam M. Nguyen, Nhan H. Pham, Justin Romberg

    Abstract: Motivated by broad applications in reinforcement learning and machine learning, this paper considers the popular stochastic gradient descent (SGD) when the gradients of the underlying objective function are sampled from Markov processes. This Markov sampling leads to the gradient samples being biased and not independent. The existing results for the convergence of SGD under Markov randomness are o… ▽ More

    Submitted 1 April, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

  6. arXiv:2003.00430  [pdf, other

    cs.LG math.OC

    A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk, Quoc Tran-Dinh

    Abstract: We propose a novel hybrid stochastic policy gradient estimator by combining an unbiased policy gradient estimator, the REINFORCE estimator, with another biased one, an adapted SARAH estimator for policy optimization. The hybrid policy gradient estimator is shown to be biased, but has variance reduced property. Using this estimator, we develop a new Proximal Hybrid Stochastic Policy Gradient Algori… ▽ More

    Submitted 21 September, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

    Journal ref: Proceedings of the International Conference on Artificial Intelligence and Statistics, PMLR 108:374-385, 2020

  7. arXiv:1907.03793  [pdf, other

    math.OC cs.LG stat.ML

    A Hybrid Stochastic Optimization Framework for Stochastic Composite Nonconvex Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We introduce a new approach to develop stochastic optimization algorithms for a class of stochastic composite and possibly nonconvex optimization problems. The main idea is to combine two stochastic estimators to create a new hybrid one. We first introduce our hybrid estimator and then investigate its fundamental properties to form a foundational theory for algorithmic development. Next, we apply… ▽ More

    Submitted 2 May, 2020; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 49 pages, 2 tables, 9 figures

    Report number: UNC-STOR-2019.07.V1-03

  8. arXiv:1902.05679  [pdf, other

    math.OC cs.LG stat.ML

    ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Quoc Tran-Dinh

    Abstract: We propose a new stochastic first-order algorithmic framework to solve stochastic composite nonconvex optimization problems that covers both finite-sum and expectation settings. Our algorithms rely on the SARAH estimator introduced in (Nguyen et al, 2017) and consist of two steps: a proximal gradient and an averaging step making them different from existing nonconvex proximal-type algorithms. The… ▽ More

    Submitted 28 March, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: 45 pages, 8 figures, and 2 table

    Report number: STOR-UNC-Feb14.2019