Skip to main content

Showing 1–27 of 27 results for author: Hoang, T N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.05349  [pdf, other

    cs.LG cs.AI

    Offline Model-Based Optimization via Policy-Guided Gradient Search

    Authors: Yassine Chemingui, Aryan Deshwal, Trong Nghia Hoang, Janardhan Rao Doppa

    Abstract: Offline optimization is an emerging problem in many experimental engineering domains including protein, drug or aircraft design, where online experimentation to collect evaluation data is too expensive or dangerous. To avoid that, one has to optimize an unknown function given only its offline evaluation at a fixed set of inputs. A naive solution to this problem is to learn a surrogate model of the… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Published at AAAI Conference on Artificial Intelligence, 2024

  2. arXiv:2404.01676  [pdf, other

    cs.LG

    Incentives in Private Collaborative Machine Learning

    Authors: Rachael Hwee Ling Sim, Yehong Zhang, Trong Nghia Hoang, Xinyi Xu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2023

  3. Effective Multi-Stage Training Model For Edge Computing Devices In Intrusion Detection

    Authors: Thua Huynh Trong, Thanh Nguyen Hoang

    Abstract: Intrusion detection poses a significant challenge within expansive and persistently interconnected environments. As malicious code continues to advance and sophisticated attack methodologies proliferate, various advanced deep learning-based detection approaches have been proposed. Nevertheless, the complexity and accuracy of intrusion detection models still need further enhancement to render them… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  4. arXiv:2310.07780  [pdf, other

    cs.LG

    Promoting Robustness of Randomized Smoothing: Two Cost-Effective Approaches

    Authors: Linbo Liu, Trong Nghia Hoang, Lam M. Nguyen, Tsui-Wei Weng

    Abstract: Randomized smoothing has recently attracted attentions in the field of adversarial robustness to provide provable robustness guarantees on smoothed neural network classifiers. However, existing works show that vanilla randomized smoothing usually does not provide good robustness performance and often requires (re)training techniques on the base classifier in order to boost the robustness of the re… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  5. arXiv:2308.13722  [pdf, other

    cs.LG

    Time-to-Pattern: Information-Theoretic Unsupervised Learning for Scalable Time Series Summarization

    Authors: Alireza Ghods, Trong Nghia Hoang, Diane Cook

    Abstract: Data summarization is the process of generating interpretable and representative subsets from a dataset. Existing time series summarization approaches often search for recurring subsequences using a set of manually devised similarity functions to summarize the data. However, such approaches are fraught with limitations stemming from an exhaustive search coupled with a heuristic definition of serie… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  6. arXiv:2306.03191  [pdf, other

    cs.IR cs.LG

    Personalized Federated Domain Adaptation for Item-to-Item Recommendation

    Authors: Ziwei Fan, Hao Ding, Anoop Deoras, Trong Nghia Hoang

    Abstract: Item-to-Item (I2I) recommendation is an important function in most recommendation systems, which generates replacement or complement suggestions for a particular item based on its semantic similarities to other cataloged items. Given that subsets of items in a recommendation system might be co-interacted with by the same set of customers, graph-based models, such as graph neural networks (GNNs), p… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 14 pages, Accepted by UAI 2023

  7. arXiv:2306.01240  [pdf, other

    cs.LG

    Federated Learning of Models Pre-Trained on Different Features with Consensus Graphs

    Authors: Tengfei Ma, Trong Nghia Hoang, Jie Chen

    Abstract: Learning an effective global model on private and decentralized datasets has become an increasingly important challenge of machine learning when applied in practice. Existing distributed learning paradigms, such as Federated Learning, enable this via model aggregation which enforces a strong form of modeling homogeneity and synchronicity across clients. This is however not suitable to many practic… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  8. arXiv:2207.09572  [pdf, other

    cs.LG cs.CR

    Robust Multivariate Time-Series Forecasting: Adversarial Attacks and Defense Mechanisms

    Authors: Linbo Liu, Youngsuk Park, Trong Nghia Hoang, Hilaf Hasson, Jun Huan

    Abstract: This work studies the threats of adversarial attack on multivariate probabilistic forecasting models and viable defense mechanisms. Our studies discover a new attack pattern that negatively impact the forecasting of a target time series via making strategic, sparse (imperceptible) modifications to the past observations of a small number of other time series. To mitigate the impact of such attack,… ▽ More

    Submitted 14 April, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

  9. arXiv:2203.12598  [pdf, other

    cs.IR cs.LG

    Learning Personalized Item-to-Item Recommendation Metric via Implicit Feedback

    Authors: Trong Nghia Hoang, Anoop Deoras, Tong Zhao, ** Li, George Karypis

    Abstract: This paper studies the item-to-item recommendation problem in recommender systems from a new perspective of metric learning via implicit feedback. We develop and investigate a personalizable deep metric model that captures both the internal contents of items and how they were interacted with by users. There are two key challenges in learning such model. First, there is no explicit similarity annot… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: AISTATS-22

  10. arXiv:2106.00456  [pdf, other

    stat.ME cs.AI cs.CR cs.LG

    Federated Estimation of Causal Effects from Observational Data

    Authors: Thanh Vinh Vo, Trong Nghia Hoang, Young Lee, Tze-Yun Leong

    Abstract: Many modern applications collect data that comes in federated spirit, with data kept locally and undisclosed. Till date, most insight into the causal inference requires data to be stored in a central repository. We present a novel framework for causal inference with federated data sources. We assess and integrate local causal effects from different private data sources without centralizing them. T… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: Preprint

  11. arXiv:2105.14877  [pdf, other

    cs.LG cs.AI stat.ME

    Adaptive Multi-Source Causal Inference

    Authors: Thanh Vinh Vo, Pengfei Wei, Trong Nghia Hoang, Tze-Yun Leong

    Abstract: Data scarcity is a tremendous challenge in causal effect estimation. In this paper, we propose to exploit additional data sources to facilitate estimating causal effects in the target population. Specifically, we leverage additional source datasets which share similar causal mechanisms with the target observations to help infer causal effects of the target population. We propose three levels of kn… ▽ More

    Submitted 31 May, 2021; originally announced May 2021.

    Comments: Preprint

  12. arXiv:2011.08432  [pdf, other

    cs.LG stat.ML

    Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes

    Authors: Quang Minh Hoang, Trong Nghia Hoang, Hai Pham, David P. Woodruff

    Abstract: We introduce a new scalable approximation for Gaussian processes with provable guarantees which hold simultaneously over its entire parameter space. Our approximation is obtained from an improved sample complexity analysis for sparse spectrum Gaussian processes (SSGPs). In particular, our analysis shows that under a certain data disentangling condition, an SSGP's prediction and model evidence (for… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  13. arXiv:2005.10918  [pdf, other

    cs.LG stat.ML

    CHEER: Rich Model Helps Poor Model via Knowledge Infusion

    Authors: Cao Xiao, Trong Nghia Hoang, Shenda Hong, Tengfei Ma, Jimeng Sun

    Abstract: There is a growing interest in applying deep learning (DL) to healthcare, driven by the availability of data with multiple feature channels in rich-data environments (e.g., intensive care units). However, in many other practical situations, we can only access data with much fewer feature channels in a poor-data environments (e.g., at home), which often results in predictive models with poor perfor… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: Published in TKDE

  14. arXiv:1911.06446  [pdf, other

    cs.LG q-bio.QM stat.ML

    CASTER: Predicting Drug Interactions with Chemical Substructure Representation

    Authors: Kexin Huang, Cao Xiao, Trong Nghia Hoang, Lucas M. Glass, Jimeng Sun

    Abstract: Adverse drug-drug interactions (DDIs) remain a leading cause of morbidity and mortality. Identifying potential DDIs during the drug design process is critical for patients and society. Although several computational models have been proposed for DDI prediction, there are still limitations: (1) specialized design of drug representation for DDI predictions is lacking; (2) predictions are based on li… ▽ More

    Submitted 19 November, 2019; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  15. arXiv:1911.00218  [pdf, other

    stat.ML cs.LG

    Statistical Model Aggregation via Parameter Matching

    Authors: Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang

    Abstract: We consider the problem of aggregating models learned from sequestered, possibly heterogeneous datasets. Exploiting tools from Bayesian nonparametrics, we develop a general meta-modeling framework that learns shared global latent structures by identifying correspondences among local model parameterizations. Our proposed framework is model-independent and is applicable to a wide range of model type… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: NeurIPS 2019

  16. arXiv:1905.12022  [pdf, other

    stat.ML cs.LG

    Bayesian Nonparametric Federated Learning of Neural Networks

    Authors: Mikhail Yurochkin, Mayank Agarwal, Soumya Ghosh, Kristjan Greenewald, Trong Nghia Hoang, Yasaman Khazaeni

    Abstract: In federated learning problems, data is scattered across different servers and exchanging or pooling it is often impractical or prohibited. We develop a Bayesian nonparametric framework for federated learning with neural networks. Each data server is assumed to provide local neural network weights, which are modeled through our framework. We then develop an inference approach that allows us to syn… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  17. arXiv:1809.01921  [pdf, other

    cs.LG stat.ML

    RDPD: Rich Data Helps Poor Data via Imitation

    Authors: Shenda Hong, Cao Xiao, Trong Nghia Hoang, Tengfei Ma, Hongyan Li, Jimeng Sun

    Abstract: In many situations, we need to build and deploy separate models in related environments with different data qualities. For example, an environment with strong observation equipments (e.g., intensive care units) often provides high-quality multi-modal data, which are acquired from multiple sensory devices and have rich-feature representations. On the other hand, an environment with poor observation… ▽ More

    Submitted 24 August, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Published in IJCAI 2019

  18. arXiv:1805.09266  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems

    Authors: Trong Nghia Hoang, Quang Minh Hoang, Kian Hsiang Low, Jonathan How

    Abstract: Distributed machine learning (ML) is a modern computation paradigm that divides its workload into independent tasks that can be simultaneously achieved by multiple machines (i.e., agents) for better scalability. However, a typical distributed system is usually implemented with a central server that collects data statistics from multiple independent machines operating on different subsets of data t… ▽ More

    Submitted 12 November, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: Extended version with proofs

  19. arXiv:1711.07033  [pdf, other

    stat.ML cs.DC cs.LG cs.MA

    Decentralized High-Dimensional Bayesian Optimization with Factor Graphs

    Authors: Trong Nghia Hoang, Quang Minh Hoang, Ruofei Ouyang, Kian Hsiang Low

    Abstract: This paper presents a novel decentralized high-dimensional Bayesian optimization (DEC-HBO) algorithm that, in contrast to existing HBO algorithms, can exploit the interdependent effects of various input components on the output of the unknown objective function f for boosting the BO performance and still preserve scalability in the number of input dimensions without requiring prior knowledge or th… ▽ More

    Submitted 24 January, 2018; v1 submitted 19 November, 2017; originally announced November 2017.

    Comments: 32nd AAAI Conference on Artificial Intelligence (AAAI 2018), Extended version with proofs, 13 pages

  20. arXiv:1711.00221  [pdf, other

    cs.LG stat.ML

    Stochastic Variational Inference for Bayesian Sparse Gaussian Process Regression

    Authors: Haibin Yu, Trong Nghia Hoang, Kian Hsiang Low, Patrick Jaillet

    Abstract: This paper presents a novel variational inference framework for deriving a family of Bayesian sparse Gaussian process regression (SGPR) models whose approximations are variationally optimal with respect to the full-rank GPR model enriched with various corresponding correlation structures of the observation noises. Our variational Bayesian SGPR (VBSGPR) models jointly treat both the distributions o… ▽ More

    Submitted 22 March, 2019; v1 submitted 1 November, 2017; originally announced November 2017.

    Comments: To appear in Proceedings of the International Joint Conference on Neural Networks 2019 (IJCNN'19)

  21. arXiv:1710.06525  [pdf, other

    cs.AI cs.MA

    Near-Optimal Adversarial Policy Switching for Decentralized Asynchronous Multi-Agent Systems

    Authors: Trong Nghia Hoang, Yuchen Xiao, Kavinayan Sivakumar, Christopher Amato, Jonathan How

    Abstract: A key challenge in multi-robot and multi-agent systems is generating solutions that are robust to other self-interested or even adversarial parties who actively try to prevent the agents from achieving their goals. The practicality of existing works addressing this challenge is limited to only small-scale synchronous decision-making scenarios or a single agent planning its best response against a… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

  22. Expert Opinion Extraction from a Biomedical Database

    Authors: Ahmed Samet, Thomas Guyet, Benjamin Negrevergne, Tien-Tuan Dao, Tuan Nha Hoang, Marie-Christine Ho Ba Tho

    Abstract: In this paper, we tackle the problem of extracting frequent opinions from uncertain databases. We introduce the foundation of an opinion mining approach with the definition of pattern and support measure. The support measure is derived from the commitment definition. A new algorithm called OpMiner that extracts the set of frequent opinions modelled as a mass functions is detailed. Finally, we appl… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Journal ref: Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU), Jul 2017, Lugano, Switzerland. Springer, 31 (LNCS 10369), pp.1 - 12, 2017, Proceedings of 14th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty

  23. arXiv:1611.06080  [pdf, other

    stat.ML cs.LG

    A Generalized Stochastic Variational Bayesian Hyperparameter Learning Framework for Sparse Spectrum Gaussian Process Regression

    Authors: Quang Minh Hoang, Trong Nghia Hoang, Kian Hsiang Low

    Abstract: While much research effort has been dedicated to scaling up sparse Gaussian process (GP) models based on inducing variables for big data, little attention is afforded to the other less explored class of low-rank GP approximations that exploit the sparse spectral representation of a GP kernel. This paper presents such an effort to advance the state of the art of sparse spectrum GP models to achieve… ▽ More

    Submitted 18 November, 2016; originally announced November 2016.

    Comments: 31st AAAI Conference on Artificial Intelligence (AAAI 2017), Extended version with proofs, 11 pages

  24. arXiv:1511.06891  [pdf, other

    stat.ML cs.AI cs.LG

    Near-Optimal Active Learning of Multi-Output Gaussian Processes

    Authors: Yehong Zhang, Trong Nghia Hoang, Kian Hsiang Low, Mohan Kankanhalli

    Abstract: This paper addresses the problem of active learning of a multi-output Gaussian process (MOGP) model representing multiple types of coexisting correlated environmental phenomena. In contrast to existing works, our active learning problem involves selecting not just the most informative sampling locations to be observed but also the types of measurements at each selected location for minimizing the… ▽ More

    Submitted 24 November, 2015; v1 submitted 21 November, 2015; originally announced November 2015.

    Comments: 30th AAAI Conference on Artificial Intelligence (AAAI 2016), Extended version with proofs, 13 pages

  25. arXiv:1304.5159  [pdf, other

    cs.AI cs.MA

    Interactive POMDP Lite: Towards Practical Planning to Predict and Exploit Intentions for Interacting with Self-Interested Agents

    Authors: Trong Nghia Hoang, Kian Hsiang Low

    Abstract: A key challenge in non-cooperative multi-agent systems is that of develo** efficient planning algorithms for intelligent agents to interact and perform effectively among boundedly rational, self-interested agents (e.g., humans). The practicality of existing works addressing this challenge is being undermined due to either the restrictive assumptions of the other agents' behavior, the failure in… ▽ More

    Submitted 18 April, 2013; originally announced April 2013.

    Comments: 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013), Extended version with proofs, 24 pages

  26. arXiv:1304.2024  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    A General Framework for Interacting Bayes-Optimally with Self-Interested Agents using Arbitrary Parametric Model and Model Prior

    Authors: Trong Nghia Hoang, Kian Hsiang Low

    Abstract: Recent advances in Bayesian reinforcement learning (BRL) have shown that Bayes-optimality is theoretically achievable by modeling the environment's latent dynamics using Flat-Dirichlet-Multinomial (FDM) prior. In self-interested multi-agent environments, the transition dynamics are mainly controlled by the other agent's stochastic behavior for which FDM's independence and modeling assumptions do n… ▽ More

    Submitted 16 March, 2014; v1 submitted 7 April, 2013; originally announced April 2013.

    Comments: 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013), Extended version with proofs, 10 pages

  27. arXiv:1209.4275  [pdf, other

    cs.AI cs.MA cs.MM cs.RO

    Decision-Theoretic Coordination and Control for Active Multi-Camera Surveillance in Uncertain, Partially Observable Environments

    Authors: Prabhu Natarajan, Trong Nghia Hoang, Kian Hsiang Low, Mohan Kankanhalli

    Abstract: A central problem of surveillance is to monitor multiple targets moving in a large-scale, obstacle-ridden environment with occlusions. This paper presents a novel principled Partially Observable Markov Decision Process-based approach to coordinating and controlling a network of active cameras for tracking and observing multiple mobile targets at high resolution in such surveillance environments. O… ▽ More

    Submitted 2 October, 2012; v1 submitted 19 September, 2012; originally announced September 2012.

    Comments: 6th ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC 2012), Extended version with proofs, 8 pages