Skip to main content

Showing 1–8 of 8 results for author: Tran-The, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.06844  [pdf, other

    stat.ML cs.LG

    Provably Efficient Bayesian Optimization with Unknown Gaussian Process Hyperparameter Estimation

    Authors: Huong Ha, Vu Nguyen, Hung Tran-The, Hongyu Zhang, Xiuzhen Zhang, Anton van den Hengel

    Abstract: Gaussian process (GP) based Bayesian optimization (BO) is a powerful method for optimizing black-box functions efficiently. The practical performance and theoretical guarantees of this approach depend on having the correct GP hyperparameter values, which are usually unknown in advance and need to be estimated from the observed data. However, in practice, these estimations could be incorrect due to… ▽ More

    Submitted 6 June, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: 25 pages, 5 figures

  2. arXiv:2301.00437  [pdf, other

    cs.LG stat.ML

    Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data

    Authors: Hien Dang, Tho Tran, Stanley Osher, Hung Tran-The, Nhat Ho, Tan Nguyen

    Abstract: Modern deep neural networks have achieved impressive performance on tasks from image classification to natural language processing. Surprisingly, these complex systems with massive amounts of parameters exhibit the same structural properties in their last-layer features and classifiers across canonical datasets when training until convergence. In particular, it has been observed that the last-laye… ▽ More

    Submitted 18 June, 2023; v1 submitted 1 January, 2023; originally announced January 2023.

    Comments: 75 pages, 20 figures, 4 tables. Hien Dang and Tho Tran contributed equally to this work

  3. arXiv:2107.11533  [pdf, other

    stat.ML cs.LG

    Combining Online Learning and Offline Learning for Contextual Bandits with Deficient Support

    Authors: Hung Tran-The, Sunil Gupta, Thanh Nguyen-Tang, Santu Rana, Svetha Venkatesh

    Abstract: We address policy learning with logged data in contextual bandits. Current offline-policy learning algorithms are mostly based on inverse propensity score (IPS) weighting requiring the logging policy to have \emph{full support} i.e. a non-zero probability for any context/action of the evaluation policy. However, many real-world systems do not guarantee such logging policies, especially when the ac… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  4. arXiv:2105.04332  [pdf, other

    cs.LG stat.ML

    Bayesian Optimistic Optimisation with Exponentially Decaying Regret

    Authors: Hung Tran-The, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: Bayesian optimisation (BO) is a well-known efficient algorithm for finding the global optimum of expensive, black-box functions. The current practical BO algorithms have regret bounds ranging from $\mathcal{O}(\frac{logN}{\sqrt{N}})$ to $\mathcal O(e^{-\sqrt{N}})$, where $N$ is the number of evaluations. This paper explores the possibility of improving the regret bound in the noiseless setting by… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: To appear at ICML 2021 (21 pages)

  5. arXiv:2103.06671  [pdf, ps, other

    stat.ML cs.LG

    Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks

    Authors: Thanh Nguyen-Tang, Sunil Gupta, Hung Tran-The, Svetha Venkatesh

    Abstract: Offline reinforcement learning (RL) leverages previously collected data for policy optimization without any further active exploration. Despite the recent interest in this problem, its theoretical results in neural network function approximation settings remain elusive. In this paper, we study the statistical theory of offline RL with deep ReLU network function approximation. In particular, we est… ▽ More

    Submitted 13 December, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: https://openreview.net/forum?id=LdEm0umNcv

    Journal ref: Transactions on Machine Learning Research, 2022

  6. arXiv:2009.02539  [pdf, other

    stat.ML cs.IT cs.LG

    Sub-linear Regret Bounds for Bayesian Optimisation in Unknown Search Spaces

    Authors: Hung Tran-The, Sunil Gupta, Santu Rana, Huong Ha, Svetha Venkatesh

    Abstract: Bayesian optimisation is a popular method for efficient optimisation of expensive black-box functions. Traditionally, BO assumes that the search space is known. However, in many problems, this assumption does not hold. To this end, we propose a novel BO algorithm which expands (and shifts) the search space over iterations based on controlling the expansion rate thought a hyperharmonic series. Furt… ▽ More

    Submitted 1 November, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  7. Trading Convergence Rate with Computational Budget in High Dimensional Bayesian Optimization

    Authors: Hung Tran-The, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: Scaling Bayesian optimisation (BO) to high-dimensional search spaces is a active and open research problems particularly when no assumptions are made on function structure. The main reason is that at each iteration, BO requires to find global maximisation of acquisition function, which itself is a non-convex optimization problem in the original search space. With growing dimensions, the computatio… ▽ More

    Submitted 28 August, 2020; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: Our accepted paper (with Supplementary Material) at AAAI 2020

  8. arXiv:1910.13092  [pdf, other

    stat.ML cs.LG math.OC

    Bayesian Optimization with Unknown Search Space

    Authors: Huong Ha, Santu Rana, Sunil Gupta, Thanh Nguyen, Hung Tran-The, Svetha Venkatesh

    Abstract: Applying Bayesian optimization in problems wherein the search space is unknown is challenging. To address this problem, we propose a systematic volume expansion strategy for the Bayesian optimization. We devise a strategy to guarantee that in iterative expansions of the search space, our method can find a point whose function value within epsilon of the objective function maximum. Without the need… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

    Journal ref: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada