Skip to main content

Showing 1–50 of 86 results for author: Lam, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14071  [pdf, other

    stat.ML cs.LG

    Bayesian Bandit Algorithms with Approximate Inference in Stochastic Linear Bandits

    Authors: Ziyi Huang, Henry Lam, Haofeng Zhang

    Abstract: Bayesian bandit algorithms with approximate Bayesian inference have been widely used in real-world applications. Nevertheless, their theoretical justification is less investigated in the literature, especially for contextual bandit problems. To fill this gap, we propose a general theoretical framework to analyze stochastic linear bandits in the presence of approximate inference and conduct regret… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.09837  [pdf, other

    cs.LG

    TabularFM: An Open Framework For Tabular Foundational Models

    Authors: Quan M. Tran, Suong N. Hoang, Lam M. Nguyen, Dzung Phan, Hoang Thanh Lam

    Abstract: Foundational models (FMs), pretrained on extensive datasets using self-supervised techniques, are capable of learning generalized patterns from large amounts of data. This reduces the need for extensive labeled datasets for each new task, saving both time and resources by leveraging the broad knowledge base established during pretraining. Most research on FMs has primarily focused on unstructured… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2406.05893  [pdf, other

    cs.LG

    Event prediction and causality inference despite incomplete information

    Authors: Harrison Lam, Yuanjie Chen, Noboru Kanazawa, Mohammad Chowdhury, Anna Battista, Stephan Waldert

    Abstract: We explored the challenge of predicting and explaining the occurrence of events within sequences of data points. Our focus was particularly on scenarios in which unknown triggers causing the occurrence of events may consist of non-consecutive, masked, noisy data points. This scenario is akin to an agent tasked with learning to predict and explain the occurrence of events without understanding the… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 16 pages, 8 figures, 1 table

  4. arXiv:2406.02245  [pdf, other

    cs.CL cs.IR cs.LG

    Description Boosting for Zero-Shot Entity and Relation Classification

    Authors: Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam

    Abstract: Zero-shot entity and relation classification models leverage available external information of unseen classes -- e.g., textual descriptions -- to annotate input text data. Thanks to the minimum data requirement, Zero-Shot Learning (ZSL) methods have high value in practice, especially in applications where labeled data is scarce. Even though recent research in ZSL has demonstrated significant resul… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2406.00284  [pdf, other

    cs.CL

    A Closer Look at Logical Reasoning with LLMs: The Choice of Tool Matters

    Authors: Long Hei Matthew Lam, Ehsan Shareghi

    Abstract: Logical reasoning serves as a cornerstone for human cognition. Recently, the emergence of Large Language Models (LLMs) has demonstrated promising progress in solving logical reasoning tasks effectively. To improve this capability, recent studies have delved into integrating LLMs with various symbolic solvers using diverse techniques and methodologies. While some combinations excel on specific data… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: Code and data are publicly available at: https://github.com/Mattylam/Logic_Symbolic_Solvers_Experiment

  6. arXiv:2405.14953  [pdf, other

    cs.LG cs.AI stat.ML

    Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions

    Authors: Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

    Abstract: Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning with human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.14741  [pdf, other

    math.OC cs.LG stat.ML

    Bagging Improves Generalization Exponentially

    Authors: Huajie Qian, Donghao Ying, Henry Lam, Wotao Yin

    Abstract: Bagging is a popular ensemble technique to improve the accuracy of machine learning models. It hinges on the well-established rationale that, by repeatedly retraining on resampled data, the aggregated model exhibits lower variance and hence higher stability, especially for discontinuous base learners. In this paper, we provide a new perspective on bagging: By suitably aggregating the base learners… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Correct author list typo

  8. arXiv:2403.11807  [pdf, other

    cs.AI cs.CL

    How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

    Authors: Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu

    Abstract: Decision-making, a complicated task requiring various types of abilities, presents an excellent framework for assessing Large Language Models (LLMs). Our research investigates LLMs' decision-making capabilities through the lens of a well-established field, Game Theory. We focus specifically on games that support the participation of more than two agents simultaneously. Subsequently, we introduce o… ▽ More

    Submitted 25 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 16 pages of main text. 11 pages of appendices. 15 figures, 9 tables. Updated scoring scheme

  9. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  10. arXiv:2402.16184  [pdf, other

    cs.LG

    Deep Neural Network Initialization with Sparsity Inducing Activations

    Authors: Ilan Price, Nicholas Daultry Ball, Samuel C. H. Lam, Adam C. Jones, Jared Tanner

    Abstract: Inducing and leveraging sparse activations during training and inference is a promising avenue for improving the computational efficiency of deep networks, which is increasingly important as network sizes continue to grow and their application becomes more widespread. Here we use the large width Gaussian process limit to analyze the behaviour, at random initialization, of nonlinear activations tha… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: Published in the International Conference on Learning Representations (ICLR) 2024

  11. arXiv:2401.08819  [pdf, other

    cs.LG cs.AI

    Learning from Sparse Offline Datasets via Conservative Density Estimation

    Authors: Zhepeng Cen, Zuxin Liu, Zitong Wang, Yihang Yao, Henry Lam, Ding Zhao

    Abstract: Offline reinforcement learning (RL) offers a promising direction for learning policies from pre-collected datasets without requiring further interactions with the environment. However, existing methods struggle to handle out-of-distribution (OOD) extrapolation errors, especially in sparse reward or scarce data settings. In this paper, we propose a novel training algorithm called Conservative Densi… ▽ More

    Submitted 11 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: ICLR 2024

  12. arXiv:2401.02789  [pdf

    q-bio.GN cs.CL

    Large Language Models in Plant Biology

    Authors: Hilbert Yuen In Lam, Xing Er Ong, Marek Mutwil

    Abstract: Large Language Models (LLMs), such as ChatGPT, have taken the world by storm and have passed certain forms of the Turing test. However, LLMs are not limited to human language and analyze sequential data, such as DNA, protein, and gene expression. The resulting foundation models can be repurposed to identify the complex patterns within the data, resulting in powerful, multi-purpose prediction tools… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  13. arXiv:2310.11065  [pdf, other

    stat.ML cs.LG

    Resampling Stochastic Gradient Descent Cheaply for Efficient Uncertainty Quantification

    Authors: Henry Lam, Zitong Wang

    Abstract: Stochastic gradient descent (SGD) or stochastic approximation has been widely used in model training and stochastic optimization. While there is a huge literature on analyzing its convergence, inference on the obtained solutions from SGD has only been recently studied, yet is important due to the growing need for uncertainty quantification. We investigate two computationally cheap resampling-based… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  14. arXiv:2310.09766  [pdf, other

    stat.ML cs.LG

    Pseudo-Bayesian Optimization

    Authors: Haoxian Chen, Henry Lam

    Abstract: Bayesian Optimization is a popular approach for optimizing expensive black-box functions. Its key idea is to use a surrogate model to approximate the objective and, importantly, quantify the associated uncertainty that allows a sequential search of query points that balance exploitation-exploration. Gaussian process (GP) has been a primary candidate for the surrogate model, thanks to its Bayesian-… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  15. arXiv:2310.02932  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Assessing Large Language Models on Climate Information

    Authors: Jannis Bulian, Mike S. Schäfer, Afra Amini, Heidi Lam, Massimiliano Ciaramita, Ben Gaiarin, Michelle Chen Hübscher, Christian Buck, Niels G. Mede, Markus Leippold, Nadine Strauß

    Abstract: As Large Language Models (LLMs) rise in popularity, it is necessary to assess their capability in critically relevant domains. We present a comprehensive evaluation framework, grounded in science communication research, to assess LLM responses to questions about climate change. Our framework emphasizes both presentational and epistemological adequacy, offering a fine-grained analysis of LLM genera… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning (ICML), 2024

  16. arXiv:2310.01386  [pdf, other

    cs.CL

    Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

    Authors: Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu

    Abstract: Large Language Models (LLMs) have recently showcased their remarkable capacities, not only in natural language processing tasks but also across diverse domains such as clinical medicine, legal consultation, and education. LLMs become more than mere applications, evolving into assistants capable of addressing diverse user requests. This narrows the distinction between human beings and artificial in… ▽ More

    Submitted 22 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted for ICLR 2024 Oral Presentation. 15 pages (main text) and 5 pages (appendix)

  17. arXiv:2308.16122  [pdf, other

    cs.LG

    Spatial Graph Coarsening: Weather and Weekday Prediction with London's Bike-Sharing Service using GNN

    Authors: Yuta Sato, Pak Hei Lam, Shruti Gupta, Fareesah Hussain

    Abstract: This study introduced the use of Graph Neural Network (GNN) for predicting the weather and weekday of a day in London, from the dataset of Santander Cycles bike-sharing system as a graph classification task. The proposed GNN models newly introduced (i) a concatenation operator of graph features with trained node embeddings and (ii) a graph coarsening operator based on geographical contiguity, name… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  18. arXiv:2308.03656  [pdf, other

    cs.CL

    Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench

    Authors: Jen-tse Huang, Man Ho Lam, Eric John Li, Shujie Ren, Wenxuan Wang, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu

    Abstract: Evaluating Large Language Models' (LLMs) anthropomorphic capabilities has become increasingly important in contemporary discourse. Utilizing the emotion appraisal theory from psychology, we propose to evaluate the empathy ability of LLMs, i.e., how their feelings change when presented with specific situations. After a careful and comprehensive survey, we collect a dataset containing over 400 situa… ▽ More

    Submitted 24 April, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 12 pages of main text; 9 pages of appendices

  19. arXiv:2307.13497  [pdf, other

    cs.CL cs.AI cs.LG

    Zshot: An Open-source Framework for Zero-Shot Named Entity Recognition and Relation Extraction

    Authors: Gabriele Picco, Marcos Martínez Galindo, Alberto Purpura, Leopold Fuchs, Vanessa López, Hoang Thanh Lam

    Abstract: The Zero-Shot Learning (ZSL) task pertains to the identification of entities or relations in texts that were not seen during training. ZSL has emerged as a critical research area due to the scarcity of labeled data in specific domains, and its applications have grown significantly in recent years. With the advent of large pretrained language models, several novel methods have been proposed, result… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023

    Journal ref: Association for Computational Linguistics. 3 (2023) 357-368

  20. arXiv:2306.14041  [pdf, other

    math.OC cs.LG stat.ML

    Smoothed $f$-Divergence Distributionally Robust Optimization

    Authors: Zhenyuan Liu, Bart P. G. Van Parys, Henry Lam

    Abstract: In data-driven optimization, sample average approximation (SAA) is known to suffer from the so-called optimizer's curse that causes an over-optimistic evaluation of the solution performance. We argue that a special type of distributionallly robust optimization (DRO) formulation offers theoretical advantages in correcting for this optimizer's curse compared to simple ``margin'' adjustments to SAA a… ▽ More

    Submitted 12 October, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    MSC Class: 90C15; 90C17; 90C25

  21. arXiv:2306.12802  [pdf, other

    cs.LG cs.AI q-bio.BM

    Otter-Knowledge: benchmarks of multimodal knowledge graph representation learning from different sources for drug discovery

    Authors: Hoang Thanh Lam, Marco Luca Sbodio, Marcos Martínez Galindo, Mykhaylo Zayats, Raúl Fernández-Díaz, Víctor Valls, Gabriele Picco, Cesar Berrospi Ramis, Vanessa López

    Abstract: Recent research on predicting the binding affinity between drug molecules and proteins use representations learned, through unsupervised learning techniques, from large databases of molecule SMILES and protein sequences. While these representations have significantly enhanced the predictions, they are usually based on a limited set of modalities, and they do not exploit available knowledge about e… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

  22. arXiv:2306.10081  [pdf, other

    cs.LG math.OC

    Optimizer's Information Criterion: Dissecting and Correcting Bias in Data-Driven Optimization

    Authors: Garud Iyengar, Henry Lam, Tianyu Wang

    Abstract: In data-driven optimization, the sample performance of the obtained decision typically incurs an optimistic bias against the true performance, a phenomenon commonly known as the Optimizer's Curse and intimately related to overfitting in machine learning. Common techniques to correct this bias, such as cross-validation, require repeatedly solving additional optimization problems and are therefore c… ▽ More

    Submitted 16 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  23. arXiv:2306.05674  [pdf, other

    stat.ML cs.LG

    Efficient Uncertainty Quantification and Reduction for Over-Parameterized Neural Networks

    Authors: Ziyi Huang, Henry Lam, Haofeng Zhang

    Abstract: Uncertainty quantification (UQ) is important for reliability assessment and enhancement of machine learning models. In deep learning, uncertainties arise not only from data, but also from the training procedure that often injects substantial noises and biases. These hinder the attainment of statistical guarantees and, moreover, impose computational challenges on UQ due to the need for repeated net… ▽ More

    Submitted 9 November, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  24. arXiv:2305.19926  [pdf, other

    cs.CL

    Revisiting the Reliability of Psychological Scales on Large Language Models

    Authors: Jen-tse Huang, Wenxuan Wang, Man Ho Lam, Eric John Li, Wenxiang Jiao, Michael R. Lyu

    Abstract: Recent research has extended beyond assessing the performance of Large Language Models (LLMs) to examining their characteristics from a psychological standpoint, acknowledging the necessity of understanding their behavioral characteristics. The administration of personality tests to LLMs has emerged as a noteworthy area in this context. However, the suitability of employing psychological scales, i… ▽ More

    Submitted 28 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 10 pages. Added more comprehensive experiments and analysis

  25. arXiv:2305.18412  [pdf, other

    stat.AP cs.LG

    Short-term Temporal Dependency Detection under Heterogeneous Event Dynamic with Hawkes Processes

    Authors: Yu Chen, Fengpei Li, Anderson Schneider, Yuriy Nevmyvaka, Asohan Amarasingham, Henry Lam

    Abstract: Many event sequence data exhibit mutually exciting or inhibiting patterns. Reliable detection of such temporal dependency is crucial for scientific investigation. The de facto model is the Multivariate Hawkes Process (MHP), whose impact function naturally encodes a causal structure in Granger causality. However, the vast majority of existing methods use direct or nonlinear transform of standard MH… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence 2023

  26. arXiv:2305.14216  [pdf, other

    cs.LG

    Constrained Proximal Policy Optimization

    Authors: Chengbin Xuan, Feng Zhang, Faliang Yin, Hak-Keung Lam

    Abstract: The problem of constrained reinforcement learning (CRL) holds significant importance as it provides a framework for addressing critical safety satisfaction concerns in the field of reinforcement learning (RL). However, with the introduction of constraint satisfaction, the current CRL methods necessitate the utilization of second-order optimization or primal-dual frameworks with additional Lagrangi… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  27. arXiv:2305.11948  [pdf, other

    cs.CL

    Eye-SpatialNet: Spatial Information Extraction from Ophthalmology Notes

    Authors: Surabhi Datta, Tasneem Kaochar, Hio Cheng Lam, Nelly Nwosu, Luca Giancardo, Alice Z. Chuang, Robert M. Feldman, Kirk Roberts

    Abstract: We introduce an annotated corpus of 600 ophthalmology notes labeled with detailed spatial and contextual information of ophthalmic entities. We extend our previously proposed frame semantics-based spatial representation schema, Rad-SpatialNet, to represent spatial language in ophthalmology text, resulting in the Eye-SpatialNet schema. The spatially-grounded entities are findings, procedures, and d… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  28. arXiv:2304.06833  [pdf, other

    stat.ML cs.LG stat.ME

    Estimate-Then-Optimize versus Integrated-Estimation-Optimization versus Sample Average Approximation: A Stochastic Dominance Perspective

    Authors: Adam N. Elmachtoub, Henry Lam, Haofeng Zhang, Yunfan Zhao

    Abstract: In data-driven stochastic optimization, model parameters of the underlying distribution need to be estimated from data in addition to the optimization task. Recent literature considers integrating the estimation and optimization processes by selecting model parameters that lead to the best empirical objective performance. This integrated approach, which we call integrated-estimation-optimization (… ▽ More

    Submitted 6 August, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  29. arXiv:2302.12458  [pdf, other

    cs.RO eess.SY

    Design and Mechanics of Cable-Driven Rolling Diaphragm Transmission for High-Transparency Robotic Motion

    Authors: Hoi Man Lam, W. Jared Walker, Lucas Jonasch, Dimitri Schreiber, Michael C. Yip

    Abstract: Applications of rolling diaphragm transmissions for medical and teleoperated robotics are of great interest, due to the low friction of rolling diaphragms combined with the power density and stiffness of hydraulic transmissions. However, the stiffness-enabling pressure preloads can form a tradeoff against bearing loading in some rolling diaphragm layouts, and transmission setup can be difficult. U… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: 7 pages, 13 figures

  30. arXiv:2302.10757  [pdf, other

    cs.LG cs.DC

    Distributed Learning in Heterogeneous Environment: federated learning with adaptive aggregation and computation reduction

    Authors: **gxin Li, Toktam Mahmoodi, Hak-Keung Lam

    Abstract: Although federated learning has achieved many breakthroughs recently, the heterogeneous nature of the learning environment greatly limits its performance and hinders its real-world applications. The heterogeneous data, time-varying wireless conditions and computing-limited devices are three main challenges, which often result in an unstable training process and degraded accuracy. Herein, we propos… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  31. arXiv:2302.03525  [pdf, other

    cs.IR

    Multi-Task Deep Recommender Systems: A Survey

    Authors: Yuhao Wang, Ha Tsz Lam, Yi Wong, Ziru Liu, Xiangyu Zhao, Yichao Wang, Bo Chen, Huifeng Guo, Ruiming Tang

    Abstract: Multi-task learning (MTL) aims at learning related tasks in a unified model to achieve mutual improvement among tasks considering their shared knowledge. It is an important topic in recommendation due to the demand for multi-task prediction considering performance and efficiency. Although MTL has been well studied and developed, there is still a lack of systematic review in the recommendation comm… ▽ More

    Submitted 8 February, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

  32. arXiv:2212.01518  [pdf, other

    math.OC cs.LG

    Hedging Complexity in Generalization via a Parametric Distributionally Robust Optimization Framework

    Authors: Garud Iyengar, Henry Lam, Tianyu Wang

    Abstract: Empirical risk minimization (ERM) and distributionally robust optimization (DRO) are popular approaches for solving stochastic optimization problems that appear in operations management and machine learning. Existing generalization error bounds for these methods depend on either the complexity of the cost function or dimension of the random perturbations. Consequently, the performance of these met… ▽ More

    Submitted 24 September, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Preliminary version appeared in AISTATS 2023

  33. arXiv:2210.12334  [pdf, other

    stat.ML cs.LG math.OC

    Adaptive Data Fusion for Multi-task Non-smooth Optimization

    Authors: Henry Lam, Kaizheng Wang, Yuhang Wu, Yichen Zhang

    Abstract: We study the problem of multi-task non-smooth optimization that arises ubiquitously in statistical learning, decision-making and risk management. We develop a data fusion approach that adaptively leverages commonalities among a large number of objectives to improve sample efficiency while tackling their unknown heterogeneities. We provide sharp statistical guarantees for our approach. Numerical ex… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 25 pages

  34. arXiv:2210.12262  [pdf, other

    cs.LG

    Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables

    Authors: Mengdi Xu, Peide Huang, Yaru Niu, Visak Kumar, Jielin Qiu, Chao Fang, Kuan-Hui Lee, Xuewei Qi, Henry Lam, Bo Li, Ding Zhao

    Abstract: One key challenge for multi-task Reinforcement learning (RL) in practice is the absence of task indicators. Robust RL has been applied to deal with task ambiguity, but may result in over-conservative policies. To balance the worst-case (robustness) and average performance, we propose Group Distributionally Robust Markov Decision Process (GDR-MDP), a flexible hierarchical MDP formulation that encod… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 27 pages, 10 figures

  35. arXiv:2210.04615  [pdf, other

    cs.CV

    EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification

    Authors: Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler, Minh-Triet Tran, Ngan Le

    Abstract: The timing of cell divisions in early embryos during the In-Vitro Fertilization (IVF) process is a key predictor of embryo viability. However, observing cell divisions in Time-Lapse Monitoring (TLM) is a time-consuming process and highly depends on experts. In this paper, we propose EmbryosFormer, a computational model to automatically detect and classify cell divisions from original time-lapse im… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted at WACV 2023

  36. arXiv:2209.08110  [pdf, other

    cs.SI cs.LG nlin.AO physics.soc-ph

    Detecting Political Biases of Named Entities and Hashtags on Twitter

    Authors: Jeffrey Zhu, Yining Wang, Pei Zhou, Wen Hong Lam, Mason A. Porter, Yizhou Sun

    Abstract: Ideological divisions in the United States have become increasingly prominent in daily communication. Accordingly, there has been much research on political polarization, including many recent efforts that take a computational perspective. By detecting political biases in a corpus of text, one can attempt to describe and discern the polarity of that text. Intuitively, the named entities (i.e., the… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Submitted to EPJ -- Data Science, under review

    MSC Class: 68T09 (Primary) 68T07 (Secondary)

  37. arXiv:2209.05726  [pdf, other

    eess.SY cs.LG math.DS math.OC

    Data efficient reinforcement learning and adaptive optimal perimeter control of network traffic dynamics

    Authors: C. Chen, Y. P. Huang, W. H. K. Lam, T. L. Pan, S. C. Hsu, A. Sumalee, R. X. Zhong

    Abstract: Existing data-driven and feedback traffic control strategies do not consider the heterogeneity of real-time data measurements. Besides, traditional reinforcement learning (RL) methods for traffic control usually converge slowly for lacking data efficiency. Moreover, conventional optimal perimeter control schemes require exact knowledge of the system dynamics and thus would be fragile to endogenous… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

  38. Improving COVID-19 CT Classification of CNNs by Learning Parameter-Efficient Representation

    Authors: Yujia Xu, Hak-Keung Lam, Guangyu Jia, Jian Jiang, Junkai Liao, Xinqi Bao

    Abstract: COVID-19 pandemic continues to spread rapidly over the world and causes a tremendous crisis in global human health and the economy. Its early detection and diagnosis are crucial for controlling the further spread. Many deep learning-based methods have been proposed to assist clinicians in automatic COVID-19 diagnosis based on computed tomography imaging. However, challenges still remain, including… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  39. arXiv:2208.03128  [pdf, other

    eess.SP cs.SD eess.AS

    Time-Frequency Distributions of Heart Sound Signals: A Comparative Study using Convolutional Neural Networks

    Authors: Xinqi Bao, Yujia Xu, Hak-Keung Lam, Mohamed Trabelsi, Ines Chihi, Lilia Sidhom, Ernest N. Kamavuako

    Abstract: Time-Frequency Distributions (TFDs) support the heart sound characterisation and classification in early cardiac screening. However, despite the frequent use of TFDs in signal analysis, no study comprehensively compared their performances on deep learning for automatic diagnosis. Furthermore, the combination of signal processing methods as inputs for Convolutional Neural Networks (CNNs) has been p… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

  40. arXiv:2206.04287  [pdf, other

    cs.LG stat.ML

    Evaluating Aleatoric Uncertainty via Conditional Generative Models

    Authors: Ziyi Huang, Henry Lam, Haofeng Zhang

    Abstract: Aleatoric uncertainty quantification seeks for distributional knowledge of random responses, which is important for reliability analysis and robustness improvement in machine learning applications. Previous research on aleatoric uncertainty estimation mainly targets closed-formed conditional densities or variances, which requires strong restrictions on the data distribution or dimensionality. To o… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  41. arXiv:2206.03931  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning

    Authors: Hsuan Su, Pohan Chi, Shih-Cheng Huang, Chung Ho Lam, Saurav Sahay, Shang-Tse Chen, Hung-yi Lee

    Abstract: Much literature has shown that prompt-based learning is an efficient method to make use of the large pre-trained language model. Recent works also exhibit the possibility of steering a chatbot's output by plugging in an appropriate prompt. Gradient-based methods are often used to perturb the prompts. However, some language models are not even available to the public. In this work, we first explore… ▽ More

    Submitted 13 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  42. arXiv:2204.02351  [pdf, other

    cs.LG cs.RO stat.ME

    Test Against High-Dimensional Uncertainties: Accelerated Evaluation of Autonomous Vehicles with Deep Importance Sampling

    Authors: Mansur Arief, Zhepeng Cen, Zhenyuan Liu, Zhiyuang Huang, Henry Lam, Bo Li, Ding Zhao

    Abstract: Evaluating the performance of autonomous vehicles (AV) and their complex subsystems to high precision under naturalistic circumstances remains a challenge, especially when failure or dangerous cases are rare. Rarity does not only require an enormous sample size for a naive method to achieve high confidence estimation, but it also causes dangerous underestimation of the true failure rate and it is… ▽ More

    Submitted 5 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  43. Domain Adversarial Spatial-Temporal Network: A Transferable Framework for Short-term Traffic Forecasting across Cities

    Authors: Yihong Tang, Ao Qu, Andy H. F. Chow, William H. K. Lam, S. C. Wong, Wei Ma

    Abstract: Accurate real-time traffic forecast is critical for intelligent transportation systems (ITS) and it serves as the cornerstone of various smart mobility applications. Though this research area is dominated by deep learning, recent studies indicate that the accuracy improvement by develo** new model structures is becoming marginal. Instead, we envision that the improvement can be achieved by trans… ▽ More

    Submitted 19 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  44. arXiv:2202.03558  [pdf, other

    cs.LG cs.AI

    Attacking c-MARL More Effectively: A Data Driven Approach

    Authors: Nhan H. Pham, Lam M. Nguyen, Jie Chen, Hoang Thanh Lam, Subhro Das, Tsui-Wei Weng

    Abstract: In recent years, a proliferation of methods were developed for cooperative multi-agent reinforcement learning (c-MARL). However, the robustness of c-MARL agents against adversarial attacks has been rarely explored. In this paper, we propose to evaluate the robustness of c-MARL agents via a model-based approach, named c-MBA. Our proposed formulation can craft much stronger adversarial state perturb… ▽ More

    Submitted 10 September, 2023; v1 submitted 7 February, 2022; originally announced February 2022.

  45. arXiv:2201.12955  [pdf, other

    cs.LG stat.ML

    Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework

    Authors: Ziyi Huang, Henry Lam, Amirhossein Meisami, Haofeng Zhang

    Abstract: Bayesian bandit algorithms with approximate Bayesian inference have been widely used in real-world applications. However, there is a large discrepancy between the superior practical performance of these approaches and their theoretical justification. Previous research only indicates a negative theoretical result: Thompson sampling could have a worst-case linear regret $Ω(T)$ with a constant thresh… ▽ More

    Submitted 9 November, 2023; v1 submitted 30 January, 2022; originally announced January 2022.

  46. arXiv:2112.03874  [pdf, other

    q-fin.ST cs.AI cs.CE cs.LG cs.MA stat.ME

    Efficient Calibration of Multi-Agent Simulation Models from Output Series with Bayesian Optimization

    Authors: Yuanlu Bai, Henry Lam, Svitlana Vyetrenko, Tucker Balch

    Abstract: Multi-agent simulation is commonly used across multiple disciplines, specifically in artificial intelligence in recent years, which creates an environment for downstream machine learning or reinforcement learning tasks. In many practical scenarios, however, only the output series that result from the interactions of simulation agents are observable. Therefore, simulators need to be calibrated so t… ▽ More

    Submitted 20 September, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: This paper has been accepted and will be published in ICAIF 2022 proceedings

  47. arXiv:2111.00941  [pdf, other

    cs.CV cs.AI

    Turning Traffic Monitoring Cameras into Intelligent Sensors for Traffic Density Estimation

    Authors: Zijian Hu, William H. K. Lam, S. C. Wong, Andy H. F. Chow, Wei Ma

    Abstract: Accurate traffic state information plays a pivotal role in the Intelligent Transportation Systems (ITS), and it is an essential input to various smart mobility applications such as signal coordination and traffic flow prediction. The current practice to obtain the traffic state information is through specialized sensors such as loop detectors and speed cameras. In most metropolitan areas, traffic… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

  48. arXiv:2110.12122  [pdf, other

    cs.LG stat.ME stat.ML

    Quantifying Epistemic Uncertainty in Deep Learning

    Authors: Ziyi Huang, Henry Lam, Haofeng Zhang

    Abstract: Uncertainty quantification is at the core of the reliability and robustness of machine learning. In this paper, we provide a theoretical framework to dissect the uncertainty, especially the \textit{epistemic} component, in deep learning into \textit{procedural variability} (from the training procedure) and \textit{data variability} (from the training data), which is the first such attempt in the l… ▽ More

    Submitted 18 June, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

  49. arXiv:2110.09131  [pdf, other

    cs.CL cs.AI

    Ensembling Graph Predictions for AMR Parsing

    Authors: Hoang Thanh Lam, Gabriele Picco, Yufang Hou, Young-Suk Lee, Lam M. Nguyen, Dzung T. Phan, Vanessa López, Ramon Fernandez Astudillo

    Abstract: In many machine learning tasks, models are trained to predict structure data such as graphs. For example, in natural language processing, it is very common to parse texts into dependency trees or abstract meaning representation (AMR) graphs. On the other hand, ensemble methods combine predictions from multiple models to create a new one that is more robust and accurate than individual predictions.… ▽ More

    Submitted 24 January, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

  50. arXiv:2109.08460  [pdf, other

    cs.CL

    Neural Unification for Logic Reasoning over Natural Language

    Authors: Gabriele Picco, Hoang Thanh Lam, Marco Luca Sbodio, Vanessa Lopez Garcia

    Abstract: Automated Theorem Proving (ATP) deals with the development of computer programs being able to show that some conjectures (queries) are a logical consequence of a set of axioms (facts and rules). There exists several successful ATPs where conjectures and axioms are formally provided (e.g. formalised as First Order Logic formulas). Recent approaches, such as (Clark et al., 2020), have proposed trans… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021 Findings