Skip to main content

Showing 1–17 of 17 results for author: Duong, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.17500  [pdf, other

    stat.AP cs.CE

    Using iterated local alignment to aggregate GPS trajectories into a traffic flow map

    Authors: Tarn Duong

    Abstract: Desire line maps are widely deployed for traffic flow analysis by virtue of their ease of interpretation and computation. They can be considered to be simplified traffic flow maps, whereas the computational challenges in aggregating small scale traffic flows prevent the wider dissemination of high resolution flow maps. GPS trajectories are a promising data source to solve this challenging problem.… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    MSC Class: 62P30

  2. arXiv:2203.01686  [pdf, other

    stat.CO stat.AP

    Statistical visualisation for tidy and geospatial data in R via kernel smoothing methods in the eks package

    Authors: Tarn Duong

    Abstract: Kernel smoothers are essential tools for data analysis due to their ability to convey complex statistical information with concise graphical visualisations. Their inclusion in the base distribution and in the many user-contributed add-on packages of the R statistical analysis environment caters well to many practitioners. Though there remain some important gaps for specialised data types, most not… ▽ More

    Submitted 24 March, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 19 pages, 10 figures

    MSC Class: 62G07; 62G10; 62H12

  3. arXiv:2202.13001  [pdf, other

    cs.LG stat.ML

    Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

    Authors: MohammadJavad Azizi, Thang Duong, Yasin Abbasi-Yadkori, András György, Claire Vernade, Mohammad Ghavamzadeh

    Abstract: We study a sequential decision problem where the learner faces a sequence of $K$-armed bandit tasks. The task boundaries might be known (the bandit meta-learning setting), or unknown (the non-stationary bandit setting). For a given integer $M\le K$, the learner aims to compete with the best subset of arms of size $M$. We design an algorithm based on a reduction to bandit submodular maximizati… ▽ More

    Submitted 18 October, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

  4. arXiv:2105.12898  [pdf, other

    cs.AI cs.LG stat.ML

    Stochastic Intervention for Causal Effect Estimation

    Authors: Tri Dung Duong, Qian Li, Guandong Xu

    Abstract: Causal inference methods are widely applied in various decision-making domains such as precision medicine, optimal policy and economics. Central to these applications is the treatment effect estimation of intervention strategies. Current estimation methods are mostly restricted to the deterministic treatment, which however, is unable to address the stochastic space treatment policies. Moreover, pr… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted in IJCNN 21

  5. Relaxing door-to-door matching reduces passenger waiting times: a workflow for the analysis of driver GPS traces in a stochastic carpooling service

    Authors: Panayotis Papoutsis, Safa Fennia, Constant Bridon, Tarn Duong

    Abstract: Carpooling has the potential to transform itself into a mass transportation mode by abandoning its adherence to deterministic passenger-driver matching for door-to-door journeys, and by adopting instead stochastic matching on a network of fixed meeting points. Stochastic matching is where a passenger sends out a carpooling request at a meeting point, and then waits for the arrival of a self-select… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  6. Bayesian hierarchical models for the prediction of the driver flow and passenger waiting times in a stochastic carpooling service

    Authors: Panayotis Papoutsis, Bertrand Michel, Anne Philippe, Tarn Duong

    Abstract: Carpooling is an integral component in smart carbon-neutral cities, in particular to facilitate homework commuting. We study an innovative carpooling service developed by the start-up Ecov which specialises in homework commutes in peri-urban and rural regions. When a passenger makes a carpooling request, a designated driver is not assigned as in a traditional carpooling service; rather the passeng… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  7. arXiv:2006.16789  [pdf, other

    cs.LG cs.AI stat.ML

    Causality Learning: A New Perspective for Interpretable Machine Learning

    Authors: Guandong Xu, Tri Dung Duong, Qian Li, Shaowu Liu, Xianzhi Wang

    Abstract: Recent years have witnessed the rapid growth of machine learning in a wide range of fields such as image recognition, text classification, credit scoring prediction, recommendation system, etc. In spite of their great performance in different sectors, researchers still concern about the mechanism under any machine learning (ML) techniques that are inherently black-box and becoming more complex to… ▽ More

    Submitted 17 September, 2021; v1 submitted 27 June, 2020; originally announced June 2020.

    Comments: 8 Pages

  8. arXiv:2005.11856  [pdf, other

    eess.IV cs.LG q-bio.QM stat.AP

    Predicting COVID-19 Pneumonia Severity on Chest X-ray with Deep Learning

    Authors: Joseph Paul Cohen, Lan Dao, Paul Morrison, Karsten Roth, Yoshua Bengio, Beiyi Shen, Almas Abbasi, Mahsa Hoshmand-Kochi, Marzyeh Ghassemi, Haifang Li, Tim Q Duong

    Abstract: Purpose: The need to streamline patient management for COVID-19 has become more pressing than ever. Chest X-rays provide a non-invasive (potentially bedside) tool to monitor the progression of the disease. In this study, we present a severity score prediction model for COVID-19 pneumonia for frontal chest X-ray images. Such a tool can gauge severity of COVID-19 lung infections (and pneumonia in ge… ▽ More

    Submitted 30 June, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  9. arXiv:1911.08795  [pdf

    cs.LG stat.ML

    On Node Features for Graph Neural Networks

    Authors: Chi Thang Duong, Thanh Dat Hoang, Ha The Hien Dang, Quoc Viet Hung Nguyen, Karl Aberer

    Abstract: Graph neural network (GNN) is a deep model for graph representation learning. One advantage of graph neural network is its ability to incorporate node features into the learning process. However, this prevents graph neural network from being applied into featureless graphs. In this paper, we first analyze the effects of node features on the performance of graph neural network. We show that GNNs wo… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  10. arXiv:1909.02977  [pdf, other

    cs.LG cs.SI stat.ML

    Parallel Computation of Graph Embeddings

    Authors: Chi Thang Duong, Hongzhi Yin, Thanh Dat Hoang, Truong Giang Le Ba, Matthias Weidlich, Quoc Viet Hung Nguyen, Karl Aberer

    Abstract: Graph embedding aims at learning a vector-based representation of vertices that incorporates the structure of the graph. This representation then enables inference of graph properties. Existing graph embedding techniques, however, do not scale well to large graphs. We therefore propose a framework for parallel computation of a graph embedding using a cluster of compute nodes with resource constrai… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  11. arXiv:1902.04181  [pdf, other

    cs.LG cs.AI stat.ML

    Nearest Neighbor Median Shift Clustering for Binary Data

    Authors: Gaël Beck, Tarn Duong, Mustapha Lebbah, Hanane Azzag

    Abstract: We describe in this paper the theory and practice behind a new modal clustering method for binary data. Our approach (BinNNMS) is based on the nearest neighbor median shift. The median shift is an extension of the well-known mean shift, which was designed for continuous data, to handle binary data. We demonstrate that BinNNMS can discover accurately the location of clusters in binary data with the… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Algorithms are available at https://github.com/Clustering4Ever/Clustering4Ever

  12. arXiv:1902.03833  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    A Distributed and Approximated Nearest Neighbors Algorithm for an Efficient Large Scale Mean Shift Clustering

    Authors: Gaël Beck, Tarn Duong, Mustapha Lebbah, Hanane Azzag, Christophe Cérin

    Abstract: In this paper we target the class of modal clustering methods where clusters are defined in terms of the local modes of the probability density function which generates the data. The most well-known modal clustering method is the k-means clustering. Mean Shift clustering is a generalization of the k-means clustering which computes arbitrarily shaped clusters as defined as the basins of attraction… ▽ More

    Submitted 11 February, 2019; originally announced February 2019.

    Comments: Algorithms are available at https://github.com/Clustering4Ever/Clustering4Ever

  13. arXiv:1806.05769  [pdf, other

    cond-mat.mtrl-sci stat.AP

    Bayesian Uncertainty Quantification and Information Fusion in CALPHAD-based Thermodynamic Modeling

    Authors: Pejman Honarmandi, Thien Chi Duong, Seyede Fatemeh Ghoreishi, Douglas Allaire, Raymundo Arroyave

    Abstract: Calculation of phase diagrams is one of the fundamental tools in alloy design---more specifically under the framework of Integrated Computational Materials Engineering. Uncertainty quantification of phase diagrams is the first step required to provide confidence for decision making in property- or performance-based design. As a manner of illustration, a thorough probabilistic assessment of the CAL… ▽ More

    Submitted 18 July, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: 22 pages, 8 Figures

  14. Exploratory data analysis for moderate extreme values using non-parametric kernel methods

    Authors: Boris Beranger, Tarn Duong, Sarah E. Perkins-Kirkpatrick, Scott A. Sisson

    Abstract: In many settings it is critical to accurately model the extreme tail behaviour of a random process. Non-parametric density estimation methods are commonly implemented as exploratory data analysis techniques for this purpose as they possess excellent visualisation properties, and can naturally avoid the model specification biases implied by using parametric estimators. In particular, kernel-based e… ▽ More

    Submitted 6 December, 2017; v1 submitted 28 February, 2016; originally announced February 2016.

  15. Efficient recursive algorithms for functionals based on higher order derivatives of the multivariate Gaussian density

    Authors: José E. Chacón, Tarn Duong

    Abstract: Many developments in Mathematics involve the computation of higher order derivatives of Gaussian density functions. The analysis of univariate Gaussian random variables is a well-established field whereas the analysis of their multivariate counterparts consists of a body of results which are more dispersed. These latter results generally fall into two main categories: theoretical expressions which… ▽ More

    Submitted 23 March, 2014; v1 submitted 9 October, 2013; originally announced October 2013.

    Comments: 30 pages, 1 figure

    MSC Class: 15A24; 65F30; 62E10; 62G05; 62H05

  16. Joint Modeling and Registration of Cell Populations in Cohorts of High-Dimensional Flow Cytometric Data

    Authors: Saumyadipta Pyne, Kui Wang, Jonathan Irish, Pablo Tamayo, Marc-Danie Nazaire, Tarn Duong, Sharon Lee, Shu-Kay Ng, David Hafler, Ronald Levy, Garry Nolan, Jill Mesirov, Geoffrey J. McLachlan

    Abstract: In systems biomedicine, an experimenter encounters different potential sources of variation in data such as individual samples, multiple experimental conditions, and multi-variable network-level responses. In multiparametric cytometry, which is often used for analyzing patient samples, such issues are critical. While computational methods can identify cell populations in individual samples, withou… ▽ More

    Submitted 31 May, 2013; originally announced May 2013.

  17. arXiv:1204.6160  [pdf, other

    math.ST stat.ME stat.ML

    Data-driven density derivative estimation, with applications to nonparametric clustering and bump hunting

    Authors: José E. Chacón, Tarn Duong

    Abstract: Important information concerning a multivariate data set, such as clusters and modal regions, is contained in the derivatives of the probability density function. Despite this importance, nonparametric estimation of higher order derivatives of the density functions have received only relatively scant attention. Kernel estimators of density functions are widely used as they exhibit excellent theore… ▽ More

    Submitted 19 February, 2013; v1 submitted 27 April, 2012; originally announced April 2012.

    Comments: 36 pages, 5 figures

    MSC Class: 62G05; 62H30