Skip to main content

Showing 1–9 of 9 results for author: Do, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.05892  [pdf, ps, other

    stat.ML cs.LG

    A Generalization Bound of Deep Neural Networks for Dependent Data

    Authors: Quan Huu Do, Binh T. Nguyen, Lam Si Tung Ho

    Abstract: Existing generalization bounds for deep neural networks require data to be independent and identically distributed (iid). This assumption may not hold in real-life applications such as evolutionary biology, infectious disease epidemiology, and stock price prediction. This work establishes a generalization bound of feed-forward neural networks for non-stationary $φ$-mixing data.

    Submitted 9 October, 2023; originally announced October 2023.

  2. arXiv:2206.09076  [pdf, other

    stat.ML cs.LG stat.ME

    Fair Generalized Linear Models with a Convex Penalty

    Authors: Hyungrok Do, Preston Putzel, Axel Martin, Padhraic Smyth, Judy Zhong

    Abstract: Despite recent advances in algorithmic fairness, methodologies for achieving fairness with generalized linear models (GLMs) have yet to be explored in general, despite GLMs being widely used in practice. In this paper we introduce two fairness criteria for GLMs based on equalizing expected outcomes or log-likelihoods. We prove that for GLMs both criteria can be achieved via a convex penalty term b… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted for publication in ICML 2022

  3. arXiv:2105.04648  [pdf, other

    stat.AP stat.ME

    Joint Fairness Model with Applications to Risk Predictions for Under-represented Populations

    Authors: Hyungrok Do, Shin**i Nandi, Preston Putzel, Padhraic Smyth, Judy Zhong

    Abstract: In data collection for predictive modeling, under-representation of certain groups, based on gender, race/ethnicity, or age, may yield less-accurate predictions for these groups. Recently, this issue of fairness in predictions has attracted significant attention, as data-driven models are increasingly utilized to perform crucial decision-making tasks. Existing methods to achieve fairness in the ma… ▽ More

    Submitted 23 February, 2022; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: 34 pages, 4 figures, 1 table

  4. Graph Convolutional Neural Networks with Node Transition Probability-based Message Passing and DropNode Regularization

    Authors: Tien Huu Do, Duc Minh Nguyen, Giannis Bekoulis, Adrian Munteanu, Nikos Deligiannis

    Abstract: Graph convolutional neural networks (GCNNs) have received much attention recently, owing to their capability in handling graph-structured data. Among the existing GCNNs, many methods can be viewed as instances of a neural message passing motif; features of nodes are passed around their neighbors, aggregated and transformed to produce better nodes' representations. Nevertheless, these methods seldo… ▽ More

    Submitted 18 March, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: Expert Systems with Applications, graph-based deep learning, graph neural networks, document classification

    Journal ref: Expert Systems with Applications, 174 (2021), Elsevier

  5. arXiv:1905.03042  [pdf, other

    cs.SI cs.CL cs.LG stat.ML

    Rumour Detection via News Propagation Dynamics and User Representation Learning

    Authors: Tien Huu Do, Xiao Luo, Duc Minh Nguyen, Nikos Deligiannis

    Abstract: Rumours have existed for a long time and have been known for serious consequences. The rapid growth of social media platforms has multiplied the negative impact of rumours; it thus becomes important to early detect them. Many methods have been introduced to detect rumours using the content or the social context of news. However, most existing methods ignore or do not explore effectively the propag… ▽ More

    Submitted 18 April, 2019; originally announced May 2019.

  6. arXiv:1811.01662  [pdf, other

    cs.LG cs.AI stat.ML

    Matrix Completion With Variational Graph Autoencoders: Application in Hyperlocal Air Quality Inference

    Authors: Tien Huu Do, Duc Minh Nguyen, Evaggelia Tsiligianni, Angel Lopez Aguirre, Valerio Panzica La Manna, Frank Pasveer, Wilfried Philips, Nikos Deligiannis

    Abstract: Inferring air quality from a limited number of observations is an essential task for monitoring and controlling air pollution. Existing inference methods typically use low spatial resolution data collected by fixed monitoring stations and infer the concentration of air pollutants using additional types of data, e.g., meteorological and traffic information. In this work, we focus on street-level ai… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

  7. arXiv:1712.08091  [pdf, other

    cs.LG cs.IR cs.SI stat.ML

    Multiview Deep Learning for Predicting Twitter Users' Location

    Authors: Tien Huu Do, Duc Minh Nguyen, Evaggelia Tsiligianni, Bruno Cornelis, Nikos Deligiannis

    Abstract: The problem of predicting the location of users on large social networks like Twitter has emerged from real-life applications such as social unrest detection and online marketing. Twitter user geolocation is a difficult and active research topic with a vast literature. Most of the proposed methods follow either a content-based or a network-based approach. The former exploits user-generated content… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: Submitted to the IEEE Transactions on Big Data

  8. arXiv:1501.07506  [pdf, other

    stat.ME

    Accuracy of areal interpolation methods for count data

    Authors: Van Huyen Do, Christine Thomas-Agnan, Anne Vanhems

    Abstract: The combination of several socio-economic data bases originating from different administrative sources collected on several different partitions of a geographic zone of interest into administrative units induces the so called areal interpolation problem. This problem is that of allocating the data from a set of source spatial units to a set of target spatial units. A particular case of that proble… ▽ More

    Submitted 29 January, 2015; originally announced January 2015.

  9. arXiv:1201.4714  [pdf, other

    cs.LG stat.ML

    A metric learning perspective of SVM: on the relation of SVM and LMNN

    Authors: Huyen Do, Alexandros Kalousis, Jun Wang, Adam Woznica

    Abstract: Support Vector Machines, SVMs, and the Large Margin Nearest Neighbor algorithm, LMNN, are two very popular learning algorithms with quite different learning biases. In this paper we bring them into a unified view and show that they have a much stronger relation than what is commonly thought. We analyze SVMs from a metric learning perspective and cast them as a metric learning problem, a view which… ▽ More

    Submitted 23 January, 2012; originally announced January 2012.

    Comments: To appear in AISTATS 2012