Skip to main content

Showing 1–28 of 28 results for author: Hui, S C

.
  1. arXiv:2303.09041  [pdf, other

    cs.LG

    A Multimodal Data-driven Framework for Anxiety Screening

    Authors: Haimiao Mo, Shuai Ding, Siu Cheung Hui

    Abstract: Early screening for anxiety and appropriate interventions are essential to reduce the incidence of self-harm and suicide in patients. Due to limited medical resources, traditional methods that overly rely on physician expertise and specialized equipment cannot simultaneously meet the needs for high accuracy and model interpretability. Multimodal data can provide more objective evidence for anxiety… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  2. arXiv:2302.08220  [pdf, other

    cs.CL

    Dialogue State Distillation Network with Inter-slot Contrastive Learning for Dialogue State Tracking

    Authors: **g Xu, Dandan Song, Chong Liu, Siu Cheung Hui, Fei Li, Qiang Ju, Xiaonan He, Jian Xie

    Abstract: In task-oriented dialogue systems, Dialogue State Tracking (DST) aims to extract users' intentions from the dialogue history. Currently, most existing approaches suffer from error propagation and are unable to dynamically select relevant information when utilizing previous dialogue states. Moreover, the relations between the updates of different slots provide vital clues for DST. However, the exis… ▽ More

    Submitted 7 March, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: Accepted by AAAI 2023

  3. arXiv:2211.00709  [pdf, other

    cs.CL

    Semantic Pivoting Model for Effective Event Detection

    Authors: Anran Hao, Siu Cheung Hui, Jian Su

    Abstract: Event Detection, which aims to identify and classify mentions of event instances from unstructured articles, is an important task in Natural Language Processing (NLP). Existing techniques for event detection only use homogeneous one-hot vectors to represent the event type classes, ignoring the fact that the semantic meaning of the types is important to the task. Such an approach is inefficient and… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 11 pages, 4 figures; Accepted to ACIIDS 2022

  4. arXiv:2201.10978  [pdf, other

    cs.IR cs.CL cs.LG

    Machine Learning for Food Review and Recommendation

    Authors: Tan Khang Le, Siu Cheung Hui

    Abstract: Food reviews and recommendations have always been important for online food service websites. However, reviewing and recommending food is not simple as it is likely to be overwhelmed by disparate contexts and meanings. In this paper, we use different deep learning approaches to address the problems of sentiment analysis, automatic review tag generation, and retrieval of food reviews. We propose to… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

    Comments: Accepted paper to International Student Conference on Artificial Intelligence (STCAI) 2021

  5. arXiv:2102.08597  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters

    Authors: Aston Zhang, Yi Tay, Shuai Zhang, Alvin Chan, Anh Tuan Luu, Siu Cheung Hui, Jie Fu

    Abstract: Recent works have demonstrated reasonable success of representation learning in hypercomplex space. Specifically, "fully-connected layers with Quaternions" (4D hypercomplex numbers), which replace real-valued matrix multiplications in fully-connected layers with Hamilton products of Quaternions, both enjoy parameter savings with only 1/4 learnable parameters and achieve comparable performance in v… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: Published as a conference paper at the 9th International Conference on Learning Representations (ICLR 2021)

  6. arXiv:1907.00782  [pdf, other

    cs.CR cs.CY cs.DB cs.LG

    Collecting and Analyzing Multidimensional Data with Local Differential Privacy

    Authors: Ning Wang, Xiaokui Xiao, Yin Yang, Jun Zhao, Siu Cheung Hui, Hye** Shin, Junbum Shin, Ge Yu

    Abstract: Local differential privacy (LDP) is a recently proposed privacy standard for collecting and analyzing data, which has been used, e.g., in the Chrome browser, iOS and macOS. In LDP, each user perturbs her information locally, and only sends the randomized version to an aggregator who performs analyses, which protects both the users and the aggregator against private information leaks. Although LDP… ▽ More

    Submitted 28 June, 2019; originally announced July 2019.

    Comments: 12-Page Full Paper in Proceedings of the 2019 IEEE International Conference on Data Engineering (ICDE). arXiv admin note: text overlap with arXiv:1606.05053

    MSC Class: Local differential privacy; multidimensional data; stochastic gradient descent

  7. arXiv:1906.04393  [pdf, other

    cs.CL cs.LG

    Lightweight and Efficient Neural Natural Language Processing with Quaternion Networks

    Authors: Yi Tay, Aston Zhang, Luu Anh Tuan, **feng Rao, Shuai Zhang, Shuohang Wang, Jie Fu, Siu Cheung Hui

    Abstract: Many state-of-the-art neural models for NLP are heavily parameterized and thus memory inefficient. This paper proposes a series of lightweight and memory efficient neural architectures for a potpourri of natural language processing (NLP) tasks. To this end, our models exploit computation using Quaternion algebra and hypercomplex spaces, enabling not only expressive inter-component interactions but… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  8. arXiv:1905.10847  [pdf, other

    cs.CL cs.AI cs.IR

    Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

    Authors: Yi Tay, Shuohang Wang, Luu Anh Tuan, Jie Fu, Minh C. Phan, Xingdi Yuan, **feng Rao, Siu Cheung Hui, Aston Zhang

    Abstract: This paper tackles the problem of reading comprehension over long narratives where documents easily span over thousands of tokens. We propose a curriculum learning (CL) based Pointer-Generator framework for reading/sampling over large documents, enabling diverse training of the neural model based on the notion of alternating contextual difficulty. This can be interpreted as a form of domain random… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: Accepted to ACL 2019

  9. arXiv:1811.09786  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    Recurrently Controlled Recurrent Networks

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Recurrent neural networks (RNNs) such as long short-term memory and gated recurrent units are pivotal building blocks across a broad spectrum of sequence modeling problems. This paper proposes a recurrently controlled recurrent network (RCRN) for expressive and powerful sequence encoding. More concretely, the key idea behind our approach is to learn the recurrent gating functions using recurrent n… ▽ More

    Submitted 24 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  10. arXiv:1811.04210  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    Densely Connected Attention Propagation for Reading Comprehension

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui, Jian Su

    Abstract: We propose DecaProp (Densely Connected Attention Propagation), a new densely connected neural architecture for reading comprehension (RC). There are two distinct characteristics of our model. Firstly, our model densely connects all pairwise layers of the network, modeling relationships between passage and query across all hierarchical levels. Secondly, the dense connectors in our network are learn… ▽ More

    Submitted 2 April, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: NIPS 2018

  11. arXiv:1810.02938  [pdf, other

    cs.CL cs.AI cs.IR

    Co-Stack Residual Affinity Networks with Multi-level Attention Refinement for Matching Text Sequences

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Learning a matching function between two text sequences is a long standing problem in NLP research. This task enables many potential applications such as question answering and paraphrase identification. This paper proposes Co-Stack Residual Affinity Networks (CSRAN), a new and universal neural architecture for this problem. CSRAN is a deep architecture, involving stacked (multi-layered) recurrent… ▽ More

    Submitted 6 October, 2018; originally announced October 2018.

    Comments: EMNLP 2018

  12. arXiv:1806.06446   

    cs.IR cs.AI cs.LG cs.NE

    Self-Attentive Neural Collaborative Filtering

    Authors: Yi Tay, Shuai Zhang, Luu Anh Tuan, Siu Cheung Hui

    Abstract: This paper has been withdrawn as we discovered a bug in our tensorflow implementation that involved accidental mixing of vectors across batches. This lead to different inference results given different batch sizes which is completely strange. The performance scores still remain the same but we concluded that it was not the self-attention that contributed to the performance. We are withdrawing the… ▽ More

    Submitted 19 July, 2018; v1 submitted 17 June, 2018; originally announced June 2018.

    Comments: We discovered a bug in our tensorflow implementation that involved accidental mixing of vectors across batches, rendering the main claim of the paper incorrect. We are withdrawing this paper until we find out why

  13. arXiv:1806.00778  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Cast Attention Networks for Retrieval-based Question Answering and Response Prediction

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Attention is typically used to select informative sub-phrases that are used for prediction. This paper investigates the novel use of attention as a form of feature augmentation, i.e, casted attention. We propose Multi-Cast Attention Networks (MCAN), a new attention mechanism and general model architecture for a potpourri of ranking tasks in the conversational modeling and question answering domain… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: Accepted to KDD 2018 (Paper titled only "Multi-Cast Attention Networks" in KDD version)

  14. arXiv:1805.11535  [pdf, other

    cs.CL cs.AI cs.IR cs.NE

    CoupleNet: Paying Attention to Couples with Coupled Attention for Relationship Recommendation

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Dating and romantic relationships not only play a huge role in our personal lives but also collectively influence and shape society. Today, many romantic partnerships originate from the Internet, signifying the importance of technology and the web in modern dating. In this paper, we present a text-based computational approach for estimating the relationship compatibility of two users on social med… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

    Comments: Accepted at ICWSM 2018

  15. arXiv:1805.02856  [pdf, other

    cs.CL cs.AI cs.IR

    Reasoning with Sarcasm by Reading In-between

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui, Jian Su

    Abstract: Sarcasm is a sophisticated speech act which commonly manifests on social communities such as Twitter and Reddit. The prevalence of sarcasm on the social web is highly disruptive to opinion mining systems due to not only its tendency of polarity flip** but also usage of figurative language. Sarcasm commonly manifests with a contrastive theme either between positive-negative sentiments or between… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: Accepted to ACL2018

  16. arXiv:1803.09074  [pdf, other

    cs.CL cs.AI cs.NE

    Multi-range Reasoning for Machine Comprehension

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: We propose MRU (Multi-Range Reasoning Units), a new fast compositional encoder for machine comprehension (MC). Our proposed MRU encoders are characterized by multi-ranged gating, executing a series of parameterized contract-and-expand layers for learning gating vectors that benefit from long and short-term dependencies. The aims of our approach are as follows: (1) learning representations that are… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

  17. arXiv:1801.09251  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Pointer Co-Attention Networks for Recommendation

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Many recent state-of-the-art recommender systems such as D-ATT, TransNet and DeepCoNN exploit reviews for representation learning. This paper proposes a new neural architecture for recommendation with reviews. Our model operates on a multi-hierarchical paradigm and is based on the intuition that not all reviews are created equal, i.e., only a select few are important. The importance, however, shou… ▽ More

    Submitted 21 June, 2018; v1 submitted 28 January, 2018; originally announced January 2018.

    Comments: Accepted to KDD 2018 (Research Track)

  18. arXiv:1801.00102  [pdf, other

    cs.CL cs.AI

    Compare, Compress and Propagate: Enhancing Neural Architectures with Alignment Factorization for Natural Language Inference

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: This paper presents a new deep learning architecture for Natural Language Inference (NLI). Firstly, we introduce a new architecture where alignment pairs are compared, compressed and then propagated to upper layers for enhanced representation learning. Secondly, we adopt factorization layers for efficient and expressive compression of alignment vectors into scalar features, which are then used to… ▽ More

    Submitted 10 September, 2018; v1 submitted 30 December, 2017; originally announced January 2018.

    Comments: EMNLP 2018 CRC and Update CAFE + ELMo result on SNLI

  19. arXiv:1712.05403  [pdf, other

    cs.CL cs.AI cs.IR

    Learning to Attend via Word-Aspect Associative Fusion for Aspect-based Sentiment Analysis

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: Aspect-based sentiment analysis (ABSA) tries to predict the polarity of a given document with respect to a given aspect entity. While neural network architectures have been successful in predicting the overall polarity of sentences, aspect-specific sentiment analysis still remains as an open problem. In this paper, we propose a novel method for integrating aspect information into the neural model.… ▽ More

    Submitted 14 December, 2017; originally announced December 2017.

    Comments: Accepted to AAAI2018

  20. arXiv:1711.07656  [pdf, other

    cs.CL cs.AI cs.IR

    Cross Temporal Recurrent Networks for Ranking Question Answer Pairs

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Temporal gates play a significant role in modern recurrent-based neural encoders, enabling fine-grained control over recursive compositional operations over time. In recurrent models such as the long short-term memory (LSTM), temporal gates control the amount of information retained or discarded over time, not only playing an important role in influencing the learned representations but also servi… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: Accepted to AAAI2018

  21. arXiv:1711.04981  [pdf, other

    cs.AI cs.CL

    SkipFlow: Incorporating Neural Coherence Features for End-to-End Automatic Text Scoring

    Authors: Yi Tay, Minh C. Phan, Luu Anh Tuan, Siu Cheung Hui

    Abstract: Deep learning has demonstrated tremendous potential for Automatic Text Scoring (ATS) tasks. In this paper, we describe a new neural architecture that enhances vanilla neural network models with auxiliary neural coherence features. Our new method proposes a new \textsc{SkipFlow} mechanism that models relationships between snapshots of the hidden representations of a long short-term memory (LSTM) ne… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Accepted to AAAI 2018

  22. arXiv:1708.07436  [pdf, other

    cs.LG cs.CR cs.DB

    Differentially Private Regression for Discrete-Time Survival Analysis

    Authors: Thông T. Nguyên, Siu Cheung Hui

    Abstract: In survival analysis, regression models are used to understand the effects of explanatory variables (e.g., age, sex, weight, etc.) to the survival probability. However, for sensitive survival data such as medical data, there are serious concerns about the privacy of individuals in the data set when medical data is used to fit the regression models. The closest work addressing such privacy concerns… ▽ More

    Submitted 24 August, 2017; v1 submitted 24 August, 2017; originally announced August 2017.

    Comments: 19 pages, CIKM17

  23. arXiv:1708.04828  [pdf, other

    cs.AI cs.IR

    Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs

    Authors: Yi Tay, Luu Anh Tuan, Minh C. Phan, Siu Cheung Hui

    Abstract: Many popular knowledge graphs such as Freebase, YAGO or DBPedia maintain a list of non-discrete attributes for each entity. Intuitively, these attributes such as height, price or population count are able to richly characterize entities in knowledge graphs. This additional source of information may help to alleviate the inherent sparsity and incompleteness problem that are prevalent in knowledge g… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

    Comments: Accepted at CIKM 2017

  24. arXiv:1708.04517  [pdf, other

    cs.CR cs.DB

    Privacy-Preserving Mechanisms for Parametric Survival Analysis with Weibull Distribution

    Authors: Thông T. Nguyên, Siu Cheung Hui

    Abstract: Survival analysis studies the statistical properties of the time until an event of interest occurs. It has been commonly used to study the effectiveness of medical treatments or the lifespan of a population. However, survival analysis can potentially leak confidential information of individuals in the dataset. The state-of-the-art techniques apply ad-hoc privacy-preserving mechanisms on publishing… ▽ More

    Submitted 24 August, 2017; v1 submitted 1 July, 2017; originally announced August 2017.

    Comments: 8 pages, Trustcom17

  25. Hyperbolic Representation Learning for Fast and Efficient Neural Question Answering

    Authors: Yi Tay, Luu Anh Tuan, Siu Cheung Hui

    Abstract: The dominant neural architectures in question answer retrieval are based on recurrent or convolutional encoders configured with complex word matching layers. Given that recent architectural innovations are mostly new word interaction layers or attention-based matching mechanisms, it seems to be a well-established fact that these components are mandatory for good performance. Unfortunately, the mem… ▽ More

    Submitted 23 November, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

    Comments: Accepted at WSDM 2018

  26. Learning to Rank Question Answer Pairs with Holographic Dual LSTM Architecture

    Authors: Yi Tay, Minh C. Phan, Luu Anh Tuan, Siu Cheung Hui

    Abstract: We describe a new deep learning architecture for learning to rank question answer pairs. Our approach extends the long short-term memory (LSTM) network with holographic composition to model the relationship between question and answer representations. As opposed to the neural tensor layer that has been adopted recently, the holographic composition provides the benefits of scalable and rich represe… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: SIGIR 2017 Full Paper

  27. Latent Relational Metric Learning via Memory-based Attention for Collaborative Ranking

    Authors: Yi Tay, Anh Tuan Luu, Siu Cheung Hui

    Abstract: This paper proposes a new neural architecture for collaborative ranking with implicit feedback. Our model, LRML (\textit{Latent Relational Metric Learning}) is a novel metric learning approach for recommendation. More specifically, instead of simple push-pull mechanisms between user and item pairs, we propose to learn latent relations that describe each user item interaction. This helps to allevia… ▽ More

    Submitted 13 February, 2018; v1 submitted 17 July, 2017; originally announced July 2017.

    Comments: WWW 2018

  28. arXiv:1606.05053  [pdf, ps, other

    cs.DB

    Collecting and Analyzing Data from Smart Device Users with Local Differential Privacy

    Authors: Thông T. Nguyên, Xiaokui Xiao, Yin Yang, Siu Cheung Hui, Hye** Shin, Junbum Shin

    Abstract: Organizations with a large user base, such as Samsung and Google, can potentially benefit from collecting and mining users' data. However, doing so raises privacy concerns, and risks accidental privacy breaches with serious consequences. Local differential privacy (LDP) techniques address this problem by only collecting randomized answers from each user, with guarantees of plausible deniability; m… ▽ More

    Submitted 16 June, 2016; originally announced June 2016.

    Comments: 11 pages