Skip to main content

Showing 1–14 of 14 results for author: Chua, T

Searching in archive stat. Search in all archives.
.
  1. Discovering Dynamic Causal Space for DAG Structure Learning

    Authors: Fangfu Liu, Wenchang Ma, An Zhang, Xiang Wang, Yueqi Duan, Tat-Seng Chua

    Abstract: Discovering causal structure from purely observational data (i.e., causal discovery), aiming to identify causal relationships among variables, is a fundamental task in machine learning. The recent invention of differentiable score-based DAG learners is a crucial enabler, which reframes the combinatorial optimization problem into a differentiable optimization with a DAG constraint over directed gra… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD 2023. Our codes are available at https://github.com/liuff19/CASPER

  2. arXiv:2303.03187  [pdf, other

    cs.LG stat.ML

    Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

    Authors: An Zhang, Fangfu Liu, Wenchang Ma, Zhibo Cai, Xiang Wang, Tat-seng Chua

    Abstract: Under stringent model type and variable distribution assumptions, differentiable score-based causal discovery methods learn a directed acyclic graph (DAG) from observational data by evaluating candidate graphs over an average score function. Despite great success in low-dimensional linear systems, it has been observed that these approaches overly exploit easier-to-fit samples, thus inevitably lear… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: In proceedings of ICLR 2023

  3. arXiv:2205.15638  [pdf, other

    cs.LG cs.DS stat.ME

    Differentiable Invariant Causal Discovery

    Authors: Yu Wang, An Zhang, Xiang Wang, Yancheng Yuan, Xiangnan He, Tat-Seng Chua

    Abstract: Learning causal structure from observational data is a fundamental challenge in machine learning. However, the majority of commonly used differentiable causal discovery methods are non-identifiable, turning this problem into a continuous optimization task prone to data biases. In many real-life situations, data is collected from different environments, in which the functional relations remain cons… ▽ More

    Submitted 29 September, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 22 pages, 11 figures

  4. Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method

    Authors: Fuli Feng, Weiran Huang, Xiangnan He, Xin Xin, Qifan Wang, Tat-Seng Chua

    Abstract: Graph Convolutional Network (GCN) is an emerging technique for information retrieval (IR) applications. While GCN assumes the homophily property of a graph, real-world graphs are never perfect: the local structure of a node may contain discrepancy, e.g., the labels of a node's neighbors could vary. This pushes us to consider the discrepancy of local structure in GCN modeling. Existing work approac… ▽ More

    Submitted 6 June, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted by SIGIR'21

  5. arXiv:2005.12979  [pdf, other

    cs.IR cs.LG cs.SI stat.ML

    Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users

    Authors: Shijun Li, Wenqiang Lei, Qingyun Wu, Xiangnan He, Peng Jiang, Tat-Seng Chua

    Abstract: Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recomm… ▽ More

    Submitted 5 October, 2022; v1 submitted 23 May, 2020; originally announced May 2020.

    Comments: TOIS 2021

    ACM Class: I.2.6

  6. arXiv:2003.02587  [pdf, other

    cs.LG stat.ML

    Cross-GCN: Enhancing Graph Convolutional Network with $k$-Order Feature Interactions

    Authors: Fuli Feng, Xiangnan He, Hanwang Zhang, Tat-Seng Chua

    Abstract: Graph Convolutional Network (GCN) is an emerging technique that performs learning and reasoning on graph data. It operates feature learning on the graph structure, through aggregating the features of the neighbor nodes to obtain the embedding of each target node. Owing to the strong representation power, recent research shows that GCN achieves state-of-the-art performance on several tasks such as… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Submitted to TKDE

  7. arXiv:1910.03648  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Meta-Transfer Learning through Hard Tasks

    Authors: Qianru Sun, Yaoyao Liu, Zhaozheng Chen, Tat-Seng Chua, Bernt Schiele

    Abstract: Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in order to learn how to adapt a base-learner to a new task for which only a few labeled samples are available. As deep neural networks (DNNs) tend to overfit using a few samples only, typical meta-learning models use shallow neur… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: An extended version of a paper published in CVPR2019. Under review. arXiv admin note: substantial text overlap with arXiv:1812.02391

  8. arXiv:1906.00562  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Learning to Self-Train for Semi-Supervised Few-Shot Classification

    Authors: Xinzhe Li, Qianru Sun, Yaoyao Liu, Shibao Zheng, Qin Zhou, Tat-Seng Chua, Bernt Schiele

    Abstract: Few-shot classification (FSC) is challenging due to the scarcity of labeled training data (e.g. only one labeled data point per class). Meta-learning has shown to achieve promising results by learning to initialize a classification model for FSC. In this paper we propose a novel semi-supervised meta-learning method called learning to self-train (LST) that leverages unlabeled data and specifically… ▽ More

    Submitted 29 September, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  9. arXiv:1905.07854  [pdf, other

    cs.LG cs.IR stat.ML

    KGAT: Knowledge Graph Attention Network for Recommendation

    Authors: Xiang Wang, Xiangnan He, Yixin Cao, Meng Liu, Tat-Seng Chua

    Abstract: To provide more accurate, diverse, and explainable recommendation, it is compulsory to go beyond modeling user-item interactions and take side information into account. Traditional methods like factorization machine (FM) cast it as a supervised learning problem, which assumes each interaction as an independent instance with side information encoded. Due to the overlook of the relations among insta… ▽ More

    Submitted 7 June, 2019; v1 submitted 19 May, 2019; originally announced May 2019.

    Comments: KDD 2019 research track

  10. arXiv:1902.08226  [pdf, ps, other

    cs.LG cs.SI stat.ML

    Graph Adversarial Training: Dynamically Regularizing Based on Graph Structure

    Authors: Fuli Feng, Xiangnan He, Jie Tang, Tat-Seng Chua

    Abstract: Recent efforts show that neural networks are vulnerable to small but intentional perturbations on input features in visual classification tasks. Due to the additional consideration of connections between examples (\eg articles with citation link tend to be in the same class), graph neural networks could be more sensitive to the perturbations, since the perturbations from connected examples exacerb… ▽ More

    Submitted 14 December, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: Accepted by TKDE

  11. arXiv:1811.04411  [pdf, other

    cs.IR cs.LG stat.ML

    Fast Matrix Factorization with Non-Uniform Weights on Missing Data

    Authors: Xiangnan He, **hui Tang, Xiaoyu Du, Richang Hong, Tongwei Ren, Tat-Seng Chua

    Abstract: Matrix factorization (MF) has been widely used to discover the low-rank structure and to predict the missing entries of data matrix. In many real-world learning systems, the data matrix can be very high-dimensional but sparse. This poses an imbalanced learning problem, since the scale of missing entries is usually much larger than that of observed entries, but they cannot be ignored due to the val… ▽ More

    Submitted 7 January, 2019; v1 submitted 11 November, 2018; originally announced November 2018.

    Comments: IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  12. arXiv:1808.03912  [pdf, other

    cs.IR cs.LG stat.ML

    Outer Product-based Neural Collaborative Filtering

    Authors: Xiangnan He, Xiaoyu Du, Xiang Wang, Feng Tian, **hui Tang, Tat-Seng Chua

    Abstract: In this work, we contribute a new multi-layer neural network architecture named ONCF to perform collaborative filtering. The idea is to use an outer product to explicitly model the pairwise correlations between the dimensions of the embedding space. In contrast to existing neural recommender models that combine user embedding and item embedding via a simple concatenation or element-wise product, o… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Comments: IJCAI 2018

  13. arXiv:1808.03908  [pdf, other

    cs.IR cs.LG stat.ML

    Adversarial Personalized Ranking for Recommendation

    Authors: Xiangnan He, Zhankui He, Xiaoyu Du, Tat-Seng Chua

    Abstract: Item recommendation is a personalized ranking task. To this end, many recommender systems optimize models with pairwise ranking objectives, such as the Bayesian Personalized Ranking (BPR). Using matrix Factorization (MF) --- the most widely used model in recommendation --- as a demonstration, we show that optimizing it with BPR leads to a recommender model that is not robust. In particular, we fin… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Comments: SIGIR 2018

  14. arXiv:1606.02979  [pdf, other

    cs.CL cs.AI cs.IR cs.LG stat.ML

    Generative Topic Embedding: a Continuous Representation of Documents (Extended Version with Proofs)

    Authors: Shaohua Li, Tat-Seng Chua, Jun Zhu, Chunyan Miao

    Abstract: Word embedding maps words into a low-dimensional continuous embedding space by exploiting the local word collocation patterns in a small context window. On the other hand, topic modeling maps documents onto a low-dimensional topic space, by utilizing the global word collocation patterns in the same document. These two types of patterns are complementary. In this paper, we propose a generative topi… ▽ More

    Submitted 8 August, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

    Comments: 13 pages. The original version has been accepted in ACL 2016 as a long paper