Skip to main content

Showing 1–14 of 14 results for author: Kao, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.04989  [pdf, other

    cs.SE cs.CR

    Profile of Vulnerability Remediations in Dependencies Using Graph Analysis

    Authors: Fernando Vera, Palina Pauliuchenka, Ethan Oh, Bai Chien Kao, Louis DiValentin, David A. Bader

    Abstract: This research introduces graph analysis methods and a modified Graph Attention Convolutional Neural Network (GAT) to the critical challenge of open source package vulnerability remediation by analyzing control flow graphs to profile breaking changes in applications occurring from dependency upgrades intended to remediate vulnerabilities. Our approach uniquely applies node centrality metrics -- deg… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  2. arXiv:2403.01121  [pdf, other

    cs.LG cs.AI cs.SI

    OpenGraph: Towards Open Graph Foundation Models

    Authors: Lianghao Xia, Ben Kao, Chao Huang

    Abstract: Graph learning has become indispensable for interpreting and harnessing relational data in diverse fields, ranging from recommendation systems to social network analysis. In this context, a variety of GNNs have emerged as promising methodologies for encoding the structural information of graphs. By effectively capturing the graph's underlying structure, these GNNs have shown great potential in enh… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  3. Debiasing Recommendation with Personal Popularity

    Authors: Wentao Ning, Reynold Cheng, Xiao Yan, Ben Kao, Nan Huo, Nur AI Hasan Haldar, Bo Tang

    Abstract: Global popularity (GP) bias is the phenomenon that popular items are recommended much more frequently than they should be, which goes against the goal of providing personalized recommendations and harms user experience and recommendation accuracy. Many methods have been proposed to reduce GP bias but they fail to notice the fundamental problem of GP, i.e., it considers popularity from a \textit{gl… ▽ More

    Submitted 21 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted by WWW'24 as a research full paper

  4. arXiv:2310.04069  [pdf, other

    cs.DB

    Spatio-temporal flow patterns

    Authors: Chrysanthi Kosyfaki, Nikos Mamoulis, Reynold Cheng, Ben Kao

    Abstract: Transportation companies and organizations routinely collect huge volumes of passenger transportation data. By aggregating these data (e.g., counting the number of passengers going from a place to another in every 30 minute interval), it becomes possible to analyze the movement behavior of passengers in a metropolitan area. In this paper, we study the problem of finding important trends in passeng… ▽ More

    Submitted 12 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  5. Automated Self-Supervised Learning for Recommendation

    Authors: Lianghao Xia, Chao Huang, Chunzhen Huang, Kangyi Lin, Tao Yu, Ben Kao

    Abstract: Graph neural networks (GNNs) have emerged as the state-of-the-art paradigm for collaborative filtering (CF). To improve the representation quality over limited labeled data, contrastive learning has attracted attention in recommendation and benefited graph-based CF model recently. However, the success of most contrastive methods heavily relies on manually generating effective contrastive views for… ▽ More

    Submitted 20 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by ACM The Web Conference, 2023

  6. arXiv:2205.01941  [pdf, other

    cs.CL

    Lexical Knowledge Internalization for Neural Dialog Generation

    Authors: Zhiyong Wu, Wei Bi, Xiang Li, Lingpeng Kong, Ben Kao

    Abstract: We propose knowledge internalization (KI), which aims to complement the lexical knowledge into neural dialog models. Instead of further conditioning the knowledge-grounded dialog (KGD) models on externally retrieved knowledge, we seek to integrate knowledge about each input token internally into the model's parameters. To tackle the challenge due to the large scale of lexical knowledge, we adopt t… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: To appear at ACL 2022 main conference

  7. Automatic Meta-Path Discovery for Effective Graph-Based Recommendation

    Authors: Wentao Ning, Reynold Cheng, Jiajun Shen, Nur Al Hasan Haldar, Ben Kao, Xiao Yan, Nan Huo, Wai Kit Lam, Tian Li, Bo Tang

    Abstract: Heterogeneous Information Networks (HINs) are labeled graphs that depict relationships among different types of entities (e.g., users, movies and directors). For HINs, meta-path-based recommenders (MPRs) utilize meta-paths (i.e., abstract paths consisting of node and link types) to predict user preference, and have attracted a lot of attention due to their explainability and performance. We observ… ▽ More

    Submitted 7 September, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: This paper is accepted as a full research paper by CIKM 2022

  8. arXiv:2105.14462  [pdf, other

    cs.CL cs.AI

    Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation

    Authors: Zhiyong Wu, Lingpeng Kong, Wei Bi, Xiang Li, Ben Kao

    Abstract: A neural multimodal machine translation (MMT) system is one that aims to perform better translation by extending conventional text-only translation models with multimodal information. Many recent studies report improvements when equip** their models with the multimodal module, despite the controversy of whether such improvements indeed come from the multimodal part. We revisit the contribution o… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: To appear at ACL 2021 main conference

  9. arXiv:2012.10024  [pdf, other

    cs.LG cs.SI

    Leveraging Meta-path Contexts for Classification in Heterogeneous Information Networks

    Authors: Xiang Li, Danhao Ding, Ben Kao, Yizhou Sun, Nikos Mamoulis

    Abstract: A heterogeneous information network (HIN) has as vertices objects of different types and as edges the relations between objects, which are also of various types. We study the problem of classifying objects in HINs. Most existing methods perform poorly when given scarce labeled objects as training sets, and methods that improve classification accuracy under such scenarios are often computationally… ▽ More

    Submitted 20 February, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Accepted by ICDE 2021

  10. arXiv:2006.04435  [pdf, other

    cs.LG cs.AI stat.ML

    CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

    Authors: Xiang Li, Ben Kao, Caihua Shan, Dawei Yin, Martin Ester

    Abstract: We study the problem of applying spectral clustering to cluster multi-scale data, which is data whose clusters are of various sizes and densities. Traditional spectral clustering techniques discover clusters by processing a similarity matrix that reflects the proximity of objects. For multi-scale data, distance-based similarity is not effective because objects of a sparse cluster could be far apar… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  11. arXiv:2004.14786  [pdf, other

    cs.CL

    Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

    Authors: Zhiyong Wu, Yun Chen, Ben Kao, Qun Liu

    Abstract: By introducing a small set of additional parameters, a probe learns to solve specific linguistic tasks (e.g., dependency parsing) in a supervised manner using feature representations (e.g., contextualized embeddings). The effectiveness of such probing tasks is taken as evidence that the pre-trained model encodes linguistic knowledge. However, this approach of evaluating a language model is undermi… ▽ More

    Submitted 28 May, 2021; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: ACL2020

  12. arXiv:2003.00773  [pdf, other

    cs.DB

    Top-K Deep Video Analytics: A Probabilistic Approach

    Authors: Ziliang Lai, Chenxia Han, Chris Liu, Pengfei Zhang, Eric Lo, Ben Kao

    Abstract: The impressive accuracy of deep neural networks (DNNs) has created great demands on practical analytics over video data. Although efficient and accurate, the latest video analytic systems have not supported analytics beyond selection and aggregation queries. In data analytics, Top-K is a very important analytical operation that enables analysts to focus on the most important entities. In this pape… ▽ More

    Submitted 28 March, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 14 pages, 9 figures, 8 tables

  13. arXiv:1809.00939  [pdf, other

    cs.IR cs.CR cs.DC

    Decentralized Search on Decentralized Web

    Authors: Ziliang Lai, Chris Liu, Eric Lo, Ben Kao, Siu-Ming Yiu

    Abstract: Decentralized Web, or DWeb, is envisioned as a promising future of the Web. Being decentralized, there are no dedicated web servers in DWeb; Devices that retrieve web contents also serve their cached data to peer devices with straight privacy-preserving mechanisms. The fact that contents in DWeb are distributed, replicated, and decentralized lead to a number of key advantages over the conventional… ▽ More

    Submitted 18 August, 2018; originally announced September 2018.

  14. arXiv:1512.00965  [pdf, other

    cs.AI cs.CL cs.LG cs.NE

    Neural Enquirer: Learning to Query Tables with Natural Language

    Authors: Pengcheng Yin, Zhengdong Lu, Hang Li, Ben Kao

    Abstract: We proposed Neural Enquirer as a neural network architecture to execute a natural language (NL) query on a knowledge-base (KB) for answers. Basically, Neural Enquirer finds the distributed representation of a query and then executes it on knowledge-base tables to obtain the answer as one of the values in the tables. Unlike similar efforts in end-to-end training of semantic parsers, Neural Enquirer… ▽ More

    Submitted 20 January, 2016; v1 submitted 3 December, 2015; originally announced December 2015.