Skip to main content

Showing 1–7 of 7 results for author: Koh, H Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.05800  [pdf, other

    cs.LG cs.AI

    Graph Spatiotemporal Process for Multivariate Time Series Anomaly Detection with Missing Values

    Authors: Yu Zheng, Huan Yee Koh, Ming **, Lianhua Chi, Haishuai Wang, Khoa T. Phan, Yi-** Phoebe Chen, Shirui Pan, Wei Xiang

    Abstract: The detection of anomalies in multivariate time series data is crucial for various practical applications, including smart power grids, traffic flow forecasting, and industrial process control. However, real-world time series data is usually not well-structured, posting significant challenges to existing approaches: (1) The existence of missing values in multivariate time series data along variabl… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted by Information Fusion

  2. arXiv:2310.07984  [pdf

    cs.AI cs.CE

    Large Language Models for Scientific Synthesis, Inference and Explanation

    Authors: Yizhen Zheng, Huan Yee Koh, Jiaxin Ju, Anh T. N. Nguyen, Lauren T. May, Geoffrey I. Webb, Shirui Pan

    Abstract: Large language models are a form of artificial intelligence systems whose primary knowledge consists of the statistical patterns, semantic relationships, and syntactical structures of language1. Despite their limited forms of "knowledge", these systems are adept at numerous complex tasks including creative writing, storytelling, translation, question-answering, summarization, and computer code gen… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Supplementary Information: https://drive.google.com/file/d/1KrpUpzuFTeMx6a6zl18lqdo8vV-UUa1Z/view?usp=sharing Github Repo: https://github.com/zyzisastudyreallyhardguy/LLM4SD

  3. Correlation-aware Spatial-Temporal Graph Learning for Multivariate Time-series Anomaly Detection

    Authors: Yu Zheng, Huan Yee Koh, Ming **, Lianhua Chi, Khoa T. Phan, Shirui Pan, Yi-** Phoebe Chen, Wei Xiang

    Abstract: Multivariate time-series anomaly detection is critically important in many applications, including retail, transportation, power grid, and water treatment plants. Existing approaches for this problem mostly employ either statistical models which cannot capture the non-linear relations well or conventional deep learning models (e.g., CNN and LSTM) that do not explicitly learn the pairwise correlati… ▽ More

    Submitted 16 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 17 pages, double columns, 10 tables, 3 figures. Accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  4. arXiv:2307.03759  [pdf, other

    cs.LG cs.AI

    A Survey on Graph Neural Networks for Time Series: Forecasting, Classification, Imputation, and Anomaly Detection

    Authors: Ming **, Huan Yee Koh, Qingsong Wen, Daniele Zambon, Cesare Alippi, Geoffrey I. Webb, Irwin King, Shirui Pan

    Abstract: Time series are the primary data type used to record dynamic system measurements and generated in great volume by both physical sensors and online processes (virtual sensors). Time series analytics is therefore crucial to unlocking the wealth of information implicit in available data. With the recent advancements in graph neural networks (GNNs), there has been a surge in GNN-based approaches for t… ▽ More

    Submitted 9 August, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: Ongoing work; 27 pages, 6 figures, 5 tables; Github page: https://github.com/KimMeen/Awesome-GNN4TS

  5. arXiv:2210.16732  [pdf, other

    cs.CL

    How Far are We from Robust Long Abstractive Summarization?

    Authors: Huan Yee Koh, Jiaxin Ju, He Zhang, Ming Liu, Shirui Pan

    Abstract: Abstractive summarization has made tremendous progress in recent years. In this work, we perform fine-grained human annotations to evaluate long document abstractive summarization systems (i.e., models and metrics) with the aim of implementing them to generate reliable summaries. For long document abstractive models, we show that the constant strive for state-of-the-art ROUGE results can lead us t… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  6. An Empirical Survey on Long Document Summarization: Datasets, Models and Metrics

    Authors: Huan Yee Koh, Jiaxin Ju, Ming Liu, Shirui Pan

    Abstract: Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader's comprehension. Rece… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

    Comments: Accepted for publication by ACM Computing Surveys

  7. arXiv:2110.01280  [pdf, other

    cs.CL

    Leveraging Information Bottleneck for Scientific Document Summarization

    Authors: Jiaxin Ju, Ming Liu, Huan Yee Koh, Yuan **, Lan Du, Shirui Pan

    Abstract: This paper presents an unsupervised extractive approach to summarize scientific long documents based on the Information Bottleneck principle. Inspired by previous work which uses the Information Bottleneck principle for sentence compression, we extend it to document level summarization with two separate steps. In the first step, we use signal(s) as queries to retrieve the key content from the sour… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: Accepted at EMNLP 2021 Findings