Skip to main content

Showing 1–29 of 29 results for author: Dong, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12439  [pdf, other

    cs.LG

    A data-centric approach for assessing progress of Graph Neural Networks

    Authors: Tianqi Zhao, Ngan Thi Dong, Alan Hanjalic, Megha Khosla

    Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art results in node classification tasks. However, most improvements are in multi-class classification, with less focus on the cases where each node could have multiple labels. The first challenge in studying multi-label node classification is the scarcity of publicly available datasets. To address this, we collected and released three real-w… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: Published in Data-centric Machine Learning Research Worshop @ ICML 2024

  2. arXiv:2404.08501  [pdf, ps, other

    cs.NE cs.AI

    Analyzing and Overcoming Local Optima in Complex Multi-Objective Optimization by Decomposition-Based Evolutionary Algorithms

    Authors: Ting Dong, Haoxin Wang, Hengxi Zhang, Wenbo Ding

    Abstract: When addressing the challenge of complex multi-objective optimization problems, particularly those with non-convex and non-uniform Pareto fronts, Decomposition-based Multi-Objective Evolutionary Algorithms (MOEADs) often converge to local optima, thereby limiting solution diversity. Despite its significance, this issue has received limited theoretical exploration. Through a comprehensive geometric… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  3. arXiv:2404.03233  [pdf, other

    cs.CR

    Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearning

    Authors: Hongsheng Hu, Shuo Wang, Tian Dong, Minhui Xue

    Abstract: Machine unlearning has become a promising solution for fulfilling the "right to be forgotten", under which individuals can request the deletion of their data from machine learning models. However, existing studies of machine unlearning mainly focus on the efficacy and efficiency of unlearning methods, while neglecting the investigation of the privacy vulnerability during the unlearning process. Wi… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: To Appear in the 45th IEEE Symposium on Security and Privacy, May 20-23, 2024

  4. arXiv:2403.15297  [pdf, other

    cs.AI

    Sphere Neural-Networks for Rational Reasoning

    Authors: Tiansi Dong, Mateja Jamnik, Pietro Liò

    Abstract: The success of Large Language Models (LLMs), e.g., ChatGPT, is witnessed by their planetary popularity, their capability of human-like communication, and also by their steadily improved reasoning performance. However, it remains unclear whether LLMs reason. It is an open problem how traditional neural networks can be qualitatively extended to go beyond the statistic paradigm and achieve high-level… ▽ More

    Submitted 24 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2312.00374  [pdf, other

    cs.CR

    The Philosopher's Stone: Trojaning Plugins of Large Language Models

    Authors: Tian Dong, Minhui Xue, Guoxing Chen, Rayne Holland, Shaofeng Li, Yan Meng, Zhen Liu, Hao** Zhu

    Abstract: Open-source Large Language Models (LLMs) have recently gained popularity because of their comparable performance to proprietary LLMs. To efficiently fulfill domain-specialized tasks, open-source LLMs can be refined, without expensive accelerators, using low-rank adapters. However, it is still unknown whether low-rank adapters can be exploited to control LLMs. To address this gap, we demonstrate th… ▽ More

    Submitted 13 March, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

  6. arXiv:2311.07766  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Vision-Language Integration in Multimodal Video Transformers (Partially) Aligns with the Brain

    Authors: Dota Tianai Dong, Mariya Toneva

    Abstract: Integrating information from multiple modalities is arguably one of the essential prerequisites for grounding artificial intelligence systems with an understanding of the real world. Recent advances in video transformers that jointly learn from vision, text, and sound over time have made some progress toward this goal, but the degree to which these models integrate information from modalities stil… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  7. arXiv:2307.16663  [pdf, other

    cs.CL cs.AI cs.LG

    Word Sense Disambiguation as a Game of Neurosymbolic Darts

    Authors: Tiansi Dong, Rafet Sifa

    Abstract: Word Sense Disambiguation (WSD) is one of the hardest tasks in natural language understanding and knowledge engineering. The glass ceiling of 80% F1 score is recently achieved through supervised deep-learning, enriched by a variety of knowledge graphs. Here, we propose a novel neurosymbolic methodology that is able to push the F1 score above 90%. The core of our methodology is a neurosymbolic sens… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  8. arXiv:2305.10263  [pdf, other

    cs.CL

    M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models

    Authors: Chuang Liu, Renren **, Yuqi Ren, Linhao Yu, Tianyu Dong, Xiaohan Peng, Shuting Zhang, Jianxiang Peng, Peiyi Zhang, Qingqing Lyu, Xiaowen Su, Qun Liu, Deyi Xiong

    Abstract: Large language models have recently made tremendous progress in a variety of aspects, e.g., cross-task generalization, instruction following. Comprehensively evaluating the capability of large language models in multiple tasks is of great importance. In this paper, we propose M3KE, a Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark, which is developed to measure knowledge acquired… ▽ More

    Submitted 20 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  9. arXiv:2304.10398  [pdf, other

    cs.LG

    Multi-label Node Classification On Graph-Structured Data

    Authors: Tianqi Zhao, Ngan Thi Dong, Alan Hanjalic, Megha Khosla

    Abstract: Graph Neural Networks (GNNs) have shown state-of-the-art improvements in node classification tasks on graphs. While these improvements have been largely demonstrated in a multi-class classification scenario, a more general and realistic scenario in which each node could have multiple labels has so far received little attention. The first challenge in conducting focused studies on multi-label node… ▽ More

    Submitted 29 February, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Published in TMLR 2023. Link: https://openreview.net/forum?id=EZhkV2BjDP

    Journal ref: Transaction Of Machine Learning Research, 2835-8856, 2023

  10. arXiv:2303.09158  [pdf, other

    cs.CV

    Facial Affect Recognition based on Transformer Encoder and Audiovisual Fusion for the ABAW5 Challenge

    Authors: Ziyang Zhang, Liuwei An, Zishun Cui, Ao xu, Tengteng Dong, Yueqi Jiang, **gyi Shi, Xin Liu, Xiao Sun, Meng Wang

    Abstract: In this paper, we present our solutions for the 5th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW), which includes four sub-challenges of Valence-Arousal (VA) Estimation, Expression (Expr) Classification, Action Unit (AU) Detection and Emotional Reaction Intensity (ERI) Estimation. The 5th ABAW competition focuses on facial affect recognition utilizing different modalit… ▽ More

    Submitted 20 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

  11. arXiv:2212.11751  [pdf, other

    cs.CR cs.LG

    Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing

    Authors: Tian Dong, Ziyuan Zhang, Han Qiu, Tianwei Zhang, Hewu Li, Terry Wang

    Abstract: Transforming off-the-shelf deep neural network (DNN) models into dynamic multi-exit architectures can achieve inference and transmission efficiency by fragmenting and distributing a large DNN model in edge computing scenarios (e.g., edge devices and cloud servers). In this paper, we propose a novel backdoor attack specifically on the dynamic multi-exit DNN models. Particularly, we inject a backdoo… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted to IEEE INFOCOM 2023

  12. arXiv:2206.00240  [pdf, other

    cs.CR cs.LG

    Privacy for Free: How does Dataset Condensation Help Privacy?

    Authors: Tian Dong, Bo Zhao, Lingjuan Lyu

    Abstract: To prevent unintentional data leakage, research community has resorted to data generators that can produce differentially private data for model training. However, for the sake of the data privacy, existing solutions suffer from either expensive training cost or poor generalization performance. Therefore, we raise the question whether training efficiency and privacy can be achieved simultaneously.… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted by ICML 2022 as Oral

  13. arXiv:2202.02139  [pdf, other

    cs.NI cs.AI

    Multi Objective Resource Optimization of Wireless Network Based on Cross Domain Virtual Network Embedding

    Authors: Chao Wang, Tao Dong, Youxiang Duan, Qifeng Sun, Peiying Zhang

    Abstract: The rapid development of virtual network architecture makes it possible for wireless network to be widely used. With the popularity of artificial intelligence (AI) industry in daily life, efficient resource allocation of wireless network has become a problem. Especially when network users request wireless network resources from different management domains, they still face many practical problems.… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

  14. arXiv:2201.03134  [pdf, other

    cs.CR cs.LG

    An Interpretable Federated Learning-based Network Intrusion Detection Framework

    Authors: Tian Dong, Song Li, Han Qiu, Jialiang Lu

    Abstract: Learning-based Network Intrusion Detection Systems (NIDSs) are widely deployed for defending various cyberattacks. Existing learning-based NIDS mainly uses Neural Network (NN) as a classifier that relies on the quality and quantity of cyberattack data. Such NN-based approaches are also hard to interpret for improving efficiency and scalability. In this paper, we design a new local-global computati… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

    Comments: 12 pages, draft

  15. A multitask transfer learning framework for the prediction of virus-human protein-protein interactions

    Authors: Thi Ngan Dong, Graham Brogden, Gisa Gerold, Megha Khosla

    Abstract: Viral infections are causing significant morbidity and mortality worldwide. Understanding the interaction patterns between a particular virus and human proteins plays a crucial role in unveiling the underlying mechanism of viral infection and pathogenesis. This could further help in the prevention and treatment of virus-related diseases. However, the task of predicting protein-protein interactions… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Journal ref: BMC Bioinformatics 2021

  16. arXiv:2111.10085  [pdf, other

    cs.CR cs.LG

    Mate! Are You Really Aware? An Explainability-Guided Testing Framework for Robustness of Malware Detectors

    Authors: Ruoxi Sun, Minhui Xue, Gareth Tyson, Tian Dong, Shaofeng Li, Shuo Wang, Hao** Zhu, Seyit Camtepe, Surya Nepal

    Abstract: Numerous open-source and commercial malware detectors are available. However, their efficacy is threatened by new adversarial attacks, whereby malware attempts to evade detection, e.g., by performing feature-space manipulation. In this work, we propose an explainability-guided and model-agnostic testing framework for robustness of malware detectors when confronted with adversarial attacks. The fra… ▽ More

    Submitted 27 November, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: Accepted at ESEC/FSE 2023. https://doi.org/10.1145/3611643.3616309

  17. arXiv:2110.03175  [pdf, other

    cs.CR cs.AI

    Fingerprinting Multi-exit Deep Neural Network Models via Inference Time

    Authors: Tian Dong, Han Qiu, Tianwei Zhang, Jiwei Li, Hewu Li, Jialiang Lu

    Abstract: Transforming large deep neural network (DNN) models into the multi-exit architectures can overcome the overthinking issue and distribute a large DNN model on resource-constrained scenarios (e.g. IoT frontend devices and backend servers) for inference and transmission efficiency. Nevertheless, intellectual property (IP) protection for the multi-exit models in the wild is still an unsolved challenge… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

  18. arXiv:2108.04820  [pdf, other

    q-bio.QM cs.LG

    MuCoMiD: A Multitask Convolutional Learning Framework for miRNA-Disease Association Prediction

    Authors: Thi Ngan Dong, Megha Khosla

    Abstract: Growing evidence from recent studies implies that microRNA or miRNA could serve as biomarkers in various complex human diseases. Since wet-lab experiments are expensive and time-consuming, computational techniques for miRNA-disease association prediction have attracted a lot of attention in recent years. Data scarcity is one of the major challenges in building reliable machine learning models. Dat… ▽ More

    Submitted 29 November, 2021; v1 submitted 8 August, 2021; originally announced August 2021.

  19. arXiv:2106.04174  [pdf, other

    cs.CL cs.AI

    Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making

    Authors: Zijun Yao, Chengjiang Li, Tiansi Dong, Xin Lv, Jifan Yu, Lei Hou, Juanzi Li, Yichi Zhang, Zelin Dai

    Abstract: Entity Matching (EM) aims at recognizing entity records that denote the same real-world object. Neural EM models learn vector representation of entity descriptions and match entities end-to-end. Though robust, these methods require many resources for training, and lack of interpretability. In this paper, we propose a novel EM framework that consists of Heterogeneous Information Fusion (HIF) and Ke… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  20. arXiv:2105.00164  [pdf, other

    cs.CL cs.CR

    Hidden Backdoors in Human-Centric Language Models

    Authors: Shaofeng Li, Hui Liu, Tian Dong, Benjamin Zi Hao Zhao, Minhui Xue, Hao** Zhu, Jialiang Lu

    Abstract: Natural language processing (NLP) systems have been proven to be vulnerable to backdoor attacks, whereby hidden features (backdoors) are trained into a language model and may only be activated by specific inputs (called triggers), to trick the model into producing unexpected behaviors. In this paper, we create covert and natural triggers for textual backdoor attacks, \textit{hidden backdoors}, whe… ▽ More

    Submitted 28 September, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

  21. arXiv:2010.15015  [pdf, ps, other

    cs.HC

    Towards Supporting Programming Education at Scale via Live Streaming

    Authors: Yan Chen, Walter S. Lasecki, Tao Dong

    Abstract: Live streaming, which allows streamers to broadcast their work to live viewers, is an emerging practice for teaching and learning computer programming. Participation in live streaming is growing rapidly, despite several apparent challenges, such as a general lack of training in pedagogy among streamers and scarce signals about a stream's characteristics (e.g., difficulty, style, and usefulness) to… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted to ACM CSCW 2020

    ACM Class: H.5.1; J.4

  22. arXiv:2007.07320  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Syllogism with Euler Neural-Networks

    Authors: Tiansi Dong, Chengjiang Li, Christian Bauckhage, Juanzi Li, Stefan Wrobel, Armin B. Cremers

    Abstract: Traditional neural networks represent everything as a vector, and are able to approximate a subset of logical reasoning to a certain degree. As basic logic relations are better represented by topological relations between regions, we propose a novel neural network that represents everything as a ball and is able to learn topological configuration as an Euler diagram. So comes the name Euler Neural… ▽ More

    Submitted 20 July, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: 16 pages, 6 figures

  23. arXiv:1910.06732  [pdf, other

    physics.soc-ph cs.SI

    Cross-sectional Urban Scaling Fails in Predicting Temporal Growth of Cities

    Authors: Gang Xu, Zhengzi Zhou, Limin Jiao, Ting Dong, Ruiqi Li

    Abstract: Numerous urban indicators scale with population in a power law across cities, but whether the cross-sectional scaling law is applicable to the temporal growth of individual cities is unclear. Here we first find two paradoxical scaling relationships that urban built-up area sub-linearly scales with population across cities, but super-linearly scales with population over time in most individual citi… ▽ More

    Submitted 17 October, 2019; v1 submitted 13 October, 2019; originally announced October 2019.

  24. arXiv:1905.09483  [pdf, other

    cs.RO

    Towards Generation and Evaluation of Comprehensive Map** Robot Datasets

    Authors: Hongyu Chen, Xiting Zhao, Jianwen Luo, Zhijie Yang, Zehao Zhao, Haochuan Wan, Xiaoya Ye, Guangyuan Weng, Zhenpeng He, Tian Dong, Sören Schwertfeger

    Abstract: This paper presents a fully hardware synchronized map** robot with support for a hardware synchronized external tracking system, for super-precise timing and localization. We also employ a professional, static 3D scanner for ground truth map collection. Three datasets are generated to evaluate the performance of map** algorithms within a room and between rooms. Based on these datasets we gener… ▽ More

    Submitted 24 August, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

  25. arXiv:1811.10776  [pdf

    cs.CL

    Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision

    Authors: Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Chengjiang Li, Xu Chen, Tiansi Dong

    Abstract: Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: 11 pages, EMNLP2018

  26. arXiv:1807.04191  [pdf, other

    cs.HC cs.CV

    A Computational Method for Evaluating UI Patterns

    Authors: Bardia Doosti, Tao Dong, Biplab Deka, Jeffrey Nichols

    Abstract: UI design languages, such as Google's Material Design, make applications both easier to develop and easier to learn by providing a set of standard UI components. Nonetheless, it is hard to assess the impact of design languages in the wild. Moreover, designers often get stranded by strong-opinionated debates around the merit of certain UI components, such as the Floating Action Button and the Navig… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  27. arXiv:1806.01743  [pdf, other

    q-fin.PM cs.LG stat.ML

    A Machine Learning Framework for Stock Selection

    Authors: XingYu Fu, **Hong Du, YiFeng Guo, MingWen Liu, Tao Dong, XiuWen Duan

    Abstract: This paper demonstrates how to apply machine learning algorithms to distinguish good stocks from the bad stocks. To this end, we construct 244 technical and fundamental features to characterize each stock, and label stocks according to their ranking with respect to the return-to-volatility ratio. Algorithms ranging from traditional statistical learning methods to recently popular deep learning met… ▽ More

    Submitted 8 August, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  28. arXiv:1711.07835  [pdf

    cs.CV

    Robust Object Tracking Based on Self-adaptive Search Area

    Authors: Taihang Dong, Sheng Zhong

    Abstract: Discriminative correlation filter (DCF) based trackers have recently achieved excellent performance with great computational efficiency. However, DCF based trackers suffer boundary effects, which result in unstable performance in challenging situations exhibiting fast motion. In this paper, we propose a novel method to mitigate this side-effect in DCF based trackers. We change the search area acco… ▽ More

    Submitted 15 November, 2020; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: 10 pages, 4 figures, 3 tables, SPIE 10th International Symposium on Multispectral Image Processing and Pattern Recognition

  29. arXiv:1711.07829  [pdf

    cs.CV

    Discussion among Different Methods of Updating Model Filter in Object Tracking

    Authors: Taihang Dong, Sheng Zhong

    Abstract: Discriminative correlation filters (DCF) have recently shown excellent performance in visual object tracking area. In this paper, we summarize the methods of updating model filter from discriminative correlation filter (DCF) based tracking algorithms and analyzes similarities and differences among these methods. We deduce the relationship between updating coefficient in high dimension (kernel tric… ▽ More

    Submitted 15 November, 2020; v1 submitted 21 November, 2017; originally announced November 2017.

    Comments: 8 pages, 3 figures, SPIE 10th International Symposium on Multispectral Image Processing and Pattern Recognition