Skip to main content

Showing 1–50 of 61 results for author: Aggarwal, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.11685  [pdf, other

    cs.LG cs.SI

    Edge Classification on Graphs: New Directions in Topological Imbalance

    Authors: Xueqi Cheng, Yu Wang, Yunchao Liu, Yuying Zhao, Charu C. Aggarwal, Tyler Derr

    Abstract: Recent years have witnessed the remarkable success of applying Graph machine learning (GML) to node/graph classification and link prediction. However, edge classification task that enjoys numerous real-world applications such as social network analysis and cybersecurity, has not seen significant advancement. To address this gap, our study pioneers a comprehensive approach to edge classification. W… ▽ More

    Submitted 17 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2405.16879  [pdf, other

    cs.LG cs.AI

    Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning

    Authors: Wangyang Ying, Dongjie Wang, Xuanming Hu, Yuanchun Zhou, Charu C. Aggarwal, Yanjie Fu

    Abstract: Feature transformation is to derive a new feature set from original features to augment the AI power of data. In many science domains such as material performance screening, while feature transformation can model material formula interactions and compositions and discover performance drivers, supervised labels are collected from expensive and lengthy experiments. This issue motivates an Unsupervis… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2405.09591  [pdf, other

    cs.LG cs.AI

    A Comprehensive Survey on Data Augmentation

    Authors: Zaitian Wang, Pengfei Wang, Kunpeng Liu, Pengyang Wang, Yanjie Fu, Chang-Tien Lu, Charu C. Aggarwal, Jian Pei, Yuanchun Zhou

    Abstract: Data augmentation is a series of techniques that generate high-quality artificial data by manipulating existing data samples. By leveraging data augmentation techniques, AI models can achieve significantly improved applicability in tasks involving scarce or imbalanced datasets, thereby substantially enhancing AI models' generalization capabilities. Existing literature surveys only focus on a certa… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  4. InsightNet: Structured Insight Mining from Customer Feedback

    Authors: Sandeep Sricharan Mukku, Manan Soni, Jitenkumar Rana, Chetan Aggarwal, Promod Yenigalla, Rashmi Patange, Shyam Mohan

    Abstract: We propose InsightNet, a novel approach for the automated extraction of structured insights from customer reviews. Our end-to-end machine learning framework is designed to overcome the limitations of current solutions, including the absence of structure for identified topics, non-standard aspect names, and lack of abundant training data. The proposed solution builds a semi-supervised multi-level t… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: EMNLP 2023

  5. arXiv:2402.12541  [pdf, other

    cs.IR

    Leveraging Opposite Gender Interaction Ratio as a Path towards Fairness in Online Dating Recommendations Based on User Sexual Orientation

    Authors: Yuying Zhao, Yu Wang, Yi Zhang, Pamela Wisniewski, Charu Aggarwal, Tyler Derr

    Abstract: Online dating platforms have gained widespread popularity as a means for individuals to seek potential romantic relationships. While recommender systems have been designed to improve the user experience in dating platforms by providing personalized recommendations, increasing concerns about fairness have encouraged the development of fairness-aware recommender systems from various perspectives (e.… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI 2024

  6. arXiv:2402.08241  [pdf, other

    cs.IR

    Causal Learning for Trustworthy Recommender Systems: A Survey

    Authors: ** Li, Shou** Wang, Qi Zhang, Longbing Cao, Fang Chen, Xiuzhen Zhang, Dietmar Jannach, Charu C. Aggarwal

    Abstract: Recommender Systems (RS) have significantly advanced online content discovery and personalized decision-making. However, emerging vulnerabilities in RS have catalyzed a paradigm shift towards Trustworthy RS (TRS). Despite numerous progress on TRS, most of them focus on data correlations while overlooking the fundamental causal nature in recommendation. This drawback hinders TRS from identifying th… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  7. arXiv:2402.01943  [pdf, other

    cs.LG

    Precedence-Constrained Winter Value for Effective Graph Data Valuation

    Authors: Hongliang Chi, Wei **, Charu Aggarwal, Yao Ma

    Abstract: Data valuation is essential for quantifying data's worth, aiding in assessing data quality and determining fair compensation. While existing data valuation methods have proven effective in evaluating the value of Euclidean data, they face limitations when applied to the increasingly popular graph-structured data. Particularly, graph data valuation introduces unique challenges, primarily stemming f… ▽ More

    Submitted 8 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 17 pages in total

  8. arXiv:2311.01024  [pdf, other

    cs.LG cs.AI

    Distance-Based Propagation for Efficient Knowledge Graph Reasoning

    Authors: Harry Shomer, Yao Ma, Juanhui Li, Bo Wu, Charu C. Aggarwal, Jiliang Tang

    Abstract: Knowledge graph completion (KGC) aims to predict unseen edges in knowledge graphs (KGs), resulting in the discovery of new facts. A new class of methods have been proposed to tackle this problem by aggregating path information. These methods have shown tremendous ability in the task of KGC. However they are plagued by efficiency issues. Though there are a few recent attempts to address this throug… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  9. arXiv:2308.09296  [pdf, other

    cs.LG cs.NE

    CARLA: Self-supervised Contrastive Representation Learning for Time Series Anomaly Detection

    Authors: Zahra Zamanzadeh Darban, Geoffrey I. Webb, Shirui Pan, Charu C. Aggarwal, Mahsa Salehi

    Abstract: One main challenge in time series anomaly detection (TSAD) is the lack of labelled data in many real-life scenarios. Most of the existing anomaly detection methods focus on learning the normal behaviour of unlabelled time series in an unsupervised manner. The normal boundary is often defined tightly, resulting in slight deviations being classified as anomalies, consequently leading to a high false… ▽ More

    Submitted 7 April, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 35 pages, 9 figures, 10 tables

  10. arXiv:2307.04644  [pdf, other

    cs.IR

    Fairness and Diversity in Recommender Systems: A Survey

    Authors: Yuying Zhao, Yu Wang, Yunchao Liu, Xueqi Cheng, Charu Aggarwal, Tyler Derr

    Abstract: Recommender systems are effective tools for mitigating information overload and have seen extensive applications across various domains. However, the single focus on utility goals proves to be inadequate in addressing real-world concerns, leading to increasing attention to fairness-aware and diversity-aware recommender systems. While most existing studies explore fairness and diversity independent… ▽ More

    Submitted 1 March, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

  11. arXiv:2306.02002  [pdf, other

    cs.LG cs.AI cs.CR

    Can Directed Graph Neural Networks be Adversarially Robust?

    Authors: Zhichao Hou, Xitong Zhang, Wei Wang, Charu C. Aggarwal, Xiaorui Liu

    Abstract: The existing research on robust Graph Neural Networks (GNNs) fails to acknowledge the significance of directed graphs in providing rich information about networks' inherent structure. This work presents the first investigation into the robustness of GNNs in the context of directed graphs, aiming to harness the profound trust implications offered by directed graphs to bolster the robustness and res… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

  12. arXiv:2306.01958  [pdf, other

    cs.LG cs.AI

    A Survey on Explainability of Graph Neural Networks

    Authors: Jaykumar Kakkad, Jaspal Jannu, Kartik Sharma, Charu Aggarwal, Sourav Medya

    Abstract: Graph neural networks (GNNs) are powerful graph-based deep-learning models that have gained significant attention and demonstrated remarkable performance in various domains, including natural language processing, drug discovery, and recommendation systems. However, combining feature information and combinatorial graph structures has led to complex non-linear GNN models. Consequently, this has incr… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: submitted to Bulletin of the IEEE Computer Society Technical Committee on Data Engineering

  13. arXiv:2305.15822  [pdf, other

    cs.LG

    Towards Label Position Bias in Graph Neural Networks

    Authors: Haoyu Han, Xiaorui Liu, Feng Shi, MohamadAli Torkamani, Charu C. Aggarwal, Jiliang Tang

    Abstract: Graph Neural Networks (GNNs) have emerged as a powerful tool for semi-supervised node classification tasks. However, recent studies have revealed various biases in GNNs stemming from both node features and graph topology. In this work, we uncover a new bias - label position bias, which indicates that the node closer to the labeled nodes tends to perform better. We introduce a new metric, the Label… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  14. arXiv:2305.14851  [pdf, other

    cs.CR

    Sharpness-Aware Data Poisoning Attack

    Authors: Pengfei He, Han Xu, Jie Ren, Yingqian Cui, Hui Liu, Charu C. Aggarwal, Jiliang Tang

    Abstract: Recent research has highlighted the vulnerability of Deep Neural Networks (DNNs) against data poisoning attacks. These attacks aim to inject poisoning samples into the models' training dataset such that the trained models have inference failures. While previous studies have executed different types of attacks, one major challenge that greatly limits their effectiveness is the uncertainty of the re… ▽ More

    Submitted 7 May, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  15. arXiv:2304.01391  [pdf, other

    cs.LG

    Counterfactual Learning on Graphs: A Survey

    Authors: Zhimeng Guo, Teng Xiao, Zongyu Wu, Charu Aggarwal, Hui Liu, Suhang Wang

    Abstract: Graph-structured data are pervasive in the real-world such as social networks, molecular graphs and transaction networks. Graph neural networks (GNNs) have achieved great success in representation learning on graphs, facilitating various downstream tasks. However, GNNs have several drawbacks such as lacking interpretability, can easily inherit the bias of data and cannot model casual relations. Re… ▽ More

    Submitted 24 March, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

  16. Heterogeneous Social Event Detection via Hyperbolic Graph Representations

    Authors: Zitai Qiu, Jia Wu, Jian Yang, Xing Su, Charu C. Aggarwal

    Abstract: Social events reflect the dynamics of society and, here, natural disasters and emergencies receive significant attention. The timely detection of these events can provide organisations and individuals with valuable information to reduce or avoid losses. However, due to the complex heterogeneities of the content and structure of social media, existing models can only learn limited information; larg… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  17. arXiv:2301.05860  [pdf, other

    cs.LG cs.AI

    State of the Art and Potentialities of Graph-level Learning

    Authors: Zhenyu Yang, Ge Zhang, Jia Wu, Jian Yang, Quan Z. Sheng, Shan Xue, Chuan Zhou, Charu Aggarwal, Hao Peng, Wenbin Hu, Edwin Hancock, Pietro Liò

    Abstract: Graphs have a superior ability to represent relational data, like chemical compounds, proteins, and social networks. Hence, graph-level learning, which takes a set of graphs as input, has been applied to many tasks including comparison, regression, classification, and more. Traditional approaches to learning a set of graphs heavily rely on hand-crafted features, such as substructures. But while th… ▽ More

    Submitted 25 May, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

  18. arXiv:2212.05532  [pdf, other

    cs.LG cs.SI

    Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

    Authors: **g Ren, Feng Xia, Azadeh Noori Hoshyar, Charu C. Aggarwal

    Abstract: Anomaly analytics is a popular and vital task in various research contexts, which has been studied for several decades. At the same time, deep learning has shown its capacity in solving many graph-based tasks like, node classification, link prediction, and graph classification. Recently, many studies are extending graph learning models for solving anomaly analytics problems, resulting in beneficia… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  19. arXiv:2211.05244  [pdf, other

    cs.LG cs.AI

    Deep Learning for Time Series Anomaly Detection: A Survey

    Authors: Zahra Zamanzadeh Darban, Geoffrey I. Webb, Shirui Pan, Charu C. Aggarwal, Mahsa Salehi

    Abstract: Time series anomaly detection has applications in a wide range of research fields and applications, including manufacturing and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, or heart fluttering, and is therefore of particular interest. The large size and complex patterns of time series have led researchers to develop speci… ▽ More

    Submitted 28 May, 2024; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 42 pages, 12 figures, 5 tables

  20. arXiv:2210.09766  [pdf, other

    cs.LG

    DAGAD: Data Augmentation for Graph Anomaly Detection

    Authors: Fanzhen Liu, Xiaoxiao Ma, Jia Wu, Jian Yang, Shan Xue, Amin Beheshti, Chuan Zhou, Hao Peng, Quan Z. Sheng, Charu C. Aggarwal

    Abstract: Graph anomaly detection in this paper aims to distinguish abnormal nodes that behave differently from the benign ones accounting for the majority of graph-structured instances. Receiving increasing attention from both academia and industry, yet existing research on this task still suffers from two critical issues when learning informative anomalous behavior from graph data. For one thing, anomalie… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: Regular paper accepted by the 22nd IEEE International Conference on Data Mining (ICDM 2022)

  21. arXiv:2209.12618  [pdf, ps, other

    cs.AI cs.SC

    Survey on Applications of Neurosymbolic Artificial Intelligence

    Authors: Djallel Bouneffouf, Charu C. Aggarwal

    Abstract: In recent years, the Neurosymbolic framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance. This success is due to its stellar performance combined with attractive properties, such as learning and reasoning. The new emerging Neurosymbolic field is currently experiencing a renaissance, as novel frameworks and a… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  22. arXiv:2208.01820  [pdf, other

    cs.LG cs.SI

    Link Prediction on Heterophilic Graphs via Disentangled Representation Learning

    Authors: Shijie Zhou, Zhimeng Guo, Charu Aggarwal, Xiang Zhang, Suhang Wang

    Abstract: Link prediction is an important task that has wide applications in various domains. However, the majority of existing link prediction approaches assume the given graph follows homophily assumption, and designs similarity-based heuristics or representation learning approaches to predict links. However, many real-world graphs are heterophilic graphs, where the homophily assumption does not hold, whi… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  23. Feature Overcorrelation in Deep Graph Neural Networks: A New Perspective

    Authors: Wei **, Xiaorui Liu, Yao Ma, Charu Aggarwal, Jiliang Tang

    Abstract: Recent years have witnessed remarkable success achieved by graph neural networks (GNNs) in many real-world applications such as recommendation and drug discovery. Despite the success, oversmoothing has been identified as one of the key issues which limit the performance of deep GNNs. It indicates that the learned node representations are highly indistinguishable due to the stacked aggregators. In… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted by KDD 2022

  24. arXiv:2205.15555  [pdf, other

    cs.LG

    Graph-level Neural Networks: Current Progress and Future Directions

    Authors: Ge Zhang, Jia Wu, Jian Yang, Shan Xue, Wenbin Hu, Chuan Zhou, Hao Peng, Quan Z. Sheng, Charu Aggarwal

    Abstract: Graph-structured data consisting of objects (i.e., nodes) and relationships among objects (i.e., edges) are ubiquitous. Graph-level learning is a matter of studying a collection of graphs instead of a single graph. Traditional graph-level learning methods used to be the mainstream. However, with the increasing scale and complexity of graphs, Graph-level Neural Networks (GLNNs, deep learning-based… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  25. arXiv:2205.10759  [pdf, other

    cs.IR cs.AI cs.LG

    Sequential/Session-based Recommendations: Challenges, Approaches, Applications and Opportunities

    Authors: Shou** Wang, Qi Zhang, Liang Hu, Xiuzhen Zhang, Yan Wang, Charu Aggarwal

    Abstract: In recent years, sequential recommender systems (SRSs) and session-based recommender systems (SBRSs) have emerged as a new paradigm of RSs to capture users' short-term but dynamic preferences for enabling more timely and accurate recommendations. Although SRSs and SBRSs have been extensively studied, there are many inconsistencies in this area caused by the diverse descriptions, settings, assumpti… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Journal ref: SIGIR 2022

  26. arXiv:2202.10688  [pdf, other

    cs.LG cs.AI

    Graph Lifelong Learning: A Survey

    Authors: Falih Gozi Febrinanto, Feng Xia, Kristen Moore, Chandra Thapa, Charu Aggarwal

    Abstract: Graph learning is a popular approach for performing machine learning on graph-structured data. It has revolutionized the machine learning ability to model graph data to address downstream tasks. Its application is wide due to the availability of graph data ranging from all types of networks to information systems. Most graph learning methods assume that the graph is static and its complete structu… ▽ More

    Submitted 3 November, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 19 pages, 4 figures

    MSC Class: 68T07; 68T05 ACM Class: I.2.6

    Journal ref: IEEE Computational Intelligence Magazine 2022

  27. arXiv:2110.12035  [pdf, other

    cs.LG cs.SI

    Distance-wise Prototypical Graph Neural Network in Node Imbalance Classification

    Authors: Yu Wang, Charu Aggarwal, Tyler Derr

    Abstract: Recent years have witnessed the significant success of applying graph neural networks (GNNs) in learning effective node representations for classification. However, current GNNs are mostly built under the balanced data-splitting, which is inconsistent with many real-world networks where the number of training nodes can be extremely imbalanced among the classes. Thus, directly utilizing current GNN… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  28. arXiv:2108.05869  [pdf, ps, other

    cs.CL

    Syntax Matters! Syntax-Controlled in Text Style Transfer

    Authors: Zhiqiang Hu, Roy Ka-Wei Lee, Charu C. Aggarwal

    Abstract: Existing text style transfer (TST) methods rely on style classifiers to disentangle the text's content and style attributes for text style transfer. While the style classifier plays a critical role in existing TST methods, there is no known investigation on its effect on the TST methods. In this paper, we conduct an empirical study on the limitations of the style classifiers used in existing TST m… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  29. arXiv:2108.03388  [pdf, other

    cs.LG cs.AI cs.CR cs.SI

    Jointly Attacking Graph Neural Network and its Explanations

    Authors: Wenqi Fan, Wei **, Xiaorui Liu, Han Xu, Xianfeng Tang, Suhang Wang, Qing Li, Jiliang Tang, Jian** Wang, Charu Aggarwal

    Abstract: Graph Neural Networks (GNNs) have boosted the performance for many graph-related tasks. Despite the great success, recent studies have shown that GNNs are highly vulnerable to adversarial attacks, where adversaries can mislead the GNNs' prediction by modifying graphs. On the other hand, the explanation of GNNs (GNNExplainer) provides a better understanding of a trained GNN model by generating a sm… ▽ More

    Submitted 22 November, 2022; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted by ICDE 2023 (39th IEEE International Conference on Data Engineering)

  30. arXiv:2107.10234  [pdf, other

    cs.LG cs.AI

    Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural Networks

    Authors: Zhiqian Chen, Fanglan Chen, Lei Zhang, Taoran Ji, Kaiqun Fu, Liang Zhao, Feng Chen, Lingfei Wu, Charu Aggarwal, Chang-Tien Lu

    Abstract: Deep learning's performance has been extensively recognized recently. Graph neural networks (GNNs) are designed to deal with graph-structural data that classical deep learning does not easily manage. Since most GNNs were created using distinct theories, direct comparisons are impossible. Prior research has primarily concentrated on categorizing existing models, with little attention paid to their… ▽ More

    Submitted 18 September, 2023; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: ACM Computing Survey, to appear

  31. arXiv:2106.04714  [pdf, other

    cs.LG

    NRGNN: Learning a Label Noise-Resistant Graph Neural Network on Sparsely and Noisily Labeled Graphs

    Authors: Enyan Dai, Charu Aggarwal, Suhang Wang

    Abstract: Graph Neural Networks (GNNs) have achieved promising results for semi-supervised learning tasks on graphs such as node classification. Despite the great success of GNNs, many real-world graphs are often sparsely and noisily labeled, which could significantly degrade the performance of GNNs, as the noisy information could propagate to unlabeled nodes via graph structure. Thus, it is important to de… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  32. arXiv:2105.04493  [pdf, other

    cs.LG cs.AI

    Graph Feature Gating Networks

    Authors: Wei **, Xiaorui Liu, Yao Ma, Tyler Derr, Charu Aggarwal, Jiliang Tang

    Abstract: Graph neural networks (GNNs) have received tremendous attention due to their power in learning effective representations for graphs. Most GNNs follow a message-passing scheme where the node representations are updated by aggregating and transforming the information from the neighborhood. Meanwhile, they adopt the same strategy in aggregating the information from different feature dimensions. Howev… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  33. arXiv:2104.06313  [pdf, other

    cs.IR cs.AI cs.LG

    SetConv: A New Approach for Learning from Imbalanced Data

    Authors: Yang Gao, Yi-Fan Li, Yu Lin, Charu Aggarwal, Latifur Khan

    Abstract: For many real-world classification problems, e.g., sentiment classification, most existing machine learning methods are biased towards the majority class when the Imbalance Ratio (IR) is high. To address this problem, we propose a set convolution (SetConv) operation and an episodic training strategy to extract a single representative for each class, so that classifiers can later be trained on a ba… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted by EMNLP 2020 (11 pages, 9 figures)

  34. arXiv:2103.00137  [pdf, other

    cs.LG cs.AI

    Meta-Learning with Graph Neural Networks: Methods and Applications

    Authors: Debmalya Mandal, Sourav Medya, Brian Uzzi, Charu Aggarwal

    Abstract: Graph Neural Networks (GNNs), a generalization of deep neural networks on graph data have been widely used in various domains, ranging from drug discovery to recommender systems. However, GNNs on such applications are limited when there are few available samples. Meta-learning has been an important framework to address the lack of samples in machine learning, and in recent years, researchers have… ▽ More

    Submitted 6 November, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

  35. arXiv:2102.05571  [pdf, other

    cs.CR cs.AI cs.IR cs.LG

    TINKER: A framework for Open source Cyberthreat Intelligence

    Authors: Nidhi Rastogi, Sharmishtha Dutta, Mohammed J. Zaki, Alex Gittens, Charu Aggarwal

    Abstract: Threat intelligence on malware attacks and campaigns is increasingly being shared with other security experts for a cost or for free. Other security analysts use this intelligence to inform them of indicators of compromise, attack techniques, and preventative actions. Security analysts prepare threat analysis reports after investigating an attack, an emerging cyber threat, or a recently discovered… ▽ More

    Submitted 19 January, 2023; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: 9 pages

  36. arXiv:2010.12742  [pdf, ps, other

    cs.CL

    Text Style Transfer: A Review and Experimental Evaluation

    Authors: Zhiqiang Hu, Roy Ka-Wei Lee, Charu C. Aggarwal, Aston Zhang

    Abstract: The stylistic properties of text have intrigued computational linguistics researchers in recent years. Specifically, researchers have investigated the Text Style Transfer (TST) task, which aims to change the stylistic properties of the text while retaining its style independent content. Over the last few years, many novel TST algorithms have been developed, while the industry has leveraged these a… ▽ More

    Submitted 1 January, 2023; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: We fixed the issue that the references are not associated with any [number] in the bibliography section

    Journal ref: KDD Explorations 24 (2022) 14-45

  37. Investigating and Mitigating Degree-Related Biases in Graph Convolutional Networks

    Authors: Xianfeng Tang, Huaxiu Yao, Yiwei Sun, Yiqi Wang, Jiliang Tang, Charu Aggarwal, Prasenjit Mitra, Suhang Wang

    Abstract: Graph Convolutional Networks (GCNs) show promising results for semi-supervised learning tasks on graphs, thus become favorable comparing with other approaches. Despite the remarkable success of GCNs, it is difficult to train GCNs with insufficient supervision. When labeled data are limited, the performance of GCNs becomes unsatisfying for low-degree nodes. While some prior work analyze successes a… ▽ More

    Submitted 13 August, 2020; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: Accepted to CIKM 2020

  38. MALOnt: An Ontology for Malware Threat Intelligence

    Authors: Nidhi Rastogi, Sharmishtha Dutta, Mohammed J. Zaki, Alex Gittens, Charu Aggarwal

    Abstract: Malware threat intelligence uncovers deep information about malware, threat actors, and their tactics, Indicators of Compromise(IoC), and vulnerabilities in different platforms from scattered threat sources. This collective information can guide decision making in cyber defense applications utilized by security operation centers(SoCs). In this paper, we introduce an open-source malware ontology -… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  39. arXiv:2005.12386  [pdf, other

    cs.LG stat.ML

    Customized Graph Neural Networks

    Authors: Yiqi Wang, Yao Ma, Wei **, Chaozhuo Li, Charu Aggarwal, Jiliang Tang

    Abstract: Recently, Graph Neural Networks (GNNs) have greatly advanced the task of graph classification. Typically, we first build a unified GNN model with graphs in a given training set and then use this unified model to predict labels of all the unseen graphs in the test set. However, graphs in the same dataset often have dramatically distinct structures, which indicates that a unified model may be sub-op… ▽ More

    Submitted 14 December, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

  40. arXiv:2003.00653  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Attacks and Defenses on Graphs: A Review, A Tool and Empirical Studies

    Authors: Wei **, Yaxin Li, Han Xu, Yiqi Wang, Shuiwang Ji, Charu Aggarwal, Jiliang Tang

    Abstract: Deep neural networks (DNNs) have achieved significant performance in various tasks. However, recent studies have shown that DNNs can be easily fooled by small perturbation on the input, called adversarial attacks. As the extensions of DNNs to graphs, Graph Neural Networks (GNNs) have been demonstrated to inherit this vulnerability. Adversary can mislead GNNs to give wrong predictions by modifying… ▽ More

    Submitted 12 December, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: Accepted by SIGKDD Explorations

  41. arXiv:2002.11867  [pdf, other

    cs.LG cs.AI cs.CG stat.ML

    Bridging the Gap between Spatial and Spectral Domains: A Survey on Graph Neural Networks

    Authors: Zhiqian Chen, Fanglan Chen, Lei Zhang, Taoran Ji, Kaiqun Fu, Liang Zhao, Feng Chen, Lingfei Wu, Charu Aggarwal, Chang-Tien Lu

    Abstract: Deep learning's success has been widely recognized in a variety of machine learning tasks, including image classification, audio recognition, and natural language processing. As an extension of deep learning beyond these domains, graph neural networks (GNNs) are designed to handle the non-Euclidean graph-structure which is intractable to previous deep learning techniques. Existing GNNs are present… ▽ More

    Submitted 21 July, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

  42. arXiv:1911.11121  [pdf, other

    cs.LG stat.ML

    Efficient Global String Kernel with Random Features: Beyond Counting Substructures

    Authors: Lingfei Wu, Ian En-Hsu Yen, Siyu Huo, Liang Zhao, Kun Xu, Liang Ma, Shouling Ji, Charu Aggarwal

    Abstract: Analysis of large-scale sequential data has been one of the most crucial tasks in areas such as bioinformatics, text, and audio mining. Existing string kernels, however, either (i) rely on local features of short substructures in the string, which hardly capture long discriminative patterns, (ii) sum over too many substructures, such as all possible subsequences, which leads to diagonal dominance… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: KDD'19 Oral Paper, Data and Code link available in the paper

  43. arXiv:1911.11119  [pdf, other

    cs.LG stat.ML

    Scalable Global Alignment Graph Kernel Using Random Features: From Node Embedding to Graph Embedding

    Authors: Lingfei Wu, Ian En-Hsu Yen, Zhen Zhang, Kun Xu, Liang Zhao, Xi Peng, Yinglong Xia, Charu Aggarwal

    Abstract: Graph kernels are widely used for measuring the similarity between graphs. Many existing graph kernels, which focus on local patterns within graphs rather than their global properties, suffer from significant structure information loss when representing graphs. Some recent global graph kernels, which utilizes the alignment of geometric node embeddings of graphs, yield state-of-the-art performance.… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: KDD'19, Oral Paper, Data and Code link available in the paper

  44. arXiv:1911.10273  [pdf, other

    cs.LG stat.ML

    Joint Modeling of Local and Global Temporal Dynamics for Multivariate Time Series Forecasting with Missing Values

    Authors: Xianfeng Tang, Huaxiu Yao, Yiwei Sun, Charu Aggarwal, Prasenjit Mitra, Suhang Wang

    Abstract: Multivariate time series (MTS) forecasting is widely used in various domains, such as meteorology and traffic. Due to limitations on data collection, transmission, and storage, real-world MTS data usually contains missing values, making it infeasible to apply existing MTS forecasting models such as linear regression and recurrent neural networks. Though many efforts have been devoted to this probl… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI 2020

  45. arXiv:1910.14436  [pdf, other

    cs.AI cs.LG

    How can AI Automate End-to-End Data Science?

    Authors: Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray

    Abstract: Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it. This makes data science time consuming and restricted to experts with the resulting quality heavily dependent on their experience and skills. To make data science more accessible and scalable, we need its democratization. Automated Data Science (AutoDS) is aimed towards that goal and is emergin… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  46. arXiv:1904.13107  [pdf, other

    cs.LG stat.ML

    Graph Convolutional Networks with EigenPooling

    Authors: Yao Ma, Suhang Wang, Charu C. Aggarwal, Jiliang Tang

    Abstract: Graph neural networks, which generalize deep neural network models to graph structured data, have attracted increasing attention in recent years. They usually learn node representations by transforming, propagating and aggregating node features and have been proven to improve the performance of many graph related tasks such as node classification and link prediction. To apply graph neural networks… ▽ More

    Submitted 18 May, 2019; v1 submitted 30 April, 2019; originally announced April 2019.

  47. Meta Diagram based Active Social Networks Alignment

    Authors: Yuxiang Ren, Charu C. Aggarwal, Jiawei Zhang

    Abstract: Network alignment aims at inferring a set of anchor links matching the shared entities between different information networks, which has become a prerequisite step for effective fusion of multiple information networks. In this paper, we will study the network alignment problem to fuse online social networks specifically. Social network alignment is extremely challenging to address due to several r… ▽ More

    Submitted 4 July, 2020; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Published at ICDE 2019

  48. arXiv:1808.06099  [pdf, other

    cs.SI

    Multi-dimensional Graph Convolutional Networks

    Authors: Yao Ma, Suhang Wang, Charu C. Aggarwal, Dawei Yin, Jiliang Tang

    Abstract: Convolutional neural networks (CNNs) leverage the great power in representation learning on regular grid data such as image and video. Recently, increasing attention has been paid on generalizing CNNs to graph or network data which is highly irregular. Some focus on graph-level representation learning while others aim to learn node-level representations. These methods have been shown to boost the… ▽ More

    Submitted 18 August, 2018; originally announced August 2018.

  49. arXiv:1807.06560  [pdf, other

    cs.LG cs.SI stat.ML

    Using link and content over time for embedding generation in Dynamic Attributed Networks

    Authors: Ana Paula Appel, Renato L. F. Cunha, Charu C. Aggarwal, Marcela Megumi Terakado

    Abstract: In this work, we consider the problem of combining link, content and temporal analysis for community detection and prediction in evolving networks. Such temporal and content-rich networks occur in many real-life settings, such as bibliographic networks and question answering forums. Most of the work in the literature (that uses both content and structure) deals with static snapshots of networks, a… ▽ More

    Submitted 22 November, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: 10 pages, 4 figures, published at ECML-PKDD 2018

  50. arXiv:1805.11048  [pdf, other

    cs.LG stat.ML

    Scalable Spectral Clustering Using Random Binning Features

    Authors: Lingfei Wu, Pin-Yu Chen, Ian En-Hsu Yen, Fangli Xu, Yinglong Xia, Charu Aggarwal

    Abstract: Spectral clustering is one of the most effective clustering approaches that capture hidden cluster structures in the data. However, it does not scale well to large-scale problems due to its quadratic complexity in constructing similarity graphs and computing subsequent eigendecomposition. Although a number of methods have been proposed to accelerate spectral clustering, most of them compromise con… ▽ More

    Submitted 25 November, 2019; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: KDD'18, Oral Paper, Data and Code link available in the paper