Skip to main content

Showing 1–24 of 24 results for author: Pan, S J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17493  [pdf, other

    cs.LG

    Overcoming Negative Transfer by Online Selection: Distant Domain Adaptation for Fault Diagnosis

    Authors: Ziyan Wang, Mohamed Ragab, Wenmian Yang, Min Wu, Sinno Jialin Pan, Jie Zhang, Zhenghua Chen

    Abstract: Unsupervised domain adaptation (UDA) has achieved remarkable success in fault diagnosis, bringing significant benefits to diverse industrial applications. While most UDA methods focus on cross-working condition scenarios where the source and target domains are notably similar, real-world applications often grapple with severe domain shifts. We coin the term `distant domain adaptation problem' to d… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  2. arXiv:2404.07546  [pdf, other

    cs.CL

    Decomposing Label Space, Format and Discrimination: Rethinking How LLMs Respond and Solve Tasks via In-Context Learning

    Authors: Quanyu Long, Yin Wu, Wenya Wang, Sinno Jialin Pan

    Abstract: In-context Learning (ICL) has emerged as a powerful capability alongside the development of scaled-up large language models (LLMs). By instructing LLMs using few-shot demonstrative examples, ICL enables them to perform a wide range of tasks without updating millions of parameters. However, the precise contributions of demonstrations towards improving end-task performance have not been thoroughly i… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 36 pages, 8 figures

  3. arXiv:2402.13532  [pdf, other

    cs.CL

    Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation

    Authors: Quanyu Long, Yue Deng, LeiLei Gan, Wenya Wang, Sinno Jialin Pan

    Abstract: Dense retrievers and retrieval-augmented language models have been widely used in various NLP applications. Despite being designed to deliver reliable and secure outcomes, the vulnerability of retrievers to potential attacks remains unclear, raising concerns about their security. In this paper, we introduce a novel scenario where the attackers aim to covertly disseminate targeted misinformation, s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  4. arXiv:2311.11551  [pdf, other

    cs.CL

    Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning

    Authors: Quanyu Long, Wenya Wang, Sinno Jialin Pan

    Abstract: Large language models (LLMs) have showcased their capability with few-shot inference known as in-context learning. However, in-domain demonstrations are not always readily available in real scenarios, leading to cross-domain in-context learning. Besides, LLMs are still facing challenges in long-tail knowledge in unseen and unfamiliar domains. The above limitations demonstrate the necessity of Unsu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  5. arXiv:2310.17924  [pdf, other

    cs.CL

    SOUL: Towards Sentiment and Opinion Understanding of Language

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis is a well-established natural language processing task, with sentiment polarity classification being one of its most popular and representative tasks. However, despite the success of pre-trained language models in this area, they often fall short of capturing the broader complexities of sentiment analysis. To address this issue, we propose a new task called Sentiment and Opinion… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference, Short Paper

  6. arXiv:2310.06474  [pdf, other

    cs.CL

    Multilingual Jailbreak Challenges in Large Language Models

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: While large language models (LLMs) exhibit remarkable capabilities across a wide range of tasks, they pose potential safety concerns, such as the ``jailbreak'' problem, wherein malicious instructions can manipulate LLMs to exhibit undesirable behavior. Although several preventive measures have been developed to mitigate the potential risks associated with LLMs, they have primarily focused on Engli… ▽ More

    Submitted 3 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  7. arXiv:2305.15005  [pdf, other

    cs.CL

    Sentiment Analysis in the Era of Large Language Models: A Reality Check

    Authors: Wenxuan Zhang, Yue Deng, Bing Liu, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis (SA) has been a long-standing research area in natural language processing. It can offer rich insights into human sentiments and opinions and has thus seen considerable interest from both academia and industry. With the advent of large language models (LLMs) such as ChatGPT, there is a great potential for their employment on SA problems. However, the extent to which existing LLM… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2305.09509  [pdf, other

    cs.CL

    Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Cross-domain aspect-based sentiment analysis (ABSA) aims to perform various fine-grained sentiment analysis tasks on a target domain by transferring knowledge from a source domain. Since labeled data only exists in the source domain, a model is expected to bridge the domain gap for tackling cross-domain ABSA. Though domain adaptation methods have proven to be effective, most of them are based on a… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: ACL 2023 main conference

  9. arXiv:2212.13088  [pdf, other

    cs.LG

    Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities

    Authors: Jianda Chen, Sinno Jialin Pan

    Abstract: How to learn an effective reinforcement learning-based model for control tasks from high-level visual observations is a practical and challenging problem. A key to solving this problem is to learn low-dimensional state representations from observations, from which an effective policy can be learned. In order to boost the learning of state encoding, recent works are focused on capturing behavioral… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  10. Fast Graph Generation via Spectral Diffusion

    Authors: Tianze Luo, Zhanfeng Mo, Sinno Jialin Pan

    Abstract: Generating graph-structured data is a challenging problem, which requires learning the underlying distribution of graphs. Various models such as graph VAE, graph GANs, and graph diffusion models have been proposed to generate meaningful and reliable graphs, among which the diffusion models have achieved state-of-the-art performance. In this paper, we argue that running full-rank diffusion SDEs on… ▽ More

    Submitted 19 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  11. arXiv:2209.14742  [pdf, other

    cs.LG

    Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

    Authors: Danni Peng, Sinno Jialin Pan

    Abstract: To address the distribution shifts between training and test data, domain generalization (DG) leverages multiple source domains to learn a model that generalizes well to unseen domains. However, existing DG methods generally suffer from overfitting to the source domains, partly due to the limited coverage of the expected region in feature space. Motivated by this, we propose to perform mixup with… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: 22 pages, 14 figures

  12. arXiv:2207.04564  [pdf, other

    cs.CL cs.LG

    Domain Confused Contrastive Learning for Unsupervised Domain Adaptation

    Authors: Quanyu Long, Tianze Luo, Wenya Wang, Sinno Jialin Pan

    Abstract: In this work, we study Unsupervised Domain Adaptation (UDA) in a challenging self-supervised approach. One of the difficulties is how to learn task discrimination in the absence of target labels. Unlike previous literature which directly aligns cross-domain distributions or leverages reverse gradient, we propose Domain Confused Contrastive Learning (DCCL) to bridge the source and the target domain… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 14 pages, 7 figures, NAACL 2022

  13. arXiv:2206.06629  [pdf, other

    cs.AI

    Semantic-Discriminative Mixup for Generalizable Sensor-based Cross-domain Activity Recognition

    Authors: Wang Lu, **dong Wang, Yiqiang Chen, Sinno Jialin Pan, Chunyu Hu, Xin Qin

    Abstract: It is expensive and time-consuming to collect sufficient labeled data to build human activity recognition (HAR) models. Training on existing data often makes the model biased towards the distribution of the training data, thus the model might perform terribly on test data with different distributions. Although existing efforts on transfer learning and domain adaptation try to solve the above probl… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: To be presented at UbiComp 2022; Accepted by Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT)

  14. Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

    Authors: Danni Peng, Sinno Jialin Pan, Jie Zhang, Anxiang Zeng

    Abstract: Recommender Systems (RSs) in real-world applications often deal with billions of user interactions daily. To capture the most recent trends effectively, it is common to update the model incrementally using only the newly arrived data. However, this may impede the model's ability to retain long-term information due to the potential overfitting and forgetting issues. To address this problem, we prop… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 11 pages, 6 figures, accepted by RecSys 2021

  15. arXiv:2107.03354  [pdf, other

    cs.LG cs.AI stat.ML

    Mitigating Performance Saturation in Neural Marked Point Processes: Architectures and Loss Functions

    Authors: Tianbo Li, Tianze Luo, Yi** Ke, Sinno Jialin Pan

    Abstract: Attributed event sequences are commonly encountered in practice. A recent research line focuses on incorporating neural networks with the statistical model -- marked point processes, which is the conventional tool for dealing with attributed event sequences. Neural marked point processes possess good interpretability of probabilistic models as well as the representational power of neural networks.… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: 9 pages, 4 figures, accepted by KDD-21 research track. The source code is available at https://github.com/ltz0120/Graph-Convolutional- Hawkes-Processes-GCHP

  16. arXiv:2011.00825  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning with Efficient Active Feature Acquisition

    Authors: Haiyan Yin, Yingzhen Li, Sinno Jialin Pan, Cheng Zhang, Sebastian Tschiatschek

    Abstract: Solving real-life sequential decision making problems under partial observability involves an exploration-exploitation problem. To be successful, an agent needs to efficiently gather valuable information about the state of the world for making rewarding decisions. However, in real-life, acquiring valuable information is often highly costly, e.g., in the medical domain, information acquisition migh… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  17. arXiv:1912.03041  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Integrating Deep Learning with Logic Fusion for Information Extraction

    Authors: Wenya Wang, Sinno Jialin Pan

    Abstract: Information extraction (IE) aims to produce structured information from an input text, e.g., Named Entity Recognition and Relation Extraction. Various attempts have been proposed for IE via feature engineering or deep learning. However, most of them fail to associate the complex relationships inherent in the task itself, which has proven to be especially crucial. For example, the relation between… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Comments: Accepted in AAAI 2020

  18. arXiv:1911.05701  [pdf, other

    cs.LG cs.AI

    Transfer Value Iteration Networks

    Authors: Junyi Shen, Hankz Hankui Zhuo, ** Xu, Bin Zhong, Sinno Jialin Pan

    Abstract: Value iteration networks (VINs) have been demonstrated to have a good generalization ability for reinforcement learning tasks across similar domains. However, based on our experiments, a policy learned by VINs still fail to generalize well on the domain whose action space and feature space are not identical to those in the domain where it is trained. In this paper, we propose a transfer learning a… ▽ More

    Submitted 26 November, 2019; v1 submitted 11 November, 2019; originally announced November 2019.

  19. arXiv:1707.00524  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning

    Authors: Haiyan Yin, Jianda Chen, Sinno Jialin Pan

    Abstract: In deep reinforcement learning (RL) tasks, an efficient exploration mechanism should be able to encourage an agent to take actions that lead to less frequent states which may yield higher accumulative future return. However, both knowing about the future and evaluating the frequentness of states are non-trivial tasks, especially for deep RL domains, where a state is represented by high-dimensional… ▽ More

    Submitted 27 April, 2018; v1 submitted 3 July, 2017; originally announced July 2017.

  20. arXiv:1705.07565  [pdf, other

    cs.NE cs.CV cs.LG

    Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon

    Authors: Xin Dong, Shangyu Chen, Sinno Jialin Pan

    Abstract: How to develop slim and accurate deep neural networks has become crucial for real- world applications, especially for those employed in embedded systems. Though previous work along this research line has shown some promising results, most existing methods either fail to significantly compress a well-trained deep network or require a heavy retraining process for the pruned deep network to re-boost… ▽ More

    Submitted 9 November, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

  21. arXiv:1702.01776  [pdf, ps, other

    cs.CL

    Multi-task memory networks for category-specific aspect and opinion terms co-extraction

    Authors: Wenya Wang, Sinno Jialin Pan, Daniel Dahlmeier

    Abstract: In aspect-based sentiment analysis, most existing methods either focus on aspect/opinion terms extraction or aspect terms categorization. However, each task by itself only provides partial information to end users. To generate more detailed and structured opinion analysis, we propose a finer-grained problem, which we call category-specific aspect and opinion terms extraction. This problem involves… ▽ More

    Submitted 5 June, 2017; v1 submitted 6 February, 2017; originally announced February 2017.

  22. arXiv:1612.04022  [pdf, ps, other

    cs.LG stat.ML

    Distributed Multi-Task Relationship Learning

    Authors: Sulin Liu, Sinno Jialin Pan, Qirong Ho

    Abstract: Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks to a single machine. However, in many real-world applications, data of different tasks may be geo-distributed over different local machines. Due to heavy commu… ▽ More

    Submitted 20 June, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Comments: To appear in KDD 2017

  23. arXiv:1605.04034  [pdf, other

    cs.LG stat.ML

    Transfer Hashing with Privileged Information

    Authors: Joey Tianyi Zhou, Xinxing Xu, Sinno Jialin Pan, Ivor W. Tsang, Zheng Qin, Rick Siow Mong Goh

    Abstract: Most existing learning to hash methods assume that there are sufficient data, either labeled or unlabeled, on the domain of interest (i.e., the target domain) for training. However, this assumption cannot be satisfied in some real-world applications. To address this data sparsity issue in hashing, inspired by transfer learning, we propose a new framework named Transfer Hashing with Privileged Info… ▽ More

    Submitted 12 May, 2016; originally announced May 2016.

    Comments: Accepted by IJCAI-2016

  24. arXiv:1603.06679  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Recursive Neural Conditional Random Fields for Aspect-based Sentiment Analysis

    Authors: Wenya Wang, Sinno Jialin Pan, Daniel Dahlmeier, Xiaokui Xiao

    Abstract: In aspect-based sentiment analysis, extracting aspect terms along with the opinions being expressed from user-generated content is one of the most important subtasks. Previous studies have shown that exploiting connections between aspect and opinion terms is promising for this task. In this paper, we propose a novel joint model that integrates recursive neural networks and conditional random field… ▽ More

    Submitted 19 September, 2016; v1 submitted 22 March, 2016; originally announced March 2016.