Skip to main content

Showing 1–50 of 97 results for author: Pechenizkiy, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18373  [pdf, other

    cs.CL cs.SD eess.AS

    Dynamic Data Pruning for Automatic Speech Recognition

    Authors: Qiao Xiao, **chuan Ma, Adriana Fernandez-Lopez, Boqian Wu, Lu Yin, Stavros Petridis, Mykola Pechenizkiy, Maja Pantic, Decebal Constantin Mocanu, Shiwei Liu

    Abstract: The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data. However, this trend has made model training prohibitively costly and imposed computational demands. While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. arXiv:2406.06495  [pdf, other

    cs.LG

    Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity

    Authors: Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu, Matthew E. Taylor

    Abstract: For autonomous agents to successfully integrate into human-centered environments, agents should be able to learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) is a promising approach that learns reward functions from human preferences. This enables RL agents to adapt their behavior based on human desires. However, humans live in a world full of d… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2406.02177  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    One-Shot Federated Learning with Bayesian Pseudocoresets

    Authors: Tim d'Hondt, Mykola Pechenizkiy, Robert Peharz

    Abstract: Optimization-based techniques for federated learning (FL) often come with prohibitive communication cost, as high dimensional model parameters need to be communicated repeatedly between server and clients. In this paper, we follow a Bayesian approach allowing to perform FL with one-shot communication, by solving the global inference problem as a product of local client posteriors. For models with… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages

  4. arXiv:2405.19017  [pdf, other

    cs.LG

    Efficient Exploration in Average-Reward Constrained Reinforcement Learning: Achieving Near-Optimal Regret With Posterior Sampling

    Authors: Danil Provodin, Maurits Kaptein, Mykola Pechenizkiy

    Abstract: We present a new algorithm based on posterior sampling for learning in Constrained Markov Decision Processes (CMDP) in the infinite-horizon undiscounted setting. The algorithm achieves near-optimal regret bounds while being advantageous empirically compared to the existing algorithms. Our main theoretical result is a Bayesian regret bound for each cost component of $\tilde{O} (DS\sqrt{AT})$ for an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: To appear at ICML'24

  5. The Neutrality Fallacy: When Algorithmic Fairness Interventions are (Not) Positive Action

    Authors: Hilde Weerts, Raphaële Xenidis, Fabien Tarissan, Henrik Palmer Olsen, Mykola Pechenizkiy

    Abstract: Various metrics and interventions have been developed to identify and mitigate unfair outputs of machine learning systems. While individuals and organizations have an obligation to avoid discrimination, the use of fairness-aware machine learning interventions has also been described as amounting to 'algorithmic positive action' under European Union (EU) non-discrimination law. As the Court of Just… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Journal ref: 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

  6. arXiv:2404.08006  [pdf, other

    cs.RO cs.AI cs.LG math.OC

    Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking

    Authors: Igor G. Smit, Zaharah Bukhsh, Mykola Pechenizkiy, Kostas Alogariastos, Kasper Hendriks, Yingqian Zhang

    Abstract: In collaborative human-robot order picking systems, human pickers and Autonomous Mobile Robots (AMRs) travel independently through a warehouse and meet at pick locations where pickers load items onto the AMRs. In this paper, we consider an optimization problem in such systems where we allocate pickers to AMRs in a stochastic environment. We propose a novel multi-objective Deep Reinforcement Learni… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2402.19226  [pdf, other

    cs.LG cs.CY

    Investigating Gender Fairness in Machine Learning-driven Personalized Care for Chronic Pain

    Authors: Pratik Gajane, Sean Newman, Mykola Pechenizkiy, John D. Piette

    Abstract: Chronic pain significantly diminishes the quality of life for millions worldwide. While psychoeducation and therapy can improve pain outcomes, many individuals experiencing pain lack access to evidence-based treatments or fail to complete the necessary number of sessions to achieve benefit. Reinforcement learning (RL) shows potential in tailoring personalized pain management interventions accordin… ▽ More

    Submitted 14 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  8. arXiv:2401.09334  [pdf, other

    cs.CL cs.AI

    Large Language Models Are Neurosymbolic Reasoners

    Authors: Meng Fang, Shilong Deng, Yudi Zhang, Zi**g Shi, Ling Chen, Mykola Pechenizkiy, Jun Wang

    Abstract: A wide range of real-world applications is characterized by their symbolic nature, necessitating a strong capability for symbolic reasoning. This paper investigates the potential application of Large Language Models (LLMs) as symbolic reasoners. We focus on text-based games, significant benchmarks for agents with natural language capabilities, particularly in symbolic tasks like math, map reading,… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI 2024

  9. arXiv:2312.15339  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning

    Authors: Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood, Meng Fang, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabling them to generalize to unseen environments with new distractions. Existing works approach this problem using data augmentation or large auxiliary networks wit… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: Accepted as full-paper (oral) at AAMAS 2024. Code is available at https://github.com/bramgrooten/mask-distractions and see our 40-second video at https://youtu.be/2oImF0h1k48

  10. arXiv:2312.06315  [pdf, other

    cs.CL cs.CY cs.LG

    GPTBIAS: A Comprehensive Framework for Evaluating Bias in Large Language Models

    Authors: Jiaxu Zhao, Meng Fang, Shirui Pan, Wenpeng Yin, Mykola Pechenizkiy

    Abstract: Warning: This paper contains content that may be offensive or upsetting. There has been a significant increase in the usage of large language models (LLMs) in various applications, both in their original form and through fine-tuned adaptations. As a result, LLMs have gained popularity and are being widely adopted by a large user community. However, one of the concerns with LLMs is the potential ge… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  11. arXiv:2312.04727  [pdf, other

    cs.CV

    E2ENet: Dynamic Sparse Feature Fusion for Accurate and Efficient 3D Medical Image Segmentation

    Authors: Boqian Wu, Qiao Xiao, Shiwei Liu, Lu Yin, Mykola Pechenizkiy, Decebal Constantin Mocanu, Maurice Van Keulen, Elena Mocanu

    Abstract: Deep neural networks have evolved as the leading approach in 3D medical image segmentation due to their outstanding performance. However, the ever-increasing model size and computation cost of deep neural networks have become the primary barrier to deploying them on real-world resource-limited hardware. In pursuit of improving performance and efficiency, we propose a 3D medical image segmentation… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  12. arXiv:2312.04307  [pdf, other

    cs.LG

    A Structural-Clustering Based Active Learning for Graph Neural Networks

    Authors: Ricky Maulana Fajri, Yulong Pei, Lu Yin, Mykola Pechenizkiy

    Abstract: In active learning for graph-structured data, Graph Neural Networks (GNNs) have shown effectiveness. However, a common challenge in these applications is the underutilization of crucial structural information. To address this problem, we propose the Structural-Clustering PageRank method for improved Active learning (SPA) specifically designed for graph-structured data. SPA integrates community det… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  13. arXiv:2312.03044  [pdf, other

    cs.LG

    REST: Enhancing Group Robustness in DNNs through Reweighted Sparse Training

    Authors: Jiaxu Zhao, Lu Yin, Shiwei Liu, Meng Fang, Mykola Pechenizkiy

    Abstract: The deep neural network (DNN) has been proven effective in various domains. However, they often struggle to perform well on certain minority groups during inference, despite showing strong performance on the majority of data groups. This is because over-parameterized models learned \textit{bias attributes} from a large number of \textit{bias-aligned} training samples. These bias attributes are str… ▽ More

    Submitted 8 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  14. arXiv:2312.01397  [pdf, other

    cs.CV cs.LG

    Visual Prompting Upgrades Neural Network Sparsification: A Data-Model Perspective

    Authors: Can **, Tian** Huang, Yihua Zhang, Mykola Pechenizkiy, Sijia Liu, Shiwei Liu, Tianlong Chen

    Abstract: The rapid development of large-scale deep learning models questions the affordability of hardware platforms, which necessitates the pruning to reduce their computational and memory footprints. Sparse neural networks as the product, have demonstrated numerous favorable benefits like low complexity, undamaged generalization, etc. Most of the prominent pruning strategies are invented from a model-cen… ▽ More

    Submitted 14 December, 2023; v1 submitted 3 December, 2023; originally announced December 2023.

  15. arXiv:2310.19650  [pdf, other

    cs.CL

    KeyGen2Vec: Learning Document Embedding via Multi-label Keyword Generation in Question-Answering

    Authors: Iftitahu Ni'mah, Samaneh Khoshrou, Vlado Menkovski, Mykola Pechenizkiy

    Abstract: Representing documents into high dimensional embedding space while preserving the structural similarity between document sources has been an ultimate goal for many works on text representation learning. Current embedding models, however, mainly rely on the availability of label supervision to increase the expressiveness of the resulting embeddings. In contrast, unsupervised embeddings are cheap, b… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Arxiv preprint

  16. arXiv:2310.08725  [pdf, ps, other

    cs.LG

    Heterophily-Based Graph Neural Network for Imbalanced Classification

    Authors: Zirui Liang, Yuntao Li, Tian** Huang, Akrati Saxena, Yulong Pei, Mykola Pechenizkiy

    Abstract: Graph neural networks (GNNs) have shown promise in addressing graph-related problems, including node classification. However, conventional GNNs assume an even distribution of data across classes, which is often not the case in real-world scenarios, where certain classes are severely underrepresented. This leads to suboptimal performance of standard GNNs on imbalanced graphs. In this paper, we intr… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted by Twelfth International Conference on Complex Networks & Their Applications

  17. arXiv:2310.05175  [pdf, other

    cs.LG

    Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

    Authors: Lu Yin, You Wu, Zhenyu Zhang, Cheng-Yu Hsieh, Yaqing Wang, Yiling Jia, Gen Li, Ajay Jaiswal, Mykola Pechenizkiy, Yi Liang, Michael Bendersky, Zhangyang Wang, Shiwei Liu

    Abstract: Large Language Models (LLMs), renowned for their remarkable performance across diverse domains, present a challenge when it comes to practical deployment due to their colossal model size. In response to this challenge, efforts have been directed toward the application of traditional network pruning techniques to LLMs, uncovering a massive number of parameters that can be pruned in one-shot without… ▽ More

    Submitted 6 May, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  18. arXiv:2309.15737  [pdf, other

    cs.LG

    Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need

    Authors: Danil Provodin, Pratik Gajane, Mykola Pechenizkiy, Maurits Kaptein

    Abstract: We present a new algorithm based on posterior sampling for learning in constrained Markov decision processes (CMDP) in the infinite-horizon undiscounted setting. The algorithm achieves near-optimal regret bounds while being advantageous empirically compared to the existing algorithms. Our main theoretical result is a Bayesian regret bound for each cost component of \tilde{O} (HS \sqrt{AT}) for any… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  19. arXiv:2306.14275  [pdf, other

    cs.LG cs.AI

    Enhancing Adversarial Training via Reweighting Optimization Trajectory

    Authors: Tian** Huang, Shiwei Liu, Tianlong Chen, Meng Fang, Li Shen, Vlaod Menkovski, Lu Yin, Yulong Pei, Mykola Pechenizkiy

    Abstract: Despite the fact that adversarial training has become the de facto method for improving the robustness of deep neural networks, it is well-known that vanilla adversarial training suffers from daunting robust overfitting, resulting in unsatisfactory robust generalization. A number of approaches have been proposed to address these drawbacks such as extra regularization, adversarial weights perturbat… ▽ More

    Submitted 4 February, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted by ECML 2023

    Journal ref: ECML 2023

  20. arXiv:2305.19454  [pdf, other

    cs.LG cs.AI cs.CV

    Dynamic Sparsity Is Channel-Level Sparsity Learner

    Authors: Lu Yin, Gen Li, Meng Fang, Li Shen, Tian** Huang, Zhangyang Wang, Vlado Menkovski, Xiaolong Ma, Mykola Pechenizkiy, Shiwei Liu

    Abstract: Sparse training has received an upsurging interest in machine learning due to its tantalizing saving potential for the entire training process as well as inference. Dynamic sparse training (DST), as a leading sparse training approach, can train deep neural networks at high sparsity from scratch to match the performance of their dense counterparts. However, most if not all DST prior arts demonstrat… ▽ More

    Submitted 10 November, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  21. arXiv:2305.19412  [pdf, other

    cs.CV cs.AI

    Are Large Kernels Better Teachers than Transformers for ConvNets?

    Authors: Tian** Huang, Lu Yin, Zhenyu Zhang, Li Shen, Meng Fang, Mykola Pechenizkiy, Zhangyang Wang, Shiwei Liu

    Abstract: This paper reveals a new appeal of the recently emerged large-kernel Convolutional Neural Networks (ConvNets): as the teacher in Knowledge Distillation (KD) for small-kernel ConvNets. While Transformers have led state-of-the-art (SOTA) performance in various fields with ever-larger models and labeled data, small-kernel ConvNets are considered more suitable for resource-limited applications due to… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted by ICML 2023

    Journal ref: ICML 2023

  22. arXiv:2305.18427  [pdf, other

    cs.LG

    Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach

    Authors: Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy

    Abstract: A major challenge in reinforcement learning is to determine which state-action pairs are responsible for future rewards that are delayed. Reward redistribution serves as a solution to re-assign credits for each time step from observed sequences. While the majority of current approaches construct the reward redistribution in an uninterpretable manner, we propose to explicitly model the contribution… ▽ More

    Submitted 10 November, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready version

  23. arXiv:2305.18382  [pdf, other

    cs.LG

    Adaptive Sparsity Level during Training for Efficient Time Series Forecasting with Transformers

    Authors: Zahra Atashgahi, Mykola Pechenizkiy, Raymond Veldhuis, Decebal Constantin Mocanu

    Abstract: Efficient time series forecasting has become critical for real-world applications, particularly with deep neural networks (DNNs). Efficiency in DNNs can be achieved through sparse connectivity and reducing the model size. However, finding the sparsity level automatically during training remains challenging due to the heterogeneity in the loss-sparsity tradeoffs across the datasets. In this paper,… ▽ More

    Submitted 12 June, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

  24. arXiv:2305.13938  [pdf, other

    cs.CY cs.AI cs.LG

    Algorithmic Unfairness through the Lens of EU Non-Discrimination Law: Or Why the Law is not a Decision Tree

    Authors: Hilde Weerts, Raphaële Xenidis, Fabien Tarissan, Henrik Palmer Olsen, Mykola Pechenizkiy

    Abstract: Concerns regarding unfairness and discrimination in the context of artificial intelligence (AI) systems have recently received increased attention from both legal and computer science scholars. Yet, the degree of overlap between notions of algorithmic bias and fairness on the one hand, and legal notions of discrimination and equality on the other, is often unclear, leading to misunderstandings bet… ▽ More

    Submitted 24 May, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Journal ref: 2023 ACM Conference on Fairness, Accountability, and Transparency (FAccT '23)

  25. arXiv:2305.11262  [pdf, other

    cs.CL

    CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

    Authors: Jiaxu Zhao, Meng Fang, Zi**g Shi, Yitong Li, Ling Chen, Mykola Pechenizkiy

    Abstract: \textit{\textbf{\textcolor{red}{Warning}:} This paper contains content that may be offensive or upsetting.} Pretrained conversational agents have been exposed to safety issues, exhibiting a range of stereotypical human biases such as gender bias. However, there are still limited bias categories in current research, and most of them only focus on English. In this paper, we introduce a new Chinese d… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted by ACL 2023

  26. arXiv:2305.08566  [pdf, other

    cs.CL

    NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist

    Authors: Iftitahu Ni'mah, Meng Fang, Vlado Menkovski, Mykola Pechenizkiy

    Abstract: In this study, we analyze automatic evaluation metrics for Natural Language Generation (NLG), specifically task-agnostic metrics and human-aligned metrics. Task-agnostic metrics, such as Perplexity, BLEU, BERTScore, are cost-effective and highly adaptable to diverse NLG tasks, yet they have a weak correlation with human. Human-aligned metrics (CTC, CtrlEval, UniEval) improves correlation level by… ▽ More

    Submitted 26 May, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: To appear at ACL 2023 Toronto (main conference). 9 pages (main), 1 page for Limitations and Ethics, 11 pages for Appendix

  27. Can Fairness be Automated? Guidelines and Opportunities for Fairness-aware AutoML

    Authors: Hilde Weerts, Florian Pfisterer, Matthias Feurer, Katharina Eggensperger, Edward Bergman, Noor Awad, Joaquin Vanschoren, Mykola Pechenizkiy, Bernd Bischl, Frank Hutter

    Abstract: The field of automated machine learning (AutoML) introduces techniques that automate parts of the development of machine learning (ML) systems, accelerating the process and reducing barriers for novices. However, decisions derived from ML models can reproduce, amplify, or even introduce unfairness in our societies, causing harm to (groups of) individuals. In response, researchers have started to p… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Journal ref: Journal of Artificial Intelligence Research 79 (2024) 639-677

  28. arXiv:2303.07200  [pdf, other

    cs.NE cs.AI cs.LG

    Supervised Feature Selection with Neuron Evolution in Sparse Neural Networks

    Authors: Zahra Atashgahi, Xuhao Zhang, Neil Kichler, Shiwei Liu, Lu Yin, Mykola Pechenizkiy, Raymond Veldhuis, Decebal Constantin Mocanu

    Abstract: Feature selection that selects an informative subset of variables from data not only enhances the model interpretability and performance but also alleviates the resource demands. Recently, there has been growing attention on feature selection using neural networks. However, existing methods usually suffer from high computational costs when applied to high-dimensional datasets. In this paper, inspi… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  29. arXiv:2302.06548  [pdf, other

    cs.LG cs.AI

    Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning

    Authors: Bram Grooten, Ghada Sokar, Shibhansh Dohare, Elena Mocanu, Matthew E. Taylor, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: Tomorrow's robots will need to distinguish useful information from noise when performing different tasks. A household robot for instance may continuously receive a plethora of information about the home, but needs to focus on just a small subset to successfully execute its current chore. Filtering distracting inputs that contain irrelevant data has received little attention in the reinforcement le… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: Accepted as full-paper at AAMAS 2023

  30. arXiv:2212.09840  [pdf, other

    cs.LG cs.AI

    Dynamic Sparse Network for Time Series Classification: Learning What to "see''

    Authors: Qiao Xiao, Boqian Wu, Yu Zhang, Shiwei Liu, Mykola Pechenizkiy, Elena Mocanu, Decebal Constantin Mocanu

    Abstract: The receptive field (RF), which determines the region of time series to be ``seen'' and used, is critical to improve the performance for time series classification (TSC). However, the variation of signal scales across and within time series data, makes it challenging to decide on proper RF sizes for TSC. In this paper, we propose a dynamic sparse network (DSN) with sparse connections for TSC, whic… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS 2022)

  31. arXiv:2211.15335  [pdf, other

    cs.LG

    You Can Have Better Graph Neural Networks by Not Training Weights at All: Finding Untrained GNNs Tickets

    Authors: Tian** Huang, Tianlong Chen, Meng Fang, Vlado Menkovski, Jiaxu Zhao, Lu Yin, Yulong Pei, Decebal Constantin Mocanu, Zhangyang Wang, Mykola Pechenizkiy, Shiwei Liu

    Abstract: Recent works have impressively demonstrated that there exists a subnetwork in randomly initialized convolutional neural networks (CNNs) that can match the performance of the fully trained dense networks at initialization, without any optimization of the weights of the network (i.e., untrained networks). However, the presence of such untrained subnetworks in graph neural networks (GNNs) still remai… ▽ More

    Submitted 4 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: Accepted by the LoG conference 2022 as a spotlight

    Journal ref: LoG 2022 (Oral & Best Paper Award)

  32. arXiv:2211.14627  [pdf, other

    cs.LG cs.CV

    Where to Pay Attention in Sparse Training for Feature Selection?

    Authors: Ghada Sokar, Zahra Atashgahi, Mykola Pechenizkiy, Decebal Constantin Mocanu

    Abstract: A new line of research for feature selection based on neural networks has recently emerged. Despite its superiority to classical methods, it requires many training iterations to converge and detect informative features. The computational time becomes prohibitively long for datasets with a large number of samples or a very high dimensional feature space. In this paper, we present a new efficient un… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: Accepted at Neural Information Processing Systems (NeurIPS) 2022

  33. arXiv:2209.12756  [pdf, other

    cs.LG

    FAL-CUR: Fair Active Learning using Uncertainty and Representativeness on Fair Clustering

    Authors: Ricky Fajri, Akrati Saxena, Yulong Pei, Mykola Pechenizkiy

    Abstract: Active Learning (AL) techniques have proven to be highly effective in reducing data labeling costs across a range of machine learning tasks. Nevertheless, one known challenge of these methods is their potential to introduce unfairness towards sensitive attributes. Although recent approaches have focused on enhancing fairness in AL, they tend to reduce the model's accuracy. To address this issue, w… ▽ More

    Submitted 19 December, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

  34. arXiv:2209.03596  [pdf, other

    cs.LG

    An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement Learning

    Authors: Danil Provodin, Pratik Gajane, Mykola Pechenizkiy, Maurits Kaptein

    Abstract: We study a posterior sampling approach to efficient exploration in constrained reinforcement learning. Alternatively to existing algorithms, we propose two simple algorithms that are more efficient statistically, simpler to implement and computationally cheaper. The first algorithm is based on a linear formulation of CMDP, and the second algorithm leverages the saddle-point formulation of CMDP. Ou… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

  35. arXiv:2209.01678  [pdf, other

    cs.SI cs.CY

    FairSNA: Algorithmic Fairness in Social Network Analysis

    Authors: Akrati Saxena, George Fletcher, Mykola Pechenizkiy

    Abstract: In recent years, designing fairness-aware methods has received much attention in various domains, including machine learning, natural language processing, and information retrieval. However, understanding structural bias and inequalities in social networks and designing fairness-aware methods for various research problems in social network analysis (SNA) have not received much attention. In this w… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 September, 2022; originally announced September 2022.

  36. arXiv:2208.10842  [pdf, other

    cs.LG cs.AI

    Lottery Pools: Winning More by Interpolating Tickets without Increasing Training or Inference Cost

    Authors: Lu Yin, Shiwei Liu, Meng Fang, Tian** Huang, Vlado Menkovski, Mykola Pechenizkiy

    Abstract: Lottery tickets (LTs) is able to discover accurate and sparse subnetworks that could be trained in isolation to match the performance of dense networks. Ensemble, in parallel, is one of the oldest time-proven tricks in machine learning to improve performance by combining the output of multiple independent models. However, the benefits of ensemble in the context of LTs will be diluted since ensembl… ▽ More

    Submitted 3 April, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Published in AAAI 2023. Code can be found at https://github.com/luuyin/Lottery-pools

  37. arXiv:2207.03932  [pdf, other

    cs.LG cs.AI

    Memory-free Online Change-point Detection: A Novel Neural Network Approach

    Authors: Zahra Atashgahi, Decebal Constantin Mocanu, Raymond Veldhuis, Mykola Pechenizkiy

    Abstract: Change-point detection (CPD), which detects abrupt changes in the data distribution, is recognized as one of the most significant tasks in time series analysis. Despite the extensive literature on offline CPD, unsupervised online CPD still suffers from major challenges, including scalability, hyperparameter tuning, and learning constraints. To mitigate some of these challenges, in this paper, we p… ▽ More

    Submitted 6 December, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

  38. arXiv:2207.03620  [pdf, other

    cs.CV

    More ConvNets in the 2020s: Scaling up Kernels Beyond 51x51 using Sparsity

    Authors: Shiwei Liu, Tianlong Chen, Xiaohan Chen, Xuxi Chen, Qiao Xiao, Boqian Wu, Tommi Kärkkäinen, Mykola Pechenizkiy, Decebal Mocanu, Zhangyang Wang

    Abstract: Transformers have quickly shined in the computer vision world since the emergence of Vision Transformers (ViTs). The dominant role of convolutional neural networks (CNNs) seems to be challenged by increasingly effective transformer-based models. Very recently, a couple of advanced convolutional models strike back with large kernels motivated by the local-window attention mechanism, showing appeali… ▽ More

    Submitted 3 March, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: Preprint

  39. arXiv:2205.15322  [pdf, other

    cs.LG cs.AI

    Superposing Many Tickets into One: A Performance Booster for Sparse Neural Network Training

    Authors: Lu Yin, Vlado Menkovski, Meng Fang, Tian** Huang, Yulong Pei, Mykola Pechenizkiy, Decebal Constantin Mocanu, Shiwei Liu

    Abstract: Recent works on sparse neural network training (sparse training) have shown that a compelling trade-off between performance and efficiency can be achieved by training intrinsically sparse neural networks from scratch. Existing sparse training methods usually strive to find the best sparse subnetwork possible in one single run, without involving any expensive dense or pre-training steps. For instan… ▽ More

    Submitted 18 August, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 17 pages, 5 figures, accepted by the 38th Conference on Uncertainty in Artificial Intelligence (UAI)

  40. arXiv:2205.10710  [pdf, other

    cs.CL cs.AI

    Phrase-level Textual Adversarial Attack with Label Preservation

    Authors: Yibin Lei, Yu Cao, Dianqi Li, Tianyi Zhou, Meng Fang, Mykola Pechenizkiy

    Abstract: Generating high-quality textual adversarial examples is critical for investigating the pitfalls of natural language processing (NLP) models and further promoting their robustness. Existing attacks are usually realized through word-level or sentence-level perturbations, which either limit the perturbation space or sacrifice fluency and textual quality, both affecting the attack effectiveness. In th… ▽ More

    Submitted 24 May, 2022; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: NAACL-HLT 2022 Findings (Long), 9 pages + 2 pages references + 8 pages appendix

  41. arXiv:2205.10032  [pdf, ps, other

    cs.LG

    Survey on Fair Reinforcement Learning: Theory and Practice

    Authors: Pratik Gajane, Akrati Saxena, Maryam Tavakol, George Fletcher, Mykola Pechenizkiy

    Abstract: Fairness-aware learning aims at satisfying various fairness constraints in addition to the usual performance criteria via data-driven machine learning techniques. Most of the research in fairness-aware learning employs the setting of fair-supervised learning. However, many dynamic real-world applications can be better modeled using sequential decision-making problems and fair reinforcement learnin… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

  42. arXiv:2202.08536  [pdf, other

    cs.LG cs.AI cs.CY

    Does the End Justify the Means? On the Moral Justification of Fairness-Aware Machine Learning

    Authors: Hilde Weerts, Lambèr Royakkers, Mykola Pechenizkiy

    Abstract: Fairness-aware machine learning (fair-ml) techniques are algorithmic interventions designed to ensure that individuals who are affected by the predictions of a machine learning model are treated fairly, typically measured in terms of a quantitative fairness metric. Despite the multitude of fairness metrics and fair-ml algorithms, there is still little guidance on the suitability of different appro… ▽ More

    Submitted 8 February, 2023; v1 submitted 17 February, 2022; originally announced February 2022.

  43. arXiv:2202.06657  [pdf, other

    cs.LG

    The Impact of Batch Learning in Stochastic Linear Bandits

    Authors: Danil Provodin, Pratik Gajane, Mykola Pechenizkiy, Maurits Kaptein

    Abstract: We consider a special case of bandit problems, named batched bandits, in which an agent observes batches of responses over a certain time period. Unlike previous work, we consider a more practically relevant batch-centric scenario of batch learning. That is to say, we provide a policy-agnostic regret analysis and demonstrate upper and lower bounds for the regret of a candidate policy. Our main the… ▽ More

    Submitted 1 September, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: This is a longer version of the paper published at ICDM'22. arXiv admin note: text overlap with arXiv:2111.02071

  44. arXiv:2202.02643  [pdf, other

    cs.LG cs.AI cs.CV

    The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training

    Authors: Shiwei Liu, Tianlong Chen, Xiaohan Chen, Li Shen, Decebal Constantin Mocanu, Zhangyang Wang, Mykola Pechenizkiy

    Abstract: Random pruning is arguably the most naive way to attain sparsity in neural networks, but has been deemed uncompetitive by either post-training pruning or sparse training. In this paper, we focus on sparse training and highlight a perhaps counter-intuitive finding, that random pruning at initialization can be quite powerful for the sparse training of modern neural networks. Without any delicate pru… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: Published as a conference paper at ICLR 2022. Code is available at https://github.com/VITA-Group/Random_Pruning

  45. arXiv:2112.09201  [pdf, other

    cs.CV cs.AI

    Semantic-Based Few-Shot Learning by Interactive Psychometric Testing

    Authors: Lu Yin, Vlado Menkovski, Yulong Pei, Mykola Pechenizkiy

    Abstract: Few-shot classification tasks aim to classify images in query sets based on only a few labeled examples in support sets. Most studies usually assume that each image in a task has a single and unique class association. Under these assumptions, these algorithms may not be able to identify the proper class assignment when there is no exact matching between support and query classes. For example, give… ▽ More

    Submitted 27 February, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted at the AAAI-22 Workshop on Interactive Machine Learning (IML@AAAI'22)

  46. arXiv:2111.02071  [pdf, other

    cs.LG stat.ML

    The Impact of Batch Learning in Stochastic Bandits

    Authors: Danil Provodin, Pratik Gajane, Mykola Pechenizkiy, Maurits Kaptein

    Abstract: We consider a special case of bandit problems, namely batched bandits. Motivated by natural restrictions of recommender systems and e-commerce platforms, we assume that a learning agent observes responses batched in groups over a certain time period. Unlike previous work, we consider a more practically relevant batch-centric scenario of batch learning. We provide a policy-agnostic regret analysis… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: To appear at the workshop on the Ecological Theory of Reinforcement Learning, NeurIPS 2021

  47. arXiv:2110.05329  [pdf, other

    cs.LG cs.AI

    Avoiding Forgetting and Allowing Forward Transfer in Continual Learning via Sparse Networks

    Authors: Ghada Sokar, Decebal Constantin Mocanu, Mykola Pechenizkiy

    Abstract: Using task-specific components within a neural network in continual learning (CL) is a compelling strategy to address the stability-plasticity dilemma in fixed-capacity models without access to past data. Current methods focus only on selecting a sub-network for a new task that reduces forgetting of past tasks. However, this selection could limit the forward transfer of relevant past knowledge tha… ▽ More

    Submitted 6 July, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2022)

  48. arXiv:2110.00623  [pdf, other

    cs.LG

    Calibrated Adversarial Training

    Authors: Tian** Huang, Vlado Menkovski, Yulong Pei, Mykola Pechenizkiy

    Abstract: Adversarial training is an approach of increasing the robustness of models to adversarial attacks by including adversarial examples in the training set. One major challenge of producing adversarial examples is to contain sufficient perturbation in the example to flip the model's output while not making severe changes in the example's semantical content. Exuberant change in the semantical content c… ▽ More

    Submitted 11 October, 2021; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: ACML 2021 accepted,24 pages

  49. arXiv:2109.10703  [pdf, other

    cs.SI physics.soc-ph

    The Banking Transactions Dataset and its Comparative Analysis with Scale-free Networks

    Authors: Akrati Saxena, Yulong Pei, Jan Veldsink, Werner van Ipenburg, George Fletcher, Mykola Pechenizkiy

    Abstract: We construct a network of 1.6 million nodes from banking transactions of users of Rabobank. We assign two weights on each edge, which are the aggregate transferred amount and the total number of transactions between the users from the year 2010 to 2020. We present a detailed analysis of the unweighted and both weighted networks by examining their degree, strength, and weight distributions, as well… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  50. arXiv:2109.10432  [pdf, other

    cs.LG cs.AI

    Beyond Discriminant Patterns: On the Robustness of Decision Rule Ensembles

    Authors: Xin Du, Subramanian Ramamoorthy, Wouter Duivesteijn, ** Tian, Mykola Pechenizkiy

    Abstract: Local decision rules are commonly understood to be more explainable, due to the local nature of the patterns involved. With numerical optimization methods such as gradient boosting, ensembles of local decision rules can gain good predictive performance on data involving global structure. Meanwhile, machine learning models are being increasingly used to solve problems in high-stake domains includin… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.