Skip to main content

Showing 1–50 of 50 results for author: Matwin, S

.
  1. arXiv:2406.16975  [pdf, other

    cs.LG cs.AI

    A Review of Global Sensitivity Analysis Methods and a comparative case study on Digit Classification

    Authors: Zahra Sadeghi, Stan Matwin

    Abstract: Global sensitivity analysis (GSA) aims to detect influential input factors that lead a model to arrive at a certain decision and is a significant approach for mitigating the computational burden of processing high dimensional data. In this paper, we provide a comprehensive review and a comparison on global sensitivity analysis methods. Additionally, we propose a methodology for evaluating the effi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2401.13098  [pdf, other

    cs.LG cs.AI cs.SI stat.AP

    Gravity-Informed Deep Learning Framework for Predicting Ship Traffic Flow and Invasion Risk of Non-Indigenous Species via Ballast Water Discharge

    Authors: Ruixin Song, Gabriel Spadon, Ronald Pelot, Stan Matwin, Amilcar Soares

    Abstract: Invasive species in water bodies pose a major threat to the environment and biodiversity globally. Due to increased transportation and trade, non-native species have been introduced to new environments, causing damage to ecosystems and leading to economic losses in agriculture, forestry, and fisheries. Therefore, there is a pressing need for risk assessment and management techniques to mitigate th… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages, 7 figures, under review

  3. arXiv:2401.11394  [pdf, other

    cs.LG

    Causal Generative Explainers using Counterfactual Inference: A Case Study on the Morpho-MNIST Dataset

    Authors: Will Taylor-Melanson, Zahra Sadeghi, Stan Matwin

    Abstract: In this paper, we propose leveraging causal generative learning as an interpretable tool for explaining image classifiers. Specifically, we present a generative counterfactual inference approach to study the influence of visual features (i.e., pixels) as well as causal factors through generative learning. To this end, we first uncover the most influential pixels on a classifier's decision by varyi… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  4. arXiv:2401.03406  [pdf, other

    cs.RO cs.AI cs.LG

    Improving Dribbling, Passing, and Marking Actions in Soccer Simulation 2D Games Using Machine Learning

    Authors: Nader Zare, Omid Amini, Aref Sayareh, Mahtab Sarvmaili, Arad Firouzkouhi, Stan Matwin, Amilcar Soares

    Abstract: The RoboCup competition was started in 1997, and is known as the oldest RoboCup league. The RoboCup 2D Soccer Simulation League is a stochastic, partially observable soccer environment in which 24 autonomous agents play on two opposing teams. In this paper, we detail the main strategies and functionalities of CYRUS, the RoboCup 2021 2D Soccer Simulation League champions. The new functionalities pr… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  5. arXiv:2310.18948  [pdf, other

    cs.LG cs.AI cs.DM math.PR

    Probabilistic Feature Augmentation for AIS-Based Multi-Path Long-Term Vessel Trajectory Forecasting

    Authors: Gabriel Spadon, Jay Kumar, Derek Eden, Josh van Berkel, Tom Foster, Amilcar Soares, Ronan Fablet, Stan Matwin, Ronald Pelot

    Abstract: Maritime transportation is paramount in achieving global economic growth, entailing concurrent ecological obligations in sustainability and safeguarding endangered marine species, most notably preserving large whale populations. In this regard, the Automatic Identification System (AIS) data plays a significant role by offering real-time streaming data on vessel movement, allowing enhanced traffic… ▽ More

    Submitted 2 May, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  6. arXiv:2307.16875  [pdf, other

    cs.RO cs.AI

    Pyrus Base: An Open Source Python Framework for the RoboCup 2D Soccer Simulation

    Authors: Nader Zare, Aref Sayareh, Omid Amini, Mahtab Sarvmaili, Arad Firouzkouhi, Stan Matwin, Amilcar Soares

    Abstract: Soccer, also known as football in some parts of the world, involves two teams of eleven players whose objective is to score more goals than the opposing team. To simulate this game and attract scientists from all over the world to conduct research and participate in an annual computer-based soccer world cup, Soccer Simulation 2D (SS2D) was one of the leagues initiated in the RoboCup competition. I… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  7. arXiv:2305.19283  [pdf, other

    cs.AI cs.RO

    Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2023

    Authors: Aref Sayareh, Nader Zare, Omid Amini, Arad Firouzkouhi, Mahtab Sarvmaili, Stan Matwin

    Abstract: The RoboCup competitions hold various leagues, and the Soccer Simulation 2D League is a major one among them. Soccer Simulation 2D (SS2D) match involves two teams, including 11 players and a coach, competing against each other. The players can only communicate with the Soccer Simulation Server during the game. This paper presents the latest research of the CYRUS soccer simulation 2D team, the cham… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

  8. arXiv:2303.01584  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Evolutionary Augmentation Policy Optimization for Self-supervised Learning

    Authors: Noah Barrett, Zahra Sadeghi, Stan Matwin

    Abstract: Self-supervised Learning (SSL) is a machine learning algorithm for pretraining Deep Neural Networks (DNNs) without requiring manually labeled data. The central idea of this learning technique is based on an auxiliary stage aka pretext task in which labeled data are created automatically through data augmentation and exploited for pretraining the DNN. However, the effect of each pretext task is not… ▽ More

    Submitted 2 August, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  9. arXiv:2211.08585  [pdf, other

    cs.RO

    Cyrus2D base: Source Code Base for RoboCup 2D Soccer Simulation League

    Authors: Nader Zare, Omid Amini, Aref Sayareh, Mahtab Sarvmaili, Arad Firouzkouhi, Saba Ramezani Rad, Stan Matwin, Amilcar Soares

    Abstract: Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. Several base codes have been released for the RoboCup soccer simulation 2D (RCSS2D) community that have promoted the application of multi-agent and AI algorithms in this field. In this paper, we introduce "Cyrus2D… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  10. A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels

    Authors: Martha Dais Ferreira, Gabriel Spadon, Amilcar Soares, Stan Matwin

    Abstract: Automatic Identification System (AIS) messages are useful for tracking vessel activity across oceans worldwide using radio links and satellite transceivers. Such data plays a significant role in tracking vessel activity and map** mobility patterns such as those found in fishing. Accordingly, this paper proposes a geometric-driven semi-supervised approach for fishing activity detection from AIS d… ▽ More

    Submitted 22 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Ferreira, M.D.; Spadon, G.; Soares, A.; Matwin, S. A Semi-Supervised Methodology for Fishing Activity Detection Using the Geometry behind the Trajectory of Multiple Vessels. Sensors 2022, 22, 6063. https://doi.org/10.3390/s22166063

  11. arXiv:2206.02310  [pdf, other

    cs.RO

    CYRUS Soccer Simulation 2D Team Description Paper 2021

    Authors: Nader Zare, Aref Sayareh, Mahtab Sarvmaili, Omid Amini, Amilcar Soares, Stan Matwin

    Abstract: In this report, we briefly present the technical procedure and simulation steps for the 2D soccer simulation of team Cyrus. We emphasize on this document on how the prediction of teammates' behavior is performed. In our proposed method, the agent receives the noisy inputs from the server, and predicts the ball holder full state behavior. Taking advantage of this approach for choosing the optimal v… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  12. arXiv:2205.10953  [pdf, other

    cs.AI cs.LG cs.RO

    CYRUS Soccer Simulation 2D Team Description Paper 2022

    Authors: Nader Zare, Arad Firouzkouhi, Omid Amini, Mahtab Sarvmaili, Aref Sayareh, Saba Ramezani Rad, Stan Matwin, Amilcar Soares

    Abstract: Soccer Simulation 2D League is one of the major leagues of RoboCup competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one coach compete against each other. The players are only allowed to communicate with the server that is called Soccer Simulation Server. This paper introduces the previous and current research of the CYRUS soccer simulation team, the champion of Robo… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  13. arXiv:2204.13170  [pdf, other

    cs.LG cs.AI cs.CV cs.DC cs.MA

    AdaBest: Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation

    Authors: Farshid Varno, Marzie Saghayi, Laya Rafiee Sevyeri, Sharut Gupta, Stan Matwin, Mohammad Havaei

    Abstract: In Federated Learning (FL), a number of clients or devices collaborate to train a model without sharing their data. Models are optimized locally at each client and further communicated to a central hub for aggregation. While FL is an appealing decentralized training paradigm, heterogeneity among data from different clients can cause the local optimization to drift away from the global objective. I… ▽ More

    Submitted 24 July, 2023; v1 submitted 27 April, 2022; originally announced April 2022.

    Comments: Published as a conference paper at ECCV 2022; Corrected some typos in the text and a baseline algorithm

    ACM Class: I.2; I.4; I.5

  14. arXiv:2202.13867  [pdf, other

    cs.LG cs.AI

    Unfolding AIS transmission behavior for vessel movement modeling on noisy data leveraging machine learning

    Authors: Gabriel Spadon, Martha D. Ferreira, Amilcar Soares, Stan Matwin

    Abstract: The oceans are a source of an impressive mixture of complex data that could be used to uncover relationships yet to be discovered. Such data comes from the oceans and their surface, such as Automatic Identification System (AIS) messages used for tracking vessels' trajectories. AIS messages are transmitted over radio or satellite at ideally periodic time intervals but vary irregularly over time. As… ▽ More

    Submitted 5 July, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

  15. arXiv:2112.07041  [pdf, other

    cs.SI cs.LG

    Survey of Generative Methods for Social Media Analysis

    Authors: Stan Matwin, Aristides Milios, Paweł Prałat, Amilcar Soares, François Théberge

    Abstract: This survey draws a broad-stroke, panoramic picture of the State of the Art (SoTA) of the research in generative methods for the analysis of social media data. It fills a void, as the existing survey articles are either much narrower in their scope or are dated. We included two important aspects that currently gain importance in mining and modeling social media: dynamics and networks. Social dynam… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

  16. arXiv:2106.14130  [pdf, other

    cs.AI cs.RO

    Continuous Control with Deep Reinforcement Learning for Autonomous Vessels

    Authors: Nader Zare, Bruno Brandoli, Mahtab Sarvmaili, Amilcar Soares, Stan Matwin

    Abstract: Maritime autonomous transportation has played a crucial role in the globalization of the world economy. Deep Reinforcement Learning (DRL) has been applied to automatic path planning to simulate vessel collision avoidance situations in open seas. End-to-end approaches that learn complex map**s directly from the input have poor generalization to reach the targets in different environments. In this… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  17. arXiv:2101.06484  [pdf, other

    cs.AI cs.IR cs.SI

    Artificial Intelligence for Emotion-Semantic Trending and People Emotion Detection During COVID-19 Social Isolation

    Authors: Hamed Jelodar, Rita Orji, Stan Matwin, Swarna Weerasinghe, Oladapo Oyebode, Yongli Wang

    Abstract: Taking advantage of social media platforms, such as Twitter, this paper provides an effective framework for emotion detection among those who are quarantined. Early detection of emotional feelings and their trends help implement timely intervention strategies. Given the limitations of medical diagnosis of early emotional change signs during the quarantine period, artificial intelligence models pro… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

  18. arXiv:2008.12833  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Pay Attention to Evolution: Time Series Forecasting with Deep Graph-Evolution Learning

    Authors: Gabriel Spadon, Shenda Hong, Bruno Brandoli, Stan Matwin, Jose F. Rodrigues-Jr, Jimeng Sun

    Abstract: Time-series forecasting is one of the most active research topics in artificial intelligence. Applications in real-world time series should consider two factors for achieving reliable predictions: modeling dynamic dependencies among multiple variables and adjusting the model's intrinsic hyperparameters. A still open gap in that literature is that statistical and ensemble learning approaches system… ▽ More

    Submitted 26 May, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

    MSC Class: 37M10; 68T07; 68T05; 68T37; 82C32 ACM Class: I.2; I.5; I.2.4; I.2.6; I.5.1

  19. arXiv:2008.10022  [pdf

    cs.CL cs.CY cs.IR cs.SI

    COVID-19 Pandemic: Identifying Key Issues using Social Media and Natural Language Processing

    Authors: Oladapo Oyebode, Chinenye Ndulue, Dinesh Mulchandani, Banuchitra Suruliraj, Ashfaq Adib, Fidelia Anulika Orji, Evangelos Milios, Stan Matwin, Rita Orji

    Abstract: The COVID-19 pandemic has affected people's lives in many ways. Social media data can reveal public perceptions and experience with respect to the pandemic, and also reveal factors that hamper or support efforts to curb global spread of the disease. In this paper, we analyzed COVID-19-related comments collected from six social media platforms using Natural Language Processing (NLP) techniques. We… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: 12 pages, 7 figures, 3 tables

    Journal ref: Journal of Healthcare Informatics Research. 2022

  20. arXiv:2008.00563  [pdf, other

    cs.CL

    SemEval-2020 Task 5: Counterfactual Recognition

    Authors: Xiaoyu Yang, Stephen Obadinma, Huasha Zhao, Qiong Zhang, Stan Matwin, Xiaodan Zhu

    Abstract: We present a counterfactual recognition (CR) task, the shared Task 5 of SemEval-2020. Counterfactuals describe potential outcomes (consequents) produced by actions or circumstances that did not happen or cannot happen and are counter to the facts (antecedent). Counterfactual thinking is an important characteristic of the human cognitive system; it connects antecedents and consequents with causal r… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

    Comments: Task description paper of SemEval-2020 Task 5: Modelling Causal Reasoning in Language: Detecting Counterfactuals

  21. arXiv:2007.01388  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Learn Faster and Forget Slower via Fast and Stable Task Adaptation

    Authors: Farshid Varno, Lucas May Petry, Lisa Di Jorio, Stan Matwin

    Abstract: Training Deep Neural Networks (DNNs) is still highly time-consuming and compute-intensive. It has been shown that adapting a pretrained model may significantly accelerate this process. With a focus on classification, we show that current fine-tuning techniques make the pretrained models catastrophically forget the transferred knowledge even before anything about the new task is learned. Such rapid… ▽ More

    Submitted 29 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: 52 pages, 15 figures, 1 table

  22. arXiv:2006.07516  [pdf, other

    cs.CY cs.LG cs.SI stat.ML

    Analyzing the Impact of Foursquare and Streetlight Data with Human Demographics on Future Crime Prediction

    Authors: Fateha Khanam Bappee, Lucas May Petry, Amilcar Soares, Stan Matwin

    Abstract: Finding the factors contributing to criminal activities and their consequences is essential to improve quantitative crime research. To respond to this concern, we examine an extensive set of features from different perspectives and explanations. Our study aims to build data-driven models for predicting future crime occurrences. In this paper, we propose the use of streetlight infrastructure and Fo… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  23. arXiv:2006.04996  [pdf, other

    cs.LG cs.CV stat.ML

    Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation

    Authors: Xiang Jiang, Qicheng Lao, Stan Matwin, Mohammad Havaei

    Abstract: We present an approach for unsupervised domain adaptation---with a strong focus on practical considerations of within-domain class imbalance and between-domain class distribution shift---from a class-conditioned domain alignment perspective. Current methods for class-conditioned domain alignment aim to explicitly minimize a loss function based on pseudo-label estimations of the target domain. Howe… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML2020. For code, see https://github.com/xiangdal/implicit_alignment

    MSC Class: 68T07

  24. arXiv:2004.05222  [pdf

    cs.CY cs.SI

    Give more data, awareness and control to individual citizens, and they will help COVID-19 containment

    Authors: Mirco Nanni, Gennady Andrienko, Albert-László Barabási, Chiara Boldrini, Francesco Bonchi, Ciro Cattuto, Francesca Chiaromonte, Giovanni Comandé, Marco Conti, Mark Coté, Frank Dignum, Virginia Dignum, Josep Domingo-Ferrer, Paolo Ferragina, Fosca Giannotti, Riccardo Guidotti, Dirk Helbing, Kimmo Kaski, Janos Kertesz, Sune Lehmann, Bruno Lepri, Paul Lukowicz, Stan Matwin, David Megías Jiménez, Anna Monreale , et al. (14 additional authors not shown)

    Abstract: The rapid dynamics of COVID-19 calls for quick and effective tracking of virus transmission chains and early detection of outbreaks, especially in the phase 2 of the pandemic, when lockdown and other restriction measures are progressively withdrawn, in order to avoid or minimize contagion resurgence. For this purpose, contact-tracing apps are being proposed for large scale adoption by many countri… ▽ More

    Submitted 16 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

    Comments: Revised text. Additional authors

    Journal ref: Transactions on Data Privacy 13(1): 61-66 (2020), http://www.tdp.cat/issues16/abs.a389a20.php

  25. arXiv:2004.03722  [pdf, other

    cs.LG stat.ML

    Challenges in Vessel Behavior and Anomaly Detection: From Classical Machine Learning to Deep Learning

    Authors: Lucas May Petry, Amilcar Soares, Vania Bogorny, Bruno Brandoli, Stan Matwin

    Abstract: The global expansion of maritime activities and the development of the Automatic Identification System (AIS) have driven the advances in maritime monitoring systems in the last decade. Monitoring vessel behavior is fundamental to safeguard maritime operations, protecting other vessels sailing the ocean and the marine fauna and flora. Given the enormous volume of vessel data continually being gener… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: This is an extended version of the article Challenges in Vessel Behavior and Anomaly Detection: From Classical Machine Learning to Deep Learning, to be published by Springer in the proceedings of the 33rd Canadian Conference on Artificial Intelligence

  26. arXiv:2003.10249  [pdf, other

    cs.LG cs.AI

    Using Deep Reinforcement Learning Methods for Autonomous Vessels in 2D Environments

    Authors: Mohammad Etemad, Nader Zare, Mahtab Sarvmaili, Amilcar Soares, Bruno Brandoli Machado, Stan Matwin

    Abstract: Unmanned Surface Vehicles technology (USVs) is an exciting topic that essentially deploys an algorithm to safely and efficiently performs a mission. Although reinforcement learning is a well-known approach to modeling such a task, instability and divergence may occur when combining off-policy and function approximation. In this work, we used deep reinforcement learning combining Q-learning with a… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  27. arXiv:2003.10248  [pdf, other

    cs.LG stat.ML

    Wise Sliding Window Segmentation: A classification-aided approach for trajectory segmentation

    Authors: Mohammad Etemad, Zahra Etemad, Amilcar Soares, Vania Bogorny, Stan Matwin, Luis Torgo

    Abstract: Large amounts of mobility data are being generated from many different sources, and several data mining methods have been proposed for this data. One of the most critical steps for trajectory data mining is segmentation. This task can be seen as a pre-processing step in which a trajectory is divided into several meaningful consecutive sub-sequences. This process is necessary because trajectory pat… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  28. arXiv:2002.03746  [pdf, other

    cs.CV cs.LG

    Black Box Explanation by Learning Image Exemplars in the Latent Feature Space

    Authors: Riccardo Guidotti, Anna Monreale, Stan Matwin, Dino Pedreschi

    Abstract: We present an approach to explain the decisions of black box models for image classification. While using the black box to label images, our explanation method exploits the latent feature space learned through an adversarial autoencoder. The proposed method first generates exemplar images in the latent feature space and learns a decision tree classifier. Then, it selects and decodes exemplars resp… ▽ More

    Submitted 27 January, 2020; originally announced February 2020.

  29. arXiv:2001.09127  [pdf, other

    eess.AS cs.LG cs.SD

    Performance of a Deep Neural Network at Detecting North Atlantic Right Whale Upcalls

    Authors: Oliver S. Kirsebom, Fabio Frazao, Yvan Simard, Nathalie Roy, Stan Matwin, Samuel Giard

    Abstract: Passive acoustics provides a powerful tool for monitoring the endangered North Atlantic right whale ($Eubalaena$ $glacialis$), but robust detection algorithms are needed to handle diverse and variable acoustic conditions and differences in recording techniques and equipment. Here, we investigate the potential of deep neural networks for addressing this need. ResNet, an architecture commonly used f… ▽ More

    Submitted 29 February, 2020; v1 submitted 24 January, 2020; originally announced January 2020.

    Comments: 11 pages, 9 figures, 2 tables, submitted to JASA on Dec 22, 2019, as part of a special issue on The Effects of Noise on Aquatic Life; resubmitted on Feb 29, 2020, upon minor revisions and improved SNR estimates

  30. arXiv:1909.01067  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Multimodal Deep Learning for Mental Disorders Prediction from Audio Speech Samples

    Authors: Habibeh Naderi, Behrouz Haji Soleimani, Stan Matwin

    Abstract: Key features of mental illnesses are reflected in speech. Our research focuses on designing a multimodal deep learning structure that automatically extracts salient features from recorded speech samples for predicting various mental disorders including depression, bipolar, and schizophrenia. We adopt a variety of pre-trained models to extract embeddings from both audio and text segments. We use se… ▽ More

    Submitted 13 April, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: arXiv admin note: text overlap with arXiv:1811.09362 by other authors

  31. arXiv:1908.05103  [pdf, other

    cs.LG cs.AI stat.ML

    Unsupervised Behavior Change Detection in Multidimensional Data Streams for Maritime Traffic Monitoring

    Authors: Lucas May Petry, Amilcar Soares, Vania Bogorny, Stan Matwin

    Abstract: The worldwide growth of maritime traffic and the development of the Automatic Identification System (AIS) has led to advances in monitoring systems for preventing vessel accidents and detecting illegal activities. In this work, we describe research gaps and challenges in machine learning for vessel behavior change and event detection, considering several constraints imposed by real-time data strea… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: Extended abstract submitted to the 2019 Montreal Artificial Intelligence Symposium (MAIS)

  32. arXiv:1907.13188  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Marine Mammal Species Classification using Convolutional Neural Networks and a Novel Acoustic Representation

    Authors: Mark Thomas, Bruce Martin, Katie Kowarski, Briand Gaudet, Stan Matwin

    Abstract: Research into automated systems for detecting and classifying marine mammals in acoustic recordings is expanding internationally due to the necessity to analyze large collections of data for conservation purposes. In this work, we present a Convolutional Neural Network that is capable of classifying the vocalizations of three species of whales, non-biological sources of noise, and a fifth class pe… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 16 pages, To appear in ECML-PKDD 2019

  33. arXiv:1905.10698  [pdf, other

    cs.LG cs.CV cs.NE

    Efficient Neural Task Adaptation by Maximum Entropy Initialization

    Authors: Farshid Varno, Behrouz Haji Soleimani, Marzie Saghayi, Lisa Di Jorio, Stan Matwin

    Abstract: Transferring knowledge from one neural network to another has been shown to be helpful for learning tasks with few training examples. Prevailing fine-tuning methods could potentially contaminate pre-trained features by comparably high energy random noise. This noise is mainly delivered from a careless replacement of task-specific parameters. We analyze theoretically such knowledge contamination fo… ▽ More

    Submitted 11 July, 2019; v1 submitted 25 May, 2019; originally announced May 2019.

  34. arXiv:1902.10584  [pdf

    cs.CL

    When a Tweet is Actually Sexist. A more Comprehensive Classification of Different Online Harassment Categories and The Challenges in NLP

    Authors: Sima Sharifirad, Stan Matwin

    Abstract: Sexism is very common in social media and makes the boundaries of freedom tighter for feminist and female users. There is still no comprehensive classification of sexism attracting natural language processing techniques. Categorizing sexism in social media in the categories of hostile or benevolent sexism are so general that simply ignores the other types of sexism happening in these media. This p… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  35. arXiv:1902.04980  [pdf, other

    eess.AS cs.LG cs.SD

    Recurrent Neural Networks with Stochastic Layers for Acoustic Novelty Detection

    Authors: Duong Nguyen, Oliver S. Kirsebom, Fábio Frazão, Ronan Fablet, Stan Matwin

    Abstract: In this paper, we adapt Recurrent Neural Networks with Stochastic Layers, which are the state-of-the-art for generating text, music and speech, to the problem of acoustic novelty detection. By integrating uncertainty into the hidden states, this type of network is able to learn the distribution of complex sequences. Because the learned distribution can be calculated explicitly in terms of probabil… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: Accepted to ICASSP 2019

  36. arXiv:1902.03089  [pdf

    cs.SI cs.CL cs.LG stat.ML

    How is Your Mood When Writing Sexist tweets? Detecting the Emotion Type and Intensity of Emotion Using Natural Language Processing Techniques

    Authors: Sima Sharifirad, Borna Jafarpour, Stan Matwin

    Abstract: Online social platforms have been the battlefield of users with different emotions and attitudes toward each other in recent years. While sexism has been considered as a category of hateful speech in the literature, there is no comprehensive definition and category of sexism attracting natural language processing techniques. Categorizing sexism as either benevolent or hostile sexism is so broad th… ▽ More

    Submitted 28 January, 2019; originally announced February 2019.

  37. arXiv:1902.01108  [pdf, other

    cs.LG stat.ML

    2-D Embedding of Large and High-dimensional Data with Minimal Memory and Computational Time Requirements

    Authors: Witold Dzwinel, Rafal Wcislo, Stan Matwin

    Abstract: In the advent of big data era, interactive visualization of large data sets consisting of M*10^5+ high-dimensional feature vectors of length N (N ~ 10^3+), is an indispensable tool for data exploratory analysis. The state-of-the-art data embedding (DE) methods of N-D data into 2-D (3-D) visually perceptible space (e.g., based on t-SNE concept) are too demanding computationally to be efficiently em… ▽ More

    Submitted 4 February, 2019; originally announced February 2019.

  38. arXiv:1812.10924  [pdf, other

    cs.LG stat.ML

    Improving the Interpretability of Deep Neural Networks with Knowledge Distillation

    Authors: Xuan Liu, Xiaoguang Wang, Stan Matwin

    Abstract: Deep Neural Networks have achieved huge success at a wide spectrum of applications from language modeling, computer vision to speech recognition. However, nowadays, good performance alone is not sufficient to satisfy the needs of practical deployment where interpretability is demanded for cases involving ethics and mission critical applications. The complex models of Deep Neural Networks make it h… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

    Comments: 2018 IEEE International Conference on Data Mining (ICDM), in press

  39. arXiv:1808.03096  [pdf, other

    cs.AI cs.LG stat.ML

    On feature selection and evaluation of transportation mode prediction strategies

    Authors: Mohammad Etemad, Amilcar Soares Junior, Stan Matwin

    Abstract: Transportation modes prediction is a fundamental task for decision making in smart cities and traffic management systems. Traffic policies designed based on trajectory mining can save money and time for authorities and the public. It may reduce the fuel consumption and commute time and moreover, may provide more pleasant moments for residents and tourists. Since the number of features that may be… ▽ More

    Submitted 5 September, 2018; v1 submitted 9 August, 2018; originally announced August 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1807.10876

  40. arXiv:1806.00852  [pdf, other

    cs.LG cs.AI stat.ML

    On the Importance of Attention in Meta-Learning for Few-Shot Text Classification

    Authors: Xiang Jiang, Mohammad Havaei, Gabriel Chartrand, Hassan Chouaib, Thomas Vincent, Andrew Jesson, Nicolas Chapados, Stan Matwin

    Abstract: Current deep learning based text classification methods are limited by their ability to achieve fast learning and generalization when the data is scarce. We address this problem by integrating a meta-learning procedure that uses the knowledge learned across many tasks as an inductive bias towards better natural language understanding. Based on the Model-Agnostic Meta-Learning framework (MAML), we… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Comments: 13 pages, 4 figures, submitted to NIPS

  41. Predicting Crime Using Spatial Features

    Authors: Fateha Khanam Bappee, Amilcar Soares Junior, Stan Matwin

    Abstract: Our study aims to build a machine learning model for crime prediction using geospatial features for different categories of crime. The reverse geocoding technique is applied to retrieve open street map (OSM) spatial data. This study also proposes finding hotpoints extracted from crime hotspots area found by Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN). A spati… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

    Comments: Paper accepted to 31st Canadian Conference in Artificial Intelligence, 2018

  42. Predicting Transportation Modes of GPS Trajectories using Feature Engineering and Noise Removal

    Authors: Mohammad Etemad, Amilcar Soares Junior, Stan Matwin

    Abstract: Understanding transportation mode from GPS (Global Positioning System) traces is an essential topic in the data mobility domain. In this paper, a framework is proposed to predict transportation modes. This framework follows a sequence of five steps: (i) data preparation, where GPS points are grouped in trajectory samples; (ii) point features generation; (iii) trajectory features extraction; (iv) n… ▽ More

    Submitted 27 February, 2018; originally announced February 2018.

    Comments: 6 pages

  43. arXiv:1802.09059  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    One Single Deep Bidirectional LSTM Network for Word Sense Disambiguation of Text Data

    Authors: Ahmad Pesaranghader, Ali Pesaranghader, Stan Matwin, Marina Sokolova

    Abstract: Due to recent technical and scientific advances, we have a wealth of information hidden in unstructured text data such as offline/online narratives, research articles, and clinical reports. To mine these data properly, attributable to their innate ambiguity, a Word Sense Disambiguation (WSD) algorithm can avoid numbers of difficulties in Natural Language Processing (NLP) pipeline. However, conside… ▽ More

    Submitted 25 February, 2018; originally announced February 2018.

    Comments: 12 pages, 1 figure, to appear in the Proceedings of the 31st Canadian Conference on Artificial Intelligence, 8-11 May, 2018, Toronto, Canada

  44. arXiv:1802.00560  [pdf, other

    cs.LG cs.AI stat.ML

    Interpretable Deep Convolutional Neural Networks via Meta-learning

    Authors: Xuan Liu, Xiaoguang Wang, Stan Matwin

    Abstract: Model interpretability is a requirement in many applications in which crucial decisions are made by users relying on a model's outputs. The recent movement for "algorithmic fairness" also stipulates explainability, and therefore interpretability of learning models. And yet the most successful contemporary Machine Learning approaches, the Deep Neural Networks, produce models that are highly non-int… ▽ More

    Submitted 18 August, 2018; v1 submitted 2 February, 2018; originally announced February 2018.

    Comments: 9 pages, 9 figures, 2018 International Joint Conference on Neural Networks, in press

  45. arXiv:1705.02636  [pdf, other

    cs.CV cs.AI cs.LG

    TrajectoryNet: An Embedded GPS Trajectory Representation for Point-based Classification Using Recurrent Neural Networks

    Authors: Xiang Jiang, Erico N de Souza, Ahmad Pesaranghader, Baifan Hu, Daniel L. Silver, Stan Matwin

    Abstract: Understanding and discovering knowledge from GPS (Global Positioning System) traces of human activities is an essential topic in mobility-based urban computing. We propose TrajectoryNet-a neural network architecture for point-based trajectory classification to infer real world human transportation modes from GPS traces. To overcome the challenge of capturing the underlying latent factors in the lo… ▽ More

    Submitted 30 August, 2017; v1 submitted 7 May, 2017; originally announced May 2017.

    ACM Class: I.2.6; H.2.8; I.2.1

  46. arXiv:1702.08866  [pdf

    cs.CL

    Studying Positive Speech on Twitter

    Authors: Marina Sokolova, Vera Sazonova, Kanyi Huang, Rudraneel Chakraboty, Stan Matwin

    Abstract: We present results of empirical studies on positive speech on Twitter. By positive speech we understand speech that works for the betterment of a given situation, in this case relations between different communities in a conflict-prone country. We worked with four Twitter data sets. Through semi-manual opinion mining, we found that positive speech accounted for < 1% of the data . In fully automate… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

    Comments: 13 pages, 6 tables

    ACM Class: I.2.6; I.2.7

  47. arXiv:1702.04956  [pdf, other

    cs.LG cs.AI stat.ML

    Reflexive Regular Equivalence for Bipartite Data

    Authors: Aaron Gerow, Mingyang Zhou, Stan Matwin, Feng Shi

    Abstract: Bipartite data is common in data engineering and brings unique challenges, particularly when it comes to clustering tasks that impose on strong structural assumptions. This work presents an unsupervised method for assessing similarity in bipartite data. Similar to some co-clustering methods, the method is based on regular equivalence in graphs. The algorithm uses spectral properties of a bipartite… ▽ More

    Submitted 16 February, 2017; originally announced February 2017.

    Comments: A condensed version of this paper will appear in Proceedings of the 30th Canadian Conference on Artificial Intelligence, Edmonton, Alberta, Canada

  48. arXiv:1608.02519  [pdf

    cs.SI cs.CL

    Topic Modelling and Event Identification from Twitter Textual Data

    Authors: Marina Sokolova, Kanyi Huang, Stan Matwin, Joshua Ramisch, Vera Sazonova, Renee Black, Chris Orwa, Sidney Ochieng, Nanjira Sambuli

    Abstract: The tremendous growth of social media content on the Internet has inspired the development of the text analytics to understand and solve real-life problems. Leveraging statistical topic modelling helps researchers and practitioners in better comprehension of textual content as well as provides useful information for further analysis. Statistical topic modelling becomes especially important when we… ▽ More

    Submitted 8 August, 2016; originally announced August 2016.

    Comments: 17 pages, 2 figures, 5 tables

    ACM Class: D.4.8; H.1.2; H.2.8; I.2.7

  49. arXiv:1602.01937  [pdf

    cs.CR cs.CY cs.IR cs.SI

    YOURPRIVACYPROTECTOR, A recommender system for privacy settings in social networks

    Authors: Kambiz Ghazinour, Stan Matwin, Marina Sokolova

    Abstract: Ensuring privacy of users of social networks is probably an unsolvable conundrum. At the same time, an informed use of the existing privacy options by the social network participants may alleviate - or even prevent - some of the more drastic privacy-averse incidents. Unfortunately, recent surveys show that an average user is either not aware of these options or does not use them, probably due to t… ▽ More

    Submitted 5 February, 2016; originally announced February 2016.

    Comments: 15 pages, International journal of security, privacy and trust management. (IJSPTM) Volume 2, No 4, Aug. 2013

    Journal ref: International journal of security, privacy and trust management. (IJSPTM) Volume 2, No 4, Aug. 2013

  50. arXiv:1412.8412   

    cs.CR

    Sanitization of Call Detail Records via Differentially-private Summaries

    Authors: Mohammad Alaggan, Sébastien Gambs, Stan Matwin, Eriko Souza, Mohammed Tuhin

    Abstract: In this work, we initiate the study of human mobility from sanitized call detail records (CDRs). Such data can be extremely valuable to solve important societal issues such as the improvement of urban transportation or the understanding on the spread of diseases. One of the fundamental building block for such study is the computation of mobility patterns summarizing how individuals move during a g… ▽ More

    Submitted 31 December, 2014; v1 submitted 29 December, 2014; originally announced December 2014.

    Comments: Withdrawn due to some possible agreement issues