Skip to main content

Showing 1–42 of 42 results for author: Zaiane, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05248  [pdf, other

    cs.CV

    Self-Paced Sample Selection for Barely-Supervised Medical Image Segmentation

    Authors: Junming Su, Zhiqiang Shen, Peng Cao, **zhu Yang, Osmar R. Zaiane

    Abstract: The existing barely-supervised medical image segmentation (BSS) methods, adopting a registration-segmentation paradigm, aim to learn from data with very few annotations to mitigate the extreme label scarcity problem. However, this paradigm poses a challenge: pseudo-labels generated by image registration come with significant noise. To address this issue, we propose a self-paced sample selection fr… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted to MICCAI 2024

  2. arXiv:2406.01919  [pdf, other

    cs.CL

    OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection

    Authors: Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar R. Zaiane, Boxing Chen

    Abstract: Recently, there has been considerable attention on detecting hallucinations and omissions in Machine Translation (MT) systems. The two dominant approaches to tackle this task involve analyzing the MT system's internal states or relying on the output of external tools, such as sentence similarity or MT quality estimators. In this work, we introduce OTTAWA, a novel Optimal Transport (OT)-based word… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 Findings

  3. arXiv:2405.09777  [pdf, other

    cs.CV

    Rethinking Barely-Supervised Segmentation from an Unsupervised Domain Adaptation Perspective

    Authors: Zhiqiang Shen, Peng Cao, Junming Su, **zhu Yang, Osmar R. Zaiane

    Abstract: This paper investigates an extremely challenging problem, barely-supervised medical image segmentation (BSS), where the training dataset comprises limited labeled data with only single-slice annotations and numerous unlabeled images. Currently, state-of-the-art (SOTA) BSS methods utilize a registration-based paradigm, depending on image registration to propagate single-slice annotations into volum… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  4. arXiv:2404.04887  [pdf, other

    cs.CV

    A Clinical-oriented Multi-level Contrastive Learning Method for Disease Diagnosis in Low-quality Medical Images

    Authors: Qingshan Hou, Shuai Cheng, Peng Cao, **zhu Yang, Xiaoli Liu, Osmar R. Zaiane, Yih Chung Tham

    Abstract: Representation learning offers a conduit to elucidate distinctive features within the latent space and interpret the deep models. However, the randomness of lesion distribution and the complexity of low-quality factors in medical images pose great challenges for models to extract key lesion features. Disease diagnosis methods guided by contrastive learning (CL) have shown significant advantages in… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  5. arXiv:2312.15182  [pdf, other

    eess.IV cs.CV cs.LG

    Narrowing the semantic gaps in U-Net with learnable skip connections: The case of medical image segmentation

    Authors: Haonan Wang, Peng Cao, Xiaoli Liu, **zhu Yang, Osmar Zaiane

    Abstract: Most state-of-the-art methods for medical image segmentation adopt the encoder-decoder architecture. However, this U-shaped framework still has limitations in capturing the non-local multi-scale information with a simple skip connection. To solve the problem, we firstly explore the potential weakness of skip connections in U-Net on multiple segmentation tasks, and find that i) not all skip connect… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  6. arXiv:2311.04229  [pdf, other

    eess.SP cs.LG

    Exploring Best Practices for ECG Signal Processing in Machine Learning

    Authors: Amir Salimi, Sunil Vasu Kalmady, Abram Hindle, Osmar Zaiane, Padma Kaul

    Abstract: In this work we search for best practices in pre-processing of Electrocardiogram (ECG) signals in order to train better classifiers for the diagnosis of heart conditions. State of the art machine learning algorithms have achieved remarkable results in classification of some heart conditions using ECG data, yet there appears to be no consensus on pre-processing best practices. Is this lack of conse… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  7. arXiv:2301.06943  [pdf, other

    eess.IV cs.CV

    Self-supervised Domain Adaptation for Breaking the Limits of Low-quality Fundus Image Quality Enhancement

    Authors: Qingshan Hou, Peng Cao, Jiaqi Wang, Xiaoli Liu, **zhu Yang, Osmar R. Zaiane

    Abstract: Retinal fundus images have been applied for the diagnosis and screening of eye diseases, such as Diabetic Retinopathy (DR) or Diabetic Macular Edema (DME). However, both low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease and even lead to misdiagnosis by ophthalmologists. Most of the existing image enhancement methods mainly focus o… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  8. arXiv:2301.04465  [pdf, ps, other

    cs.CV

    Co-training with High-Confidence Pseudo Labels for Semi-supervised Medical Image Segmentation

    Authors: Zhiqiang Shen, Peng Cao, Hua Yang, Xiaoli Liu, **zhu Yang, Osmar R. Zaiane

    Abstract: Consistency regularization and pseudo labeling-based semi-supervised methods perform co-training using the pseudo labels from multi-view inputs. However, such co-training models tend to converge early to a consensus, degenerating to the self-training ones, and produce low-confidence pseudo labels from the perturbed inputs during training. To address these issues, we propose an Uncertainty-guided C… ▽ More

    Submitted 26 May, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

  9. arXiv:2211.07717  [pdf, other

    cs.CL cs.LG

    Deep Temporal Modelling of Clinical Depression through Social Media Text

    Authors: Nawshad Farruque, Randy Goebel, Sudhakar Sivapalan, Osmar R. Zaïane

    Abstract: We describe the development of a model to detect user-level clinical depression based on a user's temporal social media posts. Our model uses a Depression Symptoms Detection (DSD) classifier, which is trained on the largest existing samples of clinician annotated tweets for clinical depression symptoms. We subsequently use our DSD model to extract clinically relevant features, e.g., depression sco… ▽ More

    Submitted 30 March, 2023; v1 submitted 28 October, 2022; originally announced November 2022.

    Comments: Tables are properly oriented and some more typos were fixed

  10. arXiv:2209.02765  [pdf, other

    cs.CL cs.AI cs.LG

    Depression Symptoms Modelling from Social Media Text: A Semi-supervised Learning Approach

    Authors: Nawshad Farruque, Randy Goebel, Sudhakar Sivapalan, Osmar Zaiane

    Abstract: A fundamental component of user-level social media language based clinical depression modelling is depression symptoms detection (DSD). Unfortunately, there does not exist any DSD dataset that reflects both the clinical insights and the distribution of depression symptoms from the samples of self-disclosed depressed population. In our work, we describe a Semi-supervised Learning (SSL) framework wh… ▽ More

    Submitted 28 September, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: Title and relevant changes are made

  11. arXiv:2204.10757  [pdf, other

    cs.CL

    FaithDial: A Faithful Benchmark for Information-Seeking Dialogue

    Authors: Nouha Dziri, Ehsan Kamalloo, Sivan Milton, Osmar Zaiane, Mo Yu, Edoardo M. Ponti, Siva Reddy

    Abstract: The goal of information-seeking dialogue is to respond to seeker queries with natural language utterances that are grounded on knowledge sources. However, dialogue systems often produce unsupported utterances, a phenomenon known as hallucination. To mitigate this behavior, we adopt a data-centric solution and create FaithDial, a new benchmark for hallucination-free dialogues, by editing hallucinat… ▽ More

    Submitted 23 October, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: TACL 2022 (20 pages, 3 figures, 10 tables)

  12. arXiv:2204.09081  [pdf, other

    cs.CL

    Named Entity Recognition for Partially Annotated Datasets

    Authors: Michael Strobl, Amine Trabelsi, Osmar Zaiane

    Abstract: The most common Named Entity Recognizers are usually sequence taggers trained on fully annotated corpora, i.e. the class of all words for all entities is known. Partially annotated corpora, i.e. some but not all entities of some types are annotated, are too noisy for training sequence taggers since the same entity may be annotated one time with its true type but not another time, misleading the ta… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Long version of our short paper accepted at NLDB 2022

  13. arXiv:2204.07931  [pdf, other

    cs.CL

    On the Origin of Hallucinations in Conversational Models: Is it the Datasets or the Models?

    Authors: Nouha Dziri, Sivan Milton, Mo Yu, Osmar Zaiane, Siva Reddy

    Abstract: Knowledge-grounded conversational models are known to suffer from producing factually invalid statements, a phenomenon commonly called hallucination. In this work, we investigate the underlying causes of this phenomenon: is hallucination due to the training data, or to the models? We conduct a comprehensive human study on both existing knowledge-grounded conversational benchmarks and several state… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: NAACL 2022, 14 pages

  14. arXiv:2204.07150  [pdf, other

    cs.CL

    FREDA: Flexible Relation Extraction Data Annotation

    Authors: Michael Strobl, Amine Trabelsi, Osmar Zaiane

    Abstract: To effectively train accurate Relation Extraction models, sufficient and properly labeled data is required. Adequately labeled data is difficult to obtain and annotating such data is a tricky undertaking. Previous works have shown that either accuracy has to be sacrificed or the task is extremely time-consuming, if done accurately. We are proposing an approach in order to produce high-quality data… ▽ More

    Submitted 14 December, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted at ACM SAC 2023 Knowledge and Natural Language Processing track

  15. arXiv:2203.07990  [pdf

    cs.MM cs.AI cs.CL

    UofA-Truth at Factify 2022 : Transformer And Transfer Learning Based Multi-Modal Fact-Checking

    Authors: Abhishek Dhankar, Osmar R. Zaïane, Francois Bolduc

    Abstract: Identifying fake news is a very difficult task, especially when considering the multiple modes of conveying information through text, image, video and/or audio. We attempted to tackle the problem of automated misinformation/disinformation detection in multi-modal news sources (including text and images) through our simple, yet effective, approach in the FACTIFY shared task at De-Factify@AAAI2022.… ▽ More

    Submitted 28 January, 2022; originally announced March 2022.

  16. arXiv:2110.07515  [pdf, other

    cs.CL

    Non-Autoregressive Translation with Layer-Wise Prediction and Deep Supervision

    Authors: Chenyang Huang, Hao Zhou, Osmar R. Zaïane, Lili Mou, Lei Li

    Abstract: How do we perform efficient inference while retaining high translation quality? Existing neural machine translation models, such as Transformer, achieve high performance, but they decode words one by one, which is inefficient. Recent non-autoregressive translation models speed up the inference, but their quality is still inferior. In this work, we propose DSLP, a highly efficient and high-performa… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  17. Simulated Annealing for Emotional Dialogue Systems

    Authors: Chengzhang Dong, Chenyang Huang, Osmar Zaïane, Lili Mou

    Abstract: Explicitly modeling emotions in dialogue generation has important applications, such as building empathetic personal companions. In this study, we consider the task of expressing a specific emotion for dialogue generation. Previous approaches take the emotion as an input signal, which may be ignored during inference. We instead propose a search-based emotional dialogue system by simulated annealin… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    MSC Class: 68T50

  18. arXiv:2109.04335  [pdf, other

    cs.CV cs.LG eess.IV

    UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer

    Authors: Haonan Wang, Peng Cao, Jiaqi Wang, Osmar R. Zaiane

    Abstract: Most recent semantic segmentation methods adopt a U-Net framework with an encoder-decoder architecture. It is still challenging for U-Net with a simple skip connection scheme to model the global multi-scale context: 1) Not each skip connection setting is effective due to the issue of incompatible feature sets of encoder and decoder stage, even some skip connection negatively influence the segmenta… ▽ More

    Submitted 24 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted by AAAI 2022. Code is available at https://github.com/McGregorWwww/UCTransNet

  19. arXiv:2106.12797  [pdf, other

    cs.CL cs.AI cs.LG

    A comprehensive empirical analysis on cross-domain semantic enrichment for detection of depressive language

    Authors: Nawshad Farruque, Randy Goebel, Osmar Zaiane

    Abstract: We analyze the process of creating word embedding feature representations designed for a learning task when annotated data is scarce, for example, in depressive language detection from Tweets. We start with a rich word embedding pre-trained from a large general dataset, which is then augmented with embeddings learned from a much smaller and more specific domain dataset through a simple non-linear… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: This is an extension over ECML-PKDD, 2019 paper "Augmenting Semantic Representation of Depressive Language: from Forums to Microblogs", with more embedding map**/augmentation methods and data ablation tests. These experiments were done in the year 2019

  20. arXiv:2106.10928  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    STEP-EZ: Syntax Tree guided semantic ExPlanation for Explainable Zero-shot modeling of clinical depression symptoms from text

    Authors: Nawshad Farruque, Randy Goebel, Osmar Zaiane, Sudhakar Sivapalan

    Abstract: We focus on exploring various approaches of Zero-Shot Learning (ZSL) and their explainability for a challenging yet important supervised learning task notorious for training data scarcity, i.e. Depression Symptoms Detection (DSD) from text. We start with a comprehensive synthesis of different components of our ZSL modeling and analysis of our ground truth samples and Depression symptom clues curat… ▽ More

    Submitted 23 June, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: Fixed an algorithm

  21. arXiv:2106.03376  [pdf, other

    cs.CL

    A Globally Normalized Neural Model for Semantic Parsing

    Authors: Chenyang Huang, Wei Yang, Yanshuai Cao, Osmar Zaïane, Lili Mou

    Abstract: In this paper, we propose a globally normalized model for context-free grammar (CFG)-based semantic parsing. Instead of predicting a probability, our model predicts a real-valued score at each step and does not suffer from the label bias problem. Experiments show that our approach outperforms locally normalized models on small datasets, but it does not yield improvement on a large dataset.

    Submitted 7 June, 2021; originally announced June 2021.

  22. arXiv:2105.12364  [pdf, other

    cs.LG cs.AI

    Basic and Depression Specific Emotion Identification in Tweets: Multi-label Classification Experiments

    Authors: Nawshad Farruque, Chenyang Huang, Osmar Zaiane, Randy Goebel

    Abstract: In this paper, we present empirical analysis on basic and depression specific multi-emotion mining in Tweets with the help of state of the art multi-label classifiers. We choose our basic emotions from a hybrid emotion model consisting of the common emotions from four highly regarded psychological models of emotions. Moreover, we augment that emotion model with new emotion categories because of th… ▽ More

    Submitted 21 June, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted at CICLing, 2019

  23. arXiv:2104.08455  [pdf, other

    cs.CL

    Neural Path Hunter: Reducing Hallucination in Dialogue Systems via Path Grounding

    Authors: Nouha Dziri, Andrea Madotto, Osmar Zaiane, Avishek Joey Bose

    Abstract: Dialogue systems powered by large pre-trained language models (LM) exhibit an innate ability to deliver fluent and natural-looking responses. Despite their impressive generation performance, these models can often generate factually incorrect statements impeding their widespread adoption. In this paper, we focus on the task of improving the faithfulness -- and thus reduce hallucination -- of Neura… ▽ More

    Submitted 14 September, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: EMNLP 2021 18 pages

  24. arXiv:2007.01972  [pdf, other

    cs.LG stat.ML

    Building a Competitive Associative Classifier

    Authors: Nitakshi Sood, Osmar Zaiane

    Abstract: With the huge success of deep learning, other machine learning paradigms have had to take back seat. Yet other models, particularly rule-based, are more readable and explainable and can even be competitive when labelled data is not abundant. However, most of the existing rule-based classifiers suffer from the production of a large number of classification rules, affecting the model readability. Th… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

    Comments: To be published in - The 22nd International Conference on Big Data Analytics and Knowledge Discovery - DaWaK2020, Bratislava, Slovakia, September 14-17, 2020

  25. arXiv:2006.16403  [pdf, other

    cs.CL

    ANA at SemEval-2020 Task 4: mUlti-task learNIng for cOmmonsense reasoNing (UNION)

    Authors: Anandh Perumal, Chenyang Huang, Amine Trabelsi, Osmar R. Zaïane

    Abstract: In this paper, we describe our mUlti-task learNIng for cOmmonsense reasoNing (UNION) system submitted for Task C of the SemEval2020 Task 4, which is to generate a reason explaining why a given false statement is non-sensical. However, we found in the early experiments that simple adaptations such as fine-tuning GPT2 often yield dull and non-informative generations (e.g. simple negations). In order… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 7 pages, 1 figure, 3 tables, SemEval 2020

  26. U$^2$-Net: Going Deeper with Nested U-Structure for Salient Object Detection

    Authors: Xuebin Qin, Zichen Zhang, Chenyang Huang, Masood Dehghan, Osmar R. Zaiane, Martin Jagersand

    Abstract: In this paper, we design a simple yet powerful deep network architecture, U$^2$-Net, for salient object detection (SOD). The architecture of our U$^2$-Net is a two-level nested U-structure. The design has the following advantages: (1) it is able to capture more contextual information from different scales thanks to the mixture of receptive fields of different sizes in our proposed ReSidual U-block… ▽ More

    Submitted 8 March, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: Accepted in Pattern Recognition 2020

  27. arXiv:2005.01908  [pdf, other

    cs.AI cs.LG

    A multi-component framework for the analysis and design of explainable artificial intelligence

    Authors: S. Atakishiyev, H. Babiker, N. Farruque, R. Goebel1, M-Y. Kima, M. H. Motallebi, J. Rabelo, T. Syed, O. R. Zaïane

    Abstract: The rapid growth of research in explainable artificial intelligence (XAI) follows on two substantial developments. First, the enormous application success of modern machine learning methods, especially deep and reinforcement learning, which have created high expectations for industrial, commercial and social value. Second, the emergence of concern for creating trusted AI systems, including the cre… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: 39 pages

  28. arXiv:2001.09403  [pdf, other

    cs.AI

    Sentiment and Knowledge Based Algorithmic Trading with Deep Reinforcement Learning

    Authors: Abhishek Nan, Anandh Perumal, Osmar R. Zaiane

    Abstract: Algorithmic trading, due to its inherent nature, is a difficult problem to tackle; there are too many variables involved in the real world which make it almost impossible to have reliable algorithms for automated stock trading. The lack of reliable labelled data that considers physical and physiological factors that dictate the ups and downs of the market, has hindered the supervised learning atte… ▽ More

    Submitted 26 January, 2020; originally announced January 2020.

  29. arXiv:1911.02147  [pdf, other

    cs.CL cs.LG

    Seq2Emo for Multi-label Emotion Classification Based on Latent Variable Chains Transformation

    Authors: Chenyang Huang, Amine Trabelsi, Xuebin Qin, Nawshad Farruque, Osmar R. Zaïane

    Abstract: Emotion detection in text is an important task in NLP and is essential in many applications. Most of the existing methods treat this task as a problem of single-label multi-class text classification. To predict multiple emotions for one instance, most of the existing works regard it as a general Multi-label Classification (MLC) problem, where they usually either apply a manually determined thresho… ▽ More

    Submitted 7 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: 10 pages, 2 figures, 5 tables

  30. arXiv:1909.05246  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Self-Attentional Models Application in Task-Oriented Dialogue Generation Systems

    Authors: Mansour Saffar Mehrjardi, Amine Trabelsi, Osmar R. Zaiane

    Abstract: Self-attentional models are a new paradigm for sequence modelling tasks which differ from common sequence modelling methods, such as recurrence-based and convolution-based sequence learning, in the way that their architecture is only based on the attention mechanism. Self-attentional models have been used in the creation of the state-of-the-art models in many NLP tasks such as neural machine trans… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: Appeared in proceedings of Recent Advances in Natural Language Processing (RANLP) Conference, 2019

  31. arXiv:1908.00648  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LG cs.SI

    Contrastive Reasons Detection and Clustering from Online Polarized Debate

    Authors: Amine Trabelsi, Osmar R. Zaiane

    Abstract: This work tackles the problem of unsupervised modeling and extraction of the main contrastive sentential reasons conveyed by divergent viewpoints on polarized issues. It proposes a pipeline approach centered around the detection and clustering of phrases, assimilated to argument facets using a novel Phrase Author Interaction Topic-Viewpoint model. The evaluation is based on the informativeness, th… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: Best paper award in CICLing 2019: International Conference on Computational Linguistics and Intelligent Text Processing

  32. arXiv:1904.03371  [pdf, other

    cs.CL cs.LG

    Evaluating Coherence in Dialogue Systems using Entailment

    Authors: Nouha Dziri, Ehsan Kamalloo, Kory W. Mathewson, Osmar Zaiane

    Abstract: Evaluating open-domain dialogue systems is difficult due to the diversity of possible correct answers. Automatic metrics such as BLEU correlate weakly with human annotations, resulting in a significant bias across different models and datasets. Some researchers resort to human judgment experimentation for assessing response quality, which is expensive, time consuming, and not scalable. Moreover, j… ▽ More

    Submitted 31 March, 2020; v1 submitted 6 April, 2019; originally announced April 2019.

    Comments: 5 pages, 2 figures; NAACL-HLT 2019

  33. arXiv:1904.00132  [pdf, other

    cs.CL cs.IR cs.LG

    ANA at SemEval-2019 Task 3: Contextual Emotion detection in Conversations through hierarchical LSTMs and BERT

    Authors: Chenyang Huang, Amine Trabelsi, Osmar R. Zaïane

    Abstract: This paper describes the system submitted by ANA Team for the SemEval-2019 Task 3: EmoContext. We propose a novel Hierarchical LSTMs for Contextual Emotion Detection (HRLCE) model. It classifies the emotion of an utterance given its conversational context. The results show that, in this task, our HRCLE outperforms the most recent state-of-the-art text classification framework: BERT. We combine the… ▽ More

    Submitted 31 May, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

    Comments: Accepted at the SemEval-2019 International Workshop on Semantic Evaluation

  34. arXiv:1811.10990  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Generating Responses Expressing Emotion in an Open-domain Dialogue System

    Authors: Chenyang Huang, Osmar R. Zaïane

    Abstract: Neural network-based Open-ended conversational agents automatically generate responses based on predictive models learned from a large number of pairs of utterances. The generated responses are typically acceptable as a sentence but are often dull, generic, and certainly devoid of any emotion. In this paper, we present neural models that learn to express a given emotion in the generated response.… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  35. arXiv:1811.06596  [pdf, ps, other

    cs.CL cs.AI

    On Generality and Knowledge Transferability in Cross-Domain Duplicate Question Detection for Heterogeneous Community Question Answering

    Authors: Mohomed Shazan Mohomed Jabbar, Luke Kumar, Hamman Samuel, Mi-Young Kim, Sankalp Prabhakar, Randy Goebel, Osmar Zaïane

    Abstract: Duplicate question detection is an ongoing challenge in community question answering because semantically equivalent questions can have significantly different words and structures. In addition, the identification of duplicate questions can reduce the resources required for retrieval, when the same questions are not repeated. This study compares the performance of deep neural networks and gradient… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  36. arXiv:1811.01063  [pdf, other

    cs.CL

    Augmenting Neural Response Generation with Context-Aware Topical Attention

    Authors: Nouha Dziri, Ehsan Kamalloo, Kory W. Mathewson, Osmar Zaiane

    Abstract: Sequence-to-Sequence (Seq2Seq) models have witnessed a notable success in generating natural conversational exchanges. Notwithstanding the syntactically well-formed responses generated by these neural network models, they are prone to be acontextual, short and generic. In this work, we introduce a Topical Hierarchical Recurrent Encoder Decoder (THRED), a novel, fully data-driven, multi-turn respon… ▽ More

    Submitted 4 June, 2019; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: Accepted at ACL 2019 Workshop on NLP for ConvAI (NLP4ConvAI). 8 pages + 4 appendix pages, 6 figures, 9 tables

  37. arXiv:1801.01229  [pdf, other

    cs.SI physics.soc-ph

    Modular Networks for Validating Community Detection Algorithms

    Authors: Justin Fagnan, Afra Abnar, Reihaneh Rabbany, Osmar R. Zaiane

    Abstract: How can we accurately compare different community detection algorithms? These algorithms cluster nodes in a given network, and their performance is often validated on benchmark networks with explicit ground-truth communities. Given the lack of cluster labels in real-world networks, a model that generates realistic networks is required for accurate evaluation of these algorithm. In this paper, we p… ▽ More

    Submitted 3 January, 2018; originally announced January 2018.

  38. arXiv:1712.00006  [pdf, other

    cs.LG cs.AI

    Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control

    Authors: Shangtong Zhang, Osmar R. Zaiane

    Abstract: Reinforcement Learning and the Evolutionary Strategy are two major approaches in addressing complicated control problems. Both are strong contenders and have their own devotee communities. Both groups have been very active in develo** new advances in their own domain and devising, in recent years, leading-edge techniques to address complex continuous control tasks. Here, in the context of Deep R… ▽ More

    Submitted 7 March, 2018; v1 submitted 29 November, 2017; originally announced December 2017.

    Comments: NIPS 2017 Deep Reinforcement Learning Symposium

  39. Complexity Analysis Approach for Prefabricated Construction Products Using Uncertain Data Clustering

    Authors: Wenying Ji, Simaan M. AbouRizk, Osmar R. Zaiane, Yitong Li

    Abstract: This paper proposes an uncertain data clustering approach to quantitatively analyze the complexity of prefabricated construction components through the integration of quality performance-based measures with associated engineering design information. The proposed model is constructed in three steps, which (1) measure prefabricated construction product complexity (hereafter referred to as product co… ▽ More

    Submitted 21 December, 2017; v1 submitted 28 October, 2017; originally announced October 2017.

  40. arXiv:1707.00331  [pdf, other

    cs.IR

    Reciprocal Recommender System for Learners in Massive Open Online Courses (MOOCs)

    Authors: Sankalp Prabhakar, Gerasimos Spanakis, Osmar Zaiane

    Abstract: Massive open online courses (MOOC) describe platforms where users with completely different backgrounds subscribe to various courses on offer. MOOC forums and discussion boards offer learners a medium to communicate with each other and maximize their learning outcomes. However, oftentimes learners are hesitant to approach each other for different reasons (being shy, don't know the right match, etc… ▽ More

    Submitted 2 July, 2017; originally announced July 2017.

    Comments: 10 pages, accepted as full paper @ ICWL 2017

  41. On Discovering Co-Location Patterns in Datasets: A Case Study of Pollutants and Child Cancers

    Authors: Jundong Li, Aibek Adilmagambetovm, Mohomed Shazan Mohomed Jabbar, Osmar R. Zaiane, Alvaro Osornio-Vargas, Osnat Wine

    Abstract: We intend to identify relationships between cancer cases and pollutant emissions and attempt to understand whether cancer in children is typically located together with some specific chemical combinations or is independent. Co-location pattern analysis seems to be the appropriate investigation to perform. Co-location mining is one of the tasks of spatial data mining which focuses on the detection… ▽ More

    Submitted 1 April, 2016; v1 submitted 23 December, 2014; originally announced December 2014.

    Comments: In GeoInformatica, 2016

    Journal ref: GeoInformatica 2016

  42. arXiv:1412.2601  [pdf, other

    cs.SI physics.soc-ph

    Generalization of Clustering Agreements and Distances for Overlap** Clusters and Network Communities

    Authors: Reihaneh Rabbany, Osmar R. Zaïane

    Abstract: A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used in cluster validation, a normalized clustering distance, a.k.a. agreement measure, compares a given clustering result against the ground-truth clustering. Cluste… ▽ More

    Submitted 5 March, 2015; v1 submitted 8 December, 2014; originally announced December 2014.

    Journal ref: Data Mining and Knowledge Discovery: Volume 29, Issue 5 (2015)