Skip to main content

Showing 1–38 of 38 results for author: Baktashmotlagh, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14924  [pdf, other

    cs.CV

    DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection

    Authors: Jia Syuen Lim, Zhuoxiao Chen, Mahsa Baktashmotlagh, Zhi Chen, Xin Yu, Zi Huang, Yadan Luo

    Abstract: Class-agnostic object detection (OD) can be a cornerstone or a bottleneck for many downstream vision tasks. Despite considerable advancements in bottom-up and multi-object discovery methods that leverage basic visual cues to identify salient objects, consistently achieving a high recall rate remains difficult due to the diversity of object types and their contextual complexity. In this work, we in… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 19 pages

  2. arXiv:2406.14878  [pdf, other

    cs.CV cs.LG eess.IV

    MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection

    Authors: Zhuoxiao Chen, Junjie Meng, Mahsa Baktashmotlagh, Zi Huang, Yadan Luo

    Abstract: LiDAR-based 3D object detection is pivotal across many applications, yet the performance of such detection systems often degrades after deployment, especially when faced with unseen test point clouds originating from diverse locations or subjected to corruption. In this work, we introduce a new online adaptation framework for detectors named Model Synergy (MOS). Specifically, MOS dynamically assem… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. Leveraging LLMs for Unsupervised Dense Retriever Ranking

    Authors: Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Guido Zuccon

    Abstract: In this paper we present Large Language Model Assisted Retrieval Model Ranking (LARMOR), an effective unsupervised approach that leverages LLMs for selecting which dense retriever to use on a test corpus (target). Dense retriever selection is crucial for many IR applications that rely on using dense retrievers trained on public corpora to encode or search a new, private target corpus. This is beca… ▽ More

    Submitted 23 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: SIGIR2024 full paper

  4. arXiv:2310.07361  [pdf, other

    cs.CV

    Domain Generalization Guided by Gradient Signal to Noise Ratio of Parameters

    Authors: Mateusz Michalkiewicz, Masoud Faraki, Xiang Yu, Manmohan Chandraker, Mahsa Baktashmotlagh

    Abstract: Overfitting to the source domain is a common issue in gradient-based training of deep neural networks. To compensate for the over-parameterized models, numerous regularization techniques have been introduced such as those based on dropout. While these methods achieve significant improvements on classical benchmarks such as ImageNet, their performance diminishes with the introduction of domain shif… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Paper was accepted to ICCV 2023

  5. arXiv:2309.09403  [pdf, ps, other

    cs.IR cs.AI

    Selecting which Dense Retriever to use for Zero-Shot Search

    Authors: Ekaterina Khramtsova, Shengyao Zhuang, Mahsa Baktashmotlagh, Xi Wang, Guido Zuccon

    Abstract: We propose the new problem of choosing which dense retrieval model to use when searching on a new collection for which no labels are available, i.e. in a zero-shot setting. Many dense retrieval models are readily available. Each model however is characterized by very differing search effectiveness -- not just on the test portion of the datasets in which the dense representations have been learned… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  6. arXiv:2307.07944  [pdf, other

    cs.CV cs.AI cs.LG

    Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling

    Authors: Zhuoxiao Chen, Yadan Luo, Zheng Wang, Mahsa Baktashmotlagh, Zi Huang

    Abstract: Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection. While effective, existing DA methods suffer from a substantial drop in performance when applied to a multi-class training setting, due to the co-existence of low-quality pseudo labels and class imbalance issues. In this paper, we address this cha… ▽ More

    Submitted 16 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023, camera-ready

  7. arXiv:2307.07942  [pdf, other

    cs.CV cs.AI cs.LG

    KECOR: Kernel Coding Rate Maximization for Active 3D Object Detection

    Authors: Yadan Luo, Zhuoxiao Chen, Zhen Fang, Zheng Zhang, Zi Huang, Mahsa Baktashmotlagh

    Abstract: Achieving a reliable LiDAR-based object detector in autonomous driving is paramount, but its success hinges on obtaining large amounts of precise 3D annotations. Active learning (AL) seeks to mitigate the annotation burden through algorithms that use fewer labels and can attain performance comparable to fully supervised learning. Although AL has shown promise, current approaches prioritize the sel… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: To appear in ICCV 2023

  8. arXiv:2301.09249  [pdf, other

    cs.CV cs.AI

    Exploring Active 3D Object Detection from a Generalization Perspective

    Authors: Yadan Luo, Zhuoxiao Chen, Zijian Wang, Xin Yu, Zi Huang, Mahsa Baktashmotlagh

    Abstract: To alleviate the high annotation cost in LiDAR-based 3D object detection, active learning is a promising solution that learns to select only a small portion of unlabeled data to annotate, without compromising model performance. Our empirical study, however, suggests that mainstream uncertainty-based and diversity-based active learning policies are not effective when applied in the 3D detection tas… ▽ More

    Submitted 8 February, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: To appear in ICLR 2023

  9. DI-NIDS: Domain Invariant Network Intrusion Detection System

    Authors: Siamak Layeghy, Mahsa Baktashmotlagh, Marius Portmann

    Abstract: The performance of machine learning based network intrusion detection systems (NIDSs) severely degrades when deployed on a network with significantly different feature distributions from the ones of the training dataset. In various applications, such as computer vision, domain adaptation techniques have been successful in mitigating the gap between the distributions of the training and test data.… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

  10. arXiv:2207.04220  [pdf, other

    cs.CV

    Rethinking Persistent Homology for Visual Recognition

    Authors: Ekaterina Khramtsova, Guido Zuccon, Xi Wang, Mahsa Baktashmotlagh

    Abstract: Persistent topological properties of an image serve as an additional descriptor providing an insight that might not be discovered by traditional neural networks. The existing research in this area focuses primarily on efficiently integrating topological properties of the data in the learning process in order to enhance the performance. However, there is no existing study to demonstrate all possibl… ▽ More

    Submitted 5 March, 2023; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: ICML 2022 Workshop on Topology, Algebra, and Geometry in Machine Learning

  11. arXiv:2202.06174  [pdf, other

    cs.CV

    Source-Free Progressive Graph Learning for Open-Set Domain Adaptation

    Authors: Yadan Luo, Zijian Wang, Zhuoxiao Chen, Zi Huang, Mahsa Baktashmotlagh

    Abstract: Open-set domain adaptation (OSDA) has gained considerable attention in many visual recognition tasks. However, most existing OSDA approaches are limited due to three main reasons, including: (1) the lack of essential theoretical analysis of generalization bound, (2) the reliance on the coexistence of source and target data during adaptation, and (3) failing to accurately estimate the uncertainty o… ▽ More

    Submitted 22 January, 2023; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2006.12087

  12. arXiv:2109.00522  [pdf, other

    cs.CV cs.LG cs.MM

    Conditional Extreme Value Theory for Open Set Video Domain Adaptation

    Authors: Zhuoxiao Chen, Yadan Luo, Mahsa Baktashmotlagh

    Abstract: With the advent of media streaming, video action recognition has become progressively important for various applications, yet at the high expense of requiring large-scale data labelling. To overcome the problem of expensive data labelling, domain adaptation techniques have been proposed that transfers knowledge from fully labelled data (i.e., source domain) to unlabelled data (i.e., target domain)… ▽ More

    Submitted 15 October, 2021; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Camera-ready. Accepted by ACM International Conference on Multimedia in Asia 2021 (MMAsia 2021)

  13. arXiv:2108.11726  [pdf, other

    cs.CV

    Learning to Diversify for Single Domain Generalization

    Authors: Zijian Wang, Yadan Luo, Ruihong Qiu, Zi Huang, Mahsa Baktashmotlagh

    Abstract: Domain generalization (DG) aims to generalize a model trained on multiple source (i.e., training) domains to a distributionally different target (i.e., test) domain. In contrast to the conventional DG that strictly requires the availability of multiple source domains, this paper considers a more realistic yet challenging scenario, namely Single Domain Generalization (Single-DG), where only one sou… ▽ More

    Submitted 22 March, 2023; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: ICCV 2021

  14. arXiv:2107.11566  [pdf, other

    cs.CV

    Going Deeper into Semi-supervised Person Re-identification

    Authors: Olga Moskvyak, Frederic Maire, Feras Dayoub, Mahsa Baktashmotlagh

    Abstract: Person re-identification is the challenging task of identifying a person across different camera views. Training a convolutional neural network (CNN) for this task requires annotating a large dataset, and hence, it involves the time-consuming manual matching of people across cameras. To reduce the need for labeled data, we focus on a semi-supervised approach that requires only a subset of the trai… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  15. arXiv:2106.06440  [pdf, other

    cs.CV cs.LG

    Learning Compositional Shape Priors for Few-Shot 3D Reconstruction

    Authors: Mateusz Michalkiewicz, Stavros Tsogkas, Sarah Parisot, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky

    Abstract: The impressive performance of deep convolutional neural networks in single-view 3D reconstruction suggests that these models perform non-trivial reasoning about the 3D structure of the output space. Recent work has challenged this belief, showing that, on standard benchmarks, complex encoder-decoder architectures perform similarly to nearest-neighbor baselines or simple linear decoder models that… ▽ More

    Submitted 16 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 13 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2004.06302

  16. arXiv:2105.06717  [pdf, other

    cs.AI cs.CL

    Neural-Symbolic Commonsense Reasoner with Relation Predictors

    Authors: Farhad Moghimifar, Lizhen Qu, Yue Zhuo, Gholamreza Haffari, Mahsa Baktashmotlagh

    Abstract: Commonsense reasoning aims to incorporate sets of commonsense facts, retrieved from Commonsense Knowledge Graphs (CKG), to draw conclusion about ordinary situations. The dynamic nature of commonsense knowledge postulates models capable of performing multi-hop reasoning over new situations. This feature also results in having large-scale sparse Knowledge Graphs, where such reasoning process is need… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: ACL2021

  17. arXiv:2101.07988  [pdf, other

    cs.CV

    Semi-supervised Keypoint Localization

    Authors: Olga Moskvyak, Frederic Maire, Feras Dayoub, Mahsa Baktashmotlagh

    Abstract: Knowledge about the locations of keypoints of an object in an image can assist in fine-grained classification and identification tasks, particularly for the case of objects that exhibit large variations in poses that greatly influence their visual appearance, such as wild animals. However, supervised training of a keypoint detection network requires annotating a large image dataset for each animal… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: accepted to ICLR 2021

  18. arXiv:2011.13549  [pdf, other

    cs.CL

    Domain Adaptative Causality Encoder

    Authors: Farhad Moghimifar, Gholamreza Haffari, Mahsa Baktashmotlagh

    Abstract: Current approaches which are mainly based on the extraction of low-level relations among individual events are limited by the shortage of publicly available labelled data. Therefore, the resulting models perform poorly when applied to a distributionally different domain for which labelled data did not exist at the time of training. To overcome this limitation, in this paper, we leverage the charac… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: ALTA2020

  19. arXiv:2011.13115  [pdf, ps, other

    cs.CL

    Learning Causal Bayesian Networks from Text

    Authors: Farhad Moghimifar, Afshin Rahimi, Mahsa Baktashmotlagh, Xue Li

    Abstract: Causal relationships form the basis for reasoning and decision-making in Artificial Intelligence systems. To exploit the large volume of textual data available today, the automatic discovery of causal relationships from text has emerged as a significant challenge in recent years. Existing approaches in this realm are limited to the extraction of low-level relations among individual events. To over… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: ALTA2020

  20. arXiv:2011.12517  [pdf, other

    cs.SI cs.AI

    Interpretable Signed Link Prediction with Signed Infomax Hyperbolic Graph

    Authors: Yadan Luo, Zi Huang, Hongxu Chen, Yang Yang, Mahsa Baktashmotlagh

    Abstract: Signed link prediction in social networks aims to reveal the underlying relationships (i.e. links) among users (i.e. nodes) given their existing positive and negative interactions observed. Most of the prior efforts are devoted to learning node embeddings with graph neural networks (GNNs), which preserve the signed network topology by message-passing along edges to facilitate the downstream link p… ▽ More

    Submitted 22 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  21. arXiv:2011.00777  [pdf, other

    cs.CL

    COSMO: Conditional SEQ2SEQ-based Mixture Model for Zero-Shot Commonsense Question Answering

    Authors: Farhad Moghimifar, Lizhen Qu, Yue Zhuo, Mahsa Baktashmotlagh, Gholamreza Haffari

    Abstract: Commonsense reasoning refers to the ability of evaluating a social situation and acting accordingly. Identification of the implicit causes and effects of a social context is the driving capability which can enable machines to perform commonsense reasoning. The dynamic world of social interactions requires context-dependent on-demand systems to infer such underlying information. However, current ap… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: COLING2020

  22. arXiv:2008.11368  [pdf, other

    cs.CV

    Keypoint-Aligned Embeddings for Image Retrieval and Re-identification

    Authors: Olga Moskvyak, Frederic Maire, Feras Dayoub, Mahsa Baktashmotlagh

    Abstract: Learning embeddings that are invariant to the pose of the object is crucial in visual image retrieval and re-identification. The existing approaches for person, vehicle, or animal re-identification tasks suffer from high intra-class variance due to deformable shapes and different camera viewpoints. To overcome this limitation, we propose to align the image embedding with a predefined order of the… ▽ More

    Submitted 25 August, 2020; originally announced August 2020.

    Comments: 8 pages, 7 figures, accepted to WACV 2021

  23. Adversarial Bipartite Graph Learning for Video Domain Adaptation

    Authors: Yadan Luo, Zi Huang, Zijian Wang, Zheng Zhang, Mahsa Baktashmotlagh

    Abstract: Domain adaptation techniques, which focus on adapting models between distributionally different domains, are rarely explored in the video recognition area due to the significant spatial and temporal shifts across the source (i.e. training) and target (i.e. test) domains. As such, recent works on visual domain adaptation which leverage adversarial learning to unify the source and target video repre… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: Proceedings of the 28th ACM International Conference on Multimedia (MM '20)

  24. arXiv:2006.12087  [pdf, other

    cs.CV cs.LG stat.ML

    Progressive Graph Learning for Open-Set Domain Adaptation

    Authors: Yadan Luo, Zijian Wang, Zi Huang, Mahsa Baktashmotlagh

    Abstract: Domain shift is a fundamental problem in visual recognition which typically arises when the source and target data follow different distributions. The existing domain adaptation approaches which tackle this problem work in the closed-set setting with the assumption that the source and the target data share exactly the same classes of objects. In this paper, we tackle a more realistic problem of op… ▽ More

    Submitted 29 June, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: International Conference on Machine Learning (ICML 2020)

  25. arXiv:2005.04623  [pdf, other

    cs.CV

    A Simple and Scalable Shape Representation for 3D Reconstruction

    Authors: Mateusz Michalkiewicz, Eugene Belilovsky, Mahsa Baktashmotlagh, Anders Eriksson

    Abstract: Deep learning applied to the reconstruction of 3D shapes has seen growing interest. A popular approach to 3D reconstruction and generation in recent years has been the CNN encoder-decoder model usually applied in voxel space. However, this often scales very poorly with the resolution limiting the effectiveness of these models. Several sophisticated alternatives for decoding to 3D shapes have been… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: 9 pages plus 3 pages of references. 4 figures

    MSC Class: 65D19

  26. arXiv:2004.06302  [pdf, other

    cs.CV cs.LG

    Few-Shot Single-View 3-D Object Reconstruction with Compositional Priors

    Authors: Mateusz Michalkiewicz, Sarah Parisot, Stavros Tsogkas, Mahsa Baktashmotlagh, Anders Eriksson, Eugene Belilovsky

    Abstract: The impressive performance of deep convolutional neural networks in single-view 3D reconstruction suggests that these models perform non-trivial reasoning about the 3D structure of the output space. However, recent work has challenged this belief, showing that complex encoder-decoder architectures perform similarly to nearest-neighbor baselines or simple linear decoder models that exploit large am… ▽ More

    Submitted 2 May, 2020; v1 submitted 14 April, 2020; originally announced April 2020.

  27. arXiv:2003.01822  [pdf, other

    cs.CV

    Implicitly Defined Layers in Neural Networks

    Authors: Qianggong Zhang, Yanyang Gu, Michalkiewicz Mateusz, Mahsa Baktashmotlagh, Anders Eriksson

    Abstract: In conventional formulations of multilayer feedforward neural networks, the individual layers are customarily defined by explicit functions. In this paper we demonstrate that defining individual layers in a neural network \emph{implicitly} provide much richer representations over the standard explicit one, consequently enabling a vastly broader class of end-to-end trainable architectures. We prese… ▽ More

    Submitted 2 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  28. arXiv:2001.02801  [pdf, other

    cs.CV

    Learning landmark guided embeddings for animal re-identification

    Authors: Olga Moskvyak, Frederic Maire, Feras Dayoub, Mahsa Baktashmotlagh

    Abstract: Re-identification of individual animals in images can be ambiguous due to subtle variations in body markings between different individuals and no constraints on the poses of animals in the wild. Person re-identification is a similar task and it has been approached with a deep convolutional neural network (CNN) that learns discriminative embeddings for images of people. However, learning discrimina… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: 7 pages, 7 figures

  29. arXiv:1911.12983  [pdf, other

    cs.CV cs.LG

    Correlation-aware Adversarial Domain Adaptation and Generalization

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Mahsa Baktashmotlagh, Sridha Sridharan

    Abstract: Domain adaptation (DA) and domain generalization (DG) have emerged as a solution to the domain shift problem where the distribution of the source and target data is different. The task of DG is more challenging than DA as the target data is totally unseen during the training phase in DG scenarios. The current state-of-the-art employs adversarial techniques, however, these are rarely considered for… ▽ More

    Submitted 29 November, 2019; originally announced November 2019.

    Comments: Preprint submitted to Pattern Recognition, Accepted in Pattern Recognition

  30. arXiv:1911.04695  [pdf, other

    cs.LG stat.ML

    Learning from the Past: Continual Meta-Learning via Bayesian Graph Modeling

    Authors: Yadan Luo, Zi Huang, Zheng Zhang, Ziwei Wang, Mahsa Baktashmotlagh, Yang Yang

    Abstract: Meta-learning for few-shot learning allows a machine to leverage previously acquired knowledge as a prior, thus improving the performance on novel tasks with only small amounts of data. However, most mainstream models suffer from catastrophic forgetting and insufficient robustness issues, thereby failing to fully retain or exploit long-term knowledge while being prone to cause severe error accumul… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  31. arXiv:1902.10847  [pdf, other

    cs.CV

    Robust Re-identification of Manta Rays from Natural Markings by Learning Pose Invariant Embeddings

    Authors: Olga Moskvyak, Frederic Maire, Asia O. Armstrong, Feras Dayoub, Mahsa Baktashmotlagh

    Abstract: Visual identification of individual animals that bear unique natural body markings is an important task in wildlife conservation. The photo databases of animal markings grow larger and each new observation has to be matched against thousands of images. Existing photo-identification solutions have constraints on image quality and appearance of the pattern of interest in the image. These constraints… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 12 pages, 15 figures

  32. arXiv:1901.06802  [pdf, other

    cs.CV

    Deep Level Sets: Implicit Surface Representations for 3D Shape Inference

    Authors: Mateusz Michalkiewicz, Jhony K. Pontes, Dominic Jack, Mahsa Baktashmotlagh, Anders Eriksson

    Abstract: Existing 3D surface representation approaches are unable to accurately classify pixels and their orientation lying on the boundary of an object. Thus resulting in coarse representations which usually require post-processing steps to extract 3D surface meshes. To overcome this limitation, we propose an end-to-end trainable model that directly predicts implicit surface representations of arbitrary t… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

  33. arXiv:1901.00282  [pdf, other

    cs.CV

    On Minimum Discrepancy Estimation for Deep Domain Adaptation

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Mahsa Baktashmotlagh, Sridha Sridharan

    Abstract: In the presence of large sets of labeled data, Deep Learning (DL) has accomplished extraordinary triumphs in the avenue of computer vision, particularly in object classification and recognition tasks. However, DL cannot always perform well when the training and testing images come from different distributions or in the presence of domain shift between training and testing images. They also suffer… ▽ More

    Submitted 2 January, 2019; originally announced January 2019.

    Comments: Accepted in Joint IJCAI/ECAI/AAMAS/ICML 2018 Workshop

  34. arXiv:1812.08974  [pdf, other

    cs.CV cs.AI

    Multi-component Image Translation for Deep Domain Generalization

    Authors: Mohammad Mahfujur Rahman, Clinton Fookes, Mahsa Baktashmotlagh, Sridha Sridharan

    Abstract: Domain adaption (DA) and domain generalization (DG) are two closely related methods which are both concerned with the task of assigning labels to an unlabeled data set. The only dissimilarity between these approaches is that DA can access the target data during the training phase, while the target data is totally unseen during the training phase in DG. The task of DG is challenging as we have no e… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

    Comments: Accepted in WACV 2019

  35. arXiv:1805.12277  [pdf, other

    cs.CV

    Learning Factorized Representations for Open-set Domain Adaptation

    Authors: Mahsa Baktashmotlagh, Masoud Faraki, Tom Drummond, Mathieu Salzmann

    Abstract: Domain adaptation for visual recognition has undergone great progress in the past few years. Nevertheless, most existing methods work in the so-called closed-set scenario, assuming that the classes depicted by the target images are exactly the same as those of the source domain. In this paper, we tackle the more challenging, yet more realistic case of open-set domain adaptation, where new, unknown… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

  36. arXiv:1709.07894  [pdf

    cs.CV

    On Encoding Temporal Evolution for Real-time Action Prediction

    Authors: Fahimeh Rezazadegan, Sareh Shirazi, Mahsa Baktashmotlagh, Larry S. Davis

    Abstract: Anticipating future actions is a key component of intelligence, specifically when it applies to real-time systems, such as robots or autonomous cars. While recent works have addressed prediction of raw RGB pixel values, we focus on anticipating the motion evolution in future video frames. To this end, we construct dynamic images (DIs) by summarising moving pixels through a sequence of future frame… ▽ More

    Submitted 7 February, 2018; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: Submitted Version

  37. arXiv:1709.00813  [pdf, other

    cs.CL

    From Review to Rating: Exploring Dependency Measures for Text Classification

    Authors: Samuel Cunningham-Nelson, Mahsa Baktashmotlagh, Wageeh Boles

    Abstract: Various text analysis techniques exist, which attempt to uncover unstructured information from text. In this work, we explore using statistical dependence measures for textual classification, representing text as word vectors. Student satisfaction scores on a 3-point scale and their free text comments written about university subjects are used as the dataset. We have compared two textual represent… ▽ More

    Submitted 4 September, 2017; originally announced September 2017.

    Comments: 8 pages

    Journal ref: Under Consideration by Pattern Recognition Letters (PRL) 2018

  38. arXiv:1507.08711  [pdf, other

    cs.CV

    Beyond Gauss: Image-Set Matching on the Riemannian Manifold of PDFs

    Authors: Mehrtash Harandi, Mathieu Salzmann, Mahsa Baktashmotlagh

    Abstract: State-of-the-art image-set matching techniques typically implicitly model each image-set with a Gaussian distribution. Here, we propose to go beyond these representations and model image-sets as probability distribution functions (PDFs) using kernel density estimators. To compare and match image-sets, we exploit Csiszar f-divergences, which bear strong connections to the geodesic distance defined… ▽ More

    Submitted 30 July, 2015; originally announced July 2015.