Skip to main content

Showing 1–18 of 18 results for author: Bastan, M

.
  1. arXiv:2311.06654  [pdf, other

    cs.CV

    Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics

    Authors: Souradeep Chakraborty, Shujon Naha, Muhammet Bastan, Amit Kumar K C, Dimitris Samaras

    Abstract: In this paper, we address the detection of co-occurring salient objects (CoSOD) in an image group using frequency statistics in an unsupervised manner, which further enable us to develop a semi-supervised method. While previous works have mostly focused on fully supervised CoSOD, less attention has been allocated to detecting co-salient objects when limited segmentation annotations are available f… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE WACV 2024

  2. arXiv:2311.00668  [pdf, other

    cs.CV

    ProcSim: Proxy-based Confidence for Robust Similarity Learning

    Authors: Oriol Barbany, Xiaofan Lin, Muhammet Bastan, Arnab Dhua

    Abstract: Deep Metric Learning (DML) methods aim at learning an embedding space in which distances are closely related to the inherent semantic similarity of the inputs. Previous studies have shown that popular benchmark datasets often contain numerous wrong labels, and DML methods are susceptible to them. Intending to study the effect of realistic noise, we create an ontology of the classes in a dataset an… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted to the algorithms track of WACV 2024

  3. arXiv:2210.14814  [pdf, other

    cs.CL cs.IR cs.LG

    BioNLI: Generating a Biomedical NLI Dataset Using Lexico-semantic Constraints for Adversarial Examples

    Authors: Mohaddeseh Bastan, Mihai Surdeanu, Niranjan Balasubramanian

    Abstract: Natural language inference (NLI) is critical for complex decision-making in biomedical domain. One key question, for example, is whether a given biomedical mechanism is supported by experimental evidence. This can be seen as an NLI problem but there are no directly usable datasets to address this. The main challenge is that manually creating informative negative examples for this task is difficult… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022, Data and evaluation suite available at https://stonybrooknlp.github.io/BioNLI/

  4. arXiv:2205.04652  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    SuMe: A Dataset Towards Summarizing Biomedical Mechanisms

    Authors: Mohaddeseh Bastan, Nishant Shankar, Mihai Surdeanu, Niranjan Balasubramanian

    Abstract: Can language models read biomedical texts and explain the biomedical mechanisms discussed? In this work we introduce a biomedical mechanism summarization task. Biomedical studies often investigate the mechanisms behind how one entity (e.g., a protein or a chemical) affects another in a biological context. The abstracts of these publications often include a focused set of sentences that present rel… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepter at LREC2022

  5. arXiv:2112.13960  [pdf, ps, other

    cs.CL

    A Preordered RNN Layer Boosts Neural Machine Translation in Low Resource Settings

    Authors: Mohaddeseh Bastan, Shahram Khadivi

    Abstract: Neural Machine Translation (NMT) models are strong enough to convey semantic and syntactic information from the source language to the target language. However, these models are suffering from the need for a large amount of data to learn the parameters. As a result, for languages with scarce data, these models are at risk of underperforming. We propose to augment attention based neural network wit… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

  6. arXiv:2103.13538  [pdf, other

    cs.CV

    Hierarchical Proxy-based Loss for Deep Metric Learning

    Authors: Zhibo Yang, Muhammet Bastan, Xinliang Zhu, Doug Gray, Dimitris Samaras

    Abstract: Proxy-based metric learning losses are superior to pair-based losses due to their fast convergence and low training complexity. However, existing proxy-based losses focus on learning class-discriminative features while overlooking the commonalities shared across classes which are potentially useful in describing and matching samples. Moreover, they ignore the implicit hierarchy of categories in re… ▽ More

    Submitted 17 October, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: Accepted to WACV2022

  7. arXiv:2011.06128  [pdf, other

    cs.CL cs.LG

    Author's Sentiment Prediction

    Authors: Mohaddeseh Bastan, Mahnaz Koupaee, Youngseo Son, Richard Sicoli, Niranjan Balasubramanian

    Abstract: We introduce PerSenT, a dataset of crowd-sourced annotations of the sentiment expressed by the authors towards the main entities in news articles. The dataset also includes paragraph-level sentiment annotations to provide more fine-grained supervision for the task. Our benchmarks of multiple strong baselines show that this is a difficult classification task. The results also suggest that simply fi… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 12 pages, 5 figures, Accepted in COLING2020

  8. arXiv:2006.05489  [pdf, other

    cs.CL

    Modeling Label Semantics for Predicting Emotional Reactions

    Authors: Radhika Gaonkar, Heeyoung Kwon, Mohaddeseh Bastan, Niranjan Balasubramanian, Nathanael Chambers

    Abstract: Predicting how events induce emotions in the characters of a story is typically seen as a standard multi-label classification task, which usually treats labels as anonymous classes to predict. They ignore information that may be conveyed by the emotion labels themselves. We propose that the semantics of emotion labels can guide a model's attention when representing the input story. Further, we obs… ▽ More

    Submitted 28 June, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 6 pages, 2 figures, published in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

  9. arXiv:2005.08399  [pdf, other

    cs.CV

    T-VSE: Transformer-Based Visual Semantic Embedding

    Authors: Muhammet Bastan, Arnau Ramisa, Mehmet Tek

    Abstract: Transformer models have recently achieved impressive performance on NLP tasks, owing to new algorithms for self-supervised pre-training on very large text corpora. In contrast, recent literature suggests that simple average word models outperform more complicated language models, e.g., RNNs and Transformers, on cross-modal image/text search tasks on standard benchmarks, like MS COCO. In this paper… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: To appear: CVPR 2020 Workshop on Computer Vision for Fashion, Art and Design (CVFAD 2020)

  10. arXiv:1911.07440  [pdf, other

    cs.CV

    Large Scale Open-Set Deep Logo Detection

    Authors: Muhammet Bastan, Hao-Yu Wu, Tian Cao, Bhargava Kota, Mehmet Tek

    Abstract: We present an open-set logo detection (OSLD) system, which can detect (localize and recognize) any number of unseen logo classes without re-training; it only requires a small set of canonical logo images for each logo class. We achieve this using a two-stage approach: (1) Generic logo detection to detect candidate logo regions in an image. (2) Logo matching for matching the detected logo regions t… ▽ More

    Submitted 12 March, 2022; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: Open Set Logo Detection (OSLD) dataset available at https://github.com/mubastan/osld

  11. arXiv:1911.06047  [pdf, other

    cs.CV cs.LG

    Semantic Granularity Metric Learning for Visual Search

    Authors: Dipu Manandhar, Muhammet Bastan, Kim-Hui Yap

    Abstract: Deep metric learning applied to various applications has shown promising results in identification, retrieval and recognition. Existing methods often do not consider different granularity in visual similarity. However, in many domain applications, images exhibit similarity at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarity ranging from clothing of the exa… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.

    Comments: 10 pages, 10 figures

  12. arXiv:1804.10805  [pdf, ps, other

    cs.CV

    Remote Detection of Idling Cars Using Infrared Imaging and Deep Networks

    Authors: Muhammet Bastan, Kim-Hui Yap, Lap-Pui Chau

    Abstract: Idling vehicles waste energy and pollute the environment through exhaust emission. In some countries, idling a vehicle for more than a predefined duration is prohibited and automatic idling vehicle detection is desirable for law enforcement. We propose the first automatic system to detect idling cars, using infrared (IR) imaging and deep networks. We rely on the differences in spatio-temporal he… ▽ More

    Submitted 28 April, 2018; originally announced April 2018.

    Comments: Neural Computing and Applications

  13. arXiv:1701.01854  [pdf

    cs.CL

    Neural Machine Translation on Scarce-Resource Condition: A case-study on Persian-English

    Authors: Mohaddeseh Bastan, Shahram Khadivi, Mohammad Mehdi Homayounpour

    Abstract: Neural Machine Translation (NMT) is a new approach for Machine Translation (MT), and due to its success, it has absorbed the attention of many researchers in the field. In this paper, we study NMT model on Persian-English language pairs, to analyze the model and investigate the appropriateness of the model for scarce-resourced scenarios, the situation that exists for Persian-centered translation s… ▽ More

    Submitted 7 January, 2017; originally announced January 2017.

    Comments: 6 pages, Submitted in ICEE 2017

  14. arXiv:1609.03415  [pdf, ps, other

    cs.CV cs.MM

    Active Canny: Edge Detection and Recovery with Open Active Contour Models

    Authors: Muhammet Bastan, S. Saqib Bukhari, Thomas M. Breuel

    Abstract: We introduce an edge detection and recovery framework based on open active contour models (snakelets). This is motivated by the noisy or broken edges output by standard edge detection algorithms, like Canny. The idea is to utilize the local continuity and smoothness cues provided by strong edges and grow them to recover the missing edges. This way, the strong edges are used to recover weak or miss… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

  15. arXiv:1608.05054  [pdf, ps, other

    cs.MM

    MT3S: Mobile Turkish Scene Text-to-Speech System for the Visually Impaired

    Authors: Muhammet Bastan, Hilal Kandemir, Busra Canturk

    Abstract: Reading text is one of the essential needs of the visually impaired people. We developed a mobile system that can read Turkish scene and book text, using a fast gradient-based multi-scale text detection algorithm for real-time operation and Tesseract OCR engine for character recognition. We evaluated the OCR accuracy and running time of our system on a new, publicly available mobile Turkish scene… ▽ More

    Submitted 17 August, 2016; originally announced August 2016.

  16. arXiv:1608.03462  [pdf, ps, other

    cs.CV cs.MM

    Multi-View Product Image Search Using Deep ConvNets Representations

    Authors: Muhammet Bastan, Ozgur Yilmaz

    Abstract: Multi-view product image queries can improve retrieval performance over single view queries significantly. In this paper, we investigated the performance of deep convolutional neural networks (ConvNets) on multi-view product image search. First, we trained a VGG-like network to learn deep ConvNets representations of product images. Then, we computed the deep ConvNets representations of database an… ▽ More

    Submitted 1 May, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Comments: 13 pages, 16 figures

  17. arXiv:1507.08861  [pdf, ps, other

    cs.MM cs.CV

    Mobile Multi-View Object Image Search

    Authors: Fatih Calisir, Muhammet Bastan, Ozgur Ulusoy, Ugur Gudukbay

    Abstract: High user interaction capability of mobile devices can help improve the accuracy of mobile visual search systems. At query time, it is possible to capture multiple views of an object from different viewing angles and at different scales with the mobile device camera to obtain richer information about the object compared to a single view and hence return more accurate results. Motivated by this, we… ▽ More

    Submitted 30 April, 2018; v1 submitted 31 July, 2015; originally announced July 2015.

    Comments: Multimedia Tools and Applications, 2017

  18. arXiv:1506.00406  [pdf

    cs.CL

    Monolingually Derived Phrase Scores for Phrase Based SMT Using Neural Networks Vector Representations

    Authors: Amir Pouya Aghasadeghi, Mohadeseh Bastan

    Abstract: In this paper, we propose two new features for estimating phrase-based machine translation parameters from mainly monolingual data. Our method is based on two recently introduced neural network vector representation models for words and sentences. It is the first time that these models have been used in an end to end phrase-based machine translation system. Scores obtained from our method can reco… ▽ More

    Submitted 24 May, 2016; v1 submitted 1 June, 2015; originally announced June 2015.