Skip to main content

Showing 1–8 of 8 results for author: Chasmai, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.14163  [pdf, other

    cs.CV

    A Language-Guided Benchmark for Weakly Supervised Open Vocabulary Semantic Segmentation

    Authors: Prashant Pandey, Mustafa Chasmai, Monish Natarajan, Brejesh Lall

    Abstract: Increasing attention is being diverted to data-efficient problem settings like Open Vocabulary Semantic Segmentation (OVSS) which deals with segmenting an arbitrary object that may or may not be seen during training. The closest standard problems related to OVSS are Zero-Shot and Few-Shot Segmentation (ZSS, FSS) and their Cross-dataset variants where zero to few annotations are needed to segment n… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  2. arXiv:2211.16200  [pdf, other

    cs.CV cs.AI

    From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments

    Authors: Britty Baby, Daksh Thapar, Mustafa Chasmai, Tamajit Banerjee, Kunal Dargan, Ashish Suri, Subhashis Banerjee, Chetan Arora

    Abstract: Minimally invasive surgeries and related applications demand surgical tool classification and segmentation at the instance level. Surgical tools are similar in appearance and are long, thin, and handled at an angle. The fine-tuning of state-of-the-art (SOTA) instance segmentation models trained on natural images for instrument segmentation has difficulty discriminating instrument classes. Our rese… ▽ More

    Submitted 11 March, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: WACV 2023

  3. arXiv:2210.03429  [pdf, other

    cs.CV

    Adversarially Robust Prototypical Few-shot Segmentation with Neural-ODEs

    Authors: Prashant Pandey, Aleti Vardhan, Mustafa Chasmai, Tanuj Sur, Brejesh Lall

    Abstract: Few-shot Learning (FSL) methods are being adopted in settings where data is not abundantly available. This is especially seen in medical domains where the annotations are expensive to obtain. Deep Neural Networks have been shown to be vulnerable to adversarial attacks. This is even more severe in the case of FSL due to the lack of a large number of training examples. In this paper, we provide a fr… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: MICCAI 2022. arXiv admin note: substantial text overlap with arXiv:2208.12428

  4. arXiv:2208.12428  [pdf, other

    cs.CV cs.AI

    Robust Prototypical Few-Shot Organ Segmentation with Regularized Neural-ODEs

    Authors: Prashant Pandey, Mustafa Chasmai, Tanuj Sur, Brejesh Lall

    Abstract: Despite the tremendous progress made by deep learning models in image semantic segmentation, they typically require large annotated examples, and increasing attention is being diverted to problem settings like Few-Shot Learning (FSL) where only a small amount of annotation is needed for generalisation to novel classes. This is especially seen in medical domains where dense pixel-level annotations… ▽ More

    Submitted 1 March, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

  5. arXiv:2206.13577  [pdf, other

    cs.CV cs.AI cs.LG

    A View Independent Classification Framework for Yoga Postures

    Authors: Mustafa Chasmai, Nirjhar Das, Aman Bhardwaj, Rahul Garg

    Abstract: Yoga is a globally acclaimed and widely recommended practice for a healthy living. Maintaining correct posture while performing a Yogasana is of utmost importance. In this work, we employ transfer learning from Human Pose Estimation models for extracting 136 key-points spread all over the body to train a Random Forest classifier which is used for estimation of the Yogasanas. The results are evalua… ▽ More

    Submitted 14 August, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

  6. arXiv:2204.13158  [pdf, other

    cs.CV

    Person Re-Identification

    Authors: Mustafa Ebrahim Chasmai, Tamajit Banerjee

    Abstract: Person Re-Identification (Re-ID) is an important problem in computer vision-based surveillance applications, in which one aims to identify a person across different surveillance photographs taken from different cameras having varying orientations and field of views. Due to the increasing demand for intelligent video surveillance, Re-ID has gained significant interest in the computer vision communi… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  7. arXiv:2111.06036   

    cs.LG cs.AI

    CubeTR: Learning to Solve The Rubiks Cube Using Transformers

    Authors: Mustafa Ebrahim Chasmai

    Abstract: Since its first appearance, transformers have been successfully used in wide ranging domains from computer vision to natural language processing. Application of transformers in Reinforcement Learning by reformulating it as a sequence modelling problem was proposed only recently. Compared to other commonly explored reinforcement learning problems, the Rubiks cube poses a unique set of challenges. T… ▽ More

    Submitted 29 October, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: It has untested ideas without supporting experimentation. Discontinued work in this direction

  8. arXiv:2106.09756  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    PyKale: Knowledge-Aware Machine Learning from Multiple Sources in Python

    Authors: Hai** Lu, Xianyuan Liu, Robert Turner, Peizhen Bai, Raivo E Koot, Shuo Zhou, Mustafa Chasmai, Lawrence Schobs

    Abstract: Machine learning is a general-purpose technology holding promises for many interdisciplinary research problems. However, significant barriers exist in crossing disciplinary boundaries when most machine learning tools are developed in different areas separately. We present Pykale - a Python library for knowledge-aware machine learning on graphs, images, texts, and videos to enable and accelerate in… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: This library is available at https://github.com/pykale/pykale