Skip to main content

Showing 1–3 of 3 results for author: Olimov, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.02203  [pdf

    cs.CV

    3D Convolutional with Attention for Action Recognition

    Authors: Labina Shrestha, Shikha Dubey, Farrukh Olimov, Muhammad Aasim Rafique, Moongu Jeon

    Abstract: Human action recognition is one of the challenging tasks in computer vision. The current action recognition methods use computationally expensive models for learning spatio-temporal dependencies of the action. Models utilizing RGB channels and optical flow separately, models using a two-stream fusion technique, and models consisting of both convolutional neural network (CNN) and long-short term me… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

  2. arXiv:2109.07799  [pdf, other

    cs.CV cs.AI

    Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning

    Authors: Shikha Dubey, Farrukh Olimov, Muhammad Aasim Rafique, Joonmo Kim, Moongu Jeon

    Abstract: Automatic transcription of scene understanding in images and videos is a step towards artificial general intelligence. Image captioning is a nomenclature for describing meaningful information in an image using computer vision techniques. Automated image captioning techniques utilize encoder and decoder architecture, where the encoder extracts features from an image and the decoder generates a tran… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  3. arXiv:2103.05103  [pdf

    cs.CV cs.AI cs.CL

    Image Captioning using Multiple Transformers for Self-Attention Mechanism

    Authors: Farrukh Olimov, Shikha Dubey, Labina Shrestha, Tran Trung Tin, Moongu Jeon

    Abstract: Real-time image captioning, along with adequate precision, is the main challenge of this research field. The present work, Multiple Transformers for Self-Attention Mechanism (MTSM), utilizes multiple transformers to address these problems. The proposed algorithm, MTSM, acquires region proposals using a transformer detector (DETR). Consequently, MTSM achieves the self-attention mechanism by transfe… ▽ More

    Submitted 14 February, 2021; originally announced March 2021.