Skip to main content

Showing 1–1 of 1 results for author: Nayak, J S

.
  1. arXiv:2311.16161  [pdf, other

    cs.CV cs.AI

    Vision Encoder-Decoder Models for AI Coaching

    Authors: Jyothi S Nayak, Afifah Khan Mohammed Ajmal Khan, Chirag Manjeshwar, Imadh Ajaz Banday

    Abstract: This research paper introduces an innovative AI coaching approach by integrating vision-encoder-decoder models. The feasibility of this method is demonstrated using a Vision Transformer as the encoder and GPT-2 as the decoder, achieving a seamless integration of visual input and textual interaction. Departing from conventional practices of employing distinct models for image recognition and text-b… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures

    ACM Class: I.2.1