Skip to main content

Showing 1–11 of 11 results for author: Litake, O

.
  1. arXiv:2401.13085  [pdf, other

    cs.CL cs.AI cs.LG

    IndiText Boost: Text Augmentation for Low Resource India Languages

    Authors: Onkar Litake, Niraj Yagnik, Shreyas Labhsetwar

    Abstract: Text Augmentation is an important task for low-resource languages. It helps deal with the problem of data scarcity. A data augmentation strategy is used to deal with the problem of data scarcity. Through the years, much work has been done on data augmentation for the English language. In contrast, very less work has been done on Indian languages. This is contrary to the fact that data augmentation… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  2. arXiv:2308.09862  [pdf, other

    cs.CL

    Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi

    Authors: Maithili Sabane, Onkar Litake, Aman Chadha

    Abstract: The recent advances in deep-learning have led to the development of highly sophisticated systems with an unquenchable appetite for data. On the other hand, building good deep-learning models for low-resource languages remains a challenging task. This paper focuses on develo** a Question Answering dataset for two such languages- Hindi and Marathi. Despite Hindi being the 3rd most spoken language… ▽ More

    Submitted 17 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  3. Enhancing Low Resource NER Using Assisting Language And Transfer Learning

    Authors: Maithili Sabane, Aparna Ranade, Onkar Litake, Parth Patil, Raviraj Joshi, Dipali Kadam

    Abstract: Named Entity Recognition (NER) is a fundamental task in NLP that is used to locate the key information in text and is primarily applied in conversational and search systems. In commercial applications, NER or comparable slot-filling methods have been widely deployed for popular languages. NER is used in applications such as human resources, customer service, search engines, content classification,… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: Accepted at International Conference on Applied Artificial Intelligence and Computing (ICAAIC) 2023

  4. arXiv:2204.12069  [pdf

    cs.CL

    Suggesting Relevant Questions for a Query Using Statistical Natural Language Processing Technique

    Authors: Shriniwas Nayak, Anuj Kanetkar, Hrushabh Hirudkar, Archana Ghotkar, Sheetal Sonawane, Onkar Litake

    Abstract: Suggesting similar questions for a user query has many applications ranging from reducing search time of users on e-commerce websites, training of employees in companies to holistic learning for students. The use of Natural Language Processing techniques for suggesting similar questions is prevalent over the existing architecture. Mainly two approaches are studied for finding text similarity namel… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

  5. arXiv:2204.09675  [pdf

    cs.CL

    Optimize_Prime@DravidianLangTech-ACL2022: Abusive Comment Detection in Tamil

    Authors: Shantanu Patankar, Omkar Gokhale, Onkar Litake, Aditya Mandke, Dipali Kadam

    Abstract: This paper tries to address the problem of abusive comment detection in low-resource indic languages. Abusive comments are statements that are offensive to a person or a group of people. These comments are targeted toward individuals belonging to specific ethnicities, genders, caste, race, sexuality, etc. Abusive Comment Detection is a significant problem, especially with the recent rise in social… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: text overlap with arXiv:2204.09087

  6. arXiv:2204.09098  [pdf

    cs.CL

    PICT@DravidianLangTech-ACL2022: Neural Machine Translation On Dravidian Languages

    Authors: Aditya Vyawahare, Rahul Tangsali, Aditya Mandke, Onkar Litake, Dipali Kadam

    Abstract: This paper presents a summary of the findings that we obtained based on the shared task on machine translation of Dravidian languages. We stood first in three of the five sub-tasks which were assigned to us for the main shared task. We carried out neural machine translation for the following five language pairs: Kannada to Tamil, Kannada to Telugu, Kannada to Malayalam, Kannada to Sanskrit, and Ka… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  7. arXiv:2204.09087  [pdf

    cs.CL

    Optimize_Prime@DravidianLangTech-ACL2022: Emotion Analysis in Tamil

    Authors: Omkar Gokhale, Shantanu Patankar, Onkar Litake, Aditya Mandke, Dipali Kadam

    Abstract: This paper aims to perform an emotion analysis of social media comments in Tamil. Emotion analysis is the process of identifying the emotional context of the text. In this paper, we present the findings obtained by Team Optimize_Prime in the ACL 2022 shared task "Emotion Analysis in Tamil." The task aimed to classify social media comments into categories of emotion like Joy, Anger, Trust, Disgust,… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  8. arXiv:2204.06029  [pdf, other

    cs.CL cs.LG

    L3Cube-MahaNER: A Marathi Named Entity Recognition Dataset and BERT models

    Authors: Parth Patil, Aparna Ranade, Maithili Sabane, Onkar Litake, Raviraj Joshi

    Abstract: Named Entity Recognition (NER) is a basic NLP task and finds major applications in conversational and search systems. It helps us identify key entities in a sentence used for the downstream application. NER or similar slot filling systems for popular languages have been heavily used in commercial applications. In this work, we focus on Marathi, an Indian language, spoken prominently by the people… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  9. Mono vs Multilingual BERT: A Case Study in Hindi and Marathi Named Entity Recognition

    Authors: Onkar Litake, Maithili Sabane, Parth Patil, Aparna Ranade, Raviraj Joshi

    Abstract: Named entity recognition (NER) is the process of recognising and classifying important information (entities) in text. Proper nouns, such as a person's name, an organization's name, or a location's name, are examples of entities. The NER is one of the important modules in applications like human resources, customer support, search engines, content classification, and academia. In this work, we con… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Accepted at ICMISC 2022

  10. Analyzing Architectures for Neural Machine Translation Using Low Computational Resources

    Authors: Aditya Mandke, Onkar Litake, Dipali Kadam

    Abstract: With the recent developments in the field of Natural Language Processing, there has been a rise in the use of different architectures for Neural Machine Translation. Transformer architectures are used to achieve state-of-the-art accuracy, but they are very computationally expensive to train. Everyone cannot have such setups consisting of high-end GPUs and other resources. We train our models on lo… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  11. arXiv:2110.05270  [pdf

    cs.CV cs.AI

    Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block

    Authors: Durvesh Malpure, Onkar Litake, Rajesh Ingle

    Abstract: In recent developments in the field of Computer Vision, a rise is seen in the use of transformer-based architectures. They are surpassing the state-of-the-art set by CNN architectures in accuracy but on the other hand, they are computationally very expensive to train from scratch. As these models are quite recent in the Computer Vision field, there is a need to study it's transfer learning capabil… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 8 pages, 4 figures