Skip to main content

Showing 1–30 of 30 results for author: Chakravarthi, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10540  [pdf, other

    cs.CV cs.LG

    SEVD: Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception

    Authors: Manideep Reddy Aliminati, Bharatesh Chakravarthi, Aayush Atul Verma, Arpitsinh Vaghela, Hua Wei, Xuesong Zhou, Yezhou Yang

    Abstract: Recently, event-based vision sensors have gained attention for autonomous driving applications, as conventional RGB cameras face limitations in handling challenging dynamic conditions. However, the availability of real-world and synthetic event-based vision datasets remains limited. In response to this gap, we present SEVD, a first-of-its-kind multi-view ego, and fixed perception synthetic event-b… ▽ More

    Submitted 19 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  2. arXiv:2403.19976  [pdf, other

    cs.CV

    eTraM: Event-based Traffic Monitoring Dataset

    Authors: Aayush Atul Verma, Bharatesh Chakravarthi, Arpitsinh Vaghela, Hua Wei, Yezhou Yang

    Abstract: Event cameras, with their high temporal and dynamic range and minimal memory usage, have found applications in various fields. However, their potential in static traffic monitoring remains largely unexplored. To facilitate this exploration, we present eTraM - a first-of-its-kind, fully event-based traffic monitoring dataset. eTraM offers 10 hr of data from different traffic scenarios in various li… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  3. A Comprehensive Review of Leap Motion Controller-based Hand Gesture Datasets

    Authors: Bharatesh Chakravarthi, Prabhu Prasad B M, Pavan Kumar B N

    Abstract: This paper comprehensively reviews hand gesture datasets based on Ultraleap's leap motion controller, a popular device for capturing and tracking hand gestures in real-time. The aim is to offer researchers and practitioners a valuable resource for develo** and evaluating gesture recognition algorithms. The review compares various datasets found in the literature, considering factors such as targ… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  4. arXiv:2309.01324  [pdf, other

    cs.CV

    SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras

    Authors: Himanshu Pahadia, Duo Lu, Bharatesh Chakravarthi, Yezhou Yang

    Abstract: Intelligent transportation systems (ITS) have revolutionized modern road infrastructure, providing essential functionalities such as traffic monitoring, road safety assessment, congestion reduction, and law enforcement. Effective vehicle detection and accurate vehicle pose estimation are crucial for ITS, particularly using monocular cameras installed on the road infrastructure. One fundamental cha… ▽ More

    Submitted 12 March, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE ITSC 2023

  5. arXiv:2205.06119  [pdf, other

    cs.CL cs.LG

    Zero-shot Code-Mixed Offensive Span Identification through Rationale Extraction

    Authors: Manikandan Ravikiran, Bharathi Raja Chakravarthi

    Abstract: This paper investigates the effectiveness of sentence-level transformers for zero-shot offensive span identification on a code-mixed Tamil dataset. More specifically, we evaluate rationale extraction methods of Local Interpretable Model Agnostic Explanations (LIME) \cite{DBLP:conf/kdd/Ribeiro0G16} and Integrated Gradients (IG) \cite{DBLP:conf/icml/SundararajanTY17} for adapting transformer based o… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Submission to https://dravidianlangtech.github.io/2022/

  6. arXiv:2205.06118  [pdf, other

    cs.CL

    Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

    Authors: Manikandan Ravikiran, Bharathi Raja Chakravarthi, Anand Kumar Madasamy, Sangeetha Sivanesan, Ratnavel Rajalakshmi, Sajeetha Thavareesan, Rahul Ponnusamy, Shankar Mahadevan

    Abstract: Offensive content moderation is vital in social media platforms to support healthy online discussions. However, their prevalence in codemixed Dravidian languages is limited to classifying whole comments without identifying part of it contributing to offensiveness. Such limitation is primarily due to the lack of annotated data for offensive spans. Accordingly, in this shared task, we provide Tamil-… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: System Description of Shared Task https://competitions.codalab.org/competitions/36395

  7. arXiv:2204.10196  [pdf, other

    cs.CL cs.AI

    Multimodal Hate Speech Detection from Bengali Memes and Texts

    Authors: Md. Rezaul Karim, Sumon Kanti Dey, Tanhim Islam, Md. Shajalal, Bharathi Raja Chakravarthi

    Abstract: Numerous machine learning (ML) and deep learning (DL)-based approaches have been proposed to utilize textual data from social media for anti-social behavior analysis like cyberbullying, fake news detection, and identification of hate speech mainly for highly-resourced languages such as English. However, despite having a lot of diversity and millions of native speakers, some languages like Bengali… ▽ More

    Submitted 21 December, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.00648 by other authors

    Journal ref: Pre-print for our paper at International Conference on Speech & Language Technology for Low-resource Languages (SPELLL'2022)

  8. Overlap** Word Removal is All You Need: Revisiting Data Imbalance in Hope Speech Detection

    Authors: Hariharan RamakrishnaIyer LekshmiAmmal, Manikandan Ravikiran, Gayathri Nisha, Navyasree Balamuralidhar, Adithya Madhusoodanan, Anand Kumar Madasamy, Bharathi Raja Chakravarthi

    Abstract: Hope Speech Detection, a task of recognizing positive expressions, has made significant strides recently. However, much of the current works focus on model development without considering the issue of inherent imbalance in the data. Our work revisits this issue in hope-speech detection by introducing focal loss, data augmentation, and pre-processing strategies. Accordingly, we find that introducin… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

  9. arXiv:2202.04725  [pdf

    cs.CL

    TamilEmo: Finegrained Emotion Detection Dataset for Tamil

    Authors: Charangan Vasantharajan, Sean Benhur, Prasanna Kumar Kumarasen, Rahul Ponnusamy, Sathiyaraj Thangasamy, Ruba Priyadharshini, Thenmozhi Durairaj, Kanchana Sivanraju, Anbukkarasi Sampath, Bharathi Raja Chakravarthi, John Phillip McCrae

    Abstract: Emotional Analysis from textual input has been considered both a challenging and interesting task in Natural Language Processing. However, due to the lack of datasets in low-resource languages (i.e. Tamil), it is difficult to conduct research of high standard in this area. Therefore we introduce this labelled dataset (a largest manually annotated dataset of more than 42k Tamil YouTube comments, la… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: 11 pages, 4 figures

  10. arXiv:2112.15417  [pdf, ps, other

    cs.CL

    Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification

    Authors: Sean Benhur, Roshan Nayak, Kanchana Sivanraju, Adeep Hande, Subalalitha Chinnaudayar Navaneethakrishnan, Ruba Priyadharshini, Bharathi Raja Chakravarthi

    Abstract: Due to the exponentially increasing reach of social media, it is essential to focus on its negative aspects as it can potentially divide society and incite people into violence. In this paper, we present our system description of work on the shared task ComMA@ICON, where we have to classify how aggressive the sentence is and if the sentence is gender-biased or communal biased. These three could be… ▽ More

    Submitted 13 January, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: 5 pages

  11. arXiv:2111.09811  [pdf, other

    cs.CL

    Findings of the Sentiment Analysis of Dravidian Languages in Code-Mixed Text

    Authors: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Sajeetha Thavareesan, Dhivya Chinnappa, Durairaj Thenmozhi, Elizabeth Sherly, John P. McCrae, Adeep Hande, Rahul Ponnusamy, Shubhanker Banerjee, Charangan Vasantharajan

    Abstract: We present the results of the Dravidian-CodeMix shared task held at FIRE 2021, a track on sentiment analysis for Dravidian Languages in Code-Mixed Text. We describe the task, its organization, and the submitted systems. This shared task is the continuation of last year's Dravidian-CodeMix shared task held at FIRE 2020. This year's tasks included code-mixing at the intra-token and inter-token level… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  12. arXiv:2111.03375  [pdf

    cs.CL

    Develo** Successful Shared Tasks on Offensive Language Identification for Dravidian Languages

    Authors: Bharathi Raja Chakravarthi, Dhivya Chinnappa, Ruba Priyadharshini, Anand Kumar Madasamy, Sangeetha Sivanesan, Subalalitha Chinnaudayar Navaneethakrishnan, Sajeetha Thavareesan, Dhanalakshmi Vadivel, Rahul Ponnusamy, Prasanna Kumar Kumaresan

    Abstract: With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 23

  13. arXiv:2109.03571  [pdf, other

    cs.SI cs.CL cs.MM

    TrollsWithOpinion: A Dataset for Predicting Domain-specific Opinion Manipulation in Troll Memes

    Authors: Shardul Suryawanshi, Bharathi Raja Chakravarthi, Mihael Arcan, Suzanne Little, Paul Buitelaar

    Abstract: Research into the classification of Image with Text (IWT) troll memes has recently become popular. Since the online community utilizes the refuge of memes to express themselves, there is an abundance of data in the form of memes. These memes have the potential to demean, harras, or bully targeted individuals. Moreover, the targeted individual could fall prey to opinion manipulation. To comprehend… ▽ More

    Submitted 10 May, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

  14. arXiv:2109.00227  [pdf

    cs.CL

    Dataset for Identification of Homophobia and Transophobia in Multilingual YouTube Comments

    Authors: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Rahul Ponnusamy, Prasanna Kumar Kumaresan, Kayalvizhi Sampath, Durairaj Thenmozhi, Sathiyaraj Thangasamy, Rajendran Nallathambi, John Phillip McCrae

    Abstract: The increased proliferation of abusive content on social media platforms has a negative impact on online users. The dread, dislike, discomfort, or mistrust of lesbian, gay, transgender or bisexual persons is defined as homophobia/transphobia. Homophobic/transphobic speech is a type of offensive language that may be summarized as hate speech directed toward LGBT+ people, and it has been a growing c… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 44 Pages

  15. arXiv:2108.12177  [pdf, other

    cs.CL

    Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling

    Authors: Adeep Hande, Karthik Puranik, Konthala Yasaswini, Ruba Priyadharshini, Sajeetha Thavareesan, Anbukkarasi Sampath, Kogilavani Shanmugavadivel, Durairaj Thenmozhi, Bharathi Raja Chakravarthi

    Abstract: Social media has effectively become the prime hub of communication and digital marketing. As these platforms enable the free manifestation of thoughts and facts in text, images and video, there is an extensive need to screen them to protect individuals and groups from offensive content targeted at them. Our work intends to classify codemixed social media comments/posts in the Dravidian languages o… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 27 pages, 12 figures, 10 tables

  16. arXiv:2108.08556  [pdf, other

    cs.CL

    Attentive fine-tuning of Transformers for Translation of low-resourced languages @LoResMT 2021

    Authors: Karthik Puranik, Adeep Hande, Ruba Priyadharshini, Thenmozhi Durairaj, Anbukkarasi Sampath, Kingston Pal Thamburaj, Bharathi Raja Chakravarthi

    Abstract: This paper reports the Machine Translation (MT) systems submitted by the IIITT team for the English->Marathi and English->Irish language pairs LoResMT 2021 shared task. The task focuses on getting exceptional translations for rather low-resourced languages like Irish and Marathi. We fine-tune IndicTrans, a pretrained multilingual NMT model for English->Marathi, using external parallel corpus as in… ▽ More

    Submitted 31 August, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: 10 pages

  17. arXiv:2108.04616  [pdf

    cs.CL

    Hope Speech detection in under-resourced Kannada language

    Authors: Adeep Hande, Ruba Priyadharshini, Anbukkarasi Sampath, Kingston Pal Thamburaj, Prabakaran Chandran, Bharathi Raja Chakravarthi

    Abstract: Numerous methods have been developed to monitor the spread of negativity in modern years by eliminating vulgar, offensive, and fierce comments from social media platforms. However, there are relatively lesser amounts of study that converges on embracing positivity, reinforcing supportive and reassuring content in online forums. Consequently, we propose creating an English-Kannada Hope speech datas… ▽ More

    Submitted 5 December, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

  18. arXiv:2108.03886  [pdf, other

    cs.CL

    Do Images really do the Talking? Analysing the significance of Images in Tamil Troll meme classification

    Authors: Siddhanth U Hegde, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Ratnasingam Sakuntharaj, Sathiyaraj Thangasamy, B Bharathi, Bharathi Raja Chakravarthi

    Abstract: A meme is an part of media created to share an opinion or emotion across the internet. Due to its popularity, memes have become the new forms of communication on social media. However, due to its nature, they are being used in harmful ways such as trolling and cyberbullying progressively. Various data modelling methods create different possibilities in feature extraction and turning them into bene… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 12 pages

  19. arXiv:2108.03867  [pdf, other

    cs.CC

    Benchmarking Multi-Task Learning for Sentiment Analysis and Offensive Language Identification in Under-Resourced Dravidian Languages

    Authors: Adeep Hande, Siddhanth U Hegde, Ruba Priyadharshini, Rahul Ponnusamy, Prasanna Kumar Kumaresan, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: To obtain extensive annotated data for under-resourced languages is challenging, so in this research, we have investigated whether it is beneficial to train models using multi-task learning. Sentiment analysis and offensive language identification share similar discourse properties. The selection of these tasks is motivated by the lack of large labelled data for user-generated code-mixed datasets.… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 29 pages

  20. DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text

    Authors: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Vigneshwaran Muralidaran, Navya Jose, Shardul Suryawanshi, Elizabeth Sherly, John P. McCrae

    Abstract: This paper describes the development of a multilingual, manually annotated dataset for three under-resourced Dravidian languages generated from social media comments. The dataset was annotated for sentiment analysis and offensive language identification for a total of more than 60,000 YouTube comments. The dataset consists of around 44,000 comments in Tamil-English, around 7,000 comments in Kannad… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 36 pages

  21. arXiv:2106.04853  [pdf, other

    cs.CL

    DravidianMultiModality: A Dataset for Multi-modal Sentiment Analysis in Tamil and Malayalam

    Authors: Bharathi Raja Chakravarthi, Jishnu Parameswaran P. K, Premjith B, K. P Soman, Rahul Ponnusamy, Prasanna Kumar Kumaresan, Kingston Pal Thamburaj, John P. McCrae

    Abstract: Human communication is inherently multimodal and asynchronous. Analyzing human emotions and sentiment is an emerging field of artificial intelligence. We are witnessing an increasing amount of multimodal content in local languages on social media about products and other topics. However, there are not many multimodal resources available for under-resourced Dravidian languages. Our study aims to cr… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: 31

  22. arXiv:2104.09081  [pdf, other

    cs.CL

    UVCE-IIITT@DravidianLangTech-EACL2021: Tamil Troll Meme Classification: You need to Pay more Attention

    Authors: Siddhanth U Hegde, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: Tamil is a Dravidian language that is commonly used and spoken in the southern part of Asia. In the era of social media, memes have been a fun moment in the day-to-day life of people. Here, we try to analyze the true meaning of Tamil memes by categorizing them as troll and non-troll. We propose an ingenious model comprising of a transformer-transformer architecture that tries to attain state-of-th… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  23. arXiv:2104.09066  [pdf, other

    cs.CL

    IIITT@LT-EDI-EACL2021-Hope Speech Detection: There is always Hope in Transformers

    Authors: Karthik Puranik, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: In a world filled with serious challenges like climate change, religious and political conflicts, global pandemics, terrorism, and racial discrimination, an internet full of hate speech, abusive and offensive content is the last thing we desire for. In this paper, we work to identify and promote positive and supportive content on these platforms. We work with several transformer-based models to cl… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  24. arXiv:2012.14353  [pdf, other

    cs.CL cs.LG

    DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language

    Authors: Md. Rezaul Karim, Sumon Kanti Dey, Tanhim Islam, Sagor Sarker, Mehadi Hasan Menon, Kabir Hossain, Bharathi Raja Chakravarthi, Md. Azam Hossain, Stefan Decker

    Abstract: The exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices, but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize textual data for social and anti-social behaviour analysis, by predicting the contexts mo… ▽ More

    Submitted 6 August, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: Proceeding of IEEE International Conference on Data Science and Advanced Analytics (DSAA'2021), October 6-9, 2021, Porto, Portugal

  25. arXiv:2009.13398  [pdf, ps, other

    cs.CL

    Aspects of Terminological and Named Entity Knowledge within Rule-Based Machine Translation Models for Under-Resourced Neural Machine Translation Scenarios

    Authors: Daniel Torregrosa, Nivranshu Pasricha, Maraim Masoud, Bharathi Raja Chakravarthi, Juan Alonso, Noe Casas, Mihael Arcan

    Abstract: Rule-based machine translation is a machine translation paradigm where linguistic knowledge is encoded by an expert in the form of rules that translate text from source to target language. While this approach grants extensive control over the output of the system, the cost of formalising the needed linguistic knowledge is much higher than training a corpus-based system, where a machine learning ap… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  26. arXiv:2008.01545  [pdf, other

    cs.CL

    ULD@NUIG at SemEval-2020 Task 9: Generative Morphemes with an Attention Model for Sentiment Analysis in Code-Mixed Text

    Authors: Koustava Goswami, Priya Rani, Bharathi Raja Chakravarthi, Theodorus Fransen, John P. McCrae

    Abstract: Code mixing is a common phenomena in multilingual societies where people switch from one language to another for various reasons. Recent advances in public communication over different social media sites have led to an increase in the frequency of code-mixed usage in written language. In this paper, we present the Generative Morphemes with Attention (GenMA) Model sentiment analysis system contribu… ▽ More

    Submitted 27 July, 2020; originally announced August 2020.

    Comments: To be published in 14th International Workshop on Semantic Evaluation SemEval-2020

  27. A Survey of Orthographic Information in Machine Translation

    Authors: Bharathi Raja Chakravarthi, Priya Rani, Mihael Arcan, John P. McCrae

    Abstract: Machine translation is one of the applications of natural language processing which has been explored in different languages. Recently researchers started paying attention towards machine translation for resource-poor languages and closely related languages. A widespread and underlying problem for these machine translation systems is the variation in orthographic conventions which causes many issu… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 18 pages

    Journal ref: SN Computer Science (2021) 2:330

  28. arXiv:2006.00210  [pdf, other

    cs.CL

    A Sentiment Analysis Dataset for Code-Mixed Malayalam-English

    Authors: Bharathi Raja Chakravarthi, Navya Jose, Shardul Suryawanshi, Elizabeth Sherly, John P. McCrae

    Abstract: There is an increasing demand for sentiment analysis of text from social media which are mostly code-mixed. Systems trained on monolingual data fail for code-mixed data due to the complexity of mixing at different levels of the text. However, very few resources are available for code-mixed data to create models specific for this data. Although much research in multilingual and cross-lingual sentim… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Report number: 2020.sltu-1.25

    Journal ref: Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL) 2020

  29. arXiv:2006.00206  [pdf, other

    cs.CL

    Corpus Creation for Sentiment Analysis in Code-Mixed Tamil-English Text

    Authors: Bharathi Raja Chakravarthi, Vigneshwaran Muralidaran, Ruba Priyadharshini, John P. McCrae

    Abstract: Understanding the sentiment of a comment from a video or an image is an essential task in many applications. Sentiment analysis of a text can be useful for various decision-making processes. One such application is to analyse the popular sentiments of videos on social media based on viewer comments. However, comments from social media do not follow strict rules of grammar, and they contain mixing… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Journal ref: Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL) 2020

  30. arXiv:2004.07807  [pdf, other

    cs.CL cs.LG stat.ML

    Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network

    Authors: Md. Rezaul Karim, Bharathi Raja Chakravarthi, John P. McCrae, Michael Cochez

    Abstract: Exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize these data for social and anti-social behaviours analysis, document characterization, and sent… ▽ More

    Submitted 19 April, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

    Comments: This paper is under review in the Journal of Natural Language Engineering