Skip to main content

Showing 1–7 of 7 results for author: Atapattu, T

.
  1. arXiv:2406.07759  [pdf, other

    cs.CL

    LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models

    Authors: Dasun Athukoralage, Thushari Atapattu, Menasha Thilakaratne, Katrina Falkner

    Abstract: This paper presents our approaches for the SMM4H24 Shared Task 5 on the binary classification of English tweets reporting children's medical disorders. Our first approach involves fine-tuning a single RoBERTa-large model, while the second approach entails ensembling the results of three fine-tuned BERTweet-large models. We demonstrate that although both approaches exhibit identical performance on… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Submitted for the 9th Social Media Mining for Health Research and Applications Workshop and Shared Tasks- Large Language Models (LLMs) and Generalizability for Social Media NLP

  2. arXiv:2208.08486  [pdf, other

    cs.CL

    EmoMent: An Emotion Annotated Mental Health Corpus from two South Asian Countries

    Authors: Thushari Atapattu, Mahen Herath, Charitha Elvitigala, Piyanjali de Zoysa, Kasun Gunawardana, Menasha Thilakaratne, Kasun de Zoysa, Katrina Falkner

    Abstract: People often utilise online media (e.g., Facebook, Reddit) as a platform to express their psychological distress and seek support. State-of-the-art NLP techniques demonstrate strong potential to automatically detect mental health issues from text. Research suggests that mental health issues are reflected in emotions (e.g., sadness) indicated in a person's choice of language. Therefore, we develope… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: This work has been accepted to appear at COLING 2022 Conference

  3. arXiv:2012.02565  [pdf, other

    cs.CL

    Automated Detection of Cyberbullying Against Women and Immigrants and Cross-domain Adaptability

    Authors: Thushari Atapattu, Mahen Herath, Georgia Zhang, Katrina Falkner

    Abstract: Cyberbullying is a prevalent and growing social problem due to the surge of social media technology usage. Minorities, women, and adolescents are among the common victims of cyberbullying. Despite the advancement of NLP technologies, the automated cyberbullying detection remains challenging. This paper focuses on advancing the technology using state-of-the-art NLP techniques. We use a Twitter data… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  4. arXiv:2010.06640  [pdf, other

    cs.CL

    Enhancing the Identification of Cyberbullying through Participant Roles

    Authors: Gathika Ratnayaka, Thushari Atapattu, Mahen Herath, Georgia Zhang, Katrina Falkner

    Abstract: Cyberbullying is a prevalent social problem that inflicts detrimental consequences to the health and safety of victims such as psychological distress, anti-social behaviour, and suicide. The automation of cyberbullying detection is a recent but widely researched problem, with current research having a strong focus on a binary classification of bullying versus non-bullying. This paper proposes a no… ▽ More

    Submitted 22 October, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

  5. arXiv:2007.10744  [pdf, ps, other

    cs.SE

    Beyond Accuracy: Assessing Software Documentation Quality

    Authors: Christoph Treude, Justin Middleton, Thushari Atapattu

    Abstract: Good software documentation encourages good software engineering, but the meaning of "good" documentation is vaguely defined in the software engineering literature. To clarify this ambiguity, we draw on work from the data and information quality community to propose a framework that decomposes documentation quality into ten dimensions of structure, content, and style. To demonstrate its applicatio… ▽ More

    Submitted 8 September, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: to appear in the Visions and Reflections Track of the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering 2020

  6. arXiv:1903.03286  [pdf

    cs.CY

    An Identification of Learners' Confusion through Language and Discourse Analysis

    Authors: Thushari Atapattu, Katrina Falkner, Menasha Thilakaratne, Lavendini Sivaneasharajah, Rangana Jayashanka

    Abstract: The substantial growth of online learning, in particular, Massively Open Online Courses (MOOCs), supports research into the development of better models for effective learning. Learner 'confusion' is among one of the identified aspects which impacts the overall learning process, and ultimately, course attrition. Confusion for a learner is an individual state of bewilderment and uncertainty of how… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.

  7. arXiv:1802.06997  [pdf, other

    cs.SE

    Categorizing the Content of GitHub README Files

    Authors: Gede Artha Azriadi Prana, Christoph Treude, Ferdian Thung, Thushari Atapattu, David Lo

    Abstract: README files play an essential role in sha** a developer's first impression of a software repository and in documenting the software project that the repository hosts. Yet, we lack a systematic understanding of the content of a typical README file as well as tools that can process these files automatically. To close this gap, we conduct a qualitative study involving the manual annotation of 4,22… ▽ More

    Submitted 30 July, 2018; v1 submitted 20 February, 2018; originally announced February 2018.