Skip to main content

Showing 1–15 of 15 results for author: Falkner, K

.
  1. arXiv:2406.07759  [pdf, other

    cs.CL

    LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models

    Authors: Dasun Athukoralage, Thushari Atapattu, Menasha Thilakaratne, Katrina Falkner

    Abstract: This paper presents our approaches for the SMM4H24 Shared Task 5 on the binary classification of English tweets reporting children's medical disorders. Our first approach involves fine-tuning a single RoBERTa-large model, while the second approach entails ensembling the results of three fine-tuned BERTweet-large models. We demonstrate that although both approaches exhibit identical performance on… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Submitted for the 9th Social Media Mining for Health Research and Applications Workshop and Shared Tasks- Large Language Models (LLMs) and Generalizability for Social Media NLP

  2. arXiv:2310.04140  [pdf, other

    cs.LG

    Routing Arena: A Benchmark Suite for Neural Routing Solvers

    Authors: Daniela Thyssens, Tim Dernedde, Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: Neural Combinatorial Optimization has been researched actively in the last eight years. Even though many of the proposed Machine Learning based approaches are compared on the same datasets, the evaluation protocol exhibits essential flaws and the selection of baselines often neglects State-of-the-Art Operations Research approaches. To improve on both of these shortcomings, we propose the Routing A… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  3. arXiv:2309.17089  [pdf, other

    cs.LG

    Too Big, so Fail? -- Enabling Neural Construction Methods to Solve Large-Scale Routing Problems

    Authors: Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: In recent years new deep learning approaches to solve combinatorial optimization problems, in particular NP-hard Vehicle Routing Problems (VRP), have been proposed. The most impactful of these methods are sequential neural construction approaches which are usually trained via reinforcement learning. Due to the high training costs of these models, they usually are trained on limited instance sizes… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  4. ConservationBots: Autonomous Aerial Robot for Fast Robust Wildlife Tracking in Complex Terrains

    Authors: Fei Chen, Hoa Van Nguyen, David A. Taggart, Katrina Falkner, S. Hamid Rezatofighi, Damith C. Ranasinghe

    Abstract: Today, the most widespread, widely applicable technology for gathering data relies on experienced scientists armed with handheld radio telemetry equipment to locate low-power radio transmitters attached to wildlife from the ground. Although aerial robots can transform labor-intensive conservation tasks, the realization of autonomous systems for tackling task complexities under real-world condition… ▽ More

    Submitted 12 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted to The Journal of Field Robotics

  5. arXiv:2302.05134  [pdf, other

    cs.LG

    Neural Capacitated Clustering

    Authors: Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: Recent work on deep clustering has found new promising methods also for constrained clustering problems. Their typically pairwise constraints often can be used to guide the partitioning of the data. Many problems however, feature cluster-level constraints, e.g. the Capacitated Clustering Problem (CCP), where each point has a weight and the total weight sum of all points in each cluster is bounded… ▽ More

    Submitted 19 May, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: Accepted at the 32nd International Joint Conference on Artificial Intelligence (IJCAI) 2023

  6. arXiv:2208.08486  [pdf, other

    cs.CL

    EmoMent: An Emotion Annotated Mental Health Corpus from two South Asian Countries

    Authors: Thushari Atapattu, Mahen Herath, Charitha Elvitigala, Piyanjali de Zoysa, Kasun Gunawardana, Menasha Thilakaratne, Kasun de Zoysa, Katrina Falkner

    Abstract: People often utilise online media (e.g., Facebook, Reddit) as a platform to express their psychological distress and seek support. State-of-the-art NLP techniques demonstrate strong potential to automatically detect mental health issues from text. Research suggests that mental health issues are reflected in emotions (e.g., sadness) indicated in a person's choice of language. Therefore, we develope… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: This work has been accepted to appear at COLING 2022 Conference

  7. arXiv:2207.07212  [pdf, other

    cs.LG

    Attention, Filling in The Gaps for Generalization in Routing Problems

    Authors: Ahmad Bdeir, Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: Machine Learning (ML) methods have become a useful tool for tackling vehicle routing problems, either in combination with popular heuristics or as standalone models. However, current methods suffer from poor generalization when tackling problems of different sizes or different distributions. As a result, ML in vehicle routing has witnessed an expansion phase with new methodologies being created fo… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at ECML-PKDD 2022

  8. Solving the Traveling Salesperson Problem with Precedence Constraints by Deep Reinforcement Learning

    Authors: Christian Löwens, Inaam Ashraf, Alexander Gembus, Genesis Cuizon, Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: This work presents solutions to the Traveling Salesperson Problem with precedence constraints (TSPPC) using Deep Reinforcement Learning (DRL) by adapting recent approaches that work well for regular TSPs. Common to these approaches is the use of graph models based on multi-head attention (MHA) layers. One idea for solving the pickup and delivery problem (PDP) is using heterogeneous attentions to e… ▽ More

    Submitted 19 September, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in KI 2022: Advances in Artificial Intelligence, and is available online at https://doi.org/10.1007/978-3-031-15791-2_14

    Journal ref: KI 2022: Advances in Artificial Intelligence pp 160-172

  9. Learning to Control Local Search for Combinatorial Optimization

    Authors: Jonas K. Falkner, Daniela Thyssens, Ahmad Bdeir, Lars Schmidt-Thieme

    Abstract: Combinatorial optimization problems are encountered in many practical contexts such as logistics and production, but exact solutions are particularly difficult to find and usually NP-hard for considerable problem sizes. To compute approximate solutions, a zoo of generic as well as problem-specific variants of local search is commonly used. However, which variant to apply to which particular proble… ▽ More

    Submitted 13 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted at ECML-PKDD 2022

    Journal ref: In: Amini, MR., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13717. Springer, Cham

  10. arXiv:2205.00772  [pdf, ps, other

    cs.LG cs.AI

    Large Neighborhood Search based on Neural Construction Heuristics

    Authors: Jonas K. Falkner, Daniela Thyssens, Lars Schmidt-Thieme

    Abstract: We propose a Large Neighborhood Search (LNS) approach utilizing a learned construction heuristic based on neural networks as repair operator to solve the vehicle routing problem with time windows (VRPTW). Our method uses graph neural networks to encode the problem and auto-regressively decodes a solution and is trained with reinforcement learning on the construction task without requiring any labe… ▽ More

    Submitted 10 May, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

  11. arXiv:2104.12226  [pdf, other

    cs.LG

    RP-DQN: An application of Q-Learning to Vehicle Routing Problems

    Authors: Ahmad Bdeir, Simon Boeder, Tim Dernedde, Kirill Tkachuk, Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: In this paper we present a new approach to tackle complex routing problems with an improved state representation that utilizes the model complexity better than previous methods. We enable this by training from temporal differences. Specifically Q-Learning is employed. We show that our approach achieves state-of-the-art performance for autoregressive policies that sequentially insert nodes to const… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 14 pages, 4 figures

  12. arXiv:2012.02565  [pdf, other

    cs.CL

    Automated Detection of Cyberbullying Against Women and Immigrants and Cross-domain Adaptability

    Authors: Thushari Atapattu, Mahen Herath, Georgia Zhang, Katrina Falkner

    Abstract: Cyberbullying is a prevalent and growing social problem due to the surge of social media technology usage. Minorities, women, and adolescents are among the common victims of cyberbullying. Despite the advancement of NLP technologies, the automated cyberbullying detection remains challenging. This paper focuses on advancing the technology using state-of-the-art NLP techniques. We use a Twitter data… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

  13. arXiv:2010.06640  [pdf, other

    cs.CL

    Enhancing the Identification of Cyberbullying through Participant Roles

    Authors: Gathika Ratnayaka, Thushari Atapattu, Mahen Herath, Georgia Zhang, Katrina Falkner

    Abstract: Cyberbullying is a prevalent social problem that inflicts detrimental consequences to the health and safety of victims such as psychological distress, anti-social behaviour, and suicide. The automation of cyberbullying detection is a recent but widely researched problem, with current research having a strong focus on a binary classification of bullying versus non-bullying. This paper proposes a no… ▽ More

    Submitted 22 October, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

  14. arXiv:2006.09100  [pdf, other

    cs.LG stat.ML

    Learning to Solve Vehicle Routing Problems with Time Windows through Joint Attention

    Authors: Jonas K. Falkner, Lars Schmidt-Thieme

    Abstract: Many real-world vehicle routing problems involve rich sets of constraints with respect to the capacities of the vehicles, time windows for customers etc. While in recent years first machine learning models have been developed to solve basic vehicle routing problems faster than optimization heuristics, complex constraints rarely are taken into consideration. Due to their general procedure to constr… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  15. arXiv:1903.03286  [pdf

    cs.CY

    An Identification of Learners' Confusion through Language and Discourse Analysis

    Authors: Thushari Atapattu, Katrina Falkner, Menasha Thilakaratne, Lavendini Sivaneasharajah, Rangana Jayashanka

    Abstract: The substantial growth of online learning, in particular, Massively Open Online Courses (MOOCs), supports research into the development of better models for effective learning. Learner 'confusion' is among one of the identified aspects which impacts the overall learning process, and ultimately, course attrition. Confusion for a learner is an individual state of bewilderment and uncertainty of how… ▽ More

    Submitted 8 March, 2019; originally announced March 2019.