Skip to main content

Showing 1–24 of 24 results for author: Thambawita, V

.
  1. arXiv:2405.07354  [pdf, other

    cs.SD cs.IR cs.LG cs.MM eess.AS

    SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset

    Authors: Sushant Gautam, Mehdi Houshmand Sarkhoosh, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah

    Abstract: The application of Automatic Speech Recognition (ASR) technology in soccer offers numerous opportunities for sports analytics. Specifically, extracting audio commentaries with ASR provides valuable insights into the events of the game, and opens the door to several downstream applications such as automatic highlight generation. This paper presents SoccerNet-Echoes, an augmentation of the SoccerNet… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    ACM Class: I.2.7; I.7

  2. arXiv:2402.17601  [pdf, other

    cs.LG

    Advancing sleep detection by modelling weak label sets: A novel weakly supervised learning approach

    Authors: Matthias Boeker, Vajira Thambawita, Michael Riegler, Pål Halvorsen, Hugo L. Hammer

    Abstract: Understanding sleep and activity patterns plays a crucial role in physical and mental health. This study introduces a novel approach for sleep detection using weakly supervised learning for scenarios where reliable ground truth labels are unavailable. The proposed method relies on a set of weak labels, derived from the predictions generated by conventional sleep detection algorithms. Introducing a… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Report number: NCAA-D-24-00134R1

  3. arXiv:2307.16262  [pdf, other

    eess.IV cs.CV

    Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges

    Authors: Debesh Jha, Vanshali Sharma, Debapriya Banik, Debayan Bhattacharya, Kaushiki Roy, Steven A. Hicks, Nikhil Kumar Tomar, Vajira Thambawita, Adrian Krenzer, Ge-Peng Ji, Sahadev Poudel, George Batchkala, Saruar Alam, Awadelrahman M. A. Ahmed, Quoc-Huy Trinh, Zeshan Khan, Tien-Phat Nguyen, Shruti Shrestha, Sabari Nathan, Jeonghwan Gwak, Ritika K. Jha, Zheyuan Zhang, Alexander Schlaefer, Debotosh Bhattacharjee, M. K. Bhuyan , et al. (8 additional authors not shown)

    Abstract: Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Deep learning has… ▽ More

    Submitted 6 May, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

  4. arXiv:2304.05233  [pdf, other

    eess.IV cs.CV cs.LG

    Mask-conditioned latent diffusion for generating gastrointestinal polyp images

    Authors: Roman Macháček, Leila Mozaffari, Zahra Sepasdar, Sravanthi Parasa, Pål Halvorsen, Michael A. Riegler, Vajira Thambawita

    Abstract: In order to take advantage of AI solutions in endoscopy diagnostics, we must overcome the issue of limited annotations. These limitations are caused by the high privacy concerns in the medical field and the requirement of getting aid from experts for the time-consuming and costly medical data annotation process. In computer vision, image synthesis has made a significant contribution in recent year… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  5. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  6. VISEM-Tracking, a human spermatozoa tracking dataset

    Authors: Vajira Thambawita, Steven A. Hicks, Andrea M. Storås, Thu Nguyen, Jorunn M. Andersen, Oliwia Witczak, Trine B. Haugen, Hugo L. Hammer, Pål Halvorsen, Michael A. Riegler

    Abstract: A manual assessment of sperm motility requires microscopy observation, which is challenging due to the fast-moving spermatozoa in the field of view. To obtain correct results, manual evaluation requires extensive training. Therefore, computer-assisted sperm analysis (CASA) has become increasingly used in clinics. Despite this, more data is needed to train supervised machine learning approaches in… ▽ More

    Submitted 10 May, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Report number: Scientific Data volume 10

    Journal ref: Sci Data 10, 260 (2023)

  7. arXiv:2211.16834  [pdf, other

    eess.IV cs.CV cs.LG

    MLC at HECKTOR 2022: The Effect and Importance of Training Data when Analyzing Cases of Head and Neck Tumors using Machine Learning

    Authors: Vajira Thambawita, Andrea M. Storås, Steven A. Hicks, Pål Halvorsen, Michael A. Riegler

    Abstract: Head and neck cancers are the fifth most common cancer worldwide, and recently, analysis of Positron Emission Tomography (PET) and Computed Tomography (CT) images has been proposed to identify patients with a prognosis. Even though the results look promising, more research is needed to further validate and improve the results. This paper presents the work done by team MLC for the 2022 version of t… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: Submitted to https://hecktor.grand-challenge.org/

  8. arXiv:2205.15428  [pdf, other

    cs.CV cs.LG

    Segmentation Consistency Training: Out-of-Distribution Generalization for Medical Image Segmentation

    Authors: Birk Torpmann-Hagen, Vajira Thambawita, Kyrre Glette, Pål Halvorsen, Michael A. Riegler

    Abstract: Generalizability is seen as one of the major challenges in deep learning, in particular in the domain of medical imaging, where a change of hospital or in imaging routines can lead to a complete failure of a model. To tackle this, we introduce Consistency Training, a training procedure and alternative to data augmentation based on maximizing models' prediction consistency across augmented and unau… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: 15 pages

  9. arXiv:2205.15413  [pdf, other

    eess.IV cs.CV cs.LG

    PolypConnect: Image inpainting for generating realistic gastrointestinal tract images with polyps

    Authors: Jan Andre Fagereng, Vajira Thambawita, Andrea M. Storås, Sravanthi Parasa, Thomas de Lange, Pål Halvorsen, Michael A. Riegler

    Abstract: Early identification of a polyp in the lower gastrointestinal (GI) tract can lead to prevention of life-threatening colorectal cancer. Develo** computer-aided diagnosis (CAD) systems to detect polyps can improve detection accuracy and efficiency and save the time of the domain experts called endoscopists. Lack of annotated data is a common challenge when building CAD systems. Generating syntheti… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: 6 pages

  10. arXiv:2205.15407  [pdf, other

    cs.CV cs.LG

    Grid HTM: Hierarchical Temporal Memory for Anomaly Detection in Videos

    Authors: Vladimir Monakhov, Vajira Thambawita, Pål Halvorsen, Michael A. Riegler

    Abstract: The interest for video anomaly detection systems has gained traction for the past few years. The current approaches use deep learning to perform anomaly detection in videos, but this approach has multiple problems. For starters, deep learning in general has issues with noise, concept drift, explainability, and training data volumes. Additionally, anomaly detection in itself is a complex task and f… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: 7 pages

  11. arXiv:2202.12031  [pdf, other

    cs.CV cs.AI cs.LG

    Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge

    Authors: Sharib Ali, Noha Ghatwary, Debesh Jha, Ece Isik-Polat, Gorkem Polat, Chen Yang, Wuyang Li, Adrian Galdran, Miguel-Ángel González Ballester, Vajira Thambawita, Steven Hicks, Sahadev Poudel, Sang-Woong Lee, Ziyi **, Tianyuan Gan, ChengHui Yu, JiangPeng Yan, Doyeob Yeo, Hyunseok Lee, Nikhil Kumar Tomar, Mahmood Haithmi, Amr Ahmed, Michael A. Riegler, Christian Daul, Pål Halvorsen , et al. (7 additional authors not shown)

    Abstract: Polyps are well-known cancer precursors identified by colonoscopy. However, variability in their size, location, and surface largely affect identification, localisation, and characterisation. Moreover, colonoscopic surveillance and removal of polyps (referred to as polypectomy ) are highly operator-dependent procedures. There exist a high missed detection rate and incomplete removal of colonic pol… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 26 pages

  12. arXiv:2202.01031  [pdf, other

    cs.CV cs.MM

    MMSys'22 Grand Challenge on AI-based Video Production for Soccer

    Authors: Cise Midoglu, Steven A. Hicks, Vajira Thambawita, Tomas Kupka, Pål Halvorsen

    Abstract: Soccer has a considerable market share of the global sports industry, and the interest in viewing videos from soccer games continues to grow. In this respect, it is important to provide game summaries and highlights of the main game events. However, annotating and producing events and summaries often require expensive equipment and a lot of tedious, cumbersome, manual labor. Therefore, automating… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  13. SinGAN-Seg: Synthetic training data generation for medical image segmentation

    Authors: Vajira Thambawita, Pegah Salehi, Sajad Amouei Sheshkal, Steven A. Hicks, Hugo L. Hammer, Sravanthi Parasa, Thomas de Lange, Pål Halvorsen, Michael A. Riegler

    Abstract: Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the da… ▽ More

    Submitted 25 April, 2022; v1 submitted 29 June, 2021; originally announced July 2021.

  14. arXiv:2107.00283  [pdf, other

    eess.IV cs.CV cs.LG

    DivergentNets: Medical Image Segmentation by Network Ensemble

    Authors: Vajira Thambawita, Steven A. Hicks, Pål Halvorsen, Michael A. Riegler

    Abstract: Detection of colon polyps has become a trending topic in the intersecting fields of machine learning and gastrointestinal endoscopy. The focus has mainly been on per-frame classification. More recently, polyp segmentation has gained attention in the medical community. Segmentation has the advantage of being more accurate than per-frame classification or object detection as it can show the affected… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: the winning model of the segmentation generalization challenge at EndoCV 2021

    Journal ref: Proceedings of the 3rd International Workshop and Challenge on Computer Vision in Endoscopy (EndoCV 2021) colocated with with the 17th IEEE International Symposium on Biomedical Imaging (ISBI 2021)

  15. arXiv:2106.03223  [pdf, other

    cs.CV

    Meta-learning with implicit gradients in a few-shot setting for medical image segmentation

    Authors: Rabindra Khadga, Debesh Jha, Steven Hicks, Vajira Thambawita, Michael A. Riegler, Sharib Ali, Pål Halvorsen

    Abstract: Widely used traditional supervised deep learning methods require a large number of training samples but often fail to generalize on unseen datasets. Therefore, a more general application of any trained model is quite limited for medical imaging for clinical practice. Using separately trained models for each unique lesion category or a unique patient population will require sufficiently large curat… ▽ More

    Submitted 30 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Journal ref: Computers in Biology and Medicine, 2022

  16. arXiv:2012.07430  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Pyramid-Focus-Augmentation: Medical Image Segmentation with Step-Wise Focus

    Authors: Vajira Thambawita, Steven Hicks, Pål Halvorsen, Michael A. Riegler

    Abstract: Segmentation of findings in the gastrointestinal tract is a challenging but also an important task which is an important building stone for sufficient automatic decision support systems. In this work, we present our solution for the Medico 2020 task, which focused on the problem of colon polyp segmentation. We present our simple but efficient idea of using an augmentation method that uses grids in… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  17. arXiv:2005.03912  [pdf, other

    cs.LG cs.MM stat.ML

    An Extensive Study on Cross-Dataset Bias and Evaluation Metrics Interpretation for Machine Learning applied to Gastrointestinal Tract Abnormality Classification

    Authors: Vajira Thambawita, Debesh Jha, Hugo Lewi Hammer, Håvard D. Johansen, Dag Johansen, Pål Halvorsen, Michael A. Riegler

    Abstract: Precise and efficient automated identification of Gastrointestinal (GI) tract diseases can help doctors treat more patients and improve the rate of disease detection and identification. Currently, automatic analysis of diseases in the GI tract is a hot topic in both computer science and medical-related journals. Nevertheless, the evaluation of such an automatic analysis is often incomplete or simp… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: 30 pages, 12 figures, 8 tables, Accepted for ACM Transactions on Computing for Healthcare

  18. arXiv:1911.03100  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Extracting temporal features into a spatial domain using autoencoders for sperm video analysis

    Authors: Vajira Thambawita, Pål Halvorsen, Hugo Hammer, Michael Riegler, Trine B. Haugen

    Abstract: In this paper, we present a two-step deep learning method that is used to predict sperm motility and morphology-based on video recordings of human spermatozoa. First, we use an autoencoder to extract temporal features from a given semen video and plot these into image-space, which we call feature-images. Second, these feature-images are used to perform transfer learning to predict the motility and… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 3 pages, 1 figure, MediaEval 19, 27-29 October 2019, Sophia Antipolis, France

  19. arXiv:1911.03086  [pdf, other

    eess.IV cs.CV cs.LG

    Stacked dense optical flows and dropout layers to predict sperm motility and morphology

    Authors: Vajira Thambawita, Pål Halvorsen, Hugo Hammer, Michael Riegler, Trine B. Haugen

    Abstract: In this paper, we analyse two deep learning methods to predict sperm motility and sperm morphology from sperm videos. We use two different inputs: stacked pure frames of videos and dense optical flows of video frames. To solve this regression task of predicting motility and morphology, stacked dense optical flows and extracted original frames from sperm videos were used with the modified state of… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 3 pages, 2 figures, MediaEval 19, 27-29 October 2019, Sophia Antipolis, France

  20. arXiv:1910.13327  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Machine Learning-Based Analysis of Sperm Videos and Participant Data for Male Fertility Prediction

    Authors: Steven A. Hicks, Jorunn M. Andersen, Oliwia Witczak, Vajira Thambawita, Påll Halvorsen, Hugo L. Hammer, Trine B. Haugen, Michael A. Riegler

    Abstract: Methods for automatic analysis of clinical data are usually targeted towards a specific modality and do not make use of all relevant data available. In the field of male human reproduction, clinical and biological data are not used to its fullest potential. Manual evaluation of a semen sample using a microscope is time-consuming and requires extensive training. Furthermore, the validity of manual… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Preprint, accepted by Nature Scientific Reports for publication 24.10.2019

  21. An optimized Parallel Failure-less Aho-Corasick algorithm for DNA sequence matching

    Authors: Vajira Thambawita, Roshan G. Ragel, Dhammike Elkaduwe

    Abstract: The Aho-Corasick algorithm is multiple patterns searching algorithm running sequentially in various applications like network intrusion detection and bioinformatics for finding several input strings within a given large input string. The parallel version of the Aho-Corasick algorithm is called as Parallel Failure-less Aho-Corasick algorithm because it doesn't need failure links like in the origina… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: 6 pages, 3 figures, 4 tables, 5 graphs, 2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)

  22. arXiv:1810.13278  [pdf, other

    cs.LG stat.ML

    The Medico-Task 2018: Disease Detection in the Gastrointestinal Tract using Global Features and Deep Learning

    Authors: Vajira Thambawita, Debesh Jha, Michael Riegler, Pål Halvorsen, Hugo Lewi Hammer, Håvard D. Johansen, Dag Johansen

    Abstract: In this paper, we present our approach for the 2018 Medico Task classifying diseases in the gastrointestinal tract. We have proposed a system based on global features and deep neural networks. The best approach combines two neural networks, and the reproducible experimental results signify the efficiency of the proposed model with an accuracy rate of 95.80%, a precision of 95.87%, and an F1-score… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

    Comments: 2 pages + 1 page for references, 1 figure, Conference paper

    Journal ref: MediaEval 2018

  23. To Use or Not to Use: CPUs' Cache Optimization Techniques on GPGPUs

    Authors: Vajira Thambawita, Roshan G. Ragel, Dhammike Elkaduwe

    Abstract: General Purpose Graphic Processing Unit(GPGPU) is used widely for achieving high performance or high throughput in parallel programming. This capability of GPGPUs is very famous in the new era and mostly used for scientific computing which requires more processing power than normal personal computers. Therefore, most of the programmers, researchers and industry use this new concept for their work.… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: 6 pages, 15 Figures

    Journal ref: ICIAfS 2016- IEEE International Conference on Information and Automation for Sustainability

  24. arXiv:1412.7789  [pdf

    cs.DC cs.PF

    To Use or Not to Use: Graphics Processing Units for Pattern Matching Algorithms

    Authors: Vajira Thambawita, Roshan Ragel, Dhammika Elkaduwe

    Abstract: String matching is an important part in today's computer applications and Aho-Corasick algorithm is one of the main string matching algorithms used to accomplish this. This paper discusses that when can the GPUs be used for string matching applications using the Aho-Corasick algorithm as a benchmark. We have to identify the best unit to run our string matching algorithm according to the performanc… ▽ More

    Submitted 25 December, 2014; originally announced December 2014.

    Comments: appears in The 7th International Conference on Information and Automation for Sustainability (ICIAfS) 2014