Skip to main content

Showing 1–20 of 20 results for author: Kovács, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06464  [pdf, other

    cs.AI cs.CL

    Transforming Wearable Data into Health Insights using Large Language Model Agents

    Authors: Mike A. Merrill, Akshay Paruchuri, Naghmeh Rezaei, Geza Kovacs, Javier Perez, Yun Liu, Erik Schenck, Nova Hammerquist, Jake Sunshine, Shyam Tailor, Kumar Ayush, Hao-Wei Su, Qian He, Cory Y. McLean, Mark Malhotra, Shwetak Patel, Jiening Zhan, Tim Althoff, Daniel McDuff, Xin Liu

    Abstract: Despite the proliferation of wearable health trackers and the importance of sleep and exercise to health, deriving actionable personalized insights from wearable data remains a challenge because doing so requires non-trivial open-ended analysis of these data. The recent rise of large language model (LLM) agents, which can use tools to reason about and interact with the world, presents a promising… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 38 pages

  2. arXiv:2401.13843  [pdf, ps, other

    cs.LG

    Enumerating the k-fold configurations in multi-class classification problems

    Authors: Attila Fazekas, Gyorgy Kovacs

    Abstract: K-fold cross-validation is a widely used tool for assessing classifier performance. The reproducibility crisis faced by artificial intelligence partly results from the irreproducibility of reported k-fold cross-validation-based performance scores. Recently, we introduced numerical techniques to test the consistency of claimed performance scores and experimental setups. In a crucial use case, the m… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  3. arXiv:2312.10708  [pdf, other

    cs.LG

    The Conditioning Bias in Binary Decision Trees and Random Forests and Its Elimination

    Authors: Gábor Timár, György Kovács

    Abstract: Decision tree and random forest classification and regression are some of the most widely used in machine learning approaches. Binary decision tree implementations commonly use conditioning in the form 'feature $\leq$ (or $<$) threshold', with the threshold being the midpoint between two observed feature values. In this paper, we investigate the bias introduced by the choice of conditioning operat… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  4. arXiv:2311.14024  [pdf, other

    cs.CV cs.LG

    Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSI

    Authors: Aleksis Pirinen, Nosheen Abid, Nuria Agues Paszkowsky, Thomas Ohlson Timoudas, Ronald Scheirer, Chiara Ceccobello, György Kovács, Anders Persson

    Abstract: Cloud formations often obscure optical satellite-based monitoring of the Earth's surface, thus limiting Earth observation (EO) activities such as land cover map**, ocean color analysis, and cropland monitoring. The integration of machine learning (ML) methods within the remote sensing domain has significantly improved performance on a wide range of EO tasks, including cloud detection and filteri… ▽ More

    Submitted 15 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Published in the journal Remote Sensing (2024). Code, data and models available at https://github.com/aleksispi/ml-cloud-opt-thick

  5. arXiv:2311.07541  [pdf, ps, other

    cs.LG

    mlscorecheck: Testing the consistency of reported performance scores and experiments in machine learning

    Authors: György Kovács, Attila Fazekas

    Abstract: Addressing the reproducibility crisis in artificial intelligence through the validation of reported experimental results is a challenging task. It necessitates either the reimplementation of techniques or a meticulous assessment of papers for deviations from the scientific method and best statistical practices. To facilitate the validation of reported results, we have developed numerical technique… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    MSC Class: 68T01 ACM Class: I.2.0

  6. arXiv:2310.12527  [pdf, other

    cs.LG cs.AI

    Testing the Consistency of Performance Scores Reported for Binary Classification Problems

    Authors: Attila Fazekas, György Kovács

    Abstract: Binary classification is a fundamental task in machine learning, with applications spanning various scientific domains. Whether scientists are conducting fundamental research or refining practical applications, they typically assess and rank classification techniques based on performance metrics such as accuracy, sensitivity, and specificity. However, reported performance scores may not always ser… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    MSC Class: 68T01 ACM Class: I.2.1

  7. arXiv:2305.15525  [pdf, other

    cs.CL cs.LG

    Large Language Models are Few-Shot Health Learners

    Authors: Xin Liu, Daniel McDuff, Geza Kovacs, Isaac Galatzer-Levy, Jacob Sunshine, Jiening Zhan, Ming-Zher Poh, Shun Liao, Paolo Di Achille, Shwetak Patel

    Abstract: Large language models (LLMs) can capture rich representations of concepts that are useful for real-world tasks. However, language alone is limited. While existing LLMs excel at text-based inferences, health applications require that models be grounded in numerical data (e.g., vital signs, laboratory values in clinical domains; steps, movement in the wellness domain) that is not easily or readily e… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2304.12847  [pdf, other

    cs.CL

    NLP-LTU at SemEval-2023 Task 10: The Impact of Data Augmentation and Semi-Supervised Learning Techniques on Text Classification Performance on an Imbalanced Dataset

    Authors: Sana Sabah Al-Azzawi, György Kovács, Filip Nilsson, Tosin Adewumi, Marcus Liwicki

    Abstract: In this paper, we propose a methodology for task 10 of SemEval23, focusing on detecting and classifying online sexism in social media posts. The task is tackling a serious issue, as detecting harmful content on social media platforms is crucial for mitigating the harm of these posts on users. Our solution for this task is based on an ensemble of fine-tuned transformer-based models (BERTweet, RoBER… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 6 pages, 5 figures , This paper has beed accepted in SemEval workshop at ACL 2023 conference

  9. arXiv:2206.08593  [pdf, other

    cs.CL cs.LG

    Automatic Correction of Human Translations

    Authors: Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero

    Abstract: We introduce translation error correction (TEC), the task of automatically correcting human-generated translations. Imperfections in machine translations (MT) have long motivated systems for improving translations post-hoc with automatic post-editing. In contrast, little attention has been devoted to the problem of automatically correcting human translations, despite the intuition that humans make… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: NAACL 2022. Dataset available at: https://github.com/lilt/tec

  10. arXiv:2202.05690  [pdf, other

    cs.CL

    HaT5: Hate Language Identification using Text-to-Text Transfer Transformer

    Authors: Sana Sabah Sabry, Tosin Adewumi, Nosheen Abid, György Kovacs, Foteini Liwicki, Marcus Liwicki

    Abstract: We investigate the performance of a state-of-the art (SoTA) architecture T5 (available on the SuperGLUE) and compare with it 3 other previous SoTA architectures across 5 different tasks from 2 relatively diverse datasets. The datasets are diverse in terms of the number and types of tasks they have. To improve performance, we augment the training data by using an autoregressive model. We achieve ne… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: 7 pages, 3 figures , conference

    MSC Class: 68

  11. arXiv:2112.15095  [pdf, other

    cs.CV cs.LG

    A general technique for the estimation of farm animal body part weights from CT scans and its applications in a rabbit breeding program

    Authors: Ádám Csóka, György Kovács, Virág Ács, Zsolt Matics, Zsolt Gerencsér, Zsolt Szendrő, István Nagy, Örs Petneházy, Imre Repa, Mariann Moizs, Tamás Donkó

    Abstract: Various applications of farm animal imaging are based on the estimation of weights of certain body parts and cuts from the CT images of animals. In many cases, the complexity of the problem is increased by the enormous variability of postures in CT images due to the scanning of non-sedated, living animals. In this paper, we propose a general and robust approach for the estimation of the weights of… ▽ More

    Submitted 30 December, 2021; originally announced December 2021.

  12. arXiv:2111.03853  [pdf, other

    eess.IV cs.CV cs.LG

    A new baseline for retinal vessel segmentation: Numerical identification and correction of methodological inconsistencies affecting 100+ papers

    Authors: György Kovács, Attila Fazekas

    Abstract: In the last 15 years, the segmentation of vessels in retinal images has become an intensively researched problem in medical imaging, with hundreds of algorithms published. One of the de facto benchmarking data sets of vessel segmentation techniques is the DRIVE data set. Since DRIVE contains a predefined split of training and test images, the published performance results of the various segmentati… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  13. arXiv:2102.03742  [pdf, other

    cs.HC

    Reconstructing Detailed Browsing Activities from Browser History

    Authors: Geza Kovacs

    Abstract: Users' detailed browsing activity - such as what sites they are spending time on and for how long, and what tabs they have open and which one is focused at any given time - is useful for a number of research and practical applications. Gathering such data, however, requires that users install and use a monitoring tool over long periods of time. In contrast, browser extensions can gain instantaneou… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

    ACM Class: H.5.m

  14. arXiv:2102.01865  [pdf, other

    cs.HC

    Edvertisements: Adding Microlearning to Social News Feeds and Websites

    Authors: Geza Kovacs

    Abstract: Many long-term goals, such as learning a language, require people to regularly practice every day to achieve mastery. At the same time, people regularly surf the web and read social news feeds in their spare time. We have built a browser extension that teaches vocabulary to users in the context of Facebook feeds and arbitrary websites, by showing users interactive quizzes they can answer without l… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: Extended version of "FeedLearn: Using facebook feeds for microlearning", which was published as an extended abstract at CHI 2015

    ACM Class: H.5.2

  15. arXiv:2102.01864  [pdf, other

    cs.HC

    QuizCram: A Quiz-Driven Lecture Viewing Interface

    Authors: Geza Kovacs, Darren Edge

    Abstract: QuizCram is an interface for navigating lecture videos that uses quizzes to help users determine what they should view. We developed it in response to observing peaks in video seeking behaviors centered around Coursera's in-video quizzes. QuizCram shows users a question to answer, with an associated video segment. Users can use these questions to navigate through video segments, and find video seg… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: Extended version of "QuizCram: A Quiz-Driven Lecture Viewing Interface", which was published as part of the Student Research Competition at CHI 2015

    ACM Class: H.5.2

  16. arXiv:2101.11743  [pdf, other

    cs.HC

    Not Now, Ask Later: Users Weaken Their Behavior Change Regimen Over Time, But Expect To Re-Strengthen It Imminently

    Authors: Geza Kovacs, Zhengxuan Wu, Michael S. Bernstein

    Abstract: How effectively do we adhere to nudges and interventions that help us control our online browsing habits? If we have a temporary lapse and disable the behavior change system, do we later resume our adherence, or has the dam broken? In this paper, we investigate these questions through log analyses of 8,000+ users on HabitLab, a behavior change platform that helps users reduce their time online. We… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: To appear in ACM CHI Conference on Human Factors in Computing Systems (CHI '21), May 8-13, 2021, Yokohama, Japan

    ACM Class: H.5.2

  17. arXiv:2011.05978  [pdf, other

    cs.CL cs.HC

    The Impact of Text Presentation on Translator Performance

    Authors: Samuel Läubli, Patrick Simianer, Joern Wuebker, Geza Kovacs, Rico Sennrich, Spence Green

    Abstract: Widely used computer-aided translation (CAT) tools divide documents into segments such as sentences and arrange them in a side-by-side, spreadsheet-like view. We present the first controlled evaluation of these design choices on translator performance, measuring speed and accuracy in three experimental text processing tasks. We find significant evidence that sentence-by-sentence presentation enabl… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in Target

  18. arXiv:2007.12463  [pdf, other

    cs.LG cs.CV stat.ML

    Approximately Optimal Binning for the Piecewise Constant Approximation of the Normalized Unexplained Variance (nUV) Dissimilarity Measure

    Authors: Attila Fazekas, György Kovács

    Abstract: The recently introduced Matching by Tone Map** (MTM) dissimilarity measure enables template matching under smooth non-linear distortions and also has a well-established mathematical background. MTM operates by binning the template, but the ideal binning for a particular problem is an open question. By pointing out an important analogy between the well known mutual information (MI) and MTM, we in… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

    MSC Class: 65D18

  19. arXiv:2001.06296  [pdf, other

    eess.SP cs.LG stat.ML

    Overly Optimistic Prediction Results on Imbalanced Data: a Case Study of Flaws and Benefits when Applying Over-sampling

    Authors: Gilles Vandewiele, Isabelle Dehaene, György Kovács, Lucas Sterckx, Olivier Janssens, Femke Ongenae, Femke De Backere, Filip De Turck, Kristien Roelens, Johan Decruyenaere, Sofie Van Hoecke, Thomas Demeester

    Abstract: Information extracted from electrohysterography recordings could potentially prove to be an interesting additional source of information to estimate the risk on preterm birth. Recently, a large number of studies have reported near-perfect results to distinguish between recordings of patients that will deliver term or preterm using a public resource, called the Term/Preterm Electrohysterogram datab… ▽ More

    Submitted 28 November, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Journal ref: Artificial Intelligence in Medicine. 111 (2021). 101987

  20. Subword Semantic Hashing for Intent Classification on Small Datasets

    Authors: Kumar Shridhar, Ayushman Dash, Amit Sahu, Gustav Grund Pihlgren, Pedro Alonso, Vinaychandran Pondenkandath, Gyorgy Kovacs, Foteini Simistira, Marcus Liwicki

    Abstract: In this paper, we introduce the use of Semantic Hashing as embedding for the task of Intent Classification and achieve state-of-the-art performance on three frequently used benchmarks. Intent Classification on a small dataset is a challenging task for data-hungry state-of-the-art Deep Learning based systems. Semantic Hashing is an attempt to overcome such a challenge and learn robust text classifi… ▽ More

    Submitted 14 September, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted at IJCNN 2019 (Oral Presentation)