Skip to main content

Showing 1–20 of 20 results for author: Hindle, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00725  [pdf, other

    eess.SP cs.CR cs.LG

    Federated Learning and Differential Privacy Techniques on Multi-hospital Population-scale Electrocardiogram Data

    Authors: Vikhyat Agrawal, Sunil Vasu Kalmady, Venkataseetharam Manoj Malipeddi, Manisimha Varma Manthena, Weijie Sun, Saiful Islam, Abram Hindle, Padma Kaul, Russell Greiner

    Abstract: This research paper explores ways to apply Federated Learning (FL) and Differential Privacy (DP) techniques to population-scale Electrocardiogram (ECG) data. The study learns a multi-label ECG classification model using FL and DP based on 1,565,849 ECG tracings from 7 hospitals in Alberta, Canada. The FL approach allowed collaborative model training without sharing raw data between hospitals while… ▽ More

    Submitted 15 May, 2024; v1 submitted 26 April, 2024; originally announced May 2024.

    Comments: Accepted for ICMHI 2024

  2. arXiv:2311.04229  [pdf, other

    eess.SP cs.LG

    Exploring Best Practices for ECG Signal Processing in Machine Learning

    Authors: Amir Salimi, Sunil Vasu Kalmady, Abram Hindle, Osmar Zaiane, Padma Kaul

    Abstract: In this work we search for best practices in pre-processing of Electrocardiogram (ECG) signals in order to train better classifiers for the diagnosis of heart conditions. State of the art machine learning algorithms have achieved remarkable results in classification of some heart conditions using ECG data, yet there appears to be no consensus on pre-processing best practices. Is this lack of conse… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  3. arXiv:2309.03414  [pdf, other

    cs.SE

    Predicting Defective Visual Code Changes in a Multi-Language AAA Video Game Project

    Authors: Kalvin Eng, Abram Hindle, Alexander Senchenko

    Abstract: Video game development increasingly relies on using visual programming languages as the primary way to build video game features. The aim of using visual programming is to move game logic into the hands of game designers, who may not be as well versed in textual coding. In this paper, we empirically observe that there are more defect-inducing commits containing visual code than textual code in a A… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: ICSME 2023

  4. arXiv:2309.03411  [pdf, other

    cs.SE

    Identifying Defect-Inducing Changes in Visual Code

    Authors: Kalvin Eng, Abram Hindle, Alexander Senchenko

    Abstract: Defects, or bugs, often form during software development. Identifying the root cause of defects is essential to improve code quality, evaluate testing methods, and support defect prediction. Examples of defect-inducing changes can be found using the SZZ algorithm to trace the textual history of defect-fixing changes back to the defect-inducing changes that they fix in line-based code. The line-bas… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: ICSME 2023

  5. Patterns of Multi-Container Composition for Service Orchestration with Docker Compose

    Authors: Kalvin Eng, Abram Hindle, Eleni Stroulia

    Abstract: Software design patterns present general code solutions to common software design problems. Modern software systems rely heavily on containers for running their constituent service components. Yet, despite the prevalence of ready-to-use Docker service images ready to participate in multi-container service compositions of applications, developers do not have much guidance on how to compose their ow… ▽ More

    Submitted 8 May, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

  6. arXiv:2211.10431  [pdf, other

    eess.SP cs.LG

    Improving ECG-based COVID-19 diagnosis and mortality predictions using pre-pandemic medical records at population-scale

    Authors: Weijie Sun, Sunil Vasu Kalmady, Nariman Sepehrvand, Luan Manh Chu, Zihan Wang, Amir Salimi, Abram Hindle, Russell Greiner, Padma Kaul

    Abstract: Pandemic outbreaks such as COVID-19 occur unexpectedly, and need immediate action due to their potential devastating consequences on global health. Point-of-care routine assessments such as electrocardiogram (ECG), can be used to develop prediction models for identifying individuals at risk. However, there is often too little clinically-annotated medical data, especially in early phases of a pande… ▽ More

    Submitted 11 January, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Accepted for NeurIPS 2022 TS4H workshop

  7. arXiv:2210.06291  [pdf, other

    eess.SP cs.LG

    ECG for high-throughput screening of multiple diseases: Proof-of-concept using multi-diagnosis deep learning from population-based datasets

    Authors: Weijie Sun, Sunil Vasu Kalmady, Amir Salimi, Nariman Sepehrvand, Eric Ly, Abram Hindle, Russell Greiner, Padma Kaul

    Abstract: Electrocardiogram (ECG) abnormalities are linked to cardiovascular diseases, but may also occur in other non-cardiovascular conditions such as mental, neurological, metabolic and infectious conditions. However, most of the recent success of deep learning (DL) based diagnostic predictions in selected patient cohorts have been limited to a small set of cardiac diseases. In this study, we use a popul… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted in Medical Imaging meets NeurIPS 2021 https://www.cse.cuhk.edu.hk/~qdou/public/medneurips2021/88_ECG_for_high-throughput_screening_of_multiple_diseases_final_version.pdf

  8. arXiv:2210.02435  [pdf, other

    cs.SE

    IRJIT: A Simple, Online, Information Retrieval Approach for Just-In-Time Software Defect Prediction

    Authors: Hareem Sahar, Abdul Ali Bangash, Abram Hindle, Denilson Barbosa

    Abstract: Just-in-Time software defect prediction (JIT-SDP) prevents the introduction of defects into the software by identifying them at commit check-in time. Current software defect prediction approaches rely on manually crafted features such as change metrics and involve expensive to train machine learning or deep learning models. These models typically involve extensive training processes that may requi… ▽ More

    Submitted 12 June, 2024; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted for publication in EMSE 2024

  9. arXiv:2106.10789  [pdf, other

    cs.SE

    An empirical evaluation of the usefulness of Tree Kernels for Commit-time Defect Detection in large software systems

    Authors: Hareem Sahar, Yuxin Liu, Abram Hindle, Denilson Barbosa

    Abstract: Defect detection at commit check-in time prevents the introduction of defects into software systems. Current defect detection approaches rely on metric-based models which are not very accurate and whose results are not directly useful for developers. We propose a method to detect bug-inducing commits by comparing the incoming changes with all past commits in the project, considering both those tha… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

  10. arXiv:2103.12298  [pdf, other

    cs.SE

    Revisiting Dockerfiles in Open Source Software Over Time

    Authors: Kalvin Eng, Abram Hindle

    Abstract: Docker is becoming ubiquitous with containerization for develo** and deploying applications. Previous studies have analyzed Dockerfiles that are used to create container images in order to better understand how to improve Docker tooling. These studies obtain Dockerfiles using either Docker Hub or Github. In this paper, we revisit the findings of previous studies using the largest set of Dockerfi… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  11. arXiv:2012.08053  [pdf, other

    cs.SE

    A Quantitative Study of Security Bug Fixes of GitHub Repositories

    Authors: Daito Nakano, Mingyang Yin, Ryosuke Sato, Abram Hindle, Yasutaka Kamei, Naoyasu Ubayashi

    Abstract: Software is prone to bugs and failures. Security bugs are those that expose or share privileged information and access in violation of the software's requirements. Given the seriousness of security bugs, there are centralized mechanisms for supporting and tracking these bugs across multiple products, one such mechanism is the Common Vulnerabilities and Exposures (CVE) ID description. When a bug ge… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  12. arXiv:2010.13712  [pdf, other

    eess.SP cs.LG

    Multilabel 12-Lead Electrocardiogram Classification Using Gradient Boosting Tree Ensemble

    Authors: Alexander William Wong, Weijie Sun, Sunil Vasu Kalmady, Padma Kaul, Abram Hindle

    Abstract: The 12-lead electrocardiogram (ECG) is a commonly used tool for detecting cardiac abnormalities such as atrial fibrillation, blocks, and irregular complexes. For the PhysioNet/CinC 2020 Challenge, we built an algorithm using gradient boosted tree ensembles fitted on morphology and signal processing features to classify ECG diagnosis. For each lead, we derive features from heart rate variability,… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 4 pages, 4 figures, 2020 Computing in Cardiology (CinC) PhysioNet Challenge

  13. arXiv:1911.06348  [pdf, other

    cs.SE

    On the Time-Based Conclusion Stability of Cross-Project Defect Prediction Models

    Authors: Abdul Ali Bangash, Hareem Sahar, Abram Hindle, Karim Ali

    Abstract: Researchers in empirical software engineering often make claims based on observable data such as defect reports. Unfortunately, in many cases, these claims are generalized beyond the data sets that have been evaluated. Will the researcher's conclusions hold a year from now for the same software projects? Perhaps not. Recent studies show that in the area of Software Analytics, conclusions over diff… ▽ More

    Submitted 7 August, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

    Comments: Accepted for publication in Empirical Software Engineering 2020

  14. arXiv:1907.07803  [pdf, other

    cs.SE

    Syntax and Stack Overflow: A methodology for extracting a corpus of syntax errors and fixes

    Authors: Alexander William Wong, Amir Salimi, Shaiful Chowdhury, Abram Hindle

    Abstract: One problem when studying how to find and fix syntax errors is how to get natural and representative examples of syntax errors. Most syntax error datasets are not free, open, and public, or they are extracted from novice programmers and do not represent syntax errors that the general population of developers would make. Programmers of all skill levels post questions and answers to Stack Overflow w… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 5 pages, ICSME 2019

  15. arXiv:1907.04908  [pdf, other

    cs.SE

    Executability of Python Snippets in Stack Overflow

    Authors: Md Monir Hossain, Nima Mahmoudi, Changyuan Lin, Hamzeh Khazaei, Abram Hindle

    Abstract: Online resources today contain an abundant amount of code snippets for documentation, collaboration, learning, and problem-solving purposes. Their executability in a "plug and play" manner enables us to confirm their quality and use them directly in projects. But, in practice that is often not the case due to several requirements violations or incompleteness. However, it is a difficult task to inv… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  16. arXiv:1904.07307  [pdf, other

    cs.IR cs.CL cs.CY

    Tracing Forum Posts to MOOC Content using Topic Analysis

    Authors: Alexander William Wong, Ken Wong, Abram Hindle

    Abstract: Massive Open Online Courses are educational programs that are open and accessible to a large number of people through the internet. To facilitate learning, MOOC discussion forums exist where students and instructors communicate questions, answers, and thoughts related to the course. The primary objective of this paper is to investigate tracing discussion forum posts back to course lecture videos… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: 6 pages, 4 figures, Course project for UofA CMPUT 660, Winter 2019

  17. arXiv:1808.00106  [pdf, other

    cs.SE

    Sourcerer's Apprentice and the study of code snippet migration

    Authors: Stephen Romansky, Cheng Chen, Baljeet Malhotra, Abram Hindle

    Abstract: On the worldwide web, not only are webpages connected but source code is too. Software development is becoming more accessible to everyone and the licensing for software remains complicated. We need to know if software licenses are being maintained properly throughout their reuse and evolution. This motivated the development of the Sourcerer's Apprentice, a webservice that helps track clone relice… ▽ More

    Submitted 31 July, 2018; originally announced August 2018.

    Comments: 10 pages, 2 figures

  18. arXiv:1709.04916  [pdf, other

    cs.CY

    An App Performance Optimization Advisor for Mobile Device App Marketplaces

    Authors: Rubén Saborido, Foutse Khomh, Abram Hindle, Enrique Alba

    Abstract: On mobile phones, users and developers use apps official marketplaces serving as repositories of apps. The Google Play Store and Apple Store are the official marketplaces of Android and Apple products which offer more than a million apps. Although both repositories offer description of apps, information concerning performance is not available. Due to the constrained hardware of mobile devices, use… ▽ More

    Submitted 20 May, 2018; v1 submitted 13 September, 2017; originally announced September 2017.

    Comments: 18 pages, 8 figures

  19. arXiv:1705.01176  [pdf, other

    cs.DC cs.PF

    How does Docker affect energy consumption? Evaluating workloads in and out of Docker containers

    Authors: Eddie Antonio Santos, Carson McLean, Christopher Solinas, Abram Hindle

    Abstract: Context: Virtual machines provide isolation of services at the cost of hypervisors and more resource usage. This spurred the growth of systems like Docker that enable single hosts to isolate several applications, similar to VMs, within a low-overhead abstraction called containers. Motivation: Although containers tout low overhead performance, do they still have low energy consumption? Methodol… ▽ More

    Submitted 2 May, 2017; originally announced May 2017.

    Comments: 12 pages (minus references), 10 figures

    ACM Class: H.3.4

  20. arXiv:1310.7469  [pdf, other

    cs.SE

    Mining the Temporal Evolution of the Android Bug Reporting Community via Sliding Windows

    Authors: Feng Jiang, Jiemin Wang, Abram Hindle, Mario A. Nascimento

    Abstract: The open source development community consists of both paid and volunteer developers as well as new and experienced users. Previous work has applied social network analysis (SNA) to open source communities and has demonstrated value in expertise discovery and triaging. One problem with applying SNA directly to the data of the entire project lifetime is that the impact of local activities will be d… ▽ More

    Submitted 28 October, 2013; originally announced October 2013.

    Report number: TR13-07