Skip to main content

Showing 1–14 of 14 results for author: Harrison, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10795  [pdf, other

    cs.LG cs.CY cs.HC

    Diversified Ensembling: An Experiment in Crowdsourced Machine Learning

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Pietro Perona, Aaron Roth

    Abstract: Crowdsourced machine learning on competition platforms such as Kaggle is a popular and often effective method for generating accurate models. Typically, teams vie for the most accurate model, as measured by overall error on a holdout set, and it is common towards the end of such competitions for teams at the top of the leaderboard to ensemble or average their models outside the platform mechanism… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  2. arXiv:2311.11772  [pdf, other

    cs.CV cs.LG

    Benchmarking Pathology Feature Extractors for Whole Slide Image Classification

    Authors: Georg Wölflein, Dyke Ferber, Asier R. Meneghetti, Omar S. M. El Nahhas, Daniel Truhn, Zunamys I. Carrero, David J. Harrison, Ognjen Arandjelović, Jakob Nikolas Kather

    Abstract: Weakly supervised whole slide image classification is a key task in computational pathology, which involves predicting a slide-level label from a set of image patches constituting the slide. Constructing models to solve this task involves multiple design choices, often made without robust empirical or conclusive theoretical justification. To address this, we conduct a comprehensive benchmarking of… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: For the conference version see: arXiv:2311.11772v4. For the longer journal version with additional experiments see arXiv:2311.11772v5

  3. arXiv:2305.10552  [pdf, other

    cs.CV cs.LG

    Deep Multiple Instance Learning with Distance-Aware Self-Attention

    Authors: Georg Wölflein, Lucie Charlotte Magister, Pietro Liò, David J. Harrison, Ognjen Arandjelović

    Abstract: Traditional supervised learning tasks require a label for every instance in the training set, but in many real-world applications, labels are only available for collections (bags) of instances. This problem setting, known as multiple instance learning (MIL), is particularly relevant in the medical domain, where high-resolution images are split into smaller patches, but labels apply to the image as… ▽ More

    Submitted 20 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  4. arXiv:2301.13767  [pdf, other

    cs.LG cs.DS

    Multicalibration as Boosting for Regression

    Authors: Ira Globus-Harris, Declan Harrison, Michael Kearns, Aaron Roth, Jessica Sorrell

    Abstract: We study the connection between multicalibration and boosting for squared error regression. First we prove a useful characterization of multicalibration in terms of a ``swap regret'' like condition on squared error. Using this characterization, we give an exceedingly simple algorithm that can be analyzed both as a boosting algorithm for regression and as a multicalibration algorithm for a class H… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: Code available here: https://github.com/Declancharrison/Level-Set-Boosting

  5. arXiv:2211.00147  [pdf, other

    cs.LG cs.CV physics.ao-ph

    A Machine Learning Tutorial for Operational Meteorology, Part II: Neural Networks and Deep Learning

    Authors: Randy J. Chase, David R. Harrison, Gary Lackmann, Amy McGovern

    Abstract: Over the past decade the use of machine learning in meteorology has grown rapidly. Specifically neural networks and deep learning have been used at an unprecedented rate. In order to fill the dearth of resources covering neural networks with a meteorological lens, this paper discusses machine learning methods in a plain language format that is targeted for the operational meteorological community.… ▽ More

    Submitted 12 March, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

  6. arXiv:2210.06909  [pdf, other

    cs.CV cs.LG q-bio.QM

    HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial Networks

    Authors: Georg Wölflein, In Hwa Um, David J Harrison, Ognjen Arandjelović

    Abstract: The presence and density of specific types of immune cells are important to understand a patient's immune response to cancer. However, immunofluorescence staining required to identify T cell subtypes is expensive, time-consuming, and rarely performed in clinical settings. We present a framework to virtually stain Hoechst images (which are cheap and widespread) with both CD3 and CD8 to identify T c… ▽ More

    Submitted 17 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  7. arXiv:2204.09782  [pdf, other

    eess.IV cs.CV

    MultiPathGAN: Structure Preserving Stain Normalization using Unsupervised Multi-domain Adversarial Network with Perception Loss

    Authors: Haseeb Nazki, Ognjen Arandjelović, InHwa Um, David Harrison

    Abstract: Histopathology relies on the analysis of microscopic tissue images to diagnose disease. A crucial part of tissue preparation is staining whereby a dye is used to make the salient tissue components more distinguishable. However, differences in laboratory protocols and scanning devices result in significant confounding appearance variation in the corresponding images. This variation increases both h… ▽ More

    Submitted 2 August, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  8. arXiv:2204.07492  [pdf, other

    physics.ao-ph cs.LG

    A Machine Learning Tutorial for Operational Meteorology, Part I: Traditional Machine Learning

    Authors: Randy J. Chase, David R. Harrison, Amanda Burke, Gary M. Lackmann, Amy McGovern

    Abstract: Recently, the use of machine learning in meteorology has increased greatly. While many machine learning methods are not new, university classes on machine learning are largely unavailable to meteorology students and are not required to become a meteorologist. The lack of formal instruction has contributed to perception that machine learning methods are 'black boxes' and thus end-users are hesitant… ▽ More

    Submitted 7 June, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Journal ref: Weather and Forecasting 37 (2022) 1509-1529

  9. arXiv:2110.11164  [pdf, other

    cs.CL cs.AI

    Modeling Performance in Open-Domain Dialogue with PARADISE

    Authors: Marilyn Walker, Colin Harmon, James Graupera, Davan Harrison, Steve Whittaker

    Abstract: There has recently been an explosion of work on spoken dialogue systems, along with an increased interest in open-domain systems that engage in casual conversations on popular topics such as movies, books and music. These systems aim to socially engage, entertain, and even empathize with their users. Since the achievement of such social goals is hard to measure, recent research has used dialogue l… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: The 12th International Workshop on Spoken Dialog System Technology, November 2021

  10. arXiv:2110.06209  [pdf, other

    cs.LG cs.CL cs.MS cs.PL

    A Brief Introduction to Automatic Differentiation for Machine Learning

    Authors: Davan Harrison

    Abstract: Machine learning and neural network models in particular have been improving the state of the art performance on many artificial intelligence related tasks. Neural network models are typically implemented using frameworks that perform gradient based optimization methods to fit a model to a dataset. These frameworks use a technique of calculating derivatives called automatic differentiation (AD) wh… ▽ More

    Submitted 14 October, 2021; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 8 pages

  11. arXiv:2107.04388  [pdf, other

    cs.CV cs.AI cs.LG

    Hoechst Is All You Need: Lymphocyte Classification with Deep Learning

    Authors: Jessica Cooper, In Hwa Um, Ognjen Arandjelović, David J Harrison

    Abstract: Multiplex immunofluorescence and immunohistochemistry benefit patients by allowing cancer pathologists to identify several proteins expressed on the surface of cells, enabling cell classification, better understanding of the tumour micro-environment, more accurate diagnoses, prognoses, and tailored immunotherapy based on the immune status of individual patients. However, they are expensive and tim… ▽ More

    Submitted 16 July, 2021; v1 submitted 9 July, 2021; originally announced July 2021.

    Comments: 15 pages, 4 figures

  12. arXiv:2103.05108  [pdf, other

    cs.CV cs.AI cs.LG

    Believe The HiPe: Hierarchical Perturbation for Fast, Robust, and Model-Agnostic Saliency Map**

    Authors: Jessica Cooper, Ognjen Arandjelović, David J Harrison

    Abstract: Understanding the predictions made by Artificial Intelligence (AI) systems is becoming more and more important as deep learning models are used for increasingly complex and high-stakes tasks. Saliency map** -- a popular visual attribution method -- is one important tool for this, but existing formulations are limited by either computational cost or architectural constraints. We therefore propose… ▽ More

    Submitted 11 April, 2022; v1 submitted 22 February, 2021; originally announced March 2021.

    Comments: github.com/jessicamarycooper/Hierarchical-Perturbation

  13. arXiv:1803.03759  [pdf, other

    stat.ML cs.LG

    Speech Recognition: Keyword Spotting Through Image Recognition

    Authors: Sanjay Krishna Gouda, Salil Kanetkar, David Harrison, Manfred K Warmuth

    Abstract: The problem of identifying voice commands has always been a challenge due to the presence of noise and variability in speed, pitch, etc. We will compare the efficacies of several neural network architectures for the speech recognition problem. In particular, we will build a model to determine whether a one second audio clip contains a particular word (out of a set of 10), an unknown word, or silen… ▽ More

    Submitted 24 November, 2020; v1 submitted 10 March, 2018; originally announced March 2018.

  14. Evaluation of Formal IDEs for Human-Machine Interface Design and Analysis: The Case of CIRCUS and PVSio-web

    Authors: Camille Fayollas, Célia Martinie, Philippe Palanque, Paolo Masci, Michael D. Harrison, José C. Campos, Saulo Rodrigues e Silva

    Abstract: Critical human-machine interfaces are present in many systems including avionics systems and medical devices. Use error is a concern in these systems both in terms of hardware panels and input devices, and the software that drives the interfaces. Guaranteeing safe usability, in terms of buttons, knobs and displays is now a key element in the overall safety of the system. New integrated developmen… ▽ More

    Submitted 29 January, 2017; originally announced January 2017.

    Comments: In Proceedings F-IDE 2016, arXiv:1701.07925

    Journal ref: EPTCS 240, 2017, pp. 1-19