Skip to main content

Showing 1–7 of 7 results for author: Szeto, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.02709  [pdf, other

    cs.CV

    Benchmarking a Benchmark: How Reliable is MS-COCO?

    Authors: Eric Zimmermann, Justin Szeto, Jerome Pasquero, Frederic Ratle

    Abstract: Benchmark datasets are used to profile and compare algorithms across a variety of tasks, ranging from image classification to segmentation, and also play a large role in image pretraining algorithms. Emphasis is placed on results with little regard to the actual content within the dataset. It is important to question what kind of information is being learned from these datasets and what are the nu… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted at ICCV 2023 DataComp Workshop

  2. arXiv:2311.02707  [pdf, other

    cs.CV

    An Empirical Study of Uncertainty in Polygon Annotation and the Impact of Quality Assurance

    Authors: Eric Zimmermann, Justin Szeto, Frederic Ratle

    Abstract: Polygons are a common annotation format used for quickly annotating objects in instance segmentation tasks. However, many real-world annotation projects request near pixel-perfect labels. While strict pixel guidelines may appear to be the solution to a successful project, practitioners often fail to assess the feasibility of the work requested, and overlook common factors that may challenge the no… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted at ICCV 2023 DataComp Workshop

  3. arXiv:2307.01738  [pdf, other

    eess.IV cs.CV

    Mitigating Calibration Bias Without Fixed Attribute Grou** for Improved Fairness in Medical Imaging Analysis

    Authors: Changjian Shui, Justin Szeto, Raghav Mehta, Douglas L. Arnold, Tal Arbel

    Abstract: Trustworthy deployment of deep learning medical imaging models into real-world clinical practice requires that they be calibrated. However, models that are well calibrated overall can still be poorly calibrated for a sub-population, potentially resulting in a clinician unwittingly making poor decisions for this group based on the recommendations of the model. Although methods have been shown to su… ▽ More

    Submitted 20 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  4. arXiv:2210.17398  [pdf, other

    cs.CV eess.IV

    Rethinking Generalization: The Impact of Annotation Style on Medical Image Segmentation

    Authors: Brennan Nichyporuk, Jillian Cardinell, Justin Szeto, Raghav Mehta, Jean-Pierre R. Falet, Douglas L. Arnold, Sotirios A. Tsaftaris, Tal Arbel

    Abstract: Generalization is an important attribute of machine learning models, particularly for those that are to be deployed in a medical context, where unreliable predictions can have real world consequences. While the failure of models to generalize across datasets is typically attributed to a mismatch in the data distributions, performance gaps are often a consequence of biases in the 'ground-truth' lab… ▽ More

    Submitted 13 December, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/papers/2022:029.html

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 1 (2022)

  5. arXiv:2108.00713  [pdf, other

    eess.IV cs.CV cs.LG

    Cohort Bias Adaptation in Aggregated Datasets for Lesion Segmentation

    Authors: Brennan Nichyporuk, Jillian Cardinell, Justin Szeto, Raghav Mehta, Sotirios Tsaftaris, Douglas L. Arnold, Tal Arbel

    Abstract: Many automatic machine learning models developed for focal pathology (e.g. lesions, tumours) detection and segmentation perform well, but do not generalize as well to new patient cohorts, impeding their widespread adoption into real clinical contexts. One strategy to create a more diverse, generalizable training set is to naively pool datasets from different cohorts. Surprisingly, training on this… ▽ More

    Submitted 18 May, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Accepted at DART 2021

  6. arXiv:2107.12978  [pdf, other

    eess.IV cs.CV cs.LG

    Optimizing Operating Points for High Performance Lesion Detection and Segmentation Using Lesion Size Reweighting

    Authors: Brennan Nichyporuk, Justin Szeto, Douglas L. Arnold, Tal Arbel

    Abstract: There are many clinical contexts which require accurate detection and segmentation of all focal pathologies (e.g. lesions, tumours) in patient images. In cases where there are a mix of small and large lesions, standard binary cross entropy loss will result in better segmentation of large lesions at the expense of missing small ones. Adjusting the operating point to accurately detect all lesions ge… ▽ More

    Submitted 18 May, 2022; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted at MIDL 2021

  7. arXiv:2103.03098  [pdf, other

    cs.LG stat.ML

    Accounting for Variance in Machine Learning Benchmarks

    Authors: Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaƫl Varoquaux, Pascal Vincent

    Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process, reve… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MLSys2021