Skip to main content

Showing 1–50 of 73 results for author: Blaschko, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00945  [pdf, other

    cs.LG

    Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: The rapid advancement of large language models (LLMs) has led to architectures with billions to trillions of parameters, posing significant deployment challenges due to their substantial demands on memory, processing power, and energy consumption. Sparse Mixture-of-Experts (SMoE) architectures have emerged as a solution, activating only a subset of parameters per token, thereby achieving faster in… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.16069  [pdf, other

    cs.CL cs.AI

    FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models

    Authors: Junyi Zhu, Shuochen Liu, Yu Yu, Bo Tang, Yibo Yan, Zhiyu Li, Feiyu Xiong, Tong Xu, Matthew B. Blaschko

    Abstract: Large language models (LLMs) excel in generating coherent text, but they often struggle with context awareness, leading to inaccuracies in tasks requiring faithful adherence to provided information. We introduce FastMem, a novel method designed to enhance instruction fine-tuned LLMs' context awareness through fast memorization of the prompt. FastMem maximizes the likelihood of the prompt before in… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.14629  [pdf, other

    cs.CL cs.AI

    Can LLMs Learn by Teaching? A Preliminary Study

    Authors: Xuefei Ning, Zifu Wang, Shiyao Li, Zinan Lin, Peiran Yao, Tianyu Fu, Matthew B. Blaschko, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Teaching to improve student models (e.g., knowledge distillation) is an extensively studied methodology in LLMs. However, for humans, teaching not only improves students but also improves teachers. We ask: Can LLMs also learn by teaching (LbT)? If yes, we can potentially unlock the possibility of continuously advancing the models without solely relying on human-produced data or stronger models. In… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  4. arXiv:2406.13103  [pdf, other

    cs.AI cs.LG

    A Generic Method for Fine-grained Category Discovery in Natural Language Texts

    Authors: Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens

    Abstract: Fine-grained category discovery using only coarse-grained supervision is a cost-effective yet challenging task. Previous training methods focus on aligning query samples with positive samples and distancing them from negatives. They often neglect intra-category and inter-category semantic similarities of fine-grained categories when navigating sample distributions in the embedding space. Furthermo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: preprint

  5. arXiv:2406.08226  [pdf, other

    cs.CV cs.AI cs.LG

    DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

    Authors: Jordy Van Landeghem, Subhajit Maity, Ayan Banerjee, Matthew Blaschko, Marie-Francine Moens, Josep Lladós, Sanket Biswas

    Abstract: This work explores knowledge distillation (KD) for visually-rich document (VRD) applications such as document layout analysis (DLA) and document image classification (DIC). While VRD research is dependent on increasingly sophisticated and cumbersome models, the field has neglected to study efficiency via model compression. Here, we design a KD experimentation methodology for more lean, performant… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to ICDAR 2024 (Athens, Greece)

  6. arXiv:2405.12705  [pdf, other

    cs.CV cs.CL cs.LG

    Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting

    Authors: Omar Hamed, Souhail Bakkali, Marie-Francine Moens, Matthew Blaschko, Jordy Van Landeghem

    Abstract: This work addresses the need for a balanced approach between performance and efficiency in scalable production environments for visually-rich document understanding (VDU) tasks. Currently, there is a reliance on large document foundation models that offer advanced capabilities but come with a heavy computational burden. In this paper, we propose a multimodal early exit (EE) model design that incor… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted at ICDAR 2024

  7. arXiv:2405.07930  [pdf, other

    cs.MM cs.CV cs.LG cs.SD eess.AS

    Improving Multimodal Learning with Multi-Loss Gradient Modulation

    Authors: Konstantinos Kontras, Christos Chatzichristos, Matthew Blaschko, Maarten De Vos

    Abstract: Learning from multiple modalities, such as audio and video, offers opportunities for leveraging complementary information, enhancing robustness, and improving contextual understanding and performance. However, combining such modalities presents challenges, especially when modalities differ in data structure, predictive contribution, and the complexity of their learning processes. It has been obser… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  8. arXiv:2405.02509  [pdf, other

    cs.CV cs.AI

    Implicit Neural Representations for Robust Joint Sparse-View CT Reconstruction

    Authors: Jiayang Shi, Junyi Zhu, Daniel M. Pelt, K. Joost Batenburg, Matthew B. Blaschko

    Abstract: Computed Tomography (CT) is pivotal in industrial quality control and medical diagnostics. Sparse-view CT, offering reduced ionizing radiation, faces challenges due to its under-sampled nature, leading to ill-posed reconstruction problems. Recent advancements in Implicit Neural Representations (INRs) have shown promise in addressing sparse-view CT reconstruction. Recognizing that CT often involves… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  9. arXiv:2404.02241  [pdf, other

    cs.CV

    Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Sergey Yekhanin, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Diffusion Models (DM) and Consistency Models (CM) are two types of popular generative models with good generation quality on various tasks. When training DM and CM, intermediate weight checkpoints are not fully utilized and only the last converged checkpoint is used. In this work, we find that high-quality model weights often lie in a basin which cannot be reached by SGD but can be obtained by pro… ▽ More

    Submitted 7 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  10. arXiv:2402.14957  [pdf, other

    cs.CV cs.LG

    The Common Stability Mechanism behind most Self-Supervised Learning Approaches

    Authors: Abhishek Jha, Matthew B. Blaschko, Yuki M. Asano, Tinne Tuytelaars

    Abstract: Last couple of years have witnessed a tremendous progress in self-supervised learning (SSL), the success of which can be attributed to the introduction of useful inductive biases in the learning process to learn meaningful visual representations while avoiding collapse. These inductive biases and constraints manifest themselves in the form of different optimization formulations in the SSL techniqu… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Additional visualizations (.gif): https://github.com/abskjha/CenterVectorSSL

  11. arXiv:2401.15223  [pdf, other

    cs.CV cs.LG

    Biological Valuation Map of Flanders: A Sentinel-2 Imagery Analysis

    Authors: Mingshi Li, Dusan Grujicic, Steven De Saeger, Stien Heremans, Ben Somers, Matthew B. Blaschko

    Abstract: In recent years, machine learning has become crucial in remote sensing analysis, particularly in the domain of Land-use/Land-cover (LULC). The synergy of machine learning and satellite imagery analysis has demonstrated significant productivity in this field, as evidenced by several studies. A notable challenge within this area is the semantic segmentation map** of land usage over extensive terri… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  12. arXiv:2312.08589  [pdf, other

    cs.LG stat.ML

    Consistent and Asymptotically Unbiased Estimation of Proper Calibration Errors

    Authors: Teodora Popordanoska, Sebastian G. Gruber, Aleksei Tiulpin, Florian Buettner, Matthew B. Blaschko

    Abstract: Proper scoring rules evaluate the quality of probabilistic predictions, playing an essential role in the pursuit of accurate and well-calibrated models. Every proper score decomposes into two fundamental components -- proper calibration error and refinement -- utilizing a Bregman divergence. While uncertainty calibration has gained significant attention, current literature lacks a general estimato… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Preprint

  13. arXiv:2312.08586  [pdf, other

    cs.LG cs.CV stat.ML

    Estimating calibration error under label shift without labels

    Authors: Teodora Popordanoska, Gorjan Radevski, Tinne Tuytelaars, Matthew B. Blaschko

    Abstract: In the face of dataset shift, model calibration plays a pivotal role in ensuring the reliability of machine learning systems. Calibration error (CE) is an indicator of the alignment between the predicted probabilities and the classifier accuracy. While prior works have delved into the implications of dataset shift on calibration, existing CE estimators assume access to labels from the target domai… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Preprint

  14. arXiv:2312.06645  [pdf, other

    cs.CV

    Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection

    Authors: Teodora Popordanoska, Aleksei Tiulpin, Matthew B. Blaschko

    Abstract: Despite their impressive predictive performance in various computer vision tasks, deep neural networks (DNNs) tend to make overly confident predictions, which hinders their widespread use in safety-critical applications. While there have been recent attempts to calibrate DNNs, most of these efforts have primarily been focused on classification tasks, thus neglecting DNN-based object detectors. Alt… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: To appear at WACV 2024

  15. arXiv:2310.19252  [pdf, other

    cs.CV cs.AI cs.LG

    Revisiting Evaluation Metrics for Semantic Segmentation: Optimization and Evaluation of Fine-grained Intersection over Union

    Authors: Zifu Wang, Maxim Berman, Amal Rannen-Triki, Philip H. S. Torr, Devis Tuia, Tinne Tuytelaars, Luc Van Gool, Jiaqian Yu, Matthew B. Blaschko

    Abstract: Semantic segmentation datasets often exhibit two types of imbalance: \textit{class imbalance}, where some classes appear more frequently than others and \textit{size imbalance}, where some objects occupy more pixels than others. This causes traditional evaluation metrics to be biased towards \textit{majority classes} (e.g. overall pixel-wise accuracy) and \textit{large objects} (e.g. mean pixel-wi… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  16. arXiv:2310.05166  [pdf, other

    cs.LG stat.ML

    A Corrected Expected Improvement Acquisition Function Under Noisy Observations

    Authors: Han Zhou, Xingchen Ma, Matthew B Blaschko

    Abstract: Sequential maximization of expected improvement (EI) is one of the most widely used policies in Bayesian optimization because of its simplicity and ability to handle noisy observations. In particular, the improvement function often uses the best posterior mean as the best incumbent in noisy settings. However, the uncertainty associated with the incumbent solution is often neglected in many analyti… ▽ More

    Submitted 13 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  17. arXiv:2308.12896  [pdf, other

    cs.CV cs.CL cs.LG

    Beyond Document Page Classification: Design, Datasets, and Challenges

    Authors: Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens

    Abstract: This paper highlights the need to bring document classification benchmarking closer to real-world applications, both in the nature of data tested ($X$: multi-channel, multi-paged, multi-industry; $Y$: class distributions and label set variety) and in classification tasks considered ($f$: multi-page document, page stream, and document bundle classification, ...). We identify the lack of public mult… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 12 pages, accepted at WACV 2024; camera-ready (paper id 1123)

  18. arXiv:2307.12717  [pdf, ps, other

    cs.CV eess.IV

    Dense Transformer based Enhanced Coding Network for Unsupervised Metal Artifact Reduction

    Authors: Wangduo Xie, Matthew B. Blaschko

    Abstract: CT images corrupted by metal artifacts have serious negative effects on clinical diagnosis. Considering the difficulty of collecting paired data with ground truth in clinical settings, unsupervised methods for metal artifact reduction are of high interest. However, it is difficult for previous unsupervised methods to retain structural information from CT images while handling the non-local charact… ▽ More

    Submitted 28 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  19. arXiv:2307.07483  [pdf, other

    cs.CV

    Multimodal Distillation for Egocentric Action Recognition

    Authors: Gorjan Radevski, Dusan Grujicic, Marie-Francine Moens, Matthew Blaschko, Tinne Tuytelaars

    Abstract: The focal point of egocentric video understanding is modelling hand-object interactions. Standard models, e.g. CNNs or Vision Transformers, which receive RGB frames as input perform well. However, their performance improves further by employing additional input modalities that provide complementary cues, such as object detections, optical flow, audio, etc. The added complexity of the modality-spec… ▽ More

    Submitted 18 July, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV 2023; Codebase released at https://github.com/gorjanradevski/multimodal-distillation

  20. arXiv:2306.00127  [pdf, other

    cs.LG cs.CR

    Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning

    Authors: Junyi Zhu, Ruicong Yao, Matthew B. Blaschko

    Abstract: In Federated Learning (FL) and many other distributed training frameworks, collaborators can hold their private data locally and only share the network weights trained with the local data after multiple iterations. Gradient inversion is a family of privacy attacks that recovers data from its generated gradients. Seemingly, FL can provide a degree of protection against gradient inversion attacks on… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  21. arXiv:2305.12557  [pdf, other

    cs.LG cs.AI

    Confidence-aware Personalized Federated Learning via Variational Expectation Maximization

    Authors: Junyi Zhu, Xingchen Ma, Matthew B. Blaschko

    Abstract: Federated Learning (FL) is a distributed learning scheme to train a shared model across clients. One common and fundamental challenge in FL is that the sets of data across clients could be non-identically distributed and have different sizes. Personalized Federated Learning (PFL) attempts to solve this challenge via locally adapted models. In this work, we present a novel framework for PFL based o… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: Accepted at CVPR 2023

  22. arXiv:2305.08455  [pdf, other

    cs.CV cs.CL cs.LG

    Document Understanding Dataset and Evaluation (DUDE)

    Authors: Jordy Van Landeghem, Rubén Tito, Łukasz Borchmann, Michał Pietruszka, Paweł Józiak, Rafał Powalski, Dawid Jurkiewicz, Mickaël Coustaty, Bertrand Ackaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanisławek

    Abstract: We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: Accepted at ICCV 2023

  23. arXiv:2303.16296  [pdf, other

    cs.CV cs.AI cs.LG

    Dice Semimetric Losses: Optimizing the Dice Score with Soft Labels

    Authors: Zifu Wang, Teodora Popordanoska, Jeroen Bertels, Robin Lemmens, Matthew B. Blaschko

    Abstract: The soft Dice loss (SDL) has taken a pivotal role in numerous automated segmentation pipelines in the medical imaging community. Over the last years, some reasons behind its superior functioning have been uncovered and further optimizations have been explored. However, there is currently no implementation that supports its direct utilization in scenarios involving soft labels. Hence, a synergy bet… ▽ More

    Submitted 20 March, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: MICCAI 2023

  24. arXiv:2302.05666  [pdf, other

    cs.CV cs.AI cs.LG

    Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels

    Authors: Zifu Wang, Xuefei Ning, Matthew B. Blaschko

    Abstract: Intersection over Union (IoU) losses are surrogates that directly optimize the Jaccard index. Leveraging IoU losses as part of the loss function have demonstrated superior performance in semantic segmentation tasks compared to optimizing pixel-wise losses such as the cross-entropy loss alone. However, we identify a lack of flexibility in these losses to support vital training techniques like label… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  25. Understanding metric-related pitfalls in image analysis validation

    Authors: Annika Reinke, Minu D. Tizabi, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, A. Emre Kavur, Tim Rädsch, Carole H. Sudre, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko, Florian Buettner, M. Jorge Cardoso, Veronika Cheplygina, Jianxu Chen, Evangelia Christodoulou, Beth A. Cimini, Gary S. Collins, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken , et al. (53 additional authors not shown)

    Abstract: Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibilit… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Shared first authors: Annika Reinke and Minu D. Tizabi; shared senior authors: Lena Maier-Hein and Paul F. Jäger. Published in Nature Methods. arXiv admin note: text overlap with arXiv:2206.01653

    Journal ref: Nature methods, 1-13 (2024)

  26. Clinically-Inspired Multi-Agent Transformers for Disease Trajectory Forecasting from Multimodal Data

    Authors: Huy Hoang Nguyen, Matthew B. Blaschko, Simo Saarakkala, Aleksei Tiulpin

    Abstract: Deep neural networks are often applied to medical images to automate the problem of medical diagnosis. However, a more clinically relevant question that practitioners usually face is how to predict the future trajectory of a disease. Current methods for prognosis or disease trajectory forecasting often require domain knowledge and are complicated to apply. In this paper, we formulate the prognosis… ▽ More

    Submitted 19 September, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE Transactions on Medical Imaging 2023

  27. arXiv:2210.07810  [pdf, other

    stat.ML cs.CV

    A Consistent and Differentiable Lp Canonical Calibration Error Estimator

    Authors: Teodora Popordanoska, Raphael Sayer, Matthew B. Blaschko

    Abstract: Calibrated probabilistic classifiers are models whose predicted probabilities can directly be interpreted as uncertainty estimates. It has been shown recently that deep neural networks are poorly calibrated and tend to output overconfident predictions. As a remedy, we propose a low-bias, trainable calibration error estimator based on Dirichlet kernel density estimates, which asymptotically converg… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022

  28. arXiv:2210.04331  [pdf, other

    cs.CV

    Students taught by multimodal teachers are superior action recognizers

    Authors: Gorjan Radevski, Dusan Grujicic, Matthew Blaschko, Marie-Francine Moens, Tinne Tuytelaars

    Abstract: The focal point of egocentric video understanding is modelling hand-object interactions. Standard models -- CNNs, Vision Transformers, etc. -- which receive RGB frames as input perform well, however, their performance improves further by employing additional modalities such as object detections, optical flow, audio, etc. as input. The added complexity of the required modality-specific modules, on… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: Extended abstract accepted at the 2nd Ego4D Workshop @ ECCV 2022

  29. arXiv:2208.11977  [pdf, other

    math.ST cs.LG

    On confidence intervals for precision matrices and the eigendecomposition of covariance matrices

    Authors: Teodora Popordanoska, Aleksei Tiulpin, Wacha Bounliphone, Matthew B. Blaschko

    Abstract: The eigendecomposition of a matrix is the central procedure in probabilistic models based on matrix factorization, for instance principal component analysis and topic models. Quantifying the uncertainty of such a decomposition based on a finite sample estimate is essential to reasoning under uncertainty when employing such models. This paper tackles the challenge of computing confidence bounds on… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:1604.01733

  30. arXiv:2207.06168  [pdf, other

    cs.LG cs.AI cs.CV

    MRF-UNets: Searching UNet with Markov Random Fields

    Authors: Zifu Wang, Matthew B. Blaschko

    Abstract: UNet [27] is widely used in semantic segmentation due to its simplicity and effectiveness. However, its manually-designed architecture is applied to a large number of problem settings, either with no architecture optimizations, or with manual tuning, which is time consuming and can be sub-optimal. In this work, firstly, we propose Markov Random Field Neural Architecture Search (MRF-NAS) that exten… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: ECML-PKDD 2022

  31. arXiv:2206.09022  [pdf, other

    cs.LG cs.AI

    Designing MacPherson Suspension Architectures using Bayesian Optimization

    Authors: Sinnu Susan Thomas, Jacopo Palandri, Mohsen Lakehal-ayat, Punarjay Chakravarty, Friedrich Wolf-Monheim, Matthew B. Blaschko

    Abstract: Engineering design is traditionally performed by hand: an expert makes design proposals based on past experience, and these proposals are then tested for compliance with certain target specifications. Testing for compliance is performed first by computer simulation using what is called a discipline model. Such a model can be implemented by a finite element analysis, multibody systems approach, etc… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: 15 pages, 16 figures

  32. arXiv:2206.02006  [pdf, other

    cs.LG

    Combinatorial optimization for low bit-width neural networks

    Authors: Han Zhou, Aida Ashrafi, Matthew B. Blaschko

    Abstract: Low-bit width neural networks have been extensively explored for deployment on edge devices to reduce computational resources. Existing approaches have focused on gradient-based optimization in a two-stage train-and-compress setting or as a combined optimization where gradients are quantized during training. Such schemes require high-performance hardware during the training phase and usually store… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  33. Metrics reloaded: Recommendations for image analysis validation

    Authors: Lena Maier-Hein, Annika Reinke, Patrick Godau, Minu D. Tizabi, Florian Buettner, Evangelia Christodoulou, Ben Glocker, Fabian Isensee, Jens Kleesiek, Michal Kozubek, Mauricio Reyes, Michael A. Riegler, Manuel Wiesenfarth, A. Emre Kavur, Carole H. Sudre, Michael Baumgartner, Matthias Eisenmann, Doreen Heckmann-Nötzel, Tim Rädsch, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Arriel Benis, Matthew Blaschko , et al. (49 additional authors not shown)

    Abstract: Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international ex… ▽ More

    Submitted 23 February, 2024; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: Shared first authors: Lena Maier-Hein, Annika Reinke. arXiv admin note: substantial text overlap with arXiv:2104.05642 Published in Nature Methods

    Journal ref: Nature methods, 1-18 (2024)

  34. arXiv:2112.12560  [pdf, other

    eess.IV cs.CV

    On the relationship between calibrated predictors and unbiased volume estimation

    Authors: Teodora Popordanoska, Jeroen Bertels, Dirk Vandermeulen, Frederik Maes, Matthew B. Blaschko

    Abstract: Machine learning driven medical image segmentation has become standard in medical image analysis. However, deep learning models are prone to overconfident predictions. This has led to a renewed focus on calibrated predictions in the medical imaging and broader machine learning communities. Calibrated predictions are estimates of the probability of a label that correspond to the true expected value… ▽ More

    Submitted 23 December, 2021; originally announced December 2021.

    Comments: Published at MICCAI 2021

  35. arXiv:2112.05419  [pdf, other

    cs.AI cs.CL cs.CV cs.HC cs.LG

    Predicting Physical World Destinations for Commands Given to Self-Driving Cars

    Authors: Dusan Grujicic, Thierry Deruyttere, Marie-Francine Moens, Matthew Blaschko

    Abstract: In recent years, we have seen significant steps taken in the development of self-driving cars. Multiple companies are starting to roll out impressive systems that work in a variety of settings. These systems can sometimes give the impression that full self-driving is just around the corner and that we would soon build cars without even a steering wheel. The increase in the level of autonomy and co… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022. First two authors have contributed equally. Extended camera-ready version including the appendix and references to it in the main text

  36. arXiv:2112.00845  [pdf, other

    cs.LG cs.CR

    Improving Differentially Private SGD via Randomly Sparsified Gradients

    Authors: Junyi Zhu, Matthew B. Blaschko

    Abstract: Differentially private stochastic gradient descent (DP-SGD) has been widely adopted in deep learning to provide rigorously defined privacy, which requires gradient clip** to bound the maximum norm of individual gradients and additive isotropic Gaussian noise. With analysis of the convergence rate of DP-SGD in a non-convex setting, we identify that randomly sparsifying gradients before clip** a… ▽ More

    Submitted 28 June, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Journal ref: Transactions on Machine Learning Research (06/2023)

  37. arXiv:2106.03793  [pdf

    eess.IV cs.CV

    Pointwise visual field estimation from optical coherence tomography in glaucoma: a structure-function analysis using deep learning

    Authors: Ruben Hemelings, Bart Elen, João Barbosa Breda, Erwin Bellon, Matthew B Blaschko, Patrick De Boever, Ingeborg Stalmans

    Abstract: Background/Aims: Standard Automated Perimetry (SAP) is the gold standard to monitor visual field (VF) loss in glaucoma management, but is prone to intra-subject variability. We developed and validated a deep learning (DL) regression model that estimates pointwise and overall VF loss from unsegmented optical coherence tomography (OCT) scans. Methods: Eight DL regression models were trained with var… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  38. arXiv:2105.14275  [pdf, other

    cs.LG cs.CV

    Greedy Bayesian Posterior Approximation with Deep Ensembles

    Authors: Aleksei Tiulpin, Matthew B. Blaschko

    Abstract: Ensembles of independently trained neural networks are a state-of-the-art approach to estimate predictive uncertainty in Deep Learning, and can be interpreted as an approximation of the posterior distribution via a mixture of delta functions. The training of ensembles relies on non-convexity of the loss landscape and random initialization of their individual members, making the resulting posterior… ▽ More

    Submitted 8 July, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

    Comments: Published in the Transactions on Machine Learning Research: https://openreview.net/forum?id=P1DuPJzVTN

    Journal ref: Transactions on Machine Learning Research, 2022

  39. arXiv:2105.04290  [pdf, other

    stat.ML cs.LG

    Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

    Authors: Xingchen Ma, Matthew B. Blaschko

    Abstract: In many applications, it is desirable that a classifier not only makes accurate predictions, but also outputs calibrated posterior probabilities. However, many existing classifiers, especially deep neural network classifiers, tend to be uncalibrated. Post-hoc calibration is a technique to recalibrate a model by learning a calibration map. Existing approaches mostly focus on constructing calibratio… ▽ More

    Submitted 23 June, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

  40. arXiv:2104.05642  [pdf, other

    eess.IV cs.CV

    Common Limitations of Image Processing Metrics: A Picture Story

    Authors: Annika Reinke, Minu D. Tizabi, Carole H. Sudre, Matthias Eisenmann, Tim Rädsch, Michael Baumgartner, Laura Acion, Michela Antonelli, Tal Arbel, Spyridon Bakas, Peter Bankhead, Arriel Benis, Matthew Blaschko, Florian Buettner, M. Jorge Cardoso, Jianxu Chen, Veronika Cheplygina, Evangelia Christodoulou, Beth Cimini, Gary S. Collins, Sandy Engelhardt, Keyvan Farahani, Luciana Ferrer, Adrian Galdran, Bram van Ginneken , et al. (68 additional authors not shown)

    Abstract: While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using spe… ▽ More

    Submitted 6 December, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Shared first authors: Annika Reinke and Minu D. Tizabi. This is a dynamic paper on limitations of commonly used metrics. It discusses metrics for image-level classification, semantic and instance segmentation, and object detection. For missing use cases, comments or questions, please contact [email protected]. Substantial contributions to this document will be acknowledged with a co-authorship

  41. arXiv:2104.03642  [pdf, other

    cs.LG cs.CV

    CLIMAT: Clinically-Inspired Multi-Agent Transformers for Knee Osteoarthritis Trajectory Forecasting

    Authors: Huy Hoang Nguyen, Simo Saarakkala, Matthew B. Blaschko, Aleksei Tiulpin

    Abstract: In medical applications, deep learning methods are built to automate diagnostic tasks. However, a clinically relevant question that practitioners usually face, is how to predict the future trajectory of a disease (prognosis). Current methods for such a problem often require domain knowledge, and are complicated to apply. In this paper, we formulate the prognosis prediction problem as a one-to-many… ▽ More

    Submitted 6 December, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: 10 pages

  42. Deep learning on fundus images detects glaucoma beyond the optic disc

    Authors: Ruben Hemelings, Bart Elen, João Barbosa-Breda, Matthew B. Blaschko, Patrick De Boever, Ingeborg Stalmans

    Abstract: Although unprecedented sensitivity and specificity values are reported, recent glaucoma detection deep learning models lack in decision transparency. Here, we propose a methodology that advances explainable deep learning in the field of glaucoma detection and vertical cup-disc ratio (VCDR), an important risk factor. We trained and evaluated deep learning models using fundus images that underwent a… ▽ More

    Submitted 5 October, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

  43. arXiv:2010.13499  [pdf, other

    eess.IV cs.CV cs.LG

    Optimization for Medical Image Segmentation: Theory and Practice when evaluating with Dice Score or Jaccard Index

    Authors: Tom Eelbode, Jeroen Bertels, Maxim Berman, Dirk Vandermeulen, Frederik Maes, Raf Bisschops, Matthew B. Blaschko

    Abstract: In many medical imaging and classical computer vision tasks, the Dice score and Jaccard index are used to evaluate the segmentation performance. Despite the existence and great empirical success of metric-sensitive losses, i.e. relaxations of these metrics such as soft Dice, soft Jaccard and Lovasz-Softmax, many researchers still use per-pixel losses, such as (weighted) cross-entropy to train CNNs… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 15 pages, 14 figures, accepted for publication in IEEE Transactions on Medical Imaging (2020)

  44. arXiv:2010.07733  [pdf, other

    cs.LG cs.AI

    R-GAP: Recursive Gradient Attack on Privacy

    Authors: Junyi Zhu, Matthew Blaschko

    Abstract: Federated learning frameworks have been regarded as a promising approach to break the dilemma between demands on privacy and the promise of learning from large collections of distributed data. Many such frameworks only ask collaborators to share their local update of a common model, i.e. gradients with respect to locally stored data, instead of exposing their raw data to other collaborators. Howev… ▽ More

    Submitted 16 March, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

  45. Additive Tree-Structured Conditional Parameter Spaces in Bayesian Optimization: A Novel Covariance Function and a Fast Implementation

    Authors: Xingchen Ma, Matthew B. Blaschko

    Abstract: Bayesian optimization (BO) is a sample-efficient global optimization algorithm for black-box functions which are expensive to evaluate. Existing literature on model based optimization in conditional parameter spaces are usually built on trees. In this work, we generalize the additive assumption to tree-structured functions and propose an additive tree-structured covariance function, showing improv… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Code available at https://github.com/maxc01/addtree. arXiv admin note: substantial text overlap with arXiv:2006.11771

  46. arXiv:2009.08792  [pdf, other

    cs.CV cs.AI

    Commands 4 Autonomous Vehicles (C4AV) Workshop Summary

    Authors: Thierry Deruyttere, Simon Vandenhende, Dusan Grujicic, Yu Liu, Luc Van Gool, Matthew Blaschko, Tinne Tuytelaars, Marie-Francine Moens

    Abstract: The task of visual grounding requires locating the most relevant region or object in an image, given a natural language query. So far, progress on this task was mostly measured on curated datasets, which are not always representative of human spoken language. In this work, we deviate from recent, popular task settings and consider the problem under an autonomous vehicle scenario. In particular, we… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  47. arXiv:2006.11771  [pdf, other

    stat.ML cs.LG

    Additive Tree-Structured Covariance Function for Conditional Parameter Spaces in Bayesian Optimization

    Authors: Xingchen Ma, Matthew B. Blaschko

    Abstract: Bayesian optimization (BO) is a sample-efficient global optimization algorithm for black-box functions which are expensive to evaluate. Existing literature on model based optimization in conditional parameter spaces are usually built on trees. In this work, we generalize the additive assumption to tree-structured functions and propose an additive tree-structured covariance function, showing improv… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

    Comments: AISTATS2020. Code link: https://github.com/maxc01/addtree

  48. arXiv:2006.02813  [pdf

    eess.IV cs.CV

    Pathological myopia classification with simultaneous lesion segmentation using deep learning

    Authors: Ruben Hemelings, Bart Elen, Matthew B. Blaschko, Julie Jacob, Ingeborg Stalmans, Patrick De Boever

    Abstract: This investigation reports on the results of convolutional neural networks developed for the recently introduced PathologicAL Myopia (PALM) dataset, which consists of 1200 fundus images. We propose a new Optic Nerve Head (ONH)-based prediction enhancement for the segmentation of atrophy and fovea. Models trained with 400 available training images achieved an AUC of 0.9867 for pathological myopia c… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 18 pages, 2 figures, preprint to journal

  49. arXiv:2005.10481  [pdf, other

    cs.CV cs.LG

    AOWS: Adaptive and optimal network width search with latency constraints

    Authors: Maxim Berman, Leonid Pishchulin, Ning Xu, Matthew B. Blaschko, Gerard Medioni

    Abstract: Neural architecture search (NAS) approaches aim at automatically finding novel CNN architectures that fit computational constraints while maintaining a good performance on the target platform. We introduce a novel efficient one-shot NAS approach to optimally search for channel numbers, given latency constraints on a specific hardware. We first show that we can use a black-box approach to estimate… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted to CVPR 2020 (oral)

  50. arXiv:2003.01944  [pdf, other

    eess.IV cs.CV cs.LG

    Semixup: In- and Out-of-Manifold Regularization for Deep Semi-Supervised Knee Osteoarthritis Severity Grading from Plain Radiographs

    Authors: Huy Hoang Nguyen, Simo Saarakkala, Matthew Blaschko, Aleksei Tiulpin

    Abstract: Knee osteoarthritis (OA) is one of the highest disability factors in the world. This musculoskeletal disorder is assessed from clinical symptoms, and typically confirmed via radiographic assessment. This visual assessment done by a radiologist requires experience, and suffers from moderate to high inter-observer variability. The recent literature has shown that deep learning methods can reliably p… ▽ More

    Submitted 12 August, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: 11 main, 03 supplementary pages. The manuscript was accepted to IEEE Transactions on Medical Imaging in August 2020