Skip to main content

Showing 1–12 of 12 results for author: Kläser, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14986  [pdf, other

    cs.LG cs.AI

    $\texttt{MiniMol}$: A Parameter-Efficient Foundation Model for Molecular Learning

    Authors: Kerstin Kläser, Błażej Banaszewski, Samuel Maddrell-Mander, Callum McLean, Luis Müller, Ali Parviz, Shenyang Huang, Andrew Fitzgibbon

    Abstract: In biological tasks, data is rarely plentiful as it is generated from hard-to-gather measurements. Therefore, pre-training foundation models on large quantities of available data and then transfer to low-data downstream tasks is a promising direction. However, how to design effective foundation models for molecular learning remains an open question, with existing approaches typically focusing on m… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  2. arXiv:2311.01135  [pdf, other

    cs.LG physics.chem-ph

    Generating QM1B with PySCF$_{\text{IPU}}$

    Authors: Alexander Mathiasen, Hatem Helal, Kerstin Klaser, Paul Balanca, Josef Dean, Carlo Luschi, Dominique Beaini, Andrew Fitzgibbon, Dominic Masters

    Abstract: The emergence of foundation models in Computer Vision and Natural Language Processing have resulted in immense progress on downstream tasks. This progress was enabled by datasets with billions of training examples. Similar benefits are yet to be unlocked for quantum chemistry, where the potential of deep learning is constrained by comparatively small datasets with 100k to 20M training examples. Th… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 15 pages, 7 figures. NeurIPS 2023 Track Datasets and Benchmarks

    ACM Class: I.2.6; J.2

  3. arXiv:2310.04292  [pdf, other

    cs.LG

    Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets

    Authors: Dominique Beaini, Shenyang Huang, Joao Alex Cunha, Zhiyi Li, Gabriela Moisescu-Pareja, Oleksandr Dymov, Samuel Maddrell-Mander, Callum McLean, Frederik Wenkel, Luis Müller, Jama Hussein Mohamud, Ali Parviz, Michael Craig, Michał Koziarski, Jiarui Lu, Zhaocheng Zhu, Cristian Gabellini, Kerstin Klaser, Josef Dean, Cas Wognum, Maciej Sypetkowski, Guillaume Rabusseau, Reihaneh Rabbany, Jian Tang, Christopher Morris , et al. (10 additional authors not shown)

    Abstract: Recently, pre-trained foundation models have enabled significant advancements in multiple fields. In molecular machine learning, however, where datasets are often hand-curated, and hence typically small, the lack of datasets with labeled features, and codebases to manage those datasets, has hindered the development of foundation models. In this work, we present seven novel datasets categorized by… ▽ More

    Submitted 18 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  4. Digital analysis of early color photographs taken using regular color screen processes

    Authors: Jan Hubička, Linda Kimrová, Kenzie Klaeser, Sara Manco, Doug Peterson

    Abstract: Some early color photographic processes based on special color screen filters pose specific challenges in their digitization and digital presentation. Those challenges include dynamic range, resolution, and the difficulty of stitching geometrically-repeating patterns. We describe a novel method used to digitize the collection of early color photographs at the National Geographic Society which make… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 figures, submitted to the proceedings of XVIII Color Conference

    ACM Class: I.4.1; I.4.5; I.4.8

    Journal ref: Color and Colorimetry. Multidisciplinary Contributions. Vol. XVIII A, 2023, 241-248

  5. arXiv:2302.02947  [pdf, other

    cs.LG

    GPS++: Reviving the Art of Message Passing for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Andrew Fitzgibbon, Shenyang Huang, Ladislav Rampášek, Dominique Beaini

    Abstract: We present GPS++, a hybrid Message Passing Neural Network / Graph Transformer model for molecular property prediction. Our model integrates a well-tuned local message passing component and biased global attention with other key ideas from prior literature to achieve state-of-the-art results on large-scale molecular dataset PCQM4Mv2. Through a thorough ablation study we highlight the impact of indi… ▽ More

    Submitted 12 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.02229

  6. arXiv:2212.02229  [pdf, other

    q-bio.QM cs.LG

    GPS++: An Optimised Hybrid MPNN/Transformer for Molecular Property Prediction

    Authors: Dominic Masters, Josef Dean, Kerstin Klaser, Zhiyi Li, Sam Maddrell-Mander, Adam Sanders, Hatem Helal, Deniz Beker, Ladislav Rampášek, Dominique Beaini

    Abstract: This technical report presents GPS++, the first-place solution to the Open Graph Benchmark Large-Scale Challenge (OGB-LSC 2022) for the PCQM4Mv2 molecular property prediction task. Our approach implements several key principles from the prior literature. At its core our GPS++ method is a hybrid MPNN/Transformer model that incorporates 3D atom positions and an auxiliary denoising task. The effectiv… ▽ More

    Submitted 6 December, 2022; v1 submitted 18 November, 2022; originally announced December 2022.

  7. arXiv:2111.04094  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Acquisition-invariant brain MRI segmentation with informative uncertainties

    Authors: Pedro Borges, Richard Shaw, Thomas Varsavsky, Kerstin Klaser, David Thomas, Ivana Drobnjak, Sebastien Ourselin, M Jorge Cardoso

    Abstract: Combining multi-site data can strengthen and uncover trends, but is a task that is marred by the influence of site-specific covariates that can bias the data and therefore any downstream analyses. Post-hoc multi-site correction methods exist but have strong assumptions that often do not hold in real-world scenarios. Algorithms should be designed in a way that can account for site-specific effects,… ▽ More

    Submitted 7 November, 2021; originally announced November 2021.

    Comments: 25 pages, 8 figures

  8. arXiv:2111.02771  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    The role of MRI physics in brain segmentation CNNs: achieving acquisition invariance and instructive uncertainties

    Authors: Pedro Borges, Richard Shaw, Thomas Varsavsky, Kerstin Klaser, David Thomas, Ivana Drobnjak, Sebastien Ourselin, M Jorge Cardoso

    Abstract: Being able to adequately process and combine data arising from different sites is crucial in neuroimaging, but is difficult, owing to site, sequence and acquisition-parameter dependent biases. It is important therefore to design algorithms that are not only robust to images of differing contrasts, but also be able to generalise well to unseen ones, with a quantifiable measure of uncertainty. In th… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: 10 pages, 3 figures, published in: Simulation and Synthesis in Medical Imaging 6th International Workshop, SASHIMI 2021, Held in Conjunction with MICCAI 2021, Strasbourg, France, September 27, 2021, Proceedings

  9. arXiv:2011.00867  [pdf, other

    cs.DB cs.IR

    Accessible Data Curation and Analytics for International-Scale Citizen Science Datasets

    Authors: Benjamin Murray, Eric Kerfoot, Mark S. Graham, Carole H. Sudre, Erika Molteni, Liane S. Canas, Michela Antonelli, Kerstin Klaser, Alessia Visconti, Andrew T. Chan, Paul W. Franks, Richard Davies, Jonathan Wolf, Tim Spector, Claire J. Steves, Marc Modat, Sebastien Ourselin

    Abstract: The Covid Symptom Study, a smartphone-based surveillance study on COVID-19 symptoms in the population, is an exemplar of big data citizen science. Over 4.7 million participants and 189 million unique assessments have been logged since its introduction in March 2020. The success of the Covid Symptom Study creates technical challenges around effective data curation for two reasons. Firstly, the scal… ▽ More

    Submitted 17 February, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    ACM Class: D.m; E.2; H.3.3; I.7

  10. arXiv:2004.09321  [pdf, other

    cs.CV eess.IV

    Combining multimodal information for Metal Artefact Reduction: An unsupervised deep learning framework

    Authors: Marta B. M. Ranzini, Irme Groothuis, Kerstin Kläser, M. Jorge Cardoso, Johann Henckel, Sébastien Ourselin, Alister Hart, Marc Modat

    Abstract: Metal artefact reduction (MAR) techniques aim at removing metal-induced noise from clinical images. In Computed Tomography (CT), supervised deep learning approaches have been shown effective but limited in generalisability, as they mostly rely on synthetic data. In Magnetic Resonance Imaging (MRI) instead, no method has yet been introduced to correct the susceptibility artefact, still present even… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted at IEEE International Symposium on Biomedical Imaging (ISBI) 2020

  11. arXiv:1908.08431  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    Improved MR to CT synthesis for PET/MR attenuation correction using Imitation Learning

    Authors: Kerstin Kläser, Thomas Varsavsky, Pawel Markiewicz, Tom Vercauteren, David Atkinson, Kris Thielemans, Brian Hutton, M Jorge Cardoso, Sebastien Ourselin

    Abstract: The ability to synthesise Computed Tomography images - commonly known as pseudo CT, or pCT - from MRI input data is commonly assessed using an intensity-wise similarity, such as an L2-norm between the ground truth CT and the pCT. However, given that the ultimate purpose is often to use the pCT as an attenuation map ($μ$-map) in Positron Emission Tomography Magnetic Resonance Imaging (PET/MRI), min… ▽ More

    Submitted 27 August, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Aceppted at SASHIMI2019

  12. arXiv:1808.07431  [pdf, other

    physics.med-ph cs.AI stat.ML

    Deep Boosted Regression for MR to CT Synthesis

    Authors: Kerstin Kläser, Pawel Markiewicz, Marta Ranzini, Wenqi Li, Marc Modat, Brian F Hutton, David Atkinson, Kris Thielemans, M Jorge Cardoso, Sebastien Ourselin

    Abstract: Attenuation correction is an essential requirement of positron emission tomography (PET) image reconstruction to allow for accurate quantification. However, attenuation correction is particularly challenging for PET-MRI as neither PET nor magnetic resonance imaging (MRI) can directly image tissue attenuation properties. MRI-based computed tomography (CT) synthesis has been proposed as an alternati… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: Accepted at SASHIMI2018