Skip to main content

Showing 1–12 of 12 results for author: Blaas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10459  [pdf, other

    cs.CL

    CancerLLM: A Large Language Model in Cancer Domain

    Authors: Mingchen Li, Anne Blaes, Steven Johnson, Hongfang Liu, Hua Xu, Rui Zhang

    Abstract: Medical Large Language Models (LLMs) such as ClinicalCamel 70B, Llama3-OpenBioLLM 70B have demonstrated impressive performance on a wide variety of medical NLP task.However, there still lacks a large language model (LLM) specifically designed for cancer domain. Moreover, these LLMs typically have billions of parameters, making them computationally expensive for healthcare systems.Thus, in this stu… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2310.13040  [pdf, other

    cs.LG cs.AI cs.CV

    Robust multimodal models have outlier features and encode more concepts

    Authors: Jonathan Crabbé, Pau Rodríguez, Vaishaal Shankar, Luca Zappella, Arno Blaas

    Abstract: What distinguishes robust models from non-robust ones? This question has gained traction with the appearance of large-scale multimodal models, such as CLIP. These models have demonstrated unprecedented robustness with respect to natural distribution shifts. While it has been shown that such differences in robustness can be traced back to differences in training data, so far it is not known what th… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 29 pages, 18 figures

  3. arXiv:2307.10907  [pdf, other

    cs.LG

    The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning

    Authors: Borja Rodríguez-Gálvez, Arno Blaas, Pau Rodríguez, Adam Goliński, Xavier Suau, Jason Ramapuram, Dan Busbridge, Luca Zappella

    Abstract: The mechanisms behind the success of multi-view self-supervised learning (MVSSL) are not yet fully understood. Contrastive MVSSL methods have been studied through the lens of InfoNCE, a lower bound of the Mutual Information (MI). However, the relation between other MVSSL methods and MI remains unclear. We consider a different lower bound on the MI consisting of an entropy and a reconstruction term… ▽ More

    Submitted 9 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 18 pages: 9 of main text, 2 of references, and 7 of supplementary material [Updated typo in page 6 (Section 3.2)]. Appears in the proceedings of ICML 2023

  4. arXiv:2306.16058  [pdf, other

    cs.LG cs.AI

    DUET: 2D Structured and Approximately Equivariant Representations

    Authors: Xavier Suau, Federico Danieli, T. Anderson Keller, Arno Blaas, Chen Huang, Jason Ramapuram, Dan Busbridge, Luca Zappella

    Abstract: Multiview Self-Supervised Learning (MSSL) is based on learning invariances with respect to a set of input transformations. However, invariance partially or totally removes transformation-related information from the representations, which might harm performance for specific downstream tasks that require such information. We propose 2D strUctured and EquivarianT representations (coined DUET), which… ▽ More

    Submitted 17 November, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  5. arXiv:2303.08448  [pdf

    cs.CL cs.IR cs.LG

    A Cross-institutional Evaluation on Breast Cancer Phenoty** NLP Algorithms on Electronic Health Records

    Authors: Sicheng Zhou, Nan Wang, Liwei Wang, Ju Sun, Anne Blaes, Hongfang Liu, Rui Zhang

    Abstract: Objective: The generalizability of clinical large language models is usually ignored during the model development process. This study evaluated the generalizability of BERT-based clinical NLP models across different clinical settings through a breast cancer phenotype extraction task. Materials and Methods: Two clinical corpora of breast cancer patients were collected from the electronic health r… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 16 pages, 3 figures

  6. arXiv:2201.13036  [pdf

    stat.AP cs.LG

    Predicting Cancer Treatments Induced Cardiotoxicity of Breast Cancer Patients

    Authors: Sicheng Zhou, Rui Zhang, Anne Blaes, Chetan Shenoy, Gyorgy Simon

    Abstract: Cardiotoxicity induced by the breast cancer treatments (i.e., chemotherapy, targeted therapy and radiation therapy) is a significant problem for breast cancer patients. The cardiotoxicity risk for breast cancer patients receiving different treatments remains unclear. We developed and evaluated risk predictive models for cardiotoxicity in breast cancer patients using EHR data. The AUC scores to pre… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 6 pages

  7. arXiv:2111.12427  [pdf, other

    cs.LG cs.CV

    Challenges of Adversarial Image Augmentations

    Authors: Arno Blaas, Xavier Suau, Jason Ramapuram, Nicholas Apostoloff, Luca Zappella

    Abstract: Image augmentations applied during training are crucial for the generalization performance of image classifiers. Therefore, a large body of research has focused on finding the optimal augmentation policy for a given task. Yet, RandAugment [2], a simple random augmentation policy, has recently been shown to outperform existing sophisticated policies. Only Adversarial AutoAugment (AdvAA) [11], an ap… ▽ More

    Submitted 3 December, 2021; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: To appear at the ICBINB 2021 Neurips Workshop

  8. arXiv:2111.02842  [pdf, other

    stat.ML cs.AI cs.CR cs.LG

    Adversarial Attacks on Graph Classification via Bayesian Optimisation

    Authors: Xingchen Wan, Henry Kenlay, Binxin Ru, Arno Blaas, Michael A. Osborne, Xiaowen Dong

    Abstract: Graph neural networks, a popular class of models effective in a wide range of graph-based learning tasks, have been shown to be vulnerable to adversarial attacks. While the majority of the literature focuses on such vulnerability in node-level classification tasks, little effort has been dedicated to analysing adversarial attacks on graph-level classification, an important problem with numerous re… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. 11 pages, 8 figures, 2 tables (24 pages, 17 figures, 8 tables including references and appendices)

  9. arXiv:2106.09777  [pdf, other

    cs.LG stat.ML

    On Invariance Penalties for Risk Minimization

    Authors: Kia Khezeli, Arno Blaas, Frank Soboczenski, Nicholas Chia, John Kalantari

    Abstract: The Invariant Risk Minimization (IRM) principle was first proposed by Arjovsky et al. [2019] to address the domain generalization problem by leveraging data heterogeneity from differing experimental conditions. Specifically, IRM seeks to find a data representation under which an optimal classifier remains invariant across all domains. Despite the conceptual appeal of IRM, the effectiveness of the… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  10. arXiv:2104.03180  [pdf, other

    cs.LG stat.ML

    Adversarial Robustness Guarantees for Gaussian Processes

    Authors: Andrea Patane, Arno Blaas, Luca Laurenti, Luca Cardelli, Stephen Roberts, Marta Kwiatkowska

    Abstract: Gaussian processes (GPs) enable principled computation of model uncertainty, making them attractive for safety-critical applications. Such scenarios demand that GP decisions are not only accurate, but also robust to perturbations. In this paper we present a framework to analyse adversarial robustness of GPs, defined as invariance of the model's decision to bounded perturbations. Given a compact su… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Submitted for publication

  11. arXiv:2101.02689  [pdf, ps, other

    stat.ML cs.LG

    The Effect of Prior Lipschitz Continuity on the Adversarial Robustness of Bayesian Neural Networks

    Authors: Arno Blaas, Stephen J. Roberts

    Abstract: It is desirable, and often a necessity, for machine learning models to be robust against adversarial attacks. This is particularly true for Bayesian models, as they are well-suited for safety-critical applications, in which adversarial attacks can have catastrophic outcomes. In this work, we take a deeper look at the adversarial robustness of Bayesian Neural Networks (BNNs). In particular, we cons… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: 4 pages, 2 tables, AAAI 2021 Workshop Towards Robust, Secure and Efficient Machine Learning

  12. arXiv:1905.11876  [pdf, other

    stat.ML cs.LG

    Adversarial Robustness Guarantees for Classification with Gaussian Processes

    Authors: Arno Blaas, Andrea Patane, Luca Laurenti, Luca Cardelli, Marta Kwiatkowska, Stephen Roberts

    Abstract: We investigate adversarial robustness of Gaussian Process Classification (GPC) models. Given a compact subset of the input space $T\subseteq \mathbb{R}^d$ enclosing a test point $x^*$ and a GPC trained on a dataset $\mathcal{D}$, we aim to compute the minimum and the maximum classification probability for the GPC over all the points in $T$. In order to do so, we show how functions lower- and upper… ▽ More

    Submitted 11 March, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: 10 pages, 6 figures + Supplementary Material