Skip to main content

Showing 1–14 of 14 results for author: Janik, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13433  [pdf, other

    cs.LG cs.AI

    Certificates of Differential Privacy and Unlearning for Gradient-Based Training

    Authors: Matthew Wicker, Philip Sosnin, Adrianna Janik, Mark N. Müller, Adrian Weller, Calvin Tsay

    Abstract: Proper data stewardship requires that model owners protect the privacy of individuals' data used during training. Whether through anonymization with differential privacy or the use of unlearning in non-anonymized settings, the gold-standard techniques for providing privacy guarantees can come with significant performance penalties or be too weak to provide practical assurances. In part, this is du… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 15 pages, 14 figures

  2. arXiv:2311.06112  [pdf, other

    physics.flu-dyn cs.LG nlin.CD

    Turbulence Scaling from Deep Learning Diffusion Generative Models

    Authors: Tim Whittaker, Romuald A. Janik, Yaron Oz

    Abstract: Complex spatial and temporal structures are inherent characteristics of turbulent fluid flows and comprehending them poses a major challenge. This comprehesion necessitates an understanding of the space of turbulent fluid flow configurations. We employ a diffusion-based generative model to learn the distribution of turbulent vorticity profiles and generate snapshots of turbulent solutions to the i… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  3. arXiv:2311.03839  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Aspects of human memory and Large Language Models

    Authors: Romuald A. Janik

    Abstract: Large Language Models (LLMs) are huge artificial neural networks which primarily serve to generate text, but also provide a very sophisticated probabilistic model of language use. Since generating a semantically consistent text requires a form of effective memory, we investigate the memory properties of LLMs and find surprising similarities with key characteristics of human memory. We argue that t… ▽ More

    Submitted 8 April, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 13+3 pages; v2: abstract expanded and future research directions added; v3: minor clarifications added

  4. arXiv:2212.02651  [pdf, other

    cs.LG

    Explaining Link Predictions in Knowledge Graph Embedding Models with Influential Examples

    Authors: Adrianna Janik, Luca Costabello

    Abstract: We study the problem of explaining link predictions in the Knowledge Graph Embedding (KGE) models. We propose an example-based approach that exploits the latent space representation of nodes and edges in a knowledge graph to explain predictions. We evaluated the importance of identified triples by observing progressing degradation of model performance upon influential triples removal. Our experime… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  5. arXiv:2211.15382  [pdf, other

    cs.LG hep-th nlin.CD physics.flu-dyn

    Neural Network Complexity of Chaos and Turbulence

    Authors: Tim Whittaker, Romuald A. Janik, Yaron Oz

    Abstract: Chaos and turbulence are complex physical phenomena, yet a precise definition of the complexity measure that quantifies them is still lacking. In this work we consider the relative complexity of chaos and turbulence from the perspective of deep neural networks. We analyze a set of classification problems, where the network has to distinguish images of fluid profiles in the turbulent regime from ot… ▽ More

    Submitted 20 July, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Journal ref: Eur. Phys. J. E 46, 57 (2023)

  6. arXiv:2211.09856  [pdf, other

    cs.LG q-bio.QM

    Machine Learning-Assisted Recurrence Prediction for Early-Stage Non-Small-Cell Lung Cancer Patients

    Authors: Adrianna Janik, Maria Torrente, Luca Costabello, Virginia Calvo, Brian Walsh, Carlos Camps, Sameh K. Mohamed, Ana L. Ortega, Vít Nováček, Bartomeu Massutí, Pasquale Minervini, M. Rosario Garcia Campelo, Edel del Barco, Joaquim Bosch-Barrera, Ernestina Menasalvas, Mohan Timilsina, Mariano Provencio

    Abstract: Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients? Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from t… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  7. arXiv:2202.04766  [pdf, other

    cs.CV

    Sampling Strategy for Fine-Tuning Segmentation Models to Crisis Area under Scarcity of Data

    Authors: Adrianna Janik, Kris Sankaran

    Abstract: The use of remote sensing in humanitarian crisis response missions is well-established and has proven relevant repeatedly. One of the problems is obtaining gold annotations as it is costly and time consuming which makes it almost impossible to fine-tune models to new regions affected by the crisis. Where time is critical, resources are limited and environment is constantly changing, models has to… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

  8. arXiv:2202.04753  [pdf, other

    cs.LG cs.CV

    Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization

    Authors: Adrianna Janik, Kris Sankaran

    Abstract: Concept discovery is one of the open problems in the interpretability literature that is important for bridging the gap between non-deep learning experts and model end-users. Among current formulations, concepts defines them by as a direction in a learned representation space. This definition makes it possible to evaluate whether a particular concept significantly influences classification decisio… ▽ More

    Submitted 9 February, 2022; originally announced February 2022.

    Comments: KDD'19, Workshop Explainable AI/ML (XAI) for Accountability, Fairness, and Transparency, August 04-08, 2019, Anchorage, AK, USA

  9. arXiv:2109.08103  [pdf, other

    cs.CV eess.IV q-bio.NC

    Aesthetics and neural network image representations

    Authors: Romuald A. Janik

    Abstract: We analyze the spaces of images encoded by generative neural networks of the BigGAN architecture. We find that generic multiplicative perturbations of neural network parameters away from the photo-realistic point often lead to networks generating images which appear as "artistic renditions" of the corresponding objects. This demonstrates an emergence of aesthetic properties directly from the struc… ▽ More

    Submitted 12 April, 2023; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: 11 pages, 6 figures; v2: expanded discussion, appendix with 2 figures added

  10. Interpretability of a Deep Learning Model in the Application of Cardiac MRI Segmentation with an ACDC Challenge Dataset

    Authors: Adrianna Janik, Jonathan Dodd, Georgiana Ifrim, Kris Sankaran, Kathleen Curran

    Abstract: Cardiac Magnetic Resonance (CMR) is the most effective tool for the assessment and diagnosis of a heart condition, which malfunction is the world's leading cause of death. Software tools leveraging Artificial Intelligence already enhance radiologists and cardiologists in heart condition assessment but their lack of transparency is a problem. This project investigates if it is possible to discover… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  11. arXiv:2006.12195  [pdf, other

    cs.LG cs.CV stat.ML

    Neural networks adapting to datasets: learning network size and topology

    Authors: Romuald A. Janik, Aleksandra Nowak

    Abstract: We introduce a flexible setup allowing for a neural network to learn both its size and topology during the course of a standard gradient-based training. The resulting network has the structure of a graph tailored to the particular learning task and dataset. The obtained networks can also be trained from scratch and achieve virtually identical performance. We explore the properties of the network a… ▽ More

    Submitted 15 July, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Fixed blank page

  12. arXiv:2006.04791  [pdf, other

    cs.LG cond-mat.dis-nn cs.NE hep-th stat.ML

    Complexity for deep neural networks and other characteristics of deep feature representations

    Authors: Romuald A. Janik, Przemek Witaszczyk

    Abstract: We define a notion of complexity, which quantifies the nonlinearity of the computation of a neural network, as well as a complementary measure of the effective dimension of feature representations. We investigate these observables both for trained networks for various datasets as well as explore their dynamics during training, uncovering in particular power law scaling. These observables can be un… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Significant extension including developments in neuroscience context and more. 36 pages

  13. arXiv:2002.08104  [pdf, other

    cs.LG cs.CV q-bio.NC stat.ML

    Analyzing Neural Networks Based on Random Graphs

    Authors: Romuald A. Janik, Aleksandra Nowak

    Abstract: We perform a massive evaluation of neural networks with architectures corresponding to random graphs of various types. We investigate various structural and numerical properties of the graphs in relation to neural network test accuracy. We find that none of the classical numerical graph invariants by itself allows to single out the best networks. Consequently, we introduce a new numerical graph ch… ▽ More

    Submitted 2 December, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Added new results and discussion

  14. arXiv:1909.10831  [pdf, other

    cond-mat.stat-mech cs.LG hep-lat q-bio.NC stat.ML

    Entropy from Machine Learning

    Authors: Romuald A. Janik

    Abstract: We translate the problem of calculating the entropy of a set of binary configurations/signals into a sequence of supervised classification tasks. Subsequently, one can use virtually any machine learning classification algorithm for computing entropy. This procedure can be used to compute entropy, and consequently the free energy directly from a set of Monte Carlo configurations at a given temperat… ▽ More

    Submitted 24 October, 2019; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: 10 pages, 2 figures; v2: reference added, minor notational improvement; v3: reference added, general comments in section 3