Skip to main content

Showing 1–16 of 16 results for author: Adilova, L

.
  1. arXiv:2406.16300  [pdf, other

    cs.LG

    Landsca** Linear Mode Connectivity

    Authors: Sidak Pal Singh, Linara Adilova, Michael Kamp, Asja Fischer, Bernhard Schölkopf, Thomas Hofmann

    Abstract: The presence of linear paths in parameter space between two different network solutions in certain cases, i.e., linear mode connectivity (LMC), has garnered interest from both theoretical and practical fronts. There has been significant research that either practically designs algorithms catered for connecting networks by adjusting for the permutation symmetries as well as some others that more th… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: ICML 2024 HiLD workshop paper

  2. arXiv:2405.16918  [pdf, other

    cs.LG

    The Uncanny Valley: Exploring Adversarial Robustness from a Flatness Perspective

    Authors: Nils Philipp Walter, Linara Adilova, Jilles Vreeken, Michael Kamp

    Abstract: Flatness of the loss surface not only correlates positively with generalization but is also related to adversarial robustness, since perturbations of inputs relate non-linearly to perturbations of weights. In this paper, we empirically analyze the relation between adversarial examples and relative flatness with respect to the parameters of one layer. We observe a peculiar property of adversarial e… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2307.06966  [pdf, other

    cs.LG

    Layer-wise Linear Mode Connectivity

    Authors: Linara Adilova, Maksym Andriushchenko, Michael Kamp, Asja Fischer, Martin Jaggi

    Abstract: Averaging neural network parameters is an intuitive method for fusing the knowledge of two independent models. It is most prominently used in federated learning. If models are averaged at the end of training, this can only lead to a good performing model if the loss surface of interest is very particular, i.e., the loss in the midpoint between the two models needs to be sufficiently low. This is i… ▽ More

    Submitted 19 March, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: published at ICLR24

  4. arXiv:2307.03681  [pdf

    cs.CY cs.AI cs.LG

    Guideline for Trustworthy Artificial Intelligence -- AI Assessment Catalog

    Authors: Maximilian Poretschkin, Anna Schmitz, Maram Akila, Linara Adilova, Daniel Becker, Armin B. Cremers, Dirk Hecker, Sebastian Houben, Michael Mock, Julia Rosenzweig, Joachim Sicking, Elena Schulz, Angelika Voss, Stefan Wrobel

    Abstract: Artificial Intelligence (AI) has made impressive progress in recent years and represents a key technology that has a crucial impact on the economy and society. However, it is clear that AI and business models based on it can only reach their full potential if AI applications are developed according to high quality standards and are effectively protected against new AI risks. For instance, AI bears… ▽ More

    Submitted 20 June, 2023; originally announced July 2023.

  5. arXiv:2307.02337  [pdf, other

    cs.LG

    FAM: Relative Flatness Aware Minimization

    Authors: Linara Adilova, Amr Abourayya, Jianning Li, Amin Dada, Henning Petzka, Jan Egger, Jens Kleesiek, Michael Kamp

    Abstract: Flatness of the loss curve around a model at hand has been shown to empirically correlate with its generalization ability. Optimizing for flatness has been proposed as early as 1994 by Hochreiter and Schmidthuber, and was followed by more recent successful sharpness-aware optimization techniques. Their widespread adoption in practice, though, is dubious because of the lack of theoretically grounde… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Proceedings of the 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML) at the 40 th International Conference on Machine Learning, Honolulu, Hawaii, USA. 2023

  6. arXiv:2303.00596  [pdf, other

    cs.IT

    Information Plane Analysis for Dropout Neural Networks

    Authors: Linara Adilova, Bernhard C. Geiger, Asja Fischer

    Abstract: The information-theoretic framework promises to explain the predictive power of neural networks. In particular, the information plane analysis, which measures mutual information (MI) between input and representation as well as representation and output, should give rich insights into the training process. This approach, however, was shown to strongly depend on the choice of estimator of the MI. Th… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Published as a conference paper at ICLR2023

  7. arXiv:2104.09254  [pdf, other

    cs.CV

    Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable Semantic Segmentation

    Authors: Linara Adilova, Elena Schulz, Maram Akila, Sebastian Houben, Jan David Schneider, Fabian Hueger, Tim Wirtz

    Abstract: Data-driven sensor interpretation in autonomous driving can lead to highly implausible predictions as can most of the time be verified with common-sense knowledge. However, learning common knowledge only from data is hard and approaches for knowledge integration are an active research area. We propose to use a partly human-designed, partly learned set of rules to describe relations between objects… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: Published at SAIAD (Safe Artificial Intelligence for Automated Driving) workshop at CVPR2021

  8. arXiv:2103.03943  [pdf, other

    cs.LG

    Novelty Detection in Sequential Data by Informed Clustering and Modeling

    Authors: Linara Adilova, Siming Chen, Michael Kamp

    Abstract: Novelty detection in discrete sequences is a challenging task, since deviations from the process generating the normal data are often small or intentionally hidden. Novelties can be detected by modeling normal sequences and measuring the deviations of a new sequence from the model predictions. However, in many applications data is generated by several distinct processes so that models trained on a… ▽ More

    Submitted 10 July, 2023; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: AI&HCI Workshop at the 40th International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA. 2023

  9. arXiv:2009.12098  [pdf, other

    cs.LG stat.ML

    Resource-Constrained On-Device Learning by Dynamic Averaging

    Authors: Lukas Heppe, Michael Kamp, Linara Adilova, Danny Heinrich, Nico Piatkowski, Katharina Morik

    Abstract: The communication between data-generating devices is partially responsible for a growing portion of the world's power consumption. Thus reducing communication is vital, both, from an economical and an ecological perspective. For machine learning, on-device learning avoids sending raw data, which can reduce communication substantially. Furthermore, not centralizing the data protects privacy-sensiti… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

  10. arXiv:2001.00939  [pdf, other

    cs.LG stat.ML

    Relative Flatness and Generalization

    Authors: Henning Petzka, Michael Kamp, Linara Adilova, Cristian Sminchisescu, Mario Boley

    Abstract: Flatness of the loss curve is conjectured to be connected to the generalization ability of machine learning models, in particular neural networks. While it has been empirically observed that flatness measures consistently correlate strongly with generalization, it is still an open theoretical problem why and under which circumstances flatness is connected to generalization, in particular in light… ▽ More

    Submitted 4 November, 2021; v1 submitted 3 January, 2020; originally announced January 2020.

    Comments: The first two authors made equal contribution; Accepted for publication at NeurIPS 2021; arXiv admin note: substantial text overlap with arXiv:1912.00058

  11. arXiv:1912.00058  [pdf, other

    cs.LG stat.ML

    A Reparameterization-Invariant Flatness Measure for Deep Neural Networks

    Authors: Henning Petzka, Linara Adilova, Michael Kamp, Cristian Sminchisescu

    Abstract: The performance of deep neural networks is often attributed to their automated, task-related feature construction. It remains an open question, though, why this leads to solutions with good generalization, even in cases where the number of parameters is larger than the number of samples. Back in the 90s, Hochreiter and Schmidhuber observed that flatness of the loss surface around a local minimum c… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: 14 pages; accepted at Workshop "Science meets Engineering of Deep Learning", 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  12. arXiv:1911.07652  [pdf, other

    cs.LG cs.DC cs.IT stat.ML

    Information-Theoretic Perspective of Federated Learning

    Authors: Linara Adilova, Julia Rosenzweig, Michael Kamp

    Abstract: An approach to distributed machine learning is to train models on local datasets and aggregate these models into a single, stronger model. A popular instance of this form of parallelization is federated learning, where the nodes periodically send their local models to a coordinator that aggregates them and redistributes the aggregation back to continue training with it. The most frequently used fo… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: 5 pages, 8 figures Workshop on Information Theory and Machine Learning, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  13. arXiv:1907.00874  [pdf, other

    cs.CR cs.LG

    System Misuse Detection via Informed Behavior Clustering and Modeling

    Authors: Linara Adilova, Livin Natious, Siming Chen, Olivier Thonnard, Michael Kamp

    Abstract: One of the main tasks of cybersecurity is recognizing malicious interactions with an arbitrary system. Currently, the logging information from each interaction can be collected in almost unrestricted amounts, but identification of attacks requires a lot of effort and time of security experts. We propose an approach for identifying fraud activity through modeling normal behavior in interactions wit… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 9 pages including appendix, DSN Workshop on Data-Centric Dependability and Security (http://dcds.lasige.di.fc.ul.pt/)

  14. arXiv:1809.10678  [pdf, other

    cs.LG stat.ML

    Introducing Noise in Decentralized Training of Neural Networks

    Authors: Linara Adilova, Nathalie Paul, Peter Schlicht

    Abstract: It has been shown that injecting noise into the neural network weights during the training process leads to a better generalization of the resulting model. Noise injection in the distributed setup is a straightforward technique and it represents a promising approach to improve the locally trained models. We investigate the effects of noise injection into the neural networks during a decentralized… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: 13 pages

    Journal ref: ECML PKDD 2018, Workshop DMLE

  15. arXiv:1807.04687  [pdf, other

    cs.LG cs.CL stat.ML

    Making Efficient Use of a Domain Expert's Time in Relation Extraction

    Authors: Linara Adilova, Sven Giesselbach, Stefan Rü**

    Abstract: Scarcity of labeled data is one of the most frequent problems faced in machine learning. This is particularly true in relation extraction in text mining, where large corpora of texts exists in many application domains, while labeling of text data requires an expert to invest much time to read the documents. Overall, state-of-the art models, like the convolutional neural network used in this paper,… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: DMNLP Workshop paper, ECML-PKDD 2017

  16. arXiv:1807.03210  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Efficient Decentralized Deep Learning by Dynamic Model Averaging

    Authors: Michael Kamp, Linara Adilova, Joachim Sicking, Fabian Hüger, Peter Schlicht, Tim Wirtz, Stefan Wrobel

    Abstract: We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept drifts. This leads to a reduction of communication by an order of magnitude compared to periodically communicating state-of-the-art approaches. Moreover, we derive a… ▽ More

    Submitted 13 November, 2018; v1 submitted 9 July, 2018; originally announced July 2018.