Skip to main content

Showing 1–14 of 14 results for author: Wojcik, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10193  [pdf, other

    cs.LG

    Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference

    Authors: Bartosz Wójcik, Alessio Devoto, Karol Pustelnik, Pasquale Minervini, Simone Scardapane

    Abstract: The computational cost of transformer models makes them inefficient in low-latency or low-power applications. While techniques such as quantization or linear attention can reduce the computational load, they may incur a reduction in accuracy. In addition, globally reducing the cost for all inputs may be sub-optimal. We observe that for each layer, the full width of the layer may be needed only for… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  2. arXiv:2310.04361  [pdf, other

    cs.LG

    Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion

    Authors: Filip Szatkowski, Bartosz Wójcik, Mikołaj Piórczyński, Simone Scardapane

    Abstract: Transformer models can face practical limitations due to their high computational requirements. At the same time, such models exhibit significant activation sparsity, which can be leveraged to reduce the inference cost by converting parts of the network into equivalent Mixture-of-Experts (MoE) layers. Despite the crucial role played by activation sparsity, its impact on this process remains unexpl… ▽ More

    Submitted 7 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  3. arXiv:2309.12033  [pdf, other

    cs.CV cs.LG

    Face Identity-Aware Disentanglement in StyleGAN

    Authors: Adrian Suwała, Bartosz Wójcik, Magdalena Proszewska, Jacek Tabor, Przemysław Spurek, Marek Śmieja

    Abstract: Conditional GANs are frequently used for manipulating the attributes of face images, such as expression, hairstyle, pose, or age. Even though the state-of-the-art models successfully modify the requested attributes, they simultaneously modify other important characteristics of the image, such as a person's identity. In this paper, we focus on solving this problem by introducing PluGeN4Faces, a plu… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  4. Computer Vision based inspection on post-earthquake with UAV synthetic dataset

    Authors: Mateusz Żarski, Bartosz Wójcik, Jarosław A. Miszczak, Bartłomiej Blachowski, Mariusz Ostrowski

    Abstract: The area affected by the earthquake is vast and often difficult to entirely cover, and the earthquake itself is a sudden event that causes multiple defects simultaneously, that cannot be effectively traced using traditional, manual methods. This article presents an innovative approach to the problem of detecting damage after sudden events by using an interconnected set of deep machine learning mod… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Comments: 15 pages, 8 figures, published version, software available from https://github.com/MatZar01/IC_SHM_P2

    Journal ref: IEEE Access, Vol. 10 (2022), pp. 108134-108144

  5. arXiv:2206.13923  [pdf, other

    cs.LG

    SLOVA: Uncertainty Estimation Using Single Label One-Vs-All Classifier

    Authors: Bartosz Wójcik, Jacek Grela, Marek Śmieja, Krzysztof Misztal, Jacek Tabor

    Abstract: Deep neural networks present impressive performance, yet they cannot reliably estimate their predictive confidence, limiting their applicability in high-risk domains. We show that applying a multi-label one-vs-all loss reveals classification ambiguity and reduces model overconfidence. The introduced SLOVA (Single Label One-Vs-All) model redefines typical one-vs-all predictive probabilities to a si… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

  6. arXiv:2206.07996  [pdf, other

    cs.LG

    Continual Learning with Guarantees via Weight Interval Constraints

    Authors: Maciej Wołczyk, Karol J. Piczak, Bartosz Wójcik, Łukasz Pustelnik, Paweł Morawiecki, Jacek Tabor, Tomasz Trzciński, Przemysław Spurek

    Abstract: We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of data, while reducing the negative impact of catastrophic forgetting, yet they do not provide any firm guarantees that network performance will not deteriorate unco… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Short presentation at ICML 2022

  7. arXiv:2106.10944  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Hard hat wearing detection based on head keypoint localization

    Authors: Bartosz Wójcik, Mateusz Żarski, Kamil Książek, Jarosław Adam Miszczak, Mirosław Jan Skibniewski

    Abstract: In recent years, a lot of attention is paid to deep learning methods in the context of vision-based construction site safety systems, especially regarding personal protective equipment. However, despite all this attention, there is still no reliable way to establish the relationship between workers and their hard hats. To answer this problem a combination of deep learning, object detection and hea… ▽ More

    Submitted 24 June, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 17 pages, 9 figures and 9 tables

    Journal ref: Bull. Pol. Acad. Sci. Tech. Sci. Vol. 71, No. 6, pp. e147340 (2023)

  8. arXiv:2106.05409  [pdf, other

    cs.LG

    Zero Time Waste: Recycling Predictions in Early Exit Neural Networks

    Authors: Maciej Wołczyk, Bartosz Wójcik, Klaudia Bałazy, Igor Podolak, Jacek Tabor, Marek Śmieja, Tomasz Trzciński

    Abstract: The problem of reducing processing time of large deep learning models is a fundamental challenge in many real-world applications. Early exit methods strive towards this goal by attaching additional Internal Classifiers (ICs) to intermediate layers of a neural network. ICs can quickly return predictions for easy examples and, as a result, reduce the average inference time of the whole model. Howeve… ▽ More

    Submitted 5 December, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  9. arXiv:2006.10013  [pdf, other

    cs.LG cs.CR stat.ML

    Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

    Authors: Bartosz Wójcik, Paweł Morawiecki, Marek Śmieja, Tomasz Krzyżek, Przemysław Spurek, Jacek Tabor

    Abstract: We present a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. For this purpose, we train individual autoencoders at intermediate layers of the target network. This allows us to describe the manifold of true data and, in consequence, decide whether a given example has the same characteristics as true data. It also gives u… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  10. KrakN: Transfer Learning framework for thin crack detection in infrastructure maintenance

    Authors: Mateusz Żarski, Bartosz Wójcik, Jarosław Adam Miszczak

    Abstract: Monitoring the technical condition of infrastructure is a crucial element to its maintenance. Currently applied methods are outdated, labour-intensive and inaccurate. At the same time, the latest methods using Artificial Intelligence techniques are severely limited in their application due to two main factors -- labour-intensive gathering of new datasets and high demand for computing power. We pro… ▽ More

    Submitted 11 October, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: 23 pages, 15 figures and flowcharts, software available at https://github.com/MatZar01/KrakN, https://doi.org/10.5281/zenodo.3764697, and https://doi.org/10.5281/zenodo.3755452, dataset available from https://doi.org/10.5281/zenodo.3759845

    Journal ref: SoftwareX, Vol 16, 100893 (2021)

  11. arXiv:2004.08172  [pdf, other

    cs.LG stat.ML

    Finding the Optimal Network Depth in Classification Tasks

    Authors: Bartosz Wójcik, Maciej Wołczyk, Klaudia Bałazy, Jacek Tabor

    Abstract: We develop a fast end-to-end method for training lightweight neural networks using multiple classifier heads. By allowing the model to determine the importance of each head and rewarding the choice of a single shallow classifier, we are able to detect and remove unneeded components of the network. This operation, which can be seen as finding the optimal depth of the model, significantly reduces th… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  12. arXiv:1905.12947  [pdf, other

    cs.LG stat.ML

    One-element Batch Training by Moving Window

    Authors: Przemysław Spurek, Szymon Knop, Jacek Tabor, Igor Podolak, Bartosz Wójcik

    Abstract: Several deep models, esp. the generative, compare the samples from two distributions (e.g. WAE like AutoEncoder models, set-processing deep networks, etc) in their cost functions. Using all these methods one cannot train the model directly taking small size (in extreme -- one element) batches, due to the fact that samples are to be compared. We propose a generic approach to training such models… ▽ More

    Submitted 31 May, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  13. arXiv:1902.07656  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    LOSSGRAD: automatic learning rate in gradient descent

    Authors: Bartosz Wójcik, Łukasz Maziarka, Jacek Tabor

    Abstract: In this paper, we propose a simple, fast and easy to implement algorithm LOSSGRAD (locally optimal step-size in gradient descent), which automatically modifies the step-size in gradient descent during neural networks training. Given a function $f$, a point $x$, and the gradient $\nabla_x f$ of $f$, we aim to find the step-size $h$ which is (locally) optimal, i.e. satisfies:… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: TFML 2019

    Journal ref: Schedae Informaticae, 2018, Volume 27

  14. arXiv:1306.6294  [pdf, other

    cs.RO cs.AI cs.HC

    Learning Trajectory Preferences for Manipulators via Iterative Improvement

    Authors: Ashesh Jain, Brian Wojcik, Thorsten Joachims, Ashutosh Saxena

    Abstract: We consider the problem of learning good trajectories for manipulation tasks. This is challenging because the criterion defining a good trajectory varies with users, tasks and environments. In this paper, we propose a co-active online learning framework for teaching robots the preferences of its users for object manipulation tasks. The key novelty of our approach lies in the type of feedback expec… ▽ More

    Submitted 5 November, 2013; v1 submitted 26 June, 2013; originally announced June 2013.

    Comments: 9 pages. To appear in NIPS 2013