Skip to main content

Showing 1–9 of 9 results for author: Stypułkowski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.09633  [pdf, other

    cs.CV eess.IV

    Dimma: Semi-supervised Low Light Image Enhancement with Adaptive Dimming

    Authors: Wojciech Kozłowski, Michał Szachniewicz, Michał Stypułkowski, Maciej Zięba

    Abstract: Enhancing low-light images while maintaining natural colors is a challenging problem due to camera processing variations and limited access to photos with ground-truth lighting conditions. The latter is a crucial factor for supervised methods that achieve good results on paired datasets but do not handle out-of-domain data well. On the other hand, unsupervised methods, while able to generalize, of… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  2. arXiv:2307.05325  [pdf, other

    cs.CV

    Self-supervised adversarial masking for 3D point cloud representation learning

    Authors: Michał Szachniewicz, Wojciech Kozłowski, Michał Stypułkowski, Maciej Zięba

    Abstract: Self-supervised methods have been proven effective for learning deep representations of 3D point cloud data. Although recent methods in this domain often rely on random masking of inputs, the results of this approach can be improved. We introduce PointCAM, a novel adversarial method for learning a masking function for point clouds. Our model utilizes a self-distillation framework with an online to… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  3. arXiv:2301.04474  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Speech Driven Video Editing via an Audio-Conditioned Diffusion Model

    Authors: Dan Bigioi, Shubhajit Basak, Michał Stypułkowski, Maciej Zięba, Hugh Jordan, Rachel McDonnell, Peter Corcoran

    Abstract: Taking inspiration from recent developments in visual generative tasks using diffusion models, we propose a method for end-to-end speech-driven video editing using a denoising diffusion model. Given a video of a talking person, and a separate auditory speech recording, the lip and jaw motions are re-synchronized without relying on intermediate structural representations such as facial landmarks or… ▽ More

    Submitted 11 May, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: 8 Pages, code and project page available here: https://danbigioi.github.io/DiffusionVideoEditing/

  4. arXiv:2301.03396  [pdf, other

    cs.CV

    Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

    Authors: Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja Pantic

    Abstract: Talking face generation has historically struggled to produce head movements and natural facial expressions without guidance from additional reference videos. Recent developments in diffusion-based generative models allow for more realistic and stable data synthesis and their performance on image and video generation has surpassed that of other generative models. In this work, we present an autore… ▽ More

    Submitted 29 July, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

  5. arXiv:2205.08013  [pdf, other

    cs.LG cs.CV

    Continual learning on 3D point clouds with random compressed rehearsal

    Authors: Maciej Zamorski, Michał Stypułkowski, Konrad Karanowski, Tomasz Trzciński, Maciej Zięba

    Abstract: Contemporary deep neural networks offer state-of-the-art results when applied to visual reasoning, e.g., in the context of 3D point cloud data. Point clouds are important datatype for precise modeling of three-dimensional environments, but effective processing of this type of data proves to be challenging. In the world of large, heavily-parameterized network architectures and continuously-streamed… ▽ More

    Submitted 20 May, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Comments: 10 pages, 3 figures

  6. arXiv:2106.11603  [pdf, ps, other

    cs.LG cs.SD eess.AS

    Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We present a number of low-resource approaches to the tasks of the Zero Resource Speech Challenge 2021. We build on the unsupervised representations of speech proposed by the organizers as a baseline, derived from CPC and clustered with the k-means algorithm. We demonstrate that simple methods of refining those representations can narrow the gap, or even improve upon the solutions which use a high… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: Published in Interspeech 2021

  7. arXiv:2104.11946  [pdf, other

    cs.LG cs.SD eess.AS

    Aligned Contrastive Predictive Coding

    Authors: Jan Chorowski, Grzegorz Ciesielski, Jarosław Dzikowski, Adrian Łańcucki, Ricard Marxer, Mateusz Opala, Piotr Pusz, Paweł Rychlikowski, Michał Stypułkowski

    Abstract: We investigate the possibility of forcing a self-supervised model trained using a contrastive predictive loss to extract slowly varying latent representations. Rather than producing individual predictions for each of the future representations, the model emits a sequence of predictions shorter than that of the upcoming representations to which they will be aligned. In this way, the prediction netw… ▽ More

    Submitted 22 June, 2021; v1 submitted 24 April, 2021; originally announced April 2021.

    Comments: Published in Interspeech 2021

  8. arXiv:2010.11087  [pdf, other

    cs.CV cs.LG

    Representing Point Clouds with Generative Conditional Invertible Flow Networks

    Authors: Michał Stypułkowski, Kacper Kania, Maciej Zamorski, Maciej Zięba, Tomasz Trzciński, Jan Chorowski

    Abstract: In this paper, we propose a simple yet effective method to represent point clouds as sets of samples drawn from a cloud-specific probability distribution. This interpretation matches intrinsic characteristics of point clouds: the number of points and their ordering within a cloud is not important as all points are drawn from the proximity of the object boundary. We postulate to represent each clou… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  9. arXiv:1910.07344  [pdf, other

    cs.LG cs.CV stat.ML

    Conditional Invertible Flow for Point Cloud Generation

    Authors: Michał Stypułkowski, Maciej Zamorski, Maciej Zięba, Jan Chorowski

    Abstract: This paper focuses on a novel generative approach for 3D point clouds that makes use of invertible flow-based models. The main idea of the method is to treat a point cloud as a probability density in 3D space that is modeled using a cloud-specific neural network. To capture the similarity between point clouds we rely on parameter sharing among networks, with each cloud having only a small embeddin… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: Published in Sets & Partitions Workshop at NeurIPS 2019 (https://www.sets.parts/)