Skip to main content

Showing 1–23 of 23 results for author: Wozniak, S

.
  1. arXiv:2404.05892  [pdf, other

    cs.CL cs.AI

    Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

    Authors: Bo Peng, Daniel Goldstein, Quentin Anthony, Alon Albalak, Eric Alcaide, Stella Biderman, Eugene Cheah, Xingjian Du, Teddy Ferdinan, Haowen Hou, Przemysław Kazienko, Kranthi Kiran GV, Jan Kocoń, Bartłomiej Koptyra, Satyapriya Krishna, Ronald McClelland Jr., Niklas Muennighoff, Fares Obeid, Atsushi Saito, Guangyu Song, Haoqin Tu, Stanisław Woźniak, Ruichong Zhang, Bingchen Zhao, Qihang Zhao , et al. (3 additional authors not shown)

    Abstract: We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity while maintaining the inference efficiency characteristics of RNNs. We introduce a new multilingual corpus with 1.12 trillion tokens and a fast tokeni… ▽ More

    Submitted 10 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  2. arXiv:2402.09269  [pdf, other

    cs.CL cs.AI

    Personalized Large Language Models

    Authors: Stanisław Woźniak, Bartłomiej Koptyra, Arkadiusz Janz, Przemysław Kazienko, Jan Kocoń

    Abstract: Large language models (LLMs) have significantly advanced Natural Language Processing (NLP) tasks in recent years. However, their universal nature poses limitations in scenarios requiring personalized responses, such as recommendation systems and chatbots. This paper investigates methods to personalize LLMs, comparing fine-tuning and zero-shot reasoning approaches on subjective tasks. Results demon… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  3. arXiv:2312.08198  [pdf, other

    cs.CL cs.AI

    Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems

    Authors: Kamil Kanclerz, Julita Bielaniewicz, Marcin Gruza, Jan Kocon, Stanisław Woźniak, Przemysław Kazienko

    Abstract: Data annotated by humans is a source of knowledge by describing the peculiarities of the problem and therefore fueling the decision process of the trained model. Unfortunately, the annotation process for subjective natural language processing (NLP) problems like offensiveness or emotion detection is often very expensive and time-consuming. One of the inevitable risks is to spend some of the funds… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  4. arXiv:2312.04720  [pdf, other

    cs.CL cs.AI cs.LG

    From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis

    Authors: Stanisław Woźniak, Jan Kocoń

    Abstract: In the era of artificial intelligence, data is gold but costly to annotate. The paper demonstrates a groundbreaking solution to this dilemma using ChatGPT for text augmentation in sentiment analysis. We leverage ChatGPT's generative capabilities to create synthetic training data that significantly improves the performance of smaller models, making them competitive with, or even outperforming, thei… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 10 pages, 9 figures, presented at ICDM Workshop: SENTIRE 2023

  5. SRAI: Towards Standardization of Geospatial AI

    Authors: Piotr Gramacki, Kacper Leśniara, Kamil Raczycki, Szymon Woźniak, Marcin Przymus, Piotr Szymański

    Abstract: Spatial Representations for Artificial Intelligence (srai) is a Python library for working with geospatial data. The library can download geospatial data, split a given area into micro-regions using multiple algorithms and train an embedding model using various architectures. It includes baseline models as well as more complex methods from published works. Those capabilities make it possible to us… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted for the 6th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery (GeoAI 2023)

  6. arXiv:2306.08744  [pdf, other

    cs.NE cs.LG

    High-performance deep spiking neural networks with 0.3 spikes per neuron

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Communication by rare, binary spikes is a key factor for the energy efficiency of biological brains. However, it is harder to train biologically-inspired spiking neural networks (SNNs) than artificial neural networks (ANNs). This is puzzling given that theoretical results provide exact map** algorithms from ANNs to SNNs with time-to-first-spike (TTFS) coding. In this paper we analyze in theory a… ▽ More

    Submitted 20 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  7. arXiv:2306.07902  [pdf, other

    cs.CL cs.AI

    Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark

    Authors: Łukasz Augustyniak, Szymon Woźniak, Marcin Gruza, Piotr Gramacki, Krzysztof Rajda, Mikołaj Morzy, Tomasz Kajdanowicz

    Abstract: Despite impressive advancements in multilingual corpora collection and model training, develo** large-scale deployments of multilingual models still presents a significant challenge. This is particularly true for language tasks that are culture-dependent. One such example is the area of multilingual sentiment analysis, where affective markers can be subtle and deeply ensconced in culture. This w… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: submitted to NeurIPS 2023 Datasets and Benchmarks track. Dataset: https://huggingface.co/datasets/Brand24/mms Code: https://github.com/Brand24-AI/mms_benchmark

  8. arXiv:2305.13048  [pdf, other

    cs.CL cs.AI

    RWKV: Reinventing RNNs for the Transformer Era

    Authors: Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang , et al. (9 additional authors not shown)

    Abstract: Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scala… ▽ More

    Submitted 10 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  9. arXiv:2304.07139  [pdf, other

    cs.CV cs.NE

    Neuromorphic Optical Flow and Real-time Implementation with Event Cameras

    Authors: Yannick Schnider, Stanislaw Wozniak, Mathias Gehrig, Jules Lecomte, Axel von Arnim, Luca Benini, Davide Scaramuzza, Angeliki Pantazi

    Abstract: Optical flow provides information on relative motion that is an important component in many computer vision pipelines. Neural networks provide high accuracy optical flow, yet their complexity is often prohibitive for application at the edge or in robots, where efficiency and latency play crucial role. To address this challenge, we build on the latest developments in event-based vision and spiking… ▽ More

    Submitted 12 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Accepted for IEEE CVPRW, Vancouver 2023. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media. Copyright 2023 IEEE

  10. Dynamic Event-based Optical Identification and Communication

    Authors: Axel von Arnim, Jules Lecomte, Naima Elosegui Borras, Stanislaw Wozniak, Angeliki Pantazi

    Abstract: Optical identification is often done with spatial or temporal visual pattern recognition and localization. Temporal pattern recognition, depending on the technology, involves a trade-off between communication frequency, range and accurate tracking. We propose a solution with light-emitting beacons that improves this trade-off by exploiting fast event-based cameras and, for tracking, sparse neuromo… ▽ More

    Submitted 7 May, 2024; v1 submitted 13 March, 2023; originally announced March 2023.

    Journal ref: Front. Neurorobot. 18:1290965

  11. arXiv:2302.10724  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    ChatGPT: Jack of all trades, master of none

    Authors: Jan Kocoń, Igor Cichecki, Oliwier Kaszyca, Mateusz Kochanek, Dominika Szydło, Joanna Baran, Julita Bielaniewicz, Marcin Gruza, Arkadiusz Janz, Kamil Kanclerz, Anna Kocoń, Bartłomiej Koptyra, Wiktoria Mieleszczenko-Kowszewicz, Piotr Miłkowski, Marcin Oleksy, Maciej Piasecki, Łukasz Radliński, Konrad Wojtasik, Stanisław Woźniak, Przemysław Kazienko

    Abstract: OpenAI has released the Chat Generative Pre-trained Transformer (ChatGPT) and revolutionized the approach in artificial intelligence to human-model interaction. Several publications on ChatGPT evaluation test its effectiveness on well-known natural language processing (NLP) tasks. However, the existing studies are mostly non-automated and tested on a very limited scale. In this work, we examined C… ▽ More

    Submitted 9 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: preprint

    Journal ref: Information Fusion 101861 (2023)

  12. arXiv:2212.12522  [pdf, other

    cs.NE cs.LG

    An Exact Map** From ReLU Networks to Spiking Neural Networks

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Deep spiking neural networks (SNNs) offer the promise of low-power artificial intelligence. However, training deep SNNs from scratch or converting deep artificial neural networks to SNNs without loss of performance has been a challenge. Here we propose an exact map** from a network with Rectified Linear Units (ReLUs) to an SNN that fires exactly one spike per neuron. For our constructive proof,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  13. arXiv:2209.14017  [pdf

    cs.NE

    On the visual analytic intelligence of neural networks

    Authors: Stanisław Woźniak, Hlynur Jónsson, Giovanni Cherubini, Angeliki Pantazi, Evangelos Eleftheriou

    Abstract: Visual oddity task was conceived as a universal ethnic-independent analytic intelligence test for humans. Advancements in artificial intelligence led to important breakthroughs, yet competing with humans on such analytic intelligence tasks remains challenging and typically resorts to non-biologically-plausible architectures. We present a biologically realistic system that receives inputs from synt… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  14. arXiv:2204.04937  [pdf, other

    cs.CL cs.LG

    Assessment of Massively Multilingual Sentiment Classifiers

    Authors: Krzysztof Rajda, Łukasz Augustyniak, Piotr Gramacki, Marcin Gruza, Szymon Woźniak, Tomasz Kajdanowicz

    Abstract: Models are increasing in size and complexity in the hunt for SOTA. But what if those 2\% increase in performance does not make a difference in a production use case? Maybe benefits from a smaller, faster model outweigh those slight performance gains. Also, equally good performance across languages in multilingual tasks is more important than SOTA results on a single one. We present the biggest, un… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted for WASSA at ACL 2022

  15. Hex2vec -- Context-Aware Embedding H3 Hexagons with OpenStreetMap Tags

    Authors: Szymon Woźniak, Piotr Szymański

    Abstract: Representation learning of spatial and geographic data is a rapidly develo** field which allows for similarity detection between areas and high-quality inference using deep neural networks. Past approaches however concentrated on embedding raster imagery (maps, street or satellite photos), mobility data or road networks. In this paper we propose the first approach to learning vector representati… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted at 4th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery (GEOAI '21)

  16. gtfs2vec -- Learning GTFS Embeddings for comparing Public Transport Offer in Microregions

    Authors: Piotr Gramacki, Szymon Woźniak, Piotr Szymański

    Abstract: We selected 48 European cities and gathered their public transport timetables in the GTFS format. We utilized Uber's H3 spatial index to divide each city into hexagonal micro-regions. Based on the timetables data we created certain features describing the quantity and variety of public transport availability in each region. Next, we trained an auto-associative deep neural network to embed each of… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted at 1st ACM SIGSPATIAL International Workshop on Searching and Mining Large Collections of Geospatial Data (GeoSearch 2021)

  17. arXiv:2110.02743  [pdf, other

    eess.AS cs.LG cs.NE q-bio.QM

    Towards efficient end-to-end speech recognition with biologically-inspired neural networks

    Authors: Thomas Bohnstingl, Ayush Garg, Stanisław Woźniak, George Saon, Evangelos Eleftheriou, Angeliki Pantazi

    Abstract: Automatic speech recognition (ASR) is a capability which enables a program to process human speech into a written form. Recent developments in artificial intelligence (AI) have led to high-accuracy ASR systems based on deep neural networks, such as the recurrent neural network transducer (RNN-T). However, the core components and the performed operations of these approaches depart from the powerful… ▽ More

    Submitted 4 November, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: Accepted at the Efficient Natural Language and Speech Processing workshop at NeurIPS 2021

  18. arXiv:2104.11604  [pdf, other

    cs.NE cs.AI

    Learning in Deep Neural Networks Using a Biologically Inspired Optimizer

    Authors: Giorgia Dellaferrera, Stanislaw Wozniak, Giacomo Indiveri, Angeliki Pantazi, Evangelos Eleftheriou

    Abstract: Plasticity circuits in the brain are known to be influenced by the distribution of the synaptic weights through the mechanisms of synaptic integration and local regulation of synaptic strength. However, the complex interplay of stimulation-dependent plasticity with local learning signals is disregarded by most of the artificial neural network training algorithms devised so far. Here, we propose a… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  19. arXiv:2007.12723  [pdf, other

    cs.LG cs.NE stat.ML

    Online Spatio-Temporal Learning in Deep Neural Networks

    Authors: Thomas Bohnstingl, Stanisław Woźniak, Wolfgang Maass, Angeliki Pantazi, Evangelos Eleftheriou

    Abstract: Biological neural networks are equipped with an inherent capability to continuously adapt through online learning. This aspect remains in stark contrast to learning with error backpropagation through time (BPTT) applied to recurrent neural networks (RNNs), or recently to biologically-inspired spiking neural networks (SNNs). BPTT involves offline computation of the gradients due to the requirement… ▽ More

    Submitted 8 October, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Main manuscript: 9 pages, 3 figures, 1 table, Supplementary notes: 13 pages

  20. Exploiting Rays in Blind Localization of Distributed Sensor Arrays

    Authors: Szymon Woźniak, Konrad Kowalczyk

    Abstract: Many signal processing algorithms for distributed sensors are capable of improving their performance if the positions of sensors are known. In this paper, we focus on estimators for inferring the relative geometry of distributed arrays and sources, i.e. the setup geometry up to a scaling factor. Firstly, we present the Maximum Likelihood estimator derived under the assumption that the Direction of… ▽ More

    Submitted 1 February, 2020; originally announced February 2020.

    Comments: 5 pages, 2 figures, Accepted to ICASSP 2020

    Journal ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  21. Deep learning incorporating biologically-inspired neural dynamics

    Authors: Stanisław Woźniak, Angeliki Pantazi, Thomas Bohnstingl, Evangelos Eleftheriou

    Abstract: Neural networks have become the key technology of artificial intelligence and have contributed to breakthroughs in several machine learning tasks, primarily owing to advances in deep learning applied to Artificial Neural Networks (ANNs). Simultaneously, Spiking Neural Networks (SNNs) incorporating biologically-feasible spiking neurons have held great promise because of their rich temporal dynamics… ▽ More

    Submitted 19 May, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

    Journal ref: Nat Mach Intell 2, 325-336 (2020)

  22. arXiv:1212.4989  [pdf, other

    cs.SI cs.CR

    Towards Trustworthy Mobile Social Networking Services for Disaster Response

    Authors: Sander Wozniak, Michael Rossberg, Guenter Schaefer

    Abstract: Situational awareness is crucial for effective disaster management. However, obtaining information about the actual situation is usually difficult and time-consuming. While there has been some effort in terms of incorporating the affected population as a source of information, the issue of obtaining trustworthy information has not yet received much attention. Therefore, we introduce the concept of… ▽ More

    Submitted 10 January, 2013; v1 submitted 20 December, 2012; originally announced December 2012.

  23. arXiv:1210.0061  [pdf, other

    cs.CR cs.NI

    Geocast into the Past: Towards a Privacy-Preserving Spatiotemporal Multicast for Cellular Networks

    Authors: Sander Wozniak, Michael Rossberg, Franz Girlich, Guenter Schaefer

    Abstract: This article introduces the novel concept of Spatiotemporal Multicast (STM), which is the issue of sending a message to mobile devices that have been residing at a specific area during a certain time span in the past. A wide variety of applications can be envisioned for this concept, including crime investigation, disease control, and social applications. An important aspect of these applications… ▽ More

    Submitted 28 January, 2013; v1 submitted 28 September, 2012; originally announced October 2012.

    Comments: - added IEEE copyright notice