Skip to main content

Showing 1–11 of 11 results for author: Pioro, M

.
  1. arXiv:2406.08423  [pdf, other

    cs.LG cs.AI

    State Soup: In-Context Skill Learning, Retrieval and Mixing

    Authors: Maciej Pióro, Maciej Wołczyk, Razvan Pascanu, Johannes von Oswald, João Sacramento

    Abstract: A new breed of gated-linear recurrent neural networks has reached state-of-the-art performance on a range of sequence modeling problems. Such models naturally handle long sequences efficiently, as the cost of processing a new input is independent of sequence length. Here, we explore another advantage of these stateful sequence models, inspired by the success of model merging through parameter inte… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2402.07871  [pdf, other

    cs.LG cs.AI cs.CL

    Scaling Laws for Fine-Grained Mixture of Experts

    Authors: Jakub Krajewski, Jan Ludziejewski, Kamil Adamczewski, Maciej Pióro, Michał Krutul, Szymon Antoniak, Kamil Ciebiera, Krystian Król, Tomasz Odrzygóźdź, Piotr Sankowski, Marek Cygan, Sebastian Jaszczur

    Abstract: Mixture of Experts (MoE) models have emerged as a primary solution for reducing the computational cost of Large Language Models. In this work, we analyze their scaling properties, incorporating an expanded range of variables. Specifically, we introduce a new hyperparameter, granularity, whose adjustment enables precise control over the size of the experts. Building on this, we establish scaling la… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2401.04081  [pdf, other

    cs.LG cs.AI cs.CL

    MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

    Authors: Maciej Pióro, Kamil Ciebiera, Krystian Król, Jan Ludziejewski, Michał Krutul, Jakub Krajewski, Szymon Antoniak, Piotr Miłoś, Marek Cygan, Sebastian Jaszczur

    Abstract: State Space Models (SSMs) have become serious contenders in the field of sequential modeling, challenging the dominance of Transformers. At the same time, Mixture of Experts (MoE) has significantly improved Transformer-based Large Language Models, including recent state-of-the-art open models. We propose that to unlock the potential of SSMs for scaling, they should be combined with MoE. We showcas… ▽ More

    Submitted 26 February, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  4. arXiv:2310.15961  [pdf, other

    cs.CL cs.LG

    Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

    Authors: Szymon Antoniak, Sebastian Jaszczur, Michał Krutul, Maciej Pióro, Jakub Krajewski, Jan Ludziejewski, Tomasz Odrzygóźdź, Marek Cygan

    Abstract: Despite the promise of Mixture of Experts (MoE) models in increasing parameter counts of Transformer models while maintaining training and inference costs, their application carries notable drawbacks. The key strategy of these models is to, for each processed token, activate at most a few experts - subsets of an extensive feed-forward layer. But this approach is not without its challenges. The ope… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  5. arXiv:2211.04470  [pdf, other

    cs.CV eess.IV

    Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Grigory Malivenko, Radu Timofte, Lukasz Treszczotko, Xin Chang, Piotr Ksiazek, Michal Lopuszynski, Maciej Pioro, Rafal Rudnicki, Maciej Smyl, Yujie Ma, Zhenyu Li, Zehui Chen, Jialei Xu, Xianming Liu, Junjun Jiang, XueChao Shi, Difan Xu, Yanan Li, Xiaotao Wang, Lei Lei, Ziyu Zhang, Yicheng Wang, Zilong Huang, Guozhong Luo , et al. (14 additional authors not shown)

    Abstract: Various depth estimation models are now widely used on many mobile and IoT devices for image segmentation, bokeh effect rendering, object tracking and many other mobile tasks. Thus, it is very crucial to have efficient and accurate depth estimation models that can run fast on low-power mobile chipsets. In this Mobile AI challenge, the target was to develop deep learning-based single image depth es… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2105.08630, arXiv:2211.03885; text overlap with arXiv:2105.08819, arXiv:2105.08826, arXiv:2105.08629, arXiv:2105.07809, arXiv:2105.07825

  6. arXiv:2005.05048  [pdf, other

    cs.NI eess.SP

    A Light Signalling Approach to Node Grou** for Massive MIMO IoT Networks

    Authors: Emma Fitzgerald, Michał Pióro, Harsh Tataria, Gilles Callebaut, Sara Gunnarsson, Liesbet Van der Perre

    Abstract: Massive MIMO is a leading technology to connect very large numbers of energy constrained nodes, as it offers both extensive spatial multiplexing and large array gain. A challenge resides in partitioning the many nodes into groups that can communicate simultaneously such that the mutual interference is minimized. We here propose node partitioning strategies that do not require full channel state in… ▽ More

    Submitted 16 June, 2022; v1 submitted 11 May, 2020; originally announced May 2020.

    Journal ref: Computers, 2022, 11(6), 98

  7. arXiv:1909.11482  [pdf, other

    cs.NI

    Efficient Pilot Allocation for URLLC Traffic in 5G Industrial IoT Networks

    Authors: Emma Fitzgerald, Michał Pióro

    Abstract: In this paper we address the problem of resource allocation for alarm traffic in industrial Internet of Things networks using massive MIMO. We formulate the general problem of how to allocate pilot signals to alarm traffic such that delivery is guaranteed, while also minimising the number of pilots reserved for alarms, thus maximising the channel resources available for other traffic, such as indu… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 11th International Workshop on Resilient Networks Design and Modeling (RNDM) 2019

  8. arXiv:1908.05055  [pdf, other

    cs.NI

    Network Lifetime Maximization in Wireless Mesh Networks for Machine-to-Machine Communication

    Authors: Emma Fitzgerald, Michał Pióro, Artur Tomaszewski

    Abstract: In this paper we present new optimization formulations for maximizing the network lifetime in wireless mesh networks performing data aggregation and dissemination for machine-to-machine communication in the Internet of Things. We focus on heterogeneous networks in which multiple applications co-exist and nodes may take on different roles for different applications. Moreover, we address network rec… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: Ad Hoc Networks, in press

  9. Massive MIMO Optimization with Compatible Sets

    Authors: Emma Fitzgerald, Michał Pióro, Fredrik Tufvesson

    Abstract: Massive multiple-input multiple-output (MIMO) is expected to be a vital component in future 5G systems. As such, there is a need for new modeling in order to investigate the performance of massive MIMO not only at the physical layer, but also higher up the networking stack. In this paper, we present general optimization models for massive MIMO, based on mixed-integer programming and compatible set… ▽ More

    Submitted 26 March, 2019; v1 submitted 19 March, 2019; originally announced March 2019.

    Journal ref: IEEE Transactions on Wireless Communications, volume 19, issue 5, pages 2794 - 2812, 2019

  10. Semi-Distributed Demand Response Solutions for Smart Homes

    Authors: Rim Kaddah, Daniel Kofman, Fabien Mathieu, Michal Pioro

    Abstract: The Internet of Things (IoT) paradigm brings an opportunity for advanced Demand Response (DR) solutions. It enables visibility and control on the various appliances that may consume, store or generate energy within a home. It has been shown that a centralized control on the appliances of a set of households leads to efficient DR mechanisms; unfortunately, such solutions raise privacy and scalabili… ▽ More

    Submitted 30 November, 2017; originally announced November 2017.

    Journal ref: Information Innovation Technology in Smart Cities, pp.17 (163-179), 2017

  11. arXiv:1406.2480  [pdf, ps, other

    cs.NI

    Optimization of Free Space Optical Wireless Network for Cellular Backhauling

    Authors: Yuan Li, Nikolaos Pappas, Vangelis Angelakis, Michal Pióro, Di Yuan

    Abstract: With densification of nodes in cellular networks, free space optic (FSO) connections are becoming an appealing low cost and high rate alternative to copper and fiber as the backhaul solution for wireless communication systems. To ensure a reliable cellular backhaul, provisions for redundant, disjoint paths between the nodes must be made in the design phase. This paper aims at finding a cost-effect… ▽ More

    Submitted 10 June, 2014; originally announced June 2014.