Skip to main content

Showing 1–5 of 5 results for author: Ciebiera, K

.
  1. arXiv:2407.08626  [pdf, other

    cs.LG cs.RO

    RoboMorph: Evolving Robot Morphology using Large Language Models

    Authors: Kevin Qiu, Krzysztof Ciebiera, Paweł Fijałkowski, Marek Cygan, Łukasz Kuciński

    Abstract: We introduce RoboMorph, an automated approach for generating and optimizing modular robot designs using large language models (LLMs) and evolutionary algorithms. In this framework, we represent each robot design as a grammar and leverage the capabilities of LLMs to navigate the extensive robot design space, which is traditionally time-consuming and computationally demanding. By integrating automat… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2402.07871  [pdf, other

    cs.LG cs.AI cs.CL

    Scaling Laws for Fine-Grained Mixture of Experts

    Authors: Jakub Krajewski, Jan Ludziejewski, Kamil Adamczewski, Maciej Pióro, Michał Krutul, Szymon Antoniak, Kamil Ciebiera, Krystian Król, Tomasz Odrzygóźdź, Piotr Sankowski, Marek Cygan, Sebastian Jaszczur

    Abstract: Mixture of Experts (MoE) models have emerged as a primary solution for reducing the computational cost of Large Language Models. In this work, we analyze their scaling properties, incorporating an expanded range of variables. Specifically, we introduce a new hyperparameter, granularity, whose adjustment enables precise control over the size of the experts. Building on this, we establish scaling la… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2401.04081  [pdf, other

    cs.LG cs.AI cs.CL

    MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts

    Authors: Maciej Pióro, Kamil Ciebiera, Krystian Król, Jan Ludziejewski, Michał Krutul, Jakub Krajewski, Szymon Antoniak, Piotr Miłoś, Marek Cygan, Sebastian Jaszczur

    Abstract: State Space Models (SSMs) have become serious contenders in the field of sequential modeling, challenging the dominance of Transformers. At the same time, Mixture of Experts (MoE) has significantly improved Transformer-based Large Language Models, including recent state-of-the-art open models. We propose that to unlock the potential of SSMs for scaling, they should be combined with MoE. We showcas… ▽ More

    Submitted 26 February, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  4. arXiv:2303.04452  [pdf, other

    cs.RO cs.CV cs.LG

    Gras** Student: semi-supervised learning for robotic manipulation

    Authors: Piotr Krzywicki, Krzysztof Ciebiera, Rafał Michaluk, Inga Maziarz, Marek Cygan

    Abstract: Gathering real-world data from the robot quickly becomes a bottleneck when constructing a robot learning system for gras**. In this work, we design a semi-supervised gras** system that, on top of a small sample of robot experience, takes advantage of images of products to be picked, which are collected without any interactions with the robot. We validate our findings both in the simulation and… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    ACM Class: I.2; I.4; I.2.9

  5. arXiv:1410.7534  [pdf, other

    cs.DS cs.SE

    Approximation Algorithms for Steiner Tree Problems Based on Universal Solution Frameworks

    Authors: Krzysztof Ciebiera, Piotr Godlewski, Piotr Sankowski, Piotr Wygocki

    Abstract: This paper summarizes the work on implementing few solutions for the Steiner Tree problem which we undertook in the PAAL project. The main focus of the project is the development of generic implementations of approximation algorithms together with universal solution frameworks. In particular, we have implemented Zelikovsky 11/6-approximation using local search framework, and 1.39-approximation by… ▽ More

    Submitted 28 October, 2014; originally announced October 2014.