Skip to main content

Showing 1–13 of 13 results for author: Maziarz, K

.
  1. arXiv:2406.18739  [pdf, other

    cs.LG

    RetroGFN: Diverse and Feasible Retrosynthesis using GFlowNets

    Authors: Piotr Gaiński, Michał Koziarski, Krzysztof Maziarz, Marwin Segler, Jacek Tabor, Marek Śmieja

    Abstract: Single-step retrosynthesis aims to predict a set of reactions that lead to the creation of a target molecule, which is a crucial task in molecular discovery. Although a target molecule can often be synthesized with multiple different reactions, it is not clear how to verify the feasibility of a reaction, because the available datasets cover only a tiny fraction of the possible solutions. Consequen… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2310.19796  [pdf, other

    cs.LG cs.AI q-bio.QM

    Re-evaluating Retrosynthesis Algorithms with Syntheseus

    Authors: Krzysztof Maziarz, Austin Tripp, Guoqing Liu, Megan Stanley, Shufang Xie, Piotr Gaiński, Philipp Seidl, Marwin Segler

    Abstract: The planning of how to synthesize molecules, also known as retrosynthesis, has been a growing focus of the machine learning and chemistry communities in recent years. Despite the appearance of steady progress, we argue that imperfect benchmarks and inconsistent comparisons mask systematic shortcomings of existing techniques. To remedy this, we present a benchmarking library called syntheseus which… ▽ More

    Submitted 19 February, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

  3. arXiv:2310.09270  [pdf, other

    cs.AI cs.LG

    Retro-fallback: retrosynthetic planning in an uncertain world

    Authors: Austin Tripp, Krzysztof Maziarz, Sarah Lewis, Marwin Segler, José Miguel Hernández-Lobato

    Abstract: Retrosynthesis is the task of planning a series of chemical reactions to create a desired molecule from simpler, buyable molecules. While previous works have proposed algorithms to find optimal solutions for a range of metrics (e.g. shortest, lowest-cost), these works generally overlook the fact that we have imperfect knowledge of the space of possible reactions, meaning plans created by algorithm… ▽ More

    Submitted 13 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 camera ready version (https://openreview.net/forum?id=dl0u4ODCuW). 58 pages total. Code available at: https://github.com/AustinT/retro-fallback-iclr24. This version has 1) updated writing 2) updated figures 3) additional experimental results 4) more complete explanation of AND/OR graphs in the appendices 5) correct typos + error in fig G.5 caption

  4. arXiv:2305.03041  [pdf, other

    cs.LG q-bio.QM

    Are VAEs Bad at Reconstructing Molecular Graphs?

    Authors: Hagen Muenkler, Hubert Misztela, Michal Pikusa, Marwin Segler, Nadine Schneider, Krzysztof Maziarz

    Abstract: Many contemporary generative models of molecules are variational auto-encoders of molecular graphs. One term in their training loss pertains to reconstructing the input, yet reconstruction capabilities of state-of-the-art models have not yet been thoroughly compared on a large and chemically diverse dataset. In this work, we show that when several state-of-the-art generative models are evaluated u… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Published at the ELLIS Workshop on Machine Learning for Molecules (ML4Molecules 2022)

  5. arXiv:2301.13755  [pdf, other

    cs.AI cs.LG

    Retrosynthetic Planning with Dual Value Networks

    Authors: Guoqing Liu, Di Xue, Shufang Xie, Yingce Xia, Austin Tripp, Krzysztof Maziarz, Marwin Segler, Tao Qin, Zongzhang Zhang, Tie-Yan Liu

    Abstract: Retrosynthesis, which aims to find a route to synthesize a target molecule from commercially available starting materials, is a critical task in drug discovery and materials design. Recently, the combination of ML-based single-step reaction predictors with multi-step planners has led to promising results. However, the single-step predictors are mostly trained offline to optimize the single-step ac… ▽ More

    Submitted 3 March, 2024; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted to ICML 2023

  6. arXiv:2105.02856  [pdf, other

    cs.PL cs.DS

    Hashing Modulo Alpha-Equivalence

    Authors: Krzysztof Maziarz, Tom Ellis, Alan Lawrence, Andrew Fitzgibbon, Simon Peyton Jones

    Abstract: In many applications one wants to identify identical subtrees of a program syntax tree. This identification should ideally be robust to alpha-renaming of the program, but no existing technique has been shown to achieve this with good efficiency (better than $\mathcal{O}(n^2)$ in expression size). We present a new, asymptotically efficient way to hash modulo alpha-equivalence. A key insight of our… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at the 42nd ACM SIGPLAN International Conference on Programming Language Design and Implementation (PLDI 2021)

  7. arXiv:2104.13915  [pdf, other

    cs.CV cs.LG

    Deep Learning for Rheumatoid Arthritis: Joint Detection and Damage Scoring in X-rays

    Authors: Krzysztof Maziarz, Anna Krason, Zbigniew Wojna

    Abstract: Recent advancements in computer vision promise to automate medical image analysis. Rheumatoid arthritis is an autoimmune disease that would profit from computer-based diagnosis, as there are no direct markers known, and doctors have to rely on manual inspection of X-ray images. In this work, we present a multi-task deep learning model that simultaneously learns to localize joints on X-ray images a… ▽ More

    Submitted 4 November, 2022; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: Presented at the Workshop on AI for Public Health at ICLR 2021

  8. arXiv:2103.03864  [pdf, other

    cs.LG q-bio.QM

    Learning to Extend Molecular Scaffolds with Structural Motifs

    Authors: Krzysztof Maziarz, Henry Jackson-Flux, Pashmina Cameron, Finton Sirockin, Nadine Schneider, Nikolaus Stiefl, Marwin Segler, Marc Brockschmidt

    Abstract: Recent advancements in deep learning-based modeling of molecules promise to accelerate in silico drug discovery. A plethora of generative models is available, building molecules either atom-by-atom and bond-by-bond or fragment-by-fragment. However, many drug discovery projects require a fixed scaffold to be present in the generated molecule, and incorporating that constraint has only recently been… ▽ More

    Submitted 12 May, 2024; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: Published at the 10th International Conference on Learning Representations (ICLR 2022)

  9. arXiv:2008.10041  [pdf, other

    cs.CV cs.LG

    Holistic Multi-View Building Analysis in the Wild with Projection Pooling

    Authors: Zbigniew Wojna, Krzysztof Maziarz, Łukasz Jocz, Robert Pałuba, Robert Kozikowski, Iasonas Kokkinos

    Abstract: We address six different classification tasks related to fine-grained building attributes: construction type, number of floors, pitch and geometry of the roof, facade material, and occupancy class. Tackling such a remote building analysis problem became possible only recently due to growing large-scale datasets of urban scenes. To this end, we introduce a new benchmarking dataset, consisting of 49… ▽ More

    Submitted 19 December, 2020; v1 submitted 23 August, 2020; originally announced August 2020.

    Comments: Accepted for publication at the 35th AAAI Conference on Artificial Intelligence (AAAI 2021)

  10. arXiv:1910.04915  [pdf, other

    cs.LG stat.ML

    Flexible Multi-task Networks by Learning Parameter Allocation

    Authors: Krzysztof Maziarz, Efi Kokiopoulou, Andrea Gesmundo, Luciano Sbaiz, Gabor Bartok, Jesse Berent

    Abstract: This paper proposes a novel learning method for multi-task applications. Multi-task neural networks can learn to transfer knowledge across different tasks by using parameter sharing. However, sharing parameters between unrelated tasks can hurt performance. To address this issue, we propose a framework to learn fine-grained patterns of parameter sharing. Assuming that the network is composed of sev… ▽ More

    Submitted 18 July, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

  11. arXiv:1811.09828  [pdf, other

    cs.LG cs.NE stat.ML

    Evolutionary-Neural Hybrid Agents for Architecture Search

    Authors: Krzysztof Maziarz, Mingxing Tan, Andrey Khorlin, Marin Georgiev, Andrea Gesmundo

    Abstract: Neural Architecture Search has shown potential to automate the design of neural networks. Deep Reinforcement Learning based agents can learn complex architectural patterns, as well as explore a vast and compositional search space. On the other hand, evolutionary algorithms offer higher sample efficiency, which is critical for such a resource intensive application. In order to capture the best of b… ▽ More

    Submitted 15 February, 2020; v1 submitted 24 November, 2018; originally announced November 2018.

  12. The Slow-coloring Game on Sparse Graphs: $k$-Degenerate, Planar, and Outerplanar

    Authors: Grzegorz Gutowski, Tomasz Krawczyk, Krzysztof Maziarz, Douglas B. West, Michał Zając, Xuding Zhu

    Abstract: The \emph{slow-coloring game} is played by Lister and Painter on a graph $G$. Initially, all vertices of $G$ are uncolored. In each round, Lister marks a nonempty set $M$ of uncolored vertices, and Painter colors a subset of $M$ that is independent in $G$. The game ends when all vertices are colored. The score of the game is the sum of the sizes of all sets marked by Lister. The goal of Painter is… ▽ More

    Submitted 15 September, 2018; v1 submitted 20 January, 2018; originally announced January 2018.

    Comments: 15 pages, 3 figures

  13. arXiv:1701.06538  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

    Authors: Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

    Abstract: The capacity of a neural network to absorb information is limited by its number of parameters. Conditional computation, where parts of the network are active on a per-example basis, has been proposed in theory as a way of dramatically increasing model capacity without a proportional increase in computation. In practice, however, there are significant algorithmic and performance challenges. In this… ▽ More

    Submitted 23 January, 2017; originally announced January 2017.