Skip to main content

Showing 1–24 of 24 results for author: Püschel, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15395  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    The CausalBench challenge: A machine learning contest for gene network inference from single-cell perturbation data

    Authors: Mathieu Chevalley, Jacob Sackett-Sanders, Yusuf Roohani, Pascal Notin, Artemy Bakulin, Dariusz Brzezinski, Kaiwen Deng, Yuanfang Guan, Justin Hong, Michael Ibrahim, Wojciech Kotlowski, Marcin Kowiel, Panagiotis Misiakos, Achille Nazaret, Markus Püschel, Chris Wendler, Arash Mehrjou, Patrick Schwab

    Abstract: In drug discovery, map** interactions between genes within cellular systems is a crucial early step. This helps formulate hypotheses regarding molecular mechanisms that could potentially be targeted by future medicines. The CausalBench Challenge was an initiative to invite the machine learning community to advance the state of the art in constructing gene-gene interaction networks. These network… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2307.03738  [pdf, other

    cs.LG cs.CL cs.PF

    QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models

    Authors: Tommaso Pegolotti, Elias Frantar, Dan Alistarh, Markus Püschel

    Abstract: We present ongoing work on a new automatic code generation approach for supporting quantized generative inference on LLMs such as LLaMA or OPT on off-the-shelf CPUs. Our approach is informed by the target architecture and a performance model, including both hardware characteristics and method-specific accuracy constraints. Results on CPU-based inference for LLaMA models show that our approach can… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  3. arXiv:2305.15936  [pdf, other

    cs.LG cs.AI stat.ME

    Learning DAGs from Data with Few Root Causes

    Authors: Panagiotis Misiakos, Chris Wendler, Markus Püschel

    Abstract: We present a novel perspective and algorithm for learning directed acyclic graphs (DAGs) from data generated by a linear structural equation model (SEM). First, we show that a linear SEM can be viewed as a linear transform that, in prior work, computes the data from a dense input vector of random valued root causes (as we will call them) associated with the nodes. Instead, we consider the case of… ▽ More

    Submitted 23 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: to be published in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

    Journal ref: NeurIPS 2023

  4. arXiv:2211.13706  [pdf, other

    cs.DM

    Fast Möbius and Zeta Transforms

    Authors: Tommaso Pegolotti, Bastian Seifert, Markus Püschel

    Abstract: Möbius inversion of functions on partially ordered sets (posets) $\mathcal{P}$ is a classical tool in combinatorics. For finite posets it consists of two, mutually inverse, linear transformations called zeta and Möbius transform, respectively. In this paper we provide novel fast algorithms for both that require $O(nk)$ time and space, where $n = |\mathcal{P}|$ and $k$ is the width (length of longe… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 16 pages, 7 figures, submitted for review

    MSC Class: 06A06; 05C50; 15A04; 15B36

  5. arXiv:2209.07970  [pdf, other

    eess.SP cs.DM cs.LG math.CO stat.ME

    Causal Fourier Analysis on Directed Acyclic Graphs and Posets

    Authors: Bastian Seifert, Chris Wendler, Markus Püschel

    Abstract: We present a novel form of Fourier analysis, and associated signal processing concepts, for signals (or data) indexed by edge-weighted directed acyclic graphs (DAGs). This means that our Fourier basis yields an eigendecomposition of a suitable notion of shift and convolution operators that we define. DAGs are the common model to capture causal relationships between data values and in this case our… ▽ More

    Submitted 9 August, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 13 pages, 11 figures

  6. arXiv:2103.03638  [pdf, other

    cs.AI cs.LG

    PRIMA: General and Precise Neural Network Certification via Scalable Convex Hull Approximations

    Authors: Mark Niklas Müller, Gleb Makarchuk, Gagandeep Singh, Markus Püschel, Martin Vechev

    Abstract: Formal verification of neural networks is critical for their safe adoption in real-world applications. However, designing a precise and scalable verifier which can handle different activation functions, realistic network architectures and relevant specifications remains an open and difficult challenge. In this paper, we take a major step forward in addressing this challenge and present a new verif… ▽ More

    Submitted 28 February, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 29 pages, 18 figures, 6 tables

    Journal ref: Proceedings of the ACM on Programming Languages, Volume 6, Issue POPL, January 2022, Article No.: 43, pp 1-33

  7. arXiv:2012.04358  [pdf, other

    cs.IT cs.SI eess.SP

    Discrete Signal Processing on Meet/Join Lattices

    Authors: Markus Püschel, Bastian Seifert, Chris Wendler

    Abstract: A lattice is a partially ordered set supporting a meet (or join) operation that returns the largest lower bound (smallest upper bound) of two elements. Just like graphs, lattices are a fundamental structure that occurs across domains including social data analysis, natural language processing, computational chemistry and biology, and database theory. In this paper we introduce discrete-lattice sig… ▽ More

    Submitted 6 July, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 13 pages

    Journal ref: IEEE Transactions on Signal Processing, Vol. 69, pp. 3571-3584, 2021

  8. arXiv:2010.00439  [pdf, other

    cs.LG cs.AI cs.DM eess.SP stat.ML

    Learning Set Functions that are Sparse in Non-Orthogonal Fourier Bases

    Authors: Chris Wendler, Andisheh Amrollahi, Bastian Seifert, Andreas Krause, Markus Püschel

    Abstract: Many applications of machine learning on discrete domains, such as learning preference functions in recommender systems or auctions, can be reduced to estimating a set function that is sparse in the Fourier domain. In this work, we present a new family of algorithms for learning Fourier-sparse set functions. They require at most $nk - k \log_2 k + k$ queries (set function evaluations), under mild… ▽ More

    Submitted 29 March, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Journal ref: Proc. AAAI, 2021

  9. Fourier Analysis-based Iterative Combinatorial Auctions

    Authors: Jakob Weissteiner, Chris Wendler, Sven Seuken, Ben Lubin, Markus Püschel

    Abstract: Recent advances in Fourier analysis have brought new tools to efficiently represent and learn set functions. In this paper, we bring the power of Fourier analysis to the design of combinatorial auctions (CAs). The key idea is to approximate bidders' value functions using Fourier-sparse set functions, which can be computed using a relatively small number of queries. Since this number is still too l… ▽ More

    Submitted 11 March, 2023; v1 submitted 22 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Main Track (2022). Pages 549-556

  10. arXiv:2007.10868  [pdf, other

    cs.LG stat.ML

    Scaling Polyhedral Neural Network Verification on GPUs

    Authors: Christoph Müller, François Serre, Gagandeep Singh, Markus Püschel, Martin Vechev

    Abstract: Certifying the robustness of neural networks against adversarial attacks is essential to their reliable adoption in safety-critical systems such as autonomous driving and medical diagnosis. Unfortunately, state-of-the-art verifiers either do not scale to bigger networks or are too imprecise to prove robustness, limiting their practical adoption. In this work, we introduce GPUPoly, a scalable verif… ▽ More

    Submitted 18 May, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: Müller and Serre contributed equally to this work

    Journal ref: Proc. MLSys, 2021

  11. Digraph Signal Processing with Generalized Boundary Conditions

    Authors: Bastian Seifert, Markus Püschel

    Abstract: Signal processing on directed graphs (digraphs) is problematic, since the graph shift, and thus associated filters, are in general not diagonalizable. Furthermore, the Fourier transform in this case is now obtained from the Jordan decomposition, which may not be computable at all for large graphs. We propose a novel and general solution for this problem based on matrix perturbation theory: We desi… ▽ More

    Submitted 8 February, 2021; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: 13 pages, 22 figures; final version accepted for publication in IEEE Trans. Signal Proc

    Journal ref: IEEE Transactions on Signal Processing, Vol. 69, pp. 1422-1437, 2021

  12. arXiv:2001.10290  [pdf, other

    cs.IT cs.LG eess.SP

    Discrete Signal Processing with Set Functions

    Authors: Markus Püschel, Chris Wendler

    Abstract: Set functions are functions (or signals) indexed by the powerset (set of all subsets) of a finite set N. They are fundamental and ubiquitous in many application domains and have been used, for example, to formally describe or quantify loss functions for semantic image segmentation, the informativeness of sensors in sensor networks the utility of sets of items in recommender systems, cooperative ga… ▽ More

    Submitted 22 October, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: 16 pages, submitted for publication

    Journal ref: IEEE Transactions on Signal Processing, Vol. 69, pp. 1039-1053, 2021

  13. arXiv:1909.02253  [pdf, other

    cs.LG stat.ML

    Powerset Convolutional Neural Networks

    Authors: Chris Wendler, Dan Alistarh, Markus Püschel

    Abstract: We present a novel class of convolutional neural networks (CNNs) for set functions, i.e., data indexed with the powerset of a finite set. The convolutions are derived as linear, shift-equivariant functions for various notions of shifts on set functions. The framework is fundamentally different from graph convolutions based on the Laplacian, as it provides not one but several basic shifts, one for… ▽ More

    Submitted 15 January, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Advances in Neural Information Processing Systems 32

    Journal ref: Advances in Neural Information Processing Systems, Vol. 32, pp. 927-938, 2019

  14. arXiv:1906.08613  [pdf, other

    cs.MS

    Program Generation for Linear Algebra Using Multiple Layers of DSLs

    Authors: Daniele G. Spampinato, Diego Fabregat-Traver, Markus Püschel, Paolo Bientinesi

    Abstract: Numerical software in computational science and engineering often relies on highly-optimized building blocks from libraries such as BLAS and LAPACK, and while such libraries provide portable performance for a wide range of computing architectures, they still present limitations in terms of flexibility. We advocate a domain-specific program generator capable of producing library routines tailored t… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

  15. arXiv:1905.00626  [pdf, other

    cs.PF cs.LG stat.ML

    On Linear Learning with Manycore Processors

    Authors: Eliza Wszola, Celestine Mendler-Dünner, Martin Jaggi, Markus Püschel

    Abstract: A new generation of manycore processors is on the rise that offers dozens and more cores on a chip and, in a sense, fuses host processor and accelerator. In this paper we target the efficient training of generalized linear models on these machines. We propose a novel approach for achieving parallelism which we call Heterogeneous Tasks on Homogeneous Cores (HTHC). It divides the problem into multip… ▽ More

    Submitted 5 February, 2020; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: Accepted to 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC)

    Journal ref: Proc. 2019 IEEE 26th International Conference on High Performance Computing, Data, and Analytics (HiPC) (pp. 184-194). IEEE

  16. arXiv:1805.04775  [pdf, other

    cs.PL cs.MS

    Program Generation for Small-Scale Linear Algebra Applications

    Authors: Daniele G. Spampinato, Diego Fabregat-Traver, Paolo Bientinesi, Markus Pueschel

    Abstract: We present SLinGen, a program generation system for linear algebra. The input to SLinGen is an application expressed mathematically in a linear-algebra-inspired language (LA) that we define. LA provides basic scalar/vector/matrix additions/multiplications and higher level operations including linear systems solvers, Cholesky and LU factorizations. The output of SLinGen is performance-optimized sin… ▽ More

    Submitted 12 May, 2018; originally announced May 2018.

    Comments: CGO 2018

  17. Compressive Sensing Using Iterative Hard Thresholding with Low Precision Data Representation: Theory and Applications

    Authors: Nezihe Merve Gürel, Kaan Kara, Alen Stojanov, Tyler Smith, Thomas Lemmin, Dan Alistarh, Markus Püschel, Ce Zhang

    Abstract: Modern scientific instruments produce vast amounts of data, which can overwhelm the processing ability of computer systems. Lossy compression of data is an intriguing solution, but comes with its own drawbacks, such as potential signal loss, and the need for careful optimization of the compression ratio. In this work, we focus on a setting where this problem is especially acute: compressive sensin… ▽ More

    Submitted 22 December, 2020; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: 19 pages, 5 figures, 1 table, in IEEE Transactions on Signal Processing Vol. 68, No. 7, pp. 4268-4282, 2020

  18. arXiv:1710.08029  [pdf, other

    cs.DS cs.DM

    Characterizing and Enumerating Walsh-Hadamard Transform Algorithms

    Authors: François Serre, Markus Püschel

    Abstract: We propose a way of characterizing the algorithms computing a Walsh-Hadamard transform that consist of a sequence of arrays of butterflies ($I_{2^{n-1}}\otimes \text{DFT}_2$) interleaved by linear permutations. Linear permutations are those that map linearly the binary representation of its element indices. We also propose a method to enumerate these algorithms.

    Submitted 22 October, 2017; originally announced October 2017.

    MSC Class: 65T50

  19. Distributed Optimization With Local Domains: Applications in MPC and Network Flows

    Authors: João F. C. Mota, João M. F. Xavier, Pedro M. Q. Aguiar, Markus Püschel

    Abstract: In this paper we consider a network with $P$ nodes, where each node has exclusive access to a local cost function. Our contribution is a communication-efficient distributed algorithm that finds a vector $x^\star$ minimizing the sum of all the functions. We make the additional assumption that the functions have intersecting local domains, i.e., each function depends only on some components of the v… ▽ More

    Submitted 8 May, 2013; originally announced May 2013.

    Comments: Submitted to IEEE Trans. Aut. Control

  20. arXiv:1009.1128  [pdf, ps, other

    math.OC cs.IT eess.SY

    Distributed Basis Pursuit

    Authors: João F. C. Mota, João M. F. Xavier, Pedro M. Q. Aguiar, Markus Püschel

    Abstract: We propose a distributed algorithm for solving the optimization problem Basis Pursuit (BP). BP finds the least L1-norm solution of the underdetermined linear system Ax = b and is used, for example, in compressed sensing for reconstruction. Our algorithm solves BP on a distributed platform such as a sensor network, and is designed to minimize the communication between nodes. The algorithm only requ… ▽ More

    Submitted 14 March, 2012; v1 submitted 6 September, 2010; originally announced September 2010.

    Comments: Preprint of the journal version of the paper; IEEE Transactions on Signal Processing, Vol. 60, Issue 4, April, 2012

  21. arXiv:1008.2972  [pdf, ps, other

    cs.IT math.RA

    Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for Polynomial Transforms Based on Induction

    Authors: Aliaksei Sandryhaila, Jelena Kovacevic, Markus Pueschel

    Abstract: A polynomial transform is the multiplication of an input vector $x\in\C^n$ by a matrix $\PT_{b,α}\in\C^{n\times n},$ whose $(k,\ell)$-th element is defined as $p_\ell(α_k)$ for polynomials $p_\ell(x)\in\C[x]$ from a list $b=\{p_0(x),\dots,p_{n-1}(x)\}$ and sample points $α_k\in\C$ from a list $α=\{α_0,\dots,α_{n-1}\}$. Such transforms find applications in the areas of signal processing, data compr… ▽ More

    Submitted 17 August, 2010; originally announced August 2010.

    Comments: 19 pages. Submitted to SIAM Journal on Matrix Analysis and Applications

    Journal ref: SIAM J. Matrix Analysis and Appl. 32 (2) pp. 364-384, 2011

  22. Algebraic Signal Processing Theory: Cooley-Tukey Type Algorithms for DCTs and DSTs

    Authors: Markus Pueschel, Jose M. F. Moura

    Abstract: This paper presents a systematic methodology based on the algebraic theory of signal processing to classify and derive fast algorithms for linear transforms. Instead of manipulating the entries of transform matrices, our approach derives the algorithms by stepwise decomposition of the associated signal models, or polynomial algebras. This decomposition is based on two generic methods or algebrai… ▽ More

    Submitted 4 February, 2007; originally announced February 2007.

    Comments: 31 pages, more information at http://www.ece.cmu.edu/~smart

    ACM Class: F.2.1

    Journal ref: IEEE Transactions on Signal Processing, Vol. 56, No. 4, pp. 1502-1521, 2008

  23. Algebraic Signal Processing Theory

    Authors: Markus Püschel, José M. F. Moura

    Abstract: This paper presents an algebraic theory of linear signal processing. At the core of algebraic signal processing is the concept of a linear signal model defined as a triple (A, M, phi), where familiar concepts like the filter space and the signal space are cast as an algebra A and a module M, respectively, and phi generalizes the concept of the z-transform to bijective linear map**s from a vector… ▽ More

    Submitted 14 November, 2019; v1 submitted 15 December, 2006; originally announced December 2006.

    Comments: 67 pages. Parts have been published in updated form as shown below. Section XV contains initial ideas for discrete signal processing on graphs. See also https://www.archive.ece.cmu.edu/~smart/ and newer work in https://acl.inf.ethz.ch/research/ASP/

    ACM Class: E.4

    Journal ref: Algebraic Signal Processing Theory: Foundation and 1-D Time & Algebraic Signal Processing Theory: 1-D Space, both IEEE Trans. SP, 56(8), 2008; Algebraic Signal Processing Theory: 1-D Nearest-Neighbor Models in IEEE Trans. SP, 60(5), 2012

  24. arXiv:quant-ph/9807064  [pdf, ps, other

    quant-ph cs.ET

    Fast Quantum Fourier Transforms for a Class of Non-abelian Groups

    Authors: Markus Pueschel, Martin Roetteler, Thomas Beth

    Abstract: An algorithm is presented allowing the construction of fast Fourier transforms for any solvable group on a classical computer. The special structure of the recursion formula being the core of this algorithm makes it a good starting point to obtain systematically fast Fourier transforms for solvable groups on a quantum computer. The inherent structure of the Hilbert space imposed by the qubit arc… ▽ More

    Submitted 22 July, 1998; originally announced July 1998.

    Comments: 16 pages, LaTeX2e

    Journal ref: Proceedings 13th International Symposium on Applied Algebra, Algebraic Algorithms and Error-Correcting Codes (AAECC'99), Honolulu, Hawaii, Springer LNCS, pp. 148-159, 1999