Skip to main content

Showing 1–11 of 11 results for author: Wendler, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10588  [pdf, other

    cs.CL cs.CY

    Do Llamas Work in English? On the Latent Language of Multilingual Transformers

    Authors: Chris Wendler, Veniamin Veselovsky, Giovanni Monea, Robert West

    Abstract: We ask whether multilingual language models trained on unbalanced, English-dominated corpora use English as an internal pivot language -- a question of key importance for understanding how language models function and the origins of linguistic bias. Focusing on the Llama-2 family of transformer models, our study uses carefully constructed non-English prompts with a unique correct single-token cont… ▽ More

    Submitted 8 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 12 pages. 28 with appendix

  2. arXiv:2401.09967  [pdf, other

    cs.CL

    Sketch-Guided Constrained Decoding for Boosting Blackbox Large Language Models without Logit Access

    Authors: Saibo Geng, Berkay Döner, Chris Wendler, Martin Josifoski, Robert West

    Abstract: Constrained decoding, a technique for enforcing constraints on language model outputs, offers a way to control text generation without retraining or architectural modifications. Its application is, however, typically restricted to models that give users access to next-token distributions (usually via softmax logits), which poses a limitation with blackbox large language models (LLMs). This paper i… ▽ More

    Submitted 2 July, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 Main Conference

  3. arXiv:2308.15395  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    The CausalBench challenge: A machine learning contest for gene network inference from single-cell perturbation data

    Authors: Mathieu Chevalley, Jacob Sackett-Sanders, Yusuf Roohani, Pascal Notin, Artemy Bakulin, Dariusz Brzezinski, Kaiwen Deng, Yuanfang Guan, Justin Hong, Michael Ibrahim, Wojciech Kotlowski, Marcin Kowiel, Panagiotis Misiakos, Achille Nazaret, Markus Püschel, Chris Wendler, Arash Mehrjou, Patrick Schwab

    Abstract: In drug discovery, map** interactions between genes within cellular systems is a crucial early step. This helps formulate hypotheses regarding molecular mechanisms that could potentially be targeted by future medicines. The CausalBench Challenge was an initiative to invite the machine learning community to advance the state of the art in constructing gene-gene interaction networks. These network… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  4. arXiv:2305.15936  [pdf, other

    cs.LG cs.AI stat.ME

    Learning DAGs from Data with Few Root Causes

    Authors: Panagiotis Misiakos, Chris Wendler, Markus Püschel

    Abstract: We present a novel perspective and algorithm for learning directed acyclic graphs (DAGs) from data generated by a linear structural equation model (SEM). First, we show that a linear SEM can be viewed as a linear transform that, in prior work, computes the data from a dense input vector of random valued root causes (as we will call them) associated with the nodes. Instead, we consider the case of… ▽ More

    Submitted 23 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: to be published in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

    Journal ref: NeurIPS 2023

  5. arXiv:2209.07970  [pdf, other

    eess.SP cs.DM cs.LG math.CO stat.ME

    Causal Fourier Analysis on Directed Acyclic Graphs and Posets

    Authors: Bastian Seifert, Chris Wendler, Markus Püschel

    Abstract: We present a novel form of Fourier analysis, and associated signal processing concepts, for signals (or data) indexed by edge-weighted directed acyclic graphs (DAGs). This means that our Fourier basis yields an eigendecomposition of a suitable notion of shift and convolution operators that we define. DAGs are the common model to capture causal relationships between data values and in this case our… ▽ More

    Submitted 9 August, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 13 pages, 11 figures

  6. arXiv:2202.04910  [pdf, other

    cs.LG math.OC

    Instance-wise algorithm configuration with graph neural networks

    Authors: Romeo Valentin, Claudio Ferrari, Jérémy Scheurer, Andisheh Amrollahi, Chris Wendler, Max B. Paulus

    Abstract: We present our submission for the configuration task of the Machine Learning for Combinatorial Optimization (ML4CO) NeurIPS 2021 competition. The configuration task is to predict a good configuration of the open-source solver SCIP to solve a mixed integer linear program (MILP) efficiently. We pose this task as a supervised learning problem: First, we compile a large dataset of the solver performan… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 5 pages, 3 figures

  7. arXiv:2012.04358  [pdf, other

    cs.IT cs.SI eess.SP

    Discrete Signal Processing on Meet/Join Lattices

    Authors: Markus Püschel, Bastian Seifert, Chris Wendler

    Abstract: A lattice is a partially ordered set supporting a meet (or join) operation that returns the largest lower bound (smallest upper bound) of two elements. Just like graphs, lattices are a fundamental structure that occurs across domains including social data analysis, natural language processing, computational chemistry and biology, and database theory. In this paper we introduce discrete-lattice sig… ▽ More

    Submitted 6 July, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 13 pages

    Journal ref: IEEE Transactions on Signal Processing, Vol. 69, pp. 3571-3584, 2021

  8. arXiv:2010.00439  [pdf, other

    cs.LG cs.AI cs.DM eess.SP stat.ML

    Learning Set Functions that are Sparse in Non-Orthogonal Fourier Bases

    Authors: Chris Wendler, Andisheh Amrollahi, Bastian Seifert, Andreas Krause, Markus Püschel

    Abstract: Many applications of machine learning on discrete domains, such as learning preference functions in recommender systems or auctions, can be reduced to estimating a set function that is sparse in the Fourier domain. In this work, we present a new family of algorithms for learning Fourier-sparse set functions. They require at most $nk - k \log_2 k + k$ queries (set function evaluations), under mild… ▽ More

    Submitted 29 March, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Journal ref: Proc. AAAI, 2021

  9. Fourier Analysis-based Iterative Combinatorial Auctions

    Authors: Jakob Weissteiner, Chris Wendler, Sven Seuken, Ben Lubin, Markus Püschel

    Abstract: Recent advances in Fourier analysis have brought new tools to efficiently represent and learn set functions. In this paper, we bring the power of Fourier analysis to the design of combinatorial auctions (CAs). The key idea is to approximate bidders' value functions using Fourier-sparse set functions, which can be computed using a relatively small number of queries. Since this number is still too l… ▽ More

    Submitted 11 March, 2023; v1 submitted 22 September, 2020; originally announced September 2020.

    Journal ref: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Main Track (2022). Pages 549-556

  10. arXiv:2001.10290  [pdf, other

    cs.IT cs.LG eess.SP

    Discrete Signal Processing with Set Functions

    Authors: Markus Püschel, Chris Wendler

    Abstract: Set functions are functions (or signals) indexed by the powerset (set of all subsets) of a finite set N. They are fundamental and ubiquitous in many application domains and have been used, for example, to formally describe or quantify loss functions for semantic image segmentation, the informativeness of sensors in sensor networks the utility of sets of items in recommender systems, cooperative ga… ▽ More

    Submitted 22 October, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: 16 pages, submitted for publication

    Journal ref: IEEE Transactions on Signal Processing, Vol. 69, pp. 1039-1053, 2021

  11. arXiv:1909.02253  [pdf, other

    cs.LG stat.ML

    Powerset Convolutional Neural Networks

    Authors: Chris Wendler, Dan Alistarh, Markus Püschel

    Abstract: We present a novel class of convolutional neural networks (CNNs) for set functions, i.e., data indexed with the powerset of a finite set. The convolutions are derived as linear, shift-equivariant functions for various notions of shifts on set functions. The framework is fundamentally different from graph convolutions based on the Laplacian, as it provides not one but several basic shifts, one for… ▽ More

    Submitted 15 January, 2020; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Advances in Neural Information Processing Systems 32

    Journal ref: Advances in Neural Information Processing Systems, Vol. 32, pp. 927-938, 2019