Skip to main content

Showing 1–21 of 21 results for author: Spiegel, C

.
  1. arXiv:2404.05509  [pdf, other

    math.CO

    Extending the Continuum of Six-Colorings

    Authors: Konrad Mundinger, Sebastian Pokutta, Christoph Spiegel, Max Zimmer

    Abstract: We present two novel six-colorings of the Euclidean plane that avoid monochromatic pairs of points at unit distance in five colors and monochromatic pairs at another specified distance $d$ in the sixth color. Such colorings have previously been known to exist for $0.41 < \sqrt{2} - 1 \le d \le 1 / \sqrt{5} < 0.45$. Our results significantly expand that range to $0.354 \le d \le 0.657$, the first i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: An animated version of Figure 2 is available at https://christophspiegel.berlin/hn/fig2.pdf and can be viewed with Acrobat Reader

  2. arXiv:2312.15230  [pdf, other

    cs.LG cs.AI

    PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs

    Authors: Max Zimmer, Megi Andoni, Christoph Spiegel, Sebastian Pokutta

    Abstract: Neural Networks can be efficiently compressed through pruning, significantly reducing storage and computational demands while maintaining predictive performance. Simple yet effective methods like Iterative Magnitude Pruning (IMP, Han et al., 2015) remove less important parameters and require a costly retraining procedure to recover performance after pruning. However, with the rise of Large Languag… ▽ More

    Submitted 13 February, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: 27 pages, 4 figures, 15 tables

  3. arXiv:2312.08049  [pdf, other

    math.CO math.OC

    The Four-Color Ramsey Multiplicity of Triangles

    Authors: Aldo Kiem, Sebastian Pokutta, Christoph Spiegel

    Abstract: We study a generalization of a famous result of Goodman and establish that asymptotically at least a $1/256$ fraction of all triangles needs to be monochromatic in any four-coloring of the edges of a complete graph. We also show that any large enough extremal construction must be based on a blow-up of one of the two $R(3,3,3)$ Ramsey-colorings of $K_{16}$. This result is obtained through an effici… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 45 pages

    MSC Class: 05D10; 05C55; 05C35; 90C22

  4. arXiv:2306.16788  [pdf, other

    cs.LG cs.AI

    Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging

    Authors: Max Zimmer, Christoph Spiegel, Sebastian Pokutta

    Abstract: Neural networks can be significantly compressed by pruning, yielding sparse models with reduced storage and computational demands while preserving predictive performance. Model soups (Wortsman et al., 2022) enhance generalization and out-of-distribution (OOD) performance by averaging the parameters of multiple models into a single one, without increasing inference time. However, achieving both spa… ▽ More

    Submitted 23 March, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: ICLR24 Camera Ready, 9 pages, 5 pages references, 16 pages appendix

  5. arXiv:2304.00400  [pdf, ps, other

    math.CO math.NT math.OC

    The Rado Multiplicity Problem in Vector Spaces over Finite Fields

    Authors: Juanjo Rué, Christoph Spiegel

    Abstract: We study an analogue of the Ramsey multiplicity problem for additive structures, in particular establishing the minimum number of monochromatic 3-APs in 3-colorings of $\mathbb{F}_3^n$ as well as obtaining the first non-trivial lower bound for the minimum number of monochromatic 4-APs in 2-colorings of $\mathbb{F}_5^n$. The former parallels results by Cumings et al (2013) in extremal graph theory… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: 20 pages, 6 figures

    MSC Class: 05D10; 05D10; 11B25; 11B75

  6. arXiv:2206.04036  [pdf, ps, other

    math.CO

    New Ramsey Multiplicity Bounds and Search Heuristics

    Authors: Olaf Parczyk, Sebastian Pokutta, Christoph Spiegel, Tibor Szabó

    Abstract: We study two related problems concerning the number of homogeneous subsets of given size in graphs that go back to questions of Erdős. Most notably, we improve the upper bounds on the Ramsey multiplicity of $K_4$ and $K_5$ and settle the minimum number of independent sets of size $4$ in graphs with clique number at most $4$. Motivated by the elusiveness of the symmetric Ramsey multiplicity problem… ▽ More

    Submitted 15 September, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 38 pages, 3 figures

    MSC Class: 05D10; 90C27

  7. arXiv:2205.11921  [pdf, other

    cs.LG math.OC

    Compression-aware Training of Neural Networks using Frank-Wolfe

    Authors: Max Zimmer, Christoph Spiegel, Sebastian Pokutta

    Abstract: Many existing Neural Network pruning approaches rely on either retraining or inducing a strong bias in order to converge to a sparse solution throughout training. A third paradigm, 'compression-aware' training, aims to obtain state-of-the-art dense models that are robust to a wide range of compression ratios using a single dense training run while also avoiding retraining. We propose a framework c… ▽ More

    Submitted 14 February, 2024; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 8 pages, 5 pages references, 14 pages appendix, 8 figures, and 11 tables

  8. arXiv:2111.00843  [pdf, other

    cs.LG

    How I Learned to Stop Worrying and Love Retraining

    Authors: Max Zimmer, Christoph Spiegel, Sebastian Pokutta

    Abstract: Many Neural Network Pruning approaches consist of several iterative training and pruning steps, seemingly losing a significant amount of their performance after pruning and then recovering it in the subsequent retraining phase. Recent works of Renda et al. (2020) and Le & Hua (2021) demonstrate the significance of the learning rate schedule during the retraining phase and propose specific heuristi… ▽ More

    Submitted 12 March, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: ICLR2023 camera-ready version, 9 pages main text, 34 pages appendix, 2 tables, 3 figures in main text

  9. arXiv:2010.07243  [pdf, other

    cs.LG math.OC

    Deep Neural Network Training with Frank-Wolfe

    Authors: Sebastian Pokutta, Christoph Spiegel, Max Zimmer

    Abstract: This paper studies the empirical efficacy and benefits of using projection-free first-order methods in the form of Conditional Gradients, a.k.a. Frank-Wolfe methods, for training Neural Networks with constrained parameters. We draw comparisons both to current state-of-the-art stochastic Gradient Descent methods as well as across different variants of stochastic Conditional Gradients. In particular… ▽ More

    Submitted 21 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: fixed coding error in figure 1 and extended abstract; 13 pages, Abstract 11 pages, 9 figures, 6 tables

  10. arXiv:2009.14114  [pdf, other

    math.OC cs.LG

    Projection-Free Adaptive Gradients for Large-Scale Optimization

    Authors: Cyrille W. Combettes, Christoph Spiegel, Sebastian Pokutta

    Abstract: The complexity in large-scale optimization can lie in both handling the objective function and handling the constraint set. In this respect, stochastic Frank-Wolfe algorithms occupy a unique position as they alleviate both computational burdens, by querying only approximate first-order information from the objective and by maintaining feasibility of the iterates without using projections. In this… ▽ More

    Submitted 15 February, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: 28 pages, 10 figures

  11. arXiv:1912.03288  [pdf, ps, other

    math.CO

    Independent Chains in Acyclic Posets

    Authors: Nika Salia, Christoph Spiegel, Casey Tompkins, Oscar Zamora

    Abstract: We consider the problem of determining the maximum order of an induced vertex-disjoint union of cliques in a graph. More specifically, given some family of graphs $\mathcal{G}$ of equal order, we are interested in the parameter $a(\mathcal{G}) = \min_{G \in \mathcal{G}} \max \{ |U| : U \subseteq V, G[U] \text{ is a vertex-disjoint union of cliques} \}$. We determine the value of this parameter pre… ▽ More

    Submitted 10 December, 2019; v1 submitted 6 December, 2019; originally announced December 2019.

    Comments: 15 pages, 6 figures

  12. arXiv:1911.13275  [pdf, ps, other

    math.CO math.NT

    On strong infinite Sidon and $B_h$ sets and random sets of integers

    Authors: David Fabian, Juanjo Rué, Christoph Spiegel

    Abstract: A set of integers $S \subset \mathbb{N}$ is an $α$-strong Sidon set if the pairwise sums of its elements are far apart by a certain measure depending on $α$, more specifically if $| (x+w) - (y+z) | \geq \max \{ x^α,y^α,z^α,w^α\}$ for every $x,y,z,w \in S$ satisfying $\max \{x,w\} \neq \max \{y,z\}$. We obtain a new lower bound for the growth of $α$-strong infinite Sidon sets when $0 \leq α< 1$. We… ▽ More

    Submitted 6 December, 2019; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 15 pages, fixed a calculation in the exponent on page 9

  13. An Erdős--Fuchs Theorem for Ordered Representation Functions

    Authors: Gonzalo Cao-Labora, Juanjo Rué, Christoph Spiegel

    Abstract: Let $k\geq 2$ be a positive integer. We study concentration results for the ordered representation functions $r^{\leq}_k(A,n) = \# \big\{ (a_1 \leq \dots \leq a_k) \in A^k : a_1+\dots+a_k = n \big\}$ and $r^{<}_k(A,n) = \# \big\{ (a_1 < \dots < a_k) \in A^k : a_1+\dots+a_k = n \big\}$ for any infinite set of non-negative integers $A$. Our main theorem is an Erdős--Fuchs-type result for both functi… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: 15 pages

  14. arXiv:1906.04024  [pdf, ps, other

    math.CO

    On the Odd Cycle Game and Connected Rules

    Authors: Jan Corsten, Adva Mond, Alexey Pokrovskiy, Christoph Spiegel, Tibor Szabó

    Abstract: We study the positional game where two players, Maker and Breaker, alternately select respectively $1$ and $b$ previously unclaimed edges of $K_n$. Maker wins if she succeeds in claiming all edges of some odd cycle in $K_n$ and Breaker wins otherwise. Improving on a result of Bednarska and Pikhurko, we show that Maker wins the odd cycle game if $b \leq ((4 - \sqrt{6})/5 + o(1)) n$. We furthermore… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: 22 pages, 3 figures

    MSC Class: 91A24; 05C57; 05C38

  15. arXiv:1811.04628  [pdf, ps, other

    math.CO

    Another Note on Intervals in the Hales-Jewett Theorem

    Authors: Nina Kamčev, Christoph Spiegel

    Abstract: The Hales-Jewett Theorem states that any $r$-colouring of $[m]^n$ contains a monochromatic combinatorial line if $n$ is large enough. Shelah's proof of the theorem implies that for $m = 3$ there always exists a monochromatic combinatorial lines whose set of active coordinates is the union of at most $r$ intervals. Conlon and Kamčev proved the existence of colourings for which it cannot be fewer th… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: 16 pages, 5 figures

    MSC Class: 05D10

  16. arXiv:1808.08455  [pdf, ps, other

    math.NT math.CO

    Additive Volume of Sets Contained in Few Arithmetic Progressions

    Authors: Gregory A. Freiman, Oriol Serra, Christoph Spiegel

    Abstract: A conjecture of Freiman gives an exact formula for the largest volume of a finite set $A$ of integers with given cardinality $k = |A|$ and doubling $T = |2A|$. The formula is known to hold when $T \le 3k-4$, for some small range over $3k-4$ and for families of structured sets called chains. In this paper we extend the formula to sets of every dimension and prove it for sets composed of three segme… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: 16 pages

    MSC Class: 11P70; 11B75

  17. arXiv:1805.12374  [pdf, ps, other

    math.CO math.NT

    A step beyond Freiman's theorem for set addition modulo a prime

    Authors: Pablo Candela, Oriol Serra, Christoph Spiegel

    Abstract: Freiman's 2.4-Theorem states that any set $A \subset \mathbb{Z}_p$ satisfying $|2A| \leq 2.4|A| - 3 $ and $|A| < p/35$ can be covered by an arithmetic progression of length at most $|2A| - |A| + 1$. A more general result of Green and Ruzsa implies that this covering property holds for any set satisfying $|2A| \leq 3|A| - 4$ as long as the rather strong density requirement $|A| < p/10^{215}$ is sat… ▽ More

    Submitted 31 May, 2018; originally announced May 2018.

    Comments: 13 pages, 1 figure

    MSC Class: 11P70; 11B13; 05B10

  18. arXiv:1802.07597  [pdf, ps, other

    math.CO

    On a problem of Sárközy and Sós for multivariate linear forms

    Authors: Juanjo Rué, Christoph Spiegel

    Abstract: We prove that for pairwise co-prime numbers $k_1,\dots,k_d \geq 2$ there does not exist any infinite set of positive integers $A$ such that the representation function $r_A (n) = \{ (a_1, \dots, a_d) \in A^d : k_1 a_1 + \dots + k_d a_d = n \}$ becomes constant for $n$ large enough. This result is a particular case of our main theorem, which poses a further step towards answering a question of Sárk… ▽ More

    Submitted 8 October, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

    Comments: Added clarifications regarding the particular notion of limit used in the first part of the paper. 11 pages

    MSC Class: 11B75; 11B13; 11B34

  19. arXiv:1711.07251  [pdf, ps, other

    math.CO

    On the optimality of the uniform random strategy

    Authors: Christopher Kusch, Juanjo Rué, Christoph Spiegel, Tibor Szabó

    Abstract: The concept of biased Maker-Breaker games, introduced by Chvátal and Erd{\H o}s, is a central topic in the field of positional games, with deep connections to the theory of random structures. For any given hypergraph ${\cal H}$ the main questions is to determine the smallest bias $q({\cal H})$ that allows Breaker to force that Maker ends up with an independent set of ${\cal H}$. Here we prove matc… ▽ More

    Submitted 3 August, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: 26 pages

  20. arXiv:1701.01631  [pdf, ps, other

    math.CO

    A Note on Sparse Supersaturation and Extremal Results for Linear Homogeneous Systems

    Authors: Christoph Spiegel

    Abstract: We study the thresholds for the property of containing a solution to a linear homogeneous system in random sets. We expand a previous sparse Szémeredi-type result of Schacht to the broadest class of matrices possible. We also provide a shorter proof of a sparse Rado result of Friedgut, Rödl, Ruciński and Schacht based on a hypergraph container approach due to Nenadov and Steger. Lastly we further… ▽ More

    Submitted 6 January, 2017; originally announced January 2017.

    Comments: 14 pages

  21. Threshold functions and Poisson convergence for systems of equations in random sets

    Authors: Juanjo Rué, Christoph Spiegel, Ana Zumalacárregui

    Abstract: We present a unified framework to study threshold functions for the existence of solutions to linear systems of equations in random sets which includes arithmetic progressions, sum-free sets, $B_{h}[g]$-sets and Hilbert cubes. In particular, we show that there exists a threshold function for the property "$\mathcal{A}$ contains a non-trivial solution of $M\cdot\textbf{x}=\textbf{0}$", where… ▽ More

    Submitted 29 May, 2017; v1 submitted 21 December, 2012; originally announced December 2012.

    Comments: New version with minor corrections and changes in notation. 24 Pages

    Journal ref: Mathematische Zeitschrift, February 2018, Volume 288, Issue 1-2, pp. 333-360