Search | arXiv e-print repository

Apportionment with Weighted Seats

Authors: Julian Chingoma, Ulle Endriss, Ronald de Haan, Adrian Haret, Jan Maly

Abstract: Apportionment is the task of assigning resources to entities with different entitlements in a fair manner, and specifically a manner that is as proportional as possible. The best-known application concerns the assignment of parliamentary seats to political parties based on their share in the popular vote. Here we enrich the standard model of apportionment by associating each seat with a weight tha… ▽ More Apportionment is the task of assigning resources to entities with different entitlements in a fair manner, and specifically a manner that is as proportional as possible. The best-known application concerns the assignment of parliamentary seats to political parties based on their share in the popular vote. Here we enrich the standard model of apportionment by associating each seat with a weight that reflects the value of that seat, for example because seats come with different roles, such as chair or treasurer, that have different (objective) values. We define several apportionment methods and natural fairness requirements for this new setting, and study the extent to which our methods satisfy our requirements. Our findings show that full fairness is harder to achieve than in the standard apportionment setting. At the same time, for relaxations of those requirements we can achieve stronger results than in the more general model of weighted fair division, where the values of objects are subjective. △ Less

Submitted 28 May, 2024; originally announced May 2024.

arXiv:2404.04694 [pdf, ps, other]

Maximal noncompactness of embeddings into Marcinkiewicz spaces

Authors: Jan Malý, Zdeněk Mihula, Vít Musil, Luboš Pick

Abstract: We develop a new functional-analytic technique for investigating the degree of noncompactness of an operator defined on a quasinormed space and taking values in a Marcinkiewicz space. The main result is a general principle from which it can be derived that such operators are almost always maximally noncompact in the sense that their ball measure of noncompactness coincides with their operator norm… ▽ More We develop a new functional-analytic technique for investigating the degree of noncompactness of an operator defined on a quasinormed space and taking values in a Marcinkiewicz space. The main result is a general principle from which it can be derived that such operators are almost always maximally noncompact in the sense that their ball measure of noncompactness coincides with their operator norm. We point out specifications of the universal principle to the case of the identity operator. △ Less

Submitted 6 April, 2024; originally announced April 2024.

Comments: 22 pages

MSC Class: 41A46; 46E35; 46E30

arXiv:2402.05895 [pdf, other]

Combining Voting and Abstract Argumentation to Understand Online Discussions

Authors: Michael Bernreiter, Jan Maly, Oliviero Nardi, Stefan Woltran

Abstract: Online discussion platforms are a vital part of the public discourse in a deliberative democracy. However, how to interpret the outcomes of the discussions on these platforms is often unclear. In this paper, we propose a novel and explainable method for selecting a set of most representative, consistent points of view by combining methods from computational social choice and abstract argumentation… ▽ More Online discussion platforms are a vital part of the public discourse in a deliberative democracy. However, how to interpret the outcomes of the discussions on these platforms is often unclear. In this paper, we propose a novel and explainable method for selecting a set of most representative, consistent points of view by combining methods from computational social choice and abstract argumentation. Specifically, we model online discussions as abstract argumentation frameworks combined with information regarding which arguments voters approve of. Based on ideas from approval-based multiwinner voting, we introduce several voting rules for selecting a set of preferred extensions that represents voters' points of view. We compare the proposed methods across several dimensions, theoretically and in numerical simulations, and give clear suggestions on which methods to use depending on the specific situation. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 33 pages. Extended version of an accepted AAMAS-24 paper

arXiv:2311.06132 [pdf, ps, other]

The core of an approval-based PB instance can be empty for nearly all cost-based satisfaction functions and for the share

Authors: Jan Maly

Abstract: The core is a strong fairness notion in multiwinner voting and participatory budgeting (PB). It is known that the core can be empty if we consider cardinal utilities, but it is not known whether it is always satisfiable with approval-ballots. In this short note, I show that in approval-based PB the core can be empty for nearly all satisfaction functions that are based on the cost of a project. In… ▽ More The core is a strong fairness notion in multiwinner voting and participatory budgeting (PB). It is known that the core can be empty if we consider cardinal utilities, but it is not known whether it is always satisfiable with approval-ballots. In this short note, I show that in approval-based PB the core can be empty for nearly all satisfaction functions that are based on the cost of a project. In particular, I show that the core can be empty for the cost satisfaction function, satisfaction functions based on diminishing marginal returns and the share. However, it remains open whether the core can be empty for the cardinality satisfaction function. △ Less

Submitted 10 November, 2023; originally announced November 2023.

arXiv:2310.08194 [pdf, other]

Free-Riding in Multi-Issue Decisions

Authors: Martin Lackner, Jan Maly, Oliviero Nardi

Abstract: Voting in multi-issue domains allows for compromise outcomes that satisfy all voters to some extent. Such fairness considerations, however, open the possibility of a special form of manipulation: free-riding. By untruthfully opposing a popular opinion in one issue, voters can receive increased consideration in other issues. We study under which conditions this is possible. Additionally, we study f… ▽ More Voting in multi-issue domains allows for compromise outcomes that satisfy all voters to some extent. Such fairness considerations, however, open the possibility of a special form of manipulation: free-riding. By untruthfully opposing a popular opinion in one issue, voters can receive increased consideration in other issues. We study under which conditions this is possible. Additionally, we study free-riding from a computational and experimental point of view. Our results show that free-riding in multi-issue domains is largely unavoidable, but comes at a non-negligible individual risk for voters. Thus, the allure of free-riding is smaller than one could intuitively assume. △ Less

Submitted 12 October, 2023; originally announced October 2023.

Comments: This is an extended version of the publication with the same name in the proceedings of AAMAS 2023

arXiv:2308.04921 [pdf, other]

How to induce regularization in linear models: A guide to reparametrizing gradient flow

Authors: Hung-Hsu Chou, Johannes Maly, Dominik Stöger

Abstract: In this work, we analyze the relation between reparametrizations of gradient flow and the induced implicit bias in linear models, which encompass various basic regression tasks. In particular, we aim at understanding the influence of the model parameters - reparametrization, loss, and link function - on the convergence behavior of gradient flow. Our results provide conditions under which the impli… ▽ More In this work, we analyze the relation between reparametrizations of gradient flow and the induced implicit bias in linear models, which encompass various basic regression tasks. In particular, we aim at understanding the influence of the model parameters - reparametrization, loss, and link function - on the convergence behavior of gradient flow. Our results provide conditions under which the implicit bias can be well-described and convergence of the flow is guaranteed. We furthermore show how to use these insights for designing reparametrization functions that lead to specific implicit biases which are closely connected to $\ell_p$- or trigonometric regularizers. △ Less

Submitted 6 March, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

arXiv:2307.12613 [pdf, other]

Tuning-free one-bit covariance estimation using data-driven dithering

Authors: Sjoerd Dirksen, Johannes Maly

Abstract: We consider covariance estimation of any subgaussian distribution from finitely many i.i.d. samples that are quantized to one bit of information per entry. Recent work has shown that a reliable estimator can be constructed if uniformly distributed dithers on $[-λ,λ]$ are used in the one-bit quantizer. This estimator enjoys near-minimax optimal, non-asymptotic error estimates in the operator and Fr… ▽ More We consider covariance estimation of any subgaussian distribution from finitely many i.i.d. samples that are quantized to one bit of information per entry. Recent work has shown that a reliable estimator can be constructed if uniformly distributed dithers on $[-λ,λ]$ are used in the one-bit quantizer. This estimator enjoys near-minimax optimal, non-asymptotic error estimates in the operator and Frobenius norms if $λ$ is chosen proportional to the largest variance of the distribution. However, this quantity is not known a-priori, and in practice $λ$ needs to be carefully tuned to achieve good performance. In this work we resolve this problem by introducing a tuning-free variant of this estimator, which replaces $λ$ by a data-driven quantity. We prove that this estimator satisfies the same non-asymptotic error estimates - up to small (logarithmic) losses and a slightly worse probability estimate. We also show that by using refined data-driven dithers that vary per entry of each sample, one can construct an estimator satisfying the same estimation error bound as the sample covariance of the samples before quantization -- again up logarithmic losses. Our proofs rely on a new version of the Burkholder-Rosenthal inequalities for matrix martingales, which is expected to be of independent interest. △ Less

Submitted 12 January, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

arXiv:2306.04961 [pdf, other]

Recovering Simultaneously Structured Data via Non-Convex Iteratively Reweighted Least Squares

Authors: Christian Kümmerle, Johannes Maly

Abstract: We propose a new algorithm for the problem of recovering data that adheres to multiple, heterogeneous low-dimensional structures from linear observations. Focusing on data matrices that are simultaneously row-sparse and low-rank, we propose and analyze an iteratively reweighted least squares (IRLS) algorithm that is able to leverage both structures. In particular, it optimizes a combination of non… ▽ More We propose a new algorithm for the problem of recovering data that adheres to multiple, heterogeneous low-dimensional structures from linear observations. Focusing on data matrices that are simultaneously row-sparse and low-rank, we propose and analyze an iteratively reweighted least squares (IRLS) algorithm that is able to leverage both structures. In particular, it optimizes a combination of non-convex surrogates for row-sparsity and rank, a balancing of which is built into the algorithm. We prove locally quadratic convergence of the iterates to a simultaneously structured data matrix in a regime of minimal sample complexity (up to constants and a logarithmic factor), which is known to be impossible for a combination of convex surrogates. In experiments, we show that the IRLS method exhibits favorable empirical convergence, identifying simultaneously row-sparse and low-rank matrices from fewer measurements than state-of-the-art methods. Code is available at https://github.com/ckuemmerle/simirls. △ Less

Submitted 18 January, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

Comments: 35 pages, 7 figures

arXiv:2303.00621 [pdf, ps, other]

The (Computational) Social Choice Take on Indivisible Participatory Budgeting

Authors: Simon Rey, Jan Maly

Abstract: In this survey, we review the literature investigating participatory budgeting as a social choice problem. Participatory Budgeting (PB) is a democratic process in which citizens are asked to vote on how to allocate a given amount of public money to a set of projects. From a social choice perspective, it corresponds then to the problem of aggregating opinions about which projects should be funded,… ▽ More In this survey, we review the literature investigating participatory budgeting as a social choice problem. Participatory Budgeting (PB) is a democratic process in which citizens are asked to vote on how to allocate a given amount of public money to a set of projects. From a social choice perspective, it corresponds then to the problem of aggregating opinions about which projects should be funded, into a budget allocation satisfying a budget constraint. This problem has received substantial attention in recent years and the literature is growing at a fast pace. In this survey, we present the most important research directions from the literature, each time presenting a large set of representative results. We only focus on the indivisible case, that is, PB problems in which projects can either be fully funded or not at all. The aim of the survey is to present a comprehensive overview of the state of the research on PB. We aim at providing both a general overview of the main research questions that are being investigated, and formal and unified definitions of the most important technical concepts from the literature. △ Less

Submitted 23 August, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.03672 [pdf, ps, other]

Proportionality in Approval-Based Participatory Budgeting

Authors: Markus Brill, Stefan Forster, Martin Lackner, Jan Maly, Jannik Peters

Abstract: The ability to measure the satisfaction of (groups of) voters is a crucial prerequisite for formulating proportionality axioms in approval-based participatory budgeting elections. Two common - but very different - ways to measure the satisfaction of a voter consider (i) the number of approved projects and (ii) the total cost of approved projects, respectively. In general, it is difficult to decide… ▽ More The ability to measure the satisfaction of (groups of) voters is a crucial prerequisite for formulating proportionality axioms in approval-based participatory budgeting elections. Two common - but very different - ways to measure the satisfaction of a voter consider (i) the number of approved projects and (ii) the total cost of approved projects, respectively. In general, it is difficult to decide which measure of satisfaction best reflects the voters' true utilities. In this paper, we study proportionality axioms with respect to large classes of approval-based satisfaction functions. We establish logical implications among our axioms and related notions from the literature, and we ask whether outcomes can be achieved that are proportional with respect to more than one satisfaction function. We show that this is impossible for the two commonly used satisfaction functions when considering proportionality notions based on extended justified representation, but achievable for a notion based on proportional justified representation. For the latter result, we introduce a strengthening of priceability and show that it is satisfied by several polynomial-time computable rules, including the Method of Equal Shares and Phragmèn's sequential rule. △ Less

Submitted 18 October, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

arXiv:2301.04641 [pdf, other]

Plug-in Channel Estimation with Dithered Quantized Signals in Spatially Non-Stationary Massive MIMO Systems

Authors: Tianyu Yang, Johannes Maly, Sjoerd Dirksen, Giuseppe Caire

Abstract: As the array dimension of massive MIMO systems increases to unprecedented levels, two problems occur. First, the spatial stationarity assumption along the antenna elements is no longer valid. Second, the large array size results in an unacceptably high power consumption if high-resolution analog-to-digital converters are used. To address these two challenges, we consider a Bussgang linear minimum… ▽ More As the array dimension of massive MIMO systems increases to unprecedented levels, two problems occur. First, the spatial stationarity assumption along the antenna elements is no longer valid. Second, the large array size results in an unacceptably high power consumption if high-resolution analog-to-digital converters are used. To address these two challenges, we consider a Bussgang linear minimum mean square error (BLMMSE)-based channel estimator for large scale massive MIMO systems with one-bit quantizers and a spatially non-stationary channel. Whereas other works usually assume that the channel covariance is known at the base station, we consider a plug-in BLMMSE estimator that uses an estimate of the channel covariance and rigorously analyze the distortion produced by using an estimated, rather than the true, covariance. To cope with the spatial non-stationarity, we introduce dithering into the quantized signals and provide a theoretical error analysis. In addition, we propose an angular domain fitting procedure which is based on solving an instance of non-negative least squares. For the multi-user data transmission phase, we further propose a BLMMSE-based receiver to handle one-bit quantized data signals. Our numerical results show that the performance of the proposed BLMMSE channel estimator is very close to the oracle-aided scheme with ideal knowledge of the channel covariance matrix. The BLMMSE receiver outperforms the conventional maximum-ratio-combining and zero-forcing receivers in terms of the resulting ergodic sum rate. △ Less

Submitted 24 January, 2024; v1 submitted 11 January, 2023; originally announced January 2023.

Comments: submitted to IEEE Transactions on Communications

arXiv:2209.03487 [pdf, ps, other]

A simple approach for quantizing neural networks

Authors: Johannes Maly, Rayan Saab

Abstract: In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving the network performance on given training data. On one hand, the computational complexity of this pre-processing slightly exceeds that of state-of-the-art algorit… ▽ More In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving the network performance on given training data. On one hand, the computational complexity of this pre-processing slightly exceeds that of state-of-the-art algorithms in the literature. On the other hand, our approach does not require any hyper-parameter tuning and, in contrast to previous methods, allows a plain analysis. We provide rigorous theoretical guarantees in the case of quantizing single network layers and show that the relative error decays with the number of parameters in the network if the training data behaves well, e.g., if it is sampled from suitable random distributions. The developed method also readily allows the quantization of deep networks by consecutive application to single layers. △ Less

Submitted 4 April, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

arXiv:2207.08437 [pdf, other]

Non-negative Least Squares via Overparametrization

Authors: Hung-Hsu Chou, Johannes Maly, Claudio Mayrink Verdun

Abstract: In many applications, solutions of numerical problems are required to be non-negative, e.g., when retrieving pixel intensity values or physical densities of a substance. In this context, non-negative least squares (NNLS) is a ubiquitous tool, e.g., when seeking sparse solutions of high-dimensional statistical problems. Despite vast efforts since the seminal work of Lawson and Hanson in the '70s, t… ▽ More In many applications, solutions of numerical problems are required to be non-negative, e.g., when retrieving pixel intensity values or physical densities of a substance. In this context, non-negative least squares (NNLS) is a ubiquitous tool, e.g., when seeking sparse solutions of high-dimensional statistical problems. Despite vast efforts since the seminal work of Lawson and Hanson in the '70s, the non-negativity assumption is still an obstacle for the theoretical analysis and scalability of many off-the-shelf solvers. In the different context of deep neural networks, we recently started to see that the training of overparametrized models via gradient descent leads to surprising generalization properties and the retrieval of regularized solutions. In this paper, we prove that, by using an overparametrized formulation, NNLS solutions can reliably be approximated via vanilla gradient flow. We furthermore establish stability of the method against negative perturbations of the ground-truth. Our simulations confirm that this allows the use of vanilla gradient descent as a novel and scalable numerical solver for NNLS. From a conceptual point of view, our work proposes a novel approach to trading side-constraints in optimization problems against complexity of the optimization landscape, which does not build upon the concept of Lagrangian multipliers. △ Less

Submitted 27 October, 2022; v1 submitted 18 July, 2022; originally announced July 2022.

arXiv:2205.07517 [pdf, other]

Fairness in Participatory Budgeting via Equality of Resources

Authors: Jan Maly, Simon Rey, Ulle Endriss, Martin Lackner

Abstract: We introduce a family of normative principles to assess fairness in the context of participatory budgeting. These principles are based on the fundamental idea that budget allocations should be fair in terms of the resources invested into meeting the wishes of individual voters. This is in contrast to earlier proposals that are based on specific assumptions regarding the satisfaction of voters with… ▽ More We introduce a family of normative principles to assess fairness in the context of participatory budgeting. These principles are based on the fundamental idea that budget allocations should be fair in terms of the resources invested into meeting the wishes of individual voters. This is in contrast to earlier proposals that are based on specific assumptions regarding the satisfaction of voters with a given budget allocation. We analyse these new principles in axiomatic, algorithmic, and experimental terms. △ Less

Submitted 20 February, 2023; v1 submitted 16 May, 2022; originally announced May 2022.

arXiv:2112.11027 [pdf, other]

More is Less: Inducing Sparsity via Overparameterization

Authors: Hung-Hsu Chou, Johannes Maly, Holger Rauhut

Abstract: In deep learning it is common to overparameterize neural networks, that is, to use more parameters than training samples. Quite surprisingly training the neural network via (stochastic) gradient descent leads to models that generalize very well, while classical statistics would suggest overfitting. In order to gain understanding of this implicit bias phenomenon we study the special case of sparse… ▽ More In deep learning it is common to overparameterize neural networks, that is, to use more parameters than training samples. Quite surprisingly training the neural network via (stochastic) gradient descent leads to models that generalize very well, while classical statistics would suggest overfitting. In order to gain understanding of this implicit bias phenomenon we study the special case of sparse recovery (compressed sensing) which is of interest on its own. More precisely, in order to reconstruct a vector from underdetermined linear measurements, we introduce a corresponding overparameterized square loss functional, where the vector to be reconstructed is deeply factorized into several vectors. We show that, if there exists an exact solution, vanilla gradient flow for the overparameterized loss functional converges to a good approximation of the solution of minimal $\ell_1$-norm. The latter is well-known to promote sparse solutions. As a by-product, our results significantly improve the sample complexity for compressed sensing via gradient flow/descent on overparameterized models derived in previous works. The theory accurately predicts the recovery rate in numerical experiments. Our proof relies on analyzing a certain Bregman divergence of the flow. This bypasses the obstacles caused by non-convexity and should be of independent interest. △ Less

Submitted 10 May, 2023; v1 submitted 21 December, 2021; originally announced December 2021.

Journal ref: Information and Inference: A Journal of the IMA, 12(3), 04 2023. iaad012

arXiv:2112.08041 [pdf, other]

Weak limit of homeomorphisms in $W^{1,n-1}$ and (INV) condition

Authors: Anna Doležalová, Stanislav Hencl, Jan Malý

Abstract: Let $Ω,Ω'\subset\mathbb{R}^3$ be Lipschitz domains, let $f_m:Ω\toΩ'$ be a sequence of homeomorphisms with prescribed Dirichlet boundary condition and $\sup_m \int_Ω(|Df_m|^2+1/J^2_{f_m})<\infty$. Let $f$ be a weak limit of $f_m$ in $W^{1,2}$. We show that $f$ is invertible a.e., more precisely it satisfies the (INV) condition of Conti and De Lellis and thus it has all the nice properties of mappin… ▽ More Let $Ω,Ω'\subset\mathbb{R}^3$ be Lipschitz domains, let $f_m:Ω\toΩ'$ be a sequence of homeomorphisms with prescribed Dirichlet boundary condition and $\sup_m \int_Ω(|Df_m|^2+1/J^2_{f_m})<\infty$. Let $f$ be a weak limit of $f_m$ in $W^{1,2}$. We show that $f$ is invertible a.e., more precisely it satisfies the (INV) condition of Conti and De Lellis and thus it has all the nice properties of map**s in this class. Generalization to higher dimensions and an example showing sharpness of the condition $1/J^2_f\in L^1$ are also given. Using this example we also show that unlike the planar case the class of weak limits and the class of strong limits of $W^{1,2}$ Sobolev homeomorphisms in $\mathbb{R}^3$ are not the same. △ Less

Submitted 25 April, 2023; v1 submitted 15 December, 2021; originally announced December 2021.

arXiv:2106.06190 [pdf, other]

New challenges in covariance estimation: multiple structures and coarse quantization

Authors: Johannes Maly, Tianyu Yang, Sjoerd Dirksen, Holger Rauhut, Giuseppe Caire

Abstract: In this self-contained chapter, we revisit a fundamental problem of multivariate statistics: estimating covariance matrices from finitely many independent samples. Based on massive Multiple-Input Multiple-Output (MIMO) systems we illustrate the necessity of leveraging structure and considering quantization of samples when estimating covariance matrices in practice. We then provide a selective surv… ▽ More In this self-contained chapter, we revisit a fundamental problem of multivariate statistics: estimating covariance matrices from finitely many independent samples. Based on massive Multiple-Input Multiple-Output (MIMO) systems we illustrate the necessity of leveraging structure and considering quantization of samples when estimating covariance matrices in practice. We then provide a selective survey of theoretical advances of the last decade focusing on the estimation of structured covariance matrices. This review is spiced up by some yet unpublished insights on how to benefit from combined structural constraints. Finally, we summarize the findings of our recently published preprint "Covariance estimation under one-bit quantization" to show how guaranteed covariance estimation is possible even under coarse quantization of the samples. △ Less

Submitted 11 June, 2021; originally announced June 2021.

arXiv:2106.05052 [pdf, other]

Choice Logics and Their Computational Properties

Authors: Michael Bernreiter, Jan Maly, Stefan Woltran

Abstract: Qualitative Choice Logic (QCL) and Conjunctive Choice Logic (CCL) are formalisms for preference handling, with especially QCL being well established in the field of AI. So far, analyses of these logics need to be done on a case-by-case basis, albeit they share several common features. This calls for a more general choice logic framework, with QCL and CCL as well as some of their derivatives being… ▽ More Qualitative Choice Logic (QCL) and Conjunctive Choice Logic (CCL) are formalisms for preference handling, with especially QCL being well established in the field of AI. So far, analyses of these logics need to be done on a case-by-case basis, albeit they share several common features. This calls for a more general choice logic framework, with QCL and CCL as well as some of their derivatives being particular instantiations. We provide such a framework, which allows us, on the one hand, to easily define new choice logics and, on the other hand, to examine properties of different choice logics in a uniform setting. In particular, we investigate strong equivalence, a core concept in non-classical logics for understanding formula simplification, and computational complexity. Our analysis also yields new results for QCL and CCL. For example, we show that the main reasoning task regarding preferred models is $Θ^p_2$-complete for QCL and CCL, while being $Δ^p_2$-complete for a newly introduced choice logic. △ Less

Submitted 9 June, 2021; originally announced June 2021.

Comments: This is an extended version of a paper of the same name to be published at IJCAI 2021

arXiv:2104.15075 [pdf, ps, other]

Participatory Budgeting with Donations and Diversity Constraints

Authors: Jiehua Chen, Martin Lackner, Jan Maly

Abstract: Participatory budgeting (PB) is a democratic process where citizens jointly decide on how to allocate public funds to indivisible projects. This paper focuses on PB processes where citizens may give additional money to projects they want to see funded. We introduce a formal framework for this kind of PB with donations. Our framework also allows for diversity constraints, meaning that each project… ▽ More Participatory budgeting (PB) is a democratic process where citizens jointly decide on how to allocate public funds to indivisible projects. This paper focuses on PB processes where citizens may give additional money to projects they want to see funded. We introduce a formal framework for this kind of PB with donations. Our framework also allows for diversity constraints, meaning that each project belongs to one or more types, and there are lower and upper bounds on the number of projects of the same type that can be funded. We propose three general classes of methods for aggregating the citizens' preferences in the presence of donations and analyze their axiomatic properties. Furthermore, we investigate the computational complexity of determining the outcome of a PB process with donations and of finding a citizen's optimal donation strategy. △ Less

Submitted 30 April, 2021; originally announced April 2021.

arXiv:2104.15058 [pdf, other]

Perpetual Voting: The Axiomatic Lens

Authors: Martin Lackner, Jan Maly

Abstract: Perpetual voting was recently introduced as a framework for long-term collective decision making. In this framework, we consider a sequence of subsequent approval-based elections and try to achieve a fair overall outcome. To achieve fairness over time, perpetual voting rules take the history of previous decisions into account and identify voters that were dissatisfied with previous decisions. In t… ▽ More Perpetual voting was recently introduced as a framework for long-term collective decision making. In this framework, we consider a sequence of subsequent approval-based elections and try to achieve a fair overall outcome. To achieve fairness over time, perpetual voting rules take the history of previous decisions into account and identify voters that were dissatisfied with previous decisions. In this paper, we look at perpetual voting rules from an axiomatic perspective and study two main questions. First, we ask how simple such rules can be while still meeting basic desiderata. For two simple but natural classes, we fully characterize the axiomatic possibilities. Second, we ask how proportionality can be formalized in perpetual voting. We study proportionality on simple profiles that are equivalent to the apportionment setting and show that lower and upper quota axioms can be used to distinguish (and sometimes characterize) perpetual voting rules. Furthermore, we show a surprising connection between a perpetual rule called Perpetual Consensus and Frege's apportionment method. △ Less

Submitted 30 April, 2021; originally announced April 2021.

Comments: Scheduled for oral presentation at COMSOC 2021

arXiv:2104.05425 [pdf, other]

Ranking Sets of Objects: The Complexity of Avoiding Impossibility Results

Authors: Jan Maly

Abstract: The problem of lifting a preference order on a set of objects to a preference order on a family of subsets of this set is a fundamental problem with a wide variety of applications in AI. The process is often guided by axioms postulating properties the lifted order should have. Well-known impossibility results by Kannai and Peleg and by Barberà and Pattanaik tell us that some desirable axioms - nam… ▽ More The problem of lifting a preference order on a set of objects to a preference order on a family of subsets of this set is a fundamental problem with a wide variety of applications in AI. The process is often guided by axioms postulating properties the lifted order should have. Well-known impossibility results by Kannai and Peleg and by Barberà and Pattanaik tell us that some desirable axioms - namely dominance and (strict) independence - are not jointly satisfiable for any linear order on the objects if all non-empty sets of objects are to be ordered. On the other hand, if not all non-empty sets of objects are to be ordered, the axioms are jointly satisfiable for all linear orders on the objects for some families of sets. Such families are very important for applications as they allow for the use of lifted orders, for example, in combinatorial voting. In this paper, we determine the computational complexity of recognizing such families. We show that it is $Π_2^p$-complete to decide for a given family of subsets whether dominance and independence or dominance and strict independence are jointly satisfiable for all linear orders on the objects if the lifted order needs to be total. Furthermore, we show that the problem remains coNP-complete if the lifted order can be incomplete. Additionally, we show that the complexity of these problem can increase exponentially if the family of sets is not given explicitly but via a succinct domain restriction. Finally, we show that it is NP-complete to decide for family of subsets whether dominance and independence or dominance and strict independence are jointly satisfiable for at least one linear orders on the objects. △ Less

Submitted 3 January, 2022; v1 submitted 12 April, 2021; originally announced April 2021.

Journal ref: Journal of Artificial Intelligence Research (JAIR), 73: 1-65 (2022)

arXiv:2104.01280 [pdf, other]

Covariance estimation under one-bit quantization

Authors: Sjoerd Dirksen, Johannes Maly, Holger Rauhut

Abstract: We consider the classical problem of estimating the covariance matrix of a subgaussian distribution from i.i.d. samples in the novel context of coarse quantization, i.e., instead of having full knowledge of the samples, they are quantized to one or two bits per entry. This problem occurs naturally in signal processing applications. We introduce new estimators in two different quantization scenario… ▽ More We consider the classical problem of estimating the covariance matrix of a subgaussian distribution from i.i.d. samples in the novel context of coarse quantization, i.e., instead of having full knowledge of the samples, they are quantized to one or two bits per entry. This problem occurs naturally in signal processing applications. We introduce new estimators in two different quantization scenarios and derive non-asymptotic estimation error bounds in terms of the operator norm. In the first scenario we consider a simple, scale-invariant one-bit quantizer and derive an estimation result for the correlation matrix of a centered Gaussian distribution. In the second scenario, we add random dithering to the quantizer. In this case we can accurately estimate the full covariance matrix of a general subgaussian distribution by collecting two bits per entry of each sample. In both scenarios, our bounds apply to masked covariance estimation. We demonstrate the near-optimality of our error bounds by deriving corresponding (minimax) lower bounds and using numerical simulations. △ Less

Submitted 22 April, 2022; v1 submitted 2 April, 2021; originally announced April 2021.

arXiv:2103.05523 [pdf, other]

Robust Sensing of Low-Rank Matrices with Non-Orthogonal Sparse Decomposition

Authors: Johannes Maly

Abstract: We consider the problem of recovering an unknown low-rank matrix X with (possibly) non-orthogonal, effectively sparse rank-1 decomposition from measurements y gathered in a linear measurement process A. We propose a variational formulation that lends itself to alternating minimization and whose global minimizers provably approximate X up to noise level. Working with a variant of robust injectivity… ▽ More We consider the problem of recovering an unknown low-rank matrix X with (possibly) non-orthogonal, effectively sparse rank-1 decomposition from measurements y gathered in a linear measurement process A. We propose a variational formulation that lends itself to alternating minimization and whose global minimizers provably approximate X up to noise level. Working with a variant of robust injectivity, we derive reconstruction guarantees for various choices of A including sub-gaussian, Gaussian rank-1, and heavy-tailed measurements. Numerical experiments support the validity of our theoretical considerations. △ Less

Submitted 12 June, 2023; v1 submitted 9 March, 2021; originally announced March 2021.

arXiv:2103.01908 [pdf, other]

doi 10.1109/TSP.2021.3137599

Structural Sparsity in Multiple Measurements

Authors: Florian Boßmann, Sara Krause-Solberg, Johannes Maly, Nada Sissouno

Abstract: We propose a novel sparsity model for distributed compressed sensing in the multiple measurement vectors (MMV) setting. Our model extends the concept of row-sparsity to allow more general types of structured sparsity arising in a variety of applications like, e.g., seismic exploration and non-destructive testing. To reconstruct structured data from observed measurements, we derive a non-convex but… ▽ More We propose a novel sparsity model for distributed compressed sensing in the multiple measurement vectors (MMV) setting. Our model extends the concept of row-sparsity to allow more general types of structured sparsity arising in a variety of applications like, e.g., seismic exploration and non-destructive testing. To reconstruct structured data from observed measurements, we derive a non-convex but well-conditioned LASSO-type functional. By exploiting the convex-concave geometry of the functional, we design a projected gradient descent algorithm and show its effectiveness in extensive numerical simulations, both on toy and real data. △ Less

Submitted 31 December, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

Comments: Copyright 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

arXiv:2102.13092 [pdf, other]

Quantitative approximation results for complex-valued neural networks

Authors: A. Caragea, D. G. Lee, J. Maly, G. Pfander, F. Voigtlaender

Abstract: Until recently, applications of neural networks in machine learning have almost exclusively relied on real-valued networks. It was recently observed, however, that complex-valued neural networks (CVNNs) exhibit superior performance in applications in which the input is naturally complex-valued, such as MRI fingerprinting. While the mathematical theory of real-valued networks has, by now, reached s… ▽ More Until recently, applications of neural networks in machine learning have almost exclusively relied on real-valued networks. It was recently observed, however, that complex-valued neural networks (CVNNs) exhibit superior performance in applications in which the input is naturally complex-valued, such as MRI fingerprinting. While the mathematical theory of real-valued networks has, by now, reached some level of maturity, this is far from true for complex-valued networks. In this paper, we analyze the expressivity of complex-valued networks by providing explicit quantitative error bounds for approximating $C^n$ functions on compact subsets of $\mathbb{C}^d$ by complex-valued neural networks that employ the modReLU activation function, given by $σ(z) = \mathrm{ReLU}(|z| - 1) \, \mathrm{sgn} (z)$, which is one of the most popular complex activation functions used in practice. We show that the derived approximation rates are optimal (up to log factors) in the class of modReLU networks with weights of moderate growth. △ Less

Submitted 3 December, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

MSC Class: 68T07; 41A25; 41A46

arXiv:2011.13772 [pdf, other]

Gradient Descent for Deep Matrix Factorization: Dynamics and Implicit Bias towards Low Rank

Authors: Hung-Hsu Chou, Carsten Gieshoff, Johannes Maly, Holger Rauhut

Abstract: In deep learning, it is common to use more network parameters than training points. In such scenarioof over-parameterization, there are usually multiple networks that achieve zero training error so that thetraining algorithm induces an implicit bias on the computed solution. In practice, (stochastic) gradientdescent tends to prefer solutions which generalize well, which provides a possible explana… ▽ More In deep learning, it is common to use more network parameters than training points. In such scenarioof over-parameterization, there are usually multiple networks that achieve zero training error so that thetraining algorithm induces an implicit bias on the computed solution. In practice, (stochastic) gradientdescent tends to prefer solutions which generalize well, which provides a possible explanation of thesuccess of deep learning. In this paper we analyze the dynamics of gradient descent in the simplifiedsetting of linear networks and of an estimation problem. Although we are not in an overparameterizedscenario, our analysis nevertheless provides insights into the phenomenon of implicit bias. In fact, wederive a rigorous analysis of the dynamics of vanilla gradient descent, and characterize the dynamicalconvergence of the spectrum. We are able to accurately locate time intervals where the effective rankof the iterates is close to the effective rank of a low-rank projection of the ground-truth matrix. Inpractice, those intervals can be used as criteria for early stop** if a certain regularity is desired. Wealso provide empirical evidence for implicit bias in more general scenarios, such as matrix sensing andrandom initialization. This suggests that deep learning prefers trajectories whose complexity (measuredin terms of effective rank) is monotonically increasing, which we believe is a fundamental concept for thetheoretical understanding of deep learning. △ Less

Submitted 20 August, 2023; v1 submitted 27 November, 2020; originally announced November 2020.

arXiv:2005.07094 [pdf, other]

Approval-Based Shortlisting

Authors: Martin Lackner, Jan Maly

Abstract: Shortlisting is the task of reducing a long list of alternatives to a (smaller) set of best or most suitable alternatives. Shortlisting is often used in the nomination process of awards or in recommender systems to display featured objects. In this paper, we analyze shortlisting methods that are based on approval data, a common type of preferences. Furthermore, we assume that the size of the short… ▽ More Shortlisting is the task of reducing a long list of alternatives to a (smaller) set of best or most suitable alternatives. Shortlisting is often used in the nomination process of awards or in recommender systems to display featured objects. In this paper, we analyze shortlisting methods that are based on approval data, a common type of preferences. Furthermore, we assume that the size of the shortlist, i.e., the number of best or most suitable alternatives, is not fixed but determined by the shortlisting method. We axiomatically analyze established and new shortlisting methods and complement this analysis with an experimental evaluation based on synthetic and real-world data. Our results lead to recommendations which shortlisting methods to use, depending on the desired properties. △ Less

Submitted 2 May, 2022; v1 submitted 14 May, 2020; originally announced May 2020.

arXiv:2003.13664 [pdf, ps, other]

On BV homeomorphisms

Authors: Luigi D'Onofrio, Jan Malý, Carlo Sbordone, Roberta Schiattarella

Abstract: We obtain the rectifiability of the graph of a bounded variation homeomorphism $f$ in the plane and relations between gradients of $f$ and its inverse. Further, we show an example of a bounded variation homeomorphism $f$ in the plane which satisfies the $(N)$ and $(N^{-1})$ properties and strict positivity of Jacobian of both itself and its inverse, but neither $f$ nor $f^{-1}$ is Sobolev. We obtain the rectifiability of the graph of a bounded variation homeomorphism $f$ in the plane and relations between gradients of $f$ and its inverse. Further, we show an example of a bounded variation homeomorphism $f$ in the plane which satisfies the $(N)$ and $(N^{-1})$ properties and strict positivity of Jacobian of both itself and its inverse, but neither $f$ nor $f^{-1}$ is Sobolev. △ Less

Submitted 30 March, 2020; originally announced March 2020.

MSC Class: 26B30; 49Q15; 46E35

Journal ref: Journal of Convex Analysis 28,2 (2021)

arXiv:1912.04555 [pdf, other]

doi 10.1093/imrn/rnaa279

Pointwise inequalities for Sobolev functions on outward cuspidal domains

Authors: Sylvester Eriksson-Bique, Pekka Koskela, Jan Maly, Zheng Zhu

Abstract: We show that the first order Sobolev spaces on cuspidal symmetric domains can be characterized via pointwise inequalities. In particular, they coincide with the Hajlasz-Sobolev spaces. We show that the first order Sobolev spaces on cuspidal symmetric domains can be characterized via pointwise inequalities. In particular, they coincide with the Hajlasz-Sobolev spaces. △ Less

Submitted 10 December, 2019; originally announced December 2019.

arXiv:1911.07816 [pdf, other]

Quantized Compressed Sensing by Rectified Linear Units

Authors: Hans Christian Jung, Johannes Maly, Lars Palzer, Alexander Stollenwerk

Abstract: This work is concerned with the problem of recovering high-dimensional signals $\mathbf{x} \in \mathbb{R}^n$ which belong to a convex set of low-complexity from a small number of quantized measurements. We propose to estimate the signals via a convex program based on rectified linear units (ReLUs) for two different quantization schemes, namely one-bit and uniform multi-bit quantization. Assuming t… ▽ More This work is concerned with the problem of recovering high-dimensional signals $\mathbf{x} \in \mathbb{R}^n$ which belong to a convex set of low-complexity from a small number of quantized measurements. We propose to estimate the signals via a convex program based on rectified linear units (ReLUs) for two different quantization schemes, namely one-bit and uniform multi-bit quantization. Assuming that the linear measurement process can be modelled by a sensing matrix with i.i.d. subgaussian rows, we obtain for both schemes near-optimal uniform reconstruction guarantees by adding well-designed noise to the linear measurements prior to the quantization step. In the one-bit case, we show that the program is robust against adversarial bit corruptions as well as additive noise on the linear measurements. Further, our analysis quantifies precisely how the rate-distortion relationship of the program changes depending on whether we seek reconstruction accuracies above or below the noise floor. The proofs rely on recent results by Dirksen and Mendelson on non-Gaussian hyperplane tessellations. Finally, we complement our theoretical analysis with numerical experiments which compare our method to other state-of-the-art methodologies. △ Less

Submitted 26 March, 2021; v1 submitted 18 November, 2019; originally announced November 2019.

Comments: 40 pages, 5 figures

MSC Class: 62B10 ACM Class: G.3

arXiv:1911.02433 [pdf, ps, other]

AM-modulus and Hausdorff measure of codimension one in metric measure spaces

Authors: Vendula Honzlová Exnerová, Jan Malý, Olli Martio

Abstract: Let $Γ(E)$ be the family of all paths which meet a set $E$ in the metric measure space $X$. The set function $E \mapsto AM(Γ(E))$ defines the $AM$--modulus measure in $X$ where $AM$ refers to the approximation modulus. We compare $AM(Γ(E))$ to the Hausdorff measure $co\mathcal H^1(E)$ of codimension one in $X$ and show that $$co\mathcal H^1(E) \approx AM(Γ(E))$$ for Suslin sets $E$ in $X$. This le… ▽ More Let $Γ(E)$ be the family of all paths which meet a set $E$ in the metric measure space $X$. The set function $E \mapsto AM(Γ(E))$ defines the $AM$--modulus measure in $X$ where $AM$ refers to the approximation modulus. We compare $AM(Γ(E))$ to the Hausdorff measure $co\mathcal H^1(E)$ of codimension one in $X$ and show that $$co\mathcal H^1(E) \approx AM(Γ(E))$$ for Suslin sets $E$ in $X$. This leads to a new characterization of sets of finite perimeter in $X$ in terms of the $AM$--modulus. We also study the level sets of $BV$ functions and show that for a.e. $t$ these sets have finite $co\mathcal H^1$--measure. Most of the results are new also in $\mathbb R^n$. △ Less

Submitted 6 November, 2019; originally announced November 2019.

MSC Class: 31B15; 28A78; 30L99

arXiv:1908.02503 [pdf, other]

Computational approaches to non-convex, sparsity-inducing multi-penalty regularization

Authors: Zeljko Kereta, Johannes Maly, Valeriya Naumova

Abstract: In this work we consider numerical efficiency and convergence rates for solvers of non-convex multi-penalty formulations when reconstructing sparse signals from noisy linear measurements. We extend an existing approach, based on reduction to an augmented single-penalty formulation, to the non-convex setting and discuss its computational intractability in large-scale applications. To circumvent thi… ▽ More In this work we consider numerical efficiency and convergence rates for solvers of non-convex multi-penalty formulations when reconstructing sparse signals from noisy linear measurements. We extend an existing approach, based on reduction to an augmented single-penalty formulation, to the non-convex setting and discuss its computational intractability in large-scale applications. To circumvent this limitation, we propose an alternative single-penalty reduction based on infimal convolution that shares the benefits of the augmented approach but is computationally less dependent on the problem size. We provide linear convergence rates for both approaches, and their dependence on design parameters. Numerical experiments substantiate our theoretical findings. △ Less

Submitted 14 January, 2021; v1 submitted 7 August, 2019; originally announced August 2019.

Comments: 20 pages, 2 figures

arXiv:1904.04574 [pdf, ps, other]

On distributional adjugate and derivative of the inverse

Authors: Stanislav Hencl, Aapo Kauranen, Jan Malý

Abstract: Let $Ω\subset\er^3$ be a domain and let $f\colonΩ\to\er^3$ be a bi-$BV$ homeomorphism. Very recently in \cite{HKL} it was shown that the distributional adjugate of $Df$ (and thus also of $Df^{-1}$) is a matrix-valued measure. In the present paper we show that the components of $\Adj Df$ are equal to components of $Df^{-1}(f(U))$ as measures and that the absolutely continuous part of the distributi… ▽ More Let $Ω\subset\er^3$ be a domain and let $f\colonΩ\to\er^3$ be a bi-$BV$ homeomorphism. Very recently in \cite{HKL} it was shown that the distributional adjugate of $Df$ (and thus also of $Df^{-1}$) is a matrix-valued measure. In the present paper we show that the components of $\Adj Df$ are equal to components of $Df^{-1}(f(U))$ as measures and that the absolutely continuous part of the distributional adjugate $\Adj Df$ equals to the pointwise adjugate $\adj Df(x)$ a.e. We also show the equivalence of several approaches to the definition of the distributional adjugate. △ Less

Submitted 9 August, 2019; v1 submitted 9 April, 2019; originally announced April 2019.

MSC Class: 26B10; 30C65; 46E35

arXiv:1904.04527 [pdf, ps, other]

doi 10.1016/j.jfa.2021.109205

Plans on measures and AM-modulus

Authors: Vendula Honzlová Exnerová, Ondřej F. K. Kalenda, Jan Malý, Olli Martio

Abstract: For measuring families of curves, or, more generally, of measures, $M_p$-modulus is traditionally used. More recent studies use so-called plans on measures. In their fundamental paper \cite{ADS}, Ambrosio, Di Marino and Savaré proved that these two approaches are in some sense equivalent within $1<p<\infty$. We consider the limiting case $p=1$ and show that the $AM$-modulus can be obtained alterna… ▽ More For measuring families of curves, or, more generally, of measures, $M_p$-modulus is traditionally used. More recent studies use so-called plans on measures. In their fundamental paper \cite{ADS}, Ambrosio, Di Marino and Savaré proved that these two approaches are in some sense equivalent within $1<p<\infty$. We consider the limiting case $p=1$ and show that the $AM$-modulus can be obtained alternatively by the plan approach. On the way, we demonstrate unexpected behavior of the $AM$-modulus in comparison with usual capacities and consider the relations between the $M_1$--modulus and the $AM$--modulus. △ Less

Submitted 25 August, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

MSC Class: 28A12; 31B15; 46E27

Journal ref: J. Funct. Anal. 281 (2021), no. 10, article no. 109205, 35pp

arXiv:1807.06490 [pdf, other]

On Recovery Guarantees for One-Bit Compressed Sensing on Manifolds

Authors: Mark A. Iwen, Felix Krahmer, Sara Krause-Solberg, Johannes Maly

Abstract: This paper studies the problem of recovering a signal from one-bit compressed sensing measurements under a manifold model; that is, assuming that the signal lies on or near a manifold of low intrinsic dimension. We provide a convex recovery method based on the Geometric Multi-Resolution Analysis and prove recovery guarantees with a near-optimal scaling in the intrinsic manifold dimension. Our meth… ▽ More This paper studies the problem of recovering a signal from one-bit compressed sensing measurements under a manifold model; that is, assuming that the signal lies on or near a manifold of low intrinsic dimension. We provide a convex recovery method based on the Geometric Multi-Resolution Analysis and prove recovery guarantees with a near-optimal scaling in the intrinsic manifold dimension. Our method is the first tractable algorithm with such guarantees for this setting. The results are complemented by numerical experiments confirming the validity of our approach. △ Less

Submitted 23 July, 2020; v1 submitted 17 July, 2018; originally announced July 2018.

arXiv:1805.03486 [pdf, other]

Analysis of Hard-Thresholding for Distributed Compressed Sensing with One-Bit Measurements

Authors: Johannes Maly, Lars Palzer

Abstract: A simple hard-thresholding operation is shown to be able to recover $L$ signals $\mathbf{x}_1,...,\mathbf{x}_L \in \mathbb{R}^n$ that share a common support of size $s$ from $m = \mathcal{O}(s)$ one-bit measurements per signal if $L \ge \log(en/s)$. This result improves the single signal recovery bounds with $m = \mathcal{O}(s\log(en/s))$ measurements in the sense that asymptotically fewer measure… ▽ More A simple hard-thresholding operation is shown to be able to recover $L$ signals $\mathbf{x}_1,...,\mathbf{x}_L \in \mathbb{R}^n$ that share a common support of size $s$ from $m = \mathcal{O}(s)$ one-bit measurements per signal if $L \ge \log(en/s)$. This result improves the single signal recovery bounds with $m = \mathcal{O}(s\log(en/s))$ measurements in the sense that asymptotically fewer measurements per non-zero entry are needed. Numerical evidence supports the theoretical considerations. △ Less

Submitted 17 September, 2018; v1 submitted 9 May, 2018; originally announced May 2018.

Comments: 17 pages, 2 figures

arXiv:1802.08127 [pdf, ps, other]

Map** Analytic sets onto cubes by little Lipschitz functions

Authors: Jan Malý, Ondřej Zindulka

Abstract: A map** $f:X\to Y$ between metric spaces is called \emph{little Lipschitz} if the quantity $$ \operatorname{lip}(f(x)=\liminf_{r\to0}\frac{\operatorname{diam} f(B(x,r))}{r} $$ is finite for every $x\in X$. We prove that if a compact (or, more generally, analytic) metric space has packing dimension greater than $n$, then $X$ can be mapped onto an $n$-dimensional cube by a little Lipschitz fun… ▽ More A map** $f:X\to Y$ between metric spaces is called \emph{little Lipschitz} if the quantity $$ \operatorname{lip}(f(x)=\liminf_{r\to0}\frac{\operatorname{diam} f(B(x,r))}{r} $$ is finite for every $x\in X$. We prove that if a compact (or, more generally, analytic) metric space has packing dimension greater than $n$, then $X$ can be mapped onto an $n$-dimensional cube by a little Lipschitz function. The result requires two facts that are interesing in their own right. First, an analytic metric space $X$ contains, for any $\varepsilon>0$, a compact subset $S$ that embeds into an ultrametric space by a Lipschitz map, and $\dim_P S\geq\dim_P X-\varepsilon$. Second, a little Lipschitz function on a closed subset admits a little Lipschitz extension. △ Less

Submitted 22 February, 2018; originally announced February 2018.

arXiv:1801.06240 [pdf, other]

Robust Recovery of Low-Rank Matrices with Non-Orthogonal Sparse Decomposition from Incomplete Measurements

Authors: Massimo Fornasier, Johannes Maly, Valeriya Naumova

Abstract: We consider the problem of recovering an unknown effectively $(s_1,s_2)$-sparse low-rank-$R$ matrix $X$ with possibly non-orthogonal rank-$1$ decomposition from incomplete and inaccurate linear measurements of the form $y = \mathcal A (X) + η$, where $η$ is an ineliminable noise. We first derive an optimization formulation for matrix recovery under the considered model and propose a novel algorith… ▽ More We consider the problem of recovering an unknown effectively $(s_1,s_2)$-sparse low-rank-$R$ matrix $X$ with possibly non-orthogonal rank-$1$ decomposition from incomplete and inaccurate linear measurements of the form $y = \mathcal A (X) + η$, where $η$ is an ineliminable noise. We first derive an optimization formulation for matrix recovery under the considered model and propose a novel algorithm, called Alternating Tikhonov regularization and Lasso (A-T-LA$\text{S}_{2,1}$), to solve it. The algorithm is based on a multi-penalty regularization, which is able to leverage both structures (low-rankness and sparsity) simultaneously. The algorithm is a fast first order method, and straightforward to implement. We prove global convergence for any linear measurement model to stationary points and local convergence to global minimizers. By adapting the concept of restricted isometry property from compressed sensing to our novel model class, we prove error bounds between global minimizers and ground truth, up to noise level, from a number of subgaussian measurements scaling as $R(s_1+s_2)$, up to log-factors in the dimension, and relative-to-diameter distortion. Simulation results demonstrate both the accuracy and efficacy of the algorithm, as well as its superiority to the state-of-the-art algorithms in strong noise regimes and for matrices, whose singular vectors do not possess exact (joint-) sparse support. △ Less

Submitted 28 July, 2020; v1 submitted 18 January, 2018; originally announced January 2018.

arXiv:1801.02193 [pdf, other]

Multi-platform Version of StarCraft: Brood War in a Docker Container: Technical Report

Authors: Michal Šustr, Jan Malý, Michal Čertický

Abstract: We present a dockerized version of a real-time strategy game StarCraft: Brood War, commonly used as a domain for AI research, with a pre-installed collection of AI developement tools supporting all the major types of StarCraft bots. This provides a convenient way to deploy StarCraft AIs on numerous hosts at once and across multiple platforms despite limited OS support of StarCraft. In this technic… ▽ More We present a dockerized version of a real-time strategy game StarCraft: Brood War, commonly used as a domain for AI research, with a pre-installed collection of AI developement tools supporting all the major types of StarCraft bots. This provides a convenient way to deploy StarCraft AIs on numerous hosts at once and across multiple platforms despite limited OS support of StarCraft. In this technical report, we describe the design of our Docker images and present a few use cases. △ Less

Submitted 7 January, 2018; originally announced January 2018.

arXiv:1710.09492 [pdf, ps, other]

doi 10.1016/j.na.2018.06.015

Approximation by map**s with singular Hessian minors

Authors: Zhuomin Liu, Jan Malý, Mohammad Reza Pakzad

Abstract: Let $Ω\subset\mathbb R^n$ be a Lipschitz domain. Given $1\leq p<k\leq n$ and any $u\in W^{2,p}(Ω)$ belonging to the little Hölder class $c^{1,α}$, we construct a sequence $u_j$ in the same space with $\operatorname{rank}D^2u_j<k$ almost everywhere such that $u_j\to u$ in $C^{1,α}$ and weakly in $W^{2,p}$. This result is in strong contrast with known regularity behavior of functions in $W^{2,p}$,… ▽ More Let $Ω\subset\mathbb R^n$ be a Lipschitz domain. Given $1\leq p<k\leq n$ and any $u\in W^{2,p}(Ω)$ belonging to the little Hölder class $c^{1,α}$, we construct a sequence $u_j$ in the same space with $\operatorname{rank}D^2u_j<k$ almost everywhere such that $u_j\to u$ in $C^{1,α}$ and weakly in $W^{2,p}$. This result is in strong contrast with known regularity behavior of functions in $W^{2,p}$, $p\geq k$, satisfying the same rank inequality. △ Less

Submitted 25 October, 2017; originally announced October 2017.

Comments: 18 pages

MSC Class: 35B99; 46T10

Journal ref: Nonlinear Anal. 176 (2018), 209-225

arXiv:1509.02326 [pdf, ps, other]

doi 10.1007/s11118-016-9580-z

Quasiopen and p-path open sets, and characterizations of quasicontinuity

Authors: Anders Björn, Jana Björn, Jan Malý

Abstract: In this paper we give various characterizations of quasiopen sets and quasicontinuous functions on metric spaces. For complete metric spaces equipped with a doubling measure supporting a p-Poincaré inequality we show that quasiopen and p-path open sets coincide. Under the same assumptions we show that all Newton-Sobolev functions on quasiopen sets are quasicontinuous. In this paper we give various characterizations of quasiopen sets and quasicontinuous functions on metric spaces. For complete metric spaces equipped with a doubling measure supporting a p-Poincaré inequality we show that quasiopen and p-path open sets coincide. Under the same assumptions we show that all Newton-Sobolev functions on quasiopen sets are quasicontinuous. △ Less

Submitted 8 September, 2015; originally announced September 2015.

Comments: 17 pages

MSC Class: Primary: 31E05; Secondary: 28A05; 30L99; 31C15; 31C40; 31C45; 46E35

Journal ref: Potential Anal. 46 (2017), 181-199

arXiv:1309.3094 [pdf, ps, other]

Luzin's Condition (N) and Modulus of Continuity

Authors: Pekka Koskela, Jan Malý, Thomas Zürcher

Abstract: In this paper, we establish Luzin's condition (N) for map**s in certain Sobolev-Orlicz spaces with certain moduli of continuity. Further, given a map** in these Sobolev-Orlicz spaces, we give bounds on the size of the exceptional set where Luzin's condition (N) may fail. If a map** violates Luzin's condition (N), we show that there is a Cantor set of measure zero that is mapped to a set of p… ▽ More In this paper, we establish Luzin's condition (N) for map**s in certain Sobolev-Orlicz spaces with certain moduli of continuity. Further, given a map** in these Sobolev-Orlicz spaces, we give bounds on the size of the exceptional set where Luzin's condition (N) may fail. If a map** violates Luzin's condition (N), we show that there is a Cantor set of measure zero that is mapped to a set of positive measure. △ Less

Submitted 12 September, 2013; originally announced September 2013.

MSC Class: 26B15; 26B35; 46E35

arXiv:1212.1563 [pdf, ps, other]

A low rank property and nonexistence of higher dimensional horizontal Sobolev sets

Authors: Valentino Magnani, Jan Malý, Samuele Mongodi

Abstract: We establish a "low rank property" for Sobolev map**s that pointwise solve a first order nonlinear system of PDEs, whose smooth solutions have the so-called "contact property". As a consequence, Sobolev map**s from an open set of the plane, taking values in the first Heisenberg group and that have almost everywhere maximal rank must have images with positive 3-dimensional Hausdorff measure wit… ▽ More We establish a "low rank property" for Sobolev map**s that pointwise solve a first order nonlinear system of PDEs, whose smooth solutions have the so-called "contact property". As a consequence, Sobolev map**s from an open set of the plane, taking values in the first Heisenberg group and that have almost everywhere maximal rank must have images with positive 3-dimensional Hausdorff measure with respect to the sub-Riemannian distance of the Heisenberg group. This provides a complete solution to a question raised in a paper by Z. M. Balogh, R. Hoefer-Isenegger and J. T. Tyson. Our approach differs from the previous ones. Its technical aspect consists in performing an "exterior differentiation by blow-up", where the standard distributional exterior differentiation is not possible. This method extends to higher dimensional Sobolev map**s taking values in higher dimensional Heisenberg groups. △ Less

Submitted 13 February, 2013; v1 submitted 7 December, 2012; originally announced December 2012.

Comments: 12 pages

MSC Class: 46E35; 53C17 (Primary) 26B20; 22E25 (Secondary)

arXiv:1002.2852 [pdf, ps, other]

doi 10.5186/aasfm.2011.3609

An elementary way to introduce a Perron-like integral

Authors: Hana Bendová, Jan Malý

Abstract: We give an alternative definition of integral at the generality of the Perron integral and propose an exposition of the foundations of integral theory starting from this new definition. Both definition and proofs needed for the development are unexpectedly simple. We show how to adapt the definition to cover the multidimensional and Stieltjes case and prove that our integral is equivalent to the… ▽ More We give an alternative definition of integral at the generality of the Perron integral and propose an exposition of the foundations of integral theory starting from this new definition. Both definition and proofs needed for the development are unexpectedly simple. We show how to adapt the definition to cover the multidimensional and Stieltjes case and prove that our integral is equivalent to the Henstock-Kurzweil(-Stieltjes) integral. △ Less

Submitted 15 February, 2010; originally announced February 2010.

MSC Class: 26A39

Journal ref: Ann. Acad. Sci. Fenn. Math. 36 (2011), no. 1, 153-164

arXiv:math/0112008 [pdf, ps, other]

The coarea formula for Sobolev map**s

Authors: Jan Maly, David Swanson, William P. Ziemer

Abstract: We extend Federer's coarea formula to map**s $f$ belonging to the Sobolev class $W^{1,p}(R^n;R^m)$, $1 \le m < n$, $p>m$, and more generally, to map**s with gradient in the Lorentz space $L^{m,1}(R^n)$. This is accomplished by showing that the graph of $f$ in $R^{n+m}$ is a Hausdorff $n$-rectifiable set. We extend Federer's coarea formula to map**s $f$ belonging to the Sobolev class $W^{1,p}(R^n;R^m)$, $1 \le m < n$, $p>m$, and more generally, to map**s with gradient in the Lorentz space $L^{m,1}(R^n)$. This is accomplished by showing that the graph of $f$ in $R^{n+m}$ is a Hausdorff $n$-rectifiable set. △ Less

Submitted 1 December, 2001; originally announced December 2001.

Comments: Submitted for publication, 16 pages

MSC Class: 46E35; 46E30

arXiv:cond-mat/9805018 [pdf, ps, other]

Pairing Correlations and the Pseudo-Gap State: Application of the Pairing Approximation Theory

Authors: Jiri Maly, Boldizsar Janko, K. Levin

Abstract: We investigate the pseudogap onset temperature $T^*$, the superconducting transition temperature $T_c$ and the general nature of the pseudogap phase using a diagrammatic BCS-Bose Einstein crossover theory. This decoupling scheme is based on the pairing approximation of Kadanoff and Martin, further extended by Patton (KMP). Our consideration of the KMP pairing approximation is driven by the objec… ▽ More We investigate the pseudogap onset temperature $T^*$, the superconducting transition temperature $T_c$ and the general nature of the pseudogap phase using a diagrammatic BCS-Bose Einstein crossover theory. This decoupling scheme is based on the pairing approximation of Kadanoff and Martin, further extended by Patton (KMP). Our consideration of the KMP pairing approximation is driven by the objective to obtain BCS like behavior at weak coupling, (which does not necessarily follow for other diagrammatic schemes). The breakdown of the Fermi liquid state at $T^*$ is investigated within the lowest order theory and is associated with intermediate values of the coupling. The superconducting instability $T_c$ is evaluated by introducing mode coupling effects, in which the long lived pairs are affected by the single particle pseudogap states and vice versa. Our $T_c$ equations, which turn out to be rather simple as a result of the KMP scheme, reveal a rich structure as a function of $g$ in which the pseudogap is found to compete with superconductivity. Our results are compared with alternate theories in the literature. △ Less

Submitted 4 May, 1998; originally announced May 1998.

Comments: REVTeX3.1, 15 pages,15 EPS figures (included)

arXiv:cond-mat/9710187 [pdf, ps, other]

Superconductivity from a pseudogapped normal state: a mode coupling approach to precursor superconductivity

Authors: Jiri Maly, Boldizsar Janko, K. Levin

Abstract: We derive a phase diagram for the pseudogap onset temperature $T^*$ (associated with the breakdown of the Fermi liquid state, due to strong pairing correlations) and the superconducting instability, $T_c$, as a function of variable pairing strength. Our diagrammatic approach to the BCS - Bose-Einstein cross-over problem self consistently treats the coupling between the single particle and pair p… ▽ More We derive a phase diagram for the pseudogap onset temperature $T^*$ (associated with the breakdown of the Fermi liquid state, due to strong pairing correlations) and the superconducting instability, $T_c$, as a function of variable pairing strength. Our diagrammatic approach to the BCS - Bose-Einstein cross-over problem self consistently treats the coupling between the single particle and pair propagators, and leads to a continuous evolution of these propagators into the standard $T<T_c$ counterparts. A rich structure is found in $T_c$ which reflects the way in which the superconducting instability at $T_c$ is affected by the pseudogap $Δ_{pg}$. An important consequence of Cooper-pair- induced pseudogaps is that the magnitude of $T_c$ is sustained, even when $Δ_{pg}>T_c$. △ Less

Submitted 17 October, 1997; originally announced October 1997.

Comments: REVTeX3.0; 5 pages, 3 EPS figures (included)

arXiv:cond-mat/9705144 [pdf, ps, other]

doi 10.1103/PhysRevB.56.R11407

Pseudogap effects induced by resonant pair scattering

Authors: Boldizsar Janko, Jiri Maly, K. Levin

Abstract: We demonstrate how resonant pair scattering of correlated electrons above T_c can give rise to pseudogap behavior. This resonance in the scattering T-matrix appears for superconducting interactions of intermediate strength, within the framework of a simple fermionic model. It is associated with a splitting of the single peak in the spectral function into a pair of peaks separated by an energy ga… ▽ More We demonstrate how resonant pair scattering of correlated electrons above T_c can give rise to pseudogap behavior. This resonance in the scattering T-matrix appears for superconducting interactions of intermediate strength, within the framework of a simple fermionic model. It is associated with a splitting of the single peak in the spectral function into a pair of peaks separated by an energy gap. Our physical picture is contrasted with that derived from other T-matrix schemes, with superconducting fluctuation effects, and with preformed pair (boson-fermion) models. Implications for photoemission and tunneling experiments in the cuprates are discussed. △ Less

Submitted 15 May, 1997; originally announced May 1997.

Comments: REVTeX3.0; 4 pages, 4 EPS figures (included)

arXiv:cond-mat/9609083 [pdf, ps, other]

doi 10.1103/PhysRevB.54.R15657

Coulomb Correlations and Pseudo-gap Effects in a Pre-formed Pair Model for the Cuprates

Authors: Jiri Maly, K. Levin, D. Z. Liu

Abstract: We extend previous work on pre-formed pair models of superconductivity to incorporate Coulomb correlation effects. For neutral systems, these models have provided a useful scheme which interpolates between BCS and Bose Einstein condensation with increasing coupling and thereby describes some aspects of pseudo-gap phenomena. However, charge fluctuations (via the plasmon, $ω_p$) significantly modi… ▽ More We extend previous work on pre-formed pair models of superconductivity to incorporate Coulomb correlation effects. For neutral systems, these models have provided a useful scheme which interpolates between BCS and Bose Einstein condensation with increasing coupling and thereby describes some aspects of pseudo-gap phenomena. However, charge fluctuations (via the plasmon, $ω_p$) significantly modify the collective modes and therefore the interpolation behavior. We discuss the resulting behavior of the pseudo-gap and thermodynamic quantities such as $T_c$, $χ$ and $C_v$ as a function of $ω_p$. △ Less

Submitted 9 September, 1996; originally announced September 1996.

Comments: 4 pages RevTeX, 3 ps figures included (Submitted to Physical Review B August 27, 1996)

arXiv:cond-mat/9605009 [pdf, ps, other]

What does d-wave symmetry tell us about the pairing mechanism?

Authors: K. Levin, D. Z. Liu, Jiri Maly

Abstract: In this paper we argue that d-wave symmetry is a general consequence of superconductivity driven by repulsive interactions. Van Hove (or flat band) effects, deriving from the two dimensionality of the $CuO_2$ plane are important in stabilizing this state. By extending the original Kohn-Luttinger picture to a 2 D lattice, we find that the screened Coulomb term has important wave vector structure… ▽ More In this paper we argue that d-wave symmetry is a general consequence of superconductivity driven by repulsive interactions. Van Hove (or flat band) effects, deriving from the two dimensionality of the $CuO_2$ plane are important in stabilizing this state. By extending the original Kohn-Luttinger picture to a 2 D lattice, we find that the screened Coulomb term has important wave vector structure which leads to $d_{x^2-y^2}$ superconductivity △ Less

Submitted 1 May, 1996; originally announced May 1996.

Comments: 4 pages, also available at http://rainbow.uchicago.edu/~ldz/paper/paper.html To be published in the proceedings of the 10th Anniversary of HTSC Workshop (Houston, March 1996)

Report number: ldz-psd14

Showing 1–50 of 52 results for author: Maly, J