Skip to main content

Showing 1–49 of 49 results for author: Kolar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13936  [pdf, other

    stat.ML cs.LG math.OC

    Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods

    Authors: Tim Tsz-Kit Lau, Weijian Li, Chenwei Xu, Han Liu, Mladen Kolar

    Abstract: Modern deep neural networks often require distributed training with many workers due to their large size. As worker numbers increase, communication overheads become the main bottleneck in data-parallel minibatch stochastic gradient methods with per-iteration gradient synchronization. Local gradient methods like Local SGD reduce communication by only syncing after several local steps. Despite under… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.06829  [pdf, other

    cs.LG stat.ML

    Personalized Binomial DAGs Learning with Network Structured Covariates

    Authors: Boxin Zhao, Weishi Wang, Dingyuan Zhu, Ziqi Liu, Dong Wang, Zhiqiang Zhang, Jun Zhou, Mladen Kolar

    Abstract: The causal dependence in data is often characterized by Directed Acyclic Graphical (DAG) models, widely used in many areas. Causal discovery aims to recover the DAG structure using observational data. This paper focuses on causal discovery with multi-variate count data. We are motivated by real-world web visit data, recording individual user visits to multiple websites. Building a causal diagram c… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  3. arXiv:2402.11215  [pdf, other

    cs.LG math.OC stat.ML

    AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods

    Authors: Tim Tsz-Kit Lau, Han Liu, Mladen Kolar

    Abstract: The choice of batch sizes in minibatch stochastic gradient optimizers is critical in large-scale model training for both optimization and generalization performance. Although large-batch training is arguably the dominant training paradigm for large-scale deep learning due to hardware advances, the generalization performance of the model deteriorates compared to small-batch training, leading to the… ▽ More

    Submitted 28 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  4. arXiv:2312.17047  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Inconsistency of cross-validation for structure learning in Gaussian graphical models

    Authors: Zhao Lyu, Wai Ming Tai, Mladen Kolar, Bryon Aragam

    Abstract: Despite numerous years of research into the merits and trade-offs of various model selection criteria, obtaining robust results that elucidate the behavior of cross-validation remains a challenging endeavor. In this paper, we highlight the inherent limitations of cross-validation when employed to discern the structure of a Gaussian graphical model. We provide finite-sample bounds on the probabilit… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Preliminary version; 47 pages, 15 figures

  5. arXiv:2306.02543  [pdf, other

    cs.LG

    Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

    Authors: Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar

    Abstract: High-quality machine learning models are dependent on access to high-quality training data. When the data are not already available, it is tedious and costly to obtain them. Data markets help with identifying valuable training data: model consumers pay to train a model, the market uses that budget to identify data and train the model (the budget allocation problem), and finally the market compensa… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Published on International Conference on Machine Learning (ICML) 2023

  6. arXiv:2305.18379  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Constrained Optimization via Exact Augmented Lagrangian and Randomized Iterative Sketching

    Authors: Ilgee Hong, Sen Na, Michael W. Mahoney, Mladen Kolar

    Abstract: We consider solving equality-constrained nonlinear, nonconvex optimization problems. This class of problems appears widely in a variety of applications in machine learning and engineering, ranging from constrained deep neural networks, to optimal control, to PDE-constrained optimization. We develop an adaptive inexact Newton method for this problem class. In each iteration, we solve the Lagrangian… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: 25 pages, 4 figures

    Journal ref: ICML 2023

  7. arXiv:2210.17237  [pdf, other

    stat.ME cs.LG stat.ML

    Latent Multimodal Functional Graphical Model Estimation

    Authors: Katherine Tsai, Boxin Zhao, Sanmi Koyejo, Mladen Kolar

    Abstract: Joint multimodal functional data acquisition, where functional data from multiple modes are measured simultaneously from the same subject, has emerged as an exciting modern approach enabled by recent engineering breakthroughs in the neurological and biological sciences. One prominent motivation to acquire such data is to enable new discoveries of the underlying connectivity by combining multimodal… ▽ More

    Submitted 1 October, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  8. arXiv:2208.12628  [pdf, other

    cs.DC

    PNPCoin: Distributed Computing on Bitcoin infrastructure

    Authors: Martin Kolář

    Abstract: Research and applications in Machine Learning are limited by computational resources, while 1% of the world's electricity goes into calculating 34 billion billion SHA-256 hashes per second, four orders of magnitude more than the 200 petaflop power of the world's most powerful supercomputer. The work presented here describes how a simple soft fork on Bitcoin can adapt these incomparable resources t… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 4 page version, 1 figure, AGI conference submission format

  9. arXiv:2205.15891  [pdf, ps, other

    cs.LG stat.ML

    One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

    Authors: Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo, Mladen Kolar

    Abstract: Although parallelism has been extensively used in reinforcement learning (RL), the quantitative effects of parallel exploration are not well understood theoretically. We study the benefits of simple parallel exploration for reward-free RL in linear Markov decision processes (MDPs) and two-player zero-sum Markov games (MGs). In contrast to the existing literature, which focuses on approaches that e… ▽ More

    Submitted 1 March, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 50 pages

  10. arXiv:2205.02450  [pdf, other

    cs.LG cs.GT stat.ML

    Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

    Authors: Boxiang Lyu, Zhaoran Wang, Mladen Kolar, Zhuoran Yang

    Abstract: Dynamic mechanism design has garnered significant attention from both computer scientists and economists in recent years. By allowing agents to interact with the seller over multiple rounds, where agents' reward functions may change with time and are state-dependent, the framework is able to model a rich class of real-world problems. In these works, the interaction between agents and sellers is of… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 52 pages

  11. arXiv:2204.13619  [pdf, other

    cs.LG

    Personalized Federated Learning with Multiple Known Clusters

    Authors: Boxiang Lyu, Filip Hanzely, Mladen Kolar

    Abstract: We consider the problem of personalized federated learning when there are known cluster structures within users. An intuitive approach would be to regularize the parameters so that users in the same cluster share similar model weights. The distances between the clusters can then be regularized to reflect the similarity between different clusters of users. We develop an algorithm that allows each c… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  12. arXiv:2201.13387  [pdf, other

    cs.LG math.OC

    L-SVRG and L-Katyusha with Adaptive Sampling

    Authors: Boxin Zhao, Boxiang Lyu, Mladen Kolar

    Abstract: Stochastic gradient-based optimization methods, such as L-SVRG and its accelerated variant L-Katyusha (Kovalev et al., 2020), are widely used to train machine learning models.The theoretical and empirical performance of L-SVRG and L-Katyusha can be improved by sampling observations from a non-uniform distribution (Qian et al., 2021). However,designing a desired sampling distribution requires prior… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Published in Transactions on Machine Learning Research (03/2023)

  13. arXiv:2112.14332  [pdf, other

    cs.LG

    Adaptive Client Sampling in Federated Learning via Online Learning with Bandit Feedback

    Authors: Boxin Zhao, Lingxiao Wang, Mladen Kolar, Ziqi Liu, Zhiqiang Zhang, Jun Zhou, Chaochao Chen

    Abstract: Due to the high cost of communication, federated learning (FL) systems need to sample a subset of clients that are involved in each round of training. As a result, client sampling plays an important role in FL systems as it affects the convergence rate of optimization algorithms used to train machine learning models. Despite its importance, there is limited work on how to sample clients effectivel… ▽ More

    Submitted 31 May, 2023; v1 submitted 28 December, 2021; originally announced December 2021.

  14. arXiv:2111.14807  [pdf, ps, other

    cs.FL cs.CC

    On formally undecidable propositions in nondeterministic languages

    Authors: Martin Kolář

    Abstract: Any class of languages $\mathbf{L}$ accepted in time $\mathbf{T}$ has a counterpart $\mathbf{NL}$ accepted in nondeterministic time $\mathbf{NT}$. It follows from the definition of nondeterministic languages that $\mathbf{L} \subseteq \mathbf{NL}$. This work shows that every sufficiently powerful language in $\mathbf{L}$ contains a string corresponding to Gödel's undecidable proposition, but this… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: 4 pages

    MSC Class: 68Q15; 03D35

  15. arXiv:2111.03772  [pdf, other

    cs.LG math.OC stat.ML

    Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems

    Authors: Yuwei Luo, Varun Gupta, Mladen Kolar

    Abstract: We consider the problem of controlling a Linear Quadratic Regulator (LQR) system over a finite horizon $T$ with fixed and known cost matrices $Q,R$, but unknown and non-stationary dynamics $\{A_t, B_t\}$. The sequence of dynamics matrices can be arbitrary, but with a total variation, $V_T$, assumed to be $o(T)$ and unknown to the controller. Under the assumption that a sequence of stabilizing, but… ▽ More

    Submitted 18 March, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Journal ref: Proceedings of the ACM on Measurement and Analysis of Computing Systems, Volume 6, Issue 1, March 2022, Article No 9, pp 1--72

  16. arXiv:2110.10281  [pdf, other

    stat.ME cs.LG stat.ML

    Joint Gaussian Graphical Model Estimation: A Survey

    Authors: Katherine Tsai, Oluwasanmi Koyejo, Mladen Kolar

    Abstract: Graphs from complex systems often share a partial underlying structure across domains while retaining individual features. Thus, identifying common structures can shed light on the underlying signal, for instance, when applied to scientific discoveries or clinical diagnoses. Furthermore, growing evidence shows that the shared structure across domains boosts the estimation power of graphs, particul… ▽ More

    Submitted 3 April, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

  17. arXiv:2109.11502  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Inequality Constrained Stochastic Nonlinear Optimization via Active-Set Sequential Quadratic Programming

    Authors: Sen Na, Mihai Anitescu, Mladen Kolar

    Abstract: We study nonlinear optimization problems with a stochastic objective and deterministic equality and inequality constraints, which emerge in numerous applications including finance, manufacturing, power systems and, recently, deep neural networks. We propose an active-set stochastic sequential quadratic programming (StoSQP) algorithm that utilizes a differentiable exact augmented Lagrangian as the… ▽ More

    Submitted 30 January, 2023; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: 65 pages, 9 figures

  18. arXiv:2106.10022  [pdf, other

    cs.LG cs.DC math.OC

    Local AdaGrad-Type Algorithm for Stochastic Convex-Concave Optimization

    Authors: Luofeng Liao, Li Shen, Jia Duan, Mladen Kolar, Dacheng Tao

    Abstract: Large scale convex-concave minimax problems arise in numerous applications, including game theory, robust training, and training of generative adversarial networks. Despite their wide applicability, solving such problems efficiently and effectively is challenging in the presence of large amounts of data using existing stochastic minimax methods. We study a class of stochastic minimax methods and d… ▽ More

    Submitted 23 September, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: 42 pages; Accepted to Machine Learning, 2022

  19. arXiv:2105.02487  [pdf, other

    stat.ML cs.LG stat.ME

    High-dimensional Functional Graphical Model Structure Learning via Neighborhood Selection Approach

    Authors: Boxin Zhao, Percy S. Zhai, Y. Samuel Wang, Mladen Kolar

    Abstract: Undirected graphical models are widely used to model the conditional independence structure of vector-valued data. However, in many modern applications, for example those involving EEG and fMRI data, observations are more appropriately modeled as multivariate random functions rather than vectors. Functional graphical models have been proposed to model the conditional independence structure of such… ▽ More

    Submitted 25 January, 2024; v1 submitted 6 May, 2021; originally announced May 2021.

  20. arXiv:2102.09907  [pdf, other

    stat.ML cs.LG

    Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

    Authors: Luofeng Liao, Zuyue Fu, Zhuoran Yang, Yixin Wang, Mladen Kolar, Zhaoran Wang

    Abstract: In offline reinforcement learning (RL) an optimal policy is learnt solely from a priori collected observational data. However, in observational data, actions are often confounded by unobserved variables. Instrumental variables (IVs), in the context of RL, are the variables whose influence on the state variables are all mediated through the action. When a valid instrument is present, we can recover… ▽ More

    Submitted 12 July, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: under review

  21. arXiv:2102.09743  [pdf, other

    cs.LG

    Personalized Federated Learning: A Unified Framework and Universal Optimization Techniques

    Authors: Filip Hanzely, Boxin Zhao, Mladen Kolar

    Abstract: We investigate the optimization aspects of personalized Federated Learning (FL). We propose general optimizers that can be applied to numerous existing personalized FL objectives, specifically a tailored variant of Local SGD and variants of accelerated coordinate descent/accelerated SVRCD. By examining a general personalized objective capable of recovering many existing personalized FL objectives… ▽ More

    Submitted 26 May, 2023; v1 submitted 18 February, 2021; originally announced February 2021.

    Journal ref: Published in Transactions on Machine Learning Research (May/2023)

  22. arXiv:2012.15274  [pdf, other

    stat.ML cs.LG math.OC

    Provably Training Overparameterized Neural Network Classifiers with Non-convex Constraints

    Authors: You-Lin Chen, Zhaoran Wang, Mladen Kolar

    Abstract: Training a classifier under non-convex constraints has gotten increasing attention in the machine learning community thanks to its wide range of applications such as algorithmic fairness and class-imbalanced classification. However, several recent works addressing non-convex constraints have only focused on simple models such as logistic regression or support vector machines. Neural networks, one… ▽ More

    Submitted 27 October, 2022; v1 submitted 30 December, 2020; originally announced December 2020.

  23. arXiv:2011.05601  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    A Nonconvex Framework for Structured Dynamic Covariance Recovery

    Authors: Katherine Tsai, Mladen Kolar, Oluwasanmi Koyejo

    Abstract: We propose a flexible yet interpretable model for high-dimensional data with time-varying second order statistics, motivated and applied to functional neuroimaging data. Motivated by the neuroscience literature, we factorize the covariances into sparse spatial and smooth temporal components. While this factorization results in both parsimony and domain interpretability, the resulting estimation pr… ▽ More

    Submitted 17 July, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

  24. arXiv:2007.07448  [pdf, other

    stat.ML cs.LG

    Statistical Inference for Networks of High-Dimensional Point Processes

    Authors: Xu Wang, Mladen Kolar, Ali Shojaie

    Abstract: Fueled in part by recent applications in neuroscience, the multivariate Hawkes process has become a popular tool for modeling the network of interactions among high-dimensional point process data. While evaluating the uncertainty of the network estimates is critical in scientific applications, existing methodological and theoretical work has primarily addressed estimation. To bridge this gap, this… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  25. arXiv:2007.01290  [pdf, other

    stat.ML cs.LG

    Provably Efficient Neural Estimation of Structural Equation Model: An Adversarial Approach

    Authors: Luofeng Liao, You-Lin Chen, Zhuoran Yang, Bo Dai, Zhaoran Wang, Mladen Kolar

    Abstract: Structural equation models (SEMs) are widely used in sciences, ranging from economics to psychology, to uncover causal relationships underlying a complex system under consideration and estimate structural parameters of interest. We study estimation in a class of generalized SEMs where the object of interest is defined as the solution to a linear operator equation. We formulate the linear operator… ▽ More

    Submitted 20 October, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: - v1: Submitted to NeurIPS 2020. Under review - v2: Revised after NeurIPS reviews. Major updates: (i) clean presentation of consistency results; (ii) more references for conditional moment problems - v3: Add references

  26. arXiv:2006.12455  [pdf, ps, other

    math.OC cs.LG stat.ML

    Gradient-Variation Bound for Online Convex Optimization with Constraints

    Authors: Shuang Qiu, Xiaohan Wei, Mladen Kolar

    Abstract: We study online convex optimization with constraints consisting of multiple functional constraints and a relatively simple constraint set, such as a Euclidean ball. As enforcing the constraints at each time step through projections is computationally challenging in general, we allow decisions to violate the functional constraints but aim to achieve a low regret and cumulative violation of the cons… ▽ More

    Submitted 5 December, 2022; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted in AAAI 2023

  27. arXiv:2003.05402  [pdf, other

    stat.ML cs.LG

    FuDGE: A Method to Estimate a Functional Differential Graph in a High-Dimensional Setting

    Authors: Boxin Zhao, Y. Samuel Wang, Mladen Kolar

    Abstract: We consider the problem of estimating the difference between two undirected functional graphical models with shared structures. In many applications, data are naturally regarded as a vector of random functions rather than as a vector of scalars. For example, electroencephalography (EEG) data are treated more appropriately as functions of time. In such a problem, not only can the number of function… ▽ More

    Submitted 1 April, 2022; v1 submitted 11 March, 2020; originally announced March 2020.

  28. arXiv:2003.01013  [pdf, other

    stat.ML cs.LG

    Semiparametric Nonlinear Bipartite Graph Representation Learning with Provable Guarantees

    Authors: Sen Na, Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar

    Abstract: Graph representation learning is a ubiquitous task in machine learning where the goal is to embed each vertex into a low-dimensional vector space. We consider the bipartite graph and formalize its representation learning problem as a statistical estimation problem of parameters in a semiparametric exponential family distribution. The bipartite graph is assumed to be generated by a semiparametric e… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

  29. arXiv:2002.06410  [pdf, other

    stat.ML cs.LG

    Posterior Ratio Estimation of Latent Variables

    Authors: Song Liu, Yulong Zhang, Mingxuan Yi, Mladen Kolar

    Abstract: Density Ratio Estimation has attracted attention from the machine learning community due to its ability to compare the underlying distributions of two datasets. However, in some applications, we want to compare distributions of random variables that are \emph{inferred} from observations. In this paper, we study the problem of estimating the ratio between two posterior probability density functions… ▽ More

    Submitted 25 June, 2020; v1 submitted 15 February, 2020; originally announced February 2020.

  30. arXiv:1912.06875  [pdf, other

    cs.LG math.OC stat.ML

    Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

    Authors: Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar

    Abstract: Multi-agent reinforcement learning has been successfully applied to a number of challenging problems. Despite these empirical successes, theoretical understanding of different algorithms is lacking, primarily due to the curse of dimensionality caused by the exponential growth of the state-action space with the number of agents. We study a fundamental problem of multi-agent linear quadratic regulat… ▽ More

    Submitted 24 December, 2021; v1 submitted 14 December, 2019; originally announced December 2019.

  31. arXiv:1910.12156  [pdf, other

    cs.LG stat.ML

    Convergent Policy Optimization for Safe Reinforcement Learning

    Authors: Ming Yu, Zhuoran Yang, Mladen Kolar, Zhaoran Wang

    Abstract: We study the safe reinforcement learning problem with nonlinear function approximation, where policy optimization is formulated as a constrained optimization problem with both the objective and the constraint being nonconvex functions. For such a problem, we construct a sequence of surrogate convex constrained optimization problems by replacing the nonconvex functions locally with convex quadratic… ▽ More

    Submitted 26 October, 2019; originally announced October 2019.

  32. arXiv:1910.09701  [pdf, other

    stat.ML cs.LG stat.ME

    Direct Estimation of Differential Functional Graphical Models

    Authors: Boxin Zhao, Y. Samuel Wang, Mladen Kolar

    Abstract: We consider the problem of estimating the difference between two functional undirected graphical models with shared structures. In many applications, data are naturally regarded as high-dimensional random function vectors rather than multivariate scalars. For example, electroencephalography (EEG) data are more appropriately treated as functions of time. In these problems, not only can the number o… ▽ More

    Submitted 16 November, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: 21 pages, 3 figures, to be published in NeurIPS 2019; added link to code

  33. arXiv:1906.05358  [pdf, other

    stat.ML cs.LG stat.ME

    Tensor Canonical Correlation Analysis with Convergence and Statistical Guarantees

    Authors: You-Lin Chen, Mladen Kolar, Ruey S. Tsay

    Abstract: In many applications, such as classification of images or videos, it is of interest to develop a framework for tensor data instead of an ad-hoc way of transforming data to vectors due to the computational and under-sampling issues. In this paper, we study convergence and statistical properties of two-dimensional canonical correlation analysis \citep{Lee2007Two} under an assumption that data come f… ▽ More

    Submitted 11 November, 2020; v1 submitted 12 June, 2019; originally announced June 2019.

  34. arXiv:1906.03362  [pdf, other

    cs.LG stat.ML

    Partially Linear Additive Gaussian Graphical Models

    Authors: Sinong Geng, Minhao Yan, Mladen Kolar, Oluwasanmi Koyejo

    Abstract: We propose a partially linear additive Gaussian graphical model (PLA-GGM) for the estimation of associations between random variables distorted by observed confounders. Model parameters are estimated using an $L_1$-regularized maximal pseudo-profile likelihood estimator (MaPPLE) for which we prove $\sqrt{n}$-sparsistency. Importantly, our approach avoids parametric constraints on the effects of co… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

  35. arXiv:1811.10790  [pdf, other

    math.ST cs.LG stat.ML

    High-dimensional Index Volatility Models via Stein's Identity

    Authors: Sen Na, Mladen Kolar

    Abstract: We study the estimation of the parametric components of single and multiple index volatility models. Using the first- and second-order Stein's identities, we develop methods that are applicable for the estimation of the variance index in the high-dimensional setting requiring finite moment condition, which allows for heavy-tailed data. Our approach complements the existing literature in the low-di… ▽ More

    Submitted 25 May, 2020; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 44 pages

  36. arXiv:1810.11098  [pdf, other

    stat.ML cs.LG

    Provable Gaussian Embedding with One Observation

    Authors: Ming Yu, Zhuoran Yang, Tuo Zhao, Mladen Kolar, Zhaoran Wang

    Abstract: The success of machine learning methods heavily relies on having an appropriate representation for data at hand. Traditionally, machine learning approaches relied on user-defined heuristics to extract features encoding structural information about data. However, recently there has been a surge in approaches that learn how to encode the data automatically in a low dimensional space. Exponential fam… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

  37. arXiv:1810.07147  [pdf, other

    stat.ML cs.LG

    Joint Nonparametric Precision Matrix Estimation with Confounding

    Authors: Sinong Geng, Mladen Kolar, Oluwasanmi Koyejo

    Abstract: We consider the problem of precision matrix estimation where, due to extraneous confounding of the underlying precision matrix, the data are independent but not identically distributed. While such confounding occurs in many scientific problems, our approach is inspired by recent neuroscientific research suggesting that brain function, as measured using functional magnetic resonance imagine (fMRI),… ▽ More

    Submitted 27 June, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

  38. arXiv:1810.07128  [pdf, other

    stat.ML cs.LG stat.CO

    High-dimensional Varying Index Coefficient Models via Stein's Identity

    Authors: Sen Na, Zhuoran Yang, Zhaoran Wang, Mladen Kolar

    Abstract: We study the parameter estimation problem for a varying index coefficient model in high dimensions. Unlike the most existing works that iteratively estimate the parameters and link functions, based on the generalized Stein's identity, we propose computationally efficient estimators for the high-dimensional parameters without estimating the link functions. We consider two different setups where we… ▽ More

    Submitted 25 October, 2019; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: 44 pages

  39. arXiv:1806.05730  [pdf, other

    stat.ML cs.LG

    Learning Influence-Receptivity Network Structure with Guarantee

    Authors: Ming Yu, Varun Gupta, Mladen Kolar

    Abstract: Traditional works on community detection from observations of information cascade assume that a single adjacency matrix parametrizes all the observed cascades. However, in reality the connection structure usually does not stay the same across cascades. For example, different people have different topics of interest, therefore the connection structure depends on the information/topic content of the… ▽ More

    Submitted 10 April, 2019; v1 submitted 14 June, 2018; originally announced June 2018.

  40. arXiv:1802.03830  [pdf, other

    stat.ML cs.LG

    Distributed Stochastic Multi-Task Learning with Graph Regularization

    Authors: Weiran Wang, Jialei Wang, Mladen Kolar, Nathan Srebro

    Abstract: We propose methods for distributed graph-based multi-task learning that are based on weighted averaging of messages from other machines. Uniform averaging or diminishing stepsize in these methods would yield consensus (single task) learning. We show how simply skewing the averaging weights or controlling the stepsize allows learning different, but related, tasks on the different machines.

    Submitted 11 February, 2018; originally announced February 2018.

  41. arXiv:1711.04955  [pdf, other

    stat.ML cs.LG

    Scalable Peaceman-Rachford Splitting Method with Proximal Terms

    Authors: Sen Na, Mingyuan Ma, Mladen Kolar

    Abstract: Along with develo** of Peaceman-Rachford Splittling Method (PRSM), many batch algorithms based on it have been studied very deeply. But almost no algorithm focused on the performance of stochastic version of PRSM. In this paper, we propose a new stochastic algorithm based on PRSM, prove its convergence rate in ergodic sense, and test its performance on both artificial and real data. We show that… ▽ More

    Submitted 9 February, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

  42. arXiv:1709.01919  [pdf, other

    stat.ML cs.LG cs.SI

    Estimation of a Low-rank Topic-Based Model for Information Cascades

    Authors: Ming Yu, Varun Gupta, Mladen Kolar

    Abstract: We consider the problem of estimating the latent structure of a social network based on the observed information diffusion events, or cascades, where the observations for a given cascade consist of only the timestamps of infection for infected nodes but not the source of the infection. Most of the existing work on this problem has focused on estimating a diffusion matrix without any structural ass… ▽ More

    Submitted 25 March, 2020; v1 submitted 6 September, 2017; originally announced September 2017.

  43. arXiv:1610.03045  [pdf, other

    cs.LG math.OC stat.ML

    Sketching Meets Random Projection in the Dual: A Provable Recovery Algorithm for Big and High-dimensional Data

    Authors: Jialei Wang, Jason D. Lee, Mehrdad Mahdavi, Mladen Kolar, Nathan Srebro

    Abstract: Sketching techniques have become popular for scaling up machine learning algorithms by reducing the sample size or dimensionality of massive data sets, while still maintaining the statistical power of big data. In this paper, we study sketching from an optimization point of view: we first show that the iterative Hessian sketch is an optimization process with preconditioning, and develop accelerate… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.

  44. arXiv:1605.07991  [pdf, other

    stat.ML cs.LG

    Efficient Distributed Learning with Sparsity

    Authors: Jialei Wang, Mladen Kolar, Nathan Srebro, Tong Zhang

    Abstract: We propose a novel, efficient approach for distributed sparse learning in high-dimensions, where observations are randomly partitioned across machines. Computationally, at each round our method only requires the master machine to solve a shifted ell_1 regularized M-estimation problem, and other workers to compute the gradient. In respect of communication, the proposed approach provably matches the… ▽ More

    Submitted 25 May, 2016; originally announced May 2016.

  45. arXiv:1603.02185  [pdf, ps, other

    cs.LG stat.ML

    Distributed Multi-Task Learning with Shared Representation

    Authors: Jialei Wang, Mladen Kolar, Nathan Srebro

    Abstract: We study the problem of distributed multi-task learning with shared representation, where each machine aims to learn a separate, but related, task in an unknown shared low-dimensional subspaces, i.e. when the predictor matrix has low rank. We consider a setting where each task is handled by a different machine, with samples for the task available locally on the machine, and study communication-eff… ▽ More

    Submitted 7 March, 2016; originally announced March 2016.

  46. arXiv:1510.00633  [pdf, other

    stat.ML cs.LG

    Distributed Multitask Learning

    Authors: Jialei Wang, Mladen Kolar, Nathan Srebro

    Abstract: We consider the problem of distributed multi-task learning, where each machine learns a separate, but related, task. Specifically, each machine learns a linear predictor in high-dimensional space,where all tasks share the same small support. We present a communication-efficient estimator based on the debiased lasso and show that it is comparable with the optimal centralized method.

    Submitted 2 October, 2015; originally announced October 2015.

  47. arXiv:1506.03995  [pdf, other

    cs.CV

    Technical Report: Image Captioning with Semantically Similar Images

    Authors: Martin Kolář, Michal Hradiš, Pavel Zemčík

    Abstract: This report presents our submission to the MS COCO Captioning Challenge 2015. The method uses Convolutional Neural Network activations as an embedding to find semantically similar images. From these images, the most typical caption is selected based on unigram frequencies. Although the method received low scores with automated evaluation metrics and in human assessed average correctness, it is com… ▽ More

    Submitted 12 June, 2015; originally announced June 2015.

    Comments: 3 pages

  48. arXiv:1502.07641  [pdf, other

    math.ST cs.LG

    ROCKET: Robust Confidence Intervals via Kendall's Tau for Transelliptical Graphical Models

    Authors: Rina Foygel Barber, Mladen Kolar

    Abstract: Undirected graphical models are used extensively in the biological and social sciences to encode a pattern of conditional independences between variables, where the absence of an edge between two nodes $a$ and $b$ indicates that the corresponding two variables $X_a$ and $X_b$ are believed to be conditionally independent, after controlling for all other measured variables. In the Gaussian case, con… ▽ More

    Submitted 1 September, 2017; v1 submitted 26 February, 2015; originally announced February 2015.

  49. arXiv:1309.6933  [pdf, other

    math.ST cs.LG stat.ML

    Estimating Undirected Graphs Under Weak Assumptions

    Authors: Larry Wasserman, Mladen Kolar, Alessandro Rinaldo

    Abstract: We consider the problem of providing nonparametric confidence guarantees for undirected graphs under weak assumptions. In particular, we do not assume sparsity, incoherence or Normality. We allow the dimension $D$ to increase with the sample size $n$. First, we prove lower bounds that show that if we want accurate inferences with low assumptions then there are limitations on the dimension as a fun… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    MSC Class: 62H12