Skip to main content

Showing 1–39 of 39 results for author: Kanade, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09347  [pdf, other

    cs.LG stat.ML

    Separations in the Representational Capabilities of Transformers and Recurrent Architectures

    Authors: Satwik Bhattamishra, Michael Hahn, Phil Blunsom, Varun Kanade

    Abstract: Transformer architectures have been widely adopted in foundation models. Due to their high inference costs, there is renewed interest in exploring the potential of efficient recurrent architectures (RNNs). In this paper, we analyze the differences in the representational capabilities of Transformers and RNNs across several tasks of practical relevance, including index lookup, nearest neighbor, rec… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2310.03016  [pdf, other

    cs.LG cs.CL

    Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions

    Authors: Satwik Bhattamishra, Arkil Patel, Phil Blunsom, Varun Kanade

    Abstract: In order to understand the in-context learning phenomenon, recent works have adopted a stylized experimental framework and demonstrated that Transformers can learn gradient-based learning algorithms for various classes of real-valued functions. However, the limitations of Transformers in implementing learning algorithms, and their ability to learn other forms of algorithms are not well understood.… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Preprint

  3. arXiv:2211.12316  [pdf, other

    cs.LG cs.CL

    Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions

    Authors: Satwik Bhattamishra, Arkil Patel, Varun Kanade, Phil Blunsom

    Abstract: Despite the widespread success of Transformers on NLP tasks, recent works have found that they struggle to model several formal languages when compared to recurrent models. This raises the question of why Transformers perform well in practice and whether they have any properties that enable them to generalize better than recurrent models. In this work, we conduct an extensive empirical study on Bo… ▽ More

    Submitted 10 July, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: ACL 2023

  4. arXiv:2210.06089  [pdf, other

    cs.LG

    When are Local Queries Useful for Robust Learning?

    Authors: Pascale Gourdeau, Varun Kanade, Marta Kwiatkowska, James Worrell

    Abstract: Distributional assumptions have been shown to be necessary for the robust learnability of concept classes when considering the exact-in-the-ball robust risk and access to random examples by Gourdeau et al. (2019). In this paper, we study learning models where the learner is given more power through the use of local queries, and give the first distribution-free algorithms that perform robust empiri… ▽ More

    Submitted 20 July, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022; V2 contains new results (Section 3.6) and an erratum from the previous version (Appendix C)

  5. arXiv:2208.12063  [pdf, other

    cs.LG cs.DS cs.IR

    Partial Matrix Completion

    Authors: Elad Hazan, Adam Tauman Kalai, Varun Kanade, Clara Mohri, Y. Jennifer Sun

    Abstract: The matrix completion problem aims to reconstruct a low-rank matrix based on a revealed set of possibly noisy entries. Prior works consider completing the entire matrix with generalization error guarantees. However, the completion accuracy can be drastically different over different entries. This work establishes a new framework of partial matrix completion, where the goal is to identify a large s… ▽ More

    Submitted 17 December, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2023

  6. arXiv:2205.12327  [pdf, other

    cs.LG cs.CY

    Beyond Impossibility: Balancing Sufficiency, Separation and Accuracy

    Authors: Limor Gultchin, Vincent Cohen-Addad, Sophie Giffard-Roisin, Varun Kanade, Frederik Mallmann-Trenn

    Abstract: Among the various aspects of algorithmic fairness studied in recent years, the tension between satisfying both \textit{sufficiency} and \textit{separation} -- e.g. the ratios of positive or negative predictive values, and false positive or false negative rates across groups -- has received much attention. Following a debate sparked by COMPAS, a criminal justice predictive system, the academic comm… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  7. arXiv:2205.06127  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity Bounds for Robustly Learning Decision Lists against Evasion Attacks

    Authors: Pascale Gourdeau, Varun Kanade, Marta Kwiatkowska, James Worrell

    Abstract: A fundamental problem in adversarial machine learning is to quantify how much training data is needed in the presence of evasion attacks. In this paper we address this issue within the framework of PAC learning, focusing on the class of decision lists. Given that distributional assumptions are essential in the adversarial setting, we work with probability distributions on the input data that satis… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: To appear in the proceedings of International Joint Conference on Artificial Intelligence (2022)

  8. arXiv:2202.11461  [pdf, other

    math.ST cs.LG stat.ML

    Exponential Tail Local Rademacher Complexity Risk Bounds Without the Bernstein Condition

    Authors: Varun Kanade, Patrick Rebeschini, Tomas Vaskevicius

    Abstract: The local Rademacher complexity framework is one of the most successful general-purpose toolboxes for establishing sharp excess risk bounds for statistical estimators based on the framework of empirical risk minimization. Applying this toolbox typically requires using the Bernstein condition, which often restricts applicability to convex and proper settings. Recent years have witnessed several exa… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  9. arXiv:2105.14119  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Towards optimally abstaining from prediction with OOD test examples

    Authors: Adam Tauman Kalai, Varun Kanade

    Abstract: A common challenge across all areas of machine learning is that training data is not distributed like test data, due to natural shifts, "blind spots," or adversarial examples; such test examples are referred to as out-of-distribution (OOD) test examples. We consider a model where one may abstain from predicting, at a fixed cost. In particular, our transductive abstention algorithm takes labeled tr… ▽ More

    Submitted 27 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: In NeurIPS 2021 (+spotlight), 24 pages

  10. arXiv:2102.07802  [pdf, ps, other

    cs.LG stat.ML

    Efficient Learning with Arbitrary Covariate Shift

    Authors: Adam Kalai, Varun Kanade

    Abstract: We give an efficient algorithm for learning a binary function in a given class C of bounded VC dimension, with training data distributed according to P and test data according to Q, where P and Q may be arbitrary distributions over X. This is the generic form of what is called covariate shift, which is impossible in general as arbitrary P and Q may not even overlap. However, recently guarantees we… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  11. arXiv:2007.08243  [pdf, ps, other

    cs.LG stat.ML

    Lottery Tickets in Linear Models: An Analysis of Iterative Magnitude Pruning

    Authors: Bryn Elesedy, Varun Kanade, Yee Whye Teh

    Abstract: We analyse the pruning procedure behind the lottery ticket hypothesis arXiv:1803.03635v5, iterative magnitude pruning (IMP), when applied to linear models trained by gradient flow. We begin by presenting sufficient conditions on the statistical structure of the features under which IMP prunes those features that have smallest projection onto the data. Following this, we explore IMP as a method for… ▽ More

    Submitted 5 July, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Updated for Sparsity in Neural Networks Workshop

    ACM Class: I.5.1

  12. arXiv:2007.04028  [pdf, other

    cs.LG stat.ML

    How benign is benign overfitting?

    Authors: Amartya Sanyal, Puneet K Dokania, Varun Kanade, Philip H. S. Torr

    Abstract: We investigate two causes for adversarial vulnerability in deep neural networks: bad data and (poorly) trained models. When trained with SGD, deep neural networks essentially achieve zero training error, even in the presence of label noise, while also exhibiting good generalization on natural test data, something referred to as benign overfitting [2, 10]. However, these models are vulnerable to ad… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  13. arXiv:2003.01461  [pdf, other

    cs.LG stat.ML

    Differentiable Causal Backdoor Discovery

    Authors: Limor Gultchin, Matt J. Kusner, Varun Kanade, Ricardo Silva

    Abstract: Discovering the causal effect of a decision is critical to nearly all forms of decision-making. In particular, it is a key quantity in drug development, in crafting government policy, and when implementing a real-world machine learning system. Given only observational data, confounders often obscure the true causal effect. Luckily, in some cases, it is possible to recover the causal effect by usin… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Published in the Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020, Palermo, Italy

  14. arXiv:2002.00189  [pdf, other

    stat.ML cs.LG

    The Statistical Complexity of Early-Stopped Mirror Descent

    Authors: Tomas Vaškevičius, Varun Kanade, Patrick Rebeschini

    Abstract: Recently there has been a surge of interest in understanding implicit regularization properties of iterative gradient-based optimization algorithms. In this paper, we study the statistical guarantees on the excess risk achieved by early-stopped unconstrained mirror descent algorithms applied to the unregularized empirical risk with the squared loss for linear models and kernel methods. By completi… ▽ More

    Submitted 27 August, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

  15. arXiv:1909.06861  [pdf, other

    cs.LG stat.ML

    Online k-means Clustering

    Authors: Vincent Cohen-Addad, Benjamin Guedj, Varun Kanade, Guy Rom

    Abstract: We study the problem of online clustering where a clustering algorithm has to assign a new point that arrives to one of $k$ clusters. The specific formulation we use is the $k$-means objective: At each time step the algorithm has to maintain a set of k candidate centers and the loss incurred is the squared distance between the new point and the closest center. The goal is to minimize regret with r… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

    Comments: 11 pages, 1 figure

    Journal ref: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics (AISTATS), PMLR 130:1126-1134, 2021

  16. arXiv:1909.05822  [pdf, other

    cs.LG cs.CC stat.ML

    On the Hardness of Robust Classification

    Authors: Pascale Gourdeau, Varun Kanade, Marta Kwiatkowska, James Worrell

    Abstract: It is becoming increasingly important to understand the vulnerability of machine learning models to adversarial attacks. In this paper we study the feasibility of robust learning from the perspective of computational learning theory, considering both sample and computational complexity. In particular, our definition of robust learnability requires polynomial sample complexity. We start with two ne… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: To appear in the proceedings of Neural Information Processing Systems Conference (2019)

  17. arXiv:1909.05122  [pdf, other

    stat.ML cs.LG eess.SP

    Implicit Regularization for Optimal Sparse Recovery

    Authors: Tomas Vaškevičius, Varun Kanade, Patrick Rebeschini

    Abstract: We investigate implicit regularization schemes for gradient descent methods applied to unpenalized least squares regression to solve the problem of reconstructing a sparse signal from an underdetermined system of linear measurements under the restricted isometry assumption. For a given parametrization yielding a non-convex optimization problem, we show that prescribed choices of initialization, st… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2019

  18. arXiv:1905.11566  [pdf, other

    cs.DS cs.LG

    Adaptive Reduced Rank Regression

    Authors: Qiong Wu, Felix Ming Fai Wong, Zhenming Liu, Yanhua Li, Varun Kanade

    Abstract: We study the low rank regression problem $\my = M\mx + ε$, where $\mx$ and $\my$ are $d_1$ and $d_2$ dimensional vectors respectively. We consider the extreme high-dimensional setting where the number of observations $n$ is less than $d_1 + d_2$. Existing algorithms are designed for settings where $n$ is typically as large as $\Rank(M)(d_1+d_2)$. This work provides an efficient algorithm which onl… ▽ More

    Submitted 23 October, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: 36 pages

  19. arXiv:1810.04468  [pdf, other

    cs.LG stat.ML

    Decentralized Cooperative Stochastic Bandits

    Authors: David Martínez-Rubio, Varun Kanade, Patrick Rebeschini

    Abstract: We study a decentralized cooperative stochastic multi-armed bandit problem with $K$ arms on a network of $N$ agents. In our model, the reward distribution of each arm is the same for each agent and rewards are drawn independently across agents and time steps. In each round, each agent chooses an arm to play and subsequently sends a message to her neighbors. The goal is to minimize the overall regr… ▽ More

    Submitted 24 October, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

  20. arXiv:1808.01857  [pdf, other

    math.ST cs.LG stat.ML

    Statistical Windows in Testing for the Initial Distribution of a Reversible Markov Chain

    Authors: Quentin Berthet, Varun Kanade

    Abstract: We study the problem of hypothesis testing between two discrete distributions, where we only have access to samples after the action of a known reversible Markov chain, playing the role of noise. We derive instance-dependent minimax rates for the sample complexity of this problem, and show how its dependence in time is related to the spectral properties of the Markov chain. We show that there exis… ▽ More

    Submitted 6 August, 2018; originally announced August 2018.

    MSC Class: 62C20

  21. arXiv:1806.03461  [pdf, other

    cs.CR cs.LG stat.ML

    TAPAS: Tricks to Accelerate (encrypted) Prediction As a Service

    Authors: Amartya Sanyal, Matt J. Kusner, Adrià Gascón, Varun Kanade

    Abstract: Machine learning methods are widely used for a variety of prediction problems. \emph{Prediction as a service} is a paradigm in which service providers with technological expertise and computational resources may perform predictions for clients. However, data privacy severely restricts the applicability of such services, unless measures to keep client data private (even from the service provider) a… ▽ More

    Submitted 9 June, 2018; originally announced June 2018.

    Comments: Accepted at International Conference in Machine Learning (ICML), 2018

  22. arXiv:1804.07090  [pdf, other

    cs.LG cs.AI stat.ML

    Robustness via Deep Low-Rank Representations

    Authors: Amartya Sanyal, Varun Kanade, Philip H. S. Torr, Puneet K. Dokania

    Abstract: We investigate the effect of the dimensionality of the representations learned in Deep Neural Networks (DNNs) on their robustness to input perturbations, both adversarial and random. To achieve low dimensionality of learned representations, we propose an easy-to-use, end-to-end trainable, low-rank regularizer (LR) that can be applied to any intermediate layer representation of a DNN. This regulari… ▽ More

    Submitted 19 February, 2020; v1 submitted 19 April, 2018; originally announced April 2018.

  23. arXiv:1802.05690  [pdf, ps, other

    quant-ph cs.DM cs.LG

    Learning DNFs under product distributions via μ-biased quantum Fourier sampling

    Authors: Varun Kanade, Andrea Rocchetto, Simone Severini

    Abstract: We show that DNF formulae can be quantum PAC-learned in polynomial time under product distributions using a quantum example oracle. The best classical algorithm (without access to membership queries) runs in superpolynomial time. Our result extends the work by Bshouty and Jackson (1998) that proved that DNF formulae are efficiently learnable under the uniform distribution using a quantum example o… ▽ More

    Submitted 25 November, 2019; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: 17 pages; v3 based on journal version; minor corrections and clarifications

    Journal ref: Quantum Information and Computation, Vol. 19, No. 15&16 (2019) 1261-1278

  24. arXiv:1711.00982  [pdf, other

    cs.LG cs.SI stat.ML

    From which world is your graph?

    Authors: Cheng Li, Felix Wong, Zhenming Liu, Varun Kanade

    Abstract: Discovering statistical structure from links is a fundamental problem in the analysis of social networks. Choosing a misspecified model, or equivalently, an incorrect inference algorithm will result in an invalid analysis or even falsely uncover patterns that are in fact artifacts of the model. This work focuses on unifying two of the most widely used link-formation models: the stochastic blockmod… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: To appear in NIPS 2017

  25. arXiv:1704.02147  [pdf, ps, other

    cs.DS cs.LG

    Hierarchical Clustering: Objective Functions and Algorithms

    Authors: Vincent Cohen-Addad, Varun Kanade, Frederik Mallmann-Trenn, Claire Mathieu

    Abstract: Hierarchical clustering is a recursive partitioning of a dataset into clusters at an increasingly finer granularity. Motivated by the fact that most work on hierarchical clustering was based on providing algorithms, rather than optimizing a specific objective, Dasgupta framed similarity-based hierarchical clustering as a combinatorial optimization problem, where a `good' hierarchical clustering is… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

  26. arXiv:1702.03959  [pdf, other

    cs.DS

    How large is your graph?

    Authors: Varun Kanade, Frederik Mallmann-Trenn, Victor Verdugo

    Abstract: We consider the problem of estimating the graph size, where one is given only local access to the graph. We formally define a query model in which one starts with a \emph{seed} node and is allowed to make queries about neighbours of nodes that have already been seen. In the case of undirected graphs, an estimator of Katzir et al. (2014) based on a sample from the stationary distribution $π$ uses… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

  27. arXiv:1611.10258  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Reliably Learning the ReLU in Polynomial Time

    Authors: Surbhi Goel, Varun Kanade, Adam Klivans, Justin Thaler

    Abstract: We give the first dimension-efficient algorithms for learning Rectified Linear Units (ReLUs), which are functions of the form $\mathbf{x} \mapsto \max(0, \mathbf{w} \cdot \mathbf{x})$ with $\mathbf{w} \in \mathbb{S}^{n-1}$. Our algorithm works in the challenging Reliable Agnostic learning model of Kalai, Kanade, and Mansour (2009) where the learner is given access to a distribution $\cal{D}$ on la… ▽ More

    Submitted 30 November, 2016; originally announced November 2016.

  28. arXiv:1611.02460  [pdf, other

    cs.DM math.CO

    On coalescence time in graphs--When is coalescing as fast as meeting?

    Authors: Varun Kanade, Frederik Mallmann-Trenn, Thomas Sauerwald

    Abstract: Coalescing random walks is a fundamental stochastic process, where a set of particles perform independent discrete-time random walks on an undirected graph. Whenever two or more particles meet at a given node, they merge and continue as a single random walk. The coalescence time is defined as the expected time until only one particle remains, starting from one particle at every node. Despite recen… ▽ More

    Submitted 1 November, 2018; v1 submitted 8 November, 2016; originally announced November 2016.

  29. arXiv:1604.01999  [pdf, other

    cs.LG stat.ML

    Online Optimization of Smoothed Piecewise Constant Functions

    Authors: Vincent Cohen-Addad, Varun Kanade

    Abstract: We study online optimization of smoothed piecewise constant functions over the domain [0, 1). This is motivated by the problem of adaptively picking parameters of learning algorithms as in the recently introduced framework by Gupta and Roughgarden (2016). Majority of the machine learning literature has focused on Lipschitz-continuous functions or functions with bounded gradients. 1 This is with go… ▽ More

    Submitted 20 May, 2016; v1 submitted 7 April, 2016; originally announced April 2016.

  30. arXiv:1509.01988  [pdf, ps, other

    cs.GT cs.DS

    Stable Matching with Evolving Preferences

    Authors: Varun Kanade, Nikos Leonardos, Frédéric Magniez

    Abstract: We consider the problem of stable matching with dynamic preference lists. At each time step, the preference list of some player may change by swap** random adjacent members. The goal of a central agency (algorithm) is to maintain an approximately stable matching (in terms of number of blocking pairs) at all times. The changes in the preference lists are not reported to the algorithm, but must in… ▽ More

    Submitted 28 June, 2016; v1 submitted 7 September, 2015; originally announced September 2015.

    Comments: 13 pages

  31. arXiv:1505.05215  [pdf, ps, other

    cs.LG

    Learning with a Drifting Target Concept

    Authors: Steve Hanneke, Varun Kanade, Liu Yang

    Abstract: We study the problem of learning in the presence of a drifting target concept. Specifically, we provide bounds on the error rate at a given time, given a learner with access to a history of independent samples labeled according to a target concept that can change on each round. One of our main contributions is a refinement of the best previous results for polynomial-time algorithms for the space o… ▽ More

    Submitted 19 May, 2015; originally announced May 2015.

  32. arXiv:1404.6325  [pdf, other

    math.PR cs.SI

    Global and Local Information in Clustering Labeled Block Models

    Authors: Varun Kanade, Elchanan Mossel, Tselil Schramm

    Abstract: The stochastic block model is a classical cluster-exhibiting random graph model that has been widely studied in statistics, physics and computer science. In its simplest form, the model is a random graph with two equal-sized clusters, with intra-cluster edge probability p, and inter-cluster edge probability q. We focus on the sparse case, i.e., p, q = O(1/n), which is practically more relevant and… ▽ More

    Submitted 3 July, 2014; v1 submitted 25 April, 2014; originally announced April 2014.

    Comments: 24 pages, 2 figures. A short abstract describing these results will appear in proceedings of RANDOM 2014

  33. arXiv:1402.5164  [pdf, ps, other

    cs.LG cs.CC cs.DS

    Distribution-Independent Reliable Learning

    Authors: Varun Kanade, Justin Thaler

    Abstract: We study several questions in the reliable agnostic learning framework of Kalai et al. (2009), which captures learning tasks in which one type of error is costlier than others. A positive reliable classifier is one that makes no false positive errors. The goal in the positive reliable agnostic framework is to output a hypothesis with the following properties: (i) its false positive error rate is a… ▽ More

    Submitted 20 February, 2014; originally announced February 2014.

    Comments: 20 pages

  34. arXiv:1309.4132  [pdf, other

    cs.LG q-bio.PE

    Attribute-Efficient Evolvability of Linear Functions

    Authors: Elaine Angelino, Varun Kanade

    Abstract: In a seminal paper, Valiant (2006) introduced a computational model for evolution to address the question of complexity that can arise through Darwinian mechanisms. Valiant views evolution as a restricted form of computational learning, where the goal is to evolve a hypothesis that is close to the ideal function. Feldman (2008) showed that (correlational) statistical query learning algorithms coul… ▽ More

    Submitted 3 April, 2014; v1 submitted 16 September, 2013; originally announced September 2013.

  35. arXiv:1307.3617  [pdf, other

    cs.LG stat.ML

    MCMC Learning

    Authors: Varun Kanade, Elchanan Mossel

    Abstract: The theory of learning under the uniform distribution is rich and deep, with connections to cryptography, computational complexity, and the analysis of boolean functions to name a few areas. This theory however is very limited due to the fact that the uniform distribution and the corresponding Fourier basis are rarely encountered as a statistical model. A family of distributions that vastly gene… ▽ More

    Submitted 12 June, 2015; v1 submitted 13 July, 2013; originally announced July 2013.

    Comments: 28 pages, 1 figure

  36. arXiv:1211.3212  [pdf, other

    cs.LG cs.AI

    Distributed Non-Stochastic Experts

    Authors: Varun Kanade, Zhenming Liu, Bozidar Radunovic

    Abstract: We consider the online distributed non-stochastic experts problem, where the distributed system consists of one coordinator node that is connected to $k$ sites, and the sites are required to communicate with each other via the coordinator. At each time-step $t$, one of the $k$ site nodes has to pick an expert from the set ${1, ..., n}$, and the same site receives information about payoffs of all e… ▽ More

    Submitted 14 November, 2012; originally announced November 2012.

  37. arXiv:1211.0996  [pdf, ps, other

    cs.LG cs.AI

    Learning using Local Membership Queries

    Authors: Pranjal Awasthi, Vitaly Feldman, Varun Kanade

    Abstract: We introduce a new model of membership query (MQ) learning, where the learning algorithm is restricted to query points that are \emph{close} to random examples drawn from the underlying distribution. The learning model is intermediate between the PAC model (Valiant, 1984) and the PAC+MQ model (where the queries are allowed to be arbitrary points). Membership query algorithms are not popular amon… ▽ More

    Submitted 17 April, 2013; v1 submitted 5 November, 2012; originally announced November 2012.

  38. arXiv:1104.2018  [pdf, other

    cs.AI cs.LG stat.ML

    Efficient Learning of Generalized Linear and Single Index Models with Isotonic Regression

    Authors: Sham Kakade, Adam Tauman Kalai, Varun Kanade, Ohad Shamir

    Abstract: Generalized Linear Models (GLMs) and Single Index Models (SIMs) provide powerful generalizations of linear regression, where the target variable is assumed to be a (possibly unknown) 1-dimensional function of a linear predictor. In general, these problems entail non-convex estimation procedures, and, in practice, iterative local search heuristics are often used. Kalai and Sastry (2009) recently pr… ▽ More

    Submitted 11 April, 2011; originally announced April 2011.

  39. arXiv:1005.3566  [pdf, ps, other

    cs.LG

    Evolution with Drifting Targets

    Authors: Varun Kanade, Leslie G. Valiant, Jennifer Wortman Vaughan

    Abstract: We consider the question of the stability of evolutionary algorithms to gradual changes, or drift, in the target concept. We define an algorithm to be resistant to drift if, for some inverse polynomial drift rate in the target function, it converges to accuracy 1 -- ε, with polynomial resources, and then stays within that accuracy indefinitely, except with probability ε, at any one time. We show t… ▽ More

    Submitted 19 May, 2010; originally announced May 2010.