Skip to main content

Showing 1–37 of 37 results for author: Jawanpuria, P

.
  1. arXiv:2406.04914  [pdf, other

    cs.LG

    Submodular Framework for Structured-Sparse Optimal Transport

    Authors: Piyushi Manupriya, Pratik Jawanpuria, Karthik S. Gurumoorthy, SakethaNath Jagarlapudi, Bamdev Mishra

    Abstract: Unbalanced optimal transport (UOT) has recently gained much attention due to its flexible framework for handling un-normalized measures and its robustness properties. In this work, we explore learning (structured) sparse transport plans in the UOT setting, i.e., transport plans have an upper bound on the number of non-sparse entries in each column (structured sparse pattern) or in the whole plan (… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2406.02225  [pdf, other

    math.OC cs.LG stat.ML

    Riemannian coordinate descent algorithms on matrix manifolds

    Authors: Andi Han, Pratik Jawanpuria, Bamdev Mishra

    Abstract: Many machine learning applications are naturally formulated as optimization problems on Riemannian manifolds. The main idea behind Riemannian optimization is to maintain the feasibility of the variables while moving along a descent direction on the manifold. This results in updating all the variables at every iteration. In this work, we provide a general framework for develo** computationally ef… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  3. arXiv:2406.02214  [pdf, other

    cs.LG

    SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining

    Authors: Andi Han, Jiaxiang Li, Wei Huang, Mingyi Hong, Akiko Takeda, Pratik Jawanpuria, Bamdev Mishra

    Abstract: Large language models (LLMs) have shown impressive capabilities across various tasks. However, training LLMs from scratch requires significant computational power and extensive memory capacity. Recent studies have explored low-rank structures on weights for efficient fine-tuning in terms of parameters and memory, either through low-rank adaptation or factorization. While effective for fine-tuning,… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  4. arXiv:2404.10029  [pdf, ps, other

    math.OC cs.CR cs.LG

    Federated Learning on Riemannian Manifolds with Differential Privacy

    Authors: Zhenwei Huang, Wen Huang, Pratik Jawanpuria, Bamdev Mishra

    Abstract: In recent years, federated learning (FL) has emerged as a prominent paradigm in distributed machine learning. Despite the partial safeguarding of agents' information within FL systems, a malicious adversary can potentially infer sensitive information through various means. In this paper, we propose a generic private FL framework defined on Riemannian manifolds (PriRFed) based on the differential p… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    MSC Class: 68W15; 68P27; 90C30; 90C48

  5. arXiv:2404.07172  [pdf, other

    cs.LG math.NA math.OC

    A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

    Authors: Neel Mishra, Bamdev Mishra, Pratik Jawanpuria, Pawan Kumar

    Abstract: A novel first-order method is proposed for training generative adversarial networks (GANs). It modifies the Gauss-Newton method to approximate the min-max Hessian and uses the Sherman-Morrison inversion formula to calculate the inverse. The method corresponds to a fixed-point method that ensures necessary contraction. To evaluate its effectiveness, numerical experiments are conducted on various da… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: accepted in IJCNN 2023, 9 pages

  6. arXiv:2402.03883  [pdf, other

    math.OC cs.LG stat.ML

    A Framework for Bilevel Optimization on Riemannian Manifolds

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Akiko Takeda

    Abstract: Bilevel optimization has seen an increasing presence in various domains of applications. In this work, we propose a framework for solving bilevel optimization problems where variables of both lower and upper level problems are constrained on Riemannian manifolds. We provide several hypergradient estimation strategies on manifolds and study their estimation error. We provide convergence and complex… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  7. arXiv:2304.11045  [pdf, other

    cs.LG cs.AI cs.IR

    Light-weight Deep Extreme Multilabel Classification

    Authors: Istasis Mishra, Arpan Dasgupta, Pratik Jawanpuria, Bamdev Mishra, Pawan Kumar

    Abstract: Extreme multi-label (XML) classification refers to the task of supervised multi-label learning that involves a large number of labels. Hence, scalability of the classifier with increasing label dimension is an important consideration. In this paper, we develop a method called LightDXML which modifies the recently developed deep learning based XML framework by using label embeddings instead of feat… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 9 pages, 2 figures, 5 tables

  8. arXiv:2211.16801  [pdf, other

    cs.CL

    Generalised Spherical Text Embedding

    Authors: Souvik Banerjee, Bamdev Mishra, Pratik Jawanpuria, Manish Shrivastava

    Abstract: This paper aims to provide an unsupervised modelling approach that allows for a more flexible representation of text embeddings. It jointly encodes the words and the paragraphs as individual matrices of arbitrary column dimension with unit Frobenius norm. The representation is also linguistically motivated with the introduction of a novel similarity metric. The proposed modelling and the novel sim… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: 6 pages

  9. arXiv:2210.04840  [pdf, other

    math.OC cs.LG cs.MS

    Rieoptax: Riemannian Optimization in JAX

    Authors: Saiteja Utpala, Andi Han, Pratik Jawanpuria, Bamdev Mishra

    Abstract: We present Rieoptax, an open source Python library for Riemannian optimization in JAX. We show that many differential geometric primitives, such as Riemannian exponential and logarithm maps, are usually faster in Rieoptax than existing frameworks in Python, both on CPU and GPU. We support various range of basic and advanced stochastic optimization solvers like Riemannian stochastic gradient, stoch… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  10. arXiv:2210.01860  [pdf, other

    cs.LG cs.AI stat.ML

    ProtoBandit: Efficient Prototype Selection via Multi-Armed Bandits

    Authors: Arghya Roy Chaudhuri, Pratik Jawanpuria, Bamdev Mishra

    Abstract: In this work, we propose a multi-armed bandit-based framework for identifying a compact set of informative data instances (i.e., the prototypes) from a source dataset $S$ that best represents a given target set $T$. Prototypical examples of a given dataset offer interpretable insights into the underlying data distribution and assist in example-based reasoning, thereby influencing every sphere of h… ▽ More

    Submitted 23 August, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Erratum corrected

  11. arXiv:2208.06619  [pdf, other

    math.OC cs.LG stat.ML

    Riemannian accelerated gradient methods via extrapolation

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Junbin Gao

    Abstract: In this paper, we propose a simple acceleration scheme for Riemannian gradient methods by extrapolating iterates on manifolds. We show when the iterates are generated from Riemannian gradient descent method, the accelerated scheme achieves the optimal convergence rate asymptotically and is computationally more favorable than the recently proposed Riemannian Nesterov accelerated gradient methods. O… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.

  12. arXiv:2205.09494  [pdf, other

    math.OC cs.CR cs.LG stat.ML

    Differentially private Riemannian optimization

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Junbin Gao

    Abstract: In this paper, we study the differentially private empirical risk minimization problem where the parameter is constrained to a Riemannian manifold. We introduce a framework of differentially private Riemannian optimization by adding noise to the Riemannian gradient on the tangent space. The noise follows a Gaussian distribution intrinsically defined with respect to the Riemannian metric. We adapt… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  13. arXiv:2204.11418  [pdf, other

    math.OC cs.LG stat.ML

    Riemannian Hamiltonian methods for min-max optimization on manifolds

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Pawan Kumar, Junbin Gao

    Abstract: In this paper, we study min-max optimization problems on Riemannian manifolds. We introduce a Riemannian Hamiltonian function, minimization of which serves as a proxy for solving the original min-max problems. Under the Riemannian Polyak--Łojasiewicz condition on the Hamiltonian function, its minimizer corresponds to the desired min-max saddle point. We also provide cases where this condition is s… ▽ More

    Submitted 24 August, 2023; v1 submitted 24 April, 2022; originally announced April 2022.

    Comments: Extended version with proofs

    Journal ref: SIAM Journal on Optimization, 33(3), pp.1797-1827, 2023

  14. arXiv:2204.04044  [pdf, other

    cs.CV

    Confidence Score for Unsupervised Foreground Background Separation of Document Images

    Authors: Soumyadeep Dey, Pratik Jawanpuria

    Abstract: Foreground-background separation is an important problem in document image analysis. Popular unsupervised binarization methods (such as the Sauvola's algorithm) employ adaptive thresholding to classify pixels as foreground or background. In this work, we propose a novel approach for computing confidence scores of the classification in such algorithms. This score provides an insight of the confiden… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted in Document Analysis Systems (DAS 2022)

  15. arXiv:2201.12933  [pdf, other

    math.FA math.OC stat.ML

    Riemannian block SPD coupling manifold and its application to optimal transport

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Junbin Gao

    Abstract: In this work, we study the optimal transport (OT) problem between symmetric positive definite (SPD) matrix-valued measures. We formulate the above as a generalized optimal transport problem where the cost, the marginals, and the coupling are represented as block matrices and each component block is a SPD matrix. The summation of row blocks and column blocks in the coupling matrix are constrained b… ▽ More

    Submitted 8 February, 2023; v1 submitted 30 January, 2022; originally announced January 2022.

  16. arXiv:2110.10464  [pdf, other

    math.FA math.DG math.OC math.ST stat.ML

    Learning with symmetric positive definite matrices via generalized Bures-Wasserstein geometry

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Junbin Gao

    Abstract: Learning with symmetric positive definite (SPD) matrices has many applications in machine learning. Consequently, understanding the Riemannian geometry of SPD matrices has attracted much attention lately. A particular Riemannian geometry of interest is the recently proposed Bures-Wasserstein (BW) geometry which builds on the Wasserstein distance between the Gaussian densities. In this paper, we pr… ▽ More

    Submitted 8 June, 2023; v1 submitted 20 October, 2021; originally announced October 2021.

  17. arXiv:2106.00286  [pdf, other

    math.OC cs.LG

    On Riemannian Optimization over Positive Definite Matrices with the Bures-Wasserstein Geometry

    Authors: Andi Han, Bamdev Mishra, Pratik Jawanpuria, Junbin Gao

    Abstract: In this paper, we comparatively analyze the Bures-Wasserstein (BW) geometry with the popular Affine-Invariant (AI) geometry for Riemannian optimization on the symmetric positive definite (SPD) matrix manifold. Our study begins with an observation that the BW metric has a linear dependence on SPD matrices in contrast to the quadratic dependence of the AI metric. We build on this to show that the BW… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  18. Light-weight Document Image Cleanup using Perceptual Loss

    Authors: Soumyadeep Dey, Pratik Jawanpuria

    Abstract: Smartphones have enabled effortless capturing and sharing of documents in digital form. The documents, however, often undergo various types of degradation due to aging, stains, or shortcoming of capturing environment such as shadow, non-uniform lighting, etc., which reduces the comprehensibility of the document images. In this work, we consider the problem of document image cleanup on embedded app… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: Accepted in 16th International Conference on Document Analysis and Recognition 2021 (ICDAR 21)

  19. arXiv:2103.10159  [pdf, other

    cs.LG cs.AI stat.ML

    SPOT: A framework for selection of prototypes using optimal transport

    Authors: Karthik S. Gurumoorthy, Pratik Jawanpuria, Bamdev Mishra

    Abstract: In this work, we develop an optimal transport (OT) based framework to select informative prototypical examples that best represent a given target dataset. Summarizing a given target dataset via representative examples is an important problem in several machine learning applications where human understanding of the learning models and underlying data distribution is essential for decision making. W… ▽ More

    Submitted 5 April, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

  20. arXiv:2103.00902  [pdf, other

    cs.LG math.OC

    Manifold optimization for non-linear optimal transport problems

    Authors: Bamdev Mishra, N T V Satyadev, Hiroyuki Kasai, Pratik Jawanpuria

    Abstract: Optimal transport (OT) has recently found widespread interest in machine learning. It allows to define novel distances between probability measures, which have shown promise in several applications. In this work, we discuss how to computationally approach general non-linear OT problems within the framework of Riemannian manifold optimization. The basis of this is the manifold of doubly stochastic… ▽ More

    Submitted 8 October, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: technical report, change is title, addition of experiments

  21. arXiv:2011.05001  [pdf, other

    cs.LG math.OC

    MMD-Regularized Unbalanced Optimal Transport

    Authors: Piyushi Manupriya, J. Saketha Nath, Pratik Jawanpuria

    Abstract: We study the unbalanced optimal transport (UOT) problem, where the marginal constraints are enforced using Maximum Mean Discrepancy (MMD) regularization. Our work is motivated by the observation that the literature on UOT is focused on regularization based on $φ$-divergence (e.g., KL divergence). Despite the popularity of MMD, its role as a regularizer in the context of UOT seems less understood.… ▽ More

    Submitted 27 January, 2024; v1 submitted 10 November, 2020; originally announced November 2020.

  22. arXiv:2010.11852  [pdf, other

    cs.LG math.OC

    Efficient Robust Optimal Transport with Application to Multi-Label Classification

    Authors: Pratik Jawanpuria, N T V Satyadev, Bamdev Mishra

    Abstract: Optimal transport (OT) is a powerful geometric tool for comparing two distributions and has been employed in various machine learning applications. In this work, we propose a novel OT formulation that takes feature correlations into account while learning the transport plan between two distributions. We model the feature-feature relationship via a symmetric positive semi-definite Mahalanobis metri… ▽ More

    Submitted 7 October, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted to IEEE CDC 2021

  23. arXiv:2004.09219  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Learning Geometric Word Meta-Embeddings

    Authors: Pratik Jawanpuria, N T V Satya Dev, Anoop Kunchukuttan, Bamdev Mishra

    Abstract: We propose a geometric framework for learning meta-embeddings of words from different embedding sources. Our framework transforms the embeddings into a common latent space, where, for example, simple averaging of different embeddings (of a given word) is more amenable. The proposed latent space arises from two particular geometric transformations - the orthogonal rotations and the Mahalanobis metr… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

  24. arXiv:2004.08243  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Geometry-aware Domain Adaptation for Unsupervised Alignment of Word Embeddings

    Authors: Pratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra

    Abstract: We propose a novel manifold based geometric approach for learning unsupervised alignment of word embeddings between the source and the target languages. Our approach formulates the alignment learning problem as a domain adaptation problem over the manifold of doubly stochastic matrices. This viewpoint arises from the aim to align the second order information of the two language spaces. The rich ge… ▽ More

    Submitted 20 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted as a short paper in ACL 2020

  25. arXiv:2004.05991  [pdf, ps, other

    cs.CL cs.LG stat.ML

    A Simple Approach to Learning Unsupervised Multilingual Embeddings

    Authors: Pratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra

    Abstract: Recent progress on unsupervised learning of cross-lingual embeddings in bilingual setting has given impetus to learning a shared embedding space for several languages without any supervision. A popular framework to solve the latter problem is to jointly solve the following two sub-problems: 1) learning unsupervised word alignment between several pairs of languages, and 2) learning how to map the m… ▽ More

    Submitted 20 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

  26. arXiv:2002.03179  [pdf, other

    cs.LG stat.ML

    Statistical Optimal Transport posed as Learning Kernel Embedding

    Authors: J. Saketha Nath, Pratik Jawanpuria

    Abstract: The objective in statistical Optimal Transport (OT) is to consistently estimate the optimal transport plan/map solely using samples from the given source and target marginal distributions. This work takes the novel approach of posing statistical OT as that of learning the transport plan's kernel mean embedding from sample based estimates of marginal embeddings. The proposed estimator controls over… ▽ More

    Submitted 10 November, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

  27. arXiv:1906.10436  [pdf, other

    math.OC cs.LG

    Riemannian optimization on the simplex of positive definite matrices

    Authors: Bamdev Mishra, Hiroyuki Kasai, Pratik Jawanpuria

    Abstract: In this work, we generalize the probability simplex constraint to matrices, i.e., $\mathbf{X}_1 + \mathbf{X}_2 + \ldots + \mathbf{X}_K = \mathbf{I}$, where $\mathbf{X}_i \succeq 0$ is a symmetric positive semidefinite matrix of size $n\times n$ for all $i = \{1,\ldots,K \}$. By assuming positive definiteness of the matrices, we show that the constraint set arising from the matrix simplex has the s… ▽ More

    Submitted 17 November, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 12th OPT Workshop on Optimization for Machine Learning at NeurIPS 2020

  28. arXiv:1903.07307  [pdf, other

    cs.LG math.OC stat.ML

    Low-rank approximations of hyperbolic embeddings

    Authors: Pratik Jawanpuria, Mayank Meghwanshi, Bamdev Mishra

    Abstract: The hyperbolic manifold is a smooth manifold of negative constant curvature. While the hyperbolic manifold is well-studied in the literature, it has gained interest in the machine learning and natural language processing communities lately due to its usefulness in modeling continuous hierarchies. Tasks with hierarchical structures are ubiquitous in those fields and there is a general interest to l… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Technical report

  29. arXiv:1902.01144  [pdf, ps, other

    cs.LG math.OC stat.ML

    Riemannian adaptive stochastic gradient algorithms on matrix manifolds

    Authors: Hiroyuki Kasai, Pratik Jawanpuria, Bamdev Mishra

    Abstract: Adaptive stochastic gradient algorithms in the Euclidean space have attracted much attention lately. Such explorations on Riemannian manifolds, on the other hand, are relatively new, limited, and challenging. This is because of the intrinsic non-linear structure of the underlying manifold and the absence of a canonical coordinate system. In machine learning applications, however, most manifolds of… ▽ More

    Submitted 28 June, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: In International Conference on Machine Learning (ICML), PMLR 97:3262-3271, 2019

  30. arXiv:1810.01811  [pdf, ps, other

    stat.ML cs.AI cs.LG

    McTorch, a manifold optimization library for deep learning

    Authors: Mayank Meghwanshi, Pratik Jawanpuria, Anoop Kunchukuttan, Hiroyuki Kasai, Bamdev Mishra

    Abstract: In this paper, we introduce McTorch, a manifold optimization library for deep learning that extends PyTorch. It aims to lower the barrier for users wishing to use manifold constraints in deep learning applications, i.e., when the parameters are constrained to lie on a manifold. Such constraints include the popular orthogonality and rank constraints, and have been recently used in a number of appli… ▽ More

    Submitted 4 October, 2018; v1 submitted 3 October, 2018; originally announced October 2018.

  31. arXiv:1808.08773  [pdf, ps, other

    cs.LG cs.AI cs.CL stat.ML

    Learning Multilingual Word Embeddings in Latent Metric Space: A Geometric Approach

    Authors: Pratik Jawanpuria, Arjun Balgovind, Anoop Kunchukuttan, Bamdev Mishra

    Abstract: We propose a novel geometric approach for learning bilingual map**s given monolingual embeddings and a bilingual dictionary. Our approach decouples learning the transformation from the source language to the target language into (a) learning rotations for language-specific embeddings to align them to a common space, and (b) learning a similarity metric in the common space to model similarities b… ▽ More

    Submitted 18 December, 2018; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: Accepted in Transactions of the Association for Computational Linguistics

  32. arXiv:1806.05454  [pdf, other

    cs.LG stat.ML

    Low-rank geometric mean metric learning

    Authors: Mukul Bhutani, Pratik Jawanpuria, Hiroyuki Kasai, Bamdev Mishra

    Abstract: We propose a low-rank approach to learning a Mahalanobis metric from data. Inspired by the recent geometric mean metric learning (GMML) algorithm, we propose a low-rank variant of the algorithm. This allows to jointly learn a low-dimensional subspace where the data reside and the Mahalanobis metric that appropriately fits the data. Our results show that we compete effectively with GMML at lower ra… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: Accepted to the geometry in machine learning (GiMLi) workshop at ICML 2018

  33. arXiv:1712.01193  [pdf, other

    cs.LG stat.ML

    A dual framework for low-rank tensor completion

    Authors: Madhav Nimishakavi, Pratik Jawanpuria, Bamdev Mishra

    Abstract: One of the popular approaches for low-rank tensor completion is to use the latent trace norm regularization. However, most existing works in this direction learn a sparse combination of tensors. In this work, we fill this gap by proposing a variant of the latent trace norm that helps in learning a non-sparse combination of tensors. We develop a dual framework for solving the low-rank tensor comple… ▽ More

    Submitted 10 November, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: Aceepted to appear in Advances of Nueral Information Processing Systems (NIPS), 2018. A shorter version appeared in the NIPS workshop on Synergies in Geometric Data Analysis 2017

  34. arXiv:1705.00467  [pdf, other

    cs.LG math.OC

    A Riemannian gossip approach to subspace learning on Grassmann manifold

    Authors: Bamdev Mishra, Hiroyuki Kasai, Pratik Jawanpuria, Atul Saroop

    Abstract: In this paper, we focus on subspace learning problems on the Grassmann manifold. Interesting applications in this setting include low-rank matrix completion and low-dimensional multivariate regression, among others. Motivated by privacy concerns, we aim to solve such problems in a decentralized setting where multiple agents have access to (and solve) only a part of the whole optimization problem.… ▽ More

    Submitted 28 February, 2018; v1 submitted 1 May, 2017; originally announced May 2017.

    Comments: Title change, extension of the technical report arXiv:1605.06968

  35. arXiv:1704.07352  [pdf, other

    stat.ML cs.LG

    Structured low-rank matrix learning: algorithms and applications

    Authors: Pratik Jawanpuria, Bamdev Mishra

    Abstract: We consider the problem of learning a low-rank matrix, constrained to lie in a linear subspace, and introduce a novel factorization for modeling such matrices. A salient feature of the proposed factorization scheme is it decouples the low-rank and the structural constraints onto separate factors. We formulate the optimization problem on the Riemannian spectrahedron manifold, where the Riemannian f… ▽ More

    Submitted 15 June, 2018; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: Accepted in ICML'18

  36. arXiv:1511.05706  [pdf, other

    stat.ML cs.LG

    Efficient Output Kernel Learning for Multiple Tasks

    Authors: Pratik Jawanpuria, Maksim Lapin, Matthias Hein, Bernt Schiele

    Abstract: The paradigm of multi-task learning is that one can achieve better generalization by learning tasks jointly and thus exploiting the similarity between the tasks rather than learning them independently of each other. While previously the relationship between tasks had to be user-defined in the form of an output kernel, recent approaches jointly learn the tasks and the output kernel. As the output k… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  37. arXiv:1206.4611  [pdf

    cs.LG stat.ML

    A Convex Feature Learning Formulation for Latent Task Structure Discovery

    Authors: Pratik Jawanpuria, J. Saketha Nath

    Abstract: This paper considers the multi-task learning problem and in the setting where some relevant features could be shared across few related tasks. Most of the existing methods assume the extent to which the given tasks are related or share a common feature space to be known apriori. In real-world applications however, it is desirable to automatically discover the groups of related tasks that share a f… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012