Skip to main content

Showing 1–13 of 13 results for author: Vishwanathan, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2004.13940  [pdf, other

    cs.LG stat.ML

    DS-FACTO: Doubly Separable Factorization Machines

    Authors: Parameswaran Raman, S. V. N. Vishwanathan

    Abstract: Factorization Machines (FM) are powerful class of models that incorporate higher-order interaction among features to add more expressive power to linear models. They have been used successfully in several real-world tasks such as click-prediction, ranking and recommender systems. Despite using a low-rank representation for the pairwise features, the memory overheads of using factorization machines… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  2. An Efficient Bandit Algorithm for Realtime Multivariate Optimization

    Authors: Daniel N Hill, Houssam Nassif, Yi Liu, Anand Iyer, S V N Vishwanathan

    Abstract: Optimization is commonly employed to determine the content of web pages, such as to maximize conversions on landing pages or click-through rates on search engine result pages. Often the layout of these pages can be decoupled into several separate decisions. For example, the composition of a landing page may involve deciding which image to show, which wording to use, what color background to displa… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: KDD'17 Audience Appreciation Award

    Journal ref: Daniel N. Hill, Houssam Nassif, Yi Liu, Anand Iyer, and S. V. N. Vishwanathan. 2017. An Efficient Bandit Algorithm for Realtime Multivariate Optimization. In Proceedings of KDD'17, Halifax, NS, Canada, pp. 1813-1821, 2017

  3. arXiv:1810.01477  [pdf, other

    cs.IR cs.LG stat.ML

    Adaptive, Personalized Diversity for Visual Discovery

    Authors: Choon Hui Teo, Houssam Nassif, Daniel Hill, Sriram Srinavasan, Mitchell Goodman, Vijai Mohan, SVN Vishwanathan

    Abstract: Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shop** experience. Here we explore extensions in the direction of adaptive personalization and item diversi… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: Best Paper Award

    Journal ref: Adaptive, Personalized Diversity for Visual Discovery. Teo CH, Nassif H, Hill D, Srinavasan S, Goodman M, Mohan V, and Vishwanathan SVN. ACM Conference on Recommender Systems (RecSys'16), Boston, pp. 35-38, 2016

  4. arXiv:1605.09499  [pdf, other

    stat.ML

    Extreme Stochastic Variational Inference: Distributed and Asynchronous

    Authors: Jiong Zhang, Parameswaran Raman, Shihao Ji, Hsiang-Fu Yu, S. V. N. Vishwanathan, Inderjit S. Dhillon

    Abstract: Stochastic variational inference (SVI), the state-of-the-art algorithm for scaling variational inference to large-datasets, is inherently serial. Moreover, it requires the parameters to fit in the memory of a single processor; this is problematic when the number of parameters is in billions. In this paper, we propose extreme stochastic variational inference (ESVI), an asynchronous and lock-free al… ▽ More

    Submitted 3 August, 2018; v1 submitted 31 May, 2016; originally announced May 2016.

  5. arXiv:1604.04706  [pdf, other

    cs.LG stat.ML

    DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression

    Authors: Parameswaran Raman, Sriram Srinivasan, Shin Matsushima, Xinhua Zhang, Hyokun Yun, S. V. N. Vishwanathan

    Abstract: Scaling multinomial logistic regression to datasets with very large number of data points and classes is challenging. This is primarily because one needs to compute the log-partition function on every data point. This makes distributing the computation hard. In this paper, we present a distributed stochastic gradient descent based optimization method (DS-MLR) for scaling up multinomial logistic re… ▽ More

    Submitted 3 August, 2018; v1 submitted 16 April, 2016; originally announced April 2016.

  6. arXiv:1511.06909  [pdf, other

    cs.LG cs.CL cs.NE stat.ML

    BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies

    Authors: Shihao Ji, S. V. N. Vishwanathan, Nadathur Satish, Michael J. Anderson, Pradeep Dubey

    Abstract: We propose BlackOut, an approximation algorithm to efficiently train massive recurrent neural network language models (RNNLMs) with million word vocabularies. BlackOut is motivated by using a discriminative loss, and we describe a new sampling strategy which significantly reduces computation while improving stability, sample efficiency, and rate of convergence. One way to understand BlackOut is to… ▽ More

    Submitted 31 March, 2016; v1 submitted 21 November, 2015; originally announced November 2015.

    Comments: Published as a conference paper at ICLR 2016

  7. arXiv:1506.02761  [pdf, other

    cs.CL cs.LG stat.ML

    WordRank: Learning Word Embeddings via Robust Ranking

    Authors: Shihao Ji, Hyokun Yun, Pinar Yanardag, Shin Matsushima, S. V. N. Vishwanathan

    Abstract: Embedding words in a vector space has gained a lot of attention in recent years. While state-of-the-art methods provide efficient computation of word similarities via a low-dimensional matrix embedding, their motivation is often left unclear. In this paper, we argue that word embedding can be naturally viewed as a ranking problem due to the ranking nature of the evaluation metrics. Then, based on… ▽ More

    Submitted 27 September, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP), November 1-5, 2016, Austin, Texas, USA

  8. arXiv:1406.4519  [pdf, other

    stat.ML

    DFacTo: Distributed Factorization of Tensors

    Authors: Joon Hee Choi, S. V. N. Vishwanathan

    Abstract: We present a technique for significantly speeding up Alternating Least Squares (ALS) and Gradient Descent (GD), two widely used algorithms for tensor factorization. By exploiting properties of the Khatri-Rao product, we show how to efficiently address a computationally challenging sub-step of both algorithms. Our algorithm, DFacTo, only requires two sparse matrix-vector products and is easy to par… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: Under review for NIPS 2014

  9. Distributed Stochastic Optimization of the Regularized Risk

    Authors: Shin Matsushima, Hyokun Yun, Xinhua Zhang, S. V. N. Vishwanathan

    Abstract: Many machine learning algorithms minimize a regularized risk, and stochastic optimization is widely used for this task. When working with massive data, it is desirable to perform stochastic optimization in parallel. Unfortunately, many existing stochastic optimization algorithms cannot be parallelized efficiently. In this paper we show that one can rewrite the regularized risk minimization problem… ▽ More

    Submitted 9 June, 2015; v1 submitted 17 June, 2014; originally announced June 2014.

    Journal ref: ECML PKDD 2017: Machine Learning and Knowledge Discovery in Databases pp 460-476

  10. arXiv:1402.2676  [pdf, other

    stat.ML cs.DC cs.LG stat.CO

    Ranking via Robust Binary Classification and Parallel Parameter Estimation in Large-Scale Data

    Authors: Hyokun Yun, Parameswaran Raman, S. V. N. Vishwanathan

    Abstract: We propose RoBiRank, a ranking algorithm that is motivated by observing a close connection between evaluation metrics for learning to rank and loss functions for robust classification. The algorithm shows a very competitive performance on standard benchmark datasets against other representative algorithms in the literature. On the other hand, in large scale problems where explicit feature vectors… ▽ More

    Submitted 21 August, 2014; v1 submitted 11 February, 2014; originally announced February 2014.

  11. arXiv:1202.6001  [pdf, ps, other

    stat.ML cs.LG

    Efficiently Sampling Multiplicative Attribute Graphs Using a Ball-Drop** Process

    Authors: Hyokun Yun, S. V. N. Vishwanathan

    Abstract: We introduce a novel and efficient sampling algorithm for the Multiplicative Attribute Graph Model (MAGM - Kim and Leskovec (2010)}). Our algorithm is \emph{strictly} more efficient than the algorithm proposed by Yun and Vishwanathan (2012), in the sense that our method extends the \emph{best} time complexity guarantee of their algorithm to a larger fraction of parameter space. Both in theory and… ▽ More

    Submitted 27 February, 2012; v1 submitted 27 February, 2012; originally announced February 2012.

  12. arXiv:1110.5383  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Quilting Stochastic Kronecker Product Graphs to Generate Multiplicative Attribute Graphs

    Authors: Hyokun Yun, S. V. N. Vishwanathan

    Abstract: We describe the first sub-quadratic sampling algorithm for the Multiplicative Attribute Graph Model (MAGM) of Kim and Leskovec (2010). We exploit the close connection between MAGM and the Kronecker Product Graph Model (KPGM) of Leskovec et al. (2010), and show that to sample a graph from a MAGM it suffices to sample small number of KPGM graphs and \emph{quilt} them together. Under a restricted set… ▽ More

    Submitted 9 February, 2012; v1 submitted 24 October, 2011; originally announced October 2011.

  13. arXiv:0804.3835  [pdf, ps, other

    stat.ML math.OC

    A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning

    Authors: ** Yu, S. V. N. Vishwanathan, Simon Guenter, Nicol N. Schraudolph

    Abstract: We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: the local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We prove that under some technical conditions, the res… ▽ More

    Submitted 22 February, 2010; v1 submitted 24 April, 2008; originally announced April 2008.

    Journal ref: Journal of Machine Learning Research 11(Mar):1145-1200, 2010