Skip to main content

Showing 1–9 of 9 results for author: Yun, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.07465  [pdf, other

    math.FA math.ST stat.ML

    Computerized Tomography and Reproducing Kernels

    Authors: Ho Yun, Victor M. Panaretos

    Abstract: The X-ray transform is one of the most fundamental integral operators in image processing and reconstruction. In this article, we revisit the formalism of the X-ray transform by considering it as an operator between Reproducing Kernel Hilbert Spaces (RKHS). Within this framework, the X-ray transform can be viewed as a natural analogue of Euclidean projection. The RKHS framework considerably simpli… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 41 pages, 8 figures

    MSC Class: 44A12 (Primary); 46E22 (Secondary)

  2. arXiv:2209.04378  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    MICO: Selective Search with Mutual Information Co-training

    Authors: Zhanyu Wang, Xiao Zhang, Hyokun Yun, Choon Hui Teo, Trishul Chilimbi

    Abstract: In contrast to traditional exhaustive search, selective search first clusters documents into several groups before all the documents are searched exhaustively by a query, to limit the search executed within one group or only a few groups. Selective search is designed to reduce the latency and computation in modern large-scale search systems. In this study, we propose MICO, a Mutual Information CO-… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of the 29th International Conference on Computational Linguistics (COLING). 2022

  3. arXiv:1604.04706  [pdf, other

    cs.LG stat.ML

    DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression

    Authors: Parameswaran Raman, Sriram Srinivasan, Shin Matsushima, Xinhua Zhang, Hyokun Yun, S. V. N. Vishwanathan

    Abstract: Scaling multinomial logistic regression to datasets with very large number of data points and classes is challenging. This is primarily because one needs to compute the log-partition function on every data point. This makes distributing the computation hard. In this paper, we present a distributed stochastic gradient descent based optimization method (DS-MLR) for scaling up multinomial logistic re… ▽ More

    Submitted 3 August, 2018; v1 submitted 16 April, 2016; originally announced April 2016.

  4. arXiv:1506.02761  [pdf, other

    cs.CL cs.LG stat.ML

    WordRank: Learning Word Embeddings via Robust Ranking

    Authors: Shihao Ji, Hyokun Yun, Pinar Yanardag, Shin Matsushima, S. V. N. Vishwanathan

    Abstract: Embedding words in a vector space has gained a lot of attention in recent years. While state-of-the-art methods provide efficient computation of word similarities via a low-dimensional matrix embedding, their motivation is often left unclear. In this paper, we argue that word embedding can be naturally viewed as a ranking problem due to the ranking nature of the evaluation metrics. Then, based on… ▽ More

    Submitted 27 September, 2016; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP), November 1-5, 2016, Austin, Texas, USA

  5. Distributed Stochastic Optimization of the Regularized Risk

    Authors: Shin Matsushima, Hyokun Yun, Xinhua Zhang, S. V. N. Vishwanathan

    Abstract: Many machine learning algorithms minimize a regularized risk, and stochastic optimization is widely used for this task. When working with massive data, it is desirable to perform stochastic optimization in parallel. Unfortunately, many existing stochastic optimization algorithms cannot be parallelized efficiently. In this paper we show that one can rewrite the regularized risk minimization problem… ▽ More

    Submitted 9 June, 2015; v1 submitted 17 June, 2014; originally announced June 2014.

    Journal ref: ECML PKDD 2017: Machine Learning and Knowledge Discovery in Databases pp 460-476

  6. arXiv:1402.2676  [pdf, other

    stat.ML cs.DC cs.LG stat.CO

    Ranking via Robust Binary Classification and Parallel Parameter Estimation in Large-Scale Data

    Authors: Hyokun Yun, Parameswaran Raman, S. V. N. Vishwanathan

    Abstract: We propose RoBiRank, a ranking algorithm that is motivated by observing a close connection between evaluation metrics for learning to rank and loss functions for robust classification. The algorithm shows a very competitive performance on standard benchmark datasets against other representative algorithms in the literature. On the other hand, in large scale problems where explicit feature vectors… ▽ More

    Submitted 21 August, 2014; v1 submitted 11 February, 2014; originally announced February 2014.

  7. arXiv:1202.6001  [pdf, ps, other

    stat.ML cs.LG

    Efficiently Sampling Multiplicative Attribute Graphs Using a Ball-Drop** Process

    Authors: Hyokun Yun, S. V. N. Vishwanathan

    Abstract: We introduce a novel and efficient sampling algorithm for the Multiplicative Attribute Graph Model (MAGM - Kim and Leskovec (2010)}). Our algorithm is \emph{strictly} more efficient than the algorithm proposed by Yun and Vishwanathan (2012), in the sense that our method extends the \emph{best} time complexity guarantee of their algorithm to a larger fraction of parameter space. Both in theory and… ▽ More

    Submitted 27 February, 2012; v1 submitted 27 February, 2012; originally announced February 2012.

  8. arXiv:1110.5383  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Quilting Stochastic Kronecker Product Graphs to Generate Multiplicative Attribute Graphs

    Authors: Hyokun Yun, S. V. N. Vishwanathan

    Abstract: We describe the first sub-quadratic sampling algorithm for the Multiplicative Attribute Graph Model (MAGM) of Kim and Leskovec (2010). We exploit the close connection between MAGM and the Kronecker Product Graph Model (KPGM) of Leskovec et al. (2010), and show that to sample a graph from a MAGM it suffices to sample small number of KPGM graphs and \emph{quilt} them together. Under a restricted set… ▽ More

    Submitted 9 February, 2012; v1 submitted 24 October, 2011; originally announced October 2011.

  9. arXiv:1105.0755  [pdf, other

    stat.AP cs.MM

    Using Logistic Regression to Analyze the Balance of a Game: The Case of StarCraft II

    Authors: Hyokun Yun

    Abstract: Recently, the market size of online game has been increasing astonishingly fast, and so does the importance of good game design. In online games, usually a human user competes with others, so the fairness of the game system to all users is of great importance not to lose interests of users on the game. Furthermore, the emergence and success of electronic sports (e-sports) and professional gaming w… ▽ More

    Submitted 4 May, 2011; originally announced May 2011.