Skip to main content

Showing 1–24 of 24 results for author: Saito, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.14522  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property

    Authors: Yuya Yoshikawa, Masanari Kimura, Ryotaro Shimizu, Yuki Saito

    Abstract: Techniques that explain the predictions of black-box machine learning models are crucial to make the models transparent, thereby increasing trust in AI systems. The input features to the models often have a nested structure that consists of high- and low-level features, and each high-level feature is decomposed into multiple low-level features. For such inputs, both high-level feature attributions… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2404.15691  [pdf, other

    cs.LG stat.ML

    Long-term Off-Policy Evaluation and Learning

    Authors: Yuta Saito, Himan Abdollahpouri, Jesse Anderton, Ben Carterette, Mounia Lalmas

    Abstract: Short- and long-term outcomes of an algorithm often differ, with damaging downstream effects. A known example is a click-bait algorithm, which may increase short-term clicks but damage long-term user engagement. A possible solution to estimate the long-term outcome is to run an online experiment or A/B test for the potential algorithms, but it takes months or even longer to observe the long-term o… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: TheWebConference 2024

  3. arXiv:2403.17410  [pdf, other

    cs.LG cs.AI stat.ML

    On permutation-invariant neural networks

    Authors: Masanari Kimura, Ryotaro Shimizu, Yuki Hirakawa, Ryosuke Goto, Yuki Saito

    Abstract: Conventional machine learning algorithms have traditionally been designed under the assumption that input data follows a vector-based format, with an emphasis on vector-centric paradigms. However, as the demand for tasks involving set-based inputs has grown, there has been a paradigm shift in the research community towards addressing these challenges. In recent years, the emergence of neural netwo… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  4. arXiv:2402.06151  [pdf, other

    stat.ML cs.LG

    POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition

    Authors: Yuta Saito, Jihan Yao, Thorsten Joachims

    Abstract: We study off-policy learning (OPL) of contextual bandit policies in large discrete action spaces where existing methods -- most of which rely crucially on reward-regression models or importance-weighted policy gradients -- fail due to excessive bias or variance. To overcome these issues in OPL, we propose a novel two-stage algorithm, called Policy Optimization via Two-Stage Policy Decomposition (P… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.08062

  5. Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction

    Authors: Haruka Kiyohara, Masahiro Nomura, Yuta Saito

    Abstract: We study off-policy evaluation (OPE) in the problem of slate contextual bandits where a policy selects multi-dimensional actions known as slates. This problem is widespread in recommender systems, search engines, marketing, to medical applications, however, the typical Inverse Propensity Scoring (IPS) estimator suffers from substantial variance due to large action spaces, making effective OPE a si… ▽ More

    Submitted 17 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: WWW2024

  6. arXiv:2310.14890  [pdf, other

    stat.ML cs.AI cs.LG

    Boosting for Bounding the Worst-class Error

    Authors: Yuya Saito, Shinnosuke Matsuo, Seiichi Uchida, Daiki Suehiro

    Abstract: This paper tackles the problem of the worst-class error rate, instead of the standard error rate averaged over all classes. For example, a three-class classification task with class-wise error rates of 10\%, 10\%, and 40\% has a worst-class error rate of 40\%, whereas the average is 20\% under the class-balanced condition. The worst-class error is important in many applications. For example, in a… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  7. arXiv:2306.15098  [pdf, other

    stat.ML cs.IR cs.LG

    Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

    Authors: Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito

    Abstract: Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their Off-Policy Evaluation (OPE), aiming towards an accurate performance evaluation of ranking policies using logged data. A de-facto approach for OPE is Inverse Propensity Scoring (IPS), which provides an unbiased and consistent value estimate. However, it becomes extremely inaccurate in the ranking… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: KDD2023 Research track

  8. arXiv:2306.10656  [pdf, other

    cs.LG cs.AI stat.ML

    Virtual Human Generative Model: Masked Modeling Approach for Learning Human Characteristics

    Authors: Kenta Oono, Nontawat Charoenphakdee, Kotatsu Bito, Zhengyan Gao, Yoshiaki Ota, Shoichiro Yamaguchi, Yohei Sugawara, Shin-ichi Maeda, Kunihiko Miyoshi, Yuki Saito, Koki Tsuda, Hiroshi Maruyama, Kohei Hayashi

    Abstract: Identifying the relationship between healthcare attributes, lifestyles, and personality is vital for understanding and improving physical and mental conditions. Machine learning approaches are promising for modeling their relationships and offering actionable suggestions. In this paper, we propose Virtual Human Generative Model (VHGM), a machine learning model for estimating attributes about healt… ▽ More

    Submitted 14 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: 14 pages, 4 figures

  9. arXiv:2305.08062  [pdf, other

    stat.ML cs.AI cs.LG

    Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

    Authors: Yuta Saito, Qingyang Ren, Thorsten Joachims

    Abstract: We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action spaces where conventional importance-weighting approaches suffer from excessive variance. To circumvent this variance issue, we propose a new estimator, called OffCEM, that is based on the conjunct effect model (CEM), a novel decomposition of the causal effect into a cluster effect and a residual effect. O… ▽ More

    Submitted 2 June, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: accepted at ICML2023. arXiv admin note: text overlap with arXiv:2202.06317

  10. arXiv:2202.06317  [pdf, other

    cs.LG cs.AI stat.ML

    Off-Policy Evaluation for Large Action Spaces via Embeddings

    Authors: Yuta Saito, Thorsten Joachims

    Abstract: Off-policy evaluation (OPE) in contextual bandits has seen rapid adoption in real-world systems, since it enables offline evaluation of new policies using only historic log data. Unfortunately, when the number of actions is large, existing OPE estimators -- most of which are based on inverse propensity score weighting -- degrade severely and can suffer from extreme bias and variance. This foils th… ▽ More

    Submitted 15 June, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: accepted at ICML2022

  11. Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

    Authors: Haruka Kiyohara, Yuta Saito, Tatsuya Matsuhiro, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto

    Abstract: In real-world recommender systems and search engines, optimizing ranking decisions to present a ranked list of relevant items is critical. Off-policy evaluation (OPE) for ranking policies is thus gaining a growing interest because it enables performance estimation of new ranking policies using only logged data. Although OPE in contextual bandits has been studied extensively, its naive application… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: WSDM2022

  12. arXiv:2108.13703  [pdf, other

    stat.ML cs.AI cs.LG

    Evaluating the Robustness of Off-Policy Evaluation

    Authors: Yuta Saito, Takuma Udagawa, Haruka Kiyohara, Kazuki Mogi, Yusuke Narita, Kei Tateno

    Abstract: Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted at RecSys2021

  13. arXiv:2010.11002  [pdf, other

    cs.LG stat.ME stat.ML

    Optimal Off-Policy Evaluation from Multiple Logging Policies

    Authors: Nathan Kallus, Yuta Saito, Masatoshi Uehara

    Abstract: We study off-policy evaluation (OPE) from multiple logging policies, each generating a dataset of fixed size, i.e., stratified sampling. Previous work noted that in this setting the ordering of the variances of different importance sampling estimators is instance-dependent, which brings up a dilemma as to which importance sampling weights to use. In this paper, we resolve this dilemma by finding t… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: Under Review

  14. arXiv:2008.07146  [pdf, other

    cs.LG stat.ML

    Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation

    Authors: Yuta Saito, Shunsuke Aihara, Megumi Matsutani, Yusuke Narita

    Abstract: Off-policy evaluation (OPE) aims to estimate the performance of hypothetical policies using data generated by a different policy. Because of its huge potential impact in practice, there has been growing research interest in this field. There is, however, no real-world public dataset that enables the evaluation of OPE, making its experimental studies unrealistic and irreproducible. With the goal of… ▽ More

    Submitted 26 October, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at NeurIPS2021 Datasets and Benchmarks Track

  15. arXiv:2007.04002  [pdf, other

    cs.LG cs.IR stat.ML

    Unbiased Lift-based Bidding System

    Authors: Daisuke Moriwaki, Yuta Hayakawa, Isshu Munemasa, Yuta Saito, Akira Matsui

    Abstract: Conventional bidding strategies for online display ad auction heavily relies on observed performance indicators such as clicks or conversions. A bidding strategy naively pursuing these easily observable metrics, however, fails to optimize the profitability of the advertisers. Rather, the bidding strategy that leads to the maximum revenue is a strategy pursuing the performance lift of showing ads t… ▽ More

    Submitted 8 July, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

  16. Efficient Hyperparameter Optimization under Multi-Source Covariate Shift

    Authors: Masahiro Nomura, Yuta Saito

    Abstract: A typical assumption in supervised machine learning is that the train (source) and test (target) datasets follow completely the same distribution. This assumption is, however, often violated in uncertain real-world applications, which motivates the study of learning under covariate shift. In this setting, the naive use of adaptive hyperparameter optimization methods such as Bayesian optimization d… ▽ More

    Submitted 16 August, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: equal contribution

  17. Exchangeable deep neural networks for set-to-set matching and learning

    Authors: Yuki Saito, Takuma Nakamura, Hirotaka Hachiya, Kenji Fukumizu

    Abstract: Matching two different sets of items, called heterogeneous set-to-set matching problem, has recently received attention as a promising problem. The difficulties are to extract features to match a correct pair of different sets and also preserve two types of exchangeability required for set-to-set matching: the pair of sets, as well as the items in each set, should be exchangeable. In this study, w… ▽ More

    Submitted 28 January, 2021; v1 submitted 22 October, 2019; originally announced October 2019.

  18. arXiv:1910.07295  [pdf, other

    stat.ML cs.IR cs.LG

    Towards Resolving Propensity Contradiction in Offline Recommender Learning

    Authors: Yuta Saito, Masahiro Nomura

    Abstract: We study offline recommender learning from explicit rating feedback in the presence of selection bias. A current promising solution for the bias is the inverse propensity score (IPS) estimation. However, the performance of existing propensity-based methods can suffer significantly from the propensity estimation bias. In fact, most of the previous IPS-based methods require some amount of missing-co… ▽ More

    Submitted 20 April, 2022; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: IJCAI2022

  19. arXiv:1910.01847  [pdf, other

    stat.ML cs.LG

    Dual Learning Algorithm for Delayed Conversions

    Authors: Yuta Saito, Gota Morishita, Shota Yasui

    Abstract: In display advertising, predicting the conversion rate (CVR), meaning the probability that a user takes a predefined action on an advertiser's website, is a fundamental task for estimating the value of displaying an advertisement to a user. There are two main challenges in CVR prediction due to delayed feedback. First, some positive labels are not correctly observed in training data because some c… ▽ More

    Submitted 18 May, 2020; v1 submitted 4 October, 2019; originally announced October 2019.

  20. arXiv:1910.01444  [pdf, other

    cs.SI cs.IR cs.LG stat.ML

    Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

    Authors: Yuta Saito

    Abstract: In most real-world recommender systems, the observed rating data are subject to selection bias, and the data are thus missing-not-at-random. Develo** a method to facilitate the learning of a recommender with biased feedback is one of the most challenging problems, as it is widely known that naive approaches under selection bias often lead to suboptimal results. A well-established solution for th… ▽ More

    Submitted 2 June, 2020; v1 submitted 8 September, 2019; originally announced October 2019.

    Comments: 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '20)

  21. arXiv:1909.05299  [pdf, other

    stat.ML cs.LG

    Counterfactual Cross-Validation: Stable Model Selection Procedure for Causal Inference Models

    Authors: Yuta Saito, Shota Yasui

    Abstract: We study the model selection problem in conditional average treatment effect (CATE) prediction. Unlike previous works on this topic, we focus on preserving the rank order of the performance of candidate CATE predictors to enable accurate and stable model selection. To this end, we analyze the model performance ranking problem and formulate guidelines to obtain a better evaluation metric. We then p… ▽ More

    Submitted 16 July, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  22. arXiv:1909.03601  [pdf, other

    stat.ML cs.IR cs.LG

    Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback

    Authors: Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, Kazuhide Nakata

    Abstract: Recommender systems widely use implicit feedback such as click data because of its general availability. Although the presence of clicks signals the users' preference to some extent, the lack of such clicks does not necessarily indicate a negative response from the users, as it is possible that the users were not exposed to the items (positive-unlabeled problem). This leads to a difficulty in pred… ▽ More

    Submitted 9 February, 2020; v1 submitted 8 September, 2019; originally announced September 2019.

    Comments: accepted at WSDM'20

  23. arXiv:1907.08294  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    DNN-based Speaker Embedding Using Subjective Inter-speaker Similarity for Multi-speaker Modeling in Speech Synthesis

    Authors: Yuki Saito, Shinnosuke Takamichi, Hiroshi Saruwatari

    Abstract: This paper proposes novel algorithms for speaker embedding using subjective inter-speaker similarity based on deep neural networks (DNNs). Although conventional DNN-based speaker embedding such as a $d$-vector can be applied to multi-speaker modeling in speech synthesis, it does not correlate with the subjective inter-speaker similarity and is not necessarily appropriate speaker representation for… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 6 pages, 7 figures, accepted for The 10th ISCA Speech Synthesis Workshop (SSW10)

  24. arXiv:1807.02999  [pdf, ps, other

    cs.LG stat.ML

    Decreasing the size of the Restricted Boltzmann machine

    Authors: Yohei Saito, Takuya Kato

    Abstract: We propose a method to decrease the number of hidden units of the restricted Boltzmann machine while avoiding decrease of the performance measured by the Kullback-Leibler divergence. Then, we demonstrate our algorithm by using numerical simulations.

    Submitted 12 December, 2018; v1 submitted 9 July, 2018; originally announced July 2018.