Skip to main content

Showing 1–12 of 12 results for author: Narita, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.15098  [pdf, other

    stat.ML cs.IR cs.LG

    Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

    Authors: Haruka Kiyohara, Masatoshi Uehara, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto, Yuta Saito

    Abstract: Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their Off-Policy Evaluation (OPE), aiming towards an accurate performance evaluation of ranking policies using logged data. A de-facto approach for OPE is Inverse Propensity Scoring (IPS), which provides an unbiased and consistent value estimate. However, it becomes extremely inaccurate in the ranking… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: KDD2023 Research track

  2. arXiv:2212.01925  [pdf, other

    cs.LG cs.AI econ.EM stat.AP stat.ML

    Counterfactual Learning with General Data-generating Policies

    Authors: Yusuke Narita, Kyohei Okumura, Akihiro Shimizu, Kohei Yata

    Abstract: Off-policy evaluation (OPE) attempts to predict the performance of counterfactual policies using log data from a different policy. We extend its applicability by develo** an OPE method for a class of both full support and deficient support logging policies in contextual-bandit settings. This class includes deterministic bandit (such as Upper Confidence Bound) as well as deterministic decision-ma… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: text overlap with arXiv:2104.12909

  3. Incorporating Participants' Welfare into Sequential Multiple Assignment Randomized Trials

    Authors: Xinru Wang, Nina Deliu, Yusuke Narita, Bibhas Chakraborty

    Abstract: Dynamic treatment regimes (DTRs) are sequences of decision rules that recommend treatments based on patients' time-varying clinical conditions. The sequential multiple assignment randomized trial (SMART) is an experimental design that can provide high-quality evidence for constructing optimal DTRs. In a conventional SMART, participants are randomized to available treatments at multiple stages with… ▽ More

    Submitted 19 September, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

  4. Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

    Authors: Haruka Kiyohara, Yuta Saito, Tatsuya Matsuhiro, Yusuke Narita, Nobuyuki Shimizu, Yasuo Yamamoto

    Abstract: In real-world recommender systems and search engines, optimizing ranking decisions to present a ranked list of relevant items is critical. Off-policy evaluation (OPE) for ranking policies is thus gaining a growing interest because it enables performance estimation of new ranking policies using only logged data. Although OPE in contextual bandits has been studied extensively, its naive application… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: WSDM2022

  5. arXiv:2108.13703  [pdf, other

    stat.ML cs.AI cs.LG

    Evaluating the Robustness of Off-Policy Evaluation

    Authors: Yuta Saito, Takuma Udagawa, Haruka Kiyohara, Kazuki Mogi, Yusuke Narita, Kei Tateno

    Abstract: Off-policy Evaluation (OPE), or offline evaluation in general, evaluates the performance of hypothetical policies leveraging only offline log data. It is particularly useful in applications where the online interaction involves high stakes and expensive setting such as precision medicine and recommender systems. Since many OPE estimators have been proposed and some of them have hyperparameters to… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted at RecSys2021

  6. arXiv:2104.12909  [pdf, other

    econ.EM cs.LG stat.ME stat.ML

    Algorithm as Experiment: Machine Learning, Market Design, and Policy Eligibility Rules

    Authors: Yusuke Narita, Kohei Yata

    Abstract: Algorithms make a growing portion of policy and business decisions. We develop a treatment-effect estimator using algorithmic decisions as instruments for a class of stochastic and deterministic algorithms. Our estimator is consistent and asymptotically normal for well-defined causal effects. A special case of our setup is multidimensional regression discontinuity designs with complex boundaries.… ▽ More

    Submitted 5 December, 2023; v1 submitted 26 April, 2021; originally announced April 2021.

  7. arXiv:2104.07617  [pdf, other

    econ.GN stat.AP

    Curse of Democracy: Evidence from the 21st Century

    Authors: Yusuke Narita, Ayumi Sudo

    Abstract: Democracy is widely believed to contribute to economic growth and public health in the 20th and earlier centuries. We find that this conventional wisdom is reversed in this century, i.e., democracy has persistent negative impacts on GDP growth during 2001-2020. This finding emerges from five different instrumental variable strategies. Our analysis suggests that democracies cause slower growth thro… ▽ More

    Submitted 26 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  8. arXiv:2101.01093  [pdf, other

    econ.EM stat.AP stat.ME

    Breaking Ties: Regression Discontinuity Design Meets Market Design

    Authors: Atila Abdulkadiroglu, Joshua D. Angrist, Yusuke Narita, Parag Pathak

    Abstract: Many schools in large urban districts have more applicants than seats. Centralized school assignment algorithms ration seats at over-subscribed schools using randomly assigned lottery numbers, non-lottery tie-breakers like test scores, or both. The New York City public high school match illustrates the latter, using test scores and other criteria to rank applicants at ``screened'' schools, combine… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

  9. arXiv:2008.07146  [pdf, other

    cs.LG stat.ML

    Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation

    Authors: Yuta Saito, Shunsuke Aihara, Megumi Matsutani, Yusuke Narita

    Abstract: Off-policy evaluation (OPE) aims to estimate the performance of hypothetical policies using data generated by a different policy. Because of its huge potential impact in practice, there has been growing research interest in this field. There is, however, no real-world public dataset that enables the evaluation of OPE, making its experimental studies unrealistic and irreproducible. With the goal of… ▽ More

    Submitted 26 October, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at NeurIPS2021 Datasets and Benchmarks Track

  10. arXiv:2002.08536  [pdf, other

    cs.LG cs.AI econ.EM stat.ME stat.ML

    Debiased Off-Policy Evaluation for Recommendation Systems

    Authors: Yusuke Narita, Shota Yasui, Kohei Yata

    Abstract: Efficient methods to evaluate new algorithms are critical for improving interactive bandit and reinforcement learning systems such as recommendation systems. A/B tests are reliable, but are time- and money-consuming, and entail a risk of failure. In this paper, we develop an alternative method, which predicts the performance of algorithms given historical data that may have been generated by a dif… ▽ More

    Submitted 2 August, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: accepted at RecSys '21

  11. arXiv:2002.05308  [pdf, ps, other

    stat.ML cs.LG econ.EM

    Efficient Adaptive Experimental Design for Average Treatment Effect Estimation

    Authors: Masahiro Kato, Takuya Ishihara, Junya Honda, Yusuke Narita

    Abstract: The goal of many scientific experiments including A/B testing is to estimate the average treatment effect (ATE), which is defined as the difference between the expected outcomes of two or more treatments. In this paper, we consider a situation where an experimenter can assign a treatment to research subjects sequentially. In adaptive experimental design, the experimenter is allowed to change the p… ▽ More

    Submitted 26 October, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

  12. arXiv:1809.03084  [pdf, other

    cs.LG cs.AI cs.IR stat.ME stat.ML

    Efficient Counterfactual Learning from Bandit Feedback

    Authors: Yusuke Narita, Shota Yasui, Kohei Yata

    Abstract: What is the most statistically efficient way to do off-policy evaluation and optimization with batch data from bandit feedback? For log data generated by contextual bandit algorithms, we consider offline estimators for the expected reward from a counterfactual policy. Our estimators are shown to have lowest variance in a wide class of estimators, achieving variance reduction relative to standard e… ▽ More

    Submitted 5 December, 2018; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: accepted at AAAI 2019