Skip to main content

Showing 1–18 of 18 results for author: Hüyük, A

.
  1. arXiv:2403.00694  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Defining Expertise: Applications to Treatment Effect Estimation

    Authors: Alihan Hüyük, Qiyao Wei, Alicia Curth, Mihaela van der Schaar

    Abstract: Decision-makers are often experts of their domain and take actions based on their domain knowledge. Doctors, for instance, may prescribe treatments by predicting the likely outcome of each available treatment. Actions of an expert thus naturally encode part of their domain knowledge, and can help make inferences within the same domain: Knowing doctors try to prescribe the best treatment for their… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: The 12th International Conference on Learning Representations (ICLR 2024)

  2. arXiv:2401.17205  [pdf, other

    stat.ML cs.LG

    Adaptive Experiment Design with Synthetic Controls

    Authors: Alihan Hüyük, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Clinical trials are typically run in order to understand the effects of a new treatment on a given population of patients. However, patients in large populations rarely respond the same way to the same treatment. This heterogeneity in patient responses necessitates trials that investigate effects on multiple subpopulations - especially when a treatment has marginal or no benefit for the overall po… ▽ More

    Submitted 9 February, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics

  3. arXiv:2311.14110  [pdf, other

    cs.LG cs.AI

    When is Off-Policy Evaluation Useful? A Data-Centric Perspective

    Authors: Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Evaluating the value of a hypothetical target policy with only a logged dataset is important but challenging. On the one hand, it brings opportunities for safe policy improvement under high-stakes scenarios like clinical guidelines. On the other hand, such opportunities raise a need for precise off-policy evaluation (OPE). While previous work on OPE focused on improving the algorithm in value esti… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Off-Policy Evaluation, Data-Centric AI, Data-Centric Reinforcement Learning, Reinforcement Learning

  4. arXiv:2311.07426  [pdf, other

    cs.LG cs.CV cs.HC

    Optimising Human-AI Collaboration by Learning Convincing Explanations

    Authors: Alex J. Chan, Alihan Huyuk, Mihaela van der Schaar

    Abstract: Machine learning models are being increasingly deployed to take, or assist in taking, complicated and high-impact decisions, from quasi-autonomous vehicles to clinical decision support systems. This poses challenges, particularly when models have hard-to-detect failure modes and are able to take actions without oversight. In order to handle this challenge, we propose a method for a collaborative s… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  5. arXiv:2310.19831  [pdf, other

    stat.ML cs.LG

    Explaining by Imitating: Understanding Decisions by Interpretable Policy Learning

    Authors: Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding human behavior from observed data is critical for transparency and accountability in decision-making. Consider real-world settings such as healthcare, in which modeling a decision-maker's policy is challenging -- with no access to underlying states, no knowledge of environment dynamics, and no allowance for live experimentation. We desire learning a data-driven representation of deci… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Journal ref: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

  6. arXiv:2310.18601  [pdf, other

    stat.ML cs.LG

    Online Decision Mediation

    Authors: Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Consider learning a decision support assistant to serve as an intermediary between (oracle) expert behavior and (imperfect) human behavior: At each time, the algorithm observes an action chosen by a fallible agent, and decides whether to *accept* that agent's decision, *intervene* with an alternative, or *request* the expert's opinion. For instance, in clinical diagnosis, fully-autonomous machine… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Journal ref: In Proc. 36th International Conference on Neural Information Processing Systems (NeurIPS 2022)

  7. arXiv:2310.18591  [pdf, other

    stat.ML cs.LG

    Inverse Decision Modeling: Learning Interpretable Representations of Behavior

    Authors: Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Decision analysis deals with modeling and enhancing decision processes. A principal challenge in improving behavior is in obtaining a transparent description of existing behavior in the first place. In this paper, we develop an expressive, unifying perspective on inverse decision modeling: a framework for learning parameterized representations of sequential decision behavior. First, we formalize t… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Journal ref: In Proc. 38th International Conference on Machine Learning (ICML 2021)

  8. arXiv:2310.07747  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

    Authors: Hao Sun, Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Learning controllers with offline data in decision-making systems is an essential area of research due to its potential to reduce the risk of applications in real-world systems. However, in responsibility-sensitive settings such as healthcare, decision accountability is of paramount importance, yet has not been adequately addressed by the literature. This paper introduces the Accountable Offline C… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  9. arXiv:2309.06553  [pdf, other

    cs.CL cs.AI cs.LG

    Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

    Authors: Hao Sun, Alihan Hüyük, Mihaela van der Schaar

    Abstract: In this study, we aim to enhance the arithmetic reasoning ability of Large Language Models (LLMs) through zero-shot prompt optimization. We identify a previously overlooked objective of query dependency in such optimization and elucidate two ensuing challenges that impede the successful and economical design of prompt optimization techniques. One primary issue is the absence of an effective method… ▽ More

    Submitted 7 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  10. arXiv:2302.12604  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Neural Laplace Control for Continuous-time Delayed Systems

    Authors: Samuel Holt, Alihan Hüyük, Zhaozhi Qian, Hao Sun, Mihaela van der Schaar

    Abstract: Many real-world offline reinforcement learning (RL) problems involve continuous-time environments with delays. Such environments are characterized by two distinctive features: firstly, the state x(t) is observed at irregular time intervals, and secondly, the current action a(t) only affects the future state x(t + g) with an unknown delay g > 0. A prime example of such an environment is satellite c… ▽ More

    Submitted 10 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023, Valencia, Spain. PMLR: Volume 206. Copyright 2023 by the author(s)

    ACM Class: I.2.6; I.2.5; E.1

  11. arXiv:2208.05844  [pdf, other

    stat.ML cs.LG

    Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions

    Authors: Alicia Curth, Alihan Hüyük, Mihaela van der Schaar

    Abstract: We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial. This type of adaptive clinical trial has been thoroughly studied in biostatistics, but has been allowed only limited adaptivity so far. Here, we aim to relax classical restrictions on such designs and investigate how to incorporate ideas from the recent ma… ▽ More

    Submitted 5 June, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: To appear in the Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  12. arXiv:2202.10153  [pdf, other

    cs.LG cs.AI stat.ML

    Inferring Lexicographically-Ordered Rewards from Preferences

    Authors: Alihan Hüyük, William R. Zame, Mihaela van der Schaar

    Abstract: Modeling the preferences of agents over a set of alternatives is a principal concern in many areas. The dominant approach has been to find a single reward/utility function with the property that alternatives yielding higher rewards are preferred over alternatives yielding lower rewards. However, in many settings, preferences are based on multiple, often competing, objectives; a single reward funct… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: In Proceedings of the 36th AAAI Conference on Artificial Intelligence

  13. arXiv:2107.06317  [pdf, other

    cs.LG stat.ML

    Inverse Contextual Bandits: Learning How Behavior Evolves over Time

    Authors: Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare. Though conventional approaches to policy learning almost invariably assume stationarity in behavior, this is hardly true in practice: Medical practice is constantly evolving as clinical professionals fine-tune their knowledge over tim… ▽ More

    Submitted 8 June, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: In Proceedings of the 39th International Conference on Machine Learning

  14. arXiv:2106.04240  [pdf, other

    cs.LG

    The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation

    Authors: Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes. Several factors including the availability of public data, the intrinsically offline nature of the problem, and the complexity of human decision making, has meant that the mainstream development of algorithms is often geared… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  15. arXiv:2007.13531  [pdf, other

    cs.LG cs.AI stat.ML

    Learning "What-if" Explanations for Sequential Decision-Making

    Authors: Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i.e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions. In this paper, we propose learning explanations of expert decisions by modeling their reward function… ▽ More

    Submitted 30 March, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

  16. arXiv:1907.11605  [pdf, other

    cs.LG stat.ML

    Lexicographic Multiarmed Bandit

    Authors: Alihan Hüyük, Cem Tekin

    Abstract: We consider a multiobjective multiarmed bandit problem with lexicographically ordered objectives. In this problem, the goal of the learner is to select arms that are lexicographic optimal as much as possible without knowing the arm reward distributions beforehand. We capture this goal by defining a multidimensional form of regret that measures the loss of the learner due to not selecting lexicogra… ▽ More

    Submitted 29 July, 2019; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: 22 pages

  17. Thompson Sampling for Combinatorial Network Optimization in Unknown Environments

    Authors: Alihan Hüyük, Cem Tekin

    Abstract: Influence maximization, adaptive routing, and dynamic spectrum allocation all require choosing the right action from a large set of alternatives. Thanks to the advances in combinatorial optimization, these and many similar problems can be efficiently solved given an environment with known stochasticity. In this paper, we take this one step further and focus on combinatorial optimization in unknown… ▽ More

    Submitted 21 September, 2020; v1 submitted 7 July, 2019; originally announced July 2019.

    Comments: 14 pages, 3 figures. Accepted for publication in the IEEE/ACM Transactions on Networking. arXiv admin note: text overlap with arXiv:1809.02707

    Journal ref: IEEE/ACM Transactions on Networking, vol. 28, no. 6, pp. 2836-2849, Dec. 2020

  18. arXiv:1809.02707  [pdf, other

    cs.LG stat.ML

    Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms

    Authors: Alihan Hüyük, Cem Tekin

    Abstract: We analyze the regret of combinatorial Thompson sampling (CTS) for the combinatorial multi-armed bandit with probabilistically triggered arms under the semi-bandit feedback setting. We assume that the learner has access to an exact optimization oracle but does not know the expected base arm outcomes beforehand. When the expected reward function is Lipschitz continuous in the expected base arm outc… ▽ More

    Submitted 19 February, 2019; v1 submitted 7 September, 2018; originally announced September 2018.

    Comments: To appear in the Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS) 2019