Skip to main content

Showing 1–19 of 19 results for author: Fiez, T

.
  1. arXiv:2406.10738  [pdf, other

    cs.LG stat.ME

    Adaptive Experimentation When You Can't Experiment

    Authors: Yao Zhao, Kwang-Sung Jun, Tanner Fiez, Lalit Jain

    Abstract: This paper introduces the \emph{confounded pure exploration transductive linear bandit} (\texttt{CPET-LB}) problem. As a motivating example, often online services cannot directly assign users to specific control or treatment experiences either for business or practical reasons. In these settings, naively comparing treatment and control groups that may result from self-selection can lead to biased… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2402.10870  [pdf, other

    cs.LG stat.ME

    Best of Three Worlds: Adaptive Experimentation for Digital Marketing in Practice

    Authors: Tanner Fiez, Houssam Nassif, Yu-Cheng Chen, Sergio Gamez, Lalit Jain

    Abstract: Adaptive experimental design (AED) methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. However, the behavior and guarantees of such methods are not well-understood beyond idealized stationary settings. This paper shares lessons learned regarding the challenges of naively using AED system… ▽ More

    Submitted 26 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Journal ref: The Web Conference (WWW), Singapore, 2024

  3. arXiv:2310.04390  [pdf, other

    math.ST

    Experimental Designs for Heteroskedastic Variance

    Authors: Justin Weltz, Tanner Fiez, Alexander Volfovsky, Eric Laber, Blake Mason, Houssam Nassif, Lalit Jain

    Abstract: Most linear experimental design problems assume homogeneous variance although heteroskedastic noise is present in many realistic settings. Let a learner have access to a finite set of measurement vectors $\mathcal{X}\subset \mathbb{R}^d$ that can be probed to receive noisy linear responses of the form $y=x^{\top}θ^{\ast}+η$. Here $θ^{\ast}\in \mathbb{R}^d$ is an unknown parameter vector, and $η$ i… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS'23), New Orleans, 2023

  4. Neural Insights for Digital Marketing Content Design

    Authors: Fanjie Kong, Yuan Li, Houssam Nassif, Tanner Fiez, Ricardo Henao, Shreya Chakrabarti

    Abstract: In digital marketing, experimenting with new website content is one of the key levers to improve customer engagement. However, creating successful marketing content is a manual and time-consuming process that lacks clear guiding principles. This paper seeks to close the loop between content creation and online experimentation by offering marketers AI-driven actionable insights based on historical… ▽ More

    Submitted 7 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Journal ref: International Conference on Knowledge Discovery and Data Mining (KDD'23), Long Beach, CA, pp. 4320-4332, 2023

  5. arXiv:2210.14369  [pdf, other

    cs.LG stat.ME

    Adaptive Experimental Design and Counterfactual Inference

    Authors: Tanner Fiez, Sergio Gamez, Arick Chen, Houssam Nassif, Lalit Jain

    Abstract: Adaptive experimental design methods are increasingly being used in industry as a tool to boost testing throughput or reduce experimentation cost relative to traditional A/B/N testing methods. This paper shares lessons learned regarding the challenges and pitfalls of naively using adaptive experimentation systems in industrial settings where non-stationarity is prevalent, while also providing pers… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: In Workshops of the Conference on Recommender Systems (RecSys), 2022

  6. arXiv:2111.03377  [pdf, other

    cs.GT cs.LG cs.MA

    Online Learning in Periodic Zero-Sum Games

    Authors: Tanner Fiez, Ryann Sim, Stratis Skoulakis, Georgios Piliouras, Lillian Ratliff

    Abstract: A seminal result in game theory is von Neumann's minmax theorem, which states that zero-sum games admit an essentially unique equilibrium solution. Classical learning results build on this theorem to show that online no-regret dynamics converge to an equilibrium in a time-average sense in zero-sum games. In the past several years, a key research direction has focused on characterizing the day-to-d… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS 2021

  7. arXiv:2109.12286  [pdf, other

    cs.LG

    Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms

    Authors: Liyuan Zheng, Tanner Fiez, Zane Alumbaugh, Benjamin Chasnov, Lillian J. Ratliff

    Abstract: The hierarchical interaction between the actor and critic in actor-critic based reinforcement learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this viewpoint and model the actor and critic interaction as a two-player general-sum game with a leader-follower structure known as a Stackelberg game. Given this abstraction, we propose a meta-framework for Stackelbe… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  8. arXiv:2106.01488  [pdf, other

    cs.LG cs.GT

    Minimax Optimization with Smooth Algorithmic Adversaries

    Authors: Tanner Fiez, Chi **, Praneeth Netrapalli, Lillian J. Ratliff

    Abstract: This paper considers minimax optimization $\min_x \max_y f(x, y)$ in the challenging setting where $f$ can be both nonconvex in $x$ and nonconcave in $y$. Though such optimization problems arise in many machine learning paradigms including training generative adversarial networks (GANs) and adversarially robust models, many fundamental issues remain in theory, such as the absence of efficiently co… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  9. arXiv:2012.08382  [pdf, other

    cs.GT cs.LG cs.MA

    Evolutionary Game Theory Squared: Evolving Agents in Endogenously Evolving Zero-Sum Games

    Authors: Stratis Skoulakis, Tanner Fiez, Ryann Sim, Georgios Piliouras, Lillian Ratliff

    Abstract: The predominant paradigm in evolutionary game theory and more generally online learning in games is based on a clear distinction between a population of dynamic agents that interact given a fixed, static game. In this paper, we move away from the artificial divide between dynamic agents and static games, to introduce and analyze a large class of competitive settings where both the agents and the g… ▽ More

    Submitted 15 December, 2020; originally announced December 2020.

    Comments: To appear in AAAI 2021

  10. arXiv:2009.14820  [pdf, other

    cs.LG cs.GT eess.SY stat.ML

    Gradient Descent-Ascent Provably Converges to Strict Local Minmax Equilibria with a Finite Timescale Separation

    Authors: Tanner Fiez, Lillian Ratliff

    Abstract: We study the role that a finite timescale separation parameter $τ$ has on gradient descent-ascent in two-player non-convex, non-concave zero-sum games where the learning rate of player 1 is denoted by $γ_1$ and the learning rate of player 2 is defined to be $γ_2=τγ_1$. Existing work analyzing the role of timescale separation in gradient descent-ascent has primarily focused on the edge cases of pla… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  11. arXiv:2007.07079  [pdf, other

    cs.AI cs.IR cs.LG

    A SUPER* Algorithm to Optimize Paper Bidding in Peer Review

    Authors: Tanner Fiez, Nihar B. Shah, Lillian Ratliff

    Abstract: A number of applications involve sequential arrival of users, and require showing each user an ordering of items. A prime example (which forms the focus of this paper) is the bidding process in conference peer review where reviewers enter the system sequentially, each reviewer needs to be shown the list of submitted papers, and the reviewer then "bids" to review some papers. The order of the paper… ▽ More

    Submitted 31 July, 2020; v1 submitted 27 June, 2020; originally announced July 2020.

  12. arXiv:1906.08399  [pdf, other

    stat.ML cs.LG

    Sequential Experimental Design for Transductive Linear Bandits

    Authors: Tanner Fiez, Lalit Jain, Kevin Jamieson, Lillian Ratliff

    Abstract: In this paper we introduce the transductive linear bandit problem: given a set of measurement vectors $\mathcal{X}\subset \mathbb{R}^d$, a set of items $\mathcal{Z}\subset \mathbb{R}^d$, a fixed confidence $δ$, and an unknown vector $θ^{\ast}\in \mathbb{R}^d$, the goal is to infer $\text{argmax}_{z\in \mathcal{Z}} z^\topθ^\ast$ with probability $1-δ$ by making as few sequentially chosen noisy meas… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  13. arXiv:1906.01217  [pdf, other

    cs.GT cs.LG eess.SY

    Convergence of Learning Dynamics in Stackelberg Games

    Authors: Tanner Fiez, Benjamin Chasnov, Lillian J. Ratliff

    Abstract: This paper investigates the convergence of learning dynamics in Stackelberg games. In the class of games we consider, there is a hierarchical game being played between a leader and a follower with continuous action spaces. We establish a number of connections between the Nash and Stackelberg equilibrium concepts and characterize conditions under which attracting critical points of simultaneous gra… ▽ More

    Submitted 6 November, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: This version includes numerical results training generative adversarial networks

    MSC Class: math.OC

  14. arXiv:1807.02297  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences

    Authors: Tanner Fiez, Shreyas Sekar, Liyuan Zheng, Lillian J. Ratliff

    Abstract: The design of personalized incentives or recommendations to improve user engagement is gaining prominence as digital platform providers continually emerge. We propose a multi-armed bandit framework for matching incentives to users, whose preferences are unknown a priori and evolving dynamically in time, in a resource constrained environment. We design an algorithm that combines ideas from three di… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: Published as a conference paper in Conference on Uncertainty in Artificial Intelligence (UAI) 2018

  15. arXiv:1806.05749  [pdf, other

    cs.GT eess.SY

    Adaptive Incentive Design

    Authors: Lillian J. Ratliff, Tanner Fiez

    Abstract: We apply control theoretic and optimization techniques to adaptively design incentives. In particular, we consider the problem of a planner with an objective that depends on data from strategic decision makers. The planner does not know the process by which the strategic agents make decisions. Under the assumption that the agents are utility maximizers, we model their interactions as a non-coopera… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

  16. arXiv:1803.04008  [pdf, other

    cs.LG

    Multi-Armed Bandits for Correlated Markovian Environments with Smoothed Reward Feedback

    Authors: Tanner Fiez, Shreyas Sekar, Lillian J. Ratliff

    Abstract: We study a multi-armed bandit problem in a dynamic environment where arm rewards evolve in a correlated fashion according to a Markov chain. Different than much of the work on related problems, in our formulation a learning algorithm does not have access to either a priori information or observations of the state of the Markov chain and only observes smoothed reward feedback following time interva… ▽ More

    Submitted 1 March, 2019; v1 submitted 11 March, 2018; originally announced March 2018.

    Comments: Significant revision of prior version including deeper discussion of related work, gap-independent regret bounds, and regret bounds for discounted rewards

  17. arXiv:1712.01263  [pdf, other

    stat.AP

    Data-Driven Spatio-Temporal Analysis of Curbside Parking Demand: A Case-Study in Seattle

    Authors: Tanner Fiez, Lillian Ratliff

    Abstract: Due to rapid expansion of urban areas in recent years, management of curbside parking has become increasingly important. To mitigate congestion, while meeting a city's diverse needs, performance-based pricing schemes have received a significant amount of attention. However, several recent studies suggest location, time-of-day, and awareness of policies are the primary factors that drive parking de… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: Submitted to IEEE Transactions on Intelligent Transportation Systems

  18. arXiv:1703.07802  [pdf, other

    math.OC

    Optimizing Curbside Parking Resources Subject to Congestion Constraints

    Authors: Chase Dowling, Tanner Fiez, Lillian Ratliff, Baosen Zhang

    Abstract: To gain theoretical insight into the relationship between parking scarcity and congestion, we describe block-faces of curbside parking as a network of queues. Due to the nature of this network, canonical queueing network results are not available to us. We present a new kind of queueing network subject to customer rejection due to the lack of available servers. We provide conditions for such netwo… ▽ More

    Submitted 22 March, 2017; originally announced March 2017.

    Comments: Submitted to IEEE CDC, 2017. 17 pages, 9 figures

  19. arXiv:1702.06156  [pdf, other

    cs.CY

    How Much Urban Traffic is Searching for Parking? Simulating Curbside Parking as a Network of Finite Capacity Queues

    Authors: Chase Dowling, Tanner Fiez, Lillian Ratliff, Baosen Zhang

    Abstract: With the increasing availability of transaction data collected by digital parking meters, paid curbside parking can be advantageously modeled as a network of interdependent queues. In this article we introduce methods for analyzing a special class of networks of finite capacity queues, where tasks arrive from an exogenous source, join the queue if there is an available server or are rejected and m… ▽ More

    Submitted 11 May, 2018; v1 submitted 20 February, 2017; originally announced February 2017.

    Comments: Updated May 11, 2018 (fixed formatting errors)