-
Randomized FIFO Mechanisms
Authors:
Francisco Castro,
Hongyao Ma,
Hamid Nazerzadeh,
Chiwei Yan
Abstract:
We study the matching of jobs to workers in a queue, e.g. a ridesharing platform dispatching drivers to pick up riders at an airport. Under FIFO dispatching, the heterogeneity in trip earnings incentivizes drivers to cherry-pick, increasing riders' waiting time for a match and resulting in a loss of efficiency and reliability. We first present the direct FIFO mechanism, which offers lower-earning…
▽ More
We study the matching of jobs to workers in a queue, e.g. a ridesharing platform dispatching drivers to pick up riders at an airport. Under FIFO dispatching, the heterogeneity in trip earnings incentivizes drivers to cherry-pick, increasing riders' waiting time for a match and resulting in a loss of efficiency and reliability. We first present the direct FIFO mechanism, which offers lower-earning trips to drivers further down the queue. The option to skip the rest of the line incentivizes drivers to accept all dispatches, but the mechanism would be considered unfair since drivers closer to the head of the queue may have lower priority for trips to certain destinations. To avoid the use of unfair dispatch rules, we introduce a family of randomized FIFO mechanisms, which send declined trips gradually down the queue in a randomized manner. We prove that a randomized FIFO mechanism achieves the first best throughput and the second best revenue in equilibrium. Extensive counterfactual simulations using data from the City of Chicago demonstrate substantial improvements of revenue and throughput, highlighting the effectiveness of using waiting times to align incentives and reduce the variability in driver earnings.
△ Less
Submitted 20 November, 2021;
originally announced November 2021.
-
Matching Queues, Flexibility and Incentives
Authors:
Francisco Castro,
Peter Frazier,
Hongyao Ma,
Hamid Nazerzadeh,
Chiwei Yan
Abstract:
Problem definition: Agents in online marketplaces (such as ridesharing and freelancing platforms) are often strategic, and heterogeneous in their compatibility with different types of jobs: fully flexible agents can fulfill any job, whereas specialized agents can only complete specific subsets of jobs. Convention wisdom suggests reserving agents that are more flexible whenever possible, however th…
▽ More
Problem definition: Agents in online marketplaces (such as ridesharing and freelancing platforms) are often strategic, and heterogeneous in their compatibility with different types of jobs: fully flexible agents can fulfill any job, whereas specialized agents can only complete specific subsets of jobs. Convention wisdom suggests reserving agents that are more flexible whenever possible, however this may incentivize agents to pretend to be more specialized, leading to loss in matches. We focus on designing a practical matching policy that performs well in a strategic environment.
Methodology/results: We model the allocation of jobs to agents as a matching queue, and analyze the equilibrium performance of various matching policies when agents are strategic and report their own types. We show that reserving flexibility naively can backfire, to the extent that the equilibrium throughput can be arbitrarily bad compared to a policy which simply dispatches jobs to agents at random. To balance matching efficiency with agents' strategic considerations, we propose a new policy dubbed flexibility reservation with fallback and show that it enjoys robust performance.
Managerial implications: Our work highlights the importance of considering agent strategic behavior when designing matching policies in online platforms and service systems. The robust performance guarantee, along with the parameter-free nature of our proposed policy makes it easy to implement in practice. We illustrate how this policy is implemented in the driver destination product of major ridesharing platforms.
△ Less
Submitted 14 January, 2024; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Dynamic Reserve Prices for Repeated Auctions: Learning from Bids
Authors:
Yash Kanoria,
Hamid Nazerzadeh
Abstract:
A large fraction of online advertisement is sold via repeated second price auctions. In these auctions, the reserve price is the main tool for the auctioneer to boost revenues. In this work, we investigate the following question: Can changing the reserve prices based on the previous bids improve the revenue of the auction, taking into account the long-term incentives and strategic behavior of the…
▽ More
A large fraction of online advertisement is sold via repeated second price auctions. In these auctions, the reserve price is the main tool for the auctioneer to boost revenues. In this work, we investigate the following question: Can changing the reserve prices based on the previous bids improve the revenue of the auction, taking into account the long-term incentives and strategic behavior of the bidders? We show that if the distribution of the valuations is known and satisfies the standard regularity assumptions, then the optimal mechanism has a constant reserve. However, when there is uncertainty in the distribution of the valuations, previous bids can be used to learn the distribution of the valuations and to update the reserve price. We present a simple, approximately incentive-compatible, and asymptotically optimal dynamic reserve mechanism that can significantly improve the revenue over the best static reserve.
The paper is from July 2014 (our submission to WINE 2014), posted later here on the arxiv to complement the 1-page abstract in the WINE 2014 proceedings.
△ Less
Submitted 17 February, 2020;
originally announced February 2020.
-
Driver Surge Pricing
Authors:
Nikhil Garg,
Hamid Nazerzadeh
Abstract:
Ride-hailing marketplaces like Uber and Lyft use dynamic pricing, often called surge, to balance the supply of available drivers with the demand for rides. We study driver-side payment mechanisms for such marketplaces, presenting the theoretical foundation that has informed the design of Uber's new additive driver surge mechanism. We present a dynamic stochastic model to capture the impact of surg…
▽ More
Ride-hailing marketplaces like Uber and Lyft use dynamic pricing, often called surge, to balance the supply of available drivers with the demand for rides. We study driver-side payment mechanisms for such marketplaces, presenting the theoretical foundation that has informed the design of Uber's new additive driver surge mechanism. We present a dynamic stochastic model to capture the impact of surge pricing on driver earnings and their strategies to maximize such earnings. In this setting, some time periods (surge) are more valuable than others (non-surge), and so trips of different time lengths vary in the induced driver opportunity cost.
First, we show that multiplicative surge, historically the standard on ride-hailing platforms, is not incentive compatible in a dynamic setting. We then propose a structured, incentive-compatible pricing mechanism. This closed-form mechanism has a simple form and is well-approximated by Uber's new additive surge mechanism. Finally, through both numerical analysis and real data from a ride-hailing marketplace, we show that additive surge is more incentive compatible in practice than is multiplicative surge.
△ Less
Submitted 6 March, 2021; v1 submitted 18 May, 2019;
originally announced May 2019.
-
Multi-Product Dynamic Pricing in High-Dimensions with Heterogeneous Price Sensitivity
Authors:
Adel Javanmard,
Hamid Nazerzadeh,
Simeng Shao
Abstract:
We consider the problem of multi-product dynamic pricing, in a contextual setting, for a seller of differentiated products. In this environment, the customers arrive over time and products are described by high-dimensional feature vectors. Each customer chooses a product according to the widely used Multinomial Logit (MNL) choice model and her utility depends on the product features as well as the…
▽ More
We consider the problem of multi-product dynamic pricing, in a contextual setting, for a seller of differentiated products. In this environment, the customers arrive over time and products are described by high-dimensional feature vectors. Each customer chooses a product according to the widely used Multinomial Logit (MNL) choice model and her utility depends on the product features as well as the prices offered. The seller a-priori does not know the parameters of the choice model but can learn them through interactions with customers. The seller's goal is to design a pricing policy that maximizes her cumulative revenue. This model is motivated by online marketplaces such as Airbnb platform and online advertising. We measure the performance of a pricing policy in terms of regret, which is the expected revenue loss with respect to a clairvoyant policy that knows the parameters of the choice model in advance and always sets the revenue-maximizing prices. We propose a pricing policy, named M3P, that achieves a $T$-period regret of $O(\log(Td) ( \sqrt{T}+ d\log(T)))$ under heterogeneous price sensitivity for products with features of dimension $d$. We also use tools from information theory to prove that no policy can achieve worst-case $T$-regret better than $Ω(\sqrt{T})$.
△ Less
Submitted 15 May, 2020; v1 submitted 4 January, 2019;
originally announced January 2019.
-
Dynamic Pricing in High-dimensions
Authors:
Adel Javanmard,
Hamid Nazerzadeh
Abstract:
We study the pricing problem faced by a firm that sells a large number of products, described via a wide range of features, to customers that arrive over time. Customers independently make purchasing decisions according to a general choice model that includes products features and customers' characteristics, encoded as $d$-dimensional numerical vectors, as well as the price offered. The parameters…
▽ More
We study the pricing problem faced by a firm that sells a large number of products, described via a wide range of features, to customers that arrive over time. Customers independently make purchasing decisions according to a general choice model that includes products features and customers' characteristics, encoded as $d$-dimensional numerical vectors, as well as the price offered. The parameters of the choice model are a priori unknown to the firm, but can be learned as the (binary-valued) sales data accrues over time. The firm's objective is to minimize the regret, i.e., the expected revenue loss against a clairvoyant policy that knows the parameters of the choice model in advance, and always offers the revenue-maximizing price. This setting is motivated in part by the prevalence of online marketplaces that allow for real-time pricing. We assume a structured choice model, parameters of which depend on $s_0$ out of the $d$ product features. We propose a dynamic policy, called Regularized Maximum Likelihood Pricing (RMLP) that leverages the (sparsity) structure of the high-dimensional model and obtains a logarithmic regret in $T$. More specifically, the regret of our algorithm is of $O(s_0 \log d \cdot \log T)$. Furthermore, we show that no policy can obtain regret better than $O(s_0 (\log d + \log T))$.
△ Less
Submitted 31 December, 2017; v1 submitted 24 September, 2016;
originally announced September 2016.
-
An Optimal Dynamic Mechanism for Multi-Armed Bandit Processes
Authors:
Sham M. Kakade,
Ilan Lobel,
Hamid Nazerzadeh
Abstract:
We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experience with items that are auctioned repeatedly over an infinite horizon. A central question here is understanding what natural restrictions on the environment permit the design of optimal mechanisms (note that even in the simpler…
▽ More
We consider the problem of revenue-optimal dynamic mechanism design in settings where agents' types evolve over time as a function of their (both public and private) experience with items that are auctioned repeatedly over an infinite horizon. A central question here is understanding what natural restrictions on the environment permit the design of optimal mechanisms (note that even in the simpler static setting, optimal mechanisms are characterized only under certain restrictions). We provide a {\em structural characterization} of a natural "separable: multi-armed bandit environment (where the evolution and incentive structure of the a-priori type is decoupled from the subsequent experience in a precise sense) where dynamic optimal mechanism design is possible. Here, we present the Virtual Index Mechanism, an optimal dynamic mechanism, which maximizes the (long term) {\em virtual surplus} using the classical Gittins algorithm. The mechanism optimally balances exploration and exploitation, taking incentives into account.
△ Less
Submitted 15 October, 2010; v1 submitted 26 January, 2010;
originally announced January 2010.