Search | arXiv e-print repository

A Network Simulation of OTC Markets with Multiple Agents

Authors: James T. Wilkinson, Jacob Kelter, John Chen, Uri Wilensky

Abstract: We present a novel agent-based approach to simulating an over-the-counter (OTC) financial market in which trades are intermediated solely by market makers and agent visibility is constrained to a network topology. Dynamics, such as changes in price, result from agent-level interactions that ubiquitously occur via market maker agents acting as liquidity providers. Two additional agents are consider… ▽ More We present a novel agent-based approach to simulating an over-the-counter (OTC) financial market in which trades are intermediated solely by market makers and agent visibility is constrained to a network topology. Dynamics, such as changes in price, result from agent-level interactions that ubiquitously occur via market maker agents acting as liquidity providers. Two additional agents are considered: trend investors use a deep convolutional neural network paired with a deep Q-learning framework to inform trading decisions by analysing price history; and value investors use a static price-target to determine their trade directions and sizes. We demonstrate that our novel inclusion of a network topology with market makers facilitates explorations into various market structures. First, we present the model and an overview of its mechanics. Second, we validate our findings via comparison to the real-world: we demonstrate a fat-tailed distribution of price changes, auto-correlated volatility, a skew negatively correlated to market maker positioning, predictable price-history patterns and more. Finally, we demonstrate that our network-based model can lend insights into the effect of market-structure on price-action. For example, we show that markets with sparsely connected intermediaries can have a critical point of fragmentation, beyond which the market forms distinct clusters and arbitrage becomes rapidly possible between the prices of different market makers. A discussion is provided on future work that would be beneficial. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 20 pages, 17 figures

arXiv:2309.12162 [pdf, other]

Optimal Conditional Inference in Adaptive Experiments

Authors: Jiafeng Chen, Isaiah Andrews

Abstract: We study batched bandit experiments and consider the problem of inference conditional on the realized stop** time, assignment probabilities, and target parameter, where all of these may be chosen adaptively using information up to the last batch of the experiment. Absent further restrictions on the experiment, we show that inference using only the results of the last batch is optimal. When the a… ▽ More We study batched bandit experiments and consider the problem of inference conditional on the realized stop** time, assignment probabilities, and target parameter, where all of these may be chosen adaptively using information up to the last batch of the experiment. Absent further restrictions on the experiment, we show that inference using only the results of the last batch is optimal. When the adaptive aspects of the experiment are known to be location-invariant, in the sense that they are unchanged when we shift all batch-arm means by a constant, we show that there is additional information in the data, captured by one additional linear function of the batch-arm means. In the more restrictive case where the stop** time, assignment probabilities, and target parameter are known to depend on the data only through a collection of polyhedral events, we derive computationally tractable and optimal conditional inference procedures. △ Less

Submitted 21 September, 2023; originally announced September 2023.

Comments: An extended abstract of this paper was presented at CODE@MIT 2021

arXiv:2309.09103 [pdf, other]

Optimal Estimation under a Semiparametric Density Ratio Model

Authors: Archer Gong Zhang, Jiahua Chen

Abstract: In many statistical and econometric applications, we gather individual samples from various interconnected populations that undeniably exhibit common latent structures. Utilizing a model that incorporates these latent structures for such data enhances the efficiency of inferences. Recently, many researchers have been adopting the semiparametric density ratio model (DRM) to address the presence of… ▽ More In many statistical and econometric applications, we gather individual samples from various interconnected populations that undeniably exhibit common latent structures. Utilizing a model that incorporates these latent structures for such data enhances the efficiency of inferences. Recently, many researchers have been adopting the semiparametric density ratio model (DRM) to address the presence of latent structures. The DRM enables estimation of each population distribution using pooled data, resulting in statistically more efficient estimations in contrast to nonparametric methods that analyze each sample in isolation. In this article, we investigate the limit of the efficiency improvement attainable through the DRM. We focus on situations where one population's sample size significantly exceeds those of the other populations. In such scenarios, we demonstrate that the DRM-based inferences for populations with smaller sample sizes achieve the highest attainable asymptotic efficiency as if a parametric model is assumed. The estimands we consider include the model parameters, distribution functions, and quantiles. We use simulation experiments to support the theoretical findings with a specific focus on quantile estimation. Additionally, we provide an analysis of real revenue data from U.S. collegiate sports to illustrate the efficacy of our contribution. △ Less

Submitted 16 September, 2023; originally announced September 2023.

arXiv:2309.07427 [pdf, other]

Measuring Higher-Order Rationality with Belief Control

Authors: Wei James Chen, Meng-Jhang Fong, Po-Hsuan Lin

Abstract: Determining an individual's strategic reasoning capability based solely on choice data is a complex task. This complexity arises because sophisticated players might have non-equilibrium beliefs about others, leading to non-equilibrium actions. In our study, we pair human participants with computer players known to be fully rational. This use of robot players allows us to disentangle limited reason… ▽ More Determining an individual's strategic reasoning capability based solely on choice data is a complex task. This complexity arises because sophisticated players might have non-equilibrium beliefs about others, leading to non-equilibrium actions. In our study, we pair human participants with computer players known to be fully rational. This use of robot players allows us to disentangle limited reasoning capacity from belief formation and social biases. Our results show that, when paired with robots, subjects consistently demonstrate higher levels of rationality and maintain stable rationality levels across different games compared to when paired with humans. This suggests that strategic reasoning might indeed be a consistent trait in individuals. Furthermore, the identified rationality limits could serve as a measure for evaluating an individual's strategic capacity when their beliefs about others are adequately controlled. △ Less

Submitted 14 September, 2023; originally announced September 2023.

Comments: The experimental design and the analysis plan are pre-registered on Open Science Framework (https://osf.io/gye4u/). The experimental instructions can be found at https://mjfong.github.io/SI_MHOR_final.pdf

arXiv:2303.13218 [pdf, other]

Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure

Authors: Xiaorong Yang, Jia Chen, Degui Li, Runze Li

Abstract: This paper considers estimating functional-coefficient models in panel quantile regression with individual effects, allowing the cross-sectional and temporal dependence for large panel observations. A latent group structure is imposed on the heterogenous quantile regression models so that the number of nonparametric functional coefficients to be estimated can be reduced considerably. With the prel… ▽ More This paper considers estimating functional-coefficient models in panel quantile regression with individual effects, allowing the cross-sectional and temporal dependence for large panel observations. A latent group structure is imposed on the heterogenous quantile regression models so that the number of nonparametric functional coefficients to be estimated can be reduced considerably. With the preliminary local linear quantile estimates of the subject-specific functional coefficients, a classic agglomerative clustering algorithm is used to estimate the unknown group structure and an easy-to-implement ratio criterion is proposed to determine the group number. The estimated group number and structure are shown to be consistent. Furthermore, a post-grou** local linear smoothing method is introduced to estimate the group-specific functional coefficients, and the relevant asymptotic normal distribution theory is derived with a normalisation rate comparable to that in the literature. The developed methodologies and theory are verified through a simulation study and showcased with an application to house price data from UK local authority districts, which reveals different homogeneity structures at different quantile levels. △ Less

Submitted 23 March, 2023; originally announced March 2023.

arXiv:2303.08653 [pdf, ps, other]

Mean-variance constrained priors have finite maximum Bayes risk in the normal location model

Authors: Jiafeng Chen

Abstract: Consider a normal location model $X \mid θ\sim N(θ, σ^2)$ with known $σ^2$. Suppose $θ\sim G_0$, where the prior $G_0$ has zero mean and unit variance. Let $G_1$ be a possibly misspecified prior with zero mean and unit variance. We show that the squared error Bayes risk of the posterior mean under $G_1$ is bounded, uniformly over $G_0, G_1, σ^2 > 0$. Consider a normal location model $X \mid θ\sim N(θ, σ^2)$ with known $σ^2$. Suppose $θ\sim G_0$, where the prior $G_0$ has zero mean and unit variance. Let $G_1$ be a possibly misspecified prior with zero mean and unit variance. We show that the squared error Bayes risk of the posterior mean under $G_1$ is bounded, uniformly over $G_0, G_1, σ^2 > 0$. △ Less

Submitted 15 March, 2023; originally announced March 2023.

arXiv:2302.02476 [pdf, other]

Estimating Time-Varying Networks for High-Dimensional Time Series

Authors: Jia Chen, Degui Li, Yuning Li, Oliver Linton

Abstract: We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under… ▽ More We explore time-varying networks for high-dimensional locally stationary time series, using the large VAR model framework with both the transition and (error) precision matrices evolving smoothly over time. Two types of time-varying graphs are investigated: one containing directed edges of Granger causality linkages, and the other containing undirected edges of partial correlation linkages. Under the sparse structural assumption, we propose a penalised local linear method with time-varying weighted group LASSO to jointly estimate the transition matrices and identify their significant entries, and a time-varying CLIME method to estimate the precision matrices. The estimated transition and precision matrices are then used to determine the time-varying network structures. Under some mild conditions, we derive the theoretical properties of the proposed estimates including the consistency and oracle properties. In addition, we extend the methodology and theory to cover highly-correlated large-scale time series, for which the sparsity assumption becomes invalid and we allow for common factors before estimating the factor-adjusted time-varying networks. We provide extensive simulation studies and an empirical application to a large U.S. macroeconomic dataset to illustrate the finite-sample performance of our methods. △ Less

Submitted 5 February, 2023; originally announced February 2023.

arXiv:2212.14444 [pdf, other]

Empirical Bayes When Estimation Precision Predicts Parameters

Authors: Jiafeng Chen

Abstract: Empirical Bayes methods usually maintain a prior independence assumption: The unknown parameters of interest are independent from the known standard errors of the estimates. This assumption is often theoretically questionable and empirically rejected. This paper instead models the conditional distribution of the parameter given the standard errors as a flexibly parametrized family of distributions… ▽ More Empirical Bayes methods usually maintain a prior independence assumption: The unknown parameters of interest are independent from the known standard errors of the estimates. This assumption is often theoretically questionable and empirically rejected. This paper instead models the conditional distribution of the parameter given the standard errors as a flexibly parametrized family of distributions, leading to a family of methods that we call CLOSE. This paper establishes that (i) CLOSE is rate-optimal for squared error Bayes regret, (ii) squared error regret control is sufficient for an important class of economic decision problems, and (iii) CLOSE is worst-case robust when our assumption on the conditional distribution is misspecified. Empirically, using CLOSE leads to sizable gains for selecting high-mobility Census tracts. Census tracts selected by CLOSE are substantially more mobile on average than those selected by the standard shrinkage method. △ Less

Submitted 8 April, 2024; v1 submitted 29 December, 2022; originally announced December 2022.

arXiv:2212.06080 [pdf, ps, other]

Logs with zeros? Some problems and solutions

Authors: Jiafeng Chen, Jonathan Roth

Abstract: When studying an outcome $Y$ that is weakly-positive but can equal zero (e.g. earnings), researchers frequently estimate an average treatment effect (ATE) for a "log-like" transformation that behaves like $\log(Y)$ for large $Y$ but is defined at zero (e.g. $\log(1+Y)$, $\mathrm{arcsinh}(Y)$). We argue that ATEs for log-like transformations should not be interpreted as approximating percentage eff… ▽ More When studying an outcome $Y$ that is weakly-positive but can equal zero (e.g. earnings), researchers frequently estimate an average treatment effect (ATE) for a "log-like" transformation that behaves like $\log(Y)$ for large $Y$ but is defined at zero (e.g. $\log(1+Y)$, $\mathrm{arcsinh}(Y)$). We argue that ATEs for log-like transformations should not be interpreted as approximating percentage effects, since unlike a percentage, they depend on the units of the outcome. In fact, we show that if the treatment affects the extensive margin, one can obtain a treatment effect of any magnitude simply by re-scaling the units of $Y$ before taking the log-like transformation. This arbitrary unit-dependence arises because an individual-level percentage effect is not well-defined for individuals whose outcome changes from zero to non-zero when receiving treatment, and the units of the outcome implicitly determine how much weight the ATE for a log-like transformation places on the extensive margin. We further establish a trilemma: when the outcome can equal zero, there is no treatment effect parameter that is an average of individual-level treatment effects, unit-invariant, and point-identified. We discuss several alternative approaches that may be sensible in settings with an intensive and extensive margin, including (i) expressing the ATE in levels as a percentage (e.g. using Poisson regression), (ii) explicitly calibrating the value placed on the intensive and extensive margins, and (iii) estimating separate effects for the two margins (e.g. using Lee bounds). We illustrate these approaches in three empirical applications. △ Less

Submitted 15 November, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

Comments: Accepted, Quarterly Journal of Economics

arXiv:2204.06967 [pdf, other]

doi 10.1016/j.jue.2022.103451

JUE Insight: The (Non-)Effect of Opportunity Zones on Housing Prices

Authors: Jiafeng Chen, Edward Glaeser, David Wessel

Abstract: Will the Opportunity Zones (OZ) program, America's largest new place-based policy in decades, generate neighborhood change? We compare single-family housing price growth in OZs with price growth in areas that were eligible but not included in the program. We also compare OZs to their nearest geographic neighbors. Our most credible estimates rule out price impacts greater than 0.5 percentage points… ▽ More Will the Opportunity Zones (OZ) program, America's largest new place-based policy in decades, generate neighborhood change? We compare single-family housing price growth in OZs with price growth in areas that were eligible but not included in the program. We also compare OZs to their nearest geographic neighbors. Our most credible estimates rule out price impacts greater than 0.5 percentage points with 95% confidence, suggesting that, so far, home buyers don't believe that this subsidy will generate major neighborhood change. OZ status reduces prices in areas with little employment, perhaps because buyers think that subsidizing new investment will increase housing supply. Mixed evidence suggests that OZs may have increased residential permitting. △ Less

Submitted 14 April, 2022; originally announced April 2022.

Comments: To appear in Journal of Urban Economics

arXiv:2202.08426 [pdf, ps, other]

doi 10.3982/ECTA20720

Synthetic Control As Online Linear Regression

Authors: Jiafeng Chen

Abstract: This paper notes a simple connection between synthetic control and online learning. Specifically, we recognize synthetic control as an instance of Follow-The-Leader (FTL). Standard results in online convex optimization then imply that, even when outcomes are chosen by an adversary, synthetic control predictions of counterfactual outcomes for the treated unit perform almost as well as an oracle wei… ▽ More This paper notes a simple connection between synthetic control and online learning. Specifically, we recognize synthetic control as an instance of Follow-The-Leader (FTL). Standard results in online convex optimization then imply that, even when outcomes are chosen by an adversary, synthetic control predictions of counterfactual outcomes for the treated unit perform almost as well as an oracle weighted average of control units' outcomes. Synthetic control on differenced data performs almost as well as oracle weighted difference-in-differences, potentially making it an attractive choice in practice. We argue that this observation further supports the use of synthetic control estimators in comparative case studies. △ Less

Submitted 13 November, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

Journal ref: Econometrica 91 (2023) 465-491

arXiv:2201.09691 [pdf, ps, other]

Multidimensional Manhattan Preferences

Authors: Jiehua Chen, Martin Nöllenburg, Sofia Simola, Anaïs Villedieu, Markus Wallinger

Abstract: A preference profile with $m$ alternatives and $n$ voters is $d$-Manhattan (resp. $d$-Euclidean) if both the alternatives and the voters can be placed into the $d$-dimensional space such that between each pair of alternatives, every voter prefers the one which has a shorter Manhattan (resp. Euclidean) distance to the voter. Following Bogomolnaia and Laslier [Journal of Mathematical Economics, 2007… ▽ More A preference profile with $m$ alternatives and $n$ voters is $d$-Manhattan (resp. $d$-Euclidean) if both the alternatives and the voters can be placed into the $d$-dimensional space such that between each pair of alternatives, every voter prefers the one which has a shorter Manhattan (resp. Euclidean) distance to the voter. Following Bogomolnaia and Laslier [Journal of Mathematical Economics, 2007] and Chen and Grottke [Social Choice and Welfare, 2021] who look at $d$-Euclidean preference profiles, we study which preference profiles are $d$-Manhattan depending on the values $m$ and $n$. First, we show that each preference profile with $m$ alternatives and $n$ voters is $d$-Manhattan whenever $d$ $\geq$ min($n$, $m$-$1$). Second, for $d = 2$, we show that the smallest non $d$-Manhattan preference profile has either three voters and six alternatives, or four voters and five alternatives, or five voters and four alternatives. This is more complex than the case with $d$-Euclidean preferences (see [Bogomolnaia and Laslier, 2007] and [Bulteau and Chen, 2020]. △ Less

Submitted 24 January, 2022; originally announced January 2022.

arXiv:2112.03872 [pdf, ps, other]

Nonparametric Treatment Effect Identification in School Choice

Authors: Jiafeng Chen

Abstract: This paper studies nonparametric identification and estimation of causal effects in centralized school assignment. In many centralized assignment settings, students are subjected to both lottery-driven variation and regression discontinuity (RD) driven variation. We characterize the full set of identified atomic treatment effects (aTEs), defined as the conditional average treatment effect between… ▽ More This paper studies nonparametric identification and estimation of causal effects in centralized school assignment. In many centralized assignment settings, students are subjected to both lottery-driven variation and regression discontinuity (RD) driven variation. We characterize the full set of identified atomic treatment effects (aTEs), defined as the conditional average treatment effect between a pair of schools, given student characteristics. Atomic treatment effects are the building blocks of more aggregated notions of treatment contrasts, and common approaches estimating aggregations of aTEs can mask important heterogeneity. In particular, many aggregations of aTEs put zero weight on aTEs driven by RD variation, and estimators of such aggregations put asymptotically vanishing weight on the RD-driven aTEs. We develop a diagnostic tool for empirically assessing the weight put on aTEs driven by RD variation. Lastly, we provide estimators and accompanying asymptotic results for inference on aggregations of RD-driven aTEs. △ Less

Submitted 23 October, 2023; v1 submitted 7 December, 2021; originally announced December 2021.

Comments: Presented at SOLE 2021

arXiv:2110.06763 [pdf, other]

Efficient Estimation in NPIV Models: A Comparison of Various Neural Networks-Based Estimators

Authors: Jiafeng Chen, Xiaohong Chen, Elie Tamer

Abstract: Artificial Neural Networks (ANNs) can be viewed as nonlinear sieves that can approximate complex functions of high dimensional variables more effectively than linear sieves. We investigate the performance of various ANNs in nonparametric instrumental variables (NPIV) models of moderately high dimensional covariates that are relevant to empirical economics. We present two efficient procedures for e… ▽ More Artificial Neural Networks (ANNs) can be viewed as nonlinear sieves that can approximate complex functions of high dimensional variables more effectively than linear sieves. We investigate the performance of various ANNs in nonparametric instrumental variables (NPIV) models of moderately high dimensional covariates that are relevant to empirical economics. We present two efficient procedures for estimation and inference on a weighted average derivative (WAD): an orthogonalized plug-in with optimally-weighted sieve minimum distance (OP-OSMD) procedure and a sieve efficient score (ES) procedure. Both estimators for WAD use ANN sieves to approximate the unknown NPIV function and are root-n asymptotically normal and first-order equivalent. We provide a detailed practitioner's recipe for implementing both efficient procedures. We compare their finite-sample performances in various simulation designs that involve smooth NPIV function of up to 13 continuous covariates, different nonlinearities and covariate correlations. Some Monte Carlo findings include: 1) tuning and optimization are more delicate in ANN estimation; 2) given proper tuning, both ANN estimators with various architectures can perform well; 3) easier to tune ANN OP-OSMD estimators than ANN ES estimators; 4) stable inferences are more difficult to achieve with ANN (than spline) estimators; 5) there are gaps between current implementations and approximation theories. Finally, we apply ANN NPIV to estimate average partial derivatives in two empirical demand examples with multivariate covariates. △ Less

Submitted 4 October, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

arXiv:2107.14405 [pdf, other]

Semiparametric Estimation of Long-Term Treatment Effects

Authors: Jiafeng Chen, David M. Ritzwoller

Abstract: Long-term outcomes of experimental evaluations are necessarily observed after long delays. We develop semiparametric methods for combining the short-term outcomes of experiments with observational measurements of short-term and long-term outcomes, in order to estimate long-term treatment effects. We characterize semiparametric efficiency bounds for various instances of this problem. These calculat… ▽ More Long-term outcomes of experimental evaluations are necessarily observed after long delays. We develop semiparametric methods for combining the short-term outcomes of experiments with observational measurements of short-term and long-term outcomes, in order to estimate long-term treatment effects. We characterize semiparametric efficiency bounds for various instances of this problem. These calculations facilitate the construction of several estimators. We analyze the finite-sample performance of these estimators with a simulation calibrated to data from an evaluation of the long-term effects of a poverty alleviation program. △ Less

Submitted 17 August, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

arXiv:2103.01368 [pdf, other]

Standing on the Shoulders of Machine Learning: Can We Improve Hypothesis Testing?

Authors: Gary Cornwall, Jeff Chen, Beau Sauley

Abstract: In this paper we have updated the hypothesis testing framework by drawing upon modern computational power and classification models from machine learning. We show that a simple classification algorithm such as a boosted decision stump can be used to fully recover the full size-power trade-off for any single test statistic. This recovery implies an equivalence, under certain conditions, between the… ▽ More In this paper we have updated the hypothesis testing framework by drawing upon modern computational power and classification models from machine learning. We show that a simple classification algorithm such as a boosted decision stump can be used to fully recover the full size-power trade-off for any single test statistic. This recovery implies an equivalence, under certain conditions, between the basic building block of modern machine learning and hypothesis testing. Second, we show that more complex algorithms such as the random forest and gradient boosted machine can serve as map** functions in place of the traditional null distribution. This allows for multiple test statistics and other information to be evaluated simultaneously and thus form a pseudo-composite hypothesis test. Moreover, we show how practitioners can make explicit the relative costs of Type I and Type II errors to contextualize the test into a specific decision framework. To illustrate this approach we revisit the case of testing for unit roots, a difficult problem in time series econometrics for which existing tests are known to exhibit low power. Using a simulation framework common to the literature we show that this approach can improve upon overall accuracy of the traditional unit root test(s) by seventeen percentage points, and the sensitivity by thirty six percentage points. △ Less

Submitted 1 March, 2021; originally announced March 2021.

arXiv:2101.02587 [pdf, other]

doi 10.1371/journal.pone.0306520

Mining the Relationship Between COVID-19 Sentiment and Market Performance

Authors: Ziyuan Xia, Jeffery Chen, Anchen Sun

Abstract: At the beginning of the COVID-19 outbreak in March, we observed one of the largest stock market crashes in history. Within the months following this, a volatile bullish climb back to pre-pandemic performances and higher. In this paper, we study the stock market behavior during the initial few months of the COVID-19 pandemic in relation to COVID-19 sentiment. Using text sentiment analysis of Twitte… ▽ More At the beginning of the COVID-19 outbreak in March, we observed one of the largest stock market crashes in history. Within the months following this, a volatile bullish climb back to pre-pandemic performances and higher. In this paper, we study the stock market behavior during the initial few months of the COVID-19 pandemic in relation to COVID-19 sentiment. Using text sentiment analysis of Twitter data, we look at tweets that contain key words in relation to the COVID-19 pandemic and the sentiment of the tweet to understand whether sentiment can be used as an indicator for stock market performance. There has been previous research done on applying natural language processing and text sentiment analysis to understand the stock market performance, given how prevalent the impact of COVID-19 is to the economy, we want to further the application of these techniques to understand the relationship that COVID-19 has with stock market performance. Our findings show that there is a strong relationship to COVID-19 sentiment derived from tweets that could be used to predict stock market performance in the future. △ Less

Submitted 13 March, 2023; v1 submitted 6 January, 2021; originally announced January 2021.

Comments: 18 pages, 7 figures, 5 tables

arXiv:2011.06158 [pdf, other]

Mostly Harmless Machine Learning: Learning Optimal Instruments in Linear IV Models

Authors: Jiafeng Chen, Daniel L. Chen, Greg Lewis

Abstract: We offer straightforward theoretical results that justify incorporating machine learning in the standard linear instrumental variable setting. The key idea is to use machine learning, combined with sample-splitting, to predict the treatment variable from the instrument and any exogenous covariates, and then use this predicted treatment and the covariates as technical instruments to recover the coe… ▽ More We offer straightforward theoretical results that justify incorporating machine learning in the standard linear instrumental variable setting. The key idea is to use machine learning, combined with sample-splitting, to predict the treatment variable from the instrument and any exogenous covariates, and then use this predicted treatment and the covariates as technical instruments to recover the coefficients in the second-stage. This allows the researcher to extract non-linear co-variation between the treatment and instrument that may dramatically improve estimation precision and robustness by boosting instrument strength. Importantly, we constrain the machine-learned predictions to be linear in the exogenous covariates, thus avoiding spurious identification arising from non-linear relationships between the treatment and the covariates. We show that this approach delivers consistent and asymptotically normal estimates under weak conditions and that it may be adapted to be semiparametrically efficient (Chamberlain, 1992). Our method preserves standard intuitions and interpretations of linear instrumental variable methods, including under weak identification, and provides a simple, user-friendly upgrade to the applied economics toolbox. We illustrate our method with an example in law and criminal justice, examining the causal effect of appellate court reversals on district court sentencing decisions. △ Less

Submitted 18 June, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: NeurIPS 2020 Workshop on Machine Learning for Economic Policy

arXiv:2011.02407 [pdf, other]

Debiasing classifiers: is reality at variance with expectation?

Authors: Ashrya Agrawal, Florian Pfisterer, Bernd Bischl, Francois Buet-Golfouse, Srijan Sood, Jiahao Chen, Sameena Shah, Sebastian Vollmer

Abstract: We present an empirical study of debiasing methods for classifiers, showing that debiasers often fail in practice to generalize out-of-sample, and can in fact make fairness worse rather than better. A rigorous evaluation of the debiasing treatment effect requires extensive cross-validation beyond what is usually done. We demonstrate that this phenomenon can be explained as a consequence of bias-va… ▽ More We present an empirical study of debiasing methods for classifiers, showing that debiasers often fail in practice to generalize out-of-sample, and can in fact make fairness worse rather than better. A rigorous evaluation of the debiasing treatment effect requires extensive cross-validation beyond what is usually done. We demonstrate that this phenomenon can be explained as a consequence of bias-variance trade-off, with an increase in variance necessitated by imposing a fairness constraint. Follow-up experiments validate the theoretical prediction that the estimation variance depends strongly on the base rates of the protected class. Considering fairness--performance trade-offs justifies the counterintuitive notion that partial debiasing can actually yield better results in practice on out-of-sample data. △ Less

Submitted 30 May, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

Comments: 13 pages, under review

MSC Class: 68T01; 68Q32; 68T05 ACM Class: G.4; I.2.0; J.4

arXiv:1909.12592 [pdf, other]

Debiased/Double Machine Learning for Instrumental Variable Quantile Regressions

Authors: Jau-er Chen, Chien-Hsun Huang, Jia-Jyun Tien

Abstract: In this study, we investigate estimation and inference on a low-dimensional causal parameter in the presence of high-dimensional controls in an instrumental variable quantile regression. Our proposed econometric procedure builds on the Neyman-type orthogonal moment conditions of a previous study Chernozhukov, Hansen and Wuthrich (2018) and is thus relatively insensitive to the estimation of the nu… ▽ More In this study, we investigate estimation and inference on a low-dimensional causal parameter in the presence of high-dimensional controls in an instrumental variable quantile regression. Our proposed econometric procedure builds on the Neyman-type orthogonal moment conditions of a previous study Chernozhukov, Hansen and Wuthrich (2018) and is thus relatively insensitive to the estimation of the nuisance parameters. The Monte Carlo experiments show that the estimator copes well with high-dimensional controls. We also apply the procedure to empirically reinvestigate the quantile treatment effect of 401(k) participation on accumulated wealth. △ Less

Submitted 21 February, 2021; v1 submitted 27 September, 2019; originally announced September 2019.

Comments: 19 pages

arXiv:1710.06893 [pdf, other]

doi 10.1063/1.5004711

The tip** point: a mathematical model for the profit-driven abandonment of restaurant tip**

Authors: Sara M. Clifton, Eileen Herbers, Jack Chen, Daniel M. Abrams

Abstract: The custom of voluntarily tip** for services rendered has gone in and out of fashion in America since its introduction in the 19th century. Restaurant owners that ban tip** in their establishments often claim that social justice drives their decisions, but we show that rational profit-maximization may also justify the decisions. Here, we propose a conceptual model of restaurant competition for… ▽ More The custom of voluntarily tip** for services rendered has gone in and out of fashion in America since its introduction in the 19th century. Restaurant owners that ban tip** in their establishments often claim that social justice drives their decisions, but we show that rational profit-maximization may also justify the decisions. Here, we propose a conceptual model of restaurant competition for staff and customers, and we show that there exists a critical conventional tip rate at which restaurant owners should eliminate tip** to maximize profit. Because the conventional tip rate has been increasing steadily for the last several decades, our model suggests that restaurant owners may abandon tip** en masse when that critical tip rate is reached. △ Less

Submitted 14 December, 2017; v1 submitted 8 September, 2017; originally announced October 2017.

Comments: 14 pages, 5 figures, supplementary material included

Journal ref: Chaos 28, 023109 (2018)

arXiv:1705.09418 [pdf, ps, other]

Nonparametric Regression with Multiple Thresholds: Estimation and Inference

Authors: Yan-Yu Chiou, Mei-Yuan Chen, Jau-er Chen

Abstract: This paper examines nonparametric regression with an exogenous threshold variable, allowing for an unknown number of thresholds. Given the number of thresholds and corresponding threshold values, we first establish the asymptotic properties of the local constant estimator for a nonparametric regression with multiple thresholds. However, the number of thresholds and corresponding threshold values a… ▽ More This paper examines nonparametric regression with an exogenous threshold variable, allowing for an unknown number of thresholds. Given the number of thresholds and corresponding threshold values, we first establish the asymptotic properties of the local constant estimator for a nonparametric regression with multiple thresholds. However, the number of thresholds and corresponding threshold values are typically unknown in practice. We then use our testing procedure to determine the unknown number of thresholds and derive the limiting distribution of the proposed test. The Monte Carlo simulation results indicate the adequacy of the modified test and accuracy of the sequential estimation of the threshold values. We apply our testing procedure to an empirical study of the 401(k) retirement savings plan with income thresholds. △ Less

Submitted 23 February, 2018; v1 submitted 25 May, 2017; originally announced May 2017.

Showing 1–22 of 22 results for author: Chen, J